Offline Reinforcement Learning for LLM Multi-Step Reasoning

Article URL: https://arxiv.org/abs/2412.16145

Comments URL: https://news.ycombinator.com/item?id=42493312

Points: 11

# Comments: 5

https://arxiv.org/abs/2412.16145

Creato 26d | 23 dic 2024, 11:40:07

Accedi per aggiungere un commento

Altri post in questo gruppo

Using ChatGPT is not bad for the environment

Using ChatGPT is not bad for the environment

Article URL: https://andymasley.substack.com/p/individual-ai-use-is-not-bad-for

Comments URL:

18 gen 2025, 06:50:05 | Hacker news

Is the TikTok ban a chance to rethink the whole internet?

Is the TikTok ban a chance to rethink the whole internet?

Article URL: https://www.newyorker.com/news/annals-of-communications/is-t

18 gen 2025, 04:30:06 | Hacker news

Can you read this cursive handwriting? The National Archives wants your help

Can you read this cursive handwriting? The National Archives wants your help

Article URL: https://www.smithsonianmag.com/smart-news/ca

18 gen 2025, 04:30:06 | Hacker news

EFF Statement on U.S. Supreme Court's Decision to Uphold TikTok Ban

EFF Statement on U.S. Supreme Court's Decision to Uphold TikTok Ban

Article URL: https://www.eff.org/deeplinks/2025/01/eff-statement-us-supreme-courts-decisi

18 gen 2025, 02:20:05 | Hacker news

Spellbrush (YC W18) Is Hiring Game Programmers (Anime SRPG/Tactics)

Spellbrush (YC W18) Is Hiring Game Programmers (Anime SRPG/Tactics)

Comments URL: https://news.ycombinator.com/item?id=42744820

Points: 0

# Comments: 0

https://news.ycombinator.com/ite

18 gen 2025, 02:20:03 | Hacker news

BYD just launched the largest car carrier to charge up its global EV ambitions

BYD just launched the largest car carrier to charge up its global EV ambitions

Article URL: https://electrek.co/2025/01/17/byd-launches-worlds-largest-car-carrier-fuel-g

18 gen 2025, 02:20:02 | Hacker news

Higher potassium intake at dinner linked to fewer sleep disturbances – study

Higher potassium intake at dinner linked to fewer sleep disturbances – study

Article URL: https://www.nutraingredients-asia.com/Article/

17 gen 2025, 23:50:08 | Hacker news

Techie