Does RL Incentivize Reasoning in LLMs Beyond the Base Model?

Article URL: https://limit-of-rlvr.github.io/

Comments URL: https://news.ycombinator.com/item?id=43760625

Points: 12

# Comments: 3

https://limit-of-rlvr.github.io/

Vytvořeno 3h | 22. 4. 2025 13:40:21

Chcete-li přidat komentář, přihlaste se

Ostatní příspěvky v této skupině

The Cold Start Problem: Using Network Effects to Scale Your Product – A Review

The Cold Start Problem: Using Network Effects to Scale Your Product – A Review

Article URL: https://madhavajay.com/the-cold-start-problem-using-network-effects-to-scale-your-

22. 4. 2025 16:10:13 | Hacker news

Join the W3C Exploration Interest Group: where standards start

Join the W3C Exploration Interest Group: where standards start

Article URL: https://www.w3.org/blog/2025/join-the-w3c-exploration-interest-group-where-standa

22. 4. 2025 16:10:13 | Hacker news

Abusing DuckDB-WASM by making SQL draw 3D graphics (Sort Of)

Abusing DuckDB-WASM by making SQL draw 3D graphics (Sort Of)

Article URL: https://www.hey.earth/posts/duckdb-doom

Comments URL: https://news

22. 4. 2025 16:10:12 | Hacker news

I Open-Sourced My AI Toy Company That Runs on ESP32 and OpenAI Realtime API

I Open-Sourced My AI Toy Company That Runs on ESP32 and OpenAI Realtime API

Article URL: https://github.com/akdeb/ElatoAI

Comments URL: https://news.ycombinator.c

22. 4. 2025 16:10:11 | Hacker news

Using physics simulations to find targeting strategies in tenpin bowling

Using physics simulations to find targeting strategies in tenpin bowling

Article URL: https://pubs.aip.org/aip/adv/article/15/4/045222/3344017/Using-physics-s

22. 4. 2025 16:10:10 | Hacker news

Introduction to Graph Transformers

Introduction to Graph Transformers

Article URL: https://kumo.ai/research/introduction-to-graph-transformers/

Comments URL:

22. 4. 2025 16:10:10 | Hacker news

Launch HN: Infra.new (YC W23) – DevOps Copilot with Guardrails Built In

Launch HN: Infra.new (YC W23) – DevOps Copilot with Guardrails Built In

Hey HN, we’re Caleb, Michael, and Josh, the founders of infra.new (https://infra.new/), a DevOps Copilot that can configure and deploy apps on AWS,

22. 4. 2025 16:10:08 | Hacker news

Techie