Does RL Incentivize Reasoning in LLMs Beyond the Base Model?

Article URL: https://limit-of-rlvr.github.io/

Comments URL: https://news.ycombinator.com/item?id=43760625

Points: 12

# Comments: 3

https://limit-of-rlvr.github.io/

Created 3h | Apr 22, 2025, 1:40:21 PM

Login to add comment

Other posts in this group

The Cold Start Problem: Using Network Effects to Scale Your Product – A Review

The Cold Start Problem: Using Network Effects to Scale Your Product – A Review

Article URL: https://madhavajay.com/the-cold-start-problem-using-network-effects-to-scale-your-

Apr 22, 2025, 4:10:13 PM | Hacker news

Join the W3C Exploration Interest Group: where standards start

Join the W3C Exploration Interest Group: where standards start

Article URL: https://www.w3.org/blog/2025/join-the-w3c-exploration-interest-group-where-standa

Apr 22, 2025, 4:10:13 PM | Hacker news

Abusing DuckDB-WASM by making SQL draw 3D graphics (Sort Of)

Abusing DuckDB-WASM by making SQL draw 3D graphics (Sort Of)

Article URL: https://www.hey.earth/posts/duckdb-doom

Comments URL: https://news

Apr 22, 2025, 4:10:12 PM | Hacker news

I Open-Sourced My AI Toy Company That Runs on ESP32 and OpenAI Realtime API

I Open-Sourced My AI Toy Company That Runs on ESP32 and OpenAI Realtime API

Article URL: https://github.com/akdeb/ElatoAI

Comments URL: https://news.ycombinator.c

Apr 22, 2025, 4:10:11 PM | Hacker news

Using physics simulations to find targeting strategies in tenpin bowling

Using physics simulations to find targeting strategies in tenpin bowling

Article URL: https://pubs.aip.org/aip/adv/article/15/4/045222/3344017/Using-physics-s

Apr 22, 2025, 4:10:10 PM | Hacker news

Introduction to Graph Transformers

Introduction to Graph Transformers

Article URL: https://kumo.ai/research/introduction-to-graph-transformers/

Comments URL:

Apr 22, 2025, 4:10:10 PM | Hacker news

Launch HN: Infra.new (YC W23) – DevOps Copilot with Guardrails Built In

Launch HN: Infra.new (YC W23) – DevOps Copilot with Guardrails Built In

Hey HN, we’re Caleb, Michael, and Josh, the founders of infra.new (https://infra.new/), a DevOps Copilot that can configure and deploy apps on AWS,

Apr 22, 2025, 4:10:08 PM | Hacker news

Techie