DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL

Article URL: https://pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2

Comments URL: https://news.ycombinator.com/item?id=43017599

Points: 155

# Comments: 68

https://pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2

Vytvořeno 13d | 11. 2. 2025 22:10:13

Chcete-li přidat komentář, přihlaste se

Ostatní příspěvky v této skupině

There Isn't Much Point to HTTP/2 Past the Load Balancer

There Isn't Much Point to HTTP/2 Past the Load Balancer

Article URL: https://byroot.github.io/ruby/performance/2025/02/24/http2-past-the-load-balancer.html

25. 2. 2025 8:20:11 | Hacker news

Xcode Constantly Phones Home

Xcode Constantly Phones Home

Article URL: https://lapcatsoftware.com/articles/2025/2/5.html

Comments URL:

25. 2. 2025 8:20:09 | Hacker news

What would happen if we didn't use TCP or UDP?

What would happen if we didn't use TCP or UDP?

Article URL: https://github.com/Hawzen/hdp

Comments URL: https://news.ycombinator.com/ite

25. 2. 2025 8:20:08 | Hacker news

DigiCert: Threat of legal action to stifle Bugzilla discourse

DigiCert: Threat of legal action to stifle Bugzilla discourse

Article URL: https://bugzilla.mozilla.org/show_bug.cgi?id=1950144

Comments URL:

25. 2. 2025 6:10:07 | Hacker news

History of CAD – David Weisberg

History of CAD – David Weisberg

Article URL: https://www.shapr3d.com/blog/history-of-cad

Comments URL: http

25. 2. 2025 6:10:06 | Hacker news

How to change your settings to make yourself less valuable to Meta

How to change your settings to make yourself less valuable to Meta

Article URL: https://johnoliverwantsyourraterotica.com/

Comments URL: https:

25. 2. 2025 6:10:03 | Hacker news

A16Z AI Voice Update 2025

A16Z AI Voice Update 2025

Article URL: https://gamma.app/docs/a16z-AI-Voice-Update-2025--ttkorld8iy6wfnj?mode=doc

Comments URL

25. 2. 2025 6:10:02 | Hacker news

Techie