DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL

Article URL: https://pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2

Comments URL: https://news.ycombinator.com/item?id=43017599

Points: 155

# Comments: 68

https://pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2

Creado 14d | 11 feb. 2025 22:10:13

Inicia sesión para agregar comentarios

Otros mensajes en este grupo.

How Core Git Developers Configure Git

How Core Git Developers Configure Git

Article URL: https://blog.gitbutler.com/how-git-core-devs-configure-git/

Comments URL:

25 feb. 2025 10:40:06 | Hacker news

Tesla sales in Europe down 45% in January

Tesla sales in Europe down 45% in January

Article URL: https://www.ft.com/content/cdd0b5c8-2703-4fd4-9ebf-26087cac8523

Comments URL:

25 feb. 2025 10:40:04 | Hacker news

There Isn't Much Point to HTTP/2 Past the Load Balancer

There Isn't Much Point to HTTP/2 Past the Load Balancer

Article URL: https://byroot.github.io/ruby/performance/2025/02/24/http2-past-the-load-balancer.html

25 feb. 2025 8:20:11 | Hacker news

Xcode Constantly Phones Home

Xcode Constantly Phones Home

Article URL: https://lapcatsoftware.com/articles/2025/2/5.html

Comments URL:

25 feb. 2025 8:20:09 | Hacker news

What would happen if we didn't use TCP or UDP?

What would happen if we didn't use TCP or UDP?

Article URL: https://github.com/Hawzen/hdp

Comments URL: https://news.ycombinator.com/ite

25 feb. 2025 8:20:08 | Hacker news

DigiCert: Threat of legal action to stifle Bugzilla discourse

DigiCert: Threat of legal action to stifle Bugzilla discourse

Article URL: https://bugzilla.mozilla.org/show_bug.cgi?id=1950144

Comments URL:

25 feb. 2025 6:10:07 | Hacker news

History of CAD – David Weisberg

History of CAD – David Weisberg

Article URL: https://www.shapr3d.com/blog/history-of-cad

Comments URL: http

25 feb. 2025 6:10:06 | Hacker news

Techie