DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL

Article URL: https://pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2

Comments URL: https://news.ycombinator.com/item?id=43017599

Points: 155

# Comments: 68

https://pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2

Établi 1mo | 11 févr. 2025, 22:10:13

Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

Show HN: Minimalytics – a standalone minimal analytics app built on SQLite

Hi everyone! I wanted to share my analytics app with you.

This project came from requirements to track certain very frequent events. I found that the cost to do it on a regular analytics product

20 mars 2025, 22:20:25 | Hacker news

Appeals court rules that Constitution protects possession of AI-generated CSAM

Article URL: https://www.techpolicy.press/court-rules-that-constitution-protec

20 mars 2025, 22:20:25 | Hacker news

NATS Server v2.11

Article URL: https://nats.io/blog/nats-server-2.11-release/

Comments URL:

20 mars 2025, 22:20:24 | Hacker news

Court Imposes over $1.6B in Penalties on a Toyota Subsidiary for Emissions Fraud

Article URL: https://www.justice.gov/opa/pr/court-sentences-hino

20 mars 2025, 22:20:24 | Hacker news

Show HN: AgentKit – JavaScript Alternative to OpenAI Agents SDK with Native MCP

Hi HN! I’m Tony, co-founder of Inngest. I wanted to share AgentKit, our Typescript multi-agent library we’ve been cooking and testing with some early users in prod for months.

Although OpenAI’s

20 mars 2025, 20:10:06 | Hacker news

How to Be Good at Dating

Article URL: https://fantasticanachronism.com/2025/03/20/how-to-be-good-at-dating/

Comments URL:

20 mars 2025, 20:10:06 | Hacker news

The Burnout Machine

Article URL: https://unionize.fyi

Comments URL: https://news.ycombinator.com/item?id=43427002

20 mars 2025, 20:10:04 | Hacker news

Techie