Show HN: Steiner – An open-source reasoning model inspired by OpenAI o1

Steiner is a series of reasoning models trained on synthetic data using reinforcement learning. These models can explore multiple reasoning paths in an autoregressive manner during inference and autonomously verify or backtrack when necessary, enabling a linear traversal of the implicit search tree.

Blog: https://medium.com/@peakji/a-small-step-towards-reproducing-...

Hugging Face: https://huggingface.co/collections/peakji/steiner-preview-67...

Comments URL: https://news.ycombinator.com/item?id=41915735

Points: 20

# Comments: 7

https://medium.com/@peakji/a-small-step-towards-reproducing-openai-o1-b9a756a00855

Created 6mo | Oct 22, 2024, 6:10:14 PM

Login to add comment

Other posts in this group

Nationwide Power Outages Also Disrupt Internet Traffic in Portugal and Spain

Nationwide Power Outages Also Disrupt Internet Traffic in Portugal and Spain

Article URL: https://twitter.com/CloudflareRadar/status/1916811587408536055

Comments URL:

Apr 28, 2025, 1:30:11 PM | Hacker news

Reports of widespread power cuts in Spain and Portugal

Reports of widespread power cuts in Spain and Portugal

Article URL: https://www.bbc.com/news/live/c9wpq8xrvd9t

Comments URL: https:

Apr 28, 2025, 1:30:10 PM | Hacker news

Deep dive into how DOS games do copy protection by making themselves unwinnable

Deep dive into how DOS games do copy protection by making themselves unwinnable

Article URL: https://mrwint.github.io/winter/writeup/writeup.html

Comments URL:

Apr 28, 2025, 1:30:10 PM | Hacker news

Making a game from scratch using only a guitar [video]

Making a game from scratch using only a guitar [video]

Apr 28, 2025, 1:30:08 PM | Hacker news

I built a hardware processor that runs Python

I built a hardware processor that runs Python

Article URL: https://www.runpyxl.com/gpio

Comments URL: https://news.ycombinator.com/item?

Apr 28, 2025, 1:30:07 PM | Hacker news

Optery (YC W22) – Engineering Team Lead and Engineers with Node.js (U.S., Latam)

Optery (YC W22) – Engineering Team Lead and Engineers with Node.js (U.S., Latam)

Article URL: https://jobs.ashbyhq.com/optery

Comments URL: https://news.ycombinator.com

Apr 28, 2025, 1:30:06 PM | Hacker news

Naur's "Programming as Theory Building" and LLMs replacing human programmers

Naur's "Programming as Theory Building" and LLMs replacing human programmers

Article URL: https://ratfactor.com/cards/naur-vs-llms

Comments URL: https://ne

Apr 28, 2025, 11:10:09 AM | Hacker news

Techie