Show HN: Steiner – An open-source reasoning model inspired by OpenAI o1

Steiner is a series of reasoning models trained on synthetic data using reinforcement learning. These models can explore multiple reasoning paths in an autoregressive manner during inference and autonomously verify or backtrack when necessary, enabling a linear traversal of the implicit search tree.

Blog: https://medium.com/@peakji/a-small-step-towards-reproducing-...

Hugging Face: https://huggingface.co/collections/peakji/steiner-preview-67...

Comments URL: https://news.ycombinator.com/item?id=41915735

Points: 20

# Comments: 7

https://medium.com/@peakji/a-small-step-towards-reproducing-openai-o1-b9a756a00855

Creată 6mo | 22 oct. 2024, 18:10:14

Autentifică-te pentru a adăuga comentarii

Alte posturi din acest grup

Naur's "Programming as Theory Building" and LLMs replacing human programmers

Article URL: https://ratfactor.com/cards/naur-vs-llms

Comments URL: https://ne

28 apr. 2025, 11:10:09 | Hacker news

Reversing the Fossilization of Computer Science Conferences

Article URL: https://cacm.acm.org/blogcacm/reversing-the-fossilization-of-computer-science-conf

28 apr. 2025, 11:10:08 | Hacker news

Why do electrons not fall into the nucleus?

Article URL:

28 apr. 2025, 08:50:03 | Hacker news

Presentation Slides with Markdown

Article URL: https://sli.dev

Comments URL: https://news.ycombinator.com/item?id=43816634

Poi

28 apr. 2025, 06:30:13 | Hacker news

Ask HN: CS degrees, do they matter again?

tldr; skip to the --------

Last time I "Asked HN", I was in a very different place. Fresh out of a bootcamp, right at the peak, and subsequent collapse of the Covid hiring. It didn't go well. Ho

28 apr. 2025, 06:30:11 | Hacker news

Show HN: Cleverb.ee – open-source agent that writes a cited research report

Article URL: https://github.com/SureScaleAI/cleverbee

Comments URL: https://ne

28 apr. 2025, 06:30:10 | Hacker news

East German Stasi Tactics – Zersetzung (2021)

Article URL: https://www.maxhertzberg.co.uk/background/politics/stasi-tactics/

Comments URL:

28 apr. 2025, 06:30:07 | Hacker news

Techie