Show HN: Steiner – An open-source reasoning model inspired by OpenAI o1

Steiner is a series of reasoning models trained on synthetic data using reinforcement learning. These models can explore multiple reasoning paths in an autoregressive manner during inference and autonomously verify or backtrack when necessary, enabling a linear traversal of the implicit search tree.

Blog: https://medium.com/@peakji/a-small-step-towards-reproducing-...

Hugging Face: https://huggingface.co/collections/peakji/steiner-preview-67...


Comments URL: https://news.ycombinator.com/item?id=41915735

Points: 20

# Comments: 7

https://medium.com/@peakji/a-small-step-towards-reproducing-openai-o1-b9a756a00855

Created 6mo | Oct 22, 2024, 6:10:14 PM


Login to add comment