DeepSeek: Inference-Time Scaling for Generalist Reward Modeling

Article URL: https://arxiv.org/abs/2504.02495

Comments URL: https://news.ycombinator.com/item?id=43578430

Points: 64

# Comments: 11

https://arxiv.org/abs/2504.02495

созданный 13h | 4 апр. 2025 г., 19:20:33

Войдите, чтобы добавить комментарий

Другие сообщения в этой группе

Scaling Up Reinforcement Learning for Traffic Smoothing

Scaling Up Reinforcement Learning for Traffic Smoothing

Article URL: https://bair.berkeley.edu/blog/2025/03/25/rl-av-smoothing/

Comments URL:

5 апр. 2025 г., 06:50:09 | Hacker news

404s – gallery of error 404 page designs

404s – gallery of error 404 page designs

Article URL: https://www.404s.design/

Comments URL: https://news.ycombinator.com/item?id=43589

5 апр. 2025 г., 06:50:08 | Hacker news

Coqui TTS: Free Text-to-Speech

Coqui TTS: Free Text-to-Speech

Article URL: https://coquitts.com

Comments URL: https://news.ycombinator.com/item?id=43590570

5 апр. 2025 г., 06:50:08 | Hacker news

I don't like traveling anymore

I don't like traveling anymore

Article URL: https://sidverma.io/posts/i-dont-like-traveling-anymore/

Comments URL:

5 апр. 2025 г., 06:50:07 | Hacker news

$Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual)$

Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual)

Hi HN,

I’ve been working on an OCR pipeline specifically optimized for machine learning dataset preparation. It’s designed to process complex academic materials — including math formulas, tables

5 апр. 2025 г., 06:50:06 | Hacker news

Recreating Daft Punk's Something About Us

Recreating Daft Punk's Something About Us

Article URL: https://thoughts-and-things.ghost.io/recreating-daft-punks-something-about-us/

Comm

5 апр. 2025 г., 06:50:05 | Hacker news

Investigating MacPaint's Source Code

Investigating MacPaint's Source Code

Article URL: https://ztoz.blog/posts/macpaint-source-code/

Comments URL:

5 апр. 2025 г., 04:30:11 | Hacker news

Techie