Show HN: Beating Pokemon Red with RL and <10M Parameters

Hi everyone!

After spending hundreds of hours, we're excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. With the release of Claude Plays Pokémon, now feels like the perfect time to showcase our work.

We'd love to get feedback!

Comments URL: https://news.ycombinator.com/item?id=43269330

Points: 41

# Comments: 26

https://drubinstein.github.io/pokerl/

Létrehozva 1mo | 2025. márc. 5. 20:20:12

Jelentkezéshez jelentkezzen be

EGYÉB POSTS Ebben a csoportban

NNN: Next-Generation Neural Networks for Marketing Mix Modeling

Article URL: https://arxiv.org/abs/2504.06212

Comments URL: https://news.ycombinator.c

2025. ápr. 9. 5:50:07 | Hacker news

Work Simplification and the History of Government Efficiency and Management

Article URL: https://www.governance.fyi/p/historical-government-efficiency

Comments URL:

2025. ápr. 9. 3:40:11 | Hacker news

Show HN: DrawDB – open-source online database diagram editor (a retro)

One year ago I open-sourced my very first 'real' project and shared it here. I was a college student in my senior year and desperately looking for a job. At the time of sharing it i couldn't even

2025. ápr. 9. 3:40:10 | Hacker news

The Barium Experiment

Article URL: https://tomscii.sig7.se/2025/04/The-Barium-Experiment

Comments URL:

2025. ápr. 9. 3:40:07 | Hacker news

Who isn't a big fan of "impartial" news? People who don't have power

Article URL: https://www.niemanlab.org/2025/04/which-types-of-people-

2025. ápr. 9. 3:40:04 | Hacker news