Show HN: Beating Pokemon Red with RL and <10M Parameters

Hi everyone!

After spending hundreds of hours, we're excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. With the release of Claude Plays Pokémon, now feels like the perfect time to showcase our work.

We'd love to get feedback!

Comments URL: https://news.ycombinator.com/item?id=43269330

Points: 41

# Comments: 26

https://drubinstein.github.io/pokerl/

Creato 1mo | 5 mar 2025, 20:20:12

Accedi per aggiungere un commento

Altri post in questo gruppo

The Columbian Orator taught nineteenth-century Americans how to speak

Article URL: https://www.neh.gov/article/columbian-orator-taught-nineteenth-century-americans-h

10 apr 2025, 02:50:06 | Hacker news

BS 1363 British Plugs and Sockets

Article URL: https://www.plugsocketmuseum.nl/British1.html

Comments URL:

10 apr 2025, 02:50:06 | Hacker news

Google Cloud Rapid Storage

Article URL: https://cloud.google.com/blog/products/compute/whats-new-with-ai-hypercomputer

Comm

10 apr 2025, 02:50:03 | Hacker news

Firebase Studio

Article URL: https://firebase.studio

Comments URL: https://news.ycombinator.com/item?id=4363578

10 apr 2025, 00:30:08 | Hacker news

Whistleblower tells senators that Meta undermined U.S. security, interests

Article URL: https://thehill.com/homenews/senate/5241043-meta-executives-undermine-national-secur

10 apr 2025, 00:30:07 | Hacker news

Show HN: LLM Based Spark Profiler

Hey HN,

Spark event logs run into 100s of MBs and offer a wealth of insight into your workloads but making sense of them has always been quite a bit prohibitive. We’ve recently built a lightweig

10 apr 2025, 00:30:07 | Hacker news

Show HN: Git Game Show – Multiplayer Game in Your Repo

Article URL: https://justinpaulson.github.io/git_game_show/

Comments URL:

10 apr 2025, 00:30:06 | Hacker news

Techie

Show HN: Beating Pokemon Red with RL and 10M Parameters

Altri post in questo gruppo