Show HN: Beating Pokemon Red with RL and 10M Parameters

Hi everyone!

After spending hundreds of hours, we're excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. With the release of Claude Plays Pokémon, now feels like the perfect time to showcase our work.

We'd love to get feedback!


Comments URL: https://news.ycombinator.com/item?id=43269330

Points: 41

# Comments: 26

https://drubinstein.github.io/pokerl/

Établi 10h | 5 mars 2025, 20:20:12


Connectez-vous pour ajouter un commentaire