Hi everyone!
After spending hundreds of hours, we're excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. With the release of Claude Plays Pokémon, now feels like the perfect time to showcase our work.
We'd love to get feedback!
Comments URL: https://news.ycombinator.com/item?id=43269330
Points: 41
# Comments: 26
Created
10h
|
Mar 5, 2025, 8:20:12 PM
Login to add comment
Other posts in this group


Requirements:
* Macbook is not an option
* I go through phases and switch between Windows and Linux as my primary OS.
* Want to be able to mess around with some local LLMs.
* I travel freq