Hi everyone!
After spending hundreds of hours, we're excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. With the release of Claude Plays Pokémon, now feels like the perfect time to showcase our work.
We'd love to get feedback!
Comments URL: https://news.ycombinator.com/item?id=43269330
Points: 41
# Comments: 26
Creato
10h
|
5 mar 2025, 20:20:12
Accedi per aggiungere un commento
Altri post in questo gruppo


Requirements:
* Macbook is not an option
* I go through phases and switch between Windows and Linux as my primary OS.
* Want to be able to mess around with some local LLMs.
* I travel freq