Hi everyone!
After spending hundreds of hours, we're excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. With the release of Claude Plays Pokémon, now feels like the perfect time to showcase our work.
We'd love to get feedback!
Comments URL: https://news.ycombinator.com/item?id=43269330
Points: 41
# Comments: 26
Jelentkezéshez jelentkezzen be
EGYÉB POSTS Ebben a csoportban

Article URL: https://arxiv.org/abs/2504.06212
Comments URL: https://news.ycombinator.c

One year ago I open-sourced my very first 'real' project and shared it here. I was a college student in my senior year and desperately looking for a job. At the time of sharing it i couldn't even

Article URL: https://github.com/reteps/dockerfmt
Comments URL: https://news.ycombin