Hi everyone!
After spending hundreds of hours, we're excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. With the release of Claude Plays Pokémon, now feels like the perfect time to showcase our work.
We'd love to get feedback!
Comments URL: https://news.ycombinator.com/item?id=43269330
Points: 41
# Comments: 26
Accedi per aggiungere un commento
Altri post in questo gruppo

Article URL: https://www.plugsocketmuseum.nl/British1.html


Article URL: https://firebase.studio
Comments URL: https://news.ycombinator.com/item?id=4363578

Hey HN,
Spark event logs run into 100s of MBs and offer a wealth of insight into your workloads but making sense of them has always been quite a bit prohibitive. We’ve recently built a lightweig
Article URL: https://justinpaulson.github.io/git_game_show/