Hi everyone!
After spending hundreds of hours, we're excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. With the release of Claude Plays Pokémon, now feels like the perfect time to showcase our work.
We'd love to get feedback!
Comments URL: https://news.ycombinator.com/item?id=43269330
Points: 41
# Comments: 26
Établi
1mo
|
5 mars 2025, 20:20:12
Connectez-vous pour ajouter un commentaire
Autres messages de ce groupe

Article URL: https://www.instantdb.com/hiring/ts-hacker
Comments URL: https:


Article URL: https://4zm.org/2025/04/05/bitter-prediction.html


Article URL: https://www.thealgorithmicbridge.com/p/google-is-winning-on-every-ai-front
Comments URL