Show HN: Beating Pokemon Red with RL and 10M Parameters

Hi everyone!

After spending hundreds of hours, we're excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. With the release of Claude Plays Pokémon, now feels like the perfect time to showcase our work.

We'd love to get feedback!

Comments URL: https://news.ycombinator.com/item?id=43269330

Points: 41

# Comments: 26

https://drubinstein.github.io/pokerl/

Établi 1mo | 5 mars 2025, 20:20:12

Connectez-vous pour ajouter un commentaire

Autres messages de ce groupe

Ames Shovel and Tool Catalog of Shovels, Spades and Scoops (1926) [pdf]

Ames Shovel and Tool Catalog of Shovels, Spades and Scoops (1926) [pdf]

Article URL: https://stonehill-website.s3.amazonaws.com/files/resources/1926-ames-catalog-2.pdf

12 avr. 2025, 12:40:07 | Hacker news

Instant (YC S22) Is Hiring a Founding TypeScript Engineer

Instant (YC S22) Is Hiring a Founding TypeScript Engineer

Article URL: https://www.instantdb.com/hiring/ts-hacker

Comments URL: https:

12 avr. 2025, 12:40:07 | Hacker news

AI can't stop making up software dependencies and sabotaging everything

AI can't stop making up software dependencies and sabotaging everything

Article URL: https://www.theregister.com/2025/04/12/ai_code_suggestions_sabotage_supply_chain/

<

12 avr. 2025, 12:40:06 | Hacker news

Why Your 'Harmonious' Team Is Failing

Why Your 'Harmonious' Team Is Failing

Article URL: https://terriblesoftware.org/2025/03/12/why-your-harmonious-team-is-actually-failing/

12 avr. 2025, 10:20:09 | Hacker news

The Bitter Prediction

The Bitter Prediction

Article URL: https://4zm.org/2025/04/05/bitter-prediction.html

Comments URL:

12 avr. 2025, 10:20:08 | Hacker news

Once lush Sahara was home to a surprisingly unique group of humans

Once lush Sahara was home to a surprisingly unique group of humans

Article URL: https://www.sciencealert.com/once-lush-sahara-was-home-to-a-surprisingly-uniq

12 avr. 2025, 08:10:06 | Hacker news

Google Is Winning on Every AI Front

Google Is Winning on Every AI Front

Article URL: https://www.thealgorithmicbridge.com/p/google-is-winning-on-every-ai-front

Comments URL

12 avr. 2025, 08:10:06 | Hacker news

Techie