Show HN: Beating Pokemon Red with RL and 10M Parameters

Hi everyone!

After spending hundreds of hours, we're excited to finally share our progress in developing a reinforcement learning system to beat Pokémon Red. Our system successfully completes the game using a policy under 10M parameters, PPO, and a few novel techniques. With the release of Claude Plays Pokémon, now feels like the perfect time to showcase our work.

We'd love to get feedback!

Comments URL: https://news.ycombinator.com/item?id=43269330

Points: 41

# Comments: 26

https://drubinstein.github.io/pokerl/

Created 1mo | Mar 5, 2025, 8:20:12 PM

Login to add comment

Other posts in this group

Trump exempts phones, computers, chips from 'reciprocal' tariffs

Trump exempts phones, computers, chips from 'reciprocal' tariffs

Article URL: https://www.bloomberg.com/news/articles/2025-04-12/trump-exem

Apr 12, 2025, 9:50:08 PM | Hacker news

Apache ECharts + Leaflet + shadcn for data viz

Apache ECharts + Leaflet + shadcn for data viz

Article URL: https://docs.evidence.dev/components/all-components/

Comments URL:

Apr 12, 2025, 9:50:08 PM | Hacker news

Zod v4 Beta

Article URL: https://v4.zod.dev/v4

Comments URL: https://news.ycombinator.com/item?id=43667925

Apr 12, 2025, 9:50:07 PM | Hacker news

Emacs Lisp Elements

Emacs Lisp Elements

Article URL: https://protesilaos.com/emacs/emacs-lisp-elements

Comments URL:

Apr 12, 2025, 7:40:10 PM | Hacker news

ArkType: Ergonomic TS validator 100x faster than Zod

ArkType: Ergonomic TS validator 100x faster than Zod

Article URL: https://arktype.io/

Comments URL: https://news.ycombinator.com/item?id=43665540

Apr 12, 2025, 7:40:10 PM | Hacker news

'Paraparticles' Would Be a Third Kingdom of Quantum Particle

'Paraparticles' Would Be a Third Kingdom of Quantum Particle

Article URL: https://www.quantamagazine.org/paraparticles-would-be-a-third-kingdom-of-

Apr 12, 2025, 7:40:09 PM | Hacker news

Dual Kickstart ROM Replacement for Amiga

Dual Kickstart ROM Replacement for Amiga

Article URL: https://github.com/cdhooper/kicksmash32

Comments URL: https://news

Apr 12, 2025, 7:40:08 PM | Hacker news

Techie