Back to browse
Try out my pixel platformer demo with an RL agent you can play against

Try out my pixel platformer demo with an RL agent you can play against

by textlapse·Mar 19, 2026·1 point·0 comments

AI Analysis

●●SolidRabbit HoleWizardry

Play against a self-play RL agent in your browser—Sony GT Sophy energy for platformers.

Strengths
  • Rewrote Puffer's core with multithreaded GPU batching—contributing back to OSS.
  • Curriculum learning + self-play actually visible in gameplay, not just a blog claim.
  • Interactive demo lets you experience RL behavior firsthand instead of watching videos.
Weaknesses
  • Demo/experiment rather than a tool—novelty factor limits long-term utility.
  • Desktop browser + controller requirement narrows the audience significantly.
Category
Target Audience

ML engineers, game developers curious about RL, reinforcement learning enthusiasts

Similar To

Sony GT Sophy · Puffer

Post Description

I trained a live RL agent inside a pixel platformer you can play against on a desktop browser (needs a keyboard or a controller):

https://rlplays.com/game

This is NOT a quick-one-off vibe/LLM-coded project.

I started this project out of curiosity: I wanted to build an RL-based game as there were very few out there (e.g. Sony GT Sophy). And I wanted to learn the core RL foundation in a practical/useful manner.

I built on top of Puffer - but the training speed was not up to my needs so I rewrote the core with a ground-up native eval/training loop with multithreaded GPU batching (gonna be a part of the next Puffer release). [ Unaffiliated plug: Puffer is an excellent OSS library - check out https://puffer.ai ]

I trained the RL agent using curriculum learning + self-play. The demo showcases this self-play as well - which you can play against yourself, like an RL agent would!

Technical details in my blog in the link above.

Similar Projects

AI/ML●●Solid

NetHack agent harness with benchmarks and livestream

You can watch an LLM play NetHack step-by-step with the model's reasoning, the exact action code, and a live game canvas — that instrumentation is the product's real selling point. The leaderboard + run/benchmark framing makes it useful for comparing agents rather than just a flashy demo, but it's still squarely for people who care about NetHack or agent evaluation; more detail on reproducible metrics and integrations would push it further.

Niche GemWizardry
kenforthewin
114mo ago
Gaming●●Solid

Come check out my game I'm live testing

The landing page nails first impressions — bold logo, friendly onboarding and a cute 3x3 character picker that makes trying the build immediate and painless. The art direction (solarpunk, pixel dogs) gives it charm, but the page stops short of selling the gameplay: no short demo, video, or clear retention hooks are shown, so it reads like a pleasant MVP rather than a breakout title.

CozyEye CandyShip It
petersonh
113mo ago