Back to browse
GitHub Repository

Vibe coded NES emulator by Codex, few shots

6 starsHTML

Codex builds a working NES Emulator in one hour

by zi2zi-jit·Feb 26, 2026·6 points·4 comments

AI Analysis

●●SolidWizardryBig Brain

Codex built a working NES emulator from spec—neat LLM test, but the emulator itself isn't novel.

Strengths
  • Rigorous constraint (no reference code) makes this a legitimate LLM stress test.
  • Full CPU/PPU implementation with multiple mappers and Cython optimization; genuinely playable.
Weaknesses
  • NES emulators exist in dozens of languages; this is a tech demo, not a practical tool.
  • Framing as LLM achievement overshadows that the actual artifact (Python emulator) is useful but not differentiated.
Category
Target Audience

AI researchers, LLM enthusiasts, emulation hobbyists

Similar To

Existing NES emulators (Nestopia, FCEUX)

Post Description

Hi folks! I know NES emulators have been implemented countless times, in practically every language imaginable.

However, having an LLM fully replicate the spec purely from memory—without referencing existing code—is still a significant challenge. It requires the underlying model to have strong anti-hallucination capabilities and solid long-term planning to keep from going astray. Because of this, building an NES emulator makes for an excellent LLM stress test.

Here is how the emulator was built:

Data Gathering: I asked Codex to download the necessary developer manuals and test suites. It was strictly prohibited from searching for reference implementations online.

Development: I instructed Codex to build the emulator until all test suites passed. This process was mostly hands-free; I only chimed in to encourage it to continue when it paused.

First Draft: After just 4-5 prompts, Codex delivered a functional, pure-Python emulator—though it ran at a sluggish 7 FPS.

Optimization: Asking Codex to optimize the app completely on its own didn't work this time. Instead, I had it generate a flamegraph, which identified the PPU update as the bottleneck. I then instructed Codex to rewrite the PPU in Cython without breaking the passing tests.

Overall, I'm incredibly impressed by Codex. I already knew it was capable of the task, but the speed was astonishing. It finished the project in under an hour, using merely 2% of my weekly Pro quota.

While the NES might be a relatively easy system to emulate, I think emulation could serve as a fantastic benchmark for testing future LLMs.

Similar Projects

Developer Tools●●Solid

Claude Code for Mobile GUI Automation

Splitting planner (Claude/Codex) from an orchestrator/skill layer that handles retries, rollback and stateful sessions is the project's best idea — it directly targets the brittleness of long LLM GUI workflows. The repo gives practical bits (CLI, install script, direct coordinate tap mode and unified JSON outputs), but it's early and niche: useful if you're building LLM-controlled phone automation, less interesting for general automation folks.

Big BrainNiche Gem
UgOrange
103mo ago