Neural Abyss – PyTorch multi-agent combat simulator
Per-agent PPO runtime with tensor-first simulation state is genuinely clever architecture.
Vectorized multi-agent RL combat sim with deterministic checkpointing and telemetry logging.
ML researchers, RL practitioners, game AI researchers experimenting with emergent multi-agent behavior.
OpenAI Multi-Agent Environments · DeepMind SMAC · PettingZoo
Per-agent PPO runtime with tensor-first simulation state is genuinely clever architecture.
Autonomous agents evolve batch correction algorithms to super-human performance without human intervention.
PPO beats classical elevator dispatch 6x on wait times; niche but rigorous.
178K neural net beats Pokémon roguelike with clever 1386-dim state encoding.
Runs PPO training entirely in-browser via TinyJit WebGPU kernels.
Revives deprecated OpenAI gym-http-api with Docker images and built-in browser monitoring views.