AVE Database Open taxonomy of 50 failure modes in multi-agent AI systems
First structured CVE-style database for AI agent failures—nobody else is doing this.
Behavioral governance framework for AI agents. Named anti-patterns, incident-driven rules, and failure mode engineering.
Named failure modes for AI agents, but it's markdown files—not an actual tool or implementation.
Developers building or configuring AI agents
AI Safety Frameworks · Constitutional AI · RLHF Guidelines
First structured CVE-style database for AI agent failures—nobody else is doing this.
1921 electrical engineering oscillator math stops AI loops at convergence.
Synthetic e-commerce site with failure injection beats flaky live-site testing.
GPU-vectorized PPO arena with thousands of agents, but emergent behavior research remains niche.
Multi-turn adaptive testing finds agent failures static benchmarks miss, but eval space is crowded.
Turns subjective docs quality into objective pass/fail metrics using dumb agent harnesses.