I visualized how AI agent systems accidentally become org charts
Interactive essay mapping AI agent chaos to emergent org charts.

Warning labels on retrieved documents actually make attacks five times more successful.
AI/LLM developers building RAG systems
Interactive essay mapping AI agent chaos to emergent org charts.
Qualitative eval workflow for PMs when LangSmith and Arize target ML engineers.
Testing framework for AI agents with LLM judges and SQLite result tracking.
Replays agent traces step-by-step to pinpoint exact failure turns automatically.
Iteratively improves agent harnesses from 67% to 87% on tau-bench using production traces.
Research findings on Medium, not an actual tool or product you can deploy or test.