Back to browse
GitHub Repository
5 starsPython

Khaos – Every AI agent I tested broke in under 30 seconds

by exordex·Feb 13, 2026·1 point·1 comment

AI Analysis

●●SolidBig BrainNiche Gem
The Take

Auto-patching LLM calls to inject faults and log telemetry is a neat technical trick that lets you fuzz real agent runs without changing your stack. The repo ships six intentionally vulnerable example agents and a CLI (discover/run/ci) with eval packs for security and resilience, so you can reproduce attacks and gate releases. It feels like an early, practical toolkit that fills a gap in agent security testing — adoption and more community-playbooks will determine how far it goes.

Category
Target Audience

ML engineers, security engineers, SREs and backend developers shipping AI agents

Post Description

We're shipping AI agents that process payments, query databases, and handle customer PII. Most of them can be tricked into bypassing their own safety policies in under 30 seconds. I built Khaos to prove it. It's an open-source chaos engineering framework that adversarially tests AI agents — prompt injection, tool misuse, data exfiltration, and infrastructure faults before they hit production.

The repo includes 6 intentionally vulnerable example agents (support bot, SQL agent, code executor, payment processor, API agent, document processor) with real attack scenarios showing exactly how they break. Try breaking them yourself.

Three commands to test your own agent:

- pip install khaos-agent - khaos discover - khaos run my-agent --pack security

It works with raw OpenAI/Anthropic, Gemini, LangGraph, CrewAI, AutoGen — any Python agent. Khaos auto-patches LLM calls to inject faults and log telemetry. No cloud needed, runs 100% locally.

Some of what it tests:

- Prompt injection (policy bypass, developer mode exploits) - Tool misuse (unauthorized DB writes, unscoped API calls) - Data exfiltration (PII extraction, credential leakage) - Fault injection (timeouts, rate limits, malformed tool responses)

We are the first platform that focuses on testing the Agent's environment, not just the model in the harness.

Plus 4 tutorials using the free Gemini API if you want to learn without spending anything. Repo: https://github.com/ExordexLabs/khaos-sdk Examples: https://github.com/ExordexLabs/khaos-examples BSD licensed. v1.0 just shipped — the attack library and framework adapters are growing. What agents are you most worried about breaking?

Similar Projects

Security●●Solid

ClawGuard – Detect 42 prompt injection patterns in <10ms

42 prompt injection patterns detected in under 10ms with zero dependencies.

Niche GemShip It
joergmichno
233mo ago