I can no longer afford the silicon. Here is my autonomous HPC agent
Recursive benchmarking loop is clever, but 4 commits and gimmick license raise concerns.
Agentic pentest tooling. Currently achieving 81% (KIMI K2.5) on XBOW's benchmark in full black-box. Completely Self-hosted. Every model available on LiteLLM (Ollama, anthropic, openai...)
Autonomous pentesting agent scoring 78% XBOW with fully local, sandboxed execution and real exploits.
Security researchers, penetration testers, red teamers, teams building secure web applications
Burp Suite · Nuclei · Custom agent-based fuzzers
Deadend is an agentic pentest CLI that automates vulnerability research in webapps.
the problem we are trying to solve : removing the time consumed in repetitive assessments, report generation and extracting relevant information to let them focus on vulnerability research but powerful enough to find issues or leads by itself when we’re in a deadend.
highlights : As of today, we scored 78% on XBOW’s benchmarks with claude-sonnet-4.5 in blackbox (we are currently iterating over the architecture of the agent and running the newest to get better results overall).
The agent runs entirely locally with optional self-hosted models. Shell tooling is isolated in Docker, and the python interpreter with WASM.
Some cool ideas are on the roadmap : CI/CD integrations, code review, bash completion, OWASP Top 10 plugins…
Docker is needed and it currently works only on MacOS Arm64 and Linux 64bits installable in one bash command.
Github Repo : https://github.com/xoxruns/deadend-cli Discord server : https://discord.gg/zwUVa3E7KT
Love to hear your thoughts and feedbacks!
Recursive benchmarking loop is clever, but 4 commits and gimmick license raise concerns.
Sandboxed agent that writes its own Python tools and remembers mistakes in JSON.
Sandboxed Docker execution keeps AI agents from wrecking your host system during pentests.
Runtime pentesting on every PR beats SAST — actual exploits, not hypothetical vulnerabilities.
Self-hosted agent browser with noVNC handoff when automation stalls on logins.
Dual MCP server architecture lets agents safely exec shell commands inside isolated LXC containers.