Back to browse
GitHub Repository

Open, model-agnostic benchmark for prompt-injection detectors — scored on both axes (attack catch-rate and false positives on real traffic), threshold-agnostic, and reproducible from raw scores.

0 starsPython

An open source benchmark for prompt-injection detectors

by gugit·Jun 29, 2026·2 points·0 comments

AI Analysis

●●SolidBig Brain

Dual-axis measurement comparing detectors at same catch rate, not arbitrary thresholds.

Strengths
  • Measures false positives on real traffic, not just attack detection rates
  • Threshold-agnostic comparison prevents gaming via tuned cutoff points
  • Author discloses commercial interest and includes their own model's weaknesses
Weaknesses
  • Zero stars and fresh repo means no community adoption yet
  • Benchmark maintained by detector vendor creates inherent conflict of interest
Category
Target Audience

AI security teams, LLM application developers, security researchers

Similar To

PromptInject · Garak · LLM Security benchmarks

Similar Projects

AI/ML●●●Banger

AI image models hallucinate history, we built a method to fix it it

Naive prompts hallucinate history; structured knowledge injection raises accuracy from 12.5% to 83.3%.

Big BrainWizardrySolve My Problem
MysticBirdie
123mo ago