Back to browse
GitHub Repository

Turn noisy command output into a short, actionable first pass for coding agents.

62 starsTypeScript

Open-source CLI that turns 128 test failures into 2 root causes

by bimamoglu·Mar 19, 2026·2 points·1 comment

AI Analysis

●●SolidNiche GemShip It

Heuristic-first parsing cuts 198K tokens to 129 before the LLM ever sees output.

Strengths
  • Local heuristics group failures before LLM escalation — smart cost optimization for token-heavy workflows.
  • Returns structured diagnosis with anchor file, likely fix, and decision signal for agents.
  • Works with pytest, typecheck, lint, build, and audit workflows out of the box.
Weaknesses
  • Narrow scope — only useful for developers running AI agents against noisy test output.
  • Early stage with 12 stars — needs more real-world validation beyond the author's benchmarks.
Target Audience

Developers using AI coding agents for debugging

Similar To

pytest-rich · pytest-sugar

Post Description

I kept running into the same issue with coding agents.

A test run fails, you get a huge wall of output, and most of the effort goes into figuring out what actually went wrong.

In many cases, the failures are not independent. It’s the same issue repeated across many tests.

In one case: 128 failures → 2 root causes

I built a small CLI that groups repeated failures into shared root causes before passing the result to the model.

It’s mainly built for coding agents, but works on raw CLI output as well.

On my backend tests, this reduced debugging time and token usage quite a bit.

Similar Projects

Developer Tools●●Solid

Nightwatch, The open-source, read-only AI SRE

Read-only AI agent architecture prevents production accidents during incident response.

Big BrainShip It
egorferber
3397d ago