Kelet – Root Cause Analysis agent for your LLM apps

Name: Kelet – Root Cause Analysis agent for your LLM apps
Availability: InStock
Author: almogbaku

by almogbaku·Apr 14, 2026·47 points·24 comments

Visit Project View on HN

AI Analysis

●●●BangerSolve My ProblemBig Brain

Auto-clusters failure patterns across sessions and suggests prompt patches.

Strengths

•Clusters hypotheses across sessions instead of just displaying raw logs
•Validates fixes with before/after reliability metrics automatically

Weaknesses

•Observability market is crowded with Langfuse and Arize
•Requires significant trust to let AI suggest production fixes

Post Description

I've spent the past few years building 50+ AI agents in prod (some reached 1M+ sessions/day), and the hardest part was never building them — it was figuring out why they fail.

AI agents don't crash. They just quietly give wrong answers. You end up scrolling through traces one by one, trying to find a pattern across hundreds of sessions.

Kelet automates that investigation. Here's how it works:

1. You connect your traces and signals (user feedback, edits, clicks, sentiment, LLM-as-a-judge, etc.) 2. Kelet processes those signals and extracts facts about each session 3. It forms hypotheses about what went wrong in each case 4. It clusters similar hypotheses across sessions and investigates them together 5. It surfaces a root cause with a suggested fix you can review and apply

The key insight: individual session failures look random. But when you cluster the hypotheses, failure patterns emerge.

The fastest way to integrate is through the Kelet Skill for coding agents — it scans your codebase, discovers where signals should be collected, and sets everything up for you. There are also Python and TypeScript SDKs if you prefer manual setup.

It’s currently free during beta. No credit card required. Docs: https://kelet.ai/docs/

I'd love feedback on the approach, especially from anyone running agents in prod. Does automating the manual error analysis sound right?

Similar Projects

Developer Tools●●●Banger

Sift, a small CLI that groups noisy test failures into root causes

Compresses 198k tokens to 129 by grouping test failures before the agent sees them.

Big BrainSolve My Problem

bimamoglu

202mo ago

Developer Tools●●Solid

Open-source CLI that turns 128 test failures into 2 root causes

Heuristic-first parsing cuts 198K tokens to 129 before the LLM ever sees output.

Niche GemShip It

bimamoglu

212mo ago

Infrastructure●●Solid

Kroot – dependency-graph root cause analysis for Kubernetes

Kubernetes root cause via dependency graphs, but kubectl debug and observability tools already solve this.

Solve My ProblemNiche Gem

An0n_Jon

113mo ago

Developer Tools●●●Banger

Andon – Toyota Production System for LLM Coding Agents

Toyota factory discipline for runaway LLM agents—stops bad deploys, learns from failures.

Big BrainSolve My ProblemWizardry

allnew_llc

303mo ago

Developer Tools●●●Banger

Sift, a local-first CLI for failures, root causes and next steps

198k tokens down to 129 — local heuristics beat LLM summarization.

Solve My ProblemBig BrainShip It

bimamoglu

312mo ago

SaaS●●Solid

Khaga – AI Infrastructure Diagnosis for AWS, GCP, Azure and Kubernetes

Multi-cloud diagnosis in <30s, but infra observability (Datadog, New Relic) already solves this better.

Solve My ProblemDark Horse

Gowrishankarhq

123mo ago