Digest AI vs HN About

GitHub Repository

Evaluate agent skill quality. Find the weakest link. Fix it. Prove it worked.

224 starsJavaScript

SkillCompass – open-source quality evaluator for your AI skills

by yo103jg·Apr 13, 2026·2 points·0 comments

Visit Project View on HN

AI Analysis

●●SolidNiche GemBig Brain

Six-dimension scoring for Claude skills when nothing else measures quality this way.

Strengths

•Usage-driven suggestions track what's stale, unused, or risky across your skill library
•Auto-evaluation gate catches quality regressions when any tool edits a skill file
•Version tracking proves improvements worked instead of tweak-and-hope workflows

Weaknesses

•Only works with Claude Code and OpenClaw, excludes other agent frameworks entirely
•Requires Claude Opus 4.6 for scoring, adding significant API cost per evaluation

Category

Developer Tools

Target Audience

Claude Code users maintaining custom agent skills

Similar Projects

AI/ML●●Solid

SkillCompass – Diagnose and Improve AI Agent Skills Across 6 Dimensions

Six-dimension scoring framework beats guesswork for improving Claude Code skills.

Big BrainNiche Gem

yo103jg

202mo ago

AI/ML●●Solid

TweakIdea – 14-dimension startup idea evaluation in Claude Code

Fourteen parallel Claude agents grade your startup idea's evidence before you quit your job.

Big BrainNiche Gem

ephx

101mo ago

Developer Tools●Mid

Goodthinking – PM skills for Claude Code

Context-blind auditors catch assumption gaps, but plugin lock-in and single-model limit audience.

Big Brain

faizanbhat

203mo ago

AI/ML●●Solid

Claude skill that evaluates B2B vendors by talking to their AI agents

AI agents interrogating other AI agents is a genuinely novel vendor evaluation approach.

Big BrainNiche Gem

ogotlieb

4562mo ago

AI/ML●●Solid

Skill Lab – CLI tool for testing and optimizing agent skills

Security scanning catches data exfiltration before skills go live.

Niche GemShip It

qu4rk5314

102mo ago

Developer Tools●Mid

Agent-evals – Claude skill to build your own evals

Claude Skill for agent evals, but LangSmith and Arize already own this.

Solve My Problem

sauercrowd

911mo ago