GitHub Repository

A production-ready MCP server that builds a world model for codebases, preventing hallucinations, repeated mistakes, and regressions in Claude Code.

4 starsPython

Memory layer for Claude Code(+10.2 pts on SWE-bench Verified benchmark)

Name: Memory layer for Claude Code(+10.2 pts on SWE-bench Verified benchmark)
Availability: InStock
Author: saravanan2294

by saravanan2294·Jun 24, 2026·2 points·0 comments

Visit Project View on HN

AI Analysis

●●●BangerBig BrainWizardry

+10.2 SWE-bench points with contradiction resolution across Claude Code and Cursor.

Strengths

•Empirical benchmark results with 375 tests and per-task breakdowns in RESULTS.md.
•Confidence-weighted contradiction resolution with provenance tracking per fact.
•Works across multiple agents via MCP protocol, not locked to one vendor.

Weaknesses

•MCP ecosystem still emerging, adoption depends on agent tool support.
•Complex setup with 26 tools and 19 CLI commands may overwhelm casual users.

Similar Projects

AI/ML●●●Banger

97% on SWE-bench Verified with subscription-token agents

97% on SWE-bench Verified with full artifact transparency, not just a score claim.

Big BrainZero to One

kimjune01

201mo ago

Developer Tools●●Solid

Codex context bloat? 87% avg reduction on SWE-bench Verified traces

Transparent proxy cuts Codex context tokens by 87% via working memory.

Big BrainNiche Gem

george_ciobanu

1022mo ago

AI/ML●●●●Gem

New Benchmark from SWE-bench team is 0% solved

Agents fail completely at rebuilding binaries from scratch without source code.

Big BrainBold BetZero to One

lieret

2431mo ago

AI/ML●●●Banger

RewardHackBench: Using sandboxes to stop agents from cheating

LLM judge on outgoing requests achieves 0% cheat rate while preserving 58% fair-solve ceiling.

Big BrainDark Horse

rotemtam

936d ago

Developer Tools●●●Banger

Tarmac – Know what Claude Code will cost before you run it

Conformal prediction trained on 3K tasks hits 81% cost accuracy.

WizardrySolve My ProblemBig Brain

sarthakaggarwal

213mo ago

AI/ML○Pass

All the LM solutions on SWE-bench are bloated compared to humans

Twitter thread with a chart; not a product or tool.

lieret

103mo ago