GitHub Repository

Autonomous ML research loops for Claude Code with mechanical anti-fabrication guards.

0 starsPython

Novum – Automated ML Research Pipeline with Anti-Fabrication Guards

Name: Novum – Automated ML Research Pipeline with Anti-Fabrication Guards
Availability: InStock
Author: euanai

by euanai·Mar 4, 2026·1 point·3 comments

Visit Project View on HN

AI Analysis

●●●BangerBig BrainWizardryZero to One

Anti-fabrication constraints on a 30-hour autonomous research run—addresses real MLR-Bench hallucination data.

Strengths

•Mechanical enforcement of result validity (not just prompts) tackles quantified 80% fabrication rate problem.
•30-hour autonomous case study with iteration tracking, regression detection, and hypothesis filtering—production evidence.
•Integrates with Claude Code's agent loop, enabling true iterative research cycles with real constraints.

Weaknesses

•Requires max Claude Code plan ($20k/month) and NVIDIA 8GB+ GPU—prohibitive for most researchers.
•Author explicitly declines to verify paper draft, undermining core claim of trustworthy autonomous research output.

Similar Projects

Developer Tools●●●Banger

Claude-Autopilot: autonomous dev pipeline with risk-tiered review

Risk-tiered Codex review gates autonomous merges better than GitHub Copilot.

Ship ItBig Brain

axledbetter01

422mo ago

Developer Tools●●Solid

Cc-pipeline – Autonomous Claude Code pipeline that builds your project

Automates Claude Code sprawl, but existing agentic frameworks already chain LLM steps.

Big BrainShip It

timothyjoh

204mo ago

AI/ML●Mid

Kai – macOS native fully autonomous AI agent.

Claude-powered UI automation for macOS, but lacks concrete differentiator from Anthropic's own agents.

Bold BetShip It

StephaneBessa

314mo ago

Developer Tools●●Solid

Claude Extender – Autonomous Agent Management for Claude Code

Using plain markdown + YAML as the canonical agent format is a smart, low-friction choice — edit agents in your editor, commit them, and the daemon runs scheduled, watcher, or persistent sessions. It persits run logs, memory and costs as browsable markdown and can start MCP tool servers, which makes it immediately useful if you already run Claude Code; the flip side is the tight coupling to Anthropic/MCP limits broader appeal.

Niche GemShip It

wbnns

215mo ago

Developer Tools●●Solid

Intellegix – Autonomous Claude Code toolkit with loop driver and MCP

Loop driver + 15 slash commands for Claude Code, but orchestration over integration.

Big BrainNiche Gem

intellegix

104mo ago

AI/ML●Mid

A Open Source Claude Code setup to publish Research papers 10x faster

Useful Claude Code skills wrapper but five minutes per paper claim is marketing hyperbole.

Solve My ProblemShip It

FurstFly

103mo ago