Digest AI vs HN About

Make a free 3.8B model as reliable as one 7× bigger at parsing data

Make a free 3.8B model as reliable as one 7× bigger at parsing data

by pcoz·Jun 1, 2026·4 points·1 comment

Visit Project View on HN

AI Analysis

●●SolidBig BrainDark Horse

Deterministic verification loop makes 3.8B models match 7x larger ones for structured extraction.

Strengths

•Regime gate plus exact graph analysis plus explicit refusal is genuinely novel architecture.
•Zero runtime dependencies and runs with no model at all is impressive flexibility.
•Bounded re-extraction loop fills gaps by re-asking with pointed-out missing fields.

Weaknesses

•No benchmarks shown to verify the 3.8B vs 7x larger claim in the README.
•Instructor, Pydantic, and guidance already handle structured LLM output.

Category

Target Audience

Developers using local LLMs for structured data extraction

Similar To

Instructor · Pydantic · guidance

Post Description

https://github.com/pcoz/llm-feedback-control

Similar Projects

Developer Tools●●●Banger

AgentCost – Track, control, and optimize your AI spending (MIT)

One-line wrapping eliminates invisible LLM spend; real cost forecasting and model recommendations.

Solve My ProblemSlick

agentcostin

314mo ago

AI/ML●●●Banger

Datetime-bench: which datetime formats LLMs get right (and wrong)

RFC 3339 hits 88% accuracy while unix epoch fails 50% of the time.

Solve My ProblemDark Horse

diwank

214mo ago

Productivity●●●Banger

I taught Claude Code to file Indian income-tax returns (ITR)

Deterministic tax engine prevents LLM math errors while keeping the agent UX.

Big BrainSolve My ProblemNiche Gem

karanb192

302d ago

Developer Tools●●Solid

N3MO – Deterministic code intelligence via AST parsing, no embeddings

Deterministic AST parsing beats embeddings for code graph accuracy.

Big BrainNiche Gem

RajX_dev

2020d ago

Developer Tools●●Solid

A deterministic middleware to compress LLM prompts by 50-80%

Deterministic prompt compression cuts tokens 50-80% without extra model calls.

Big BrainNiche Gem

rosspeili

304mo ago

Developer Tools●●Solid

SafeRun – Replay debugging and inline prevention for AI agents

Replay-first architecture beats LangSmith's static traces for debugging non-deterministic agents.

Ship ItSolve My Problem

Tidianez

112mo ago