Back to browse
GitHub Repository

Lightweight, model-agnostic hallucination risk engine for LLM outputs

3 starsPython

Hallx – Hallucination risk scoring for LLM outputs

by akadhanu·Apr 2, 2026·2 points·2 comments

AI Analysis

MidShip It

Yet another hallucination checker when Guardrails and LMQL already cover this.

Strengths
  • Three-check approach covers schema, consistency, and grounding simultaneously
  • Model-agnostic support for OpenAI, Anthropic, Gemini, and Ollama
  • Sync and async APIs with simple pip install distribution
Weaknesses
  • Heuristic scoring without novel detection methodology beyond existing tools
  • One star indicates minimal traction in crowded hallucination-detection space
Category
Target Audience

Developers building production LLM pipelines

Similar To

Guardrails AI · LMQL · Arize Phoenix

Post Description

I got tired of LLM outputs silently failing in pipelines, so I built a small scoring layer around it.

It checks three things before your output moves forward: does it match the schema you expected is it consistent across runs does it actually align with the context you provided

Returns a confidence score and a risk level. That's mostly it.

Works with OpenAI, Anthropic, Gemini, Ollama and a few others. Sync and async both supported. It's heuristic, not a guarantee. If your context is bad, the scores will be too. Hit a star, if you found this useful.

Try now: pip install hallx

Similar Projects

AI/ML●●Solid

UQLM – Closed-book hallucination detection with UQ

Peer-reviewed LLM hallucination detector using uncertainty quantification, published in JMLR and TMLR.

Niche GemSolve My Problem
virenbajaj
312d ago
AI/ML●●Solid

Runtime governance layer that refuses high-risk LLM outputs

The demo implements post-generation admissibility checks and returns structured refusals (decision codes, rule triggered, divergence metrics and a stable prompt fingerprint) so you can audit enforcement decisions. It's a crisp, focused proof-of-concept for runtime enforcement — useful as a starting pattern — but it stops short of addressing bypass/adversarial vectors, deployment integration, or guarantees that make it enforceable at scale.

Niche GemShip It
milarien
113mo ago
AI/ML●●Solid

Agenda Intel MD – schemas and CLI to audit LLM strategic-risk briefs

Schema-valid evidence packs for AI agents when generic evals miss domain nuance.

Niche GemBig Brain
vassilbek
1018d ago