GitHub Repository

Lightweight, model-agnostic hallucination risk Analysis Library for LLM outputs

4 starsPython

Hallx – Hallucination risk scoring for LLM outputs

Name: Hallx – Hallucination risk scoring for LLM outputs
Availability: InStock
Author: akadhanu

by akadhanu·Apr 2, 2026·2 points·2 comments

Visit Project View on HN

AI Analysis

●MidShip It

Yet another hallucination checker when Guardrails and LMQL already cover this.

Strengths

•Three-check approach covers schema, consistency, and grounding simultaneously
•Model-agnostic support for OpenAI, Anthropic, Gemini, and Ollama
•Sync and async APIs with simple pip install distribution

Weaknesses

•Heuristic scoring without novel detection methodology beyond existing tools
•One star indicates minimal traction in crowded hallucination-detection space

Post Description

I got tired of LLM outputs silently failing in pipelines, so I built a small scoring layer around it.

It checks three things before your output moves forward: does it match the schema you expected is it consistent across runs does it actually align with the context you provided

Returns a confidence score and a risk level. That's mostly it.

Works with OpenAI, Anthropic, Gemini, Ollama and a few others. Sync and async both supported. It's heuristic, not a guarantee. If your context is bad, the scores will be too. Hit a star, if you found this useful.

Try now: pip install hallx

Similar Projects

AI/ML●Mid

PsiGuard – real-time hallucination monitoring for LLM apps

Hallucination detector for LLMs, but existing tools like Guardrails and Langfuse already do this.

Solve My Problem

brad_o_ley

104mo ago

Developer Tools●●●Banger

AI-assert – Constraint verification for LLM outputs (278 lines, Python)

Lightweight retry loop that improves IFEval instruction-following from 69% to 76% accuracy.

Solve My ProblemShip It

kaantahti

104mo ago

Security●●●Banger

How to analyze your LLM output – A behavioural health monitor for LLMs

Detects sycophancy and jailbreak drift in LLMs without needing model weights.

Big BrainBold BetNiche Gem

k-thimmaraju

1072mo ago

AI/ML●●Solid

UQLM – Closed-book hallucination detection with UQ

Peer-reviewed LLM hallucination detector using uncertainty quantification, published in JMLR and TMLR.

Niche GemSolve My Problem

virenbajaj

311mo ago

AI/ML●●Solid

Runtime governance layer that refuses high-risk LLM outputs

The demo implements post-generation admissibility checks and returns structured refusals (decision codes, rule triggered, divergence metrics and a stable prompt fingerprint) so you can audit enforcement decisions. It's a crisp, focused proof-of-concept for runtime enforcement — useful as a starting pattern — but it stops short of addressing bypass/adversarial vectors, deployment integration, or guarantees that make it enforceable at scale.

Niche GemShip It

milarien

115mo ago

AI/ML●●Solid

Agenda Intel MD – schemas and CLI to audit LLM strategic-risk briefs

Schema-valid evidence packs for AI agents when generic evals miss domain nuance.

Niche GemBig Brain

vassilbek

102mo ago