Detecting LLM hallucinations in <1ms using hidden states (RTX3050, 4GB)
Detects hallucinations via hidden state geometry in under 1ms with no training required.
S₀ Tuning: Zero-Overhead Adaptation of Hybrid Recurrent-Attention Models
+23.6pp HumanEval gain with zero inference overhead beats LoRA on hybrid models.
ML engineers, researchers working with Mamba or hybrid models
LoRA · QLoRA · PEFT
Detects hallucinations via hidden state geometry in under 1ms with no training required.
Ancient Rome Q&A benchmark shows 81pp accuracy lift, but lacks adversarial defense evidence.
Continual learning pipeline that fine-tunes weights from text feedback, real distributed execution options.
Outperforms existing open-source injection detectors on ProtectAI and Qualifire benchmarks.
Duplicating transformer layers boosts benchmark scores without a single step of training.
Mountain Curriculum routing: 5× compute to hard samples, skip mastered ones.