Back to browse
GitHub Repository

A lightweight library for normalizing speech transcripts before computing WER

26 starsPython

Reproducible open-source STT API benchmarks with full methodology

by jilijeanlouis·Mar 24, 2026·1 point·1 comment

AI Analysis

●●SolidSolve My ProblemCozy

Fixes WER scores by normalizing '$50' and 'fifty dollars' as equivalent.

Strengths
  • Three-stage pipeline (pre-process, word, post-process) is deterministic and YAML-configurable.
  • Expands contractions, converts symbols, removes fillers before WER computation.
  • Language-aware normalization prevents false penalties for formatting differences.
Weaknesses
  • From Gladia—an STT vendor—may bias toward their evaluation needs.
  • Only 2 GitHub stars suggests limited community adoption so far.
Category
Target Audience

STT engineers, ML researchers evaluating speech models

Similar To

jiwer · SpeechBrain · Kaldi scoring tools

Similar Projects

AI/MLMid

STT.ai

Another Whisper wrapper with a nice UI, but lacks novelty against Hugging Face Spaces.

Slick
nadermx
201mo ago