Back to browse
I got frustrated with macOS transcription apps so I built my own

I got frustrated with macOS transcription apps so I built my own

by Neolio·Feb 18, 2026·2 points·3 comments

AI Analysis

●●SolidSolve My ProblemSlick

Whisper with built-in retry fallbacks and hallucination filtering; hotkey dictation anywhere.

Strengths
  • Addresses real Whisper pain: hallucinations ('thanks for watching') via post-processing
  • Fallback chain ensures something always works (model retry → cloud → local)
  • System-wide voice typing without app-switching; batch file transcription included
Weaknesses
  • Cloud mode optional but still closed-source; no open alternative mentioned
  • Hallucination post-processor details not disclosed; unclear how robust
Category
Target Audience

macOS users (Apple Silicon/Intel) doing voice dictation across apps

Similar To

Superwhisper · Monologue · macOS native dictation

Post Description

Every local speech-to-text app I tried had the same problems. Files getting stuck mid-transcription with no way to retry. Retranscribing was either gated or buggy. No fallback when something fails - just a silent failure and your recording(I tend to talk for 5-10mins) is gone.

So I built Whisnap. Hold a hotkey, talk, release - text just appears where your cursor is. Local Whisper with Metal on Apple Silicon, nothing leaves your machine if you don't want it.

I built fallbacks on top of fallbacks. If a model can't process your audio, it tries a different one. You can always retranscribe a recording. Even the cloud mode (optional) has its own fallback chain: WebSocket streaming to batch upload to local Whisper. Something always works.

One thing I spent a bunch of time on: a post-processing pipeline for Whisper's hallucination problem. Anyone who's worked with Whisper knows it hallucinates "Thanks for watching, don't forget to like and subscribe" from silent audio, or loops the same phrase endlessly. The filter handles bracketed artifacts, known hallucination phrases, word repetition, sentence loops, and cross-text deduplication. Not perfect, but catches most of it.

The same binary also works as a CLI, "whisnap recording.wav" just works. I run an AI agent (OpenClaw) on the same Mac and instead of paying for ElevenLabs or other cloud transcription APIs, it just calls Whisnap's CLI and gets clean text back. Same models, no extra setup.

Stack: Tauri v2, whisper-rs, RNNoise for denoising, SIMD audio mixing, rubato resampling.

It's free, Mac only for now. Would love to know if the hallucination filter holds up for anyone else's use cases. https://whisnap.com/

Similar Projects

AI/ML●●Solid

PrivateWhisper – Run Whisper locally on macOS (offline transcription)

Packages Whisper into a one-click Mac app that records system audio, batch-processes files, and exports SRT/MD/DOCX — all kept on-device. Very useful if you hate terminal hacks and want privacy-first transcription, but it’s not novel (several local Whisper GUIs already exist) and the Apple‑Silicon‑only restriction plus limited detail on model sizes and performance hold it back.

Solve My ProblemNiche GemSlick
matyashajek
213mo ago
Developer Tools●●Solid

Docker-whisper: Self-hosted Whisper speech-to-text server (OpenAI API)

One-command Docker deploy from hwdsl2, who maintains trusted WireGuard and OpenVPN images.

CozySolve My Problem
hwdsl2
611mo ago