MoonshineFlow
Local Moonshine transcription streams text to any app, no audio leaves your Mac.
Native macOS menu bar app for realtime dictation with optional LLM polishing. Connects to any OpenAI Realtime-compatible backend — fully local on Apple Silicon with voxmlx + mlx-lm.
Streaming speech-to-text on-device beats Whisper's wait-for-silence UX pattern.
macOS users needing low-latency dictation; developers building voice interfaces
Whisper (OpenAI) · Superwhisper · Monologue
Most dictation tools, even local ones, use Whisper or similar offline models: you record, then wait for the transcript. Localvoxtral uses Mistral's Voxtral Realtime, one of the first open-source speech models with a natively streaming architecture. Words appear as you speak, not after you stop. It feels closer to someone typing along as you talk.
Press a shortcut, speak, and text gets typed directly into whatever app you're in. No cloud, no subscription, no data leaving your machine.
Two backend options:
voxmlx on Apple Silicon: I forked voxmlx to add a WebSocket server and memory optimizations. Runs a 4-bit quantized model on an M1 Pro. Audio and inference stay fully on-device. vLLM on NVIDIA GPU: tested on an RTX 3090, noticeably faster.
The app is native Swift (~97%), lives in the menu bar, and stays out of your way. Configurable shortcut, mic selection, auto-paste. GitHub: https://github.com/T0mSIlver/localvoxtral
Pre-built DMG available in Releases
Local Moonshine transcription streams text to any app, no audio leaves your Mac.
Hold-to-talk Whisper dictation that beats Mac's built-in tool for privacy and control.
Free open-source alternative to Wispr Flow that runs Whisper and Qwen entirely offline.
System audio transcription without API keys—but Whisper Desktop and Otter do this already, better-known.
Native Swift menu bar timers with App Sandbox privacy and negligible battery footprint.
Menu bar terminal when iTerm2 and Warp already have this.