Docker-whisper: Self-hosted Whisper speech-to-text server (OpenAI API)
One-command Docker deploy from hwdsl2, who maintains trusted WireGuard and OpenVPN images.
Privacy-first, self-hosted real-time speech-to-speech translation.
Self-hosted speech translation with sentence-level context instead of word-by-word.
Privacy-conscious organizations needing real-time translation
Whisper · Faster Whisper · Speech-to-Text Live
The goal was that there are people and organisations who need privacy around the tool they are using, and for the speech-to-speech translation, we haven't had many options.
We built the tool with Ollama, Faster Whisper, and Piper.
The tool is not limited to speech-to-speech translation only, but you can also share any of your tabs, whether you're watching a YouTube video in another language, the tool will give you audio output in your target language.
We are aware of how often context and tone get lost in translation, so we ensured translation quality by processing complete sentences instead of individual words.
Now we are focused on context support and tone adaptation.
If you want to see the project, here is the GitHub repo: https://github.com/PolyTalkIO/polytalk
One-command Docker deploy from hwdsl2, who maintains trusted WireGuard and OpenVPN images.
48 ASR models + WebGPU TTS offline beats Whisper-only alternatives like Otter.ai.
Local Whisper + NLLB translation with 300ms latency overlay for Discord and games.
Local Whisper transcription beats cloud subtitle tools on privacy and cost.
Self-hosted article-to-podcast with local TTS when Speechify already exists.
Voice cloning on ESP32 without cloud beats Yoto's subscription model completely.