Back to browse
GitHub Repository

Privacy-first, self-hosted real-time speech-to-speech translation.

45 starsPython

We built an open-source tool for real-time speech-to-speech translation

by Saurabh_06·Jun 23, 2026·1 point·0 comments

AI Analysis

●●SolidBig BrainNiche Gem

Self-hosted speech translation with sentence-level context instead of word-by-word.

Strengths
  • Processes complete sentences for better context and tone preservation
  • Docker Compose stack with mock mode for testing without API keys
  • Browser audio capture works with any tab including YouTube videos
Weaknesses
  • Only 45 stars and 3 forks suggests limited community traction so far
  • Real-time speech translation already has established self-hosted options
Category
Target Audience

Privacy-conscious organizations needing real-time translation

Similar To

Whisper · Faster Whisper · Speech-to-Text Live

Post Description

For the past few days, we have been working on an open-source, self-hosted real-time speech-to-speech translation tool called PolyTalk.

The goal was that there are people and organisations who need privacy around the tool they are using, and for the speech-to-speech translation, we haven't had many options.

We built the tool with Ollama, Faster Whisper, and Piper.

The tool is not limited to speech-to-speech translation only, but you can also share any of your tabs, whether you're watching a YouTube video in another language, the tool will give you audio output in your target language.

We are aware of how often context and tone get lost in translation, so we ensured translation quality by processing complete sentences instead of individual words.

Now we are focused on context support and tone adaptation.

If you want to see the project, here is the GitHub repo: https://github.com/PolyTalkIO/polytalk

Similar Projects

Developer Tools●●Solid

Docker-whisper: Self-hosted Whisper speech-to-text server (OpenAI API)

One-command Docker deploy from hwdsl2, who maintains trusted WireGuard and OpenVPN images.

CozySolve My Problem
hwdsl2
612mo ago
Hardware●●●Banger

An Open-Source Yoto Toy with Qwen3-TTS

Voice cloning on ESP32 without cloud beats Yoto's subscription model completely.

WizardryZero to OneDark Horse
akadeb
313mo ago