I made HappySRT to transcribe, translate, & summarize easily
Threaded transcription + translation + summarization, but Opus Clip, Rev, and Descript own this category.

Native-script voice input for 11+ Indian languages, code-mixed, works everywhere.
Hindi, Bengali, Tamil, Telugu speakers who prefer speaking to typing in native scripts
Google Live Translate · Otter.ai · Whisper (OpenAI)
So I built him a tool where he could just speak.
He dictated a two-page letter in Bengali, read the transcription, and said: "I didn't even need to change a single comma."
That's when I knew this needed to exist.
Parrot transcribes and translates speech in 11+ Indian languages. The difference: it's significantly more accurate than Whisper on Hindi, Tamil, Telugu, Bengali, and handles code-mixed speech (Hinglish, etc.) naturally. Outputs in native scripts (নমস্কার not "nomoshkar") or Romanized.
Sits in your menu bar. Press a hotkey, speak in any app — WhatsApp Web, Word, Slack, browsers. It writes.
Built for everyone who thinks in their language but struggles with virtual keyboards that don't correct grammar or formatting.
Desktop app for Windows & macOS. Free to try.
Shipping improvements weekly. Would love feedback from anyone dealing with multilingual input.
Threaded transcription + translation + summarization, but Opus Clip, Rev, and Descript own this category.
Audio translation tool, but Whisper + translation APIs already commoditized this.
ISBN search for book translations across 100+ languages including Krio.
Microphone audio is captured in the browser as PCM and streamed over WebSockets to a Node server that pipes live segments into Mistral's Voxtral realtime API, then immediately hits DeepL for translation — a straightforward, usable demo of live STT->MT. It isn't reinventing the wheel, but the repo bundles the full flow (browser keys, Docker, server.mjs, and UI) so you can spin up an end-to-end test quickly; main downside is it's an integration demo that depends on external API keys and limits for real-world scale.
Offline CLI command translator, but model training data stays private—can't verify accuracy.