I made HappySRT to transcribe, translate, & summarize easily
Threaded transcription + translation + summarization, but Opus Clip, Rev, and Descript own this category.
Live AI-generated subtitles overlay for any browser video — Whisper transcription + translation, running locally.
Local Whisper transcription beats cloud subtitle tools on privacy and cost.
Multilingual viewers, accessibility users, content consumers watching non-native language videos
Chrome Live Caption · Language Reactor · Speechify
Threaded transcription + translation + summarization, but Opus Clip, Rev, and Descript own this category.
Rust CLI chains Podcast Index, Whisper, and YouTube into one command-line workflow.
80ms local OCR overlay for Genshin dialogue beats cloud translation latency hands down.
Microphone audio is captured in the browser as PCM and streamed over WebSockets to a Node server that pipes live segments into Mistral's Voxtral realtime API, then immediately hits DeepL for translation — a straightforward, usable demo of live STT->MT. It isn't reinventing the wheel, but the repo bundles the full flow (browser keys, Docker, server.mjs, and UI) so you can spin up an end-to-end test quickly; main downside is it's an integration demo that depends on external API keys and limits for real-world scale.
Audio translation tool, but Whisper + translation APIs already commoditized this.
Free offline Whisper on Android, but 88–93% accuracy lags Google Recorder's 95–96% by design constraints.