Alpha of a Speech to Text iOS App (Feature Requests Welcome)
Offline voice-to-text for iOS, but Otter, Apple Notes, and Whisper alternatives exist.
An AI-infused teleprompter that runs entirely in the browser
Voice-tracked scrolling runs entirely in-browser using Moonshine ONNX, no backend needed.
Video creators, presenters, streamers
Descript · Teleprompter Premium · CapCut
Paste a script, press record, and it highlights the current word as you speak. If you pause it waits; if you skip lines it finds its place again.
Everything runs entirely in the browser — speech recognition (Moonshine ONNX), VAD, and fuzzy script matching.
Demo: https://larsbaunwall.github.io/promptme-ai
Most of the project was initially built using Perplexity Computer, which made for an interesting agentic coding workflow.
Curious what people think about the script alignment approach.
Offline voice-to-text for iOS, but Otter, Apple Notes, and Whisper alternatives exist.
48 ASR models + WebGPU TTS offline beats Whisper-only alternatives like Otter.ai.
Auto-timestamped voice comments for YouTube, but the Chrome Web Store listing is currently unavailable.
This repo bundles a complete local audio loop — client captures audio, backend transcribes with Parakeet, queries a quantized Mistral LLM via Ollama, then renders speech with Kokoro or Qwen3-TTS for cloning — and reports ~1s round-trip on an RTX5070. It’s a practical, take-it-home demo for running privacy-first voice agents, though it’s still a demo: requires specific tooling (Ollama, GPU headroom), has obvious TODOs (VAD, better warmup for cloning), and isn’t reinventing the architecture.
Apple's neglected on-device AI APIs finally get a real showcase here.
Processes one hour of audio in under three seconds while topping Open ASR Leaderboard.