TTSLab – A voice AI agent and TTS lab running in the browser via WebGPU

Name: TTSLab – A voice AI agent and TTS lab running in the browser via WebGPU
Availability: InStock
Author: MbBrainz

by MbBrainz·Feb 23, 2026·5 points·3 comments

Visit Project View on HN

AI Analysis

●●●BangerWizardryZero to OneShip It

Full voice agent (STT→LLM→TTS) runs locally on GPU, no backend needed.

Strengths

•WebGPU acceleration eliminates network latency for real-time voice interactions.
•Local inference means text and audio never leave your device — genuine privacy win over cloud APIs.
•Instant model comparison and hardware benchmarking built in, useful for dev/research workflows.

Weaknesses

•WebGPU browser support is still fragmented; WASM fallback may be slow on older hardware.
•Voice agent capability is early-stage and may struggle with complex multi-turn conversations.

Post Description

I built TTSLab — a free, open-source tool for running text-to-speech and speech-to-text models directly in the browser using WebGPU and WASM.

No API keys, no backend, no data leaves your machine.

When you open the site, you'll hear it immediately — the landing page auto-generates speech from three different sentences right in your browser, no setup required.

You can then try any model yourself: type text, hit generate, hear it instantly. Models download once and get cached locally.

The most experimental feature: a fully in-browser Voice Agent. It chains speech-to-text → LLM → text-to-speech, all running locally on your GPU via WebGPU. You can have a spoken conversation with an AI without a single network request.

Currently supported models: - TTS: Kokoro 82M, SpeechT5, Piper (VITS) - STT: Whisper Tiny, Whisper Base

Other features: - Side-by-side model comparison - Speed benchmarking on your hardware - Streaming generation for supported models

Source: https://github.com/MbBrainz/ttslab (MIT)

Feedback I'd especially like: 1. How does performance feel on your hardware? 2. What models should I add next? 3. Did the Voice Agent work for you? That's the most experimental part.

Built on top of ONNX Runtime Web (https://onnxruntime.ai) and Transformers.js — huge thanks to those communities for making in-browser ML inference possible.

Similar Projects

Developer Tools●●●Banger

TTSLab – Text-to-speech that runs in the browser via WebGPU

Whisper + Kokoro entirely in-browser via WebGPU, no API keys or network requests.

WizardryShip ItSolve My Problem

MbBrainz

303mo ago

AI/ML●●Solid

TTS.ai - Text to Speech

20+ TTS models in one place, but Eleven Labs and Play.ht already own this space.

Crowd PleaserSlick

nadermx

103mo ago

AI/ML●●Solid

My 16MB vibe-coded voice cloning app

Shrinks the usual TTS bloat into a 16MB Electron-alternative wrapper while still letting you clone voices from a short sample and 'design' voices from text prompts. It handles model downloads for you, supports batch exports and macOS auto-updates — smart product trade-offs. Caveat: the app binary is tiny, but the underlying TTS models are downloaded on demand, so expect large model pulls behind the scenes.

Dark HorseWizardryShip It

yoav

203mo ago

AI/ML●●Solid

Local Voice Assistant

This repo bundles a complete local audio loop — client captures audio, backend transcribes with Parakeet, queries a quantized Mistral LLM via Ollama, then renders speech with Kokoro or Qwen3-TTS for cloning — and reports ~1s round-trip on an RTX5070. It’s a practical, take-it-home demo for running privacy-first voice agents, though it’s still a demo: requires specific tooling (Ollama, GPU headroom), has obvious TODOs (VAD, better warmup for cloning), and isn’t reinventing the architecture.

WizardryNiche Gem

armcat

203mo ago

AI/ML●●Solid

KokoClone – Zero-shot voice cloning using Kokoro TTS

Kokoro voice cloning with multilingual support, but voice cloning itself is crowded.

Niche GemShip It

Ashish106

213mo ago

Developer Tools●●●Banger

Sokuji – Open-source speech translator with on-device AI WASM/WebGPU

48 ASR models + WebGPU TTS offline beats Whisper-only alternatives like Otter.ai.

WizardryShip ItSolve My Problem

jiangzhuo

203mo ago