TTS.ai

Name: TTS.ai
Availability: InStock
Author: nadermx

by nadermx·Apr 18, 2026·3 points·0 comments

Visit Project View on HN

AI Analysis

●●SolidSlickCrowd Pleaser

Twenty-seven open-source TTS models in one UI with no signup required for the free tier.

Strengths

•Aggregates twenty-seven open-source models including Kokoro and Bark in one interface.
•No signup required for free tier removes friction for quick testing.
•Broad toolset includes voice cloning, audio enhancement, and speech-to-text features.

Weaknesses

•Wrapper around existing open models offers no proprietary technology or differentiation.
•Hosted TTS market is crowded with established players like ElevenLabs.

Similar Projects

AI/ML●●●Banger

Real-time local TTS (31M params, 5.6x CPU, voice cloning, ONNX)

5.6x realtime on CPU with voice cloning beats most local TTS options.

WizardryDark Horse

ZDisket

443mo ago

AI/ML●●Solid

My 16MB vibe-coded voice cloning app

Shrinks the usual TTS bloat into a 16MB Electron-alternative wrapper while still letting you clone voices from a short sample and 'design' voices from text prompts. It handles model downloads for you, supports batch exports and macOS auto-updates — smart product trade-offs. Caveat: the app binary is tiny, but the underlying TTS models are downloaded on demand, so expect large model pulls behind the scenes.

Dark HorseWizardryShip It

yoav

204mo ago

AI/ML●●Solid

KokoClone – Zero-shot voice cloning using Kokoro TTS

Kokoro voice cloning with multilingual support, but voice cloning itself is crowded.

Niche GemShip It

Ashish106

213mo ago

Hardware●●●Banger

An Open-Source Yoto Toy with Qwen3-TTS

Voice cloning on ESP32 without cloud beats Yoto's subscription model completely.

WizardryZero to OneDark Horse

akadeb

313mo ago

AI/ML●●Solid

Vui – open-source voice mode

300M TTS model running locally on consumer GPU or Apple Silicon.

Niche GemBig Brain

bazlan

209d ago

AI/ML●●Solid

Local Voice Assistant

This repo bundles a complete local audio loop — client captures audio, backend transcribes with Parakeet, queries a quantized Mistral LLM via Ollama, then renders speech with Kokoro or Qwen3-TTS for cloning — and reports ~1s round-trip on an RTX5070. It’s a practical, take-it-home demo for running privacy-first voice agents, though it’s still a demo: requires specific tooling (Ollama, GPU headroom), has obvious TODOs (VAD, better warmup for cloning), and isn’t reinventing the architecture.

WizardryNiche Gem

armcat

204mo ago