Local TTS for OpenClaw on Apple Silicon (MLX-Powered, Zero Setup)
MLX-powered local TTS plugin for OpenClaw—elegant but audience is Apple Silicon only.
Make Local AI Toys, Robots, Devices that with a MacBook and an Arduino ESP32
Free local CharacterAI with voice cloning under 10s audio, plus ESP32 hardware integration.
Privacy-focused users, hardware hackers, ESP32 tinkerers
The voice pipeline currently supports MLX on any M1 through M5 chip. I used Whisper-Turbo for STT, Qwen3.5-9B-4bit for the LLM and Qwen3-TTS-0.6B-4bit for TTS.
The repo also has a Websocket Transport to add these voices to devices powered by the ESP32 via secure websockets.
MLX-powered local TTS plugin for OpenClaw—elegant but audience is Apple Silicon only.
Tauri GUI wrapper around mlx-lm—useful for Mac users, but local fine-tuning UIs already exist.
Full MLX power in Ruby: lazy arrays, Metal GPU, transformer layers—but Ruby adoption risk.
GPU working set estimation catches memory overcommit before your 7B model swaps to SSD.
Native Swift inference with SSD streaming runs 100B MoE models without kernel panics.
Packages Whisper into a one-click Mac app that records system audio, batch-processes files, and exports SRT/MD/DOCX — all kept on-device. Very useful if you hate terminal hacks and want privacy-first transcription, but it’s not novel (several local Whisper GUIs already exist) and the Apple‑Silicon‑only restriction plus limited detail on model sizes and performance hold it back.