I built a sub-500ms latency voice agent from scratch
Outperformed Vapi 2× on latency by treating voice as turn-taking, not transcription.
Real-time voice conversations using OpenAI's native SIP integration. Sub-200ms latency for AI agents.
Native SIP speech-to-speech cuts latency vs. STT-LLM-TTS chains.
AI agent builders, contact centers, automation engineers
Twilio Voice · Retell AI · Daily.co real-time API
Outperformed Vapi 2× on latency by treating voice as turn-taking, not transcription.
Custom Metal shaders beat llama.cpp and MLX—1.67x faster on M4 Max.
Sub-600ms RAG across continents without GPU beats standard vector-DB-plus-LLM stacks.
1.2% WER in 150ms beats Whisper and Deepgram, but pricing undercuts adoption vs free.
Smart key management via 1Password keeps secrets out of Claude's context window.
Sub-sentence TTS streaming beats Piper/Sherpa-ONNX latency by token-level triggering on CPU.