Ava – AI Voice Agent for Traditional Phone Systems(Python+Asterisk/ARI)
Adds AI voice to legacy Asterisk systems without ripping out existing telephony.
Open source voice AI platform. Self-hosted alternative to Vapi and Retell. On Prem, BYOK across Speech to Speech or LLM/STT/TTS, with a visual workflow builder, MCP native and telephony support.
Open-source Vapi alternative with pre-recorded audio responses for lower latency calls.
Developers building voice bots, call center automation teams
Vapi · Retell AI · Bland AI
Hi HN,
We’re the Dograh team (YC alumni). While building voice bots, we found that wiring WebRTC/ Telephony + STT + LLM + TTS took more time than the bots themselves. Teams are spending weeks on plumbing - handling call flows, extracting variables, dealing with telephony edge cases, and redeploying for small changes. Tools like Vapi/Retell are easy to start with but come with lock-in and platform fees. So we built Dograh: a 100% open-source platform that handles the full stack, with a visual workflow builder and self-hosting by default.
Dograh v1.20 introduces two major additions: 1. Gemini 3.1 Live support Run fully real-time voice agents using Gemini’s streaming APIs, without stitching together separate STT + LLM + TTS components. 2. Pre-recorded audio (hybrid voice) Upload real voice clips (greetings, confirmations, etc.), and the agent plays them instantly while using TTS only for dynamic responses. This reduces latency, improves naturalness, and cuts TTS costs.
It also includes:
- Plug-and-play LLM / STT / TTS (including self-hosted models) - Telephony integrations (Twilio, Vonage, Telnyx) along with Call Transfer - Post-call QA, transcripts, and variable extraction - Observability via Langfuse (OpenTelemetry traces + prompt playground)
Try it now: If you have Docker, you can run the below command for a 2-minute setup (no API keys needed out of the box).
``` curl -o docker-compose.yaml https://raw.githubusercontent.com/dograh-hq/dograh/main/dock... REGISTRY=ghcr.io/dograh-hq ENABLE_TELEMETRY=true docker compose up --pull always ```
Looking Ahead: We’re expanding self-hosted model support: you can already bring any LLM (e.g. Llama, Qwen) or TTS (Kokoro, Voxtral) by configuring API endpoints. We are working on updates that will enable anyone to run everything on a single server - your AI models along with Dograh Orchestration.
Looking forward to hearing thoughts of the community.
Adds AI voice to legacy Asterisk systems without ripping out existing telephony.
Voice agent orchestration with no-code studio, but orchestrates off-the-shelf APIs like everyone else.
This repo bundles a complete local audio loop — client captures audio, backend transcribes with Parakeet, queries a quantized Mistral LLM via Ollama, then renders speech with Kokoro or Qwen3-TTS for cloning — and reports ~1s round-trip on an RTX5070. It’s a practical, take-it-home demo for running privacy-first voice agents, though it’s still a demo: requires specific tooling (Ollama, GPU headroom), has obvious TODOs (VAD, better warmup for cloning), and isn’t reinventing the architecture.
Outperformed Vapi 2× on latency by treating voice as turn-taking, not transcription.
Sub-sentence TTS streaming beats Piper/Sherpa-ONNX latency by token-level triggering on CPU.
CTF-style flags for voice prompt injection make learning LLM security actually fun.