Qiaohu – offline multimodal voice assistant on Snapdragon 8 Gen 2

Name: Qiaohu – offline multimodal voice assistant on Snapdragon 8 Gen 2
Availability: InStock
Author: donge

by donge·Apr 9, 2026·2 points·0 comments

Visit Project View on HN

AI Analysis

●●●BangerWizardryNiche GemZero to One

Full voice assistant pipeline with barge-in running entirely offline on Snapdragon GPU.

Strengths

•Barge-in capability interrupts TTS mid-sentence, requiring precise audio pipeline coordination.
•Gemma 4 2B via LiteRT-LM on mobile GPU with no cloud dependency whatsoever.
•Complete build instructions with exact library versions and model download paths.

Weaknesses

•Chinese-language only limits audience; English ASR/TTS would broaden appeal significantly.
•2.6GB model download on first run is substantial for mobile data constraints.

Post Description

Built a fully offline Chinese voice assistant that runs entirely on-device (no server, no cloud).

Pipeline: VAD (Silero, 16kHz) → LLM (Gemma 4 2B via LiteRT-LM on Snapdragon GPU) → TTS (sherpa-onnx + matcha-icefall-zh-baker, 22050Hz) → speaker. Barge-in interrupts TTS mid-sentence.

Demo video in the README. Code and full setup instructions on GitHub.