Parlor Jarvis – Realtime AI (audio+screen in, voice out) & multilingual
Multilingual local AI with screen sharing beats Parlor's English-only camera input.
On-device, real-time multimodal AI. Have natural voice and vision conversations with an AI that runs entirely on your machine. Powered by Gemma 4 E2B and Kokoro.
Runs Gemma 4 E2B and Kokoro TTS locally with barge-in and vision.
Developers building voice AI, privacy-focused users, language learners
OpenAI Realtime API · LiveKit Agents · Bule AI
Multilingual local AI with screen sharing beats Parlor's English-only camera input.
Only Apple Silicon toolkit streaming GCS data during audio fine-tuning without OOM.
Full voice assistant pipeline with barge-in running entirely offline on Snapdragon GPU.
2B model beats 12B on some tasks, saving hardware costs for edge deployment.
Clean Swift wrapper for Gemma 4 with vision and audio on iPhone.
50-token compact code output beats raw 5,000-token Excalidraw JSON — clever compression.