Recreate Thinking Machines 276B voice demo with duct tape and 8B model

Name: Recreate Thinking Machines 276B voice demo with duct tape and 8B model
Availability: InStock
Author: mrkn1

by mrkn1·Jun 12, 2026·1 point·0 comments

Visit Project View on HN

AI Analysis

●●●BangerWizardryDark HorseBig Brain

Runs Thinking Machines-style voice agent on a laptop CPU with no GPU required.

Strengths

•Four complex behaviors work end-to-end on CPU: friend detection, translation, slouch detection, background search.
•Single asyncio loop orchestrates webcam, mic, speaker, and network calls without GPU acceleration.
•Honest about limitations—duct-tape orchestration, not claiming to match 276B architecture.

Weaknesses

•Depends on external APIs (DeepInfra, Serper) for LLM inference and web search.
•Demo replication rather than a general-purpose product with broader use cases.

Similar Projects

AI/ML●●●Banger

Replicating Thinking Machines Interaction Model demo for $0.01 [video]

Sub-cent CPU-only voice agent with vision-keyed proactivity beats cloud APIs on cost.

WizardryBig Brain

mrkn1

1025d ago

AI/ML●●●Banger

Cheap-IM – CPU-only voice agent approximating Thinking Machines' demo

Runs real-time vision-keyed voice agents on a laptop CPU without custom silicon or training.

Big BrainWizardryDark Horse

mrkn1

4025d ago

AI/ML●●Solid

Tired of duct-taping access control into agent prompts. Here's the fix

Identity and access control between agents solves the single-user assumption most frameworks make.

Bold BetNiche Gem

zwigglers

22239d ago

AI/ML●●Solid

Local Voice Assistant

This repo bundles a complete local audio loop — client captures audio, backend transcribes with Parakeet, queries a quantized Mistral LLM via Ollama, then renders speech with Kokoro or Qwen3-TTS for cloning — and reports ~1s round-trip on an RTX5070. It’s a practical, take-it-home demo for running privacy-first voice agents, though it’s still a demo: requires specific tooling (Ollama, GPU headroom), has obvious TODOs (VAD, better warmup for cloning), and isn’t reinventing the architecture.

WizardryNiche Gem

armcat

203mo ago

Productivity●●Solid

Voice control coding agents on your machine via smartwatch / CarPlay

CarPlay coding sessions over SSH is a commute workflow nobody else is tackling.

Niche GemBig Brain

Zante

7011d ago

AI/ML●Mid

Think Fu – Metacognition as a service

Prompt engineering library dressed up as metacognition infrastructure.

Big BrainNiche Gem

georgestrakhov

302mo ago