Back to browse
A reasoning hierarchical robotics pipeline you can run in the browser

A reasoning hierarchical robotics pipeline you can run in the browser

by avikde·Apr 7, 2026·4 points·0 comments

AI Analysis

●●●BangerWizardryBig BrainNiche Gem

MuJoCo physics meets Gemini reasoning entirely inside your browser tab.

Strengths
  • Decouples LLM reasoning from kinematic control, avoiding embodiment hallucinations.
  • Runs MuJoCo physics simulation via WebAssembly with no server backend.
  • Modular Sense-Plan-Act architecture allows swapping individual components easily.
Weaknesses
  • Requires external Gemini API key, limiting offline or private usage.
  • Browser-based simulation limits complexity compared to native C++ or Python stacks.
Category
Target Audience

Robotics researchers and embodied AI developers

Similar To

Google RT-2 · OpenVLA · NVIDIA Isaac Lab

Post Description

This demo combines the flexible task programming and reasoning of Gemini ER (what is the scene, and what should I do?) and classical camera calibration, kinematics, motion controllers. Each layer is independently swappable, and the AI model doesn't need to know anything about the robot's embodiment. This recreates the modularity of a Sense-Plan-Act architecture while retaining the semantic reasoning of a foundation AI model. A writeup explaining the tradeoffs is linked from the page https://www.avikde.me/building-a-reasoning-hierarchical.

Similar Projects

Developer Tools●●Solid

I built a desktop app combining Claude, GPT, Gemini with local Ollama

Hybrid pipeline splits reasoning (cloud) and execution (local), but multi-model orchestration is becoming crowded.

Solve My ProblemSlick
tsunamayo
113mo ago
Data●●Solid

Vector databases are the wrong primitive for AI agents

Knowledge graphs beat vector similarity for structured relationship queries.

Big BrainBold Bet
ajainvivek
112mo ago
AI/ML●●Solid

Clonar – A Node.js RAG pipeline with 8-stage multihop reasoning

Clonar codifies an explicit 8-stage agentic loop — query rewrite, clarification gates, filter extraction and critique loops — into a runnable Node backend you can call over HTTP. The README is refreshingly transparent about its experimental status and invites benchmarks and PRs, but there are no quantitative evaluations or stress tests to back the design claims. Good as a hands-on reference for multihop orchestration, not yet a proven replacement for established RAG stacks.

Big BrainNiche Gem
sowmith-tsrc
113mo ago