Back to browse
GitHub Repository

Browser-native AI workspace — multi-agent orchestration, HITL review, semantic vector RAG, drag-and-drop workflows. Single HTML file, zero backend.

9 starsHTML

Nemilia – multi-agent AI workspace in a single HTML file, no back end

by Nemilia·Mar 5, 2026·2 points·0 comments

AI Analysis

●●●BangerZero to OneWizardryShip It

Single HTML file multi-agent orchestration with drag-and-drop DAGs beats Replit AI, n8n, and typical SaaS tools.

Strengths
  • Genuine zero-backend architecture means full data sovereignty, real differentiation vs. cloud AI platforms
  • MCP tool execution + hybrid RAG (vectors + BM25) gives agents real external capability without framework overhead
  • Drag-and-drop workflow DAGs with HITL review checkpoints solve a real developer friction point in agentic systems
Weaknesses
  • WebGPU offline mode likely limited to small models; browser context constraints mean scaling to large reasoning tasks unclear
  • Early-stage stability/performance data sparse; comparing to Cursor/Continue makes the 'local-first' positioning slightly muddled
Category
Target Audience

AI engineers, researchers, and developers building autonomous agent systems wanting data sovereignty

Similar To

Replit AI · n8n · Zapier

Post Description

A full multi-agent AI workspace in a single HTML file. No backend. No install. No build step. No account. You own the entire runtime — open it in a browser and it works.

That's not a constraint. That's the point.

Nemilia is built around the idea of AI sovereignty — your agents, your workflows, your data, your keys, all running on your machine. Nothing phoned home, nothing locked to a platform, nothing that breaks when a SaaS company changes its pricing.

What you get: - Build custom agents with their own roles, personalities, system prompts, and model overrides - Design custom workflows with drag-and-drop pipeline builder — chain agents in any order, set HITL review checkpoints, run parallel DAG execution - MCP tool execution — build agents that call external tools in real time (filesystem, databases, APIs, code execution) over streamableHttp or SSE - Document RAG with hybrid Transformers.js vector embeddings + BM25 keyword search, fully client-side - WebGPU offline mode — run Llama, Qwen, Phi, Gemma entirely in the browser, no API key, no internet - Connect any provider: OpenAI, Anthropic, Groq, Gemini, Mistral, DeepSeek, Ollama, LM Studio — swap anytime, your workflows follow - API keys encrypted AES-256-GCM, never leave your browser - Workspace syncs to a real folder on disk as plain JSON and Markdown — version control it, edit it in VS Code, take it anywhere

One file. Any browser. Your AI team.

Getting started takes about 60 seconds: download the file, open it, paste any API key, point it at LM Studio/JAN, or load up a model directly into your browser via WebGPU.

Released it publicly last week. Would genuinely love feedback.

Site: https://nemilia.com GitHub: https://github.com/luislopez1212/nemilia

P.S. Nemilia means "to think, to remember, to imagine" in Nahuatl, the language of the Aztecs. Felt like the right name for something that does all three.

Similar Projects

AI/MLMid

Visual drag&drop workflow builder to our open-source AI agent platform

Visual AI workflow builder, but Flowise and LangFlow already dominate this crowded category.

Ship It
CrewForm
302mo ago