Back to browse
VoooAI – natural language to multi-modal AI pipelines

VoooAI – natural language to multi-modal AI pipelines

by hkljjkl·Feb 24, 2026·2 points·1 comment

AI Analysis

●●SolidBig BrainSolve My Problem

Pseudocode IR layer validates before execution, but orchestrates existing APIs like Midjourney and Runway.

Strengths
  • Intermediate representation (LLM → pseudocode → JSON) is genuine insight—deterministic interpretation beats direct JSON generation for validation and correction.
  • Real multi-modal orchestration across image (Midjourney), video (Runway/Pika), audio (Suno) with zero setup per API.
  • Clean visual editor with parameter panels reduces cognitive load vs. learning ComfyUI nodes or API docs.
Weaknesses
  • Still wraps existing closed-source APIs (Midjourney, Runway, Suno)—doesn't replace them, and API costs/rate limits still apply.
  • Crowded space: n8n, Make, Zapier already chain APIs; ComfyUI community has multi-modal workflows; no evidence VoooAI is faster or cheaper to use.
Category
Target Audience

Creative professionals and teams wanting to chain image/video/audio generation without learning individual APIs or node editors.

Similar To

n8n · Make.com · ComfyUI

Post Description

Hey HN,Built VoooAI over 4 months. It's a NL2Workflow engine – you describe what you want in plain English, it generates an executable multi-modal pipeline (image + video + audio).The problem: Midjourney for images, Runway/Pika for videos, Suno for music – each has its own API, interface, and learning curve ComfyUI gives you control but requires understanding node graphs n8n connects APIs but doesn't understand creative intent What we did differently:Instead of having LLMs output JSON directly (hard to validate), we introduced a pseudocode IR layer: LLM generates pseudocode → deterministic interpreter expands to executable workflow JSON. This gives us: Syntax validation before interpretation Automatic engine correction (if skill requires specific engine) Reliable edge connections Try it: voooai.com Compare us: voooai.com/workflow-comparison – we're not better at everything, but for "idea → multi-modal output" in one prompt, that's our niche.AMA about the architecture.

Similar Projects

Run end-to-end browser tests using natural language

Natural language E2E tests sound good until you need debugging or maintenance.

Ship It
jimmypk
102mo ago