Back to browse
GitHub Repository

A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.

1,198 starsPython

Lance – image/video generation and understanding in one model

by cleardusk·May 20, 2026·64 points·15 comments

AI Analysis

●●●BangerWizardryBig Brain

Unified video and image model trained from scratch on just 128 GPUs.

Strengths
  • Achieves competitive results with only 3B active parameters vs typical 7B+ models.
  • Single architecture handles understanding, generation, and editing simultaneously.
  • Training efficiency demonstrates viable path for resource-constrained research teams.
Weaknesses
  • Research prototype lacks the polish and API stability of commercial offerings.
  • Video generation quality still lags behind dedicated large-scale video models.
Category
Target Audience

ML researchers and engineers experimenting with efficient multimodal models

Similar To

Show-o · Emu3 · Chameleon

Post Description

The model has 3B active parameters. We put the homepage, paper and model links here:

- Homepage: https://lance-project.github.io/

- Paper: https://arxiv.org/abs/2605.18678

- Model: https://huggingface.co/bytedance-research/Lance

p.s. Lance is a research project, not a polished product. The model was trained using fewer than 128 GPUs.

Similar Projects

AI/ML●●●Banger

MicroGPT-C – C99 GPT for Edge Training and Tiny Model Pipelines

Karpathy's microgpt in C99, proves tiny coordinated models beat single large models on logic.

WizardryBig Brain
Ajay__soni
103mo ago
AI/MLMid

Trained YOLOX from scratch to avoid Ultralytics (iOS aircraft detect)

This is a practical, no-nonsense play: someone trained YOLOX from scratch, released MIT-licensed weights, and packaged a path toward running it on iOS. The value is procedural — dataset curation, training recipe, and an export/convert-for-iOS pipeline — but it's not a conceptual breakthrough; I'd like to see clear mAP numbers, model size and on-device latency benchmarks before recommending it for production.

Niche GemWizardry
auspiv
303mo ago
Education●●●Banger

How-to-Train-Your-GPT

Build a LLaMA-style model from scratch with zero ML prerequisites or math.

CozyBig Brain
RaiyanYahya
101mo ago