GitHub Repository

A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.

1,198 starsPython

Lance – image/video generation and understanding in one model

Name: Lance – image/video generation and understanding in one model
Availability: InStock
Author: cleardusk

by cleardusk·May 20, 2026·64 points·15 comments

Visit Project View on HN

AI Analysis

●●●BangerWizardryBig Brain

Unified video and image model trained from scratch on just 128 GPUs.

Strengths

•Achieves competitive results with only 3B active parameters vs typical 7B+ models.
•Single architecture handles understanding, generation, and editing simultaneously.
•Training efficiency demonstrates viable path for resource-constrained research teams.

Weaknesses

•Research prototype lacks the polish and API stability of commercial offerings.
•Video generation quality still lags behind dedicated large-scale video models.

Post Description

The model has 3B active parameters. We put the homepage, paper and model links here:

- Homepage: https://lance-project.github.io/

- Paper: https://arxiv.org/abs/2605.18678

- Model: https://huggingface.co/bytedance-research/Lance

p.s. Lance is a research project, not a polished product. The model was trained using fewer than 128 GPUs.

Similar Projects

AI/ML●●●Banger

TRiP – a complete transformer engine in C built from scratch just by me

From-scratch C transformer engine with training and vision, built by one person.

WizardryDark Horse

carlovalenti

3861mo ago

AI/ML●●●Banger

MicroGPT-C – C99 GPT for Edge Training and Tiny Model Pipelines

Karpathy's microgpt in C99, proves tiny coordinated models beat single large models on logic.

WizardryBig Brain

Ajay__soni

103mo ago

AI/ML●Mid

Trained YOLOX from scratch to avoid Ultralytics (iOS aircraft detect)

This is a practical, no-nonsense play: someone trained YOLOX from scratch, released MIT-licensed weights, and packaged a path toward running it on iOS. The value is procedural — dataset curation, training recipe, and an export/convert-for-iOS pipeline — but it's not a conceptual breakthrough; I'd like to see clear mAP numbers, model size and on-device latency benchmarks before recommending it for production.

Niche GemWizardry

auspiv

303mo ago

Education●●●Banger

How-to-Train-Your-GPT

Build a LLaMA-style model from scratch with zero ML prerequisites or math.

CozyBig Brain

RaiyanYahya

101mo ago

AI/ML●●Solid

I built a tiny LLM to demystify how language models work

Train a working LLM in 5 minutes on free Colab with a fish personality.

CozyBig Brain

armanified

9151342mo ago

AI/ML●●Solid

Efficient LLM Architectures for 32GB RAM (Ternary and Sparse Inference)

Native ternary training beats post-training quantization for memory efficiency.

Big BrainBold Bet

fatihturker

213mo ago