The current open-source video lineup
We benchmark these per project. There is no single winner - the right model depends on duration, motion type, render budget, and how much prompt-adherence you need.
LTX-Video 2.3
LightricksFastest production-quality OS video model. Best motion fidelity at <6s clips. Our default for branded b-roll.
Wan 2.1
AlibabaStrong physical realism and camera control. Great when motion accuracy matters more than render time.
HunyuanVideo
Tencent13B params, top-tier prompt adherence. Slower renders, best when prompt-to-shot accuracy is critical.
Mochi-1
GenmoOpen-weight 10B model with strong cinematic motion. Good middle-ground for narrative shots.
CogVideoX
TsinghuaLong-form (up to 10s) generation with consistent characters. Useful for storyboards and previz.
Render time, duration, fidelity
Reference numbers on a single H100. Real-world throughput depends on prompt, batching, and quantization.
| Model | Max duration | Resolution | Render speed (H100) | Best for |
|---|---|---|---|---|
| LTX-Video 2.3 | ~6s | 768×512 → 1216×704 | ~30s for 5s clip | Fast cinematic b-roll |
| Wan 2.1 | ~5s | Up to 720p | ~2 min for 5s clip | Physical realism, camera moves |
| HunyuanVideo | ~5s | 720p+ | ~4 min for 5s clip | Prompt-accurate single shots |
| Mochi-1 | ~5.4s | 480p → 720p | ~3 min for 5s clip | Narrative cinematic motion |
| CogVideoX | ~10s | 720p | ~6 min for 6s clip | Long shots, character consistency |
From brief to MP4
Generation is one box in a five-box system. We build the other four boxes too - that's the difference between a demo and a pipeline.
Script / brief → shotlist
An LLM step converts an editorial brief into a shotlist (subject, camera move, mood, duration). Each shot becomes a structured generation job.
Brand LoRA + style controls
We train a style LoRA on your existing footage (color, framing, depth of field). Applied at inference so every clip lands on-brand.
Generation + upscale
Base model generates at native resolution; second pass upscales to 1080p/4K (Real-ESRGAN, Topaz). Frame interpolation to 60fps where needed.
Audio + delivery
Optional MusicGen / Stable Audio scoring, branded TTS voiceover, then MP4 delivered straight into your NLE bin via API.
Where this lands
Marketing b-roll
Brand-styled cinematic clips at the pace of a campaign calendar.
Product visualisation
Concept videos for unreleased products without a film crew.
Social content engine
Daily reels / shorts at platform-native aspect ratios.
Storyboard / previz
Long-form CogVideoX shots for narrative previsualisation.