Meituan·🎬 Video Generation

Longcat Distilled

private

Try on Venice.ai ↗

Quick reference

Longcat Distilled — TLDR

🏢 Meituan's LongCat team built this open video model
🆕 Distilled image-to-video variant for faster, fewer-step inference
🎬 Animates a still image into coherent motion
📏 Designed for long-duration video generation
🧠 Diffusion Transformer base architecture
⚡ Distillation sampling trades steps for speed
🎯 Emphasizes subject coherence and temporal stability
🔒 Weights released under the permissive MIT license

💰 Pricing

$0.090 – $0.530

per generation

📅 On Venice since

Dec 4, 2025

227 days ago

Provider

Meituan

Meituan is a Chinese technology company founded in 2010 by Wang Xing and headquartered in Beijing. Best known for its massive local services platform — spanning on-demand food delivery, consumer reviews, hotel bookings, and instant retail — Meituan listed on…

Read full profile →

4 models on Venice

4 video

Added Dec 4, 2025

Wikipedia ↗Official site ↗

See 3 other models from Meituan →

About this model

Longcat Distilled (image-to-video) is part of Meituan's LongCat-Video family, an open-source video generation effort from the company better known for food delivery and local services. LongCat-Video uses a Diffusion Transformer (DiT) architecture and unifies text-to-video, image-to-video, and video-continuation tasks in a single model, distinguishing tasks by the number of conditional frames supplied. The family is positioned for long-duration video generation.

This entry is the distilled image-to-video configuration: it takes a still reference image and animates it into video. Per Meituan's model card, the distilled setup uses fewer sampling steps for faster inference. That positions it against its same-family sibling Longcat Full Quality, which runs the full-step pipeline for maximum fidelity, while the Longcat Distilled and Longcat Full Quality siblings cover the prompt-to-video path.

According to Meituan's model card, the approach emphasizes cross-frame consistency, subject coherence, and temporal stability across long sequences.

The weights ship under the MIT License, allowing broad commercial and research use, though the license grants no rights to Meituan trademarks or patents.

🤗View model card on HuggingFace ↗View source on GitHub ↗

Sources

meituan-longcat/LongCat-Video · Hugging Facehuggingface.co ↗

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Research & Papers

Primary reference paper for this model family, sourced from the HuggingFace model card.

arXiv2510.22200Oct 2025

LongCat-Video Technical Report(2025)

Meituan LongCat Team, Xunliang Cai, Qilong Huang et al.

Video generation is a critical pathway toward world models, with efficient long video inference as a key capability. Toward this end, we introduce LongCat-Video, a foundational video generation model with 13.6B parameters, delivering strong performance across multiple video…

Data sources: Venice API · HuggingFace · Wikipedia · arXiv — enrichment updated 4d ago