Not currently listed in Venice's public API catalog — last listed Jun 17, 2026. Delisted models may still respond to direct API calls.

xAI·🎬 Video Generation·VS Pick

Grok Imagine R2V

Try on Venice.ai ↗

Quick reference

Grok Imagine R2V — TLDR

- 🆕 xAI's reference-to-video model, guided by uploaded images rather than text alone.
- 🎯 Treats reference images as creative direction, not just a first frame.
- 👁️ Uses reference images for style, character, and composition control.
- 🔧 Part of xAI's Imagine API spanning video, image, and audio generation.
- 🎨 Catalog describes its output as stylized rather than strictly photorealistic.
- 🏢 Part of xAI's expanding Grok Imagine creative media family.
- 🌐 Available via xAI's Imagine API and third-party platforms.
- 📏 Released March 2026 by xAI.

💰 Pricing

$0.320 – $0.890

per generation

📅 On Venice since

Apr 13, 2026

96 days ago

Provider

xAI

xAI is an American artificial intelligence company and wholly owned subsidiary of SpaceX. The company develops AI systems under the Grok brand, spanning language models, image generation, and video synthesis. xAI has quickly established itself as a multimodal…

Read full profile →

16 models on Venice

5 text · 5 video · 2 image · 2 inpaint · 1 tts · 1 asr

Since Jan 29, 2026

Wikipedia ↗Official site ↗

See 15 other models from xAI →

About this model

Grok Imagine R2V is xAI's reference-to-video model, released in March 2026 as part of the broader Grok Imagine media family that also spans text-to-video, image-to-video, image generation, and inpainting. Where Grok Imagine turns prompts into clips and Grok Imagine image-to-video uses a still image as the opening frame, R2V instead treats supplied images as creative direction — drawing on their visual style, subjects, and composition to synthesize something new.

The defining difference is the reference-image input: users pass reference imagery alongside a text prompt, enabling character consistency, style transfer, and creative remixing. This positions R2V as a distinct generation mode within the Imagine API, which xAI documents as a single endpoint for producing video, images, and audio.

In the Venice catalog, the Grok Imagine line is characterized as producing stylized, expressive scenes rather than strict photorealism. The family later added higher-quality image and editing variants and the private Grok Imagine 1.5 Private video model, reflecting xAI's continued expansion of its creative toolkit.

It is reachable through xAI's Imagine API as well as third-party hosting platforms, fitting alongside Grok's text, image, speech, and reasoning offerings.

Sources

Imagine Overview | xAI Docsdocs.x.ai ↗

Imagine API: Generate Videos, Images, and Audio | xAIx.ai ↗

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Data sources: Venice API · HuggingFace · Wikipedia — enrichment updated 32d ago