Alibaba·🖌️ Inpainting

Qwen Image 2

anonymized

Try on Venice.ai ↗

Quick reference

Qwen Image 2 — TLDR

🖼️ High-fidelity inpainting and image editing model
✍️ Strong text rendering inside edited regions
🏢 Built on Alibaba's Qwen Image lineage
🎯 Editing variant of the Qwen Image 2 generator
🎨 Targeted region edits with prompt control

💰 Pricing

$0.050

per edit

📅 On Venice since

Mar 4, 2026

137 days ago

Provider

Alibaba

Alibaba Group is a Chinese multinational technology company founded in 1999 and headquartered in Hangzhou, Zhejiang. Originally built around e-commerce and cloud computing, Alibaba has become one of the most prolific contributors to open-weight AI research,…

Read full profile →

51 models on Venice

20 video · 18 text · 5 image · 4 inpaint · 2 embedding · 2 tts

Since Jan 11, 2025

Wikipedia ↗Official site ↗

See 50 other models from Alibaba →

About this model

Qwen Image 2 (editing) is the inpainting counterpart to Alibaba's Qwen Image 2 text-to-image model, released March 2026. Where the base model generates images from prompts, this variant specializes in altering existing ones — masking a region and regenerating it from a text instruction while preserving the surrounding composition. Its standout trait is high-fidelity text rendering, meaning words and typography placed into an edit hold up cleanly rather than dissolving into garbled glyphs, a common failure point for image editors.

Within Alibaba's image lineup it sits alongside a higher-tier Qwen Image 2 Pro editing variant for users who need more headroom, and it succeeds the earlier Qwen Edit 2511 inpainting release. For looser content boundaries, the later Qwen Edit Uncensored covers a different niche. The Qwen family on the platform spans text, vision, embeddings, and TTS, with this model anchoring the image-editing slot.

It is best suited for precise, localized edits — swapping objects, retouching scenes, adding or correcting on-image text, and compositing changes — where maintaining the original image's integrity and rendering legible text matter most.

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Data sources: Venice API · HuggingFace · Wikipedia — enrichment updated 4d ago