Alibaba·💬 Text Generation

Qwen 3 Coder 480B Turbo

CodeFunction CallingWeb Searchfp8private

🧠 Try in Intelligence →

Try on Venice.ai ↗

Quick reference

Qwen 3 Coder 480B Turbo — TLDR

🧠 Mixture-of-Experts coder: 480B total, 35B active parameters
⚡ Turbo, FP8-quantized build tuned for faster code inference
📏 256K native context, extendable toward 1M via extrapolation
🔧 Agentic coding with a purpose-built function-call format
💬 Instruct, non-thinking model — no reasoning-trace blocks
🌐 Works with Qwen Code, CLINE, and similar agent tools
🏢 Built by Alibaba's Qwen team (Alibaba Cloud)
🎯 Capabilities here include function calling and web search

💰 Pricing

$0.350 / $1.50

per 1M · input / output

📏 Context

256K tokens

📅 On Venice since

Jan 27, 2026

173 days ago

Provider

Alibaba

Alibaba Group is a Chinese multinational technology company founded in 1999 and headquartered in Hangzhou, Zhejiang. Originally built around e-commerce and cloud computing, Alibaba has become one of the most prolific contributors to open-weight AI research,…

Read full profile →

51 models on Venice

20 video · 18 text · 5 image · 4 inpaint · 2 embedding · 2 tts

Since Jan 11, 2025

Wikipedia ↗Official site ↗

See 50 other models from Alibaba →

About this model

Qwen 3 Coder 480B Turbo is a code-optimized large language model from Alibaba's Qwen team, served as a Turbo, FP8-quantized variant of the Qwen3-Coder-480B-A35B-Instruct base. The underlying model is a Mixture-of-Experts design with 480 billion total parameters and 35 billion active per inference, which the Qwen team frames as delivering high performance at lower compute cost than dense models of comparable scale. It supports a 256K-token context natively, with extrapolation methods reaching up to roughly 1M tokens.

The "Turbo" designation reflects an inference-optimized deployment: FP8 weights and provider-side serving aimed at faster, cheaper code workloads, which is the focus of this catalog entry. Functionally, it is an instruct, non-thinking model — it does not emit separate reasoning-trace blocks — and ships with a specially designed function-call format for agentic coding across tools like Qwen Code and CLINE.

Within Venice's broader Qwen lineup, it sits alongside general-purpose siblings such as Qwen 3 235B A22B Instruct 2507 and the efficiency-focused Qwen 3 Next 80b, but this checkpoint is specialized purely for coding and agentic tool use. As deployed here it adds function-calling and web-search capabilities for developer workflows.

Sources

qwen3-coder-480b-a35b-instruct Model by Qwenbuild.nvidia.com ↗

Qwen3-Coder: Agentic Coding in the World | Qwenqwenlm.github.io ↗

Qwen/Qwen3-Coder-480B-A35B-Instruct · Hugging Facehuggingface.co ↗

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Data sources: Venice API · HuggingFace · Wikipedia — enrichment updated 4d ago