AlibabaAlibaba·📐 Embeddings

Qwen3 Embedding 8B

private
Try on Venice.ai ↗
Quick reference
Qwen3 Embedding 8B — TLDR
  • - 🧠 Largest variant of Alibaba's Qwen3 text-embedding series, eight billion parameters.
  • - 📏 Handles up to 32K-token context for long-document embedding.
  • - 🎯 Built for retrieval, clustering, classification, and reranking pipelines.
  • - 🌐 Multilingual coverage spanning over 100 languages.
  • - 🔧 Flexible output dimensions up to 4096.
  • - 🔒 Apache 2.0 license, permitting unrestricted commercial use.
  • - ⚡ Inherits Qwen3 long-text understanding for embeddings.
  • - 📚 Posts a reported 70.58 MTEB multilingual score.
💰 Pricing
$0.013
per 1M tokens
📅 On Venice since
Apr 17, 2026
48 days ago
Provider

Alibaba Group is a Chinese multinational technology company founded in 1999 and headquartered in Hangzhou, Zhejiang. Originally built around e-commerce and cloud computing, Alibaba has become one of the most prolific contributors to open-weight AI research,…

Read full profile →
46 models on Venice
17 text · 16 video · 5 image · 4 inpaint · 2 embedding · 2 tts
Since Jan 11, 2025

About this model

Qwen3 Embedding 8B is the high-capacity member of Alibaba's Qwen3 Embedding family, a series purpose-built for text embedding and ranking tasks. Built on the dense Qwen3 foundation models, it produces semantically rich vectors used for retrieval, clustering, classification, code search, and bitext mining, and it inherits the multilingual breadth and long-text understanding of the underlying Qwen3 base. It supports a 32K-token context window and configurable output dimensions up to 4096, and ships under an Apache 2.0 license.

Compared with its smaller same-family sibling Qwen3 Embedding 0.6B, the 8B model trades efficiency for raw representational capacity, scaling from 0.6B to 8B parameters while sharing the same architecture, instruction-formatting conventions, and multilingual training recipe. The two are designed to be combined or swapped depending on whether deployments prioritize throughput or embedding quality.

On the provider's reported figures, the 8B embedding model achieves a 70.58 score on the MTEB multilingual benchmark (as cited as of June 5, 2025). Across the lineup, the series spans 0.6B, 4B, and 8B sizes for both embedding and reranking, letting developers pick a point on the efficiency-versus-quality curve.

The model supports flexible vector dimensions, and tooling such as sentence-transformers, text-embeddings-inference, and llama.cpp can serve it, making it straightforward to slot into existing vector-database and RAG workflows.

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Research & Papers

Primary reference paper for this model family, sourced from the HuggingFace model card.

Data sources: Venice API · HuggingFace · Wikipedia · arXiv — enrichment updated 4d ago