Alibaba·📐 Embeddings·↑ Newer: Qwen3 Embedding 0.6B

Qwen3 Embedding 8B

private

Try on Venice.ai ↗

Quick reference

Qwen3 Embedding 8B — TLDR

🧠 Largest of Alibaba's Qwen3 text-embedding series, 8B parameters
📏 Supports 32K-token context, embedding dimensions up to 4096
🌐 Multilingual coverage spanning over 100 languages
🎯 Built for retrieval, clustering, classification, and reranking
🔧 Customizable instructions and user-defined output dimensions
🔒 Apache 2.0 license permitting unrestricted commercial use
📚 On vendor-reported MTEB multilingual, scores 70.58 (June 2025)
🆕 Inherits long-text and reasoning skills from Qwen3 foundation models

💰 Pricing

$0.013

per 1M tokens

📅 On Venice since

Apr 17, 2026

93 days ago

Provider

Alibaba

Alibaba Group is a Chinese multinational technology company founded in 1999 and headquartered in Hangzhou, Zhejiang. Originally built around e-commerce and cloud computing, Alibaba has become one of the most prolific contributors to open-weight AI research,…

Read full profile →

51 models on Venice

20 video · 18 text · 5 image · 4 inpaint · 2 embedding · 2 tts

Since Jan 11, 2025

Wikipedia ↗Official site ↗

See 50 other models from Alibaba →

About this model

Qwen3 Embedding 8B is the flagship-size member of Alibaba's Qwen3 Embedding series, a family specifically designed for text embedding and ranking tasks. Built on the dense foundational models of the Qwen3 series, it inherits their multilingual capabilities, long-text understanding, and reasoning skills, and supports over 100 languages. The model handles a 32K-token context, produces vectors of up to 4096 dimensions with user-defined output sizes, and is released under the Apache 2.0 license for commercial use.

Within the same family, it sits alongside the much smaller Qwen3 Embedding 0.6B, plus a mid-tier 4B variant. The series spans 0.6B to 8B parameters so developers can trade efficiency against accuracy. The 8B model is the highest-capacity option, intended for workloads where retrieval quality matters most, while the 0.6B is suited to latency- or resource-constrained deployments.

On the provider's reported figures, the 8B embedding model scored 70.58 on MTEB multilingual as of June 5, 2025. The series targets text retrieval, code retrieval, classification, clustering, and bitext mining.

Technical details and training methodology are documented in the team's arXiv paper, "Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models" (2506.05176). The model is widely distributed, with millions of downloads on Hugging Face, and integrates through sentence-transformers and standard embedding-inference tooling.

🤗View model card on HuggingFace ↗View source on GitHub ↗

Sources

Qwen3 Embedding: Advancing Text ...arxiv.org ↗

Qwen/Qwen3-Embedding-8B · Hugging Facehuggingface.co ↗

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Research & Papers

Primary reference paper for this model family, sourced from the HuggingFace model card.

arXiv2506.05176Jun 2025

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models(2025)

Yanzhao Zhang, Mingxin Li, Dingkun Long et al.

In this work, we introduce the Qwen3 Embedding series, a significant advancement over its predecessor, the GTE-Qwen series, in text embedding and reranking capabilities, built upon the Qwen3 foundation models. Leveraging the Qwen3 LLMs' robust capabilities in multilingual text…

Data sources: Venice API · HuggingFace · Wikipedia · arXiv — enrichment updated 13h ago