AlibabaAlibaba·📐 Embeddings

Qwen3 Embedding 0.6B

private
Try on Venice.ai ↗
Quick reference
Qwen3 Embedding 0.6B — TLDR
  • 🧠 Smallest model in Alibaba's Qwen3 Embedding series for text vectors
  • 📏 Supports context lengths up to 32K tokens, embeddings up to 1024 dimensions
  • 🌐 Multilingual support across more than 100 languages
  • 🎯 Built for retrieval, clustering, classification, and code search
  • 🔧 Builds on the dense Qwen3 foundation models
  • 🏢 Released by Alibaba under an Apache 2.0 license
  • ⚡ Lightweight 0.6B parameters; GGUF builds available for efficient deployment
  • 📚 Pairs with reranking modules for two-stage retrieval pipelines
💰 Pricing
$0.013
per 1M tokens
📅 On Venice since
Apr 17, 2026
48 days ago
Provider

Alibaba Group is a Chinese multinational technology company founded in 1999 and headquartered in Hangzhou, Zhejiang. Originally built around e-commerce and cloud computing, Alibaba has become one of the most prolific contributors to open-weight AI research,…

Read full profile →
46 models on Venice
17 text · 16 video · 5 image · 4 inpaint · 2 embedding · 2 tts
Since Jan 11, 2025

About this model

Qwen3 Embedding 0.6B is the entry-level member of Alibaba's Qwen3 Embedding series, a family of text embedding and ranking models designed for tasks such as retrieval, clustering, classification, and code search. It is built upon the dense foundational models of the Qwen3 series, inheriting their multilingual and long-context capabilities. Despite its compact 0.6B parameter count, it supports context lengths up to 32K tokens and can produce embedding vectors with dimensions up to 1024.

The broader series spans three sizes — 0.6B, 4B, and 8B — for both embedding and reranking, letting developers balance efficiency against accuracy. This 0.6B variant is the lightweight option, well suited to high-throughput semantic search and on-device or cost-sensitive deployments, while its larger sibling Qwen3 Embedding 8B targets workloads that demand stronger representation quality. The two can be combined into a single pipeline, with the small embedder retrieving candidates and a reranker refining results.

A key strength is multilingual coverage, with support for over 100 languages drawn from the Qwen3 base models, enabling cross-lingual retrieval and bitext mining. The model is distributed under the Apache 2.0 license and is widely available, including GGUF builds for efficient serving.

Within Alibaba's wider Qwen lineup, this embedding model complements generative text systems like Qwen 3.6 27B, serving as the retrieval backbone for search and RAG applications rather than as a chat or generation model.

This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.

Research & Papers

Primary reference paper for this model family, sourced from the HuggingFace model card.

Data sources: Venice API · HuggingFace · Wikipedia · arXiv — enrichment updated 4d ago