intfloat
intfloat is the open-source research identity behind the widely used E5 family of text embedding models. The work centers on general-purpose, instruction-tuned embeddings designed for retrieval, semantic search, clustering, and classification — with a strong emphasis on multilingual coverage and strong performance on public benchmarks like MTEB.
On Venice, intfloat is represented by a single embedding model: Multilingual E5 Large Instruct. Released in 2026, it is a large-scale instruction-tuned embedding model supporting dozens of languages, making it suitable for cross-lingual retrieval, RAG pipelines, and similarity tasks where a unified vector space across languages matters.
What sets intfloat apart is its focus on openly released, research-driven embedding models rather than chat or generative systems. The E5 line has become a common default choice among developers building retrieval and search infrastructure, offering a practical, well-documented alternative to closed embedding APIs.
📐EMBEDDINGS(1)
Data from Venice API (live pricing + capabilities) and enrichment worker (HuggingFace + Wikipedia + arXiv, refreshed every 12h).