Qwen3 Embedding 8B
- - 🧠 Largest variant of Alibaba's Qwen3 text-embedding series, eight billion parameters.
- - 📏 Handles up to 32K-token context for long-document embedding.
- - 🎯 Built for retrieval, clustering, classification, and reranking pipelines.
- - 🌐 Multilingual coverage spanning over 100 languages.
- - 🔧 Flexible output dimensions up to 4096.
- - 🔒 Apache 2.0 license, permitting unrestricted commercial use.
- - ⚡ Inherits Qwen3 long-text understanding for embeddings.
- - 📚 Posts a reported 70.58 MTEB multilingual score.
Alibaba Group is a Chinese multinational technology company founded in 1999 and headquartered in Hangzhou, Zhejiang. Originally built around e-commerce and cloud computing, Alibaba has become one of the most prolific contributors to open-weight AI research, developing the Qwen…
Explore 24 more models by Alibaba Group →Qwen3 Embedding 8B is the high-capacity member of Alibaba's Qwen3 Embedding family, a series purpose-built for text embedding and ranking tasks. Built on the dense Qwen3 foundation models, it produces semantically rich vectors used for retrieval, clustering, classification, code search, and bitext mining, and it inherits the multilingual breadth and long-text understanding of the underlying Qwen3 base. It supports a 32K-token context window and configurable output dimensions up to 4096, and ships under an Apache 2.0 license.
Compared with its smaller same-family sibling [[sibling:text-embedding-qwen3-0-6b|Qwen3 Embedding 0.6B]], the 8B model trades efficiency for raw representational capacity, scaling from 0.6B to 8B parameters while sharing the same architecture, instruction-formatting conventions, and multilingual training recipe. The two are designed to be combined or swapped depending on whether deployments prioritize throughput or embedding quality.
On the provider's reported figures, the 8B embedding model achieves a 70.58 score on the MTEB multilingual benchmark (as cited as of June 5, 2025). Across the lineup, the series spans 0.6B, 4B, and 8B sizes for both embedding and reranking, letting developers pick a point on the efficiency-versus-quality curve.
The model supports flexible vector dimensions, and tooling such as sentence-transformers, text-embeddings-inference, and llama.cpp can serve it, making it straightforward to slot into existing vector-database and RAG workflows.
This About section is AI-generated from public sources via VeniceStats + Venice inference, with no human editing. It may contain inaccuracies.
| Seller | Reputation↓ | Input $/M | Cached $/M | Output $/M | Categories | API |
|---|---|---|---|---|---|---|
| ▲ Apex Ant 0x73b4…e736 | 40 | $0.0017 | $0.0003 | $0.00 | embeddings,open-source | openai-chat-completions |
"Best price" and the seller table are live AntSeed catalog data (advertised $/1M tokens, not settled amounts). Reputation = on-chain trust (0-100). Model knowledge (TLDR, provider, About) via the VeniceStats enrichment layer. Advertised catalog, not the model used in any specific purchase.