- 🧠 Mixture-of-experts model: 30.5B total, ~3.3B active per token.
- 📏 Ultra-long 256K context window for long-document and agentic use.
- 🔒 Runs in a Trusted Execution Environment with hardware attestation evidence.
- 🔧 Supports function calling and web search natively.
- 🌐 Multilingual coverage spanning roughly 119 languages.
- 🆕 Qwen3 generation adds switchable thinking and non-thinking modes.
- 📚 Apache-2.0 licensed; widely downloaded on Hugging Face.
- 🏢 Built by Alibaba's Qwen team, served confidentially via Venice.
Alibaba Group is a Chinese multinational technology company founded in 1999 and headquartered in Hangzhou, Zhejiang. Originally built around e-commerce and cloud computing, Alibaba has become one of the most prolific contributors to open-weight AI research, developing the Qwen…
Explore 24 more models by Alibaba Group →Qwen3 30B A3B is Alibaba's compact mixture-of-experts language model deployed here inside a Trusted Execution Environment (TEE), where hardware attestation lets users independently verify the runtime. Architecturally it activates only about 3.3B of its 30.5B total parameters per inference, a sparse MoE design that keeps compute low while retaining a broad knowledge base. According to Qwen's documentation, the Qwen3 line supports seamless switching between thinking and non-thinking modes and spans roughly 119 languages, covering reasoning, coding, math, and instruction-following.
Within this confidential-compute family, it succeeds [[sibling:e2ee-qwen-2-5-7b-p|Qwen 2.5 7B]], the small dense model previously offered in the same TEE configuration. The generational jump moves from a 7B dense architecture to a far larger MoE backbone with greater total capacity at comparable active cost, plus the newer Qwen3 features such as mode switching and stronger multilingual support. The catalog also lists a much larger sibling, the dense-MoE [[sibling:e2ee-qwen3-5-122b-a10b|Qwen3.5 122B A10B]], for users needing more capacity under the same privacy guarantees.
This Venice deployment extends the context window to 256K tokens and exposes function calling and web search, making it suited to long-document analysis and tool-using agents. The end-to-end-encrypted, attestable setup targets workloads where data confidentiality matters as much as model quality. It carries an Apache-2.0 license, and the underlying Qwen3-30B-A3B weights are openly available on Hugging Face for self-hosting.
This About section is AI-generated from public sources via VeniceStats + Venice inference, with no human editing. It may contain inaccuracies.
| Seller | Reputation↓ | Input $/M | Cached $/M | Output $/M | Categories | API |
|---|---|---|---|---|---|---|
| Venice.ai Proxy 0x1f22…18c9 | 88 | $0.095 | $0.095 | $0.345 | chat,web-search,e2ee | openai-chat-completions |
| Fire Ant 🔥🐜 0xbe05…bc5d | 45 | $0.085 | $0.085 | $0.335 | anon,chat,cheap,coding,e2ee,json,long-context,open-source,privacy,reasoning,tee,translate,web-search | — |
| ▲ Apex Ant 0x73b4…e736 | 40 | $0.0008 | $0.0002 | $0.0041 | chat,reasoning,open-source,long-context,cheap | openai-chat-completions |
| Leftermute 0x388b…5389 | 26 | $0.0173 | $0.0173 | $0.0627 | chat,coding,json | openai-chat-completions |
"Best price" and the seller table are live AntSeed catalog data (advertised $/1M tokens, not settled amounts). Reputation = on-chain trust (0-100). Model knowledge (TLDR, provider, About) via the VeniceStats enrichment layer. Advertised catalog, not the model used in any specific purchase.