Qwen3 30B A3B

E2eeWeb searchFunction calling
Quick reference
Qwen3 30B A3B — TLDR
  • 🧠 Mixture-of-experts model: 30.5B total, ~3.3B active per token.
  • 📏 Ultra-long 256K context window for long-document and agentic use.
  • 🔒 Runs in a Trusted Execution Environment with hardware attestation evidence.
  • 🔧 Supports function calling and web search natively.
  • 🌐 Multilingual coverage spanning roughly 119 languages.
  • 🆕 Qwen3 generation adds switchable thinking and non-thinking modes.
  • 📚 Apache-2.0 licensed; widely downloaded on Hugging Face.
  • 🏢 Built by Alibaba's Qwen team, served confidentially via Venice.
💰 Best price on AntSeed
$0.0008 / $0.004199%
per 1M · cheapest in / out
📏 Context
256K tokens
🐜 Sellers
4
advertising on AntSeed
Provider

Alibaba Group is a Chinese multinational technology company founded in 1999 and headquartered in Hangzhou, Zhejiang. Originally built around e-commerce and cloud computing, Alibaba has become one of the most prolific contributors to open-weight AI research, developing the Qwen…

Explore 24 more models by Alibaba Group
About this model

Qwen3 30B A3B is Alibaba's compact mixture-of-experts language model deployed here inside a Trusted Execution Environment (TEE), where hardware attestation lets users independently verify the runtime. Architecturally it activates only about 3.3B of its 30.5B total parameters per inference, a sparse MoE design that keeps compute low while retaining a broad knowledge base. According to Qwen's documentation, the Qwen3 line supports seamless switching between thinking and non-thinking modes and spans roughly 119 languages, covering reasoning, coding, math, and instruction-following.

Within this confidential-compute family, it succeeds [[sibling:e2ee-qwen-2-5-7b-p|Qwen 2.5 7B]], the small dense model previously offered in the same TEE configuration. The generational jump moves from a 7B dense architecture to a far larger MoE backbone with greater total capacity at comparable active cost, plus the newer Qwen3 features such as mode switching and stronger multilingual support. The catalog also lists a much larger sibling, the dense-MoE [[sibling:e2ee-qwen3-5-122b-a10b|Qwen3.5 122B A10B]], for users needing more capacity under the same privacy guarantees.

This Venice deployment extends the context window to 256K tokens and exposes function calling and web search, making it suited to long-document analysis and tool-using agents. The end-to-end-encrypted, attestable setup targets workloads where data confidentiality matters as much as model quality. It carries an Apache-2.0 license, and the underlying Qwen3-30B-A3B weights are openly available on Hugging Face for self-hosting.

View source on GitHub ↗View model card on HuggingFace ↗
Sources
qwenlm.github.ioQwen3: Think Deeper, Act Faster | Qwen· qwenlm.github.iohuggingface.coQwen/Qwen3-30B-A3B · Hugging Face· huggingface.co

This About section is AI-generated from public sources via VeniceStats + Venice inference, with no human editing. It may contain inaccuracies.

Sellers serving Qwen3 30B A3B (4)
SellerReputationInput $/MCached $/MOutput $/MCategoriesAPI

"Best price" and the seller table are live AntSeed catalog data (advertised $/1M tokens, not settled amounts). Reputation = on-chain trust (0-100). Model knowledge (TLDR, provider, About) via the VeniceStats enrichment layer. Advertised catalog, not the model used in any specific purchase.