Alibaba Group·text

Qwen3 30B A3B

E2eeWeb searchFunction calling

Quick reference

Qwen3 30B A3B — TLDR

🧠 Mixture-of-experts model: 30.5B total, ~3.3B active per token.
📏 Ultra-long 256K context window for long-document and agentic use.
🔒 Runs in a Trusted Execution Environment with hardware attestation evidence.
🔧 Supports function calling and web search natively.
🌐 Multilingual coverage spanning roughly 119 languages.
🆕 Qwen3 generation adds switchable thinking and non-thinking modes.
📚 Apache-2.0 licensed; widely downloaded on Hugging Face.
🏢 Built by Alibaba's Qwen team, served confidentially via Venice.

💰 Best price on AntSeed

$0.0008 / $0.0041−99%

per 1M · cheapest in / out

📏 Context

256K tokens

🐜 Sellers

advertising on AntSeed

Provider

Alibaba Group

Alibaba Group is a Chinese multinational technology company founded in 1999 and headquartered in Hangzhou, Zhejiang. Originally built around e-commerce and cloud computing, Alibaba has become one of the most prolific contributors to open-weight AI research, developing the Qwen…

Site ↗X ↗Wikipedia ↗

Explore 24 more models by Alibaba Group →

About this model

Qwen3 30B A3B is Alibaba's compact mixture-of-experts language model deployed here inside a Trusted Execution Environment (TEE), where hardware attestation lets users independently verify the runtime. Architecturally it activates only about 3.3B of its 30.5B total parameters per inference, a sparse MoE design that keeps compute low while retaining a broad knowledge base. According to Qwen's documentation, the Qwen3 line supports seamless switching between thinking and non-thinking modes and spans roughly 119 languages, covering reasoning, coding, math, and instruction-following.

Within this confidential-compute family, it succeeds [[sibling:e2ee-qwen-2-5-7b-p|Qwen 2.5 7B]], the small dense model previously offered in the same TEE configuration. The generational jump moves from a 7B dense architecture to a far larger MoE backbone with greater total capacity at comparable active cost, plus the newer Qwen3 features such as mode switching and stronger multilingual support. The catalog also lists a much larger sibling, the dense-MoE [[sibling:e2ee-qwen3-5-122b-a10b|Qwen3.5 122B A10B]], for users needing more capacity under the same privacy guarantees.

This Venice deployment extends the context window to 256K tokens and exposes function calling and web search, making it suited to long-document analysis and tool-using agents. The end-to-end-encrypted, attestable setup targets workloads where data confidentiality matters as much as model quality. It carries an Apache-2.0 license, and the underlying Qwen3-30B-A3B weights are openly available on Hugging Face for self-hosting.

View source on GitHub ↗View model card on HuggingFace ↗

Sources

Qwen3: Think Deeper, Act Faster | Qwen· qwenlm.github.io

Qwen/Qwen3-30B-A3B · Hugging Face· huggingface.co

This About section is AI-generated from public sources via VeniceStats + Venice inference, with no human editing. It may contain inaccuracies.

Sellers serving Qwen3 30B A3B (4)

Seller	Reputation↓	Input $/M	Cached $/M	Output $/M	Categories	API
Venice.ai Proxy 0x1f22…18c9	88	$0.095	$0.095	$0.345	chat,web-search,e2ee	openai-chat-completions
Fire Ant 🔥🐜 0xbe05…bc5d	45	$0.085	$0.085	$0.335	anon,chat,cheap,coding,e2ee,json,long-context,open-source,privacy,reasoning,tee,translate,web-search	—
▲ Apex Ant 0x73b4…e736	40	$0.0008	$0.0002	$0.0041	chat,reasoning,open-source,long-context,cheap	openai-chat-completions
Leftermute 0x388b…5389	26	$0.0173	$0.0173	$0.0627	chat,coding,json	openai-chat-completions

"Best price" and the seller table are live AntSeed catalog data (advertised $/1M tokens, not settled amounts). Reputation = on-chain trust (0-100). Model knowledge (TLDR, provider, About) via the VeniceStats enrichment layer. Advertised catalog, not the model used in any specific purchase.