Alibaba Group·text

Qwen 3 235B A22B Instruct 2507

Web searchFunction calling

Quick reference

Qwen 3 235B A22B Instruct 2507 — TLDR

🧠 Mixture-of-experts: 235B total parameters, 22B active per token
🆕 Updated "non-thinking" refresh of the original Qwen3-235B-A22B
📏 Natively supports 256K context, extendable toward 1M tokens
🎯 Gains in instruction following, math, science, coding, tool use
🌐 Multilingual coverage across many languages and dialects
🔧 Function calling and web search; Qwen-Agent tooling support
🔒 Apache 2.0 licensed open weights
💬 Instruct-only mode; does not emit reasoning traces

💰 Best price on AntSeed

FREE / FREE

per 1M · cheapest in / out

📏 Context

128K tokens

🐜 Sellers

advertising on AntSeed

Provider

Alibaba Group

Alibaba Group is a Chinese multinational technology company founded in 1999 and headquartered in Hangzhou, Zhejiang. Originally built around e-commerce and cloud computing, Alibaba has become one of the most prolific contributors to open-weight AI research, developing the Qwen…

Site ↗X ↗Wikipedia ↗

Explore 23 more models by Alibaba Group →

About this model

Qwen 3 235B A22B Instruct 2507 is a Mixture-of-Experts large language model from Alibaba's Qwen team, with 235 billion total parameters but only about 22 billion activated per forward pass. It is the "2507" refresh of the original Qwen3-235B-A22B non-thinking mode, released as part of the Qwen3 series. Distributed under the Apache 2.0 license, it targets long-document research, technical work, and high-precision tasks, and is served here in FP8 quantization.

Compared with its same-family predecessor, the original Qwen3-235B-A22B, Qwen reports significant improvements in general capabilities — instruction following, logical reasoning, text comprehension, mathematics, science, coding and tool usage — plus substantial gains in multilingual long-tail knowledge and better alignment on subjective, open-ended tasks. The update also adds enhanced 256K-token long-context understanding, with model-card instructions for extending toward one million tokens. Unlike the original dual-mode design, this Instruct variant operates in non-thinking mode only and does not generate reasoning-trace blocks.

For workloads needing explicit step-by-step reasoning, Qwen released a parallel [[sibling:qwen3-235b-a22b-thinking-2507|Qwen 3 235B A22B Thinking 2507]] sibling that uses extended reasoning chains. Other related Qwen text models in the catalog include the efficiency-focused [[sibling:qwen3-next-80b|Qwen 3 Next 80B]]. Note that Venice exposes a 128K context window for this deployment, below the model's full native 256K capacity.

View source on GitHub ↗View model card on HuggingFace ↗

Sources

Qwen/Qwen3-235B-A22B-Instruct-2507 · Hugging Face· huggingface.co

This About section is AI-generated from public sources via VeniceStats + Venice inference, with no human editing. It may contain inaccuracies.

Sellers serving Qwen 3 235B A22B Instruct 2507 (8)

Seller	Reputation↓	Input $/M	Cached $/M	Output $/M	Categories	API
Venice.ai Proxy 0x1f22…18c9	99	$0.075	$0.075	$0.375	chat,web-search	openai-chat-completions
Open Forge 0x1d90…b0aa	87	$0.00	$0.00	$0.00	chat,fast,free	openai-chat-completions
surplusintelligence.ai 0x0e49…8927	79	$0.045	$0.045	$0.225	agents,anon,chat,cheap,code,coding,fast,frontier,function-calling,research,tasks,tools,translate,web-search	openai-chat-completions
▲ Apex Ant 0x73b4…e736	71	$0.108	$0.108	$0.66	chat,open-source,privacy,long-context,agents	openai-chat-completions
Fire Ant 🔥🐜 0xbe05…bc5d	54	$0.1169	$0.1169	$0.6246	agents,anon,chat,cheap,code,coding,fast,frontier,function-calling,long-context,open-source,privacy,research,tasks,tools,translate,web-search	—
D5V1N2 0xd5e7…7be0	41	$0.062	$0.062	$0.31	chat,reasoning,research,cheap,router,fallback,qwen	openai-chat-completions
antseed-neon-puma-944e 0x6650…944e	26	$0.075	$0.075	$0.375	chat	openai-chat-completions
➤Bullet Ant 🐜 0xe924…8936	13	$2.00	$2.00	$6.00	chat,coding,reasoning,anon	openai-chat-completions

"Best price" and the seller table are live AntSeed catalog data (advertised $/1M tokens, not settled amounts). Reputation = on-chain trust (0-100). Model knowledge (TLDR, provider, About) via the VeniceStats enrichment layer. Advertised catalog, not the model used in any specific purchase.