Mistral AI·text

Mistral Small 4

CodeVisionReasoningWeb searchFunction calling

Quick reference

Mistral Small 4 — TLDR

🆕 Unifies instruct, reasoning, and coding into one model
🧠 119B-parameter MoE, only ~6.5B active per token
🔧 128 experts, 4 active per token
📏 Native 256K-token context window
👁️ Accepts text and image input, text output
🎯 Configurable reasoning effort, toggle fast vs. thinking mode
🔒 Apache 2.0 license, open weights
💬 Native function calling and JSON output for agentic use

💰 Best price on AntSeed

$0.0017 / $0.0068−99%

per 1M · cheapest in / out

📏 Context

256K tokens

🐜 Sellers

advertising on AntSeed

Provider

Mistral AI

Mistral AI is a French artificial intelligence company headquartered in Paris, founded in 2023. The company focuses on developing large language models offered under both open-weight and proprietary licenses. Mistral AI has quickly risen to prominence in the AI landscape,…

Site ↗X ↗Wikipedia ↗

Explore 1 more model by Mistral AI →

About this model

Mistral Small 4, released March 16, 2026, is Mistral AI's open-weight hybrid model that consolidates three previously separate model families—Instruct, Reasoning (formerly Magistral), and Devstral coding—into a single checkpoint. It uses a Mixture-of-Experts architecture with 119 billion total parameters spread across 128 experts, activating only 4 experts (about 6.5 billion parameters) per token, giving it the inference profile of a much smaller dense model. It accepts text and image input, supports a 256K-token context window, and exposes a configurable reasoning-effort parameter that toggles between fast instant replies and a slower thinking mode.

This marks a substantial change from its same-family predecessor, [[sibling:mistral-small-3-2-24b-instruct|Mistral Small 3.2 24B Instruct]], a 24B dense instruction model from January 2026. Where Small 3.2 focused on text instruction following, Small 4 moves to a sparse MoE design, adds native vision, integrates dedicated reasoning and agentic-coding behavior, and expands deployment to enterprise-scale tasks.

Mistral distributes Small 4 under the Apache 2.0 license with multiple checkpoints, including an NVFP4 4-bit quantized version and a trained Eagle head for speculative decoding. Mistral positions it for chat assistants, coding, agentic workflows, and reasoning tasks, with native function calling and JSON output.

View source on GitHub ↗View model card on HuggingFace ↗

Sources

Mistral Small 4 - Mistral AI | Mistral Docs· docs.mistral.ai

mistralai / mistral-small-4-119b-2603· docs.api.nvidia.com

mistral-small-4-119b-2603 Model by Mistral AI· build.nvidia.com

mistralai/Mistral-Small-4-119B-2603 · Hugging Face· huggingface.co

This About section is AI-generated from public sources via VeniceStats + Venice inference, with no human editing. It may contain inaccuracies.

Sellers serving Mistral Small 4 (8)

Seller	Reputation↓	Input $/M	Cached $/M	Output $/M	Categories	API
Venice.ai Proxy 0x1f22…18c9	88	$0.0938	$0.0938	$0.375	chat,reasoning,coding,vision,multimodal,web-search	openai-chat-completions
surplusintelligence.ai 0x0e49…8927	79	$0.1875	$0.1875	$0.75	anon,chat,code,coding,multimodal,reasoning,vision,web-search,research,translate	openai-chat-completions
Mistral Relay Node 0x057c…2a5f	65	$25.00	$0.10	$63.00	chat,coding,analysis	openai-chat-completions
Deepseek Compute Node 0x2d3f…22ec	47	$30.00	$0.10	$75.00	chat,coding,analysis	openai-chat-completions
Fire Ant 🔥🐜 0xbe05…bc5d	45	$0.1688	$0.1688	$0.675	anon,chat,code,coding,multimodal,open-source,reasoning,research,tasks,translate,vision,web-search	—
▲ Apex Ant 0x73b4…e736	40	$0.0017	$0.0003	$0.0068	chat,reasoning,vision,multimodal,tasks	openai-chat-completions
Leftermute 0x388b…5389	26	$0.0379	$0.0379	$0.1515	chat,coding,json,tools	openai-chat-completions
Meridian AI 0x8c8c…06f5	2	$0.0405	$0.0405	$0.162	chat,coding	openai-chat-completions

"Best price" and the seller table are live AntSeed catalog data (advertised $/1M tokens, not settled amounts). Reputation = on-chain trust (0-100). Model knowledge (TLDR, provider, About) via the VeniceStats enrichment layer. Advertised catalog, not the model used in any specific purchase.