Google·text

Google Gemma 4 31B Instruct

VisionReasoningWeb searchFunction calling

Quick reference

Google Gemma 4 31B Instruct — TLDR

🧠 Dense 30.7B open model from Google DeepMind for reasoning
🆕 Configurable thinking modes toggled via a reasoning token
📏 256K-token context window for long documents and code
👁️ Handles text and image input; video processed as frames
🔧 Native function calling for agentic, tool-using workflows
🏢 Quantized checkpoints target consumer GPUs and workstations
🔒 Apache 2.0 license; open pre-trained and instruction-tuned weights
📚 Hybrid local/global attention with Proportional RoPE for long context

💰 Best price on AntSeed

$0.0090 / $0.024−93%

per 1M · cheapest in / out

📏 Context

256K tokens

🐜 Sellers

advertising on AntSeed

Provider

Google

Google is an American multinational technology corporation and one of the world's most valuable brands. A subsidiary of parent company Alphabet Inc., Google operates across search, cloud computing, consumer electronics, and artificial intelligence. Its DeepMind and Google…

Site ↗X ↗Wikipedia ↗

Explore 13 more models by Google →

About this model

Gemma 4 31B Instruct is the dense flagship of Google DeepMind's Gemma 4 family, a 30.7B-parameter multimodal model that accepts text and image input (and can process video as sequences of frames) while generating text output. It offers a 256K-token context window, native function calling, and configurable thinking modes, aimed at running reasoning, coding, and multimodal tasks under an Apache 2.0 license.

Architecturally it is a dense transformer paired with a vision encoder, using a hybrid attention scheme that interleaves local sliding-window layers with full global attention and Proportional RoPE (p-RoPE) for efficient long-context handling; quantization-aware and w4a16 checkpoints are published for smaller-footprint deployment.

Relative to the sibling [[sibling:google-gemma-4-26b-a4b-it|Gemma 4 26B A4B Instruct]], a Mixture-of-Experts variant with fewer active parameters, this 31B is dense—trading that inference efficiency for the family's highest-quality tier. Against the previous generation [[sibling:google-gemma-3-27b-it|Gemma 3 27B]], Google DeepMind highlights Gemma 4's built-in reasoning with configurable thinking, native system-prompt and function-calling support, and coding improvements.

Google DeepMind publishes instruction-tuned results in the official Gemma 4 31B model card, spanning reasoning, coding, vision, long-context, and safety tasks, and states the models undergo the same safety evaluations as its proprietary Gemini models.

View source on GitHub ↗View model card on HuggingFace ↗

Sources

google / gemma-4-31b-it· docs.api.nvidia.com

gemma-4-31b-it Model by Google· build.nvidia.com

google/gemma-4-31B · Hugging Face· huggingface.co

This About section is AI-generated from public sources via VeniceStats + Venice inference, with no human editing. It may contain inaccuracies.

Sellers serving Google Gemma 4 31B Instruct (9)

Seller	Reputation↓	Input $/M	Cached $/M	Output $/M	Categories	API
Venice.ai Proxy 0x1f22…18c9	99	$0.0875	$0.0875	$0.25	chat,reasoning,vision,video,multimodal,web-search	openai-chat-completions
surplusintelligence.ai 0x0e49…8927	79	$0.036	$0.027	$0.108	agents,anon,chat,cheap,function-calling,multimodal,reasoning,research,tasks,tools,translate,video,vision,web-search	openai-chat-completions
▲ Apex Ant 0x73b4…e736	71	$0.168	$0.168	$0.48	chat,open-source,vision,multimodal,reasoning,long-context,agents	openai-chat-completions
Fire Ant 🔥🐜 0xbe05…bc5d	54	$0.089	$0.08	$0.286	agents,anon,base-usdc,chat,cheap,code,coding,function-calling,gemma,github,json,low-cost,math,monitored,multimodal,openai-compatible,reasoning,research,response-auth,surplus,tasks,tools,translate,value,verified,video,vision,web-search	—
Chutes 0xded6…657c	49	$0.132	$0.066	$0.407	chat,reasoning,vision,tee	openai-chat-completions
NovaRoute AI 0xc50d…ed7b	33	$0.009	$0.009	$0.024	chat,coding,code,reasoning,tasks,gemma,value,surplus,openai-compatible,low-cost,verified,github,response-auth,base-usdc,monitored	openai-chat-completions
uomi.ai 0x87df…48e3	30	$0.096	$0.096	$0.296	chat,math,coding	openai-chat-completions
antseed-neon-puma-944e 0x6650…944e	26	$0.06	$0.09	$0.18	chat,math	openai-chat-completions
Leftermute 0x388b…5389	18	$0.0348	$0.0348	$0.0929	chat,coding,json,tools	openai-chat-completions

"Best price" and the seller table are live AntSeed catalog data (advertised $/1M tokens, not settled amounts). Reputation = on-chain trust (0-100). Model knowledge (TLDR, provider, About) via the VeniceStats enrichment layer. Advertised catalog, not the model used in any specific purchase.