OpenAI·text

GPT OSS 120B

E2eeReasoningWeb search

Quick reference

GPT OSS 120B — TLDR

🧠 OpenAI open-weight 117B Mixture-of-Experts, activating 5.1B parameters per token
🔒 Runs in a Trusted Execution Environment with hardware attestation evidence
🔧 Configurable reasoning effort (low, medium, high) and native tool use
📏 128K-token context window for long inputs
🆕 Permissive Apache 2.0 license for commercial deployment
👁️ Full chain-of-thought access for debugging and transparency
⚡ Designed to run efficiently on a single 80GB GPU
🏢 Released March 2026 as a privacy-focused confidential-compute deployment

💰 Best price on AntSeed

FREE / FREE

per 1M · cheapest in / out

📏 Context

128K tokens

🐜 Sellers

advertising on AntSeed

Provider

OpenAI

OpenAI is an American artificial intelligence research organization headquartered in San Francisco, structured as both a for-profit public benefit corporation and a nonprofit foundation. The lab developed the GPT family of large language models, the DALL-E image generation…

Site ↗X ↗Wikipedia ↗

Explore 16 more models by OpenAI →

About this model

GPT OSS 120B is the larger member of OpenAI's open-weight gpt-oss series, a Transformer using mixture-of-experts to keep only 5.1B of its 117B total parameters active per token. This catalog entry packages that model inside a Trusted Execution Environment (TEE), adding hardware attestation evidence so users can independently verify that inference runs in a confidential, tamper-resistant enclave. It carries forward the base model's permissive Apache 2.0 license, configurable reasoning depth, full chain-of-thought visibility, and native tool use including function calling, browsing, and structured output.

Within this confidential-compute family, the model is the higher-capacity counterpart to [[sibling:e2ee-gpt-oss-20b-p|GPT OSS 20B]]. The two share the same architecture and TEE wrapper, but the 120B variant activates 5.1B parameters per token versus the 20B model's 3.6B, and OpenAI positions the larger model for production, general-purpose, high-reasoning workloads while the smaller one targets lower-latency or memory-constrained deployment.

Compared to the standard non-enclave release, [[sibling:openai-gpt-oss-120b|OpenAI GPT OSS 120B]], the weights and capabilities are identical; the distinguishing feature here is end-to-end encryption and verifiable execution rather than any change to the model itself.

On capability, OpenAI reports that gpt-oss-120b reaches near-parity with its o4-mini model on core reasoning benchmarks while running efficiently on a single 80GB GPU. For broader chat, image, and embedding needs, the provider's same-period lineup also includes models such as [[sibling:openai-gpt-55|GPT-5.5]].

View source on GitHub ↗View model card on HuggingFace ↗

Sources

gpt-oss-120b Model | OpenAI API· developers.openai.com

Introducing gpt-oss | OpenAI· openai.com

openai/gpt-oss-120b · Hugging Face· huggingface.co

This About section is AI-generated from public sources via VeniceStats + Venice inference, with no human editing. It may contain inaccuracies.

Sellers serving GPT OSS 120B (8)

Seller	Reputation↓	Input $/M	Cached $/M	Output $/M	Categories	API
Venice.ai Proxy 0x1f22…18c9	88	$0.065	$0.065	$0.325	chat,reasoning,web-search,e2ee	openai-chat-completions
surplusintelligence.ai 0x0e49…8927	79	$0.07	$0.07	$0.30	anon,chat,web-search,cheap	openai-chat-completions
Open Bird 0xc0f1…8183	57	$0.0195	$0.0195	$0.095	chat,open-source	openai-chat-completions
Fire Ant 🔥🐜 0xbe05…bc5d	45	$0.014	$0.014	$0.085	anon,chat,cheap,coding,e2ee,free,json,math,open-source,privacy,reasoning,research,tee,tools,web-search	—
▲ Apex Ant 0x73b4…e736	40	$0.00	$0.00	$0.00	chat,open-source,free	openai-chat-completions
AntFeed 0xddb6…1442	25	$0.0655	$0.0655	$0.2183	chat,cheap	openai-chat-completions
➤Bullet Ant 🐜 0xe924…8936	16	$0.10	$0.10	$0.30	chat,coding,reasoning,anon	openai-chat-completions
uomi.ai 0x87df…48e3	0	$0.053	$0.053	$0.473	chat,math,coding	openai-chat-completions

"Best price" and the seller table are live AntSeed catalog data (advertised $/1M tokens, not settled amounts). Reputation = on-chain trust (0-100). Model knowledge (TLDR, provider, About) via the VeniceStats enrichment layer. Advertised catalog, not the model used in any specific purchase.