OpenAIOpenAI·text

GPT-5.4 Mini

VisionReasoningWeb searchFunction calling
Quick reference
GPT-5.4 Mini — TLDR
  • - 🆕 Compact GPT-5.4-class model for high-throughput, latency-sensitive workloads.
  • - 📏 400,000-token context window with up to 128K output tokens.
  • - 👁️ Accepts text and image inputs.
  • - 🔧 Supports function calling, tool use, file search, and computer use.
  • - 🌐 Built-in web search for grounded responses.
  • - ⚡ Optimized for speed in coding assistants and parallel subagents.
  • - 🎯 Recommended for classification, extraction, ranking, and coding subtasks.
💰 Best price on AntSeed
$0.0067 / $0.02799%
per 1M · cheapest in / out
📏 Context
400K tokens
🐜 Sellers
12
advertising on AntSeed
Provider

OpenAI is an American artificial intelligence research organization headquartered in San Francisco, structured as both a for-profit public benefit corporation and a nonprofit foundation. The lab developed the GPT family of large language models, the DALL-E image generation…

Explore 16 more models by OpenAI
About this model

GPT-5.4 Mini, released in 2026 by OpenAI, is a smaller, faster sibling within the GPT-5.4 generation, bringing many of the strengths of [[sibling:openai-gpt-54|GPT-5.4]] to a model designed for high-volume, cost-sensitive deployments. It supports text and image inputs, tool use, function calling, web search, file search, computer use, and skills, alongside a 400,000-token context window. OpenAI positions it for workloads where latency directly shapes the product experience, such as responsive coding assistants and computer-using systems that interpret screenshots.

Within the catalog's mini lineage, it follows [[sibling:openai-gpt-4o-mini-2024-07-18|GPT-4o Mini]]. According to OpenAI, GPT-5.4 Mini and its companion nano are the company's most capable small models yet, and OpenAI now recommends starting with GPT-5.4 mini for most new low-latency, high-volume workloads in place of the earlier GPT-5 mini.

In practice, OpenAI describes a delegation pattern in Codex where a larger model like GPT-5.4 handles planning, coordination, and final judgment, while GPT-5.4 Mini subagents tackle narrower subtasks in parallel—searching a codebase, reviewing a large file, or processing supporting documents.

It sits alongside other GPT-5.4 tier models, including [[sibling:openai-gpt-54-pro|GPT-5.4 Pro]], and the later [[sibling:openai-gpt-55|GPT-5.5]] and [[sibling:openai-gpt-55-pro|GPT-5.5 Pro]] releases. OpenAI recommends it as a default starting point for new low-latency, high-volume agent and chat workloads.

Sources
developers.openai.comGPT-5.4 mini Model | OpenAI API· developers.openai.comopenai.comIntroducing GPT-5.4 mini and nano | OpenAI· openai.com

This About section is AI-generated from public sources via VeniceStats + Venice inference, with no human editing. It may contain inaccuracies.

Sellers serving GPT-5.4 Mini (12)
SellerReputationInput $/MCached $/MOutput $/MCategoriesAPI

"Best price" and the seller table are live AntSeed catalog data (advertised $/1M tokens, not settled amounts). Reputation = on-chain trust (0-100). Model knowledge (TLDR, provider, About) via the VeniceStats enrichment layer. Advertised catalog, not the model used in any specific purchase.