o4 Mini Deep Research

o4-mini-deep-research

o4-mini-deep-research is a faster, lower-cost version of OpenAI's deep-research model, designed for complex, multi-step investigations. It automatically relies on web_search for information gathering, which always adds extra usage cost.

Context: 200K tokens
Endpoint

Get API Key Compare

Pricing

Input$2.00 / 1M

Output$8.00 / 1M

Cache Write$0 / 1M

Cache Read$0 / 1M

Web Search$0 / 1M

Prompt cache writes and reads are included at no additional cost.

Quick Start

Select an endpoint and copy a working example for this model.

Endpoint:

python

from openai import OpenAI client = OpenAI(    api_key="YOUR_API_KEY",    base_url="https://api.apertis.ai/v1") response = client.chat.completions.create(    model="o4-mini-deep-research",    messages=[        {"role": "user", "content": "Hello!"}    ],    max_tokens=1024,    temperature=0.7) print(response.choices[0].message.content) # Optional: Enable context compression to reduce token usage# response = client.chat.completions.create(#     model="o4-mini-deep-research",#     messages=[{"role": "user", "content": "Hello!"}],#     extra_body={"compression": {"enabled": True, "model": "gpt-4.1-mini"}}# )

Supported Parameters

API docs

Common7 params

modelmessagesmax_tokenstemperaturetop_pstreamtools

Extended4 params

reasoning_effortstream_optionsthinkingextra_body

Cursor IDE Model IDs

Use these namespaced identifiers in Cursor IDE to avoid conflicts with built-in models.

o4-mini-deep-research

Compare with Other Models

See how this model compares to others from the same provider.

GPT Image 1.5

gpt-image-1.5 is an upgraded OpenAI image model that produces more detailed, realistic visuals with better composition and prompt fidelity. It improves editing, variation, and style control over earlier versions, making it well suited for creative design, illustration, marketing assets, and visual prototyping.

Context: 32.0K
Input: $0/M
Output: $0/M

GPT Audio (2025-08-28)

gpt-4o-mini-audio-preview is a smaller preview version of OpenAI’s audio-capable GPT-4o mini model that supports both audio and text inputs and outputs via the API. It enables the model to understand nuances in audio recordings and incorporate them into responses, making it useful for audio-enabled applications like transcription, speech understanding, and voice-driven interactions.

Context: 128K
Input: $0.875/M
Output: $1.75/M

GPT-5.4 Pro

GPT-5.4 Pro is OpenAI's most advanced model, built on the unified GPT-5.4 architecture with enhanced reasoning capabilities for complex and high-stakes tasks. It supports text and image inputs and features a 1M+ token context window (≈922K input, 128K output) for handling large-scale workflows and long-context analysis. Optimized for step-by-step reasoning, instruction following, and accuracy, GPT-5.4 Pro excels in agentic coding, long-context problem solving, and complex multi-step workflows, making it well suited for advanced engineering, research, and high-reliability applications.

Context: 1.1M
Input: $24.00/M
Output: $144.00/M

GPT-5 Codex Low

GPT-5 Codex (Low) is a coding-focused version of GPT-5 built for both interactive development and long autonomous engineering tasks. It can create projects, add features, debug, refactor, and review code, producing cleaner and more controllable outputs than GPT-5. It integrates with developer tools (CLI, IDEs, GitHub, cloud), supports adjustable reasoning effort, handles multimodal inputs, and uses tools for search and environment setup — making it purpose-built for agentic coding workflows.

Context: 400K
Input: $0.625/M
Output: $5.00/M

Compare with Other Models

See how this model compares to others from the same provider.

GPT-4o Mini TTS

GPT-4o Mini TTS is OpenAI's cost-efficient text-to-speech model, designed to convert text into natural-sounding audio output. It supports a variety of voices and tones, enabling flexible and expressive speech generation. Optimized for scalability and low cost, it is well suited for real-time voice applications, content narration, and high-volume audio generation workflows.

Context: 4.1K
Input: $0.30/M
Output: $0/M

GPT-4o Mini Transcribe

GPT-4o Mini Transcribe is a smaller, cost-efficient speech-to-text model built on GPT-4o Mini's audio capabilities. It is designed for high-volume transcription workloads, delivering reliable performance with lower cost and latency. Priced per token (input and output), it provides transparent, fine-grained billing, making it well suited for scalable transcription pipelines, real-time applications, and cost-sensitive deployments.

Context: 128K
Input: $0.625/M
Output: $0.625/M

GPT-4o Transcribe

GPT-4o Transcribe is OpenAI's high-quality speech-to-text model built on GPT-4o's audio capabilities. It delivers accurate transcription with strong language understanding, making it suitable for a wide range of audio processing tasks. Priced per token (input and output), it offers transparent, fine-grained billing, making it well suited for workflows that require scalable transcription, integration with LLM pipelines, and cost-aware processing.

Context: 128K
Input: $1.25/M
Output: $0/M

Whisper Large V3 Turbo

Whisper Large V3 Turbo is an optimized version of OpenAI's Whisper Large V3 speech recognition model, designed for high-speed and cost-efficient transcription. It supports 99+ languages and accepts common audio formats including mp3, mp4, wav, webm, flac, and ogg. With a ~12% word error rate and real-time speed factors up to 216×, it delivers fast, scalable performance for latency-sensitive and high-throughput transcription workloads, making it ideal for real-time and large-scale speech processing applications.

o4 Mini Deep Research

Pricing

Quick Start

Supported Parameters

Cursor IDE Model IDs

Compare with Other Models

GPT Image 1.5

GPT Audio (2025-08-28)

GPT-5.4 Pro

GPT-5 Codex Low

Compare with Other Models

GPT-4o Mini TTS

GPT-4o Mini Transcribe

GPT-4o Transcribe

Whisper Large V3 Turbo

Developers

Contact

Legal

Compare with Other Models

GPT Image 1.5

GPT Audio (2025-08-28)

GPT-5.4 Pro

GPT-5 Codex Low

Metadata

Supported Features

Observed Availability