GPT-4o Mini Transcribe is a smaller, cost-efficient speech-to-text model built on GPT-4o Mini's audio capabilities. It is designed for high-volume transcription workloads, delivering reliable performance with lower cost and latency. Priced per token (input and output), it provides transparent, fine-grained billing, making it well suited for scalable transcription pipelines, real-time applications, and cost-sensitive deployments.
Use the Apertis AI SDK, the OpenAI SDK, or make direct HTTP requests to our API.
from openai import OpenAI client = OpenAI( api_key="YOUR_API_KEY", base_url="https://api.apertis.ai/v1") response = client.chat.completions.create( model="gpt-4o-mini-transcribe", messages=[ {"role": "user", "content": "Hello!"} ], max_tokens=1024, temperature=0.7) print(response.choices[0].message.content) # Optional: Enable context compression to reduce token usage# response = client.chat.completions.create(# model="gpt-4o-mini-transcribe",# messages=[{"role": "user", "content": "Hello!"}],# extra_body={"compression": {"enabled": True, "model": "gpt-4.1-mini"}}# )Common parameters: modelfilelanguagepromptresponse_format
Extended parameters: temperaturetimestamp_granularities
Use these namespaced identifiers in Cursor IDE to avoid conflicts with built-in models.
See how this model compares to others from the same provider.
gpt-4o-mini-audio-preview is a smaller preview version of OpenAI’s audio-capable GPT-4o mini model that supports both audio and text inputs and outputs via the API. It enables the model to understand nuances in audio recordings and incorporate them into responses, making it useful for audio-enabled applications like transcription, speech understanding, and voice-driven interactions.
GPT-5.1 is the full-capability successor to GPT-5, offering stronger general reasoning, better instruction following, and a more natural conversational style. It uses adaptive reasoning to stay fast on simple questions while thinking more deeply on complex tasks, producing clearer, more grounded explanations. It shows steady improvements across math, coding, and structured analysis, with more coherent long-form output and more reliable tool use.
GPT-5 Mini is a smaller, faster, lower-cost version of GPT-5 for lighter reasoning workloads. It retains GPT-5's instruction-following and safety tuning, and serves as the successor to o4-mini.
o4-mini-deep-research is a faster, lower-cost version of OpenAI's deep-research model, designed for complex, multi-step investigations. It automatically relies on web_search for information gathering, which always adds extra usage cost.