GPT-4o Mini TTS is OpenAI's cost-efficient text-to-speech model, designed to convert text into natural-sounding audio output. It supports a variety of voices and tones, enabling flexible and expressive speech generation. Optimized for scalability and low cost, it is well suited for real-time voice applications, content narration, and high-volume audio generation workflows.
Use the Apertis AI SDK, the OpenAI SDK, or make direct HTTP requests to our API.
from openai import OpenAI client = OpenAI( api_key="YOUR_API_KEY", base_url="https://api.apertis.ai/v1") response = client.chat.completions.create( model="gpt-4o-mini-tts-2025-12-15", messages=[ {"role": "user", "content": "Hello!"} ], max_tokens=1024, temperature=0.7) print(response.choices[0].message.content) # Optional: Enable context compression to reduce token usage# response = client.chat.completions.create(# model="gpt-4o-mini-tts-2025-12-15",# messages=[{"role": "user", "content": "Hello!"}],# extra_body={"compression": {"enabled": True, "model": "gpt-4.1-mini"}}# )Common parameters: modelinputvoiceresponse_formatspeed
Extended parameters: instructionsstream_format
Use these namespaced identifiers in Cursor IDE to avoid conflicts with built-in models.
See how this model compares to others from the same provider.
gpt-4o-audio-preview adds support for audio inputs, allowing the model to understand nuances in audio recordings and enrich responses. It currently does not generate audio outputs, and audio input is billed per million audio tokens.
GPT-5.1 is the full-capability successor to GPT-5, offering stronger general reasoning, better instruction following, and a more natural conversational style. It uses adaptive reasoning to stay fast on simple questions while thinking more deeply on complex tasks, producing clearer, more grounded explanations. It shows steady improvements across math, coding, and structured analysis, with more coherent long-form output and more reliable tool use.
gpt-image-1.5 is an upgraded OpenAI image model that produces more detailed, realistic visuals with better composition and prompt fidelity. It improves editing, variation, and style control over earlier versions, making it well suited for creative design, illustration, marketing assets, and visual prototyping.
GPT-5.2 Chat (Instant) is the fast, lightweight version of the 5.2 family, tuned for low-latency conversation while keeping strong general intelligence. It uses adaptive reasoning to think more on tough questions, boosting accuracy in math, coding, and multi-step tasks without slowing normal chats. It’s friendlier by default, follows instructions well, and is ideal for high-throughput interactive use where speed and consistency matter more than deep deliberation.