OpenAIvoice

GPT-4o Mini TTS

GPT-4o Mini TTS is OpenAI's cost-efficient text-to-speech model, designed to convert text into natural-sounding audio output. It supports a variety of voices and tones, enabling flexible and expressive speech generation. Optimized for scalability and low cost, it is well suited for real-time voice applications, content narration, and high-volume audio generation workflows.

Get API Key Compare

Pricing

Input$0.30 / 1M

Output$0 / 1M

Cache Write$0 / 1M

Cache Read$0 / 1M

Web Search$0 / 1M

Quick Start

Use the Apertis AI SDK, the OpenAI SDK, or make direct HTTP requests to our API.

Endpoint:

python

from openai import OpenAI client = OpenAI(    api_key="YOUR_API_KEY",    base_url="https://api.apertis.ai/v1") response = client.chat.completions.create(    model="gpt-4o-mini-tts-2025-12-15",    messages=[        {"role": "user", "content": "Hello!"}    ],    max_tokens=1024,    temperature=0.7) print(response.choices[0].message.content) # Optional: Enable context compression to reduce token usage# response = client.chat.completions.create(#     model="gpt-4o-mini-tts-2025-12-15",#     messages=[{"role": "user", "content": "Hello!"}],#     extra_body={"compression": {"enabled": True, "model": "gpt-4.1-mini"}}# )

Supported Parameters

Common parameters: modelinputvoiceresponse_formatspeed

Extended parameters: instructionsstream_format

View full API documentation ->

Cursor IDE Model IDs

Use these namespaced identifiers in Cursor IDE to avoid conflicts with built-in models.

gpt-4o-mini-tts-2025-12-15

Compare with Other Models

See how this model compares to others from the same provider.

GPT-4o Audio Preview (2025-06-03)

gpt-4o-audio-preview adds support for audio inputs, allowing the model to understand nuances in audio recordings and enrich responses. It currently does not generate audio outputs, and audio input is billed per million audio tokens.

GPT-5.1 (Low)

GPT-5.1 is the full-capability successor to GPT-5, offering stronger general reasoning, better instruction following, and a more natural conversational style. It uses adaptive reasoning to stay fast on simple questions while thinking more deeply on complex tasks, producing clearer, more grounded explanations. It shows steady improvements across math, coding, and structured analysis, with more coherent long-form output and more reliable tool use.

GPT Image 1.5

gpt-image-1.5 is an upgraded OpenAI image model that produces more detailed, realistic visuals with better composition and prompt fidelity. It improves editing, variation, and style control over earlier versions, making it well suited for creative design, illustration, marketing assets, and visual prototyping.

GPT-5.2 Chat

GPT-5.2 Chat (Instant) is the fast, lightweight version of the 5.2 family, tuned for low-latency conversation while keeping strong general intelligence. It uses adaptive reasoning to think more on tough questions, boosting accuracy in math, coding, and multi-step tasks without slowing normal chats. It’s friendlier by default, follows instructions well, and is ideal for high-throughput interactive use where speed and consistency matter more than deep deliberation.