← Back to Changelog

System Update

Released: 2026-05-05

## Full Audio API Support Apertis now supports the OpenAI-compatible Audio API. Use a single API key to access leading TTS (text-to-speech) and STT (speech-to-text) models across providers. ### Supported Models **Text-to-Speech (TTS)** - `gemini-3.1-flash-tts-preview` — Google's latest Flash TTS preview - `gpt-4o-mini-tts` — OpenAI's lightweight real-time speech synthesis **Speech-to-Text (STT)** - `gpt-4o-transcribe` — Flagship high-accuracy transcription - `gpt-4o-mini-transcribe` — Cost-efficient real-time transcription - `whisper-large-v3-turbo` — Accelerated Whisper v3 - `whisper-large-v3` — Full-precision Whisper - `whisper-1` — The classic, battle-tested baseline ### Endpoints Drop-in compatible with the OpenAI SDK — no code changes required: - `POST /v1/audio/speech` — text → audio - `POST /v1/audio/transcriptions` — audio → text - `POST /v1/audio/translations` — audio → translated text ### Billing - **PAYG (pay-as-you-go)**: shares the same quota balance as chat/completions - **Per-dimension billing**: priced separately on input tokens / output tokens / audio seconds, with admin-tunable AudioRatio - **File limit**: 25 MB per multipart upload - **Subscriptions**: audio models are PAYG-only for now (not included in subscription plans) ### Example ```python from openai import OpenAI client = OpenAI( api_key="sk-your-apertis-key", base_url="https://api.apertis.ai/v1" ) # TTS speech = client.audio.speech.create( model="gpt-4o-mini-tts", voice="alloy", input="Hello from Apertis." ) speech.stream_to_file("hello.mp3") # STT with open("audio.mp3", "rb") as f: transcript = client.audio.transcriptions.create( model="whisper-large-v3-turbo", file=f ) print(transcript.text) ``` ### Model Detail Page Updates - Endpoint and code samples auto-switch based on the model's task - TTS models now emit ready-to-run OpenAI SDK Python snippets - Web Search pricing column hidden for voice models (:web is unsupported)