Feature
System Update
Audio APIs Now Live
Full Audio API Support
Apertis now supports the OpenAI-compatible Audio API. Use a single API key to access leading TTS (text-to-speech) and STT (speech-to-text) models across providers.
Supported Models
Text-to-Speech (TTS)
gemini-3.1-flash-tts-preview— Google's latest Flash TTS previewgpt-4o-mini-tts— OpenAI's lightweight real-time speech synthesis
Speech-to-Text (STT)
gpt-4o-transcribe— Flagship high-accuracy transcriptiongpt-4o-mini-transcribe— Cost-efficient real-time transcriptionwhisper-large-v3-turbo— Accelerated Whisper v3whisper-large-v3— Full-precision Whisperwhisper-1— The classic, battle-tested baseline
Endpoints
Drop-in compatible with the OpenAI SDK — no code changes required:
POST /v1/audio/speech— text → audioPOST /v1/audio/transcriptions— audio → textPOST /v1/audio/translations— audio → translated text
Billing
- PAYG (pay-as-you-go): shares the same quota balance as chat/completions
- Per-dimension billing: priced separately on input tokens / output
tokens / audio seconds, with admin-tunable AudioRatio
- File limit: 25 MB per multipart upload
- Subscriptions: audio models are PAYG-only for now (not included
in subscription plans)
Example
from openai import OpenAI
client = OpenAI(
api_key="sk-your-apertis-key",
base_url="https://api.apertis.ai/v1"
)
# TTS
speech = client.audio.speech.create(
model="gpt-4o-mini-tts",
voice="alloy",
input="Hello from Apertis."
)
speech.stream_to_file("hello.mp3")
# STT
with open("audio.mp3", "rb") as f:
transcript = client.audio.transcriptions.create(
model="whisper-large-v3-turbo",
file=f
)
print(transcript.text)Model Detail Page Updates
- Endpoint and code samples auto-switch based on the model's task
- TTS models now emit ready-to-run OpenAI SDK Python snippets
- Web Search pricing column hidden for voice models (:web is unsupported)