text-embedding-3-largetext-embedding-3-large is OpenAI's most powerful embedding model, producing numeric representations of text to measure similarity. It works well for both English and non-English content and is widely used for tasks like search, clustering, recommendations, anomaly detection, and classification.
Select an endpoint and copy a working example for this model.
from openai import OpenAI client = OpenAI( api_key="YOUR_API_KEY", base_url="https://api.apertis.ai/v1") response = client.chat.completions.create( model="text-embedding-3-large", messages=[ {"role": "user", "content": "Hello!"} ], max_tokens=1024, temperature=0.7) print(response.choices[0].message.content) # Optional: Enable context compression to reduce token usage# response = client.chat.completions.create(# model="text-embedding-3-large",# messages=[{"role": "user", "content": "Hello!"}],# extra_body={"compression": {"enabled": True, "model": "gpt-4.1-mini"}}# )modelinputencoding_formatdimensionsuserUse these namespaced identifiers in Cursor IDE to avoid conflicts with built-in models.
See how this model compares to others from the same provider.
GPT-4o Mini TTS is OpenAI's cost-efficient text-to-speech model, designed to convert text into natural-sounding audio output. It supports a variety of voices and tones, enabling flexible and expressive speech generation. Optimized for scalability and low cost, it is well suited for real-time voice applications, content narration, and high-volume audio generation workflows.
GPT-4o Mini Transcribe is a smaller, cost-efficient speech-to-text model built on GPT-4o Mini's audio capabilities. It is designed for high-volume transcription workloads, delivering reliable performance with lower cost and latency. Priced per token (input and output), it provides transparent, fine-grained billing, making it well suited for scalable transcription pipelines, real-time applications, and cost-sensitive deployments.
GPT-4o Transcribe is OpenAI's high-quality speech-to-text model built on GPT-4o's audio capabilities. It delivers accurate transcription with strong language understanding, making it suitable for a wide range of audio processing tasks. Priced per token (input and output), it offers transparent, fine-grained billing, making it well suited for workflows that require scalable transcription, integration with LLM pipelines, and cost-aware processing.
Whisper Large V3 Turbo is an optimized version of OpenAI's Whisper Large V3 speech recognition model, designed for high-speed and cost-efficient transcription. It supports 99+ languages and accepts common audio formats including mp3, mp4, wav, webm, flac, and ogg. With a ~12% word error rate and real-time speed factors up to 216×, it delivers fast, scalable performance for latency-sensitive and high-throughput transcription workloads, making it ideal for real-time and large-scale speech processing applications.
See how this model compares to others from the same provider.
GPT-5.5 Pro is OpenAI's high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It supports text and image inputs and features a 1M+ token context window (≈922K input, 128K output) for handling large-scale, long-context tasks. Designed for long-horizon problem solving, agentic coding, and precise multi-step execution, GPT-5.5 Pro delivers strong reliability and performance across advanced engineering, research, and complex workflow scenarios.
GPT-4o Transcribe is OpenAI's high-quality speech-to-text model built on GPT-4o's audio capabilities. It delivers accurate transcription with strong language understanding, making it suitable for a wide range of audio processing tasks. Priced per token (input and output), it offers transparent, fine-grained billing, making it well suited for workflows that require scalable transcription, integration with LLM pipelines, and cost-aware processing.
GPT-5.2-Codex is OpenAI's most advanced agentic coding model yet, built for complex, real-world software engineering and defensive cybersecurity. It’s a version of GPT-5.2 further optimized for Codex, with improvements in long-horizon coding tasks (like refactors and migrations), better handling of long contexts, stronger performance on large code changes, enhanced Windows support, and significantly stronger cybersecurity capabilities.
GPT-5.1 is the full-capability successor to GPT-5, offering stronger general reasoning, better instruction following, and a more natural conversational style. It uses adaptive reasoning to stay fast on simple questions while thinking more deeply on complex tasks, producing clearer, more grounded explanations. It shows steady improvements across math, coding, and structured analysis, with more coherent long-form output and more reliable tool use.
Initialized observational baseline with no recorded failures