OpenAIEmbedding

text-embedding-3-large

text-embedding-3-large

text-embedding-3-large is OpenAI's most powerful embedding model, producing numeric representations of text to measure similarity. It works well for both English and non-English content and is widely used for tasks like search, clustering, recommendations, anomaly detection, and classification.

Context: 8.2K tokens
Endpoint

Get API Key Compare

Pricing

Input$0.091 / 1M

Output$0.091 / 1M

Cache Write$0 / 1M

Cache Read$0 / 1M

Prompt cache writes and reads are included at no additional cost.

Quick Start

Select an endpoint and copy a working example for this model.

Endpoint:

python

from openai import OpenAI client = OpenAI(    api_key="YOUR_API_KEY",    base_url="https://api.apertis.ai/v1") response = client.chat.completions.create(    model="text-embedding-3-large",    messages=[        {"role": "user", "content": "Hello!"}    ],    max_tokens=1024,    temperature=0.7) print(response.choices[0].message.content) # Optional: Enable context compression to reduce token usage# response = client.chat.completions.create(#     model="text-embedding-3-large",#     messages=[{"role": "user", "content": "Hello!"}],#     extra_body={"compression": {"enabled": True, "model": "gpt-4.1-mini"}}# )

Supported Parameters

API docs

Common4 params

modelinputencoding_formatdimensions

Extended1 param

user

Cursor IDE Model IDs

Use these namespaced identifiers in Cursor IDE to avoid conflicts with built-in models.

text-embedding-3-large

Compare with Other Models

See how this model compares to others from the same provider.

GPT-5.5 Pro

GPT-5.5 Pro is OpenAI's high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It supports text and image inputs and features a 1M+ token context window (≈922K input, 128K output) for handling large-scale, long-context tasks. Designed for long-horizon problem solving, agentic coding, and precise multi-step execution, GPT-5.5 Pro delivers strong reliability and performance across advanced engineering, research, and complex workflow scenarios.

Context: 1.1M
Input: $30.00/M
Output: $180.00/M

GPT-4o Transcribe

GPT-4o Transcribe is OpenAI's high-quality speech-to-text model built on GPT-4o's audio capabilities. It delivers accurate transcription with strong language understanding, making it suitable for a wide range of audio processing tasks. Priced per token (input and output), it offers transparent, fine-grained billing, making it well suited for workflows that require scalable transcription, integration with LLM pipelines, and cost-aware processing.

Context: 128K
Input: $1.25/M
Output: $0/M

GPT-5.2 Codex

GPT-5.2-Codex is OpenAI's most advanced agentic coding model yet, built for complex, real-world software engineering and defensive cybersecurity. It’s a version of GPT-5.2 further optimized for Codex, with improvements in long-horizon coding tasks (like refactors and migrations), better handling of long contexts, stronger performance on large code changes, enhanced Windows support, and significantly stronger cybersecurity capabilities.

Context: 196K
Input: $0.50/M
Output: $4.00/M

GPT-5.1 (Thinking)

GPT-5.1 is the full-capability successor to GPT-5, offering stronger general reasoning, better instruction following, and a more natural conversational style. It uses adaptive reasoning to stay fast on simple questions while thinking more deeply on complex tasks, producing clearer, more grounded explanations. It shows steady improvements across math, coding, and structured analysis, with more coherent long-form output and more reliable tool use.

Context: 400K
Input: $0.875/M
Output: $7.00/M

Compare with Other Models

See how this model compares to others from the same provider.

GPT-4o Mini TTS

GPT-4o Mini TTS is OpenAI's cost-efficient text-to-speech model, designed to convert text into natural-sounding audio output. It supports a variety of voices and tones, enabling flexible and expressive speech generation. Optimized for scalability and low cost, it is well suited for real-time voice applications, content narration, and high-volume audio generation workflows.

Context: 4.1K
Input: $0.30/M
Output: $0/M

GPT-4o Mini Transcribe

GPT-4o Mini Transcribe is a smaller, cost-efficient speech-to-text model built on GPT-4o Mini's audio capabilities. It is designed for high-volume transcription workloads, delivering reliable performance with lower cost and latency. Priced per token (input and output), it provides transparent, fine-grained billing, making it well suited for scalable transcription pipelines, real-time applications, and cost-sensitive deployments.

Context: 128K
Input: $0.625/M
Output: $0.625/M

GPT-4o Transcribe

Context: 128K
Input: $1.25/M
Output: $0/M

Whisper Large V3 Turbo

Whisper Large V3 Turbo is an optimized version of OpenAI's Whisper Large V3 speech recognition model, designed for high-speed and cost-efficient transcription. It supports 99+ languages and accepts common audio formats including mp3, mp4, wav, webm, flac, and ogg. With a ~12% word error rate and real-time speed factors up to 216×, it delivers fast, scalable performance for latency-sensitive and high-throughput transcription workloads, making it ideal for real-time and large-scale speech processing applications.

text-embedding-3-large

Pricing

Quick Start

Supported Parameters

Cursor IDE Model IDs

Compare with Other Models

GPT-5.5 Pro

GPT-4o Transcribe

GPT-5.2 Codex

GPT-5.1 (Thinking)

Compare with Other Models

GPT-4o Mini TTS

GPT-4o Mini Transcribe

GPT-4o Transcribe

Whisper Large V3 Turbo

Developers

Contact

Legal

Compare with Other Models

GPT-5.5 Pro

GPT-4o Transcribe

GPT-5.2 Codex

GPT-5.1 (Thinking)

Metadata

Observed Availability