28 AI Models — Browse, Compare & Use
Explore pricing, context windows, and capabilities across every major AI provider, all accessible through a single API.
28 AI Models — Browse, Compare & Use
Explore pricing, context windows, and capabilities across every major AI provider, all accessible through a single API.
| Provider | Model | Price | Context |
|---|---|---|---|
DeepSeek V4 Pro is a large-scale Mixture-of-Experts (MoE) model with 1.6T total parameters and 49B activated per token, supporting a 1M-token context window for advanced reasoning and long-horizon workflows. It delivers strong performance across knowledge, mathematics, and software engineering tasks, making it suitable for complex, real-world applications. Built on a hybrid attention architecture for efficient long-context processing, the model supports configurable reasoning modes to balance speed and depth. It is well suited for full codebase analysis, multi-step automation, and large-scale information synthesis, where both capability and efficiency are essential.https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro | IN:$0.000435OUT:$0.00087/1K Tks | Context:1.0M | |
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts (MoE) model with 284B total parameters and 13B activated per token, designed for fast inference and high-throughput workloads. It supports a 1M-token context window, enabling large-scale reasoning and long-context processing. Built with hybrid attention for efficiency, the model maintains strong performance in reasoning and coding while offering configurable reasoning modes. It is well suited for coding assistants, chat systems, and agent workflows where responsiveness and cost efficiency are critical. | IN:$0.00014OUT:$0.00028/1K Tks | Context:1.0M | |
DeepSeek-V3.2 is an efficiency-focused large model that combines strong reasoning with reliable tool use. It introduces DeepSeek Sparse Attention to lower compute costs for long contexts while preserving quality, and uses large-scale reinforcement learning to reach GPT-5-class reasoning (including top IMO/IOI results). An agentic task-synthesis pipeline improves how it reasons with tools in interactive settings — and developers can toggle reasoning on or off as needed. | IN:$0.00019OUT:$0.000275/1K Tks | Context:164K | |
DeepSeek-V3.2-Speciale is a high-compute edition of V3.2 built for top-tier reasoning and agent performance. Using DeepSeek Sparse Attention and extensive reinforcement learning, it surpasses GPT-5 on tough reasoning benchmarks and approaches Gemini 3 Pro–level capability, while still remaining strong at coding and tool use. It also draws on a large agent-training pipeline to boost reliability and generalization in interactive environments. | IN:$0.00028OUT:$0.0004/1K Tks | Context:164K | |
DeepSeek-V3.2 is an efficiency-focused large model that pairs strong reasoning with reliable tool use. It introduces DeepSeek Sparse Attention to cut long-context compute costs, and uses large-scale reinforcement learning to reach GPT-5-class performance (including top IMO/IOI results). An agentic task-synthesis pipeline further improves how it reasons with tools, and developers can toggle its reasoning mode on or off as needed. | IN:$0.00015OUT:$0.00024/1K Tks | Context:164K | |
DeepSeek-V3.2-Exp is an experimental model released between V3.1 and future DeepSeek architectures. It introduces DeepSeek Sparse Attention to improve efficiency on long-context tasks while preserving quality, and allows developers to toggle reasoning on or off. Trained to compare directly with V3.1, its performance is similar across reasoning, coding, and tool-use, making it mainly a research model for testing efficient transformer designs rather than pushing raw accuracy. | IN:$0.00015OUT:$0.00024/1K Tks | Context:164K | |
DeepSeek-V3.1 is a hybrid reasoning model (671B total / 37B active) that supports switchable thinking and non-thinking modes. It extends DeepSeek-V3 with long-context training up to 128K tokens and efficient FP8 inference. It delivers faster performance while matching DeepSeek-R1 on tough reasoning and coding tasks, and supports structured tool use and agent workflows. | IN:$0.0002OUT:$0.0008/1K Tks | Context:164K | |
| IN:$0.0001OUT:$0.0002/1K Tks | Context:131K | ||
| Free | Context:131K | ||
| IN:$0.00006OUT:$0.00009/1K Tks | Context:128K | ||
| IN:$0.0005OUT:$0.00218/1K Tks | Context:164K | ||
| IN:$0.0007OUT:$0.002/1K Tks | Context:131K | ||
| IN:$0OUT:$0/1K Tks | Context:164K | ||
| IN:$0.0028OUT:$0.0112/1K Tks | Context:164K | ||
| IN:$0.0003OUT:$0.00088/1K Tks | Context:164K | ||
| IN:$0.0056OUT:$0.0224/1K Tks | Context:164K | ||
| IN:$0OUT:$0/1K Tks | Context:164K | ||
| IN:$0.00004OUT:$0.00004/1K Tks | Context:32K | ||
| IN:$0.00018OUT:$0.00018/1K Tks | Context:131K | ||
| Free | Context:16K | ||
| IN:$0.00012OUT:$0.00018/1K Tks | Context:131K | ||
| Free | Context:64K | ||
| IN:$0.0016OUT:$0.0016/1K Tks | Context:64K | ||
| IN:$0.00023OUT:$0.00069/1K Tks | Context:131K | ||
| IN:$0.0005OUT:$0.00218/1K Tks | Context:164K | ||
| Free | Context:164K | ||
| IN:$0.00038OUT:$0.00089/1K Tks | Context:164K | ||
| IN:$0.006OUT:$0.006/1K Tks | - |