← Back to Changelog
Models Added
Released: 2026-04-24
## DeepSeek V4 Pro
DeepSeek V4 Pro is a large-scale Mixture-of-Experts (MoE) model with 1.6T total parameters and 49B activated per token, supporting a 1M-token context window for advanced reasoning and long-horizon workflows.
It delivers strong performance across knowledge, mathematics, and software engineering tasks, making it suitable for complex, real-world applications.
## DeepSeek V4 Flash
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts (MoE) model with 284B total parameters and 13B activated per token, designed for fast inference and high-throughput workloads.
It supports a 1M-token context window, enabling large-scale reasoning and long-context processing.
**Enjoy them.**