Feature

2.2.52Models Added

Add DeepSeek V4 Pro & DeepSeek V4 Flash

DeepSeek V4 ProDeepSeek V4 Flash

DeepSeek V4 Pro

DeepSeek V4 Pro is a large-scale Mixture-of-Experts (MoE) model with 1.6T total parameters and 49B activated per token, supporting a 1M-token context window for advanced reasoning and long-horizon workflows.

It delivers strong performance across knowledge, mathematics, and software engineering tasks, making it suitable for complex, real-world applications.

DeepSeek V4 Flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts (MoE) model with 284B total parameters and 13B activated per token, designed for fast inference and high-throughput workloads.

It supports a 1M-token context window, enabling large-scale reasoning and long-context processing.

Enjoy them.