Models Added

Released: 2026-04-23

## MiMo-V2.5 MiMo-V2.5 is Xiaomi's native omnimodal model, delivering pro-level agentic performance at roughly half the inference cost. It surpasses MiMo-V2-Omni in multimodal perception, particularly in image and video understanding. With a 1M-token context window, it can handle complete documents, extended conversations, and complex task contexts in a single pass. Combining strong reasoning, rich perception, and cost efficiency, MiMo-V2.5 is well suited for integration into advanced agent frameworks and real-world multimodal applications. ## MiMo-V2.5-Pro MiMo-V2.5-Pro is Xiaomi's flagship model, delivering top-tier performance in agentic capabilities, complex software engineering, and long-horizon tasks. It ranks highly on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro, demonstrating strong real-world reliability. The model can autonomously complete professional tasks that would take human experts days or weeks, executing thousands of tool calls within a single workflow. With a 1M-token context window, it is well suited for integration into advanced agent frameworks and large-scale task orchestration systems. **Enjoy it.**