Feature

2.2.50Models Added

Add Xiaomi MiMo-V2.5 & MiMo-V2.5-Pro

MiMo-V2.5MiMo-V2.5-Pro

MiMo-V2.5

MiMo-V2.5 is Xiaomi's native omnimodal model, delivering pro-level agentic performance at roughly half the inference cost. It surpasses MiMo-V2-Omni in multimodal perception, particularly in image and video understanding. With a 1M-token context window, it can handle complete documents, extended conversations, and complex task contexts in a single pass.

Combining strong reasoning, rich perception, and cost efficiency, MiMo-V2.5 is well suited for integration into advanced agent frameworks and real-world multimodal applications.

MiMo-V2.5-Pro

MiMo-V2.5-Pro is Xiaomi's flagship model, delivering top-tier performance in agentic capabilities, complex software engineering, and long-horizon tasks. It ranks highly on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro, demonstrating strong real-world reliability. The model can autonomously complete professional tasks that would take human experts days or weeks, executing thousands of tool calls within a single workflow.

With a 1M-token context window, it is well suited for integration into advanced agent frameworks and large-scale task orchestration systems.

Enjoy it.