Xiaomi
Models from Xiaomi
MiMo-V2-Flash
Lightweight, high-speed reasoning model with hybrid attention and multi-token prediction for low-cost inference and strong benchmark scores.
MiMo-V2-Omni
Omni-modal foundation model that natively understands text, images, audio, and video with deep reasoning, web search, and multi-step planning.
MiMo-V2-Pro
Reasoning and agentic foundation model on a trillion-parameter MoE (42B active) with up to 1M-token context for complex coding and agent work.
MiMo-V2.5
Multimodal model with native visual and audio understanding on a 1M context, designed to reason and act across modalities in agentic workflows.
MiMo-V2.5-Pro
Top-tier model for agentic workflows, complex software engineering, and long-horizon tasks, sustaining work across 1000+ tool calls on 1M context.
