← Back to Changelog
Models Added
Released: 2026-04-29
## Nemotron 3 Nano Omni (Free)
NVIDIA Nemotron 3 Nano Omni is an open 30B-A3B multimodal model designed as a perception and context sub-agent for enterprise agent systems. It supports text, image, video, and audio inputs with text output, enabling unified multimodal reasoning within a single inference loop. Built on a hybrid MoE Transformer–Mamba architecture with Conv3D video layers and Efficient Video Sampling (EVS), it delivers significantly improved efficiency for video reasoning—achieving ~2× higher throughput and 2.5× lower compute compared to separate pipelines.
With up to 300K context length and extended thinking support, it is well suited for scalable, multimodal agent workflows.
**Enjoy it.**