← Library · Frontier

NVIDIA Releases Nemotron 3 Ultra for Enhanced Agentic AI

NVIDIA launched Nemotron 3 Ultra on June 4, 2026, a 550B-parameter Mixture-of-Experts model with 55B active parameters, optimized for orchestrating complex, long-running agent workflows. It features architectural innovations such as hybrid Mamba-Transformer layers for efficient long-context handling, NVFP4 quantization for cross-GPU deployment, LatentMoE for expert routing, and multi-token prediction for improved generative speed. The model is open-source, including weights, data, and recipes, allowing for broad adoption and fine-tuning.

Why it matters

Nemotron 3 Ultra provides a robust and efficient foundation for building advanced AI agents, with features like open-source availability and hardware optimization that facilitate wider deployment.

Learn one new AI thing every day.

Daily Deck sends you seven plain-English cards like this every morning. Free.

Start free