NVIDIA's Open-Source "Nemotron 3 Super" AI model has topped the EnterpriseOps-Gym leaderboard, showcasing NVIDIA's software prowess.
NVIDIA Is Topping Both AI Hardware and Software Leaderboards With Its Open-Source Nemotron 3 Super, Leading The Pack
In March this year, NVIDIA introduced its Neomtron 3 Super, a 120B AI model with 12B active parameters. Based on a hybrid MoE architecture, the model is designed to deliver a 5x throughput versus the previous Nemotron Super model, and tackles large context with a native 1M-token context windows that gives agents long-term memory for aligned, high accuracy reasoning.
Some of the highlights of NVIDIA's Nemotron 3 Super model include:
- Latent MoE that calls 4x as many expert specialists for the same inference cost, by compressing tokens before they reach the experts.
- Multi-token prediction (MTP) that predicts multiple future tokens in one forward pass, dramatically reducing generation time for long sequences and enabling built-in speculative decoding.
- Hybrid Mamba-Transformer backbone integrating Mamba layers for sequence efficiency with Transformer layers for precision reasoning, delivering higher throughput with 4x improved memory and compute efficiency.
- Native NVFP4 pretraining optimized for NVIDIA Blackwell, significantly cutting memory requirements and speeding up inference by 4x on NVIDIA B200 compared to FP8 on NVIDIA H100, while maintaining accuracy.
- Multi-environment reinforcement-learning (RL) post-trained with RL across 21 environment configurations using NVIDIA NeMo Gym and NVIDIA NeMo RL, trained with more than 1.2 million environment rollouts.
Today, NVIDIA has shared the first and latest benchmarks of Nemotron 3 Super at EnterpriseOps-Gym. This benchmark evaluates the performance of various models across 1,150 tasks in fully interactive environments with 512 functional tools. This requires agents to coordinate across multiple enterprise systems and tools to complete a single workflow.
In the Open-Source model leaderboard, NVIDIA's Nemotron 3 Super now claims top position with an average score of 27.3 points. The model leads in TEAMS, Email, and Hybrid workflows while being highly competitive in CSM, ITSM, and Drive workflows. With this performance, NVIDIA's AI model beats Kimi-K2.5, which is now sitting at 2nd place, and DeepSeek v3.2, which sits at 3rd place. The GPT-OSS-120B model sits at the 5th position.
NVIDIA's Nemotron 3 lineup of open AI models includes Nano, Super, and Ultra. The company also introduced Nemotron 3 Nano Omni, offering a 9x boost in Agnetic AI throughput. With its latest AI models, NVIDIA continues to prove that it's a complete AI stack provider with both its hardware and software capabilities outclassing the competition, giving more reason why going all-in on NVIDIA's solution is probably the best choice for AI.
Follow Wccftech on Google to get more of our news coverage in your feeds.
