Nvidia’s Nemotron 3 Ultra Is Free, Open, and 5x Faster — Why That Matters for the Rest of Us

Nvidia released a free, open AI model that’s 5x faster — and handed over the data it was trained on. Here’s why open matters.

Nvidia’s Nemotron 3 Ultra Is Free, Open, and 5x Faster — Why That Matters for the Rest of Us

Most of the Nvidia keynote was about hardware you’ll never buy and AI factories you’ll never see. Nemotron 3 Ultra is the exception — a free, open AI model that anyone can download, take apart, and build on. And while “open model” gets thrown around loosely these days, the thing Nvidia did here is rarer than it sounds.

Here’s what it is and why it’s worth a moment of your attention even if you’ll never train an AI in your life.

What Nemotron 3 Ultra is

Nemotron 3 Ultra is Nvidia’s new flagship open model, and it’s built on an architecture the company says is a first: a hybrid of state space models (SSMs) and mixture-of-experts (MoE). The short version of the jargon — SSMs are a faster, more efficient alternative to the transformer design behind most chatbots, and mixture-of-experts means only the relevant slice of the model fires for any given task rather than the whole thing.

The practical upshot is a model designed to think quickly and cheaply, which matters when you’re running it thousands of times a day rather than chatting with it occasionally.

NVIDIA Nemotron 3 family graphic showcasing models: Nano, Super, and Ultra with availability details.

The genuinely unusual part: open data

Plenty of companies release “open” models, but what they usually mean is open weights — the finished brain, with no explanation of how it got that way. Nvidia went further. Alongside the model, it’s releasing the data it trained on and the training recipe itself.

That’s the difference between being handed a cake and being handed the cake, the recipe, and the shopping list. It means a company or a university can not only run Nemotron, but retrain it on their own material, audit what went into it, and turn it into something genuinely their own. Nvidia framed this as a coalition effort, with partners pooling data — and the stated goal is simple: take all of it, add to it, make it better, make it yours.

5x faster, 30% cheaper — with an asterisk

Nvidia’s headline claims are that Nemotron 3 Ultra runs five times faster and around 30% cheaper than the best open models available, measured on compute and inference time. Those are Nvidia’s own numbers, against rivals Nvidia chose, so treat them as a starting point rather than gospel until independent testing catches up.

Still, even discounted, the direction is clear: the model is built to be cheap to run, which is the whole point of open weights. A model is only useful to most businesses if they can afford to run it at scale.

Diagram illustrating a network of nodes and connections representing data flow in machine learning models.

Why this matters if you’ll never train a model

Here’s the part that reaches beyond developers. Right now, serious AI mostly means renting capacity from a handful of giant providers. Capable open models that ship with their training recipe chip away at that, lowering the barrier for smaller companies, research labs and entire regions to build AI they actually control rather than licence.

For the Gulf, where governments and businesses have been vocal about wanting home-grown, sovereign AI rather than dependence on foreign clouds, that’s not a small thing. A free, fast, properly open model is exactly the kind of foundation that makes “build your own” go from a slide deck to a project.

And Nvidia isn’t standing still — Huang noted the team is already working on Nemotron 4, with a roadmap stretching well beyond it.

The verdict

Nemotron 3 Ultra is one of the few announcements from the keynote with consequences you might actually feel, even indirectly. The speed and cost claims need independent verification, and “open” is only as useful as the community that picks it up. But releasing the data and the recipe, not just the weights, is a meaningfully more open gesture than the industry norm — and the more credible open models there are, the less the future of AI belongs to three companies. That’s worth rooting for, whoever’s name is on the model.

Subscribe to our newsletter

Subscribe to our newsletter to get the latest updates and news

Member discussion