The New Stack March 12, 2026 21:11

Nvidia releases a super model

Newsletter Content

March 12, 2026

Nvidia releases a super model

On Wednesday, Nvidia released Nemotron 3 Super, the second model in its open-weight Nemotron 3 family — just before its GTC conference kicks off in San Jose next week.

The model is fast and efficient enough, reports TNS Senior Editor for AI Frederic Lardinois, to manage complex agentic AI systems at scale. As Lardinois points out in his story, the benchmarks appear to back up Nvidia’s claims. Here’s one from benchmarking firm Artificial Analysis: Nemotron 3 Super clocks in at 478 output tokens per second, which is almost twice as fast as OpenAI’s open-weight model.

The new model is now available on build.nvidia.com, Perplexity, OpenRouter, and Hugging Face. Enterprises will also be able to access it through Google Cloud’s Vertex AI, Oracle Cloud Infrastructure, and — soon — Amazon Bedrock and Microsoft Azure, as well as on platforms like Coreweave, Crusoe, Nebius, and Together AI.

There’s still no word on the debut of Nemotron 3 Ultra, the 500-billion-parameter sibling that Nvidia teased last year. (Maybe GTC will bring answers.)

Go deeper: Nvidia launches Nemotron 3 Super, a 120B open model for large-scale AI systems

TOP OF THE STACK

Tetrate launches open source marketplace to simplify Envoy adoption

by Adrian Bridgwater

Tetrate launches Built on Envoy, a free open source extensions marketplace to simplify Envoy proxy adoption for cloud-native development teams.

Agentic AI development company Tetrate has launched Built on Envoy, a free and open source extensions marketplace for Envoy. Envoy is an open source edge and service proxy for cloud-native applications. Tracing its origins back to ride-hailing service...

WHAT ELSE IS NEW?

Featued image for: Nvidia launches Nemotron 3 Super, a 120B open model for large-scale AI systems

AI Models
Nvidia launches Nemotron 3 Super, a 120B open model for large-scale AI systems

by Frederic Lardinois

Featued image for: Microsoft’s VS Code team moved to weekly releases after 10 years of monthly — and credits AI for making it possible

Developer tools
Microsoft's VS Code team moved to weekly releases after 10 years of monthly — and credits AI for making it possible

by Darryl K. Taft

Featued image for: JetBrains names the debt AI agents leave behind

AI Agents
JetBrains names the debt AI agents leave behind

by Darryl K. Taft

Featued image for: “Self-healing” IT? HPE research explores how AI-trained models can catch silent infrastructure failures

AI Operations
"Self-healing" IT? HPE research explores how AI-trained models can catch silent infrastructure failures

by Jennifer Riggins

Featued image for: The 2 failures with AI coding that are creating security bottlenecks

Security
The 2 failures with AI coding that are creating security bottlenecks

by Julie Davila

Featued image for: Publish your data, AI techniques, and agentic engineering work on Towards Data Science

Tech Culture
Publish your data, AI techniques, and agentic engineering work on Towards Data Science

by Ludovic Benistant

Featued image for: Amazon calls engineers for a “deep dive” internal meeting to discuss “GenAI”-related outages

AI
Amazon calls engineers for a “deep dive” internal meeting to discuss "GenAI"-related outages

by Meredith Shubel

Featued image for: With its latest Phi-4 reasoning model, Microsoft reckons bigger isn’t always better

AI Agents
With its latest Phi-4 reasoning model, Microsoft reckons bigger isn’t always better

by Paul Sawers

Featued image for: Nvidia plans NemoClaw launch, an open-source platform for AI agents

AI Agents
Nvidia plans NemoClaw launch, an open-source platform for AI agents

by Meredith Shubel

Featued image for: How to deploy an AI server on your Debian/Ubuntu server

AI
How to deploy an AI server on your Debian/Ubuntu server

by Jack Wallen

How to create production-ready code with Claude Code

From our partner

Coding agents can quickly generate a lot of code. However, while Cursor and Claude Code allow you to build apps rapidly, their initial output often isn’t production-ready. Explore this article to learn how to generate robust, high-quality code with AI agents.

READ THE ARTICLE

FLOW STATE

Trending story: OpenAI’s Codex is now on Windows

OpenAI's agentic coding app arrives on Windows with native sandboxing, PowerShell support, and a new WinUI skill — no Mac required.

Check it out

Look: Why the “bible” of data systems is getting a massive rewrite for 2026

The data systems "bible" gets a 2026 rewrite. Martin Kleppmann and Chris Riccomini discuss updates for AI and cloud-native architectures.

Take a look

Upcoming webinar: AI-powered Kubernetes observability best practices in 2026

Join us live on March 19 to explore how to leverage AI-powered insights to ensure the health, performance, and security of your increasingly complex K8s environments.

Read: Your AI strategy is built on layers of API sediment

AI protocols, such as MCP and Agent Skills, are agent-first, which risks bypassing the governance, security, and access controls that enterprises have spent years building around their APIs and data.

How did we do today?