NVIDIA has started shipping the very first Vera CPUs to major AI firms, marking the official beginning of production to accelerate Agentic AI forward.
NVIDIA Vera CPUs, Purpose-Built For Agentic AI, Are Now In Full Production As The Company Hands Out First Chips To Several AI Firms
The Vera CPU is the next chapter for NVIDIA's Agentic AI ecosystem, setting the stage for its next multi-billion dollar frontier.
- What it is — NVIDIA’s first custom CPU, designed for agentic AI
- What it handles — Orchestration, tool-calling, RL workloads, data analytics, agent sandboxing, long-context state management
- Who it’s for — AI labs, cloud providers, and enterprises running agentic AI at scale
- Core specs — 88 custom Olympus cores, 1.2 TB/s memory bandwidth, 50% faster per-core under full load
Today, NVIDIA's Vice President of Hyperscale and High-Performance Computing, Ian Buck, hand-delivered the first Vera CPUs to four major AI firms who are accelerating Agentic AI forward, these include OpenAI, SpaceX, Anthropic, and Oracle Cloud. This marks the first time that the Vera CPU left NVIDIA's labs and landed in the hands of its first customers.
Ian shares his journey in delivering the Vera CPUs. The first Vera CPUs landed at Anthropic's SoMa offices in San Francisco. Next stop, OpenAI's Mission Bay HQ, where the second rack featuring Vera CPUs was delivered. And lastly, SpaceXAI, where Elon Musk himself was handed Vera at the Palo Alto office.
All three Vera racks were delivered on Friday. On Monday, Ian made a visit to Oracle's AI Customer Excellence Center, where he delivered the last Vera CPU rack.
These deliveries are just a tiny portion of what NVIDIA will be shipping in the coming quarters. Agentic AI firms are showing massive interest in CPUs these days, and Vera, being built for it, is driving crazy traction. Just last week, we reported that Vera CPUs were going to be adopted by CoreWeave, Meta, and Alibaba as early buyers. Oracle was also mentioned, and as you can see, they have already received the first chips.
What To Expect From NVIDIA's Vera CPU, The Successor To Grace
According to NVIDIA, the Vera CPU offers extremely high single-threaded core performance, incredibly high data output, and extreme levels of energy efficiency. Vera is the world's first and only data center CPU to utilize LPDDR5 memory and offers unrivaled performance per watt. NVIDIA is not just integrating Vera CPUs into its Vera Rubin platform; these will also be shipped standalone, & the company expects this to open another multi-billion-dollar business front for it.
For the Vera CPU, NVIDIA has designed its next-gen custom Arm architecture codenamed Olympus, and the chip packs 88 cores, 176 threads (with NVIDIA Spatial Multi-Threading), 1.8 TB/s NVLink-C2C coherent memory interconnect, 1.5 TB of system memory (3x Grace), 1.2 TB/s of memory bandwidth with SOCAMM LPDDR5X, and Rack-scale confidential compute. These combine to offer 2x data processing, compression & CI/CD performance versus Grace.
With Vera, we will also see demand for LPDDR5X DRAM swell as the platform features support for up to 1.5 TB of memory, and given the amount of Vera CPUs that will be required to meet compute demands, the supply chain is going to see increased constraints. NVIDIA's Vera Rubin platform is set for launch in the second half of this week, with mass production of the first racks commencing real soon. Vera will be used in both standalone LPX servers and as a host processor in the Vera Rubin NVL72 racks.
With Vera now entering the full production phase, and Rubin all set for launch in the coming months, the extreme co-design platform that NVIDIA has come up with is going to propel agentic AI workflows to the next level.
Follow Wccftech on Google to get more of our news coverage in your feeds.
