ARM Takes Matters Into Its Own Hands, Unveiling the ‘AGI CPU’ as Its First-Ever Silicon for Agentic AI

Mar 24, 2026 at 01:19pm EDT
A presenter holds a device on stage in front of a slide displaying 'arm AGI CPU' and 'World's most efficient agentic CPU.'

ARM made a massive announcement at its ARM Everywhere keynote: according to a new blog post, the firm will sell its own 'AGI CPU' for the first time.

ARM's Pivot to Direct Silicon Sales Ends NVIDIA's Dominance Over High-End Server Core IP

With agentic AI workloads, CPU has started to become the next bottleneck for hyperscalers, which is why we have seen x86 solutions from Intel/AMD and ARM-based chips from NVIDIA gaining massive adoption among customers. In light of this, ARM has decided to capitalize on the hype by introducing its first-ever chip, called the ARM AGI CPU, marking a shift from an IP provider to an end-to-end silicon manufacturer. According to ARM's CEO, Rene Haas, this move is targeted at fulfilling enterprise demand driven by agentic AI workloads.

Related Story MSI Pushes NVIDIA’s RTX Spark Into The Mainstream With A Developer Mini PC And A Tandem OLED Flip Laptop

Today marks the next phase of the Arm compute platform and a defining moment for our company. With the expansion into delivering production silicon with our Arm AGI CPU, we are giving partners more choices all built on Arm’s foundation of high-performance, power-efficient computing, to support agentic AI infrastructure at global scale.

Diving into the specifics of the AGI CPU, we are looking at up to 136 Arm Neoverse V3 cores per CPU, offering 6GB/s memory bandwidth per core. The processor has 2 MB of L2 cache per core and runs at up to 3.7 GHz. As far as I/O specifications are concerned, you are looking at 96x PCIe Gen 6 lanes, along with CXL 3.0 memory expansion, allowing the processor to support "massively parallel, high-performance agentic workloads". Here's a complete rundown of the AGI CPU, based on the details disclosed:

CategorySpecification
Core ArchitectureArm Neoverse V3
Core CountUp to 136 cores
Clock FrequencyUp to 3.7GHz
CacheDedicated 2MB L2 cache per core
Manufacturing Process3nm
Thermal Design Power (TDP)300 Watts
Packaging / LayoutDual chiplet design (Memory and I/O on the same die)
Memory SupportedUp to DDR5-8800
Memory CapacityUp to 6TB per chip
Memory Performance6GB/s memory bandwidth per core; Sub-100ns memory latency
I/O & Connectivity96x lanes PCIe Gen6
Expansion & InterconnectsCXL 3.0 (for memory expansion) and AMBA CHI extension links

In terms of rack-scale deployment, ARM offers ultra-thin 1OU (Open Unit) nodes, which are basically a shift away from multi-unit servers. A single chassis can host up to two nodes, providing a total of 272 cores per blade. The physical rack layout can house up to 30 of these open unit nodes, delivering a total of 8160 cores. You are also looking at unified memory pools connected via the CXL 3.0 fabric, and each rack is rated to run at 36kW with air cooling. Given how prevalent CPU-only racks have become, ARM has designed its solution to meet market demand.

ARM says its AGI CPU delivers "two times higher" performance per rack compared to modern x86 solutions. While it doesn't compare its solution to Vera, we expect it to be closer to it, given similar microarchitectures. The AGI CPU also allows vendors to mix-and-match their rack-scale configurations, since ARM has opened up support for any accelerator (Cerebras, Groq, Meta MTIA) that fits into standard OCP server designs. This essentially means that the ARM IP benefits NVIDIA previously used exclusively have likely now ended.

It would be interesting to see how ARM's venture into the standalone CPU segment evolves going forward, since the firm has been discussing the prospects of its own chips for quite some time now. At the same time, NVIDIA, a core customer of ARM's IP technology, now has a formidable competitor to Vera.

About the author: Muhammad Zuhair is a hardware and technology reporter for Wccftech, specializing in the semiconductor industry and the complex interplay between technology, manufacturing, and geopolitics. His coverage focuses on the corporate strategies and technological roadmaps of industry giants like TSMC, NVIDIA, Samsung, and Intel. Zuhair's expertise lies in deconstructing complex topics such as fabrication nodes (e.g., 2nm process), the economic impact of policies like the CHIPS Act, and the strategic development of AI infrastructure from NVIDIA, AMD and Intel.

Follow Wccftech on Google to get more of our news coverage in your feeds.