AMD Launches MI350P, Its First PCIe “Instinct” In Four Years – Packs CDNA 4 GPU With 4.6 PFLOPs AI Compute, 144 GB HBM3E at 600W

May 7, 2026 at 10:45am EDT
The image shows an AMD Instinct MI350P graphics card against a dark, abstract background.

AMD has announced its brand new Instinct MI350P PCIe GPU accelerator, which is the first PCIe design in years and is aimed at AI workloads.

The Instinct MI350P PCIe GPU Takes The MI350X Chips, Cuts It Into Half For 128 CUs, 144 GB HBM3E & 600W Power

With the Instinct MI350P PCIe GPU, AMD gives enterprise users an option to expand their AI computing capabilities without having to invest in expensive infrastructure. The PCIe design of the MI350P makes it an easy-to-use and drop-in solution that brings lots of performance in a standard dual-slot and server-focused design.

Related Story Intel’s Crescent Island PCB Leaks, Showing a Massive Xe3P GPU, 16-Pin Connector, 160GB LPDDR5X as Intel Sidesteps the HBM Shortage

Designed to help you prepare for the agentic AI era, AMD Instinct MI350P PCIe cards are dual-slot drop-in cards for standard air-cooled servers. They are built to deploy inference on premises within your current data center’s power, cooling, and rack infrastructure. AMD Instinct GPUs in cost-effective PCIe cards round out the AMD AI compute portfolio, providing a range of options for your enterprise as it navigates its unique AI adoption curve.

The following are some of the highlights of the Instinct MI350P PCIe GPU:

Looking at the specifications, the AMD Instinct MI350P features the CDNA 4 architecture, and is based on the same TSMC 3nm process technology in a 4 XCD configuration, half the amount of the MI350X. It also features a single IO die, which is based on TSMC's 6nm FinFET process. There are 128 compute units on the chip, which equal 8,192 Stream processors, and 512 Matrix cores. The cores are clocked at 2200 MHz at peak. The entire chip features 73 billion transistors.

For memory, the Instinct MI350P packs 128 MB of LLC in the form of Infinity Cache within the GPU, and 144 GB of fast HBM3E memory that operates across a 4096-bit wide bus, delivering 4 TB/s of memory. In comparison, the MI350X packs 288 GB of HBM3E memory across a 8192-bit bus interface. The PCIe card measures 10.5" (267mm) in length and features a passive-cooled design, which is ideal for servers. AMD also uses a 16-Pin connector to meet the 600W TBP of the card. It can also be configured down to 450W.

In terms of performance, the AMD Instinct MI350P offers:

As you can see, the AMD Instinct MI350 series, including the MI350P, offers native acceleration across various Enterprise AI precision formats such as MXFP6 and MXFP4.

The MI350P will be competing against the H200 NVL, which is NVIDIA's last PCIe-based GPU accelerator with 141 GB of HBM3E memory and running the Hopper H200 GPU. NVIDIA has released the RTX PRO 6000 Blackwell server edition, but that's based on the standard GB202 chip instead of the GB200, which is the true server option. The RTX PRO 6000 Blackwell packs 96 GB of GDDR7 memory. The H200 NVL GPUs cost anywhere around $30-$40K US.

The AMD Instinct MI350P PCIe GPUs are now available across various partners and offer a fully open ecosystem and Enterprise Ready AI software stack with ROCm support.

About the author: A Software Engineer by training and a PC enthusiast by passion, Hassan Mujtaba serves as Wccftech's Senior Editor for hardware section. With years of experience in the industry, he specializes in deep-dive technical analysis of next-generation CPU and GPU architectures, motherboards, and cooling solutions. His work involves not only breaking news on upcoming technologies but also extensive hands-on reviews and benchmarking.

Follow Wccftech on Google to get more of our news coverage in your feeds.