AMD’s Next-Gen Instinct MI400 Accelerator Doubles The Compute To 40 PFLOPs, Equipped With 432 GB HBM4 Memory at 19.6 TB/s & Launches In 2026

Jun 12, 2025 at 02:11pm EDT
AMD's Next-Gen Instinct MI400 Accelerator Doubles The Compute To 40 PFLOPs, Equipped With 432 GB HBM4 Memory at 19.6 TB/s & Launches In 2026 1

In addition to its MI350 series, AMD is also giving us a glimpse of what to expect from its next-gen Instinct MI400 series, which launches in 2026.

AMD Instinct MI400 Features 2x More AI Compute Than MI350 Series, 50% More Memory, Almost 2.5x Bandwidth Increase With HBM4, 10x Faster Vs MI350

With new details shared for the Instinct MI400 accelerator, it looks like AMD is once again going to go big on the hardware side, essentially doubling the compute capability. The official metrics now list the MI400 as a 40 PFLOP (FP4) & 20 PFLOP (FP8) product, which doubles the compute capability of the MI350 series, which launched today.

Related Story AMD Says EPYC Turin Already Crushes NVIDIA Vera by 2.37x in Agentic AI, With Zen 6 Venice Pushing the Lead Past 3.3x

In addition to the compute capability, AMD is also going to leverage HBM4 memory for its Instinct MI400 series. The new chip will offer a 50% memory capacity uplift from 288GB HBM3e to 432GB HBM4. The HBM4 standard will offer a massive 19.6 TB/s bandwidth, more than double that of the 8 TB/s for the MI350 series. The GPU will also feature a 300 GB/s scale-out bandwidth/per GPU, so some big things are coming in the next generation of Instinct.

As per previous details, the Instinct MI400 accelerator will feature up to four XCDs (Accelerated Compute Dies), increasing the count from two XCDs per AID on the MI300. That said, there will be two AIDs (Active Interposer Dies) on the MI400 accelerator, and this time, there will be separate Multimedia and I/O dies as well.

Image Source: FreeDesktop.org

For each AID, there will be a dedicated MID tile, and this will offer efficient communication between the compute units and the I/O interfaces compared to what we had in previous generations. Even on the MI350, AMD uses Infinity Fabric for inter-die communication.

So, it's a big change to the MI400 accelerators, which are aimed at large-scale AI training and inference tasks and are going to be based on the CDNA-Next architecture, which is probably going to be rebranded to UDNA as part of the red team's unification strategy of the RDNA and CDNA architectures.

AMD Instinct AI Accelerators:

Accelerator NameAMD Instinct MI500AMD Instinct MI400AMD Instinct MI350XAMD Instinct MI325XAMD Instinct MI300XAMD Instinct MI250X
GPU ArchitectureCDNA 6CDNA 5CDNA 4Aqua Vanjaram (CDNA 3)Aqua Vanjaram (CDNA 3)Aldebaran (CDNA 2)
GPU Process Node2nm2nm+3nm3nm5nm+6nm5nm+6nm6nm
XCDs (Chiplets)TBD8 (MCM)8 (MCM)8 (MCM)8 (MCM)2 (MCM)
1 (Per Die)
GPU CoresTBDTBD16,38419,45619,45614,080
GPU Clock Speed (Max)TBDTBD2400 MHz2100 MHz2100 MHz1700 MHz
INT8 ComputeTBDTBD5200 TOPS2614 TOPS2614 TOPS383 TOPs
FP6/FP4 MatrixTBD40 PFLOPs20 PFLOPsN/AN/AN/A
FP8 MatrixTBD20 PFLOPs5 PFLOPs2.6 PFLOPs2.6 PFLOPsN/A
FP16 MatrixTBD10 PFLOPs2.5 PFLOPs1.3 PFLOPs1.3 PFLOPs383 TFLOPs
FP32 VectorTBDTBD157.3 TFLOPs163.4 TFLOPs163.4 TFLOPs95.7 TFLOPs
FP64 VectorTBDTBD78.6 TFLOPs81.7 TFLOPs81.7 TFLOPs47.9 TFLOPs
VRAMHBM4E432 GB HBM4288 GB HBM3e256 GB HBM3e192 GB HBM3128 GB HBM2e
Infinity CacheTBDTBD256 MB256 MB256 MBN/A
Memory ClockTBD19.6 TB/s8.0 Gbps5.9 Gbps5.2 Gbps3.2 Gbps
Memory BusTBDTBD8192-bit8192-bit8192-bit8192-bit
Memory BandwidthTBDTBD8 TB/s6.0 TB/s5.3 TB/s3.2 TB/s
Form FactorTBDTBDOAMOAMOAMOAM
CoolingTBDPassive / LiquidPassive / LiquidPassive CoolingPassive CoolingPassive Cooling
TDP (Max)TBDTBD1400W (355X)1000W750W560W

About the author: A Software Engineer by training and a PC enthusiast by passion, Hassan Mujtaba serves as Wccftech's Senior Editor for hardware section. With years of experience in the industry, he specializes in deep-dive technical analysis of next-generation CPU and GPU architectures, motherboards, and cooling solutions. His work involves not only breaking news on upcoming technologies but also extensive hands-on reviews and benchmarking.

Follow Wccftech on Google to get more of our news coverage in your feeds.