AMD Instinct MI400 Spotted In Latest Patches, Will Feature Up To 8 Chiplets On Dual Interposer Dies

Sarfraz Khan
AMD's Next-Gen Instinct MI400 Accelerator Doubles The Compute To 40 PFLOPs, Equipped With 432 GB HBM4 Memory at 19.6 TB/s & Launches In 2026 1

AMD's next-gen Instinct MI400 accelerators will bring separate interposer dies for up to 8 chaplets as spotted in the latest patches.

AMD Instinct MI400 Accelerators Will Utilize the CDNA Next Architecture and Will Have Eight XCDs and Dedicated Multimedia I/O Tiles

It appears that AMD's next-gen Instinct MI400 accelerator will bring significant changes in the design, which reportedly comprise the inclusion of newer tiles. While AMD is still working on the MI350 for release this year, we have started getting information on the Instinct MI400, which is set to launch in 2026.

Related Story AMD Believes Unified Memory Architectures Open Up a “World of Possibilities”, Will Shape Their Product Choices & Roadmaps In Future

As per the latest reports, some of the design aspects of the MI400 have been revealed. AMD has yet to provide specs and design information on the MI400 in detail officially, but the latest patches give us a sneak peek into what you should be expecting in the upcoming accelerators.

As spotted by Coelacanth Dream, the new patches available on the Free Desktop show that the MI400 will feature up to Four XCDs (Accelerated Compute Dies), increasing the count from two XCDs per AID on the MI300. That said, there will be two AIDs (Active Interposer Dies) on the MI400 accelerator, and this time, there will be separate Multimedia and I/O dies as well.

Image Source: FreeDesktop.org

For each AID, there will be a dedicated MID tile, and this will offer efficient communication between the compute units and the I/O interfaces compared to what we had in previous generations. Even on the MI350, AMD uses infinity fabric for inter-die communication. So, it's a big change to the MI400 accelerators, which are aimed at large-scale AI training and inference tasks and are going to be based on the CDNA-Next architecture which is probably going to be rebranded to UDNA as part of the red team's unification strategy of the RDNA and CDNA architectures.

Image Source: AMD

Meanwhile, AMD is going to release the MI350 accelerators based on the CDNA 4 architecture this year, which already brings huge improvements over the MI300 and its predecessors. It brings the advanced 3nm process node, higher energy efficiency, and will offer up to a 35-fold increase in AI Inference compared to MI300. The details about the uplifts on MI400 are still unknown, and we await AMD's announcement of its specs and features.

AMD Instinct AI Accelerators:

Accelerator NameAMD Instinct MI500AMD Instinct MI400AMD Instinct MI350XAMD Instinct MI325XAMD Instinct MI300XAMD Instinct MI250X
GPU ArchitectureCDNA 6CDNA 5CDNA 4Aqua Vanjaram (CDNA 3)Aqua Vanjaram (CDNA 3)Aldebaran (CDNA 2)
GPU Process Node2nm2nm+3nm3nm5nm+6nm5nm+6nm6nm
XCDs (Chiplets)TBD8 (MCM)8 (MCM)8 (MCM)8 (MCM)2 (MCM)
1 (Per Die)
GPU CoresTBDTBD16,38419,45619,45614,080
GPU Clock Speed (Max)TBDTBD2400 MHz2100 MHz2100 MHz1700 MHz
INT8 ComputeTBDTBD5200 TOPS2614 TOPS2614 TOPS383 TOPs
FP6/FP4 MatrixTBD40 PFLOPs20 PFLOPsN/AN/AN/A
FP8 MatrixTBD20 PFLOPs5 PFLOPs2.6 PFLOPs2.6 PFLOPsN/A
FP16 MatrixTBD10 PFLOPs2.5 PFLOPs1.3 PFLOPs1.3 PFLOPs383 TFLOPs
FP32 VectorTBDTBD157.3 TFLOPs163.4 TFLOPs163.4 TFLOPs95.7 TFLOPs
FP64 VectorTBDTBD78.6 TFLOPs81.7 TFLOPs81.7 TFLOPs47.9 TFLOPs
VRAMHBM4E432 GB HBM4288 GB HBM3e256 GB HBM3e192 GB HBM3128 GB HBM2e
Infinity CacheTBDTBD256 MB256 MB256 MBN/A
Memory ClockTBD19.6 TB/s8.0 Gbps5.9 Gbps5.2 Gbps3.2 Gbps
Memory BusTBDTBD8192-bit8192-bit8192-bit8192-bit
Memory BandwidthTBDTBD8 TB/s6.0 TB/s5.3 TB/s3.2 TB/s
Form FactorTBDTBDOAMOAMOAMOAM
CoolingTBDPassive / LiquidPassive / LiquidPassive CoolingPassive CoolingPassive Cooling
TDP (Max)TBDTBD1400W (355X)1000W750W560W

News Source: Videocardz

Sarfraz Khan Photo

About the author: Sarfraz Khan is a hardware reporter with a focus on PC components and the builder community. With years of experience writing about PC hardware and laptops, his work has been featured on several reputable technology publications. Sarfraz's hands-on experience is demonstrated through his first-person accounts of using and comparing different hardware configurations, providing practical and relatable insights for everyday users. His technical analysis is respected by peers in the enthusiast community and has been cited by specialized hardware sites such as Germany's Igor's Lab.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Deal of the Day

Button