⋮  

AMD Aldebaran Rumored To Be Arcturus’s Successor, Could Be Featured on Instinct MI200 GPU Accelerator

Submit

It looks like AMD is accelerating the production of its next-generation Instinct accelerator, the MI200, which is expected to feature an MCM GPU design. According to the latest information dump, not only is the codename for the GPU unveiled but also a range of new specifications.

AMD Instinct MI200 Accelerator Rumored To Be Codenamed Aldebaran, Will Succeed Arcturus With MCM GPU Design & HBM2E Memory

The AMD Instinct family, starting in 2020, is all CDNA architecture-based. The first generation CDNA flagship, the Instinct MI100, was internally codenamed Arcturus. It was a follow-up to Vega and the GPUs are named after giant stars. The successor to the Instinct MI100, the MI200, is also seemingly going to be named after a huge star and this time, it is expected to be known as Aldebaran.

AMD RDNA 3 Flagship, Navi 31 GPU Rumored To Offer Up To 384-bit Bus & 24 GB GDDR6 Memory, Navi 32 Gets 256-bit For Up To 16 GB Memory

In the latest Linux patch support (via Phoronix), the AMD Instinct MI200 could be known as Aldebaran which is a giant star located within the constellation of Taurus and has a solar radius of 44.13 or 75% more than Arcturus. The naming convention seems to suggest that Aldebaran will be twice as powerful as Arcturus since the numbers in the MI accelerator's naming convention represent the theoretical Flops performance. This is just speculation at this point but given that the accelerator is expected to feature an MCM GPU design, it might be real.

The patches also reveal that the AMD Instinct MI200 'Aldebaran' GPU will feature HBM2E memory support. The brand new memory standard was first used by NVIDIA's Ampere GA100 GPUs & will offer a nice boost over the standard HBM2 configuration used on the Arcturus-based MI100 GPU accelerator. HBM2E allows up to 16 GB memory capacity per stack so we can expect up to 64 GB HBM2E memory at blisteringly fast speeds for Aldebaran.

Other features listed include SDMA (System Direct Memory Access) support which will allow data transfers over PCIe and XGMI/Infinity Cache subsystems. It looks like AMD will incorporate its new Infinity Cache design on upcoming Instinct accelerators too so we are looking for a very advanced version of the Vega GPU.

AMD & Qualcomm Join Forces To Tackle Intel’s vPRO With Faster WIFI & Enhanced FastConnect Solution on Ryzen CPUs

ARCTURUS ALDEBARAN
 .asic_family = CHIP_ARCTURUS,
.asic_name = “arcturus”,
.max_pasid_bits = 16,
.max_no_of_hqd = 24,
.doorbell_size = 8,
.ih_ring_entry_size = 8 * sizeof(uint32_t),
.event_interrupt_class = &event_interrupt_class_v9,
.num_of_watch_points = 4,
.mqd_size_aligned = MQD_SIZE_ALIGNED,
.supports_cwsr = true,
.needs_iommu_device = false,
.needs_pci_atomics = false,
.num_sdma_engines = 2,
.num_xgmi_sdma_engines = 6,
.num_sdma_queues_per_engine = 8,
.asic_family = CHIP_ALDEBARAN,
.asic_name = “aldebaran”,
.max_pasid_bits = 16,
.max_no_of_hqd = 24,
.doorbell_size = 8,
.ih_ring_entry_size = 8 * sizeof(uint32_t),
.event_interrupt_class = &event_interrupt_class_v9,
.num_of_watch_points = 4,
.mqd_size_aligned = MQD_SIZE_ALIGNED,
.supports_cwsr = true,
.needs_iommu_device = false,
.needs_pci_atomics = false,
.num_sdma_engines = 2,
.num_xgmi_sdma_engines = 3,
.num_sdma_queues_per_engine = 8,

There's also a hint at the MCM GPU design again for the AMD Instinct MI200 'Aldebaran GPU'. The patch states a new mode known as Performance Determinism in which the PMFW will maintain sustained performance level and can be enabled on a per-die basis. This would allow each GPU die to run this feature but a max graphics frequency needs to be specified so they don't exceed the power caps.

Do note that AMD's CDNA 2 GPU will be fabricated on a brand new process node & are confirmed to feature a 3rd Generation AMD Infinity architecture that extends to Exascale by allowing up to 8-Way coherent GPU connectivity.

AMD Radeon Instinct Accelerators 2020

Accelerator NameAMD Instinct MI300AMD Instinct MI250XAMD Instinct MI250AMD Instinct MI210AMD Instinct MI100AMD Radeon Instinct MI60AMD Radeon Instinct MI50AMD Radeon Instinct MI25AMD Radeon Instinct MI8AMD Radeon Instinct MI6
CPU ArchitectureZen 4 (Exascale APU)N/AN/AN/AN/AN/AN/AN/AN/AN/A
GPU ArchitectureTBA (CDNA 3)Aldebaran (CDNA 2)Aldebaran (CDNA 2)Aldebaran (CDNA 2)Arcturus (CDNA 1)Vega 20Vega 20Vega 10Fiji XTPolaris 10
GPU Process Node5nm+6nm6nm6nm6nm7nm FinFET7nm FinFET7nm FinFET14nm FinFET28nm14nm FinFET
GPU Chiplets4 (MCM / 3D Stacked)
1 (Per Die)
2 (MCM)
1 (Per Die)
2 (MCM)
1 (Per Die)
2 (MCM)
1 (Per Die)
1 (Monolithic)1 (Monolithic)1 (Monolithic)1 (Monolithic)1 (Monolithic)1 (Monolithic)
GPU Cores28,160?14,08013,3126656768040963840409640962304
GPU Clock SpeedTBA1700 MHz1700 MHz1700 MHz1500 MHz1800 MHz1725 MHz1500 MHz1000 MHz1237 MHz
FP16 ComputeTBA383 TOPs362 TOPs181 TOPs185 TFLOPs29.5 TFLOPs26.5 TFLOPs24.6 TFLOPs8.2 TFLOPs5.7 TFLOPs
FP32 ComputeTBA95.7 TFLOPs90.5 TFLOPs45.3 TFLOPs23.1 TFLOPs14.7 TFLOPs13.3 TFLOPs12.3 TFLOPs8.2 TFLOPs5.7 TFLOPs
FP64 ComputeTBA47.9 TFLOPs45.3 TFLOPs22.6 TFLOPs11.5 TFLOPs7.4 TFLOPs6.6 TFLOPs768 GFLOPs512 GFLOPs384 GFLOPs
VRAM192 GB HBM3?128 GB HBM2e128 GB HBM2e64 GB HBM2e32 GB HBM232 GB HBM216 GB HBM216 GB HBM24 GB HBM116 GB GDDR5
Memory ClockTBA3.2 Gbps3.2 Gbps3.2 Gbps1200 MHz1000 MHz1000 MHz945 MHz500 MHz1750 MHz
Memory Bus8192-bit8192-bit8192-bit4096-bit4096-bit bus4096-bit bus4096-bit bus2048-bit bus4096-bit bus256-bit bus
Memory BandwidthTBA3.2 TB/s3.2 TB/s1.6 TB/s1.23 TB/s1 TB/s1 TB/s484 GB/s512 GB/s224 GB/s
Form FactorOAMOAMOAMDual Slot CardDual Slot, Full LengthDual Slot, Full LengthDual Slot, Full LengthDual Slot, Full LengthDual Slot, Half LengthSingle Slot, Full Length
CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive Cooling
TDP~600W560W500W300W300W300W300W300W175W150W

News Sources: Videocardz , Komachi , Coelacanth’s Dream

Submit