AMD Confirms Radeon Instinct MI100 ‘Arcturus’ Discrete GPU Accelerator With CDNA Architecture In 2H 2020

During Dell's EMC presentation, AMD's CTO, Mark Papermaster, confirmed that they will be introducing the next-generation CDNA architecture-based Radeon Instinct MI100 accelerator during the second half of 2020.

AMD's Radeon Instinct MI100 CDNA Architecture Based Discrete GPU Accelerator Arriving in 2H 2020

The AMD Radeon Instinct MI100 which is internally referred to as 'Arcturus' will be a next-gen HPC part that will feature an enhanced version of the 7nm Vega architecture. The accelerator has never been mentioned by AMD officially until now. The GPU seems to be the top HPC part for 2020 in the AMD first-generation CDNA portfolio. Mark confirmed that the Discrete GPU will be introduced in the second half of 2020.

Related StoryHassan Mujtaba
AMD EPYC CPUs To See 18% Market Share By 2024 As Intel’s Xeon Server Hold Declines, Arm CPUs To See 3x Growth

Following is the quote from Mark during the Q/A session:

Like our multi-generational commitment to the Zen roadmap in x86 CPU, we have done the same with our DNA architectures for GPU - rDNA for gaming and visualization, and cDNA for compute & AI. The rDNA is driving gain in AMD share for graphics and deployed in the upcoming Sony and Microsoft new game consoles, and for cDNA you will see the MI100 discrete GPU both 2nd half of 2020.

The ROCm software stack creates an alternative for GPU compute with easy portability and enabling competition. - AMD CTO, Mark Papermaster

Based on what we have learned from various prototype leaks, the Radeon Instinct MI100 'Arcturus' GPU will feature several variants. The flagship variant goes in the D34303 SKU which makes use of the XL variant. The info for this part is based on a test board so it is likely that final specifications would not be the same but here are the key points:

  • Based on Arcturus XL GPU
  • Test Board has a TDP of 200W
  • Up To 32 GB HBM2 Memory
  • HBM2 Memory Clocks Reported Between 1000-1200 MHz

The Radeon Instinct MI100 test board has a TDP of 200W and is based on the XL variant of AMD's Arcturus GPU. The card also features 32 GB of HBM2 memory with pin speeds of 1.0 - 1.2 GHz. The MI60 in comparison has 64 CUs with a TDP of 300W while clock speeds are reported at 1200 MHz (Base Clock) while the memory operates at 1.0 GHz along with a 4096-bit bus interface, pumping out 1 TB/s bandwidth. There's a big chance that the final design of the Arcturus GPU could be featuring Samsung's latest HBM2E 'Flashbolt' memory which offers 3.2 Gbps speeds for up to 1.5 Tb/s of bandwidth.

Related StoryHassan Mujtaba
AMD Radeon RX 7900 XT RDNA 3 “Navi 31” Graphics Card Specs, Performance, Price & Availability – Everything We Know So Far

AMD Radeon Instinct Accelerators 2020

Accelerator NameAMD Instinct MI300AMD Instinct MI250XAMD Instinct MI250AMD Instinct MI210AMD Instinct MI100AMD Radeon Instinct MI60AMD Radeon Instinct MI50AMD Radeon Instinct MI25AMD Radeon Instinct MI8AMD Radeon Instinct MI6
CPU ArchitectureZen 4 (Exascale APU)N/AN/AN/AN/AN/AN/AN/AN/AN/A
GPU ArchitectureTBA (CDNA 3)Aldebaran (CDNA 2)Aldebaran (CDNA 2)Aldebaran (CDNA 2)Arcturus (CDNA 1)Vega 20Vega 20Vega 10Fiji XTPolaris 10
GPU Process Node5nm+6nm6nm6nm6nm7nm FinFET7nm FinFET7nm FinFET14nm FinFET28nm14nm FinFET
GPU Chiplets4 (MCM / 3D Stacked)
1 (Per Die)
2 (MCM)
1 (Per Die)
2 (MCM)
1 (Per Die)
2 (MCM)
1 (Per Die)
1 (Monolithic)1 (Monolithic)1 (Monolithic)1 (Monolithic)1 (Monolithic)1 (Monolithic)
GPU Cores28,160?14,08013,3126656768040963840409640962304
GPU Clock SpeedTBA1700 MHz1700 MHz1700 MHz1500 MHz1800 MHz1725 MHz1500 MHz1000 MHz1237 MHz
FP16 ComputeTBA383 TOPs362 TOPs181 TOPs185 TFLOPs29.5 TFLOPs26.5 TFLOPs24.6 TFLOPs8.2 TFLOPs5.7 TFLOPs
FP32 ComputeTBA95.7 TFLOPs90.5 TFLOPs45.3 TFLOPs23.1 TFLOPs14.7 TFLOPs13.3 TFLOPs12.3 TFLOPs8.2 TFLOPs5.7 TFLOPs
FP64 ComputeTBA47.9 TFLOPs45.3 TFLOPs22.6 TFLOPs11.5 TFLOPs7.4 TFLOPs6.6 TFLOPs768 GFLOPs512 GFLOPs384 GFLOPs
VRAM192 GB HBM3?128 GB HBM2e128 GB HBM2e64 GB HBM2e32 GB HBM232 GB HBM216 GB HBM216 GB HBM24 GB HBM116 GB GDDR5
Memory ClockTBA3.2 Gbps3.2 Gbps3.2 Gbps1200 MHz1000 MHz1000 MHz945 MHz500 MHz1750 MHz
Memory Bus8192-bit8192-bit8192-bit4096-bit4096-bit bus4096-bit bus4096-bit bus2048-bit bus4096-bit bus256-bit bus
Memory BandwidthTBA3.2 TB/s3.2 TB/s1.6 TB/s1.23 TB/s1 TB/s1 TB/s484 GB/s512 GB/s224 GB/s
Form FactorOAMOAMOAMDual Slot CardDual Slot, Full LengthDual Slot, Full LengthDual Slot, Full LengthDual Slot, Full LengthDual Slot, Half LengthSingle Slot, Full Length
CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive Cooling
TDP~600W560W500W300W300W300W300W300W175W150W

So far, AMD has unveiled that the main focus of CDNA would be performance, efficiency, features, scalability in the data center market. Currently, AMD's GCN architecture has served this segment but with CDNA, AMD will be creating GPUs specifically optimized for high-performance Compute, Machine Learning, and HPC. The 1st Gen CDNA GPUs will feature a 2nd Gen Infinity Architecture and would utilize the ROCm (Radeon Open Compute Platform) to power the data center with key optimizations and enhanced scalability. The 2nd Gen Infinity Architecture will allow for 4-8 Way GPU connectivity in a singular node, allowing the new Radeon Instinct boards to run in harmony.

amd-cdna-radeon-instinct-gpu_1
amd-cdna-radeon-instinct-gpu_3
amd-cdna-radeon-instinct-gpu_4
amd-cdna-radeon-instinct-gpu_5
amd-cdna-radeon-instinct-gpu_6
amd-infinity-fabric-3-0_1
amd-infinity-fabric-3-0_2

AMD has proved that they can offer more FLOPs at a competitive price so maybe that is where Arcturus would be targetting. With 2H 2020 now set for the Radeon Instinct MI100, we will soon know how AMD's latest CDNA architecture competes against NVIDIA's Ampere based A100 GPU accelerator in the HPC segment.

WccfTech Tv
Subscribe
Filter videos by
Order