⋮    ⋮  

AMD CDNA Architecture Based Arcturus GPU ‘Radeon Instinct’ Test Board Spotted – 120 CUs With 7680 Cores, 1200 MHz HBM2 Clock, 878 MHz GPU Clock

Apr 21, 2020

AMD's Radeon Instinct Arcturus GPU which will feature the CDNA architecture and aim the server market has been spotted by Rogame. Featured inside the next-generation Radeon Instinct graphics cards, the CDNA architecture will leverage its compute-optimized GPU design to deliver the highest performance Compute capabilities for data centers.

AMD's CDNA Architecture Based Arcturus GPU Test Board Leaks Out - Next-Gen Radeon Instinct With 120 CUs For A Total of 7680 Cores

The AMD Arcturus GPU leaked out all the way back in 2018 which was before AMD has introduced any 7nm GPU. The Radeon VII and Navi lineup launched in 2019 and featured 7nm GPUs with Navi being aimed at the mass consumer market.

AMD Navi 21 “Big Navi GPU” For Radeon RX 6900 XT Reportedly Features 16 GB VRAM, Radeon RX 6000 Cards With Navi 22 Get 12 GB VRAM

It was later revealed that AMD's next-generation HPC & AI GPUs would be designed separately from the consumer-end chips. This meant that the Arcturus GPU would be kept exclusive to the datacenter market. AMD just recently confirmed in its Radeon CDNA architecture roadmap that all CDNA based GPUs would be exclusively designed for the HPC & data center markets while Radeon RDNA GPUs will power the consumer segment.

Coming to the specifications, it was previously unveiled that AMD's Arcturus GPU would feature an increased cache and double the CUs as Vega. That along with a list of data center specific features such as XDLOPs, Rapid Packed Math, New Vector ALU & BFloat16 are to be expected in the Radeon Instinct cards that feature the new CDNA architecture. The previous Radeon Instinct MI100 proto-type 'D34303' board featured the Arcturus-XL die with a rated TDP of 200W and 32 GB HBM2 VRAM clocked at around 1000-1200 MHz.

The information for this part is based on a prototype so it is likely that final specifications would not be the same but here are the key points:

AMD On Huawei Sanctions: We Should Be Able To Continue Servicing Customers

  • Based on Arcturus XL GPU
  • Test Board has a TDP of 200W
  • Up To 32 GB HBM2 Memory
  • HBM2 Memory Clocks Reported Between 1000-1200 MHz

Once again, a test board has been spotted by Rogame which is based on the Arcturus CDNA GPU and from the looks of it, this variant offers 120 CUs for a total of 7680 stream processors & a GPU clock speed of 878 MHz (750 MHz SOC clock). This variant also features an undefined amount of HBM2 memory clocked at 1200 MHz so if we are looking at a 4096-bit bus, we should get around 1.2 TB/s bandwidth which is what Aquabolt is able to offer. But it is very likely that both NVIDIA & AMD would end up utilizing the faster HBM2E 'Flashbolt' standard which goes into production this year and will be capable of delivering up to 1.8 TB/s bandwidth.

Talking about the clock speeds, the 878 MHz for the test board are rather slow as we have seen variants going up to 1334 MHz in the past. At the mentioned speeds, the chip would boast around 13.5 TFLOPs of FP32 compute power which is lower than the Radeon Instinct MI60 and also the 21 TFLOPs that we got on the previous prototype sample. It is likely that the first iteration of CDNA GPUs would end up somewhere around 25 TFLOPs FP32.

Based on leaks of Ampere GPUs which are also expected to be announced later this year, it looks like NVIDIA might hold the upper hand in terms of Compute performance as they are speculated to hit almost 36 TFLOPs of FP32 and 18 TFLOPs of FP64 Compute power with next-generation Tesla 7nm GPU lineup.

AMD Radeon Instinct Accelerators 2020

Accelerator NameAMD Radeon Instinct MI6AMD Radeon Instinct MI8AMD Radeon Instinct MI25AMD Radeon Instinct MI50AMD Radeon Instinct MI60AMD Radeon Instinct MI100
GPU ArchitecturePolaris 10Fiji XTVega 10Vega 20Vega 20Arcturus
GPU Process Node14nm FinFET28nm14nm FinFET7nm FinFET7nm FinFET7nm FinFET
GPU Cores230440964096384040968192?
GPU Clock Speed1237 MHz1000 MHz1500 MHz1725 MHz1800 MHz1334 MHz?
FP16 Compute5.7 TFLOPs8.2 TFLOPs24.6 TFLOPs26.5 TFLOPs29.5 TFLOPs~50 TFLOPs
FP32 Compute5.7 TFLOPs8.2 TFLOPs12.3 TFLOPs13.3 TFLOPs14.7 TFLOPs~25 TFLOPs
FP64 Compute384 GFLOPs512 GFLOPs768 GFLOPs6.6 TFLOPs7.4 TFLOPs~12.5 TFLOPs
Memory Clock1750 MHz500 MHz945 MHz1000 MHz1000 MHzTBD
Memory Bus256-bit bus4096-bit bus2048-bit bus4096-bit bus4096-bit bus4096-bit bus
Memory Bandwidth224 GB/s512 GB/s484 GB/s1 TB/s1 TB/sTBD
Form FactorSingle Slot, Full LengthDual Slot, Half LengthDual Slot, Full LengthDual Slot, Full LengthDual Slot, Full LengthDual Slot, Full Length
CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive Cooling?
~200W (Test Board)

So far, AMD has unveiled that the main focus of CDNA would be performance, efficiency, features, scalability in the data center market. Currently, AMD's GCN architecture has served this segment but with CDNA, AMD will be creating GPUs specifically optimized for high-performance Compute, Machine Learning, and HPC. The 1st Gen CDNA GPUs will feature a 2nd Gen Infinity Architecture and would utilize the ROCm (Radeon Open Compute Platform) to power the data center with key optimizations and enhanced scalability. The 2nd Gen Infinity Architecture will allow for 4-8 Way GPU connectivity in a singular node, allowing the new Radeon Instinct boards to run in harmony.

AMD has proved that they can offer more FLOPs at a competitive price so maybe that is where Arcturus would be targetting. There's no word on when Arcturus would land, but AMD has hinted at a Radeon Instinct product later this year which will feature its 1st Gen CDNA architecture.

Products mentioned in this post

Radeon VII
Radeon VII
USD 989.7

The links above are affiliate links. As an Amazon Associate, Wccftech.com may earn from qualifying purchases.