AMD CDNA Architecture Based Arcturus GPU ‘Radeon Instinct’ Test Board Spotted – 120 CUs With 7680 Cores, 1200 MHz HBM2 Clock, 878 MHz GPU Clock

Hassan Mujtaba

AMD's Radeon Instinct Arcturus GPU which will feature the CDNA architecture and aim the server market has been spotted by Rogame. Featured inside the next-generation Radeon Instinct graphics cards, the CDNA architecture will leverage its compute-optimized GPU design to deliver the highest performance Compute capabilities for data centers.

AMD's CDNA Architecture Based Arcturus GPU Test Board Leaks Out - Next-Gen Radeon Instinct With 120 CUs For A Total of 7680 Cores

The AMD Arcturus GPU leaked out all the way back in 2018 which was before AMD has introduced any 7nm GPU. The Radeon VII and Navi lineup launched in 2019 and featured 7nm GPUs with Navi being aimed at the mass consumer market.

Related Story AMD Reportedly Says No To FSR 4 For RDNA 3.5, Stripping Ryzen AI 300/400 APUs Of Latest Upscaling Technology

It was later revealed that AMD's next-generation HPC & AI GPUs would be designed separately from the consumer-end chips. This meant that the Arcturus GPU would be kept exclusive to the datacenter market. AMD just recently confirmed in its Radeon CDNA architecture roadmap that all CDNA based GPUs would be exclusively designed for the HPC & data center markets while Radeon RDNA GPUs will power the consumer segment.

Coming to the specifications, it was previously unveiled that AMD's Arcturus GPU would feature an increased cache and double the CUs as Vega. That along with a list of data center specific features such as XDLOPs, Rapid Packed Math, New Vector ALU & BFloat16 are to be expected in the Radeon Instinct cards that feature the new CDNA architecture. The previous Radeon Instinct MI100 proto-type 'D34303' board featured the Arcturus-XL die with a rated TDP of 200W and 32 GB HBM2 VRAM clocked at around 1000-1200 MHz.

The information for this part is based on a prototype so it is likely that final specifications would not be the same but here are the key points:

  • Based on Arcturus XL GPU
  • Test Board has a TDP of 200W
  • Up To 32 GB HBM2 Memory
  • HBM2 Memory Clocks Reported Between 1000-1200 MHz

Once again, a test board has been spotted by Rogame which is based on the Arcturus CDNA GPU and from the looks of it, this variant offers 120 CUs for a total of 7680 stream processors & a GPU clock speed of 878 MHz (750 MHz SOC clock). This variant also features an undefined amount of HBM2 memory clocked at 1200 MHz so if we are looking at a 4096-bit bus, we should get around 1.2 TB/s bandwidth which is what Aquabolt is able to offer. But it is very likely that both NVIDIA & AMD would end up utilizing the faster HBM2E 'Flashbolt' standard which goes into production this year and will be capable of delivering up to 1.8 TB/s bandwidth.

Talking about the clock speeds, the 878 MHz for the test board are rather slow as we have seen variants going up to 1334 MHz in the past. At the mentioned speeds, the chip would boast around 13.5 TFLOPs of FP32 compute power which is lower than the Radeon Instinct MI60 and also the 21 TFLOPs that we got on the previous prototype sample. It is likely that the first iteration of CDNA GPUs would end up somewhere around 25 TFLOPs FP32.

Based on leaks of Ampere GPUs which are also expected to be announced later this year, it looks like NVIDIA might hold the upper hand in terms of Compute performance as they are speculated to hit almost 36 TFLOPs of FP32 and 18 TFLOPs of FP64 Compute power with next-generation Tesla 7nm GPU lineup.

AMD Radeon Instinct Accelerators

Accelerator NameAMD Instinct MI400AMD Instinct MI350XAMD Instinct MI300XAMD Instinct MI300AAMD Instinct MI250XAMD Instinct MI250AMD Instinct MI210AMD Instinct MI100AMD Radeon Instinct MI60AMD Radeon Instinct MI50AMD Radeon Instinct MI25AMD Radeon Instinct MI8AMD Radeon Instinct MI6
CPU ArchitectureZen 5 (Exascale APU)N/AN/AZen 4 (Exascale APU)N/AN/AN/AN/AN/AN/AN/AN/AN/A
GPU ArchitectureCDNA 4CDNA 3+?Aqua Vanjaram (CDNA 3)Aqua Vanjaram (CDNA 3)Aldebaran (CDNA 2)Aldebaran (CDNA 2)Aldebaran (CDNA 2)Arcturus (CDNA 1)Vega 20Vega 20Vega 10Fiji XTPolaris 10
GPU Process Node4nm4nm5nm+6nm5nm+6nm6nm6nm6nm7nm FinFET7nm FinFET7nm FinFET14nm FinFET28nm14nm FinFET
GPU ChipletsTBDTBD8 (MCM)8 (MCM)2 (MCM)
1 (Per Die)
2 (MCM)
1 (Per Die)
2 (MCM)
1 (Per Die)
1 (Monolithic)1 (Monolithic)1 (Monolithic)1 (Monolithic)1 (Monolithic)1 (Monolithic)
GPU CoresTBDTBD19,45614,59214,08013,3126656768040963840409640962304
GPU Clock SpeedTBDTBD2100 MHz2100 MHz1700 MHz1700 MHz1700 MHz1500 MHz1800 MHz1725 MHz1500 MHz1000 MHz1237 MHz
INT8 ComputeTBDTBD2614 TOPS1961 TOPS383 TOPs362 TOPS181 TOPS92.3 TOPSN/AN/AN/AN/AN/A
FP16 ComputeTBDTBD1.3 PFLOPs980.6 TFLOPs383 TFLOPs362 TFLOPs181 TFLOPs185 TFLOPs29.5 TFLOPs26.5 TFLOPs24.6 TFLOPs8.2 TFLOPs5.7 TFLOPs
FP32 ComputeTBDTBD163.4 TFLOPs122.6 TFLOPs95.7 TFLOPs90.5 TFLOPs45.3 TFLOPs23.1 TFLOPs14.7 TFLOPs13.3 TFLOPs12.3 TFLOPs8.2 TFLOPs5.7 TFLOPs
FP64 ComputeTBDTBD81.7 TFLOPs61.3 TFLOPs47.9 TFLOPs45.3 TFLOPs22.6 TFLOPs11.5 TFLOPs7.4 TFLOPs6.6 TFLOPs768 GFLOPs512 GFLOPs384 GFLOPs
VRAMTBDHBM3e192 GB HBM3128 GB HBM3128 GB HBM2e128 GB HBM2e64 GB HBM2e32 GB HBM232 GB HBM216 GB HBM216 GB HBM24 GB HBM116 GB GDDR5
Infinity CacheTBDTBD256 MB256 MBN/AN/AN/AN/AN/AN/AN/AN/AN/A
Memory ClockTBDTBD5.2 Gbps5.2 Gbps3.2 Gbps3.2 Gbps3.2 Gbps1200 MHz1000 MHz1000 MHz945 MHz500 MHz1750 MHz
Memory BusTBDTBD8192-bit8192-bit8192-bit8192-bit4096-bit4096-bit bus4096-bit bus4096-bit bus2048-bit bus4096-bit bus256-bit bus
Memory BandwidthTBDTBD5.3 TB/s5.3 TB/s3.2 TB/s3.2 TB/s1.6 TB/s1.23 TB/s1 TB/s1 TB/s484 GB/s512 GB/s224 GB/s
Form FactorTBDTBDOAMAPU SH5 SocketOAMOAMDual Slot CardDual Slot, Full LengthDual Slot, Full LengthDual Slot, Full LengthDual Slot, Full LengthDual Slot, Half LengthSingle Slot, Full Length
CoolingTBDTBDPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive CoolingPassive Cooling
TDP (Max)TBDTBD750W760W560W500W300W300W300W300W300W175W150W
amd-cdna-radeon-instinct-gpu_1
amd-cdna-radeon-instinct-gpu_3
amd-cdna-radeon-instinct-gpu_4
amd-cdna-radeon-instinct-gpu_5
amd-cdna-radeon-instinct-gpu_6
amd-infinity-fabric-3-0_1
amd-infinity-fabric-3-0_2

So far, AMD has unveiled that the main focus of CDNA would be performance, efficiency, features, scalability in the data center market. Currently, AMD's GCN architecture has served this segment but with CDNA, AMD will be creating GPUs specifically optimized for high-performance Compute, Machine Learning, and HPC. The 1st Gen CDNA GPUs will feature a 2nd Gen Infinity Architecture and would utilize the ROCm (Radeon Open Compute Platform) to power the data center with key optimizations and enhanced scalability. The 2nd Gen Infinity Architecture will allow for 4-8 Way GPU connectivity in a singular node, allowing the new Radeon Instinct boards to run in harmony.

AMD has proved that they can offer more FLOPs at a competitive price so maybe that is where Arcturus would be targetting. There's no word on when Arcturus would land, but AMD has hinted at a Radeon Instinct product later this year which will feature its 1st Gen CDNA architecture.

Hassan Mujtaba Photo

About the author: A Software Engineer by training and a PC enthusiast by passion, Hassan Mujtaba serves as Wccftech's Senior Editor for hardware section. With years of experience in the industry, he specializes in deep-dive technical analysis of next-generation CPU and GPU architectures, motherboards, and cooling solutions. His work involves not only breaking news on upcoming technologies but also extensive hands-on reviews and benchmarking.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Button