Cerebras Unveils Its 7nm Wafer Scale Engine 2: The Largest AI Chip Ever Built With 2.6 Trillion Transistors & Close To A Million Cores

Hassan Mujtaba • Apr 21, 2021 at 08:04am EDT

Cerebras CS-2 Wafer Scale Chip Outperforms Every Single GPU By Leaps & Bounds, Breaks Record of Largest AI Model Trained on A Single Device 2

Cerebras Systems has unveiled the largest AI chip based on the 7nm process node, the Wafer Scale Engine 2. Succeding the first generation WSE, the WSE2 is a singular monolithic chip that features a range of new features including the largest transistor count & most number of cores ever featured on a chip.

Cerebras Wafer Scale Engine Is A Massive AI Chip Featuring 2.6 Trillion Transistors & Nearly 1 Million Cores

The Cerberas Wafer Scale Engine 2 has two parts, the first is the chip itself and the second is the CS-2 system which it will actually be featured within. We will with the chip itself which has insane numbers to go by. For starters, the WSE-2 is the largest AI chip ever made. It is based on TSMC's 7nm process node and has a 100% yield rate which is quite impressive. For one, if you are using the entire wafer as a die, you are either going to get 100% yield if the design can absorb defects or 0% if it cannot.

The Cerebras Wafer Scale Engine chip is compared next to NVIDIA's A100 AI GPU.

Coming to the technical details, the Cerebras WSE-2 measures at 46,225mm2 which is 56 times larger than the largest GPU ever made, the NVIDIA Ampere A100. The chip features almost one million cores (850,000 to be precise), a 123x increase over the Ampere A100, 40 GB of on-chip memory (SRAM) vs 40 MB on the Ampere A100, 20 Petabytes per second of memory bandwidth and 220 Petabits per second interconnect fabric (2D Mesh) bandwidth.

Cerebras Wafer Scale Engine 2 Specs

Chip Name	Cerebras WSE-2	Cerebras WSE-1	NVIDIA A100	Cerebras Advantage (vs A100)
Process Node	TSMC 7nm	TSMC 16nm	TSMC 7nm	TSMC 7nm
Chip Size	46,225 mm2	46,225 mm2	826mm2	56 X
Cores	850,000	400,000	6,912 + 432	123 X
On-Chip Memory	40 GB	18 GB	40 MB	1,000 X
Memory Bandwidth	20 PB/s	9 PB/s	1.55 GB/s	12,862 X
Fabric Bandwidth	220 Pb/s	100 Pb/s	600 GB/s	45,833 X
Price	TBC	~$2 Million US	$11-$15K US	N/A

As for the cores on the Cerebras WSE-2, they are AI-optimized Sparse Linear Algebra Compute (SLAC) cores and are fine-tuned towards AI processing. Compared to the previous generation (Cerebras WSE-1), Cerebras's WSE-2 offers a 2.16x increase in transistor density and features more than twice the number of cores.

The WSE-2 will power the Cerebras CS-2, the company’s AI computer, which more than doubles the performance of Cerebras’ first-generation CS-1. Manufactured by Taiwan Semiconductor Manufacturing Company (TSMC) on its 7nm-node, the WSE-2 more than doubles all performance characteristics on the chip – the transistor count, core count, memory, memory bandwidth and fabric bandwidth – over the first generation WSE.

Cerebras

"TSMC has long partnered with the industry innovators to manufacture advanced processors with leading performance. We are pleased with the result of our continuous collaboration with Cerebras Systems in manufacturing the Cerebras WSE-2 on our 7nm process, another extraordinary accomplishment and milestone for wafer scale development after the introduction of the Cerebras 16nm WSE less than two years ago."

Sajiv Dalal, SVP of Business Management, TSMC North America

Moving over to the Cerebras WSE-2 powered CS-2 system, we are looking at a system that is 26" tall and composed of 15 rack units (15U). The system features 12 100GbE lanes, a custom water-cooled solution through a closed-loop setup, and twelve standard PSUs in a 9+3 redundant configuration that delivers up to 23,000 Watts of power to the CS-2 system. Some words from Cerebras on their cooling solution:

To provide the cooling horsepower the WSE-2 needs while keeping datacenter integration simple, the CS-2 is internally water-cooled. Water circulates through a closed-loop, fully self-contained within the system. Like a giant gaming PC, the CS-2 uses water to cool the WSE-2, and then air to cool the water.

The top right of the system is for the movement of water. Two hot-swappable pumps move water through a manifold across the back of the WSE-2, cooling the wafer and warming the water. Warm water is then pumped into a heat exchanger. This heat exchanger presents a large surface area for the cold air blown in by the four hot-swappable fans at the bottom of the CS-2. These fans move air from the cold aisle, cool the warm water via the heat exchanger and exhaust the warm air into the warm aisle.

Currently, there's no word on pricing for either the Cerberas CS-2 or the WSE-2 but the first generation Wafer Scale Engine had a price tag of around $2 million US.

About the author: A Software Engineer by training and a PC enthusiast by passion, Hassan Mujtaba serves as Wccftech's Senior Editor for hardware section. With years of experience in the industry, he specializes in deep-dive technical analysis of next-generation CPU and GPU architectures, motherboards, and cooling solutions. His work involves not only breaking news on upcoming technologies but also extensive hands-on reviews and benchmarking.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Read all comments on Cerebras Unveils Its 7nm Wafer Scale Engine 2: The Largest AI Chip Ever Built With 2.6 Trillion Transistors & Close To A Million Cores

Cerebras Unveils Its 7nm Wafer Scale Engine 2: The Largest AI Chip Ever Built With 2.6 Trillion Transistors & Close To A Million Cores

Cerebras Wafer Scale Engine Is A Massive AI Chip Featuring 2.6 Trillion Transistors & Nearly 1 Million Cores

Cerebras Wafer Scale Engine 2 Specs

Trending Stories

Intel CEO Lip-Bu Tan Warned Helium Could Choke AI Chips in June, and China’s Export Ban Might Prove Him Right

Samsung Will Take Three Generations To Unveil Its First 1.4nm Exynos SoC, But The Delay Could Prove Beneficial Despite TSMC Obtaining A Lead

Trump Mobile Wants To Entice You To Buy The “Yellow Plastic” T1 Phone By Offering A Free Charging Brick

Intel EMIB-T Breaks Past Existing AI & HPC Scaling Limits, Enabling Ultra-Large Die Complexes With Over 10x Reticle Dies & 12 Gb/s+ HBM4e DRAM

MRDIMM’s Allow DDR5 Memory To Keep Up With Next-Gen Servers, Achieving DDR6-Class Bandwidth & No Pin-Change

Popular Discussions

AMD Prepares For Zen 6 EPYC CPUs Launch For July 22nd-23rd, Confirms AMD’s Mark Papermaster

Intel’s Shot At Fabricating Apple’s A20 Chip For The Base iPhone 18 Collapses As A Credible Leaker Calls The Original Source A ‘Blowhard’

AMD’s Next-Gen Medusa Point “10-Core” CPU Beats Strix “10-Core” By 29% In Single-Core & 22% In Multi-Core While Running At Just 2.0 GHz

NVIDIA’s RTX 3060 12 GB Graphics Card Comeback Proves Just How Bad Things Are For The PC Gaming Market

AMD Ryzen Becomes The Top CPU Choice While Radeon Powers 1 In Every 3 Desktop Gaming GPUs Sold at Microcenter

Cerebras Unveils Its 7nm Wafer Scale Engine 2: The Largest AI Chip Ever Built With 2.6 Trillion Transistors & Close To A Million Cores

Cerebras Wafer Scale Engine Is A Massive AI Chip Featuring 2.6 Trillion Transistors & Nearly 1 Million Cores

Related Story Jim Keller Says Cerebras IPO Was Helpful As Tenstorrent Set To “Beat Them on Everything”, Confirms Meeting With Intel & Qualcomm CEOs “Hoping To Get A Big Deal”

Cerebras Wafer Scale Engine 2 Specs

Further Reading

Trending Stories

Popular Discussions