Qualcomm’s New AI Rack-Scale Solutions Actually Uses LPDDR Mobile Memory Onboard, Boldly Hoping to Take on NVIDIA and AMD

Oct 27, 2025 at 01:47pm EDT
Qualcomm server emphasizing Rack-scale performance and Low total cost of ownership, featuring AI200 and AI250 models.

Qualcomm has announced its latest AI chips, which are designed to scale up to a purpose-built rack-level AI inference solution, but interestingly, they employ mobile memory onboard.

Qualcomm's New AI Chips Take a 'Daring' Pivot Away From HBM To Target Efficient Inferencing Workloads

Qualcomm has come a long way from being a mobile-focused firm, and in recent years, the San Diego chipmaker has expanded into new segments, including consumer computing and AI infrastructure. Now, the firm has announced its newest AI200 and AI250 chip solutions, which are reportedly designed for rack-scale configurations. This not only marks the entry of a new player in a segment dominated by NVIDIA and AMD, but Qualcomm has managed to find a unique implementation by utilizing mobile-focused LPDDR memory.

Related Story Qualcomm Quietly Preps Snapdragon X2 CPU Refresh With Three Dies, Stalling X3 As Memory Shortages Bite

Before we delve into the specifics of the newly announced AI chips, let's examine the use of LPDDR memory compared to the more traditional HBM solution. Qualcomm's new products offer up to 768 GB of LPDDR on the accelerator package, which is significantly higher than the industry's HBM capacity. The main reason this venture looks attractive is that it reduces data-movement energy and cost, a key advantage that the firm calls a "near-memory" approach. Here are the traditional improvements the firm gets by employing LPPDR over HBM:

While this implementation sounds optimistic, Qualcomm's rack-scale solutions still fall short when compared to mainstream options from NVIDIA/AMD, simply because avoiding HBM use results in lower memory bandwidth, higher latency due to a narrow interface, and, most importantly, utilizing an immature memory stack in 24/7 high-heat server environments. However, the intention of the San Diego firm here is to provide companies with a capable inferencing option, and the use of LPDDR certainly achieves this goal, but it does limit these rack-scale configurations to a specific application.

Apart from this, the AI200 and AI250 chip solutions feature direct liquid cooling, PCIe/Ethernet protocols, and a rack-level power consumption of 160 kW, which is a pretty low figure for a modern-day solution. More importantly, the chips onboard employ the firm's Hexagon NPUs, which are widely expanding in terms of inferencing capabilities, supporting advanced data formats as well as inference-focused features.

Interestingly, the pivot towards bringing capable inferencing solutions to the market is being done by a lot of compute providers, with one of the more recent examples being Intel with its 'Crescent Island' solution and NVIDIA introducing a new Rubin CPX AI chip. Qualcomm apparently recognizes that the inferencing segment is gaining market spotlight, which is why the AI200 and AI250 chip solutions are a sensible approach here. However, for modern training or large workloads, these racks would probably be the last choice.

It's exciting to see competition emerging in the AI space, and by the looks of it, retailers took the announcements with quite some optimism.

About the author: Muhammad Zuhair is a hardware and technology reporter for Wccftech, specializing in the semiconductor industry and the complex interplay between technology, manufacturing, and geopolitics. His coverage focuses on the corporate strategies and technological roadmaps of industry giants like TSMC, NVIDIA, Samsung, and Intel. Zuhair's expertise lies in deconstructing complex topics such as fabrication nodes (e.g., 2nm process), the economic impact of policies like the CHIPS Act, and the strategic development of AI infrastructure from NVIDIA, AMD and Intel.

Follow Wccftech on Google to get more of our news coverage in your feeds.