Turn Your Vacant M.2 Slot Into a 20B LLM Cruncher With This Dedicated AI Module: Packing 32 GB Memory & 60 TOPs

•

Apr 15, 2026 at 01:30am EDT

Turn Your Vacant M.2 Slot Into a 20B LLM Cruncher With This Dedicated AI Module: Packing 32 GB Memory & 60 TOPs 1

Unigen's latest AI module runs on a standard M.2 slot and offers up to 60 TOPS, 32 GB memory, & can run up to 20B parameter LLMs.

Unigen Amaretti AI Module Packs a 60 TOPS NPU, 32 GB Memory, & Consumes Just 10W Power On A M.2 or E1.S Slot

With the rise of local AI agents, many companies are putting out some unique AI products. Unigen is one of these manufacturers that has announced the Amaretti E1.S AI module, a tiny M.2-compatible module that looks like a regular SSD but houses some strong AI capabilities.

The Unigen Amaretti E1.S is based on the SAKURA-II AI accelerator from EdgeCortix, which was initially designed for low-power AI platforms, bringing these capabilities to Raspberry Pi5 & other ARM-based products. The accelerator chip features an NPU with 60 TOPS of INT8 and 30 TFLOPS of BF16 compute. It features a dual 64-bit LPDDR4x memory controller and packs 20MB of in-chip SRAM cache. The 19x19 BGA package consumes roughly 8-10W.

What Unigen has done is taken the SAKURA-II AI accelerator and put it on an E1.S board, along with an impressive memory capacity of up to 32 GB. The module is available in both 16 GB and 32 GB flavors, offering up to 68 GB/s bandwidth. The Amaretti module is rated at 10W, so you are getting 6 TOPS per Watt.

Now, in terms of performance, the 32 GB memory capacity enables the module to easily run AI LLMs with up to 20B parameters. This is ideal for low-power AI solutions that need to run GenAI & Agentic AI workflows. Furthermore, these modules can be stacked into multiple M.2 slots, further increasing their overall capabilities. EdgeCortix already offers a higher-end PCIe configuration which features two of these chips & additional capabilities, but the M.2 solution is definitely an interesting choice.

Many PCs, Desktops & Laptops, have idle sitting M.2 slots. If you are looking for localized AI and want to speed up your system, then these modules make a lot of sense.

According to Unigen, the AI module supports all the latest AI frameworks such as TensorFlow, PyTorch, ONNX, and Hugging Face. The main highlights of the module include:

E1.S AI Module
AI Accelerator: SAKURA-II
Up to 1920 TOPS of inference performance with air-cooled Dual CPU Servers
Use 20% of the Watts with TPUs compared to training GPUs
GenAI LLMs for up to 20B parameters
14-week lead times, significantly less than the typical for GPU servers
Up to 32GB per module

Unigen ships the Amaretti E1.S AI module with a pre-equipped heatsink. There's no information regarding the price, but the memory capacity should give a hint at what to expect.

About the author: A Software Engineer by training and a PC enthusiast by passion, Hassan Mujtaba serves as Wccftech's Senior Editor for hardware section. With years of experience in the industry, he specializes in deep-dive technical analysis of next-generation CPU and GPU architectures, motherboards, and cooling solutions. His work involves not only breaking news on upcoming technologies but also extensive hands-on reviews and benchmarking.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Turn Your Vacant M.2 Slot Into a 20B LLM Cruncher With This Dedicated AI Module: Packing 32 GB Memory & 60 TOPs

Unigen Amaretti AI Module Packs a 60 TOPS NPU, 32 GB Memory, & Consumes Just 10W Power On A M.2 or E1.S Slot

Related Story Hygon’s 128-Core & 512-Thread C86 CPU Targets Intel Xeon With 15% IPC Gain, As China Races to Cut Foreign Chip Reliance

Further Reading

InnoGrit Unveils IG5686 PCI Gen6 SSD Controller With Up To 256 TB Capacities At 28 Gbps, Aims Gen7 SSDs For 2028 With 100 Million IOPS

Phison Demos Its Next-Gen PCIe Gen6 "PS5303-X3" SSD Controller, Up To 28 GB/s Speeds, Full Gen6 Redriver/Retimer Stack Ready

QNAP Pairs a 6-Year-Old Zen 2 EPYC With NVIDIA's 96GB RTX PRO 6000 Blackwell in Its New Edge AI NAS

Intel Arc Pro B70 Outclasses NVIDIA's RTX Pro 4000 In AI At Half The Cost, 33% More Memory