Hardware

Intel Makes Its NPU Acceleration Library An Open-Source Asset, Allowing Devs To Optimize AI Applications

Muhammad Zuhair • Mar 2, 2024 at 02:50pm EST

Intel has finally "open-sourced" its NPU Acceleration library, allowing developers and enthusiasts to tune their applications to work best with Intel's AI engines.

Intel's Open-Sourcing of NPU Libraries Reveals That Dedicated AI Engines Have a Great Future Ahead

The news comes from Intel's Tech Evangelist Tony Mongkolsmai, who disclosed the firm's new open-source library in the first place.

With this step, the NPU acceleration library will help developers benefit from NPUs existing in CPU lineups such as the Meteor Lake "Core Ultra" series. It is based on Python, and it simplifies development by providing a high-level interface and supports popular frameworks like TensorFlow and PyTorch, giving developers the power to leverage the library's capabilities for making AI-related tasks more efficient.

For devs that have been asking, check out the newly open sourced Intel NPU Acceleration library. I just tried it out on my MSI Prestige 16 AI Evo machine (windows this time, but the library supports Linux as well) and following the GitHub documentation was able to run TinyLlama… pic.twitter.com/UPMujuKGGT

— Tony Mongkolsmai (@tonymongkolsmai) March 1, 2024

Tony had been running the NPU acceleration library on an MSI Prestige 16 AI Evo laptop, which features the Intel Core Ultra CPUs. He could run TinyLlama and Gemma-2b-it LLM models on the machine without performance disruptions, indicating the potential captivated in Intel's NPUs and how they promote an edge AI environment for developers. Here is how the Intel development team themselves describes the library:

The Intel NPU Acceleration Library is a Python library designed to boost the efficiency of your applications by leveraging the power of the Intel Neural Processing Unit (NPU) to perform high-speed computations on compatible hardware.

In our quest to significantly improve the library's performance, we are directing our efforts toward implementing a range of key features, including:

8-bit quantization

4-bit Quantization and GPTQ

NPU-Native mixed precision inference

Float16 support

BFloat16 (Brain Floating Point Format)

torch.compile support

LLM MLP horizontal fusion implementation

Static shape inference

MHA NPU inference

NPU/GPU hetero compute

Paper

via Github Intel

It is great to see the open-sourcing of the NPU acceleration library, as it would ultimately lead to an enhanced implementation of AI applications running on Intel's dedicated AI engines. It will be interesting to see what sort of developments we see on such engines moving ahead, since, as stated by Tony himself, there is a lot packed in for consumers and developers.

News Source: Tony Mongkolsmai

About the author: Muhammad Zuhair is a hardware and technology reporter for Wccftech, specializing in the semiconductor industry and the complex interplay between technology, manufacturing, and geopolitics. His coverage focuses on the corporate strategies and technological roadmaps of industry giants like TSMC, NVIDIA, Samsung, and Intel. Zuhair's expertise lies in deconstructing complex topics such as fabrication nodes (e.g., 2nm process), the economic impact of policies like the CHIPS Act, and the strategic development of AI infrastructure from NVIDIA, AMD and Intel.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Read all comments on Intel Makes Its NPU Acceleration Library An Open-Source Asset, Allowing Devs To Optimize AI Applications

Intel Makes Its NPU Acceleration Library An Open-Source Asset, Allowing Devs To Optimize AI Applications

Intel's Open-Sourcing of NPU Libraries Reveals That Dedicated AI Engines Have a Great Future Ahead

Trending Stories

Nintendo Doubles Down on Switch 2 Security, But Developer Gezine Cracks a Universal Exploit That Works Entirely Offline

Xbox Studio Leaders Reportedly Detest Game Pass, Arguing it Destroyed the Value of Their $40+ Games Now Available for Pennies

A Modder Fits Entire Grand Theft Auto PS2 Trilogy Inside a Single Game, While Rockstar Continues to Prepare GTA 6

Some Newer GeForce RTX 5060 GPUs Transition To 16-pin Connector As Vendors Deploy Cut-Down GB205 Die

TSMC’s CFO Admits US Fabs Cost Four To Five Times More Than Taiwan, Yet Doubles Down With $100B Bet

Popular Discussions

AMD Medusa Point 10-Core “Zen 6” CPU Beats Strix Point 10-Core “Zen 5” By Nearly 35% While Operating at 5.4 GHz

AMD Ryzen 7 7700X3D 4.5 GHz “3D V-Cache” CPU Review: The Budget X3D Champ For AM5

NVIDIA GeForce RTX 50 SUPER GPUs Have Reportedly Arrived At AIBs, But Are On Hold Due To Undecided Memory Prices

AMD Ryzen 7 5800X3D Outsells Ryzen 7 7800X3D For The Same Price On Amazon Despite Being Weaker

AMD Ryzen 7 7800X3D CPU Drops To $299 A Day Ahead of 7700X3D’s Launch, Bringing 3D V-Cache Goodness To Mainstream Gamers

Intel Makes Its NPU Acceleration Library An Open-Source Asset, Allowing Devs To Optimize AI Applications

Intel's Open-Sourcing of NPU Libraries Reveals That Dedicated AI Engines Have a Great Future Ahead

Related Story Intel Xeon 6 Leaps To 8000 MT/s Memory Now, But The Real Payoff Waits For 8800 MT/s MRDIMM In 2027

Further Reading

Trending Stories

Popular Discussions