OpenAI’s First Custom Chip Is As Hot As A Jalapeño For AI, As The Firm Calls It The “Best Inference Platform” for LLMs

Jun 24, 2026 at 11:50am EDT
Two people holding a plaque with a wafer labeled 'Jalapeño Intelligence Processor'.

OpenAI has just announced its first custom chip called Jalapeño, which is aimed at AI Inference workloads & produced by Broadcom.

OpenAI Designed & Built Its First Custom AI Silicon, The Jalapeño Chip, For Agentic AI

Today, OpenAI announced that it has built and designed its first AI chip, which they are calling Jalapeño. Interesting choice of name for a chip that came fresh off the baking ovens at Broadcom, who are responsible for producing this chip.

Related Story Google Taps MediaTek to Build TPUv9 “Triggerfish,” Fusing CPU and Compute Die in One Package for Agentic AI

The Jalapeño Intelligence Processor and its first wafer were shown off by OpenAI's CEO, Sam Altman, and Broadcom's CEO, Hock Tan. This chip continues the trend of AI firms developing custom AI chips for the Agentic AI era. We have seen the likes of Anthropic exploring custom AI chips, while Google is already deep within its TPU strategy with a multi-front solution.

With the Jalapeño chip, OpenAI claims that it will mark the beginning of their vision of the future of LLM inference. The chip is designed to be OpenAI's first AI accelerator in a multi-generational compute platform that firms are building together to make AI faster, more reliable & accessible.

Jalapeño is designed from scratch and focuses solely on AI workloads. OpenAI states that the chip was co-developed from initial design to manufacturing tape-out in just nine months.

The chip will be backed by a robust ecosystem developed by its partners, Broadcom & Celestica. Together, the firms will help industrialize the platform through chip implementation, board, rack system integration, high-performance networking, and scalable production systems.

Jalapeño is a blank-slate design for modern LLM inference, not a general-purpose accelerator adapted from earlier AI workloads. It is informed by the systems OpenAI runs every day across ChatGPT, Codex, the API, and future agentic products, while also being designed for current and future LLMs across the industry. The goal is to combine the power and throughput of today’s leading AI accelerators with latency closer to the fastest specialized inference systems, making Jalapeño well suited for interactive LLM products at scale.

OpenAI

In terms of applications, Jalapeño is designed to be flexible and works with all LLMs. The first engineering samples of the chip are already running ML workloads such as GPT-5.3-Codex-Spark at production target frequency and power. The company isn't sharing a whole lot of details at the moment, but we do see eight HBM sites and the compute die(s) in the middle of the chip.

As for launch, the first Jalapeño platforms are said to be deployed by the end of 2026 and will expand in the years ahead. The chip is designed to be a multi-generation effort with a multi-generation compute platform.

Just like the others, OpenAI diving into the custom silicon space shows that the demand for custom ASIC and AI accelerators is growing a lot. Last year, OpenAI signed a partnership to deploy 10GW of NVIDIA systems, but as supply issues persist, AI firms are now investing in custom silicon that doesn't lock them into NVIDIA's ecosystem or make them solely reliant on one chipmaker, but makes their compute platform more diverse, with room to expand their compute portfolio.

About the author: A Software Engineer by training and a PC enthusiast by passion, Hassan Mujtaba serves as Wccftech's Senior Editor for hardware section. With years of experience in the industry, he specializes in deep-dive technical analysis of next-generation CPU and GPU architectures, motherboards, and cooling solutions. His work involves not only breaking news on upcoming technologies but also extensive hands-on reviews and benchmarking.

Follow Wccftech on Google to get more of our news coverage in your feeds.