OpenAI and Broadcom Unveil Chip for Large-Scale LLM Inference

chips inference openai

2026-06-25 | Source: Ars Technica | Original article

OpenAI and Broadcom unveil a custom chip for large language model inference. The chip is designed to improve LLM performance at scale.

OpenAI and Broadcom have announced a chip designed for large language model (LLM) inference at scale. This development is a follow-up to OpenAI's unveiling of its first AI chip, Jalapeño, which could make ChatGPT faster and cheaper, as we reported earlier. The new chip, also called Jalapeño, is a custom AI chip built from scratch to improve performance, efficiency, and scale across AI systems. This partnership matters because it showcases the growing importance of specialized hardware for AI applications. By optimizing LLM inference, OpenAI and Broadcom aim to significantly boost inference performance and energy efficiency, potentially reducing hardware costs. This could have significant implications for the widespread adoption of AI technologies. As the silicon race heats up, it will be interesting to watch how this development impacts the AI landscape. With OpenAI's deep understanding of LLM fundamentals and Broadcom's expertise in silicon implementation, this partnership could set a new standard for LLM inference. We can expect to see further innovations in AI hardware and increased competition in the market as companies strive to keep up with demand for AI technologies.

Sources

Back to AIPULSEN