HeadlinesBriefing favicon HeadlinesBriefing.com

OpenAI and Broadcom Unveil Jalapeño Chip for LLM Inference

Ars Technica •
×

OpenAI and Broadcom have partnered to develop Jalapeño, a custom ASIC designed specifically for large language model inference in data centers. The chip represents the first generation of what both companies describe as a long-term collaboration to create specialized hardware for AI workloads. This marks OpenAI's push into custom silicon territory, moving beyond reliance on general-purpose processors.

Broadcom built the chip from the ground up using insights from OpenAI researchers, incorporating the AI company's roadmap for future models into the design process. Development took nine months, suggesting rapid execution for a custom silicon project. The ASIC targets the unique computational patterns of LLM inference rather than adapting existing chip architectures.

Early testing indicates Jalapeño delivers performance per watt significantly better than current state-of-the-art solutions, though OpenAI hasn't released specific benchmarks yet. The company plans to publish a detailed technical report in upcoming months. This could reshape how AI companies approach hardware procurement.

Custom inference chips like Jalapeño address growing demand for efficient AI processing as models scale. Major cloud providers and AI companies face mounting costs running inference workloads on traditional GPUs and CPUs. Specialized silicon could reduce operational expenses while improving performance, though adoption will depend on manufacturing costs and compatibility with existing infrastructure.