OpenAI and Broadcom have unveiled Jalapeño, OpenAI's first custom chip and what OpenAI is calling its first Intelligence Processor: a purpose-built inference ASIC architected around OpenAI's own understanding of how large-model inference behaves at scale. OpenAI is explicit that this is not a repurposed training accelerator or a general-purpose AI processor. The design targets the practical bottlenecks that dominate inference economics — costly data movement, the balance between compute and memory, and networking efficiency — with the stated goal of pushing realized utilization much closer to theoretical peak than today's leading hardware achieves.
Physically, the package appears to hold one large compute chiplet flanked by six high-bandwidth-memory stacks, plus a second chiplet carrying input/output interfaces and two structural dummy dies for mechanical support. The headline claim is performance per watt substantially better than current state-of-the-art, though OpenAI says a detailed technical report will follow in the coming months, so the efficiency numbers are not yet independently verifiable.
Two details stand out beyond the silicon itself. First, the development cadence: the chip reached tape-out in roughly nine months, an unusually fast turn for a reticle-scale ASIC, and the companies credit a tight software-hardware co-design loop that used OpenAI's own models to accelerate parts of the design work. Second, engineering samples are already running real machine-learning workloads in the lab at production target frequency and power, including GPT-5.3-Codex-Spark, which suggests the part is past the paper-design stage.
Strategically, Jalapeño is positioned as the first step in a multi-generation compute platform pairing OpenAI-designed accelerators with Broadcom's silicon implementation, networking, and connectivity, plus Celestica for board, rack, and system integration. Deployment is slated to begin by the end of 2026 and expand from there. The throughline is vertical integration: like the hyperscalers before it, OpenAI is moving to own the full inference stack rather than renting it, reducing exposure to Nvidia pricing and supply while tailoring the hardware to its specific serving patterns. The open questions are the ones the technical report will have to answer — real utilization on production traffic, total cost of ownership against Nvidia's latest parts, and whether a single-customer ASIC can keep pace with a rapidly moving model architecture.
- The Information frames it strategically — a step in OpenAI's effort to reduce reliance on Nvidia and control its own hardware, with Broadcom talks dating to 2024 and an earlier reported $18B financing snag.
- TechCrunch leads with the chip's codename, Jalapeño, and that it was designed specifically for the unique needs of OpenAI's inference systems rather than training.