Close Menu
ZTYLEZMAN – Men’s fashion trends, luxury cars and watches, electronic products and financial information websiteZTYLEZMAN – Men’s fashion trends, luxury cars and watches, electronic products and financial information website
    Search
    YouTube Facebook Instagram
    • Back to ZTYLEZ.COM
    Facebook Instagram YouTube
    ZTYLEZMAN – Men’s fashion trends, luxury cars and watches, electronic products and financial information websiteZTYLEZMAN – Men’s fashion trends, luxury cars and watches, electronic products and financial information website
    • ZCOVER
    • INTERVIEW
    • STYLE
      • Editorial
      • Fashion
      • Footwear
      • Grooming
    • WATCHES
      • Watches & Wonders
    • AUTO
      • Racing
      • Drive
    • GADGETS
    • INVESTMENT
      • Properties
      • Auctions
      • Credit Cards
    • LIFESTYLE
      • Food & Drink
        • Liguor Guide
      • Gaming
      • Sports
      • Movies & TV
      • Travel
      • Entertainment
      • Design
    • English (US)
      • 简体中文
      • 繁體 (香港)
    ZTYLEZMAN – Men’s fashion trends, luxury cars and watches, electronic products and financial information websiteZTYLEZMAN – Men’s fashion trends, luxury cars and watches, electronic products and financial information website
    Home»Gadgets»OpenAI Jalapeño chip targets data center LLM inference
    Gadgets

    OpenAI Jalapeño chip targets data center LLM inference

    2026-06-25By Michael Choi
    Facebook Twitter Pinterest LinkedIn Tumblr Email

    OpenAI Jalapeño chip was announced June 24 as a custom AI processor built for large language model inference in data centers, the companies said.

    OpenAI Jalapeño chip, designed for LLM inference

    OpenAI said the Jalapeño chip is an intelligence processor purpose built to reduce data movement between compute engines, memory, and network fabric, addressing what the company called the main bottlenecks for LLM inference.

    Partners and roles

    OpenAI led the architecture and system requirements, Broadcom is responsible for silicon implementation and Tomahawk network technology, and Celestica will provide circuit boards, racks, and systems integration, the joint announcement said.

    Architecture focus, not general compute

    The design concentrates on inference for large language models, not on general purpose compute. OpenAI said the chip reduces the movement of data during inference, a strategy the company says improves per watt efficiency for LLM workloads.

    Broadcom provided details on the network elements, noting the use of its Tomahawk technology to stitch chips into data center fabrics at scale, the company said.

    Lab samples, limited public metrics so far

    Engineering samples have completed lab runs at target frequency and power, and those tests included workloads based on GPT 5.3 Codex Spark, OpenAI said. The company reported that early per watt performance in internal tests was markedly higher than existing solutions, but it did not publish full technical reports.

    OpenAI and Broadcom acknowledged there are no public, directly comparable benchmarks against NVIDIA Blackwell or Google TPU under identical conditions, and they said independent validation will be important to confirm any claimed advantages.

    Rapid design cycle, missing public details

    OpenAI said the design to production finalization took only nine months, aided in part by internal AI models that accelerated certain design tasks. The company did not disclose process node, HBM memory configuration, die size, actual inference latency figures, or per token cost.

    Those omissions have prompted technical community questions about reproducibility and real world operating costs, industry analysts said.

    Deployment timeline and what users might see

    OpenAI targets first deployments by late 2026, with the platform evolving through multiple generations afterward. If the efficiency gains hold up in production, users could see faster ChatGPT response times, shorter wait for multi step Codex tasks, and potential improvements in API capacity during busy periods, the company said.

    Whether OpenAI can use the Jalapeño chip to reduce reliance on NVIDIA will depend on repeatable benchmark results and actual service performance after the systems go live, analysts at technology research firms said.

    OpenAI, Broadcom, and Celestica did not respond to requests for additional technical data beyond the joint announcement. Independent testing and published benchmarks remain the decisive evidence customers and cloud operators will expect.

    AI Blackwell Broadcom Celestica data center GPT 5.3 Jalapeño LLM NVIDIA OpenAI Tomahawk
    Previous ArticleDiscord age verification to reach 200 million users
    Next Article Apple price hike hits Hong Kong Mac and iPad

    Related Posts

    The domain name AI.com is being offered at a price of 100 million US dollars, with tech giants vying to acquire it.

    2025-03-06 Gadgets

    BAPE x Crocs footwear release, available in three colors, camo design showcases street culture.

    2025-03-05 Footwear
    ADVERTISEMENT

    Oris Star Edition Anchors Watches and Wonders Booth

    2026-04-16

    Vacheron Constantin 2026: Five watch highlights from Watches & Wonders

    2026-04-15

    Cartier new watches blend jewelry craft and mechanics

    2026-04-15
    Facebook Instagram YouTube
    • ZTYLEZ.COM
    • Terms and Conditions
    • Privacy Policy
    • Contact Us
    © 2026 ZTYLEZ.COM LIMITED

    Type above and press Enter to search. Press Esc to cancel.