OpenAI

OpenAI unveils Jalapeño custom AI inference chip

TrendWatcher AI (Enhanced)·9m ago·neutral

Most covered nowLIVEsee all →

Layer 2 Scalingcrypto7 stories NFTcryptoHot20 stories AltcoinscryptoHot20 stories ChainlinkcryptoHot20 stories LitecoincryptoHot20 stories StablecoinscryptoHot20 stories Bitcoincrypto7 stories Dogecoincrypto9 stories

At a glance
Chip name	Jalapeño
Claim	~50% cheaper inference per token vs. current GPUs
Design cycle	9 months from concept to tape‑out
Delivery	Engineering samples shipped to OpenAI HQ

Custom silicon for LLM inference

The chip is an application‑specific integrated circuit (ASIC) optimized for the memory‑heavy, low‑precision workloads of large‑language‑model (LLM) inference. Broadcom’s CEO Hock Tan told Bloomberg the early lab tests show performance on par with Nvidia’s Blackwell GPUs and Google’s TPUs, while delivering the 50% cost reduction claim [1]. OpenAI’s own statement qualifies the claim, describing the chip’s performance‑per‑watt as “substantially better than current state‑of‑the‑art” and noting that full technical results will be published in the coming months [1].

Designing the ASIC in nine months—what the companies call the fastest high‑performance ASIC cycle ever—was enabled by OpenAI’s models themselves. President Greg Brockman said the company’s AI models accelerated the design process in a way that was “very surprising” [2]. The silicon was fabricated by TSMC and will be integrated with Broadcom’s Tomahawk networking chips and Celestica‑built racks, forming a full‑stack solution that OpenAI can control end‑to‑end.

Competitive context

Current inference workloads run on general‑purpose GPUs, which typically achieve only 60‑70% utilization because the bottleneck is memory traffic, not raw compute [1]. By tailoring the architecture to the specific kernels, memory movement, and serving patterns of transformer models, Jalapeño aims to push utilization closer to the chip’s theoretical peak, a key factor behind the cost‑saving claim. Independent analysts note that the exact baseline chips and test conditions have not been disclosed, leaving the 50% figure unverified outside OpenAI’s own labs [1].

If the cost advantage holds at scale, OpenAI could reduce its reliance on Nvidia GPUs—its biggest AI‑hardware expense since 2022—and lessen the pressure on its cloud partners. Broadcom, whose shares have risen 10% this year and are up nearly sevenfold since 2022, stands to gain a steady stream of high‑volume ASIC orders, while competitors such as AMD, Cerebras, and AWS (with its Trainium chips) may need to accelerate their own custom‑silicon programs to stay relevant [2].

What to watch

Production rollout – Prototype racks are slated to begin deployment in late 2026, with scaling through 2027‑28 [1].
Independent benchmarks – Third‑party performance and cost analyses of Jalapeño versus Nvidia Blackwell and Google TPU will clarify the 50% claim.
Broadcom share movement – Market reaction to the announcement may signal investor confidence in the custom‑chip strategy.

The Jalapeño debut marks OpenAI’s first foray into owning the hardware stack that powers its flagship services. Whether the promised cost savings translate into lower prices for end users or a competitive edge against GPU‑centric rivals will depend on real‑world deployment data and the pace at which other AI leaders roll out their own ASICs.

Keep reading

OpenAIOpenAI makes GPT‑5.5 Instant the default ChatGPT modelTrendWatcher AI (Enhanced) · 9m ago OpenAIOpenAI Unveils Jalapeño ChipTrendWatcher AI (Enhanced) · 9m ago OpenAIOpenAI launches Jalapeno AI chip, cuts inference cost 50%TrendWatcher AI (Enhanced) · 9m ago OpenAIAmazon MGM drops Sam Altman film “Artificial” after $50 B OpenAI dealTrendWatcher AI (Enhanced) · 1d ago OpenAIOpenAI Unveils Jalapeno Chip With BroadcomTrendWatcher AI (Enhanced) · 1d ago OpenAIOpenAI opens advanced AI models to all vetted government agenciesTrendWatcher AI (Enhanced) · 2d ago

Coming upLIVEsee all →

JUN 25 · 12:30 UTCmacroUS PCE (Fed's Gauge)JUN 30 · all day UTCcryptoCelestia Token Unlock JUN 30 · all day UTCcryptoOptimism Token Unlock JUL 1 · all day UTCcryptoSui Token Unlock JUL 1 · all day UTCcryptoWorldcoin Token Unlock

Across the coverage

Coverage is mostly measured — 103 of 125 reports stay neutral.

Bullish 12

Neutral 103

Bearish 10

The Catalyst Brief

Know what’s about to move the market.

Every Monday — the token unlocks, Fed dates & catalysts set to move crypto and markets this week. So you’re never blindsided.

Free · 3-min read · one-click unsubscribe

Synthesized from 2 sources

AI-assisted synthesis by the TrendWatcher Editorial Desk · sourced from 2 outlets · Jun 25, 2026 · How we report

Published

Jun 25, 2026, 02:39 PM

Source

TrendWatcher AI (Enhanced)

Frequently asked · OpenAI

What improvements does GPT-5.5 Instant include?

It improves conversational quality, goal understanding, handling of complex instructions, and adapts to user feedback, according to OpenAI's release notes.

When will free users receive the GPT-5.5 Instant update?

Free users are expected to receive the update within a day of its rollout to paid users.

What is the purpose of the Jalapeño chip?

Jalapeño is a purpose‑built ASIC for LLM inference, intended to increase performance per watt and reduce dependence on GPU hardware.

Who is manufacturing the Jalapeño chip and its server hardware?

Broadcom will manufacture the chip and associated server hardware, with Celestica assembling the racks.

When does OpenAI plan to start deploying Jalapeño in data centers?

OpenAI aims to begin deployment by the end of 2026 and expand over several years.

Explore More

Ethereum Bitcoin Tesla Fed Rates Layer 2 Scaling Crypto Lending