Loading article…
Coinbase CEO Brian Armstrong cut AI spending by nearly half while boosting token usage by routing to cheaper Chinese models like GLM 5.2.
Coinbase cut internal AI spending by nearly half while pushing token usage to record highs by defaulting engineers to cheaper Chinese models and aggressive caching [2, 3]. The strategy, outlined by CEO Brian Armstrong, addresses a widening enterprise cost crisis where AI bills outpaced budgets, though it introduces new legal considerations regarding data sovereignty [3].
| At a glance | |
|---|---|
| AI Spend Change | Down nearly 50% from peak levels [2] |
| Token Usage | At one of highest levels in company history [2] |
| New Default Models | GLM 5.2, Kimi K2.7 Code [2, 3] |
| Cost Comparison | GLM 5.2 is ~6x cheaper than Opus 4.8 on output [3] |
Armstrong detailed a five-point strategy focused on infrastructure efficiency rather than usage caps, noting that 91% of engineers were not hitting their previous limits anyway [3]. The exchange now routes routine tasks like code reviews and summarization to open-weight models GLM 5.2 and Kimi K2.7 Code, developed by Zhipu AI and Moonshot AI respectively, instead of expensive frontier models from Anthropic or OpenAI [2, 3]. This switch leverages Mixture-of-Experts (MoE) architecture, which activates only a fraction of parameters per token, dropping costs to roughly $1.40 per million input tokens compared to $5 for Anthropic's Opus 4.8 [3]. Armstrong predicts 80% of workloads will eventually run on models that are 99% cheaper within 12 to 18 months [1].
The most significant driver of the cost reduction was not the model switch alone but a 12-fold improvement in caching efficiency [3]. By optimizing the open-source platform LibreChat, Coinbase raised its cache hit rate from 5% to 60%, meaning the majority of queries now return stored results at near-zero cost rather than triggering fresh, billable inference [3]. Additionally, the company implemented "intelligent routing" where AI automatically selects the appropriate model for a task, and increased visibility so engineers can track their spend against their impact [2]. This approach follows a May restructuring where Coinbase laid off 14% of its staff, partly attributed to AI increasing individual productivity [2].
The move signals a maturation in enterprise AI strategy from "tokenmaxxing"—or maximizing usage regardless of cost—to a focus on sustainable infrastructure and "intelligence allocation" [1, 2].
Coverage is mostly measured — 5 of 5 reports stay neutral.
Every Monday — the token unlocks, Fed dates & catalysts set to move crypto and markets this week. So you’re never blindsided.
Free · 3-min read · one-click unsubscribe
AI-assisted synthesis by the TrendWatcher Editorial Desk · sourced from 3 outlets · Jul 4, 2026 · How we report
Coinbase reported earnings of $1.50 per share, beating the consensus estimate of $1.10 per share.
Retail trading volume rose 37% to $59 billion, and institutional trading volume increased 22% quarter‑over‑quarter to $236 billion.
The U.S. Securities and Exchange Commission sued Coinbase in June 2023 for operating as an unregistered exchange, broker, and clearing agency, and a federal judge denied Coinbase's motion to dismiss the case in March 2024.
The strategy involves expanding the number of tradable assets to about 40,000 and adding offerings such as prediction markets, tokenized equities, and other digital asset classes.
Being publicly listed on Nasdaq subjects Coinbase to stringent disclosure requirements and regular audits, which is viewed as providing greater transparency compared with private exchanges.