Loading article…
Coinbase engineers urged to adopt cheaper Chinese AI models; AI spend halved while token usage hits historic highs, signaling a new cost‑control play.
Coinbase announced that AI spending has dropped to roughly 50 % of its peak even as token consumption climbs to one of the highest levels in company history, a shift driven by new routing rules and a move to cheaper Chinese large‑language models (LLMs) [1][2].
| At a glance | |
|---|---|
| AI spend | ↓ ≈ 50 % of peak |
| Token usage | ↑ to historic high |
| Default LLMs | GLM 5.2 & Kimi 2.7 (Chinese) |
| Cache hit rate | ↑ 5 % → 60 % |
CEO Brian Armstrong outlined five tactics to keep AI costs low while encouraging engineers to “token‑max” responsibly. The first tactic replaces default frontier models with open‑weight Chinese alternatives—GLM 5.2 from Z.ai and Kimi 2.7 from Moonshot AI—whose per‑token fees are substantially lower than those of Anthropic or OpenAI [1]. A second tactic routes prompts automatically to the most appropriate model based on task difficulty, reserving expensive frontier models for planning‑heavy work and using cheaper models for execution [1].
Armstrong shared a graph (timeline unspecified) showing token usage at a near‑record level while AI spend has fallen to “nearly half its peak” [1]. The shift to Chinese LLMs, combined with an automatic routing system, boosted the cache‑hit rate from about 5 % to 60 % and cut overall AI costs in half, according to the company’s internal data [2]. The higher cache efficiency means fewer inference calls, directly reducing per‑token expense.
The AI‑cost reforms arrive less than two months after Coinbase trimmed roughly 14 % of its workforce—about 700 roles—in an AI‑driven restructuring [3]. Armstrong framed the layoffs as a move toward a “leaner, faster, and more efficient” operation, emphasizing a flatter hierarchy and “AI‑native” talent [3]. The broader industry trend is moving away from the “token‑maxxing” approach that allowed unlimited token consumption, toward accountability mechanisms that tie AI spend to measurable business output [1][2].
Coinbase’s pivot to cheaper, non‑Western LLMs illustrates how crypto‑focused firms are re‑engineering AI consumption to preserve margins, raising questions about the long‑term viability of domestic AI providers in high‑volume enterprise settings.
Coverage is mostly measured — 83 of 94 reports stay neutral.
Every Monday — the token unlocks, Fed dates & catalysts set to move crypto and markets this week. So you’re never blindsided.
Free · 3-min read · one-click unsubscribe
AI-assisted synthesis by the TrendWatcher Editorial Desk · sourced from 3 outlets · Jul 2, 2026 · How we report
Coinbase announced on October 8, 2023 that staking is now available to New York residents.
Coinbase holds nearly 12 percent of all Bitcoin in existence.
Coinbase executives state that staking services are not securities.
Coinbase reports having over 100 million users.