Coinbase

Coinbase CEO outlines five tactics to halve AI spend while token use

TrendWatcher AI (Enhanced)·6h ago·neutral

Most covered nowLIVEsee all →

NFTcryptoHot20 stories AltcoinscryptoHot20 stories ChainlinkcryptoHot20 stories LitecoincryptoHot20 stories StablecoinscryptoHot20 stories Ethereumcrypto7 stories Microsofttech10 stories Fed Ratesfinance8 stories

At a glance
AI spend	↓ ≈ 50 % from peak
Token usage	Near‑record high
Default models	GLM 5.2, Kimi 2.7 (Chinese LLMs)
Cost‑saving tactic	Automated model routing, caching, lean context, spend visibility

Cost‑cutting tactics in detail

Armstrong’s first lever is swapping default LLMs for cheaper open‑weight Chinese models—GLM 5.2 from Z.ai and Kimi 2.7 from Moonshot AI—rather than defaulting to premium offerings from Anthropic or OpenAI【1】. The second step routes each prompt to the most appropriate model based on task difficulty, letting “frontier” models handle planning while cheaper models handle execution【1】. A third measure improves inference cost by using more aggressive caching, and a fourth keeps context lean by starting fresh sessions when switching tasks【1】. Finally, the company makes every engineer’s token consumption visible, tying higher spend to higher impact expectations rather than imposing hard caps【1】.

Impact on spend and usage trends

Armstrong attached a graph showing token usage climbing to historic levels while AI spend dropped sharply, though the exact timeline isn’t disclosed【1】. The Decoder reports that the same routing and caching upgrades lifted the hit‑rate from 5 % to 60 % and cut Coinbase’s AI bill in half as token usage kept rising【4】. In a separate Business Insider post, Armstrong said the firm has kept costs “roughly flat” despite exponential token growth, and he forecasts that within 12‑18 months, 80 % of workloads will run on models that are 99 % cheaper than today’s frontier options【2】.

Industry context

Coinbase’s strategy mirrors a broader shift away from the “tokenmaxxing” craze, where firms previously encouraged unrestricted token consumption to showcase raw AI power. Instead, companies now impose usage caps or visibility rules to curb runaway costs. Armstrong’s approach aligns with moves by other tech firms—Lindy’s adoption of Deepseek v4 and Snowflake’s testing of Chinese models—adding pricing pressure on Western AI labs as they prepare for potential IPOs【4】.

What to watch

AI spend vs. token usage: Monitor quarterly updates for any divergence between rising token counts and AI expenditure.
Model adoption: Track the proportion of prompts routed to GLM 5.2 and Kimi 2.7 versus frontier models, especially as the 12‑18 month cost‑efficiency target unfolds.
Caching efficiency: Watch for reported hit‑rate improvements (e.g., moving from 5 % toward the 60 % benchmark) in Coinbase’s internal AI performance reports.

By halving AI costs while allowing token usage to expand, Coinbase demonstrates a scalable model for crypto‑focused firms that need AI‑driven productivity without unsustainable spend. The open question remains whether the cost‑saving measures will sustain as AI workloads become more complex and demand higher‑end models.

Keep reading

CoinbaseCoinbase Q2 2024 earnings call live streamTrendWatcher AI (Enhanced) · 6h ago CoinbaseBase layer‑2 suffers two sequencer‑related outages in a weekTrendWatcher AI (Enhanced) · 1d ago CoinbaseCoinbase Base blockchain resumes after two‑hour outageTrendWatcher AI (Enhanced) · 1d ago CoinbaseCoinbase Earn faces SEC lawsuit threat, shares tumble 20%TrendWatcher AI (Enhanced) · 1d ago CoinbaseCoinbase launches stocks, perps and AI productsTrendWatcher AI (Enhanced) · 1d ago CoinbaseCoinbase latest news – no new market move reportedTrendWatcher AI (Enhanced) · 1d ago

Coming upLIVEsee all →

JUN 30 · all day UTCcryptoOptimism Token Unlock JUN 30 · all day UTCcryptoCelestia Token Unlock JUL 1 · all day UTCcryptoWorldcoin Token Unlock JUL 1 · all day UTCcryptoSui Token Unlock JUL 2 · 12:30 UTCmacroUS Jobs Report (NFP)

Across the coverage

Coverage is mostly measured — 73 of 84 reports stay neutral.

Bullish 5

Neutral 73

Bearish 6

The Catalyst Brief

Know what’s about to move the market.

Every Monday — the token unlocks, Fed dates & catalysts set to move crypto and markets this week. So you’re never blindsided.

Free · 3-min read · one-click unsubscribe

Synthesized from 5 sources

AI-assisted synthesis by the TrendWatcher Editorial Desk · sourced from 5 outlets · Jun 29, 2026 · How we report

Published

Jun 29, 2026, 03:47 PM

Source

TrendWatcher AI (Enhanced)

Frequently asked · Coinbase

What specific AI models is Coinbase experimenting with to lower costs?

Coinbase is experimenting with open weight models including GLM 5.2 from Z.ai and Kimi 2.7 from Moonshot AI.

How does Coinbase manage AI spending among its engineers?

The company provides engineers with visibility into their token usage and expects higher impact from those who consume more AI resources.

Why did Coinbase lay off 14% of its staff?

The layoffs were partly attributed to AI changing how people work and enabling small teams to complete tasks more quickly.

Explore More

Ethereum Bitcoin OpenAI Tesla Fed Rates Layer 2 Scaling