Loading article…
OpenAI cuts token fees 50% and Anthropic 67% amid enterprise caps like Uber’s $1,500 limit, driving a shift to cheaper open‑source models.
OpenAI and Anthropic have announced token‑price reductions of roughly 50% and 67% respectively, a move forced by enterprises that are tightening AI budgets and demanding lower‑cost alternatives [1].
| At a glance | |
|---|---|
| OpenAI token price cut | ~50% |
| Anthropic token price cut | 67% (Claude Opus) |
| Uber AI spend cap | $1,500 per tool per month |
| Business token usage growth | +1,001% Jan 2025‑Apr 2026 |
Enterprises such as Uber have already capped AI spending, limiting usage of Claude Code and Cursor to $1,500 per month per tool after burning through its 2026 AI coding budget in just four months [1]. The pressure has pushed OpenAI to roll out “Flex” processing at a half‑price discount, while Anthropic’s latest Opus 4.5 launch slashed Claude Opus pricing by two‑thirds [1]. Companies are responding by routing simple tasks to far cheaper open‑source models from Chinese firms like Alibaba and DeepSeek, reserving the most expensive models for only the most complex work [1]. This cost‑optimisation is reflected in a 1,001% surge in business token consumption from January 2025 to April 2026, outpacing the 497% rise in overall spend [1].
The token‑price war could compress margins for AI chatbot providers, with Anthropic already lowering its gross‑margin outlook to 40% because of inference costs on Google and Amazon servers [1]. Nvidia’s dominance may erode as custom chips—Google TPUs, AWS Trainium, Microsoft Maia, and Meta MTIA—are projected to cut inference costs by 30‑50% and capture a growing share of AI servers, potentially reaching 40% of deployments by 2030 [1]. Meanwhile, neocloud operators such as CoreWeave, which carries $29 billion in liabilities and $9.7 billion due within twelve months, could face tighter cash flow if lower token prices depress revenue streams [1].
| Token cost comparison | |
|---|---|
| Anthropic Fable 5 (per‑token) | >50× DeepSeek V4 Pro |
| OpenAI Flex (per‑token) | 50% discount vs. standard rate |
The steep token‑price cuts underscore a shift from “token‑maxxing” to cost‑conscious AI deployment, raising the question of whether lower prices will revive enterprise demand or accelerate a broader pullback from high‑margin AI services.
Coverage is mostly measured — 108 of 130 reports stay neutral.
Every Monday — the token unlocks, Fed dates & catalysts set to move crypto and markets this week. So you’re never blindsided.
Free · 3-min read · one-click unsubscribe
AI-assisted synthesis by the TrendWatcher Editorial Desk · sourced from 2 outlets · Jun 26, 2026 · How we report
OpenAI's annualized run rate was reported to be around $25 billion earlier this year, up from $13.1 billion in 2025.
OpenAI filed a confidential IPO in early June, alongside Anthropic.
OpenAI introduced analytics and controls that let administrators track credit spend, set usage limits, and provide budget visibility for employees.
OpenAI teamed up with Broadcom to develop an inference‑optimised chip named Jalapeño.
Microsoft, Amazon, and Google are investing heavily in lower‑cost AI models to compete with OpenAI and Anthropic.