SHIFTInfrastructure Software·May 27, 2026, 8:52 PMSignal85Short term

AI costs begin to bite as agents may increase token demand by 24 times, says Goldman Sachs report — Uber and Microsoft among companies feeling the bite of tokenized billing

Source: Tom's Hardware

Share
AI costs begin to bite as agents may increase token demand by 24 times, says Goldman Sachs report — Uber and Microsoft among companies feeling the bite of tokenized billing

Major tech companies are considering refining their approaches to AI, as rising token costs and increased token demand from AI agents make the costs harder to justify, with limited return on the investment.

Why this matters
Why now

The rapid adoption and scaling of AI agents are now translating into tangible and significant operational costs for major tech companies, leading to re-evaluation of current models.

Why it’s important

This development highlights the economic constraints of current AI paradigms and forces a re-think on AI investment strategies, potentially slowing deployment or shifting towards more efficient architectures.

What changes

The previous assumption that AI costs would linearly scale or decrease with efficiency gains is being challenged, leading to a focus on cost-per-token and return on investment for AI applications.

Winners
  • · AI efficiency startups
  • · On-device AI solutions
  • · Companies with proprietary, optimized AI models
  • · Cloud providers offering cost-effective compute
Losers
  • · Uber
  • · Microsoft
  • · AI agent developers with high token usage models
  • · Companies with undifferentiated AI offerings
Second-order effects
Direct

Companies will prioritize AI model optimization and the development of more bespoke, efficient agents to manage escalating token costs.

Second

There will be increased investment in AI hardware and software that reduces per-token cost, potentially shifting market share among compute providers.

Third

The economic viability of certain large-scale, general-purpose AI applications might be questioned, leading to a more specialized and targeted deployment of AI agents.

Editorial confidence: 95 / 100 · Structural impact: 70 / 100
Original report

This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.

Read at Tom's Hardware
Tracked by The Continuum Brief · live intelligence network
Share
The Brief · Weekly Dispatch

Stay ahead of the systems reshaping markets.

By subscribing, you agree to receive updates from THE CONTINUUM BRIEF. You can unsubscribe at any time.