Back to Wire
Google's AI Token Processing Grows 52x, Serving Costs Plummet
Business

Google's AI Token Processing Grows 52x, Serving Costs Plummet

Source: Tomtunguz Original Author: Tomasz Tunguz 2 min read Intelligence Analysis by Gemini

Sonic Intelligence

00:00 / 00:00
Signal Summary

Google's Gemini now processes over 10 billion tokens per minute, a 52x year-over-year increase, while serving costs dropped 78%.

Explain Like I'm Five

"Imagine Google's AI brain is getting super fast and cheap! It can now think 52 times faster than last year, and it costs much less to run."

Original Reporting
Tomtunguz

Read the original article for full context.

Read Article at Source

Deep Intelligence Analysis

Google's Q4 2025 earnings call revealed a substantial acceleration in its AI capabilities, with Gemini now processing over 10 billion tokens per minute, a 52x increase year-over-year. This growth is coupled with a 78% reduction in Gemini serving unit costs, indicating significant improvements in efficiency. The company's 2026 CapEx investments are projected to be in the range of $175 to $180 billion, signaling a strong commitment to expanding its AI infrastructure. This level of investment, if mirrored by other hyperscalers, could drive data center CapEx to between $500B and $750B this year. The growth in token processing and cost reduction has direct revenue implications, with Google Cloud revenue growing 48% to $17.7 billion. While Microsoft reports a higher number of customers processing over 1 trillion tokens annually, Google's growth rate and cost efficiencies suggest a competitive advantage. The comparison to historical infrastructure spending, such as the railroad era and the national highway system, puts the current AI data center buildout into perspective. The long-term implications of this rapid growth and investment include the potential for more affordable and accessible AI services, but also the risk of increased market concentration and potential misuse of AI technology.
AI-assisted intelligence report · EU AI Act Art. 50 compliant

Impact Assessment

Google's massive growth in AI token processing and cost reduction highlights the rapid advancement and increasing efficiency of AI infrastructure. This impacts the competitive landscape and the accessibility of AI services.

Key Details

  • Gemini processes over 10 billion tokens per minute.
  • Google lowered Gemini serving unit costs by 78%.
  • Google's 2026 CapEx investments are anticipated to be $175-180 billion.

Optimistic Outlook

The dramatic reduction in serving costs could lead to more affordable and accessible AI services for businesses and consumers. Google's significant CapEx investments signal a strong commitment to AI and could drive further innovation and growth in the field.

Pessimistic Outlook

The intense capital expenditure required for AI infrastructure could create a barrier to entry for smaller players. The concentration of AI power in the hands of a few hyperscalers raises concerns about market dominance and potential misuse of AI technology.

Stay on the wire

Get the next signal in your inbox.

One concise weekly briefing with direct source links, fast analysis, and no inbox clutter.

Free. Unsubscribe anytime.

Continue reading

More reporting around this signal.

Related coverage selected to keep the thread going without dropping you into another card wall.