Stanford MS&E435 Economics of the AI Supercycle | Spring 2026 | Infrastructure, Capstone Case
TL;DR
Sachin Katti, OpenAI's head of industrial compute, details the infrastructure economics driving the AI supercycle, explaining how the company plans to scale to 30 gigawatts by 2030 while navigating the shift from training to inference-heavy agentic workloads and managing massive energy and supply chain constraints.
📈 Compute Scaling & Economic Drivers 4 insights
30 GW target by 2030
OpenAI has set an aspirational goal to reach 30 gigawatts of compute capacity by decade's end, split between research and products, to ensure researchers remain unconstrained in exploring new models.
Revenue tracks compute capacity
At frontier labs, revenue functions as a lagging indicator of compute capacity and utilization, with OpenAI tripling both compute and revenue year-over-year for the past three years.
Inference dominates future workloads
Over 80% of compute will shift to inference workloads, encompassing not just product usage but also post-training RL and synthetic data generation as scaling laws evolve across the entire AI lifecycle.
Three-dimensional cost optimization
OpenAI simultaneously pushes to make tokens cheaper, make each token more intelligent, and reduce the number of tokens required per task to maximize accessibility while managing infrastructure costs.
⚡ Infrastructure & Supply Chain Bottlenecks 4 insights
Gigawatt-scale orchestration
Building compute at scale requires synchronizing an entire supply chain including chips, memory, networking, power generation, cooling systems, data center construction, and land acquisition to align simultaneously.
Operational fragility post-deployment
The hardest work begins after contracts are signed, as modern GPUs are extremely brittle and sensitive to power fluctuations and cooling variations, requiring constant engineering to maintain performance at scale.
Grid stability risks
Large synchronized training jobs can cause hundreds of megawatt fluctuations on regional grids within minutes, creating blackout risks that require redesigned infrastructure to prevent collateral damage to national energy systems.
Energy diversification imperative
OpenAI is actively derisking supply chains by relocating manufacturing, decoupling from grid dependency through natural gas and nuclear energy investments, and pioneering infrastructure innovations with broader societal applications.
🤖 The Evolution to Agentic AI 4 insights
From chatbots to agents
AI workloads have evolved from simple one-shot chatbot inference to reasoning models, and now to agentic systems that autonomously execute actions, spin up VMs, and iterate through tasks rather than merely suggesting answers.
Complex compute graphs
Agentic AI requires dramatically more complex compute architectures that close the loop between thinking and doing, involving multiple tool integrations and iterative processing rather than single-response generation.
CPU resurgence
CPUs are making a comeback specifically for agentic workloads, contributing to Intel's recent momentum alongside supply constraints that benefit the only remaining leading-edge American manufacturer with fabrication capabilities.
7 terawatt vision
The long-term vision involves universal GPU access for 7 billion humans, requiring approximately 7 terawatts of compute—two orders of magnitude beyond current targets—suggesting infrastructure scaling has barely begun.
Bottom Line
The AI industry's immediate constraint is not model capability but industrial infrastructure—success requires orchestrating unprecedented energy, supply chain, and hardware logistics to deliver compute at a scale that makes universal intelligence accessible.
More from Stanford Online
View all
Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 16: Post-Training - RLVR
This lecture explains why RLHF hits overoptimization limits with learned reward models, and how RLVR (Reinforcement Learning from Verifiable Rewards) enables unlimited compute scaling on verifiable tasks like math and coding through simpler algorithms like GRPO.
Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 15: Mid/Post-Training
This lecture explains how post-training transforms raw pre-trained models like GPT-3 into instruction-following systems like ChatGPT through supervised fine-tuning and reinforcement learning, emphasizing that high-quality data curation matters more than algorithmic sophistication.
Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 14: Data
This lecture details the pre-training data pipeline, covering the transformation of raw HTML and PDFs into linear text and classifier-based filtering strategies to curate domain-specific datasets, while emphasizing the strategic trade-off between data quality and training duration.
Stanford CS25: Transformers United V6 I Advancing Science and Medicine with Collaborative AI Agents
Google DeepMind researcher Vivek Natarajan discusses the development of Co-Scientist, an AI system designed to act as a collaborative partner for scientific discovery by moving beyond fast System 1 thinking to rigorous System 2 reasoning, emphasizing that true scientific AI requires the generality of human cognition rather than narrow specialization.