Stanford CS153 Frontier Systems | Jensen Huang from NVIDIA on the Compute Behind Intelligence

Stanford Online

| Podcasts | May 13, 2026 | 36.8 Thousand views | 1:08:24

TL;DR

Jensen Huang argues that computing is undergoing its first fundamental reinvention in 60 years, shifting from pre-recorded, general-purpose, on-demand processing to generated, accelerated, continuously running agentic systems. He reveals that NVIDIA achieved a 1-million-x speedup over the last decade through extreme 'co-design' of hardware, software, and algorithms, fundamentally outpacing Moore's Law.

🔄 The Reinvention of Computing 3 insights

From Pre-recorded to Generated

Computing is shifting from retrieving static, pre-recorded content to generating contextually relevant responses in real time based on user intention rather than explicit instruction.

The End of General-Purpose Architecture

The 60-year era of general-purpose computing (modeled after the IBM System 360) is ending as AI demands specialized accelerated architectures rather than one-size-fits-all processors.

The Rise of Agentic Systems

Future computing will be continuous and agentic rather than on-demand, requiring a complete rethink of cloud services, personal computers, and how software is deployed and utilized.

⚡ Co-Design & The Million-X Advantage 3 insights

Co-Design Across the Stack

True performance gains require simultaneously co-designing algorithms, compilers, frameworks, and chip architecture rather than optimizing hardware and software layers independently.

1 Million-X Speedup vs. Moore's Law

While traditional Moore's Law offered only 10x improvement over the last decade due to the end of Dennard scaling, NVIDIA's extreme co-design approach delivered approximately 1,000,000x acceleration.

Enabling Internet-Scale Training

This massive speedup made it practical for AI researchers to train models on all of the world's data rather than curating small datasets, unlocking capabilities like self-driving vehicles.

🎓 AI's Impact on Education & Development 3 insights

AI as a Learning Infrastructure

Education must integrate AI not just as a subject to study but as a core tool for the curriculum, using it to read papers, summarize research, and act as a personalized super-researcher.

First Principles Still Matter

Despite rapid technological change, foundational computer science principles remain essential to understand system evolution, even as specific implementation details become obsolete.

Strategic Use of Open vs. Proprietary

Organizations should use frontier proprietary models like OpenAI and Anthropic for immediate productivity while supporting open-source models to advance the codification of human intelligence and information representation.

Bottom Line

To compete in the age of AI, abandon general-purpose computing assumptions and adopt extreme co-design across the entire hardware-software stack, while treating AI as continuous infrastructure rather than an on-demand tool.

Watch on YouTube

More from Stanford Online

Stanford CS153 Frontier Systems | Scott Nolan from General Matter on Energy Bottlenecks

Stanford Online

Stanford CS153 Frontier Systems | Scott Nolan from General Matter on Energy Bottlenecks

General Matter CEO Scott Nolan argues that energy has superseded compute as the primary bottleneck for AI scaling, requiring an urgent shift from exhausted 'stranded' power to massive new baseload generation—specifically nuclear energy—which is itself constrained by uranium enrichment supply chains.

3 days ago · 6 points

Stanford Robotics Seminar ENGR319 | Spring 2026 | Unlocking Autonomous Medical Robotics

Stanford Online

Stanford Robotics Seminar ENGR319 | Spring 2026 | Unlocking Autonomous Medical Robotics

This seminar outlines a roadmap for autonomous surgical robotics to address critical healthcare labor shortages, proposing a physics-based approach built on four pillars—perception, modeling, planning, and control—that achieves sub-2mm precision through real-time digital twinning rather than relying on data-scarce foundation models.

3 days ago · 7 points

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 10: Inference

Stanford Online

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 10: Inference

Inference now dominates AI economics, with OpenAI generating 8.6 trillion tokens daily—exceeding frontier model training compute in under four days. Unlike training, autoregressive inference cannot parallelize across sequences, making it fundamentally memory-bandwidth bound rather than compute bound, with batch sizes under 295 on H100s failing to saturate GPU capacity.

3 days ago · 9 points

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 5 - Architectures

Stanford Online

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 5 - Architectures

This lecture transitions from theoretical foundations to practical architecture design for diffusion models, explaining how U-Net structures leverage convolutional inductive biases, hierarchical downsampling for global context, and skip connections to preserve local details while maintaining strict dimensional requirements for iterative denoising.

3 days ago · 9 points

Browse more: 🎙️ Podcasts All Videos All Categories