Stanford MS&E435 Economics of the AI Supercycle | Spring 2026 | Applications, Applied AI

Stanford Online

| Podcasts | June 05, 2026 | 24.3 Thousand views | 49:16

TL;DR

Base 10 CEO Tuhin explains why AI inference is shifting from frontier models to custom post-trained models as companies scale, driven by 70-90% cost savings, latency requirements, and the strategic need to own proprietary data rather than feed it to potential competitors.

💰 The Economic Imperative of Custom Models 3 insights

The 90/5 spending reversal

While 90-95% of current inference spend flows to frontier models, successful application companies must shift to custom post-trained models to achieve viable gross margins (40-70%).

Cost-performance parity gap

Open source models lag frontier models by roughly 90 days but cost 70-90% less to run, making them economically essential once companies reach scale.

Existential scaling pressure

High-volume AI applications currently run negative gross margins on frontier APIs, making the transition to optimized custom models a business-critical necessity for survival.

⚡ Infrastructure Differentiation 3 insights

Optimization as managed service

Unlike raw cloud providers (AWS, GCP, CoreWeave), Base 10 handles complex performance optimizations, reducing latency for multi-model workflows like Whisper Flow's speech-to-text pipeline.

Multi-cloud resilience

The platform provides fault-tolerant inference across multiple cloud providers, ensuring reliability without customers managing infrastructure complexity themselves.

Developer experience layer

Offers integrated security, observability, and flexibility that application companies need but cannot easily build on top of commodity compute offerings.

🔒 Owning Your Intelligence 3 insights

Defensibility through data ownership

Companies must 'own their intelligence' by post-training on proprietary workflow data to prevent frontier labs from capturing unique user signals and competing directly against them.

End-to-end post-training workflow

Base 10 enables customers to define utility functions (e.g., minimizing medical transcription errors), provide proprietary datasets, and deploy specialized models without managing ML infrastructure.

Avoiding the East India Company trap

Using frontier APIs risks vendors 'post-training against' customer workflows using shared data, whereas custom models keep strategic advantages proprietary.

🏢 Business Model & Market Position 2 insights

Powering scaled AI applications

Base 10 runs mission-critical infrastructure for companies like Whisper Flow and Abridge, simultaneously operating 20+ specialized models per customer with strict reliability requirements.

Pricing model evolution

Currently monetizes through compute markup on GPUs (H100/B200) for the software stack value, transitioning toward token-based pricing to demonstrate clear savings versus frontier APIs.

Bottom Line

To build a defensible, profitable AI business at scale, companies must transition from frontier APIs to owning their intelligence through post-trained custom models optimized for their specific workflows and data.

Watch on YouTube

More from Stanford Online

Stanford Robotics Seminar ENGR319 | Spring 2026 | Towards Trustworthy Autonomy

Stanford Online

Stanford Robotics Seminar ENGR319 | Spring 2026 | Towards Trustworthy Autonomy

As learning-based robotics deploy at scale—exemplified by Waymo's 500,000 weekly rides—they face dangerous 'semantic anomalies' where context causes system-level confusion rather than visual novelty. The speaker presents a 'fast and slow' reasoning framework using lightweight embedding models for real-time detection and large language models for safety interventions, enabling trustworthy autonomy without requiring perfect prediction models.

13 days ago · 9 points

Stanford MS&E435 Economics of the AI Supercycle | Spring 2026 | Applications, Coding AI

Stanford Online

Stanford MS&E435 Economics of the AI Supercycle | Spring 2026 | Applications, Coding AI

Vercel founder Guillermo Rauch explains how AI coding agents have expanded the software development market by 10-100x, driving a fundamental shift from traditional web services to 'agentic infrastructure' where tokens replace pixels as the primary commodity and deployment becomes the critical value creator.

27 days ago · 9 points

Stanford MS&E435 Economics of the AI Supercycle | Spring 2026 | Building AI Factories

Stanford Online

Stanford MS&E435 Economics of the AI Supercycle | Spring 2026 | Building AI Factories

Crusoe Energy CEO Chase Lockmiller explains how AI data centers represent history's second-largest infrastructure investment, driven by the economic potential of scalable 'digital labor.' He reveals Crusoe's strategy of building massive AI factories in stranded-power locations like Abilene, Texas, to overcome the industry's critical bottleneck: energized data center capacity.

about 1 month ago · 9 points

AI in Healthcare Series: Inside the Rise of AI in Healthcare, Open Evidence and Cyber Risks

Stanford Online

AI in Healthcare Series: Inside the Rise of AI in Healthcare, Open Evidence and Cyber Risks

Former U.S. Chief Data Scientist DJ Patil warns that healthcare systems are dangerously unprepared for AI-enabled cyberattacks from nation states, while simultaneously seeing rapid democratization of medical knowledge through tools like Open Evidence that are fundamentally reshaping the doctor-patient relationship.

about 1 month ago · 10 points

Browse more: 🎙️ Podcasts All Videos All Categories