Robots Are Finally Starting to Work

| Business & Entrepreneurship | April 16, 2026 | 67.7 Thousand views | 49:27

TL;DR

Physical Intelligence co-founder Quan Vang explains why robotics is approaching its 'GPT-1 moment,' where cross-embodiment AI models trained on diverse hardware are beginning to exhibit emergent zero-shot capabilities and scaling laws previously unseen in the field.

🤖 The Cross-Embodiment Breakthrough 3 insights

Generalist models outperform specialists

The Open X-Embodiment paper demonstrated that a single policy trained across 10 different robot platforms performed 50% better than individual models optimized for specific hardware, proving scaling laws apply to robotics.

Cross-embodiment solves hardware drift

Unlike single-robot approaches that suffer from hardware variations and software changes every few months, training on diverse platforms forces models to learn abstract 'robot control' rather than specific motor mappings.

Emergent zero-shot capabilities appearing

Current models can now perform complex precision tasks and multi-object reasoning zero-shot—capabilities that required hundreds of hours of data collection just last year.

🧠 From Language Models to Physical Control 3 insights

Vision-language models enable semantic control

Research like RT2 and PaLM-E showed that adapting powerful vision-language models to output robot actions transfers internet-scale knowledge to low-level control, allowing commands like 'move the can to Taylor Swift' without specific training.

SayCan bridged planning and semantics

SayCan was the first demonstration of using language models for robotic planning, leveraging common sense knowledge to reduce the need for task-specific robot data.

The three-pillar hierarchy

Robotics requires solving semantics (now unlocked via LLMs), planning, and real-time control—the latter being the final frontier where vision-language models are now making breakthroughs.

📊 Deployment Strategy & Data Reality 3 insights

The data capture problem vs. generation problem

While robotic data is constantly generated in labs and industry, there has been no incentive to capture it in standardized formats; Open X-Embodiment was the first effort to aggregate multi-robot data but remains 'a drop in the bucket' compared to what's needed.

Mixed autonomy enables immediate scaling

Current systems are deployable today using a 'peeling an onion' approach: deploy a strong base model with human oversight for edge cases, then improve through real-world exposure rather than waiting for full autonomy.

Real-world validation with YC companies

Physical Intelligence partnered with Weave to deploy laundry-folding robots in commercial laundromats, demonstrating that current models can handle deformable objects and unstructured environments with human assistance.

Bottom Line

Founders should start building vertical robotics applications now using mixed autonomy systems, as cross-embodiment AI models are rapidly generalizing across hardware and approaching the capability to perform complex physical tasks without task-specific training.

More from Y Combinator

View all
Inference, Diffusion, World Models, and More | YC Paper Club
1:07:19
Y Combinator Y Combinator

Inference, Diffusion, World Models, and More | YC Paper Club

At the inaugural YC Paper Club, Stanford researcher Tanishk presented Speculative Speculative Decoding (SSD), arguing that inference speed is becoming the primary constraint on AI capabilities rather than just a cost factor. The technique achieves 300 tokens per second on Llama 3 70B by parallelizing the drafting and verification steps of speculative decoding, effectively predicting verification outcomes to hide latency.

6 days ago · 7 points
How to Build Superintelligence Inside Your Company
46:30
Y Combinator Y Combinator

How to Build Superintelligence Inside Your Company

Y Combinator has transformed into an AI-native organization by building a shared 'organizational brain'—a centralized database and registry of 350+ internal tools that allows non-technical teams to encode workflows in English rather than code, moving beyond single-player coding agents to true organizational superintelligence.

7 days ago · 9 points
Why Good Companies Go Bad (And How to Stop It)
50:05
Y Combinator Y Combinator

Why Good Companies Go Bad (And How to Stop It)

Eric Ries exposes how standard corporate governance systematically destroys founder value, arguing that 'shareholder primacy' rules and expiring voting controls inevitably lead to mission drift, while offering frameworks for building enduring mission-controlled companies.

12 days ago · 6 points
Personal AI Is the New Personal Computer
41:30
Y Combinator Y Combinator

Personal AI Is the New Personal Computer

Y Combinator CEO Gary Tan details his return to software engineering after a 13-year hiatus, shipping hundreds of thousands of lines of code while running YC full-time by leveraging AI coding tools and developing "token maxing" methodologies that transform exhaustive research and development tasks into solo weekend projects.

26 days ago · 10 points