AI Research Breakthroughs from NVIDIA Research (Hosted by Karoly of Two Minute Papers) | NVIDIA GTC

| Podcasts | April 01, 2026 | 24.1 Thousand views | 40:14

TL;DR

NVIDIA Research unveils breakthroughs shifting AI from imitation to exploration through Reinforcement Learning as Pre-training (RLP), open-sources the Alpamayo reasoning platform for autonomous vehicles, and demonstrates real-time generative world models and neural physics simulators enabling zero-shot sim-to-real robotics transfer.

🧠 Reinventing AI Training Paradigms 2 insights

Reinforcement Learning as Pre-training

Yejin Choi introduced RLP, which injects exploration and reasoning into the pre-training phase rather than limiting it to post-training RLHF, allowing models to think before predicting tokens instead of merely imitating internet data.

Era of Explorative Learning

This approach hybridizes training stages and enables continual learning where models adapt during deployment time, blurring the line between training and testing to push AI frontiers beyond static imitation.

🚗 Reasoning-Based Autonomous Driving 3 insights

Alpamayo Open Platform Release

Marco Pavone announced the fully open-source Alpamayo ecosystem including Alpamayo-1.5, a 10-billion-parameter chain-of-thought visual-language-action model available on Hugging Face with post-training scripts for customization.

Reasoning-Action Alignment

The team solved embodiment misalignment through post-training coupling to ensure the model's explicit English reasoning traces faithfully reflect its physical driving actions, providing safety signals and trustworthy explainability.

Real-Time Generative Simulation

Alpamayo Dreams enables real-time, interactive simulation of multiple cameras from text prompts, allowing developers to edit weather, add objects like falling mattresses, and test policies in closed-loop generative environments.

🌐 Neural Simulation and World Models 2 insights

Omniverse Neural Reconstruction

Sanja Fidler detailed NewRec's shift from slow graphics engines to 3D Gaussian splatting, converting real driving videos into photorealistic digital environments that run 2 million tests daily for AV validation.

Data-Driven Simulation Future

World models like COSMOS learn simulation from massive visual data, eliminating the scalability ceiling of human-authored content and enabling generation of novel scenarios like pedestrians reacting realistically to near-misses never explicitly programmed.

🤖 Robotics and Sim-to-Real Transfer 2 insights

Neural Robot Dynamics

Yashraj Narang presented NeRD, which simulates 1,024 simultaneous nut-and-bolt contact interactions in real time on a single GPU by learning physics in robot-centric frames, enabling massive parallel reinforcement learning.

Zero-Shot Real World Deployment

Policies trained purely in NeRD transfer zero-shot to physical tasks including assembling NVIDIA GB300 superchips and managing cables, extending to industrial assembly and autonomous scientific lab automation.

Bottom Line

The future of AI development lies in open-source, reasoning-capable models combined with real-time generative simulation, enabling robots and autonomous systems to learn complex physical tasks through exploration rather than imitation and deploy safely with zero-shot transfer from simulation to reality.

More from NVIDIA AI Podcast

View all
Build Video Analytics AI Agents with Skills
59:53
NVIDIA AI Podcast NVIDIA AI Podcast

Build Video Analytics AI Agents with Skills

NVIDIA introduces the Video Search and Summarization (VSS) blueprint for building vision AI agents that process billions of camera streams using vision language models and a new 'skills' framework, enabling deep video search and summarization 60x faster than manual review.

8 days ago · 9 points
Ask the Experts: Nemotron 3 Nano Omni | Nemotron Labs
48:56
NVIDIA AI Podcast NVIDIA AI Podcast

Ask the Experts: Nemotron 3 Nano Omni | Nemotron Labs

NVIDIA researchers detail the development of Nemotron 3 Nano Omni, explaining how they evolved a text-only model into a multimodal system capable of processing vision, audio, and video through progressive training stages while maintaining the hybrid Mamba-Transformer architecture.

9 days ago · 10 points
Apr 14 - Jetson AI Lab Research Group Call - Tensor RT Edge LLM on Jetson & Culture
51:38
NVIDIA AI Podcast NVIDIA AI Podcast

Apr 14 - Jetson AI Lab Research Group Call - Tensor RT Edge LLM on Jetson & Culture

NVIDIA researchers Lynn Chai and Luc introduce TensorRT Edge LLM, a purpose-built inference engine for deploying large language models on Jetson edge devices, showcasing NVFP4 quantization and speculative decoding techniques that achieve up to 7x faster prefill speeds and 500 tokens per second generation while previewing a simplified vLLM-style Python API coming soon.

17 days ago · 10 points
March 10 - Jetson AI Lab Research Group Call - Lightning talks
55:28
NVIDIA AI Podcast NVIDIA AI Podcast

March 10 - Jetson AI Lab Research Group Call - Lightning talks

This Jetson AI Lab Research Group call features lightning talks on open-source hardware for remote Jetson access, a real-time emotional AI engine for robots running entirely on Jetson Nano, and updates to the Jetson AI Lab model repository with new performance benchmarks and deployment guides.

17 days ago · 8 points