Stanford Robotics Seminar ENGR319 | Winter 2026 | Autonomous Navigation in Outdoor Environments

Stanford Online

| Podcasts | January 30, 2026 | 3.64 Thousand views | 44:16

TL;DR

This Stanford seminar presents advanced autonomous navigation systems using Gaussian splatting and vision language models to solve traversability and social compliance in outdoor environments, while introducing a paradigm shift toward companion robots that assist aging populations with mobility and health monitoring.

🏞️ Terrain-Aware Navigation 3 insights

Robot-Specific Traversability Analysis

Navigation requirements vary significantly by platform, as wheeled robots require flat surfaces while legged robots can traverse curbs, stairs, and vegetation depending on their size and weight.

Gaussian Splatting with Physical Properties

The system renders semantic environments using Gaussian splatting fused with LiDAR Euclidean Signed Distance Fields, jointly estimating material types and physical properties including friction, hardness, stiffness, and density.

VLM-Based Trajectory Selection

Multiple trajectory candidates are generated via autoencoder-decoder mechanisms with Gaussian projections, then ranked and selected by Vision Language Models based on goal proximity, outperforming purely learning-based approaches like Nomad.

👥 Social & Traffic Compliance 3 insights

Three-Stage Social Navigation Framework

The task is decomposed into perception (identifying pedestrians/groups), prediction (forecasting motion), and action (planning compliant movements) to ensure pedestrian comfort and traffic rule adherence.

Fine-Tuned Vision Language Models

Fine-tuned LLaVA models outperform ChatGPT and Gemini in social navigation benchmarks, particularly in predicting pedestrian behaviors and avoiding actions like intersecting groups or racing against people.

Traffic Context Understanding

Robots must interpret traffic lights, right-of-way rules, and vehicle interactions to navigate urban environments compliantly rather than simply avoiding obstacles.

👴 Companion Robots for Aging Populations 3 insights

Demographic Imperative for Elder Care

With 55 million Americans aged 65+ in 2020 and projections of 20% senior population by 2030, companion robots aim to enhance independence through mobility assistance and health monitoring.

Adaptive Pace-Moving Navigation

Unlike traditional social navigation that avoids humans, companion robots must adaptively match walking pace with older adults, process language instructions, and provide verbal alerts about hazardous terrains.

Behavior Analysis and Fall Prevention

Robots will monitor movement patterns to correlate motions with disease risks and predict falls while supporting independent exercise, addressing seniors' desire to avoid traditional walkers.

Bottom Line

The field must shift from obstacle-avoiding machines to terrain-aware, socially compliant companion systems that use real-time semantic mapping to adaptively assist aging populations in maintaining independent, active lifestyles.

Watch on YouTube

More from Stanford Online

Stanford CS153 Frontier Systems | Nikhyl Singhal from Skip on Product Management in the AI Era

Stanford Online

Stanford CS153 Frontier Systems | Nikhyl Singhal from Skip on Product Management in the AI Era

Nikhyl Singhal argues that product management is evolving from manual information gathering to AI-augmented strategic judgment, requiring PMs to focus on solving genuine customer problems while leveraging AI's ability to synthesize vast customer data streams.

2 days ago · 10 points

Stanford CS153 Frontier Systems | Amit Jain from Luma AI on Unified Intelligence Systems

Stanford Online

Stanford CS153 Frontier Systems | Amit Jain from Luma AI on Unified Intelligence Systems

Amit Jain details Luma AI's evolution from 3D capture to video generation, revealing how the company learned to build scalable world simulators by designing algorithms around data physics rather than theoretical ideals, ultimately converging on unified intelligence systems that combine language, video, and reasoning.

3 days ago · 10 points

Stanford CS153 Frontier Systems | Andreas Blattmann from Black Forest Labs on Visual Intelligence

Stanford Online

Stanford CS153 Frontier Systems | Andreas Blattmann from Black Forest Labs on Visual Intelligence

Andreas Blattmann, co-founder of Black Forest Labs and co-creator of Stable Diffusion, argues that visual intelligence represents the critical next frontier for AI, requiring a fundamental shift from text-centric unimodal models to multimodal systems trained on 'natural representations' (video, audio, physics) to unlock true reasoning, robotics capabilities, and higher intelligence.

5 days ago · 9 points

Stanford CS153 Frontier Systems | Mati Staniszewski from ElevenLabs on The Future of Voice Systems

Stanford Online

Stanford CS153 Frontier Systems | Mati Staniszewski from ElevenLabs on The Future of Voice Systems

ElevenLabs CEO Mati Staniszewski explains how the company pivoted from an AI dubbing vision to perfecting text-to-speech by staying close to Discord communities, leveraging open-source research, and running lean to solve the 'one voice' dubbing problem he experienced growing up in Poland.

5 days ago · 9 points

Browse more: 🎙️ Podcasts All Videos All Categories