NVIDIA's AI Engineers: Brev, Dynamo and Agent Inference at Planetary Scale and Speed of Light

Latent Space

| Podcasts | March 08, 2026 | 6.14 Thousand views | 1:26:00

TL;DR

NVIDIA engineers discuss securing AI agents through the 'two of three' capability rule, the evolution of Brev from startup to NVIDIA's developer experience layer, and how DGX Spark bridges local and cloud GPU workflows for a broader developer audience.

🔒 Agent Security Architecture 2 insights

The Two-of-Three Agent Capability Rule

AI agents should only be granted two of three capabilities—file access, internet access, and code execution—to prevent security vulnerabilities like malware injection.

Enforcement Points for Agent Access Control

Organizations must implement strict enforcement points that restrict agent permissions based on specific functional needs rather than providing blanket system access.

🚀 Democratizing GPU Infrastructure 2 insights

Brev's One-Click GPU Provisioning Philosophy

Brev was designed to replace complex multi-page cloud GPU forms with immediate SSH access, making hardware like A100s accessible with minimal friction.

Expanding Developer Access Beyond CUDA Experts

NVIDIA is reinventing developer experience for a broader AI audience—including those unfamiliar with CUDA—through tools like launchables that enable one-click software deployment.

🖥️ Unified Local-Cloud Workflows 2 insights

Remote Cloud Management for DGX Spark

Users can register local DGX Spark devices with Brev to enable remote cloud-like access from anywhere via NVIDIA Sync, turning home hardware into managed nodes.

Isolated Sandboxes for Experimental AI Agents

NVIDIA security teams recommend running experimental autonomous agents like OpenClaw on Brev's isolated cloud VMs rather than corporate networks to maintain security boundaries.

🎯 Developer-First Culture 2 insights

Executive Technical Engagement in Product Development

NVIDIA leadership maintains deep technical involvement with VPs actively using developer tools like Cursor and working closely with engineering on hardware-software integration.

Authentic Marketing Stunts Build Developer Trust

Brev's memorable GTC marketing stunts with surfboards and foil-printed GPU cards demonstrated authentic developer engagement that continues to influence NVIDIA's outreach strategy.

Bottom Line

Restrict AI agents to only two of three critical capabilities (file access, internet, code execution) and deploy them in isolated GPU sandboxes like Brev to safely experiment with autonomous tools while maintaining security

Watch on YouTube

More from Latent Space

The Agent Cloud: Databricks’ Bet on the Future of AI — Matei Zaharia and Reynold Xin

Latent Space

The Agent Cloud: Databricks’ Bet on the Future of AI — Matei Zaharia and Reynold Xin

Matei Zaharia and Reynold Xin detail Databricks' open-source 'Agent Cloud' platform (Omnigen), arguing that standardized protocols and persistent infrastructure—not just better models—will determine which enterprises successfully deploy collaborative, secure AI agents at scale.

about 5 hours ago · 9 points

AI Security After Codex and Claude Code — Zico Kolter & Matt Fredrikson, Gray Swan

Latent Space

AI Security After Codex and Claude Code — Zico Kolter & Matt Fredrikson, Gray Swan

Gray Swan co-founders Zico Kolter and Matt Fredrikson explain why AI systems require a fundamentally different security approach than traditional software, highlighting how their automated red teaming system 'Shade' has begun to outperform human experts at finding model vulnerabilities. They emphasize the urgent need to treat AI agents as inherently untrusted entities capable of correlated failures across the software ecosystem.

2 days ago · 8 points

⚡️Every product of the future will be a living system — Ronak Malde, Trajectory.ai

Latent Space

⚡️Every product of the future will be a living system — Ronak Malde, Trajectory.ai

Ronak Malde explains leaving DeepMind (and $2 billion in acquisition earnings) to found Trajectory.ai, arguing that AI products must evolve from static tools into "living systems" that continually learn from real-world user corrections across enterprise verticals like legal and finance.

3 days ago · 9 points

The AI Frontier: from FLOPs to Megawatts — Anjney Midha, AMP

Latent Space

The AI Frontier: from FLOPs to Megawatts — Anjney Midha, AMP

Anjney Midha argues that AI infrastructure is facing a crisis of inefficiency and cultural misalignment, proposing that compute be treated as a utility through an Independent System Operator model that pools multi-cloud resources while embedding community incentives directly into unit economics.

6 days ago · 10 points

Browse more: 🎙️ Podcasts All Videos All Categories