Cursor's Third Era: Cloud Agents — ft. Sam Whitmore, Jonas Nelle, Cursor
TL;DR
Cursor launched Cloud Agents that provide AI models with full virtual machine access to autonomously write, test, and demonstrate code through video recordings, shifting from simple code generation to end-to-end software engineering workflows.
🖥️ Full-Computer Architecture 3 insights
Brain in a box virtual machine approach
Agents now run on full VMs with complete computer use (pixels in, coordinates out) rather than just reading code, enabled by Autotab integration and proper DevX setup like a human developer.
Autonomous end-to-end testing on dev servers
Agents automatically start dev servers and test changes for 30+ minutes, returning with verified PRs rather than untested code suggestions, using default prompting calibrated to change complexity.
Multi-model synergistic base layer approach
The system leverages strengths from different model providers as base layers, creating outputs better than any single unified model tier could achieve alone.
🎥 Visual Verification System 3 insights
Video demonstrations accelerate code review process
Every agent session generates a video recording showing the implemented feature in action, serving as an entry point for review that is faster than reading large diffs.
Complete VNC remote desktop environment access
Developers get full remote control of the agent's VM to hover, type, and interact with the live environment via VNC before deciding to merge or request iterations.
Zero-prompt intelligent testing strategies
Agents autonomously determine how to test changes—such as opening Chrome DevTools to inject 5,000 characters to test error limits—without explicit human instructions.
🚀 Advanced Agentic Workflows 3 insights
Automated bug reproduction and fix verification
The /repro command enables agents to reproduce bugs on video, fix them, and demonstrate the fix, reducing complex bug resolution from hours to 90-second review cycles.
Parallel agent swarms increase development throughput
The next major unlock involves parallelizing work through swarms of agents to dramatically increase throughput rather than just making single agents faster.
Recursive agent debugging and Datadog integration
Cloud agents can spin up sub-agents to debug themselves using Datadog MCP integration and explore logs, though recursive agent spawning is currently disabled.
Bottom Line
Start using full-computer agents with visual verification workflows immediately, as models like Claude 3.5 Sonnet and Codex 53 have crossed the threshold to autonomously handle end-to-end development including testing and bug reproduction.
More from Latent Space
View all
🔬Top Black Holes Physicist: GPT5 can do Vibe Physics, here's what I found
Physicist Alex Lubyansky discusses how GPT-5 and reasoning models like o3 have achieved superhuman capabilities in theoretical physics, solving the year-long mystery of single minus gluon tree amplitudes and reproducing complex research in minutes rather than months.
The $15B Physical AI Company: Simulation, Autonomy OS, Neural Sim, & 1K Engineers—Applied Intuition
Applied Intuition is building the unified 'Android for physical machines' to solve OS fragmentation across vehicles and industrial equipment, enabling modern AI deployment through simulation tools, proprietary operating systems, and end-to-end autonomy models with a 1,000-engineer team.
CI/CD Breaks at AI Speed: Tangle, Graphite Stacks, Pro-Model PR Review — Mikhail Parakhin, Shopify
Shopify CTO Mikhail Parakhin reveals that AI agents have achieved nearly 100% daily adoption among developers, driving a 30% month-over-month surge in PR merges that is breaking traditional CI/CD pipelines, and argues that organizations must shift from parallel token-burning agents to high-latency, critique-loop architectures using expensive pro-level models for code review.
🔬 Training Transformers to solve 95% failure rate of Cancer Trials — Ron Alfa & Daniel Bear, Noetik
Noetik is tackling the 95% failure rate of cancer clinical trials by training transformers on proprietary multimodal patient tumor data to identify hidden biological subtypes and match therapies to responsive populations, moving beyond simplistic biomarkers and outdated cell lines.