Harness Engineering: How to Build Software When Humans Steer, Agents Execute — Ryan Lopopolo, OpenAI
TL;DR
OpenAI engineer Ryan Lopopolo explains how AI agents have made code generation abundant and free, requiring a fundamental shift from writing implementations to designing systems, documentation, and guardrails that enable agents to execute software engineering tasks autonomously.
💰 The Abundance Paradigm 3 insights
GPT-5.2 marked the "magic moment" for full agent autonomy
Lopopolo identifies GPT-5.2 as the capability inflection point where models became isomorphic to human software engineers in producing production-quality code.
Human engineers now command thousands of agents
Individual engineers can deploy five to five thousand agent-workers in parallel, constrained only by GPU capacity and token budgets exceeding one billion output tokens daily.
Parallel execution eliminates priority backlogs
Previously abandoned P3 tasks can now be attempted immediately across multiple parallel trajectories, with successful solutions merged and failures discarded.
🎯 Steering Through Documentation 3 insights
Documentation displaces code as the primary artifact
The valuable output is no longer the implementation but the prompts, ADRs, and guardrails that guide agents, as code has become free to produce, refactor, and delete.
Encode non-functional requirements explicitly
Teams must document quality standards for security and reliability as lint rules and review agent prompts to prevent agents from generating maintenance-burdened "slop."
Durable expertise scales through written standards
Documenting QA plans and coding standards once allows every agent trajectory to inherit the team's collective expertise without synchronous human review.
🤖 Operationalizing Agent Teams 3 insights
Deploy automated reviewer agents in CI
Security and reliability agents continuously check proposed patches for patterns like missing timeouts, retries, or unsafe interfaces before human involvement.
Engineer codebases for context efficiency
Adapt repositories to model constraints by enforcing file size limits and writing error messages that include specific remediation prompts to maximize limited context windows.
Compose stackable agent skills
Create reusable "skills" that encode best practices—such as prompt engineering—which agents invoke to maintain consistency and build compound leverage.
Bottom Line
Stop writing implementation code and instead build systems of documentation, automated guardrails, and agent skills that enable AI agents to execute the full software engineering lifecycle while humans focus exclusively on high-level steering.
More from AI Engineer
View all
AI Didn’t Kill the Web, It Moved in! — Olivier Leplus (AWS) & Yohan Lasorsa (Microsoft)
AI has permeated every stage of the web development lifecycle, from coding agents that use customizable "skills" to browser-native AI debugging tools, fundamentally shifting how developers build, test, and optimize applications.
How METR measures Long Tasks and Experienced Open Source Dev Productivity - Joel Becker, METR
Joel Becker from METR argues that slowing compute growth would proportionally delay AI capabilities milestones measured by task time horizons, while presenting findings that experienced open-source developers showed minimal productivity gains from AI coding assistants like Cursor, challenging optimistic adoption curves.
Identity for AI Agents - Patrick Riley & Carlos Galan, Auth0
Auth0/Okta leaders Patrick Riley and Carlos Galan unveil new AI identity infrastructure including Token Vault for secure credential management and Async OAuth for human approvals, presenting a four-pillar framework to authenticate users and authorize autonomous agent actions across enterprise applications.
OpenAI + @Temporalio : Building Durable, Production Ready Agents - Cornelia Davis, Temporal
Cornelia Davis from Temporal demonstrates how integrating OpenAI's Agents SDK with Temporal's distributed systems platform creates production-ready AI agents that automatically handle crashes, retries, and state persistence without developers writing complex resilience code.