OpenAI + @Temporalio : Building Durable, Production Ready Agents - Cornelia Davis, Temporal

AI Engineer

| Podcasts | January 12, 2026 | 26.4 Thousand views | 1:18:30

TL;DR

Cornelia Davis from Temporal demonstrates how integrating OpenAI's Agents SDK with Temporal's distributed systems platform creates production-ready AI agents that automatically handle crashes, retries, and state persistence without developers writing complex resilience code.

🤖 OpenAI Agents SDK Fundamentals 2 insights

Agentic loops drive autonomous behavior

The SDK enables LLMs to control application flow through runner.run loops that continuously invoke the LLM, execute tools, and route outputs until the task completes.

Simple configuration with powerful defaults

Agents require only a name and instructions to start, but support advanced features like handoffs, guardrails, and tool integration in both Python and TypeScript.

🛡️ Temporal's Distributed Durability 3 insights

Durable execution as a backing service

Temporal provides distributed systems durability as a service, allowing developers to program only the 'happy path' while automatically handling crashes, retries, and state recovery.

Workflows and activities architecture

Activities wrap external calls or heavy computation, while workflows orchestrate them with built-in retries, exponential backoff, and event-sourced state management.

Proven at massive scale

Every Snapchat, Airbnb booking, and OpenAI CodeX/image generation runs on Temporal, which originally forked from Uber's Cadence workflow engine.

⚡ Production-Ready Agent Integration 2 insights

Token-preserving crash recovery

When Temporal powers agents, applications resume exactly where they left off after crashes without re-executing previous LLM calls, preventing token reburn even on the 1,350th turn.

Automatic resilience for AI workflows

The integration transparently handles rate limiting, downstream API failures, and infrastructure crashes, eliminating manual retry logic and queue management like Kafka.

⚠️ Current Platform Limitations 2 insights

Native streaming not yet available

Temporal currently does not natively support streaming data for agents, though workarounds exist at scale and native support is a top priority.

Large payload storage in development

The team is actively building large payload storage to efficiently handle big LLM context windows by passing data by reference rather than value.

Bottom Line

Developers should use Temporal with OpenAI Agents SDK to automatically handle failures and state management in production, allowing focus on business logic rather than building resilience infrastructure.

Watch on YouTube

More from AI Engineer

Agentic Search for Context Engineering — Leonie Monigatti, Elastic

AI Engineer

Agentic Search for Context Engineering — Leonie Monigatti, Elastic

Leonie Monigatti from Elastic argues that context engineering is fundamentally 80% agentic search, evolving from rigid RAG pipelines to dynamic agent-driven retrieval that must navigate diverse context sources through carefully curated, specialized search tools.

1 day ago · 9 points

Playground in Prod - Optimising Agents in Production Environments — Samuel Colvin, Pydantic

AI Engineer

Playground in Prod - Optimising Agents in Production Environments — Samuel Colvin, Pydantic

Samuel Colvin demonstrates optimizing AI agent prompts in production using Jepper, a genetic algorithm library that breeds high-performing prompt variations, combined with Logfire's managed variables for structured configuration and deterministic evaluation against golden datasets.

2 days ago · 8 points

Vibe Engineering Effect Apps — Michael Arnaldi, Effectful

AI Engineer

Vibe Engineering Effect Apps — Michael Arnaldi, Effectful

Michael Arnaldi demonstrates "vibe engineering" by building a TypeScript project with AI agents, revealing that cloning library repositories directly into your codebase—rather than using npm packages—enables AI to learn patterns from source code, while strict TypeScript and custom lint rules act as essential guardrails.

2 days ago · 8 points

Everything You Need To Know About Agent Observability — Danny Gollapalli and Ben Hylak, Raindrop

AI Engineer

Everything You Need To Know About Agent Observability — Danny Gollapalli and Ben Hylak, Raindrop

As AI agents grow more complex and autonomous, traditional pre-deployment testing fails to catch the infinite edge cases of production behavior. The video outlines a new observability paradigm combining explicit system metrics with implicit semantic signals and self-diagnostics to monitor agents in real-time.

2 days ago · 9 points

Browse more: 🎙️ Podcasts All Videos All Categories