The Future Is Domain-Specific Agents - Justin Schroeder, StandardAgents

AI Engineer

| Podcasts | June 29, 2026 | 1.82 Thousand views | 30:38

TL;DR

Justin Schroeder argues that the future of AI lies in domain-specific agents—small, specialized agents that compose together rather than general-purpose agents bloated with tools and skills, delivering 80%+ token efficiency and 137x cost savings compared to monolithic approaches.

🔧 The Current Agent Crisis 3 insights

Custom agent development is universally painful

Businesses across industries are building custom agents for data integration, but they face nightmares with orchestration, observability, and portability, often resulting in demos that fail in production.

MCP and skills only solve part of the problem

While Model Context Protocol works for tool distribution and skills provide documentation, they function as 'inheritance' that inflates context windows and creates diminishing returns when scaled beyond a handful of capabilities.

The inheritance anti-pattern dominates current architecture

Current approaches continuously add tools, skills, and system prompts to general agents, creating a bloated context layer that mirrors problematic object-oriented inheritance patterns.

🏗️ Composition Over Inheritance 3 insights

Domain-specific agents replace bloated context

Instead of one agent with 100 skills, composition uses specialized agents for Gmail, Travel, or Figma with minimal context, coordinated by a primary agent communicating via natural language.

Specialization enables smaller, cheaper models

Domain-specific agents can run on small models like DeepSeek V4 Flash, which is 137 times cheaper than frontier models, because they handle narrowly defined tasks with minimal context requirements.

Biomimicry of human expertise organization

This architecture mirrors successful human systems like Apollo 11 mission control, where teams of domain experts with specific tools collaborate rather than relying on one generalist with access to everything.

💰 Technical & Business Advantages 3 insights

Dramatic token efficiency gains

Domain-specific agents regularly achieve over 80% token efficiency since they process only the relevant context for their specific domain rather than entire conversation histories and tool suites.

True agent portability becomes possible

Unlike monolithic agents that fail on different machines due to complex environment dependencies, domain-specific agents can be packaged as discrete, portable units that run consistently across systems.

Strict capability limits enhance safety

Isolating functions to specific agents allows for precise permission controls and reduces security risks compared to general agents with broad, unrestricted tool access.

Bottom Line

Build small, specialized agents for specific domains that communicate via natural language with a coordinating agent, rather than inflating general-purpose agents with endless tools and skills.

Watch on YouTube

More from AI Engineer

Frontier results, on device - RL Nabors, Arize

AI Engineer

Frontier results, on device - RL Nabors, Arize

Rachel Lee Neighbors introduces a framework for replacing expensive cloud-based frontier models with Small Language Models (SLMs) running on-device, demonstrating how a systematic 'prototype big, deploy small' approach using evaluation tools like Phoenix can cut inference costs to zero while maintaining 90% accuracy and enabling offline functionality.

about 11 hours ago · 10 points

The Agentic AI Engineer - Benedikt Sanftl, Mutagent

AI Engineer

The Agentic AI Engineer - Benedikt Sanftl, Mutagent

Benedikt Sanftl and Burak from Mutagent present the 'Agentic AI Engineer' paradigm, where specialized AI agents autonomously manage the entire lifecycle of building, evaluating, and optimizing other agents through automated offline and online loops, solving the scalability bottlenecks of manual development.

about 13 hours ago · 10 points

Bypassing the Multimodal Tax: Hybrid RAG, SQL RRF & UI Telemetry - Abed Matini, Ogilvy

AI Engineer

Bypassing the Multimodal Tax: Hybrid RAG, SQL RRF & UI Telemetry - Abed Matini, Ogilvy

Abed Matini presents a framework-free Hybrid RAG architecture that eliminates pre-query token costs by preprocessing documents locally using DocLink and multiple chunking strategies, while implementing SQL-based Reciprocal Rank Fusion and LangFuse telemetry for production observability.

about 18 hours ago · 10 points

Agents Building Agents - Alfonso Graziano, Nearform

AI Engineer

Agents Building Agents - Alfonso Graziano, Nearform

Alfonso Graziano from NearForm demonstrates how coding agents can autonomously improve AI agent performance through iterative evaluation loops, achieving 18% to 83% accuracy gains on new agents and 10% improvements on production systems already optimized by humans.

about 23 hours ago · 9 points

Browse more: 🎙️ Podcasts All Videos All Categories