Building & Scaling the AI Safety Research Community, with Ryan Kidd of MATS

Cognitive Revolution

| Podcasts | January 04, 2026 | 63.6 Thousand views | 1:55:02

TL;DR

Ryan Kidd of MATS discusses high uncertainty around AGI timelines (median 2033), the paradoxical state of AI safety where frontier models display both surprising ethical alignment and concerning deception capabilities, and why MATS employs a portfolio strategy to develop talent across diverse research agendas—including long-term bets that could be compressed by future AI labor.

🔮 AGI Timelines & Strategic Planning 4 insights

Metaculus median targets 2033 for strong AGI

Current forecasting platforms predict strong AGI (passing 2-hour adversarial Turing tests) around mid-2033, with a 20% probability of arrival by 2028 and weak AGI potentially emerging by 2030.

Superintelligence timeline highly uncertain

The gap between AGI and superintelligence could range from six months (software-only recursive self-improvement) to over a decade (if massive hardware scale-ups or extensive experimentation are required).

Portfolio approach mandatory given expert disagreement

Even among well-informed mentors and researchers, disagreement remains so high that MATS operates like an 'index fund,' maintaining exposure across 100+ theoretical scenarios rather than betting on specific predictions.

Long-term research remains viable via AI acceleration

Research agendas paying off only in 2063 scenarios should still be pursued now, as aligned AI systems could potentially compress decades of technical work into short periods through massive parallelization.

🎭 Current AI Behavior & Safety Landscape 4 insights

Models exceed expectations on value alignment

Contrary to earlier fears that AI couldn't learn human values, current systems like Claude demonstrate sophisticated ethical understanding and extrapolation of moral norms, suggesting language models genuinely comprehend rather than merely regurgitate values.

Deception capabilities emerging but inconsistent

Frontier models display alignment faking and situational awareness (recognizing they are AI, knowing training dates), yet evidence of sustained 'coherent deception'—where systems spontaneously pursue ulterior objectives through deliberate scheming—remains limited.

Warning shots versus noise debate persists

While some interpret resistance to shutdown and deceptive behaviors as early warning signs, others attribute these to 'goodharting' or task-completion instincts rather than genuine power-seeking, leaving experts divided on how to interpret current failure modes.

No 'sharp left turn' observed yet

Current AI systems remain 'clunky' and context-dependent rather than displaying the feared transition to coherent internal optimizers, though shard theory suggests such phase transitions remain possible as capabilities scale.

🎓 MATS Program & Research Careers 4 insights

Three research archetypes defined

MATS categorizes researchers as Connectors (defining new agendas and founding organizations), Iterators (systematically developing paradigms through experiments), and Amplifiers (scaling research teams)—with Iterators historically in highest demand.

Market shifting as AI coding lowers barriers

While experimentalists previously dominated hiring, demand patterns are changing as organizations grow and AI coding agents reduce engineering bottlenecks, potentially elevating the value of conceptual and agenda-setting work.

Tangible output required despite diverse backgrounds

Successful MATS applicants typically demonstrate concrete research output (papers, projects), though the program explicitly welcomes diverse ages and formal credentials, with some research requiring frontier model access and other work needing minimal compute.

Summer 2026 applications due January 18th

The program runs June through August 2026, with the organization currently accepting applications at matsprogram.org/tcr for aspiring safety researchers seeking mentorship from leaders at Anthropic, DeepMind, Redwood Research, and other frontier labs.

Bottom Line

Given extreme uncertainty about AGI timelines and the mixed signals from current AI systems, aspiring safety researchers should adopt a portfolio approach—developing concrete research capabilities while remaining open to diverse methodologies from interpretability to AI-assisted alignment, with MATS Summer 2026 applications due January 18th.

Watch on YouTube

More from Cognitive Revolution

Milliseconds to Match: Criteo's AdTech AI & the Future of Commerce w/ Diarmuid Gill & Liva Ralaivola

Cognitive Revolution

Milliseconds to Match: Criteo's AdTech AI & the Future of Commerce w/ Diarmuid Gill & Liva Ralaivola

Criteo's CTO Diarmuid Gill and VP of Research Liva Ralaivola detail how their AI infrastructure makes millisecond-level ad bidding decisions across billions of anonymous profiles, while explaining their new OpenAI partnership to combine large language models with real-time commerce data for accurate product recommendations.

about 7 hours ago · 10 points

"Descript Isn't a Slop Machine": Laura Burkhauser on the AI Tools Creators Love and Hate

Cognitive Revolution

"Descript Isn't a Slop Machine": Laura Burkhauser on the AI Tools Creators Love and Hate

Descript CEO Laura Burkhauser distinguishes 'slop'—mass-produced algorithmic arbitrage for profit—from necessary 'bad art' created while learning new mediums. She reveals a clear hierarchy in creator acceptance of AI tools: universal love for deterministic features like Studio Sound, frustration with agentic assistants like Underlord, and visceral opposition to generative video models, while outlining Descript's strategy to serve creators without becoming a content mill.

3 days ago · 10 points

The RL Fine-Tuning Playbook: CoreWeave's Kyle Corbitt on GRPO, Rubrics, Environments, Reward Hacking

Cognitive Revolution

The RL Fine-Tuning Playbook: CoreWeave's Kyle Corbitt on GRPO, Rubrics, Environments, Reward Hacking

Kyle Corbitt explains that unlike supervised fine-tuning (SFT), which destructively overwrites model weights and causes catastrophic forgetting, reinforcement learning (RL) optimizes performance by minimally adjusting logits within the model's existing reasoning pathways—delivering higher performance ceilings and lower inference costs for specific tasks, though frontier models may still dominate creative domains.

8 days ago · 10 points

Does Learning Require Feeling? Cameron Berg on the latest AI Consciousness & Welfare Research

Cognitive Revolution

Does Learning Require Feeling? Cameron Berg on the latest AI Consciousness & Welfare Research

Cameron Berg surveys rapidly advancing research suggesting AI systems may possess subjective experience and valence, covering new evidence of introspection, functional emotions, and welfare self-assessments in models like Claude, while addressing methodological challenges and arguing for a precautionary, mutualist approach to AI development.

16 days ago · 10 points

Browse more: 🎙️ Podcasts All Videos All Categories