🔬 From Red Teaming GPT-4 to Automating Drug Discovery: The Future of AI in Science — Andrew White
TL;DR
Andrew White recounts his journey from molecular dynamics researcher to founder of Future House and Edison Scientific, revealing how AlphaFold disrupted specialized supercomputing approaches and why current LLMs have already compressed a decade-long vision for automated science into just two years—prompting urgent policy attention at the highest levels of government.
🧬 Computational Biology's Hardware-to-Software Revolution 2 insights
D. E. Shaw's custom silicon failed against general AI
David Shaw's research group burned molecular dynamics algorithms into specialized chips near Times Square, but AlphaFold solved protein folding on desktop GPUs via Google Colab without requiring government-funded supercomputers.
Geometric deep learning supplanted maximum entropy methods
White's early work fitting complex molecular dynamics to sparse data evolved into using transformers that naturally handle chemical graph structures and symmetry, enabling chemistry to be treated as a language modeling problem.
🚀 Automating Discovery: Organizations and Timelines 2 insights
Future House bridges academia and venture capital
White resigned his University of Rochester tenure in June to establish a nonprofit 'Focused Research Organization' exploring AI automation, then spun out Edison Scientific as a venture-backed startup when progress compressed a planned 10-year mission into two years.
Current LLMs already automate empirical biology
Unlike theoretical domains requiring perfect reasoning, biology's empirical nature means today's models can match top-quintile human experimental predictions, making immediate automation viable without waiting for more advanced AI systems.
⚠️ Red Teaming and Policy at the Frontier 2 insights
ChemCrow prompted unprecedented White House attention
White spent nine months red-teaming GPT-4's chemistry capabilities before its March release, creating an agentic system that operates cloud laboratories and became the first pre-print presented to the President in a 30-minute National Security Council briefing.
Immediate dual-use implications revealed expertise gaps
Three-letter agencies immediately questioned ChemCrow's impact on explosives synthesis and nuclear weapons research timelines, highlighting the dangerous scarcity of experts at the intersection of AI and hard sciences.
Bottom Line
Organizations should immediately deploy current-generation LLMs to automate empirical scientific research through hybrid academic-industry teams, as the technology has already matured beyond the capabilities of traditional institutional structures.
More from Latent Space
View all
The Agent Cloud: Databricks’ Bet on the Future of AI — Matei Zaharia and Reynold Xin
Matei Zaharia and Reynold Xin detail Databricks' open-source 'Agent Cloud' platform (Omnigen), arguing that standardized protocols and persistent infrastructure—not just better models—will determine which enterprises successfully deploy collaborative, secure AI agents at scale.
AI Security After Codex and Claude Code — Zico Kolter & Matt Fredrikson, Gray Swan
Gray Swan co-founders Zico Kolter and Matt Fredrikson explain why AI systems require a fundamentally different security approach than traditional software, highlighting how their automated red teaming system 'Shade' has begun to outperform human experts at finding model vulnerabilities. They emphasize the urgent need to treat AI agents as inherently untrusted entities capable of correlated failures across the software ecosystem.
⚡️Every product of the future will be a living system — Ronak Malde, Trajectory.ai
Ronak Malde explains leaving DeepMind (and $2 billion in acquisition earnings) to found Trajectory.ai, arguing that AI products must evolve from static tools into "living systems" that continually learn from real-world user corrections across enterprise verticals like legal and finance.
The AI Frontier: from FLOPs to Megawatts — Anjney Midha, AMP
Anjney Midha argues that AI infrastructure is facing a crisis of inefficiency and cultural misalignment, proposing that compute be treated as a utility through an Independent System Operator model that pools multi-cloud resources while embedding community incentives directly into unit economics.