🔬 From Red Teaming GPT-4 to Automating Drug Discovery: The Future of AI in Science — Andrew White
TL;DR
Andrew White recounts his journey from molecular dynamics researcher to founder of Future House and Edison Scientific, revealing how AlphaFold disrupted specialized supercomputing approaches and why current LLMs have already compressed a decade-long vision for automated science into just two years—prompting urgent policy attention at the highest levels of government.
🧬 Computational Biology's Hardware-to-Software Revolution 2 insights
D. E. Shaw's custom silicon failed against general AI
David Shaw's research group burned molecular dynamics algorithms into specialized chips near Times Square, but AlphaFold solved protein folding on desktop GPUs via Google Colab without requiring government-funded supercomputers.
Geometric deep learning supplanted maximum entropy methods
White's early work fitting complex molecular dynamics to sparse data evolved into using transformers that naturally handle chemical graph structures and symmetry, enabling chemistry to be treated as a language modeling problem.
🚀 Automating Discovery: Organizations and Timelines 2 insights
Future House bridges academia and venture capital
White resigned his University of Rochester tenure in June to establish a nonprofit 'Focused Research Organization' exploring AI automation, then spun out Edison Scientific as a venture-backed startup when progress compressed a planned 10-year mission into two years.
Current LLMs already automate empirical biology
Unlike theoretical domains requiring perfect reasoning, biology's empirical nature means today's models can match top-quintile human experimental predictions, making immediate automation viable without waiting for more advanced AI systems.
⚠️ Red Teaming and Policy at the Frontier 2 insights
ChemCrow prompted unprecedented White House attention
White spent nine months red-teaming GPT-4's chemistry capabilities before its March release, creating an agentic system that operates cloud laboratories and became the first pre-print presented to the President in a 30-minute National Security Council briefing.
Immediate dual-use implications revealed expertise gaps
Three-letter agencies immediately questioned ChemCrow's impact on explosives synthesis and nuclear weapons research timelines, highlighting the dangerous scarcity of experts at the intersection of AI and hard sciences.
Bottom Line
Organizations should immediately deploy current-generation LLMs to automate empirical scientific research through hybrid academic-industry teams, as the technology has already matured beyond the capabilities of traditional institutional structures.
More from Latent Space
View all
🔬Top Black Holes Physicist: GPT5 can do Vibe Physics, here's what I found
Physicist Alex Lubyansky discusses how GPT-5 and reasoning models like o3 have achieved superhuman capabilities in theoretical physics, solving the year-long mystery of single minus gluon tree amplitudes and reproducing complex research in minutes rather than months.
The $15B Physical AI Company: Simulation, Autonomy OS, Neural Sim, & 1K Engineers—Applied Intuition
Applied Intuition is building the unified 'Android for physical machines' to solve OS fragmentation across vehicles and industrial equipment, enabling modern AI deployment through simulation tools, proprietary operating systems, and end-to-end autonomy models with a 1,000-engineer team.
CI/CD Breaks at AI Speed: Tangle, Graphite Stacks, Pro-Model PR Review — Mikhail Parakhin, Shopify
Shopify CTO Mikhail Parakhin reveals that AI agents have achieved nearly 100% daily adoption among developers, driving a 30% month-over-month surge in PR merges that is breaking traditional CI/CD pipelines, and argues that organizations must shift from parallel token-burning agents to high-latency, critique-loop architectures using expensive pro-level models for code review.
🔬 Training Transformers to solve 95% failure rate of Cancer Trials — Ron Alfa & Daniel Bear, Noetik
Noetik is tackling the 95% failure rate of cancer clinical trials by training transformers on proprietary multimodal patient tumor data to identify hidden biological subtypes and match therapies to responsive populations, moving beyond simplistic biomarkers and outdated cell lines.