🔬Generating Molecules, Not Just Models
TL;DR
AlphaFold2's 2020 breakthrough solved single-chain protein structure prediction using evolutionary sequence correlations, but the field still struggles with folding dynamics, protein complexes, and molecules lacking evolutionary data, while expanding to small molecules and nucleic acids.
🧬 AlphaFold2's Breakthrough Moment 2 insights
CASP 14 dominance
AlphaFold2 achieved unprecedented accuracy at the 2020 protein structure prediction competition, effectively solving the 50-year challenge of predicting single-chain protein structures when evolutionary data is available.
Structure vs. folding distinction
The breakthrough addressed predicting final static structures but not the dynamic folding process itself, leaving intermediate states and misfolding pathways poorly understood.
⚛️ The Molecular Landscape 2 insights
Proteins as cellular machinery
Proteins are sequences of 20 amino acid types that fold into functional machines, with their 3D shape determining biological function and disease mechanisms.
Small molecules and nucleic acids
Small molecules feature diverse atomic compositions distinct from proteins, while nucleic acids (DNA/RNA) use 4-base sequences similar to proteins but require different modeling approaches.
⚠️ Current Limitations 2 insights
Dynamic and complex systems
Models struggle with intrinsically disordered proteins, multi-chain complexes, and conformational switching where proteins change shape based on cellular environment.
The orphan protein problem
Structure prediction fails for proteins lacking evolutionary homologs, as current methods rely heavily on multiple sequence alignments (MSA) to infer spatial constraints.
đź§® The Evolutionary Hack 2 insights
Co-evolutionary signals
Spatially close amino acids show correlated mutations across species through compensatory changes, providing distance constraints that enable accurate geometric reconstruction.
MSA dependency bottleneck
This reliance on evolutionary history means the 'solved' problem only applies to well-characterized protein families, leaving many therapeutic targets structurally opaque.
Bottom Line
While AlphaFold2 cracked static structure prediction for single proteins using evolutionary patterns, advancing therapeutic design requires solving dynamic interactions, complexes, and molecules without evolutionary history.
More from Latent Space
View all
The Agent Cloud: Databricks’ Bet on the Future of AI — Matei Zaharia and Reynold Xin
Matei Zaharia and Reynold Xin detail Databricks' open-source 'Agent Cloud' platform (Omnigen), arguing that standardized protocols and persistent infrastructure—not just better models—will determine which enterprises successfully deploy collaborative, secure AI agents at scale.
AI Security After Codex and Claude Code — Zico Kolter & Matt Fredrikson, Gray Swan
Gray Swan co-founders Zico Kolter and Matt Fredrikson explain why AI systems require a fundamentally different security approach than traditional software, highlighting how their automated red teaming system 'Shade' has begun to outperform human experts at finding model vulnerabilities. They emphasize the urgent need to treat AI agents as inherently untrusted entities capable of correlated failures across the software ecosystem.
⚡️Every product of the future will be a living system — Ronak Malde, Trajectory.ai
Ronak Malde explains leaving DeepMind (and $2 billion in acquisition earnings) to found Trajectory.ai, arguing that AI products must evolve from static tools into "living systems" that continually learn from real-world user corrections across enterprise verticals like legal and finance.
The AI Frontier: from FLOPs to Megawatts — Anjney Midha, AMP
Anjney Midha argues that AI infrastructure is facing a crisis of inefficiency and cultural misalignment, proposing that compute be treated as a utility through an Independent System Operator model that pools multi-cloud resources while embedding community incentives directly into unit economics.