🔬Generating Molecules, Not Just Models
TL;DR
AlphaFold2's 2020 breakthrough solved single-chain protein structure prediction using evolutionary sequence correlations, but the field still struggles with folding dynamics, protein complexes, and molecules lacking evolutionary data, while expanding to small molecules and nucleic acids.
🧬 AlphaFold2's Breakthrough Moment 2 insights
CASP 14 dominance
AlphaFold2 achieved unprecedented accuracy at the 2020 protein structure prediction competition, effectively solving the 50-year challenge of predicting single-chain protein structures when evolutionary data is available.
Structure vs. folding distinction
The breakthrough addressed predicting final static structures but not the dynamic folding process itself, leaving intermediate states and misfolding pathways poorly understood.
⚛️ The Molecular Landscape 2 insights
Proteins as cellular machinery
Proteins are sequences of 20 amino acid types that fold into functional machines, with their 3D shape determining biological function and disease mechanisms.
Small molecules and nucleic acids
Small molecules feature diverse atomic compositions distinct from proteins, while nucleic acids (DNA/RNA) use 4-base sequences similar to proteins but require different modeling approaches.
⚠️ Current Limitations 2 insights
Dynamic and complex systems
Models struggle with intrinsically disordered proteins, multi-chain complexes, and conformational switching where proteins change shape based on cellular environment.
The orphan protein problem
Structure prediction fails for proteins lacking evolutionary homologs, as current methods rely heavily on multiple sequence alignments (MSA) to infer spatial constraints.
🧮 The Evolutionary Hack 2 insights
Co-evolutionary signals
Spatially close amino acids show correlated mutations across species through compensatory changes, providing distance constraints that enable accurate geometric reconstruction.
MSA dependency bottleneck
This reliance on evolutionary history means the 'solved' problem only applies to well-characterized protein families, leaving many therapeutic targets structurally opaque.
Bottom Line
While AlphaFold2 cracked static structure prediction for single proteins using evolutionary patterns, advancing therapeutic design requires solving dynamic interactions, complexes, and molecules without evolutionary history.
More from Latent Space
View all
🔬There Is No AlphaFold for Materials — AI for Materials Discovery with Heather Kulik
MIT professor Heather Kulik explains how AI discovered quantum phenomena to create 4x tougher polymers and why materials science lacks an 'AlphaFold' equivalent due to missing experimental datasets, emphasizing that domain expertise remains essential to validate AI predictions in chemistry.
Dreamer: the Agent OS for Everyone — David Singleton
David Singleton introduces Dreamer as an 'Agent OS' that combines a personal AI Sidekick with a marketplace of tools and agents, enabling both non-technical users and engineers to build, customize, and deploy AI applications through natural language while maintaining privacy through centralized, OS-level architecture.
Why Anthropic Thinks AI Should Have Its Own Computer — Felix Rieseberg of Claude Cowork/Code
Anthropic's Felix Rieseberg explains why AI agents need their own virtual computers to be effective, arguing that confining Claude to chat interfaces severely limits capability. He details how this philosophy shaped Claude Cowork and why product development is shifting from lengthy planning to rapidly building multiple prototypes simultaneously.
⚡️Monty: the ultrafast Python interpreter by Agents for Agents — Samuel Colvin, Pydantic
Samuel Colvin from Pydantic introduces Monty, a Rust-based Python interpreter designed specifically for AI agents that achieves sub-microsecond execution latency by running in-process, bridging the gap between rigid tool calling and heavy containerized sandboxes.