🔬Generating Molecules, Not Just Models
TL;DR
AlphaFold2's 2020 breakthrough solved single-chain protein structure prediction using evolutionary sequence correlations, but the field still struggles with folding dynamics, protein complexes, and molecules lacking evolutionary data, while expanding to small molecules and nucleic acids.
🧬 AlphaFold2's Breakthrough Moment 2 insights
CASP 14 dominance
AlphaFold2 achieved unprecedented accuracy at the 2020 protein structure prediction competition, effectively solving the 50-year challenge of predicting single-chain protein structures when evolutionary data is available.
Structure vs. folding distinction
The breakthrough addressed predicting final static structures but not the dynamic folding process itself, leaving intermediate states and misfolding pathways poorly understood.
⚛️ The Molecular Landscape 2 insights
Proteins as cellular machinery
Proteins are sequences of 20 amino acid types that fold into functional machines, with their 3D shape determining biological function and disease mechanisms.
Small molecules and nucleic acids
Small molecules feature diverse atomic compositions distinct from proteins, while nucleic acids (DNA/RNA) use 4-base sequences similar to proteins but require different modeling approaches.
⚠️ Current Limitations 2 insights
Dynamic and complex systems
Models struggle with intrinsically disordered proteins, multi-chain complexes, and conformational switching where proteins change shape based on cellular environment.
The orphan protein problem
Structure prediction fails for proteins lacking evolutionary homologs, as current methods rely heavily on multiple sequence alignments (MSA) to infer spatial constraints.
🧮 The Evolutionary Hack 2 insights
Co-evolutionary signals
Spatially close amino acids show correlated mutations across species through compensatory changes, providing distance constraints that enable accurate geometric reconstruction.
MSA dependency bottleneck
This reliance on evolutionary history means the 'solved' problem only applies to well-characterized protein families, leaving many therapeutic targets structurally opaque.
Bottom Line
While AlphaFold2 cracked static structure prediction for single proteins using evolutionary patterns, advancing therapeutic design requires solving dynamic interactions, complexes, and molecules without evolutionary history.
More from Latent Space
View all
🔬Top Black Holes Physicist: GPT5 can do Vibe Physics, here's what I found
Physicist Alex Lubyansky discusses how GPT-5 and reasoning models like o3 have achieved superhuman capabilities in theoretical physics, solving the year-long mystery of single minus gluon tree amplitudes and reproducing complex research in minutes rather than months.
The $15B Physical AI Company: Simulation, Autonomy OS, Neural Sim, & 1K Engineers—Applied Intuition
Applied Intuition is building the unified 'Android for physical machines' to solve OS fragmentation across vehicles and industrial equipment, enabling modern AI deployment through simulation tools, proprietary operating systems, and end-to-end autonomy models with a 1,000-engineer team.
CI/CD Breaks at AI Speed: Tangle, Graphite Stacks, Pro-Model PR Review — Mikhail Parakhin, Shopify
Shopify CTO Mikhail Parakhin reveals that AI agents have achieved nearly 100% daily adoption among developers, driving a 30% month-over-month surge in PR merges that is breaking traditional CI/CD pipelines, and argues that organizations must shift from parallel token-burning agents to high-latency, critique-loop architectures using expensive pro-level models for code review.
🔬 Training Transformers to solve 95% failure rate of Cancer Trials — Ron Alfa & Daniel Bear, Noetik
Noetik is tackling the 95% failure rate of cancer clinical trials by training transformers on proprietary multimodal patient tumor data to identify hidden biological subtypes and match therapies to responsive populations, moving beyond simplistic biomarkers and outdated cell lines.