Grant Sanderson (@3blue1brown) – AI and the future of math
TL;DR
Grant Sanderson explains that AI progress in mathematics reveals a 'fractal frontier' with highly uneven capabilities; solving even Millennium Prize problems may not indicate full AGI if the solution relies on cross-domain pattern matching rather than sustained theory-building.
🎯 The Benchmark Paradox 3 insights
IMO gold was not AGI
Sanderson's prediction that AI achieving International Math Olympiad gold would merely mark another passed benchmark rather than general intelligence has proven correct.
Geometry brute force vs. combinatorics creativity
Current AI solves IMO geometry problems in 19 seconds through brute force but still struggles with combinatorics problems that appear to require more playful, creative reasoning.
Fractal capability spikiness
Mathematical capability itself has a fractal structure—while AI appears superhuman in some domains, zooming in reveals uneven progress where specific subfields remain resistant.
🧩 Two Paths to Breakthroughs 3 insights
Cross-domain connections
Solutions like the Riemann Hypothesis might emerge from linking distant fields (e.g., number theory and quantum physics via random matrix theory), playing to LLMs' strength of retaining broad, superhuman knowledge.
Building new theoretical mountains
Alternatively, solving hard problems may require constructing entirely new theoretical frameworks (like elliptic curves for Fermat's Last Theorem), a skill that would indicate intelligence sufficient to automate white-collar work.
The century-long verification loop
Unlike theorem proving, validating whether a new conceptual framework is productive can require hundred-year feedback loops, making it resistant to current reinforcement learning methods.
🔮 Beyond Solving: The Next Frontier 3 insights
From theorems to definitions
The next true benchmark is shifting from problem-solving to generating interesting conjectures and definitions that create new fields, as 'great mathematicians come up with conjectures, the greatest come up with definitions.'
Subjective progress metrics
Unlike clear-cut benchmarks, progress here will appear as a 'tone shift' where mathematicians report AI is useful for deciding what research questions are worth pursuing in the first place.
RLVR training limitations
These higher-level creative capabilities likely cannot be trained via current RLVR approaches because they lack immediate verifiable rewards, requiring instead long-term validation by the scientific community.
Bottom Line
True AI mastery of mathematics won't be measured by solving existing problems but by its ability to help mathematicians decide what is worth studying and to create conceptual frameworks that reshape entire fields.
More from Dwarkesh Patel
View all
How Machiavelli's Florence bargained with Cesare Borgia for survival – Ada Palmer
Ada Palmer explains that Machiavelli wrote *The Prince* during a crisis of institutional legitimacy in Italy, where constant papal interference and broken city-state continuity created chaos. His infamous advice was shaped by firsthand experience with Cesare Borgia, against whom Florence's only survival strategy was calculated submission—buying time through abject loyalty until fortune (in the form of a pope's death) intervened.
Sarah Paine - Why Russia and China can't escape geography
Sarah Paine argues that geography fundamentally constrains Russia and China to remain continental 'elephants' dependent on land armies and territorial expansion, lacking the geographic moats, sea access, and institutional stability required to become maritime 'whales' regardless of their ambitions.
What remains scarce after AGI? – Alex Imas and Phil Trammell
Alex Imas and Phil Trammell analyze what remains scarce after AGI, arguing that while a 'relational sector' where humans provide intrinsic value may persist, increasing variety in capital goods could cause labor share to collapse to zero unless we collect critical data on consumer preferences for human involvement.
Chip design from the bottom up – Reiner Pope
Reiner Pope explains how AI chips work from fundamental logic gates up, revealing that the physical cost of moving data between memory and compute units (via multiplexers) often exceeds the cost of the actual mathematical operations, and that circuit area scales quadratically with precision, making low-precision arithmetic exponentially more efficient than commonly assumed.