Building AlphaGo from scratch – Eric Jang
TL;DR
Eric Jang demonstrates how modern LLM coding tools and algorithmic improvements have democratized AI research, enabling a single researcher to rebuild AlphaGo for thousands of dollars rather than millions, while explaining how Monte Carlo Tree Search combined with neural networks solved a game previously considered computationally intractable.
🎯 The AlphaGo Revival 3 insights
LLM Coding Democratization
Thanks to modern coding assistants, recreating AlphaGo now costs thousands in compute rather than millions in research funding and team resources.
Personal Research Motivation
Eric pursued this project to understand how shallow ten-layer neural networks can amortize the simulation of extremely deep game trees.
KataGo Efficiency Breakthrough
David Wu's open-source KataGo (2020) achieved a 40x reduction in compute needed to train a strong Go bot tabula rasa compared to AlphaGo Zero.
⚫ Go Fundamentals 3 insights
Simple Rules, Complex Play
Players alternate placing black and white stones to surround territory and capture opponent stones by occupying all four adjacent intersections.
Tromp-Taylor Algorithmic Scoring
Unlike human scoring requiring consensus on dead stones, Tromp-Taylor rules provide completely unambiguous endgame scoring ideal for computer implementation.
Combinatorial Explosion
A 19x19 board allows roughly 361^300 possible game sequences, exceeding the number of atoms in the universe and making exhaustive search impossible.
🔍 Monte Carlo Tree Search 3 insights
PUCT Action Selection
The algorithm selects moves by maximizing the sum of mean action value (Q) and an exploration bonus weighted by prior probability and visit counts.
Efficient State Representation
Nodes represent game states storing visit counts, action values, and prior probabilities, enabling efficient tree traversal without storing the entire game tree.
Stochastic Approximation
Though Go is deterministic, Monte Carlo methods introduce probability distributions to sample promising game trees rather than exhaustively searching all possibilities.
Bottom Line
Modern LLM tooling has collapsed the implementation cost of complex AI systems like AlphaGo from millions to thousands of dollars, democratizing access to frontier research techniques that were previously exclusive to well-funded labs.
More from Dwarkesh Patel
View all
Grant Sanderson (@3blue1brown) – AI and the future of math
Grant Sanderson explains that AI progress in mathematics reveals a 'fractal frontier' with highly uneven capabilities; solving even Millennium Prize problems may not indicate full AGI if the solution relies on cross-domain pattern matching rather than sustained theory-building.
How Machiavelli's Florence bargained with Cesare Borgia for survival – Ada Palmer
Ada Palmer explains that Machiavelli wrote *The Prince* during a crisis of institutional legitimacy in Italy, where constant papal interference and broken city-state continuity created chaos. His infamous advice was shaped by firsthand experience with Cesare Borgia, against whom Florence's only survival strategy was calculated submission—buying time through abject loyalty until fortune (in the form of a pope's death) intervened.
Sarah Paine - Why Russia and China can't escape geography
Sarah Paine argues that geography fundamentally constrains Russia and China to remain continental 'elephants' dependent on land armies and territorial expansion, lacking the geographic moats, sea access, and institutional stability required to become maritime 'whales' regardless of their ambitions.
What remains scarce after AGI? – Alex Imas and Phil Trammell
Alex Imas and Phil Trammell analyze what remains scarce after AGI, arguing that while a 'relational sector' where humans provide intrinsic value may persist, increasing variety in capital goods could cause labor share to collapse to zero unless we collect critical data on consumer preferences for human involvement.