Building AlphaGo from scratch – Eric Jang

Dwarkesh Patel

| Podcasts | May 15, 2026 | 142 Thousand views | 2:37:18

TL;DR

Eric Jang demonstrates how modern LLM coding tools and algorithmic improvements have democratized AI research, enabling a single researcher to rebuild AlphaGo for thousands of dollars rather than millions, while explaining how Monte Carlo Tree Search combined with neural networks solved a game previously considered computationally intractable.

🎯 The AlphaGo Revival 3 insights

LLM Coding Democratization

Thanks to modern coding assistants, recreating AlphaGo now costs thousands in compute rather than millions in research funding and team resources.

Personal Research Motivation

Eric pursued this project to understand how shallow ten-layer neural networks can amortize the simulation of extremely deep game trees.

KataGo Efficiency Breakthrough

David Wu's open-source KataGo (2020) achieved a 40x reduction in compute needed to train a strong Go bot tabula rasa compared to AlphaGo Zero.

⚫ Go Fundamentals 3 insights

Simple Rules, Complex Play

Players alternate placing black and white stones to surround territory and capture opponent stones by occupying all four adjacent intersections.

Tromp-Taylor Algorithmic Scoring

Unlike human scoring requiring consensus on dead stones, Tromp-Taylor rules provide completely unambiguous endgame scoring ideal for computer implementation.

Combinatorial Explosion

A 19x19 board allows roughly 361^300 possible game sequences, exceeding the number of atoms in the universe and making exhaustive search impossible.

🔍 Monte Carlo Tree Search 3 insights

PUCT Action Selection

The algorithm selects moves by maximizing the sum of mean action value (Q) and an exploration bonus weighted by prior probability and visit counts.

Efficient State Representation

Nodes represent game states storing visit counts, action values, and prior probabilities, enabling efficient tree traversal without storing the entire game tree.

Stochastic Approximation

Though Go is deterministic, Monte Carlo methods introduce probability distributions to sample promising game trees rather than exhaustively searching all possibilities.

Bottom Line

Modern LLM tooling has collapsed the implementation cost of complex AI systems like AlphaGo from millions to thousands of dollars, democratizing access to frontier research techniques that were previously exclusive to well-funded labs.

Watch on YouTube

More from Dwarkesh Patel

Grant Sanderson (@3blue1brown) – AI and the future of math

Dwarkesh Patel

Grant Sanderson (@3blue1brown) – AI and the future of math

Grant Sanderson explains that AI progress in mathematics reveals a 'fractal frontier' with highly uneven capabilities; solving even Millennium Prize problems may not indicate full AGI if the solution relies on cross-domain pattern matching rather than sustained theory-building.

4 days ago · 9 points

How Machiavelli's Florence bargained with Cesare Borgia for survival – Ada Palmer

Dwarkesh Patel

How Machiavelli's Florence bargained with Cesare Borgia for survival – Ada Palmer

Ada Palmer explains that Machiavelli wrote *The Prince* during a crisis of institutional legitimacy in Italy, where constant papal interference and broken city-state continuity created chaos. His infamous advice was shaped by firsthand experience with Cesare Borgia, against whom Florence's only survival strategy was calculated submission—buying time through abject loyalty until fortune (in the form of a pope's death) intervened.

18 days ago · 10 points

Sarah Paine - Why Russia and China can't escape geography

Dwarkesh Patel

Sarah Paine - Why Russia and China can't escape geography

Sarah Paine argues that geography fundamentally constrains Russia and China to remain continental 'elephants' dependent on land armies and territorial expansion, lacking the geographic moats, sea access, and institutional stability required to become maritime 'whales' regardless of their ambitions.

25 days ago · 10 points

What remains scarce after AGI? – Alex Imas and Phil Trammell

Dwarkesh Patel

What remains scarce after AGI? – Alex Imas and Phil Trammell

Alex Imas and Phil Trammell analyze what remains scarce after AGI, arguing that while a 'relational sector' where humans provide intrinsic value may persist, increasing variety in capital goods could cause labor share to collapse to zero unless we collect critical data on consumer preferences for human involvement.

about 1 month ago · 10 points

Browse more: 🎙️ Podcasts All Videos All Categories