Why AI Chess Bots Are Virtually Unbeatable (ft. GothamChess)
The charts depicting minimax with alpha-beta pruning was created by Wikipedia user Maschelos and is licensed under the Creative Commons Attribution-Share Alike 3.0 license.
Director: Lisandro Perez-Rey
Director of Photography: Francis Bernal
Editor: Paul Isakson
Talent: Gary Linscott; Levy Rozman
Line Producer: Joseph Buscemi
Associate Producer: Paul Gulyas; Brandon White
Production Manager: D. Eric Martinez
Production Coordinator: Fernando Davila
Camera Operator: Brittany Berger
Gaffer: Mar Alfonso
Sound Mixer: Michael Guggino
Production Assistant: Albie Smith
Post Production Supervisor: Alexa Deutsch
Post Production Coordinator: Ian Bryant
Supervising Editor: Doug Larsen
Assistant Editor: Andy Morell
Released on 12/08/2023
I'm about to play Stockfish 16,
the strongest chess computer in the world.
And we are going to see how long I can survive.
What?
What the [beep] is that? Yeah, I'm fine. I'm not fine.
Queen takes Bishop, Checkmate.
That was horrible.
I never want to do that again. I got Checkmated in 34 moves.
The most important thing is I had a lot of fun. No I didn't.
My name is Levy Rozman.
I wanna find out a little bit more about
how Stockfish thinks so I can use its tricks
to beat other humans.
Today we'll talk to Gary Linscott, software engineer
who has worked in computer chess for over 20 years.
So Gary, you worked on Stockfish.
What exactly makes it so good?
Stockfish really takes advantage
of super powerful computers.
It searches tens of millions of physicians per second.
So if you give it 30 seconds,
it's gonna be searching upwards of a billion positions
and that's more positions than a human
will see in their entire lifetime.
I could live a hundred lifetimes.
I would not be able to win this game.
Not only that, the evaluation function it uses,
it's a way of assessing the board.
By itself is probably stronger than almost all the
chess players in the world.
Your chess Elo is basically a number
that quantifies your skill level
and the highest ever Elo achieved
by a human is by Magnus Carlsen.
My Elo right now is 2,322.
What exactly is Stockfish's Elo right now?
It's probably greater than 3,500.
I never had a chance.
To understand how Stockfish actually thinks,
we should start at the very beginning of the game.
Most chess openings are subjective for humans.
A human will choose an opening
because they get more fun out of it.
You might like something because it gives you a big center
or very active pieces.
That's how most players choose their openings.
In my game against Stockfish, I played Pawn to D4.
Let's see which opening Stockfish plays
and it responded Knight to F6.
Gary, so the way humans learn chess openings, we read books,
we create chess files and we try to memorize them
'cause we can't access them during tournament games.
That would be like cheating on a school test.
An opening book in chess is a gigantic chess database.
Does Stockfish have a chess opening opinion as early
as the second or even first move?
The way that it thinks about the opening is
really different than a human.
It is not using chess theory.
It very rarely assesses an opening position.
What it's assessing is the resulting structures
that might evolve from that opening,
but way, way down the game tree, it's already assessing
the end game right from the getgo.
So on its second move, Stockfish played D5,
which is a very popular move.
And here I made a decision to capture this Knight.
Humans might see doubled pawns and feel unpleasant.
They might see a weakness in their position.
Does Stockfish have these little buzzwords like weakness
or inactive piece?
What actually goes into that?
The evaluation function is trained off of a huge library
of chess games that have been played.
And so it kind of absorbs that knowledge.
It's just all going into a giant neural net.
And then you can train your evaluation function
to learn what's good and what's bad.
And then that network runs when Stockfish is searching the
the position and then uses it to navigate the opening.
So to succeed in our quest
of understanding Stockfish a little bit better,
let's dive into middlegames.
Most middlegames begin around move 10 to 15.
It's the stage right
after the opening when you've gotten all your pieces out.
You can think of the middlegame in chess as a
gigantic ocean, this massive tree of possibility.
So in my middle game against Stockfish,
Pawn to C4 is what I played.
Stockfish played a move
that I don't think any human being
would play here against me.
The absurd pawn to G5, which made me audibly gasp.
Ho, ho, ho, ho. What is that? Oh my God.
Is that actually the best move in the position?
That move violates most human chess principles
that we have been brought up with?
You shouldn't push ponds in front of your king.
You should not weaken an area of the board
where you can't claim any advantage.
How does Stockfish break these age old principles?
In this case, it might just be that the move is too
far past kind of the human frontier to be a reasonable one.
It's gonna assess every possible
move and it's gonna rank them.
It all comes down to a single number,
which is what's the likelihood
that I'm gonna win from this position?
Stockfish doesn't have emotions
and opinions, it will just play the best move
that it thinks exists.
Yes, Stockfish definitely does not
emotionally consider it.
It is assessing with like grand master level quality,
but it then searches 50 moves, 60 moves into the future
to evaluate what is the best possible move.
I'm curious, like as an incredibly strong human,
how many moves do you search into the future?
I didn't realize this was the roast of Levy Rozman.
Sometimes I can go 10 moves, if it's an end game position
and I can kind of identify the forcing moves for both sides.
Sometimes I'm stuck in a middlegame
and I'm already indecisive.
It's like being at a restaurant.
You have three good options.
They're all very difficult to evaluate.
And I'll flip a coin,
but that doesn't work against Stockfish.
I counted, in this position, black has 41 legal moves.
How does Stockfish know which of those to scrub
and ultimately decide on one,
if three or four look really, really good,
It builds a game tree
and that game tree is gonna have all the legal moves.
It's gonna rank all of those moves.
It takes one step forward at a time.
It'll search two moves ahead,
then three moves ahead, then four moves ahead.
[Levy] Three or four moves speak to me
in about five seconds
and that's probably why I'm a good speed chess player.
But deciding on the best move,
sometimes I need to spend 5 minutes, 10 minutes
uncovering the truth about one of those moves.
Chess Computers actually do a similar sort of process.
Stockfish actually only looks at about two moves per
position and that's what's called the alpha beta search
technique, which is what Stockfish
and most other engines are built on top of.
Alpha Beta allows the engine to eliminate many,
many moves from a position because it knows that they are
worse than the best move that it's found so far.
And that allows alpha beta
to prune a huge amount of the search trees.
For example, if it was white's move,
one of the legal moves is Knight to E4 which is really,
really stupid because the Pawn would take the Knight
and white would be at a catastrophically worse position.
Some computers will just be able to discard this right away
and humans will too.
Alpha beta by itself can take those 35 moves that you have
to look at on average down to about 15 or so,
which is a huge reduction.
The last stage of chess is the endgame.
Is there any difference in the way
that Stockfish approaches the endgame
versus the middlegame or the opening?
Yes. Once the number of pieces goes
below a certain amount, seven,
then Stockfish can actually solve the game perfectly.
Chess is solved if there are seven pieces
remaining on a chessboard or less.
Exactly. Yeah.
There's what are called endgame table bases.
And by solved we mean literally every combination
of moves possible.
If the Bishop goes here and the Knight goes here,
or if the Bishop goes to that square originally
and the Knight goes there or there or there
or any of these combinations,
all of that is solved.
Literally Stockfish just has
to look up the position in its database
and it knows exactly what the answer is.
Even for seven pieces, it's only about 10 to 20 terabytes
of data, which is a lot.
But you know, manageable.
There is no more evaluating.
A best move is the best move
and a position is either winning,
a draw, or a losing, and that's it.
It's completely non-negotiable.
Exactly. Now the chess computers are capable
of playing without the table bases as well,
and they will still play incredibly strong chess there.
But they could make a mistake.
Gary, I lost to Stockfish.
I got checkmated in 34 moves,
so we didn't make it into an end game.
All right.
But that's just as well.
Stockfish always knows how
to close out a game in the least amount of moves,
even when the world's greatest players don't.
I have a game here, Magnus Carlsen versus Fabiano Caruana
from the 2018 World Chess Championship.
Magnus was in an end game where he was down a piece,
but neither one of them found the right technique.
Fabiano has a Bishop and a Knight,
and Magnus only has a Bishop.
Fabiano could not convert despite his extra material
because it looked like Magnus had a defensive fortress.
The game ended in a draw.
While the players were playing
and the entire world was watching,
Stockfish was screaming at them from the digital cyberspace
going, you idiots forced checkmate in 35 moves.
That is ridiculous.
Like how could it have possibly known
that if both sides made a move
for 35 moves, black would win.
What Stockfish can do is look forward into the table
bases and every path to the table bases.
The best that you can possibly do is that mate in 35.
If Stockfish was playing black here instead of Fabiano,
it would've played Bishop to H4,
then bring its Knight here
and then trap its own Knight on the edge of the board.
So the Knight now cannot escape anywhere
because white's bishop would take it.
No human being would play chess like this.
You cannot trap your Knight on the edge of the board,
but apparently white actually runs out of moves first
and slowly but surely black would've won that end game.
Stockfish saw that from a distance.
No human on the planet saw that.
So Stockfish is a very, very specialized AI.
How does it compare to some other AIs,
autonomous cars or something else?
Stockfish is only good at one thing
and it's super good at that one thing,
which is playing chess.
Chess engines have now borrowed the state
of the art research and artificial intelligence,
which is deep learning.
However, they're still fundamentally limited
to the domain of chess.
Chess has, I think, been opened up, in a way, by AI.
People can leverage the chess engines
to improve their own skills,
and that's kind of like the ideal use case
for AI technologies, is that they help humans,
you know, do things better.
I agree with Gary.
I think AI has been a net benefit for chess.
I think it's helping people improve at a rate
like we've never experienced.
We do obviously have to tackle the issue
of potential cheating,
but I'm gonna stay positive and optimistic.
Anyway, now that I know a little bit more about Stockfish,
I'm gonna go have my rematch.
How the Disco Clam Uses Light to Fight Super-Strong Predators
Architect Explains How Homes Could be 3D Printed on Mars and Earth
Scientist Explains How Rare Genetics Allow Some to Sleep Only 4 Hours a Night
Scientist Explains Unsinkable Metal That Could Prevent Disasters at Sea
Is Invisibility Possible? An Inventor and a Physicist Explain
Scientist Explains Why Her Lab Taught Rats to Drive Tiny Cars
Mycologist Explains How a Slime Mold Can Solve Mazes
How the Two-Hour Marathon Limit Was Broken
Research Suggests Cats Like Their Owners as Much as Dogs
Researcher Explains Deepfake Videos
Scientist Explains How to Study the Metabolism of Ultra High Flying Geese
Hurricane Hunter Explains How They Track and Predict Hurricanes
Scientist Explains Viral Fish Cannon Video
A Biohacker Explains Why He Turned His Leg Into a Hotspot
Scientist Explains What Water Pooling in Kilauea's Volcanic Crater Means
Bill Nye Explains the Science Behind Solar Sailing
Vision Scientist Explains Why These Praying Mantises Are Wearing 3D Glasses
Why Some Cities Are Banning Facial Recognition Technology
Scientist's Map Explains Climate Change
Scientist Explains How Moon Mining Would Work
Scientist Explains How She Captured Rare Footage of a Giant Squid
Doctor Explains How Sunscreen Affects Your Body
Stranger Things is Getting a New Mall! But Today Malls Are Dying. What Happened?
The Limits of Human Endurance Might Be Our Guts
Meet the First College Students to Launch a Rocket Into Space
Scientist Explains Why Dogs Can Smell Better Than Robots
A Harvard Professor Explains What the Avengers Can Teach Us About Philosophy
NASA Twin Study: How Space Changes Our Bodies
What the Black Hole Picture Means for Researchers
Scientist Explains How to Levitate Objects With Sound
Why Scientists and Artists Want The Blackest Substances on Earth
Biologist Explains How Drones Catching Whale "Snot" Helps Research
Researcher Explains Why Humans Can't Spot Real-Life Deepfake Masks
Doctor Explains What You Need to Know About The Coronavirus
VFX Artist Breaks Down This Year's Best Visual Effects Nominees
How Doctors on Earth Treated a Blood Clot in Space
Scientist Explains Why Some Cats Eat Human Corpses
Voting Expert Explains How Voting Technology Will Impact the 2020 Election
Doctor Explains What You Need to Know About Pandemics
ER Doctor Explains How They're Handling Covid-19
Why This Taste Map Is Wrong
Q&A: What's Next for the Coronavirus Pandemic?
Why Captive Tigers Can’t Be Reintroduced to the Wild
How Covid-19 Immunity Compares to Other Diseases
5 Mistakes to Avoid as We Try to Stop Covid-19
How This Emergency Ventilator Could Keep Covid-19 Patients Alive
Why NASA Made a Helicopter for Mars
Theoretical Physicist Breaks Down the Marvel Multiverse
Former NASA Astronaut Explains Jeff Bezos's Space Flight
Physics Student Breaks Down Gymnastics Physics
What Do Cities Look Like Under a Microscope?
Inside the Largest Bitcoin Mine in The U.S.
How Caffeine Has Fueled History
How Mushroom Time-Lapses Are Filmed
Why You’ll Fail the Milk Crate Challenge
Why Vegan Cheese Doesn't Melt
How 250 Cameras Filmed Neill Blomkamp's Demonic
How Meme Detectives Stop NFT Fraud
How Disney Designed a Robotic Spider-Man
How Online Conspiracy Groups Compare to Cults
Dune Costume Designers Break Down Dune’s Stillsuits
Korean Phrases You Missed in 'Squid Game'
Why Scientists Are Stress Testing Tardigrades
Every Prototype that Led to a Realistic Prosthetic Arm
Why the Toilet Needs an Upgrade
How Animals Are Evolving Because of Climate Change
How Stop-Motion Movies Are Animated at Aardman
Astronomer Explains How NASA Detects Asteroids
Are We Living In A Simulation?
Inside the Journey of a Shipping Container (And Why the Supply Chain Is So Backed Up)
The Science of Slow Aging
How Nose Swabs Detect New Covid-19 Strains
Samsung S22 Ultra Explained in 3 Minutes
The Science Behind Elon Musk’s Neuralink Brain Chip
Every Prototype to Make a Humanoid Robot
Chemist Breaks Down How At-Home Covid Tests Work
A Timeline of Russian Cyberattacks on Ukraine
VFX Artist Breaks Down Oscar-Nominated CGI
Why Smartphone Night Photos Are So Good Now
We Invented the Perfect WIRED Autocomplete Glue
How Everything Everywhere All at Once's Visual Effects Were Made
How Dogs Coevolved with Humans
How an Architect Redesigns NYC Streets
Viking Expert Breaks Down The Northman Weapons
J. Kenji López-Alt Breaks Down the Science of Stir-Fry
How A.I. Is Changing Hollywood
How Trash Goes From Garbage Cans to Landfills
Veterinarian Explains How to Prevent Pet Separation Anxiety
The Science Behind Genetically Modified Mosquitoes
How Scientists & Filmmakers Brought Prehistoric Planet's Dinosaurs to Life
All the Ways Google Gets Street View Images
How Public Cameras Recognize and Track You
How the Nuro Robotic Delivery Car Was Built
Biologist Explains the Unexpected Origins of Feathers in Fashion
Surgeons Break Down Separating Conjoined Twins
Former Air Force Pilot Breaks Down UFO Footage
Bug Expert Explains Why Cicadas Are So Loud
The Best of CES 2021
Health Expert Explains What You Need to Know About Quarantines
Scientist Explains How People Might Hibernate Like Bears
Could a Chernobyl Level Nuclear Disaster Happen in the US?
Neuroscientist Explains ASMR's Effects on the Brain & The Body
Why Top Scientists Are Pretending an Asteroid is Headed for Earth
Epidemiologist Answers Common Monkeypox Questions
Bill Nye Breaks Down Webb Telescope Space Images
How This Humanoid Robot Diver Was Designed
Every Trick a Pro GeoGuessr Player Uses to Win
How NASA Biologists Plan to Grow Plants on the Moon
How FIFA Graphics & Gameplay Are Evolving (1993 - 2023)
How a Vet Performs Dangerous Surgeries on Wild Animals
This Heart is Not Human
How Entomologists Use Insects to Solve Crimes
Former NASA Astronaut Breaks Down a Rocket Launch
Chess Pro Explains How to Spot Cheaters
Why Billionaires Are Actually Ruining the Economy
How to Keep Your New Year’s Resolutions for More Than a Week
The Biology Behind The Last of Us
English Teacher Grades Homework By ChatGPT
All the Ways a Cold Plunge Affects the Body
Spy Historian Debunks Chinese Spy Balloon Theories
A.I. Tries 20 Jobs | WIRED
Mathematician Breaks Down the Best Ways to Win the Lottery
Why Music Festivals Sound Better Than Ever
Pro Interpreters vs. AI Challenge: Who Translates Faster and Better?
Why The Average Human Couldn't Drive An F1 Car
Atomic Expert Explains "Oppenheimer" Bomb Scenes
Every 'Useless' Body Part Explained From Head to Toe
How Pilots and Scientists Are Thinking About the Future of Air Travel
How To Max Out At Every Fantasy Football Position (Ft. Matthew Berry)
All The Ways Mt. Everest Can Kill You
How Fat Bears Bulk Up To Hibernate (And Why We Love To See It)
Why Vintage Tech Is So Valuable To Collectors
8 Photos That Tell The History of Humans In Space
How Every Organ in Your Body Ages From Head to Toe
Why AI Chess Bots Are Virtually Unbeatable (ft. GothamChess)
How Mind-Controlled Bionic Arms Fuse To The Body
Historian Breaks Down Napoleon's Battle Tactics