Tic-Tac-Toe the Hard Way

Share on

A writer and a software engineer from Google's People + AI Research team explore the human choices that shape machine learning systems by building competing tic-tac-toe agents.

People + AI Research

Jul 22, 2020 LATEST EPISODE
infrequent NEW EPISODES
21m AVG DURATION
10 EPISODES

Search for episodes from Tic-Tac-Toe the Hard Way with a specific topic:

All Episodes

Latest episodes from Tic-Tac-Toe the Hard Way

Lessons learned

Play Episode Listen Later Jul 22, 2020 33:01

What have we learned about machine learning and the human decisions that shape it? And is machine learning perhaps changing our minds about how the world outside of machine learning — also known as the world — works?

google games ai lessons learned machine learning ml reinforcement learning tic tac toe supervised learning

Head to Head: The Even Bigger ML Smackdown!

Play Episode Listen Later Jul 22, 2020 24:26

Yannick and David’s systems play against each other in 500 games. Who’s going to win? And what can we learn about how the ML may be working by thinking about the results?

google games ai machine learning smackdown ml yannick head to head reinforcement learning tic tac toe supervised learning

Enter tic-tac-two

Play Episode Listen Later Jul 22, 2020 21:20

David’s variant of tic-tac-toe that we’re calling tic-tac-two is only slightly different but turns out to be far more complex. This requires rethinking what the ML system will need in order to learn how to play, and how to represent that data.

google games ai machine learning ml tic tac reinforcement learning tic tac toe supervised learning

Head to Head: the Big ML Smackdown!

Play Episode Listen Later Jul 22, 2020 25:19

David and Yannick’s tic-tac-toe ML agents face-off against each other in tic-tac-toe!

google games ai machine learning smackdown ml yannick head to head reinforcement learning tic tac toe supervised learning

Give that model a treat! : Reinforcement learning explained

Play Episode Listen Later Jul 22, 2020 26:04

Switching gears, we focus on how Yannick’s been training his model using reinforcement learning. He explains the differences from David’s supervised learning approach. We find out how his system performs against a player that makes random tic-tac-toe moves.

google games ai model treat machine learning switching ml yannick reinforcement learning tic tac toe supervised learning

Beating random: What it means to have trained a model

Play Episode Listen Later Jul 22, 2020 17:14

David did it! He trained a machine learning model to play tic-tac-toe! How did his model do against a player that makes random tic-tac-toe moves?

google games ai model beating machine learning trained ml reinforcement learning tic tac toe supervised learning

From tic-tac-toe moves to ML model

Play Episode Listen Later Jul 22, 2020 21:37

Once we have the data we need—thousands of sample games—how do we turn it into something the ML can train itself on? That means understanding how training works, and what a model is.

google games ai model machine learning ml reinforcement learning tic tac toe supervised learning

What does a tic-tac-toe board look like to machine learning?

Play Episode Listen Later Jul 22, 2020 23:26

David delves into questions around data and training for his model including: What does a tic-tac-toe board “look” like to ML? Plus, an intro to reinforcement learning, the approach Yannick will be taking.

google games ai board machine learning ml yannick reinforcement learning tic tac toe supervised learning

Howdy, and the myth of “pouring in data”

Play Episode Listen Later Jul 21, 2020 22:01

David and Yannick get started on their project to build competing machine learning systems that play tic-tac-toe. They discuss the human choices that will shape their systems along the way.

google games ai data myth machine learning ml howdy pouring yannick reinforcement learning tic tac toe supervised learning

Introducing Tic-Tac-Toe the Hard Way

Play Episode Listen Later Jul 21, 2020 2:09

Introducing the podcast where a writer and a software engineer explore the human choices that shape machine learning systems by building competing tic-tac-toe agents. Brought to you by Google's People + AI Research team.

google games ai machine learning ml hard way reinforcement learning tic tac toe supervised learning

Claim Tic-Tac-Toe the Hard Way

In order to claim this podcast we'll send an email to with a verification link. Simply click the link and you will be able to edit tags, request a refresh, and other features to take control of your podcast page!

Claim Cancel