David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.
Support this podcast by signing up with these sponsors:
- MasterClass: https://masterclass.com/lex
- Cash App - use code "LexPodcast" and download:
- Cash App (App Store): https://apple.co/2sPrUHe
- Cash App (Google Play): https://bit.ly/2MlvP5w
EPISODE LINKS: Reinforcement learning (book): https://amzn.to/2Jwp5zG
This conversation is part of the Artificial Intelligence podcast. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on Apple Podcasts, follow on Spotify, or support it on Patreon.
Here's the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time.
OUTLINE: 00:00 - Introduction 04:09 - First program 11:11 - AlphaGo 21:42 - Rule of the game of Go 25:37 - Reinforcement learning: personal journey 30:15 - What is reinforcement learning? 43:51 - AlphaGo (continued) 53:40 - Supervised learning and self play in AlphaGo 1:06:12 - Lee Sedol retirement from Go play 1:08:57 - Garry Kasparov 1:14:10 - Alpha Zero and self play 1:31:29 - Creativity in AlphaZero 1:35:21 - AlphaZero applications 1:37:59 - Reward functions 1:40:51 - Meaning of life https://lexfridman.com/david-silver/?utm_source=rss&utm_medium=rss&utm_campaign=david-silver
Connectez-vous pour ajouter un commentaire
Autres messages de ce groupe
Volodymyr Zelenskyy is the President of Ukraine. On YouTube this episode is available in English, Ukrainian, and Russian. Captions and voice-over audio tracks are provided in English, Ukrainian, Russi
Adam Frank is an astrophysicist studying star systems and the search for extraterrestrial life and alien civilizations. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsor
Saagar Enjeti is a political journalist & commentator, co-host of Breaking Points with Krystal and Saagar and The Realignment Podcast. He is exceptionally well-read, and the books he recommends are al
Javier Milei is the President of Argentina. This episode is available in both English and Spanish. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep453-sc See below
Dario Amodei is the CEO of Anthropic, the company that created Claude. Amanda Askell is an AI researcher working on Claude's character and personality. Chris Olah is an AI researcher working on mechan
Rick Spence is a historian specializing in the history of intelligence agencies, espionage, secret societies, conspiracies, the occult, and military history. Thank you for listening ❤ Check out our sp
Bernie Sanders is a US Senator from Vermont and a two-time presidential candidate. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep450-sc See below for timestamps,