32 min listen
LM101-062: How to Transform a Supervised Learning Machine into a Value Function Reinforcement Learning Machine
LM101-062: How to Transform a Supervised Learning Machine into a Value Function Reinforcement Learning Machine
ratings:
Length:
31 minutes
Released:
Mar 19, 2017
Format:
Podcast episode
Description
This 62nd episode of Learning Machines 101 (www.learningmachines101.com) discusses how to design reinforcement learning machines using your knowledge of how to build supervised learning machines! Specifically, we focus on Value Function Reinforcement Learning Machines which estimate the unobservable total penalty associated with an episode when only the beginning of the episode is observable. This estimated Value Function can then be used by the learning machine to select a particular action in a given situation to minimize the total future penalties that will be received. Applications include: building your own robot, building your own automatic aircraft lander, building your own automated stock market trading system, and building your own self-driving car!!
Released:
Mar 19, 2017
Format:
Podcast episode
Titles in the series (85)
LM101-005: How to Decide if a Machine is Artificially Intelligent (The Turing Test): Episode Summary: This episode we discuss the Turing Test for Artificial Intelligence which is designed to determine if the behavior of a computer is indistinguishable from the behavior of a thinking human being. The chatbot A.L.I.C.E. by Learning Machines 101