141 subscribers
Go offline with the Player FM app!
LM101-062: How to Transform a Supervised Learning Machine into a Value Function Reinforcement Learning Machine
Manage episode 230297540 series 2497400
This 62nd episode of Learning Machines 101 (www.learningmachines101.com) discusses how to design reinforcement learning machines using your knowledge of how to build supervised learning machines! Specifically, we focus on Value Function Reinforcement Learning Machines which estimate the unobservable total penalty associated with an episode when only the beginning of the episode is observable. This estimated Value Function can then be used by the learning machine to select a particular action in a given situation to minimize the total future penalties that will be received. Applications include: building your own robot, building your own automatic aircraft lander, building your own automated stock market trading system, and building your own self-driving car!!
85 episodes
Manage episode 230297540 series 2497400
This 62nd episode of Learning Machines 101 (www.learningmachines101.com) discusses how to design reinforcement learning machines using your knowledge of how to build supervised learning machines! Specifically, we focus on Value Function Reinforcement Learning Machines which estimate the unobservable total penalty associated with an episode when only the beginning of the episode is observable. This estimated Value Function can then be used by the learning machine to select a particular action in a given situation to minimize the total future penalties that will be received. Applications include: building your own robot, building your own automatic aircraft lander, building your own automated stock market trading system, and building your own self-driving car!!
85 episodes
All episodes
×

1 LM101-086: Ch8: How to Learn the Probability of Infinitely Many Outcomes 35:29


1 LM101-085:Ch7:How to Guarantee your Batch Learning Algorithm Converges 30:51


1 LM101-084: Ch6: How to Analyze the Behavior of Smart Dynamical Systems 33:13


1 LM101-083: Ch5: How to Use Calculus to Design Learning Machines 34:22


1 LM101-082: Ch4: How to Analyze and Design Linear Machines 29:05


1 LM101-081: Ch3: How to Define Machine Learning (or at Least Try) 37:20


1 LM101-080: Ch2: How to Represent Knowledge using Set Theory 31:43


1 LM101-079: Ch1: How to View Learning as Risk Minimization 26:07


1 LM101-078: Ch0: How to Become a Machine Learning Expert 39:18


1 LM101-077: How to Choose the Best Model using BIC 24:15


1 LM101-076: How to Choose the Best Model using AIC and GAIC 28:17


1 LM101-075: Can computers think? A Mathematician's Response (remix) 36:26


1 LM101-074: How to Represent Knowledge using Logical Rules (remix) 19:22


1 LM101-073: How to Build a Machine that Learns to Play Checkers (remix) 24:58


1 LM101-072: Welcome to the Big Artificial Intelligence Magic Show! (Remix of LM101-001 and LM101-002) 22:07
Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.