Home
Categories
EXPLORE
True Crime
Comedy
Society & Culture
Business
Sports
Technology
News
About Us
Contact Us
Copyright
© 2024 PodJoint
Podjoint Logo
US
00:00 / 00:00
Sign in

or

Don't have an account?
Sign up
Forgot password
https://is1-ssl.mzstatic.com/image/thumb/Podcasts211/v4/9e/1d/83/9e1d8392-fad9-ed9f-e5c4-b9298a984ca7/mza_14788877948767378709.jpg/600x600bb.jpg
TalkRL: The Reinforcement Learning Podcast
Robin Ranjit Singh Chauhan
72 episodes
21 hours ago
TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute. Hosted by Robin Ranjit Singh Chauhan.
Show more...
Technology
RSS
All content for TalkRL: The Reinforcement Learning Podcast is the property of Robin Ranjit Singh Chauhan and is served directly from their servers with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute. Hosted by Robin Ranjit Singh Chauhan.
Show more...
Technology
https://img.transistor.fm/cEMx1HAZcVbookQ_dS9oK75DYmZFQrjN3IWBORuaqCo/rs:fill:3000:3000:1/q:60/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS8wNjE1/ZGUwNjgwNWI0ZWE5/OGQyNDQ5MjE4NTU1/MDEzZS5qcGVn.jpg
Abhishek Naik on Continuing RL & Average Reward
TalkRL: The Reinforcement Learning Podcast
1 hour 21 minutes
8 months ago
Abhishek Naik on Continuing RL & Average Reward

Abhishek Naik was a student at University of Alberta and Alberta Machine Intelligence Institute, and he just finished his PhD in reinforcement learning, working with Rich Sutton.  Now he is a postdoc fellow at the National Research Council of Canada, where he does AI research on Space applications. 

Featured References 

Reinforcement Learning for Continuing Problems Using Average Reward
Abhishek Naik Ph.D. dissertation 2024 

Reward Centering
Abhishek Naik, Yi Wan, Manan Tomar, Richard S. Sutton 2024   

Learning and Planning in Average-Reward Markov Decision Processes
Yi Wan, Abhishek Naik, Richard S. Sutton 2020 

Discounted Reinforcement Learning Is Not an Optimization Problem 
Abhishek Naik, Roshan Shariff, Niko Yasui, Hengshuai Yao, Richard S. Sutton 2019  


Additional References 

  • Explaining dopamine through prediction errors and beyond, Gershman et al 2024 (proposes Differential-TD-like learning mechanism in the brain around Box 4)  


TalkRL: The Reinforcement Learning Podcast
TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute. Hosted by Robin Ranjit Singh Chauhan.