David Abel is a Senior Research Scientist at DeepMind on the Agency team, and an Honorary Fellow at the University of Edinburgh. His research blends computer science and philosophy, exploring foundational questions about reinforcement learning, definitions, and the nature of agency.
Featured References
Plasticity as the Mirror of Empowerment
David Abel, Michael Bowling, André Barreto, Will Dabney, Shi Dong, Steven Hansen, Anna Harutyunyan, Khimya Khetarpal, Clare Lyle, Razvan Pascanu, Georgios Piliouras, Doina Precup, Jonathan Richens, Mark Rowland, Tom Schaul, Satinder Singh
A Definition of Continual RL
David Abel, André Barreto, Benjamin Van Roy, Doina Precup, Hado van Hasselt, Satinder Singh
Agency is Frame-Dependent
David Abel, André Barreto, Michael Bowling, Will Dabney, Shi Dong, Steven Hansen, Anna Harutyunyan, Khimya Khetarpal, Clare Lyle, Razvan Pascanu, Georgios Piliouras, Doina Precup, Jonathan Richens, Mark Rowland, Tom Schaul, Satinder Singh
On the Expressivity of Markov Reward
David Abel, Will Dabney, Anna Harutyunyan, Mark Ho, Michael Littman, Doina Precup, Satinder Singh — Outstanding Paper Award, NeurIPS 2021
Additional References
Recorded at Reinforcement Learning Conference 2025 at University of Alberta, Edmonton Alberta Canada.
Featured References
Lecture on the Oak Architecture, Rich Sutton
Alberta Plan, Rich Sutton with Mike Bowling and Patrick Pilarski
Additional References
We caught up with the RLC Outstanding Paper award winners for your listening pleasure.
Recorded on location at Reinforcement Learning Conference 2025, at University of Alberta, in Edmonton Alberta Canada in August 2025.
Featured References
Empirical Reinforcement Learning Research
Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Ayush Jain, Norio Kosaka, Xinhu Li, Kyung-Min Kim, Erdem Biyik, Joseph J Lim
Applications of Reinforcement Learning
WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies
William Solow, Sandhya Saisubramanian, Alan Fern
Emerging Topics in Reinforcement Learning
Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners
Calarina Muslimani, Kerrick Johnstonbaugh, Suyog Chandramouli, Serena Booth, W. Bradley Knox, Matthew E. Taylor
Scientific Understanding in Reinforcement Learning
Multi-Task Reinforcement Learning Enables Parameter Scaling
Reginald McLean, Evangelos Chatzaroulas, J K Terry, Isaac Woungang, Nariman Farsad, Pablo Samuel Castro
We caught up with the RLC Outstanding Paper award winners for your listening pleasure.
Recorded on location at Reinforcement Learning Conference 2025, at University of Alberta, in Edmonton Alberta Canada in August 2025.
Featured References
Scientific Understanding in Reinforcement Learning
How Should We Meta-Learn Reinforcement Learning Algorithms?
Alexander David Goldie, Zilin Wang, Jakob Nicolaus Foerster, Shimon Whiteson
Tooling, Environments, and Evaluation for Reinforcement Learning
Syllabus: Portable Curricula for Reinforcement Learning Agents
Ryan Sullivan, Ryan Pégoud, Ameen Ur Rehman, Xinchen Yang, Junyun Huang, Aayush Verma, Nistha Mitra, John P Dickerson
Resourcefulness in Reinforcement Learning
PufferLib 2.0: Reinforcement Learning at 1M steps/s
Joseph Suarez
Theory of Reinforcement Learning
Deep Reinforcement Learning with Gradient Eligibility Traces
Esraa Elelimy, Brett Daley, Andrew Patterson, Marlos C. Machado, Adam White, Martha White
Prof Thomas Akam is a Neuroscientist at the Oxford University Department of Experimental Psychology. He is a Wellcome Career Development Fellow and Associate Professor at the University of Oxford, and leads the Cognitive Circuits research group.
Featured References
Brain Architecture for Adaptive Behaviour
Thomas Akam, RLDM 2025 Tutorial
Additional References
Stefano V. Albrecht was previously Associate Professor at the University of Edinburgh, and is currently serving as Director of AI at startup Deepflow. He is a Program Chair of RLDM 2025 and is co-author of the MIT Press textbook "Multi-Agent Reinforcement Learning: Foundations and Modern Approaches".
Featured References
Multi-Agent Reinforcement Learning: Foundations and Modern Approaches
Stefano V. Albrecht, Filippos Christianos, Lukas Schäfer
MIT Press, 2024
RLDM 2025: Reinforcement Learning and Decision Making Conference
Dublin, Ireland
EPyMARL: Extended Python MARL framework
https://github.com/uoe-agents/epymarl
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis and Filippos Christianos and Lukas Schäfer and Stefano V. Albrecht
Professor Satinder Singh of Google DeepMind and U of Michigan is co-founder of RLDM. Here he narrates the origin story of the Reinforcement Learning and Decision Making meeting (not conference).
Recorded on location at Trinity College Dublin, Ireland during RLDM 2025.
Featured References
RLDM 2025: Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM)
June 11-14, 2025 at Trinity College Dublin, Ireland
Satinder Singh on Google Scholar
Posters and Hallway episodes are short interviews and poster summaries. Recorded at NeurIPS 2024 in Vancouver BC Canada.
Featuring
Posters and Hallway episodes are short interviews and poster summaries. Recorded at NeurIPS 2024 in Vancouver BC Canada.
Featuring
Posters and Hallway episodes are short interviews and poster summaries. Recorded at NeurIPS 2024 in Vancouver BC Canada.
Featuring
Abhishek Naik was a student at University of Alberta and Alberta Machine Intelligence Institute, and he just finished his PhD in reinforcement learning, working with Rich Sutton. Now he is a postdoc fellow at the National Research Council of Canada, where he does AI research on Space applications.
Featured References
Reinforcement Learning for Continuing Problems Using Average Reward
Abhishek Naik Ph.D. dissertation 2024
Reward Centering
Abhishek Naik, Yi Wan, Manan Tomar, Richard S. Sutton 2024
Learning and Planning in Average-Reward Markov Decision Processes
Yi Wan, Abhishek Naik, Richard S. Sutton 2020
Discounted Reinforcement Learning Is Not an Optimization Problem
Abhishek Naik, Roshan Shariff, Niko Yasui, Hengshuai Yao, Richard S. Sutton 2019
Additional References
What do RL researchers complain about after hours at the bar? In this "Hot takes" episode, we find out!
Recorded at The Pearl in downtown Vancouver, during the RL meetup after a day of Neurips 2024.
Special thanks to "David Beckham" for the inspiration :)
Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA.
Featuring:
Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA.
Featuring:
Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA.
Featuring:
Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA.
Featuring:
Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA.
Featuring:
Finale Doshi-Velez is a Professor at the Harvard Paulson School of Engineering and Applied Sciences.
This off-the-cuff interview was recorded at UMass Amherst during the workshop day of RL Conference on August 9th 2024.
Host notes: I've been a fan of some of Prof Doshi-Velez' past work on clinical RL and hoped to feature her for some time now, so I jumped at the chance to get a few minutes of her thoughts -- even though you can tell I was not prepared and a bit flustered tbh. Thanks to Prof Doshi-Velez for taking a moment for this, and I hope to cross paths in future for a more in depth interview.
References
Thanks to Professor Silver for permission to record this discussion after his RLC 2024 keynote lecture.
Recorded at UMass Amherst during RCL 2024.
Due to the live recording environment, audio quality varies. We publish this audio in its raw form to preserve the authenticity and immediacy of the discussion.
References
David Silver is a principal research scientist at DeepMind and a professor at University College London.
This interview was recorded at UMass Amherst during RLC 2024.
References