Bayesian Reinforcement Learning: A Survey

Ghavamzadeh, Mohammad; Mannor, Shie; Pineau, Joelle; Tamar, Aviv

doi:10.1561/2200000049

Article navigation

Research Article| November 26 2015

Bayesian Reinforcement Learning: A Survey

Mohammad Ghavamzadeh;

Mohammad Ghavamzadeh

Adobe Research & INRIA

Search for other works by this author on:

This Site

PubMed

Google Scholar

Shie Mannor;

Shie Mannor

Technion

Search for other works by this author on:

This Site

PubMed

Google Scholar

Joelle Pineau;

Joelle Pineau

McGill University

Search for other works by this author on:

This Site

PubMed

Google Scholar

Aviv Tamar

University of California, Berkeley

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Online ISSN: 1935-8245

Print ISSN: 1935-8237

2015

M. Ghavamzadeh, S. Mannor, J. Pineau, and A. Tamar

Licensed re-use rights only

Foundations and Trends in Machine Learning (2015) 8 (5-6): 359–483.

https://doi.org/10.1561/2200000049

Bayesian methods for machine learning have been widely investigated, yielding principled methods for incorporating prior information into inference algorithms. In this survey, we provide an in-depth review of the role of Bayesian methods for the reinforcement learning (RL) paradigm. The major incentives for incorporating Bayesian reasoning in RL are: 1) it provides an elegant approach to action-selection (exploration/exploitation) as a function of the uncertainty in learning; and 2) it provides a machinery to incorporate prior knowledge into the algorithms. We first discuss models and methods for Bayesian inference in the simple single-step Bandit model. We then review the extensive recent literature on Bayesian methods for model-based RL, where prior information can be expressed on the parameters of the Markov model. We also present Bayesian methods for model-free RL, where priors are expressed over the value function or policy class. The objective of the paper is to provide a comprehensive survey on Bayesian RL algorithms and their theoretical and empirical properties.

2015

M. Ghavamzadeh, S. Mannor, J. Pineau, and A. Tamar

Licensed re-use rights only

You do not currently have access to this content.

Don't already have an account? Register

Bayesian Reinforcement Learning: A Survey

Email Alerts

Cited By

Bayesian Reinforcement Learning: A Survey

Sign in

Client Account

ICE Member Sign In

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Sharing Unavailable