Rollout, Policy Iteration, and Distributed Reinforcement Learning

Rollout, Policy Iteration, and Distributed Reinforcement Learning
Author :
Publisher : Athena Scientific
Total Pages : 498
Release :
ISBN-10 : 9781886529076
ISBN-13 : 1886529078
Rating : 4/5 (078 Downloads)

Book Synopsis Rollout, Policy Iteration, and Distributed Reinforcement Learning by : Dimitri Bertsekas

Download or read book Rollout, Policy Iteration, and Distributed Reinforcement Learning written by Dimitri Bertsekas and published by Athena Scientific. This book was released on 2021-08-20 with total page 498 pages. Available in PDF, EPUB and Kindle. Book excerpt: The purpose of this book is to develop in greater depth some of the methods from the author's Reinforcement Learning and Optimal Control recently published textbook (Athena Scientific, 2019). In particular, we present new research, relating to systems involving multiple agents, partitioned architectures, and distributed asynchronous computation. We pay special attention to the contexts of dynamic programming/policy iteration and control theory/model predictive control. We also discuss in some detail the application of the methodology to challenging discrete/combinatorial optimization problems, such as routing, scheduling, assignment, and mixed integer programming, including the use of neural network approximations within these contexts. The book focuses on the fundamental idea of policy iteration, i.e., start from some policy, and successively generate one or more improved policies. If just one improved policy is generated, this is called rollout, which, based on broad and consistent computational experience, appears to be one of the most versatile and reliable of all reinforcement learning methods. In this book, rollout algorithms are developed for both discrete deterministic and stochastic DP problems, and the development of distributed implementations in both multiagent and multiprocessor settings, aiming to take advantage of parallelism. Approximate policy iteration is more ambitious than rollout, but it is a strictly off-line method, and it is generally far more computationally intensive. This motivates the use of parallel and distributed computation. One of the purposes of the monograph is to discuss distributed (possibly asynchronous) methods that relate to rollout and policy iteration, both in the context of an exact and an approximate implementation involving neural networks or other approximation architectures. Much of the new research is inspired by the remarkable AlphaZero chess program, where policy iteration, value and policy networks, approximate lookahead minimization, and parallel computation all play an important role.


Rollout, Policy Iteration, and Distributed Reinforcement Learning Related Books

Rollout, Policy Iteration, and Distributed Reinforcement Learning
Language: en
Pages: 498
Authors: Dimitri Bertsekas
Categories: Computers
Type: BOOK - Published: 2021-08-20 - Publisher: Athena Scientific

DOWNLOAD EBOOK

The purpose of this book is to develop in greater depth some of the methods from the author's Reinforcement Learning and Optimal Control recently published text
Reinforcement Learning and Optimal Control
Language: zh-CN
Pages: 373
Authors: Dimitri P. Bertsekas
Categories: Artificial intelligence
Type: BOOK - Published: 2020 - Publisher:

DOWNLOAD EBOOK

Reinforcement Learning and Dynamic Programming Using Function Approximators
Language: en
Pages: 280
Authors: Lucian Busoniu
Categories: Computers
Type: BOOK - Published: 2017-07-28 - Publisher: CRC Press

DOWNLOAD EBOOK

From household appliances to applications in robotics, engineered systems involving complex dynamics can only be as effective as the algorithms that control the
Efficient Reinforcement Learning Using Gaussian Processes
Language: en
Pages: 226
Authors: Marc Peter Deisenroth
Categories: Electronic computers. Computer science
Type: BOOK - Published: 2010 - Publisher: KIT Scientific Publishing

DOWNLOAD EBOOK

This book examines Gaussian processes in both model-based reinforcement learning (RL) and inference in nonlinear dynamic systems.First, we introduce PILCO, a fu
Convex Optimization Theory
Language: en
Pages: 256
Authors: Dimitri Bertsekas
Categories: Mathematics
Type: BOOK - Published: 2009-06-01 - Publisher: Athena Scientific

DOWNLOAD EBOOK

An insightful, concise, and rigorous treatment of the basic theory of convex sets and functions in finite dimensions, and the analytical/geometrical foundations