Mahadaven | Learning Representation and Control in Markov Decision Processes | Buch | 978-1-60198-238-4 | www.sack.de

Buch, Englisch, Band 3, 176 Seiten, Format (B × H): 156 mm x 234 mm

Reihe: Foundations and Trends® in Machine Learning

Mahadaven

Learning Representation and Control in Markov Decision Processes

New Frontiers
1. Auflage 2009
ISBN: 978-1-60198-238-4
Verlag: Now Publishers

New Frontiers

Buch, Englisch, Band 3, 176 Seiten, Format (B × H): 156 mm x 234 mm

Reihe: Foundations and Trends® in Machine Learning

ISBN: 978-1-60198-238-4
Verlag: Now Publishers


Learning Representation and Control in Markov Decision Processes describes methods for automatically compressing Markov decision processes (MDPs) by learning a low-dimensional linear approximation defined by an orthogonal set of basis functions. A unique feature of the text is the use of Laplacian operators, whose matrix representations have non-positive off-diagonal elements and zero row sums. The generalized inverses of Laplacian operators, in particular the Drazin inverse, are shown to be useful in the exact and approximate solution of MDPs. The author goes on to describe a broad framework for solving MDPs, generically referred to as representation policy iteration (RPI), where both the basis function representations for approximation of value functions as well as the optimal policy within their linear span are simultaneously learned. Basis functions are constructed by diagonalizing a Laplacian operator or by dilating the reward function or an initial set of bases by powers of the operator. The idea of decomposing an operator by finding its invariant subspaces is shown to be an important principle in constructing low-dimensional representations of MDPs. Theoretical properties of these approaches are discussed, and they are also compared experimentally on a variety of discrete and continuous MDPs. Finally, challenges for further research are briefly outlined. Learning Representation and Control in Markov Decision Processes is a timely exposition of a topic with broad interest within machine learning and beyond.

Mahadaven Learning Representation and Control in Markov Decision Processes jetzt bestellen!

Autoren/Hrsg.


Weitere Infos & Material


1: Introduction 2: Sequential Decision Problems 3: Laplacian Operators and MDPs 4: Approximating Markov Decision Processes 5: Dimensionality Reduction Principles in MDPs 6: Basis Construction: Diagonalization Methods 7: Basis Construction: Dilation Methods 8: Model-Based Representation Policy Iteration 9: Basis Construction in Continuous MDPs 10: Model-Free Representation Policy Iteration 11: Related Work and Future Challenges. References.



Ihre Fragen, Wünsche oder Anmerkungen
Vorname*
Nachname*
Ihre E-Mail-Adresse*
Kundennr.
Ihre Nachricht*
Lediglich mit * gekennzeichnete Felder sind Pflichtfelder.
Wenn Sie die im Kontaktformular eingegebenen Daten durch Klick auf den nachfolgenden Button übersenden, erklären Sie sich damit einverstanden, dass wir Ihr Angaben für die Beantwortung Ihrer Anfrage verwenden. Selbstverständlich werden Ihre Daten vertraulich behandelt und nicht an Dritte weitergegeben. Sie können der Verwendung Ihrer Daten jederzeit widersprechen. Das Datenhandling bei Sack Fachmedien erklären wir Ihnen in unserer Datenschutzerklärung.