Reinforcement Learning in Action

From Foundations to Frontier AI
1. Auflage 2026
ISBN: 978-1-041-13141-0
Verlag: Taylor & Francis Ltd

Buch, Englisch, 352 Seiten, Format (B × H): 178 mm x 254 mm

Reinforcement Learning in Action
Erscheinungsjahr 2026, 978-1-040-58933-5, eBook, PDF, 0 - No protection
Reinforcement Learning in Action
Erscheinungsjahr 2026, 978-1-040-61751-9, eBook, EPUB, 0 - No protection

Reinforcement Learning in Action
1. Auflage 2026, 978-1-041-13081-9, Buch

From Foundations to Frontier AI

Buch, Englisch, 352 Seiten, Format (B × H): 178 mm x 254 mm

ISBN: 978-1-041-13141-0
Verlag: Taylor & Francis Ltd

148,50 €

(inkl. MwSt.)

versandkostenfreie Lieferung
vorbestellbar, Erscheinungstermin ca. Oktober 2026

Bücher versandkostenfrei

kostenlose Rücksendung

Reinforcement learning (RL) has become the engine behind some of the most significant advances in modern artificial intelligence, from defeating world champions in Go to aligning large language models with human preferences. Yet despite its central role, RL remains poorly understood by many practitioners who work with these systems daily. Reinforcement Learning in Action: From Foundations to Frontiers bridges the gap between classical RL theory and the cutting-edge techniques driving today’s AI breakthroughs. The book traces a complete path from Markov Decision Processes and Bellman equations through deep RL methods (DQN, REINFORCE, Actor-Critic, PPO) to the modern landscape of LLM alignment (RLHF, DPO, SimPO, KTO), reasoning optimization (GRPO, VinePPO, MCTS), and agentic systems with tool use, memory, and multi-turn planning. A distinguishing feature is the book’s consistent five-layer pedagogical structure: each algorithm is presented with its key characteristics, a full mathematical derivation, an honest assessment of its advantages and limitations, a complete from-scratch Python/PyTorch implementation in which variable names match the equations, and a hands-on case study with reproducible experiments. Case studies progress from Grid World navigation and CartPole control to fine-tuning language models with DPO on the HuggingFace ecosystem, training reasoning models with GRPO on mathematical benchmarks, and building a full agentic customer support system. Written for ML engineers, researchers, and advanced students, this book provides both the conceptual depth and implementation fluency needed to understand, build, and extend the RL systems shaping the future of AI.

Kamath / Vajre Reinforcement Learning in Action jetzt bestellen!

Zielgruppe

Academic, Postgraduate, and Professional Practice & Development

Autoren/Hrsg.

Kamath, Uday

Vajre, Vedant

Fachgebiete

Weitere Infos & Material

Inhaltsverzeichnis

List of Figures. List of Tables. Foreword. Preface. Author Bios. Contributors. Notation. Chapter 1: Introduction to Reinforcement Learning. Section I: Basics of Reinforcement Learning. Chapter 2: Fundamentals of Reinforcement Learning. Section II: Classical Reinforcement Learning. Chapter 3: Classical Reinforcement Learning Algorithms. Section III: Deep Reinforcement Learning. Chapter 4: Scaling Reinforcement Learning: Function Approximation and Deep Methods. Section IV: LLMs and Reinforcement Learning. Chapter 5: Preference-Based Alignment: Reward Modeling and Reinforcement Learning for LLMs. Chapter 6: Reinforcement Learning for Reasoning Models. Section V: Agentic and Reinforcement Learning. Chapter 7: Reinforcement Learning Enabled Agentic AI. Appendix A: Mathematical Proofs and Derivations. Bibliography. Index.

Über Autor(innen)

Uday Kamath has over 25 years of experience in AI product development with a Ph.D. in scalable machine learning. His significant contributions span numerous journals, conferences, books, and patents. Notable books include Large Language Models: A Deep Dive, Applied Causal Inference, Explainable Artificial Intelligence, Transformers for Machine Learning, Deep Learning for NLP and Speech Recognition, Mastering Java Machine Learning, and Machine Learning: End-to-End Guide for Java Developers. Currently serving as the Chief Analytics Officer at Smarsh, he spearheads data science and research in communications AI for regulated industries. He is also an active member of the Board of Advisors for entities, including commercial companies and academic institutions.

Vedant Vajre is an aspiring AI researcher with a strong interest in reinforcement learning and intelligent decision-making systems. He is graduating from Penn State University and intends to pursue doctoral research in artificial intelligence. He has authored two peer-reviewed research publications with IEEE and continues to pursue active research in machine learning. Having worked at organizations including NASA, IBM, and early-stage startups, he has gained experience applying machine learning and AI in both research and production settings. Outside of research, he loves spending time with his two Shih Tzus (Pinot and Buzz), playing tennis, and solving Sudoku puzzles.

Produktsicherheit

Fragen zum Artikel?

Ihre Fragen, Wünsche oder Anmerkungen

Vorname*

Nachname*

Ihre E-Mail-Adresse*

Kundennr.

Ihre Nachricht*

Lediglich mit * gekennzeichnete Felder sind Pflichtfelder.

Wenn Sie die im Kontaktformular eingegebenen Daten durch Klick auf den nachfolgenden Button übersenden, erklären Sie sich damit einverstanden, dass wir Ihr Angaben für die Beantwortung Ihrer Anfrage verwenden. Selbstverständlich werden Ihre Daten vertraulich behandelt und nicht an Dritte weitergegeben. Sie können der Verwendung Ihrer Daten jederzeit widersprechen. Das Datenhandling bei Sack Fachmedien erklären wir Ihnen in unserer Datenschutzerklärung.

148,50 € (inkl. MwSt.)

vorbestellbar, Erscheinungstermin ca. Oktober 2026

Bücher versandkostenfrei

kostenlose Rücksendung

Webcode: sack.de/8i4a6