Buch, Englisch, 352 Seiten, Format (B × H): 178 mm x 254 mm
From Classical Algorithms to LLM-Driven AI
Buch, Englisch, 352 Seiten, Format (B × H): 178 mm x 254 mm
ISBN: 978-1-041-13141-0
Verlag: Taylor & Francis Ltd
Reinforcement learning (RL) has become the engine behind some of the most significant advances in modern artificial intelligence, from defeating world champions in Go to aligning large language models with human preferences. Yet despite its central role, RL remains poorly understood by many practitioners who work with these systems daily. Reinforcement Learning in Action: From Foundations to Frontiers bridges the gap between classical RL theory and the cutting-edge techniques driving today’s AI breakthroughs. The book traces a complete path from Markov Decision Processes and Bellman equations through deep RL methods (DQN, REINFORCE, Actor-Critic, PPO) to the modern landscape of LLM alignment (RLHF, DPO, SimPO, KTO), reasoning optimization (GRPO, VinePPO, MCTS), and agentic systems with tool use, memory, and multi-turn planning. A distinguishing feature is the book’s consistent five-layer pedagogical structure: each algorithm is presented with its key characteristics, a full mathematical derivation, an honest assessment of its advantages and limitations, a complete from-scratch Python/PyTorch implementation in which variable names match the equations, and a hands-on case study with reproducible experiments. Case studies progress from Grid World navigation and CartPole control to fine-tuning language models with DPO on the HuggingFace ecosystem, training reasoning models with GRPO on mathematical benchmarks, and building a full agentic customer support system. Written for ML engineers, researchers, and advanced students, this book provides both the conceptual depth and implementation fluency needed to understand, build, and extend the RL systems shaping the future of AI.
Zielgruppe
Academic, Postgraduate, and Professional Practice & Development
Autoren/Hrsg.
Fachgebiete
- Mathematik | Informatik EDV | Informatik Informatik Künstliche Intelligenz Neuronale Netzwerke
- Mathematik | Informatik EDV | Informatik Informatik Künstliche Intelligenz Computer Vision
- Mathematik | Informatik EDV | Informatik Daten / Datenbanken Automatische Datenerfassung, Datenanalyse
- Mathematik | Informatik EDV | Informatik Informatik Künstliche Intelligenz Mustererkennung, Biometrik
- Mathematik | Informatik EDV | Informatik Informatik Künstliche Intelligenz Maschinelles Lernen
Weitere Infos & Material
List of Figures. List of Tables. Foreword. Preface. Author Bios. Contributors. Notation. Chapter 1: Introduction to Reinforcement Learning. Section I: Basics of Reinforcement Learning. Chapter 2: Fundamentals of Reinforcement Learning. Section II: Classical Reinforcement Learning. Chapter 3: Classical Reinforcement Learning Algorithms. Section III: Deep Reinforcement Learning. Chapter 4: Scaling Reinforcement Learning: Function Approximation and Deep Methods. Section IV: LLMs and Reinforcement Learning. Chapter 5: Preference-Based Alignment: Reward Modeling and Reinforcement Learning for LLMs. Chapter 6: Reinforcement Learning for Reasoning Models. Section V: Agentic and Reinforcement Learning. Chapter 7: Reinforcement Learning Enabled Agentic AI. Appendix A: Mathematical Proofs and Derivations. Bibliography. Index.




