- ホーム
- > 洋書
- > 英文書
- > Computer / General
Full Description
Bridging the gap between introductory texts and the specialized research literature, this is one of the first truly rigorous yet accessible treatments of modern reinforcement learning. Written by three leading researchers with over a decade of teaching experience, the book uniquely combines mathematical precision with practical insights. It progresses naturally from planning (dynamic programming, MDPs, value and policy iteration) to learning (model-based and model-free algorithms, function approximation, policy gradients, and regret minimization). Each concept is developed from first principles with complete proofs, making the material self-contained. The modular chapter organization enables flexible course design. The book's website offers battle-tested exercises refined through years of classroom use. Combining mathematical rigor with practical applications, this definitive text is ideal for advanced undergraduate and graduate students as well as practitioners seeking a deep understanding of sequential decision-making and intelligent agent design.
Contents
1. Introduction and overview; 2. Preface to the planning chapters; 3. Deterministic decision processes; 4. Markov chains; 5. Markov decision processes and finite horizon dynamic programming; 6. Discounted Markov decision processes; 7. Episodic Markov decision processes; 8. Linear programming solutions; 9. Preface to the learning chapters; 10. Reinforcement learning: model based; 11. Reinforcement learning: model free; 12. Large state spaces: value function approximation; 13. Large state space: policy gradient methods; 14. Regret minimization; A. Dynamic programming; B. Ordinary differential equations; References; Index.
-
- 電子書籍
- ぼくは化け物きみは怪物
-
- 電子書籍
- 俺の『全自動支援(フルオートバフ)』で…



