A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning
Bokus
A Markov Decision Process (MDP) is a natural framework for formulating sequential decision-making problems under uncertainty. In recent years, researchers have greatly advanced algorithms for learning and acting in MDPs. This book reviews such alg...
810.00 kr