A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning

A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning

Bokus

A Markov Decision Process (MDP) is a natural framework for formulating sequential decision-making problems under uncertainty. In recent years, researchers have greatly advanced algorithms for learning and acting in MDPs. This book reviews such alg...

810.00 kr

Liknande produkter