Publication | Closed Access
An analysis of reinforcement learning with function approximation
193
Citations
15
References
2008
Year
Unknown Venue
Artificial IntelligenceMathematical ProgrammingEngineeringMachine LearningStochastic OptimizationStochastic GameGame TheoryOptimal Q-functionSequential Decision MakingProbability TheoryFunction ApproximationLearning ControlApproximation TheoryMarkov Decision ProblemsMarkov Decision ProcessConstructive Approximation
We address the problem of computing the optimal Q-function in Markov decision problems with infinite state-space. We analyze the convergence properties of several variations of Q-learning when combined with function approximation, extending the analysis of TD-learning in (Tsitsiklis & Van Roy, 1996a) to stochastic control settings. We identify conditions under which such approximate methods converge with probability 1. We conclude with a brief discussion on the general applicability of our results and compare them with several related works.
| Year | Citations | |
|---|---|---|
Page 1
Page 1