Publication | Closed Access
Improving Generalization for Temporal Difference Learning: The Successor Representation
729
Citations
15
References
1993
Year
Artificial IntelligenceMathematical ProgrammingMarkov Decision ProcessEngineeringMachine LearningData ScienceAutomated ReasoningSequential LearningPredictive AnalyticsTemporal Difference LearningTemporal Pattern RecognitionSequential Decision MakingComputer ScienceRobot LearningAppropriate GeneralizationTd MachineryNonlinear Time SeriesTemporal Difference
Estimation of returns over time, the focus of temporal difference (TD) algorithms, imposes particular constraints on good function approximators or representations. Appropriate generalization between states is determined by how similar their successors are, and representations should follow suit. This paper shows how TD machinery can be used to learn such representations, and illustrates, using a navigation task, the appropriately distributed nature of the result.
| Year | Citations | |
|---|---|---|
Page 1
Page 1