Concepedia

Publication | Open Access

Temporal Difference Learning and TD-Gammon

764

Citations

6

References

1995

Year

Abstract

We provide an abstract, selectively u ing the author's formulations: "The article presents a game-learning program called TD-GAMMON. TD-GAMMON is a neural network that trains itself to be an evaluation function for the game of backgammon by playing against itself and learning from the outcome. It was not developed to surpass all previous computer programs in backgammon; rather, its purpose was to explore some new ideas and approaches to traditional problems in reinforcement learning.

References

YearCitations

Page 1