Concepedia

Publication | Closed Access

An investigation of guarding a territory problem in a grid world

14

Citations

8

References

2010

Year

Abstract

A game of guarding a territory in a grid world is proposed in this paper. A defender tries to intercept an invader before he reaches the territory. Two reinforcement learning algorithms are applied to make two players learn their optimal policies simultaneously. Minimax-Q learning algorithm and Win-or-Learn-Fast Policy Hill-Climbing learning algorithm are introduced and compared. Simulation results of two reinforcement learning algorithms are analyzed.

References

YearCitations

Page 1