Concepedia

Publication | Closed Access

Extending Q-Learning to General Adaptive Multi-Agent Systems

173

Citations

11

References

2003

Year

Gerald Tesauro

Unknown Venue

Abstract

Recent multi-agent extensions of Q-Learning require knowledge of other agents' payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This paper proposes a fundamentally different approach, dubbed "Hyper-Q" Learning, in which values of mixed strategies rather than base actions are learned, and in which other agents' strategies are estimated from observed actions via Bayesian inference. Hyper-Q

References

YearCitations

Page 1