Extending Q-Learning to General Adaptive Multi-Agent Systems

Abstract

Recent multi-agent extensions of Q-Learning require knowledge of other agents&apos; payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This paper proposes a fundamentally different approach, dubbed &quot;Hyper-Q&quot; Learning, in which values of mixed strategies rather than base actions are learned, and in which other agents&apos; strategies are estimated from observed actions via Bayesian inference. Hyper-Q

References

Page 1

	Year	Citations

Page 1