Publication | Closed Access
Decentralized learning in finite Markov chains
101
Citations
24
References
1986
Year
Decentralized Machine LearningEngineeringStochastic GameGame TheoryBusinessDecentralized Decision MakerFinite Markov ChainsProbability TheoryComputer ScienceDistributed LearningGamesMulti-agent LearningSequential Decision MakingMechanism DesignMarkov Decision ProcessDecentralised SystemErgodic Markov Chains
The principal contribution of this paper is a new result on the decentralized control of finite Markov chains with unknown transition probabilities and rewords. One decentralized decision maker is associated with each state in which two or more actions (decisions) are available. Each decision maker uses a simple learning scheme, requiring minimal information, to update its action choice. It is shown that, if updating is done in sufficiently small steps, the group will converge to the policy that maximizes the long-term expected reward per step. The analysis is based on learning in sequential stochastic games and on certain properties, derived in this paper, of ergodic Markov chains. A new result on convergence in identical payoff games with a unique equilibrium point is also presented.
| Year | Citations | |
|---|---|---|
Page 1
Page 1