Publication | Closed Access
A Generalized Bandit Problem
36
Citations
4
References
1980
Year
Mathematical ProgrammingEngineeringGame TheoryArm DependGeneralized Bandit ProblemOperations ResearchStochastic GameCombinatorial OptimizationDecision TheoryMechanism DesignDynamic Allocation IndexOnline AlgorithmStrategyComputer ScienceSequential Decision MakingExploration V ExploitationMulti-armed Bandit ProblemBusinessAlgorithmic Game Theory
Summary A multi-armed bandit problem is investigated in which rewards obtained from pulls of any arm depend on the states of the other arms, as well as on the state of the arm pulled. A Dynamic Allocation Index is defined for this class of problems, and it is shown that this leads to optimal policies.
| Year | Citations | |
|---|---|---|
Page 1
Page 1