Publication | Closed Access
Multi-Armed Bandits and the Gittins Index
526
Citations
9
References
1980
Year
EngineeringGame TheoryOperations ResearchDynamic Programming MethodsStatisticsStochastic DynamicQuantitative ManagementGittins IndexSequential Decision MakingProbability TheoryGamesExploration V ExploitationIndex RuleContextual BanditReward MBusinessGame-theoretic ProbabilityDecision ScienceAlgorithmic Game Theory
Summary A plausible conjecture (C) has the implication that a relationship (12) holds between the maximal expected rewards for a multi-project process and for a one-project process (F and φi respectively), if the option of retirement with reward M is available. The validity of this relation and optimality of Gittins' index rule are verified simultaneously by dynamic programming methods. These results are partially extended to the case of so-called “bandit superprocesses”.
| Year | Citations | |
|---|---|---|
Page 1
Page 1