Publication | Open Access
On the Gittins Index for Multiarmed Bandits
271
Citations
0
References
1992
Year
Mathematical ProgrammingEngineeringGame TheoryMarket DesignOperations ResearchCombinatorial OptimizationDecision TheoryMechanism DesignStatisticsPortfolio OptimizationGittins Index PolicyGittins IndexStrategyProbability TheorySequential Decision MakingExploration V ExploitationMultiarmed Bandit ProblemOptimization ProblemBusinessGame-theoretic ProbabilityAvailable ProjectsAlgorithmic Game Theory
This paper considers the multiarmed bandit problem and presents a new proof of the optimality of the Gittins index policy. The proof is intuitive and does not require an interchange argument. The insight it affords is used to give a streamlined summary of previous research and to prove a new result: The optimal value function is a submodular set function of the available projects.