Publication | Closed Access
The knowledge gradient algorithm for online subset selection
31
Citations
14
References
2009
Year
Unknown Venue
Artificial IntelligenceEngineeringMachine LearningKnowledge Gradient AlgorithmGame TheoryAlgorithmic LearningOther SubsetsOptimization-based Data MiningInformation RetrievalData ScienceData MiningOnline ProblemManagementSubset Selection ProblemCombinatorial OptimizationDecision TheoryOnline AlgorithmKnowledge DiscoverySequential Decision MakingComputer ScienceDecision RuleExploration V ExploitationDecision Science
We derive a one-period look-ahead policy for online subset selection problems, where learning about one subset also gives us information about other subsets. The subset selection problem is treated as a multi-armed bandit problem with correlated prior beliefs. We show that our decision rule is easily computable, and present experimental evidence that the policy is competitive against other online learning policies.
| Year | Citations | |
|---|---|---|
Page 1
Page 1