Publication | Closed Access
Incentivizing exploration
106
Citations
22
References
2014
Year
Unknown Venue
Posterior RewardReward HackingBayesian Multi-armed BanditBehavioral Decision MakingIncentive MechanismGame TheoryManagementExperimental EconomicsBusinessAlgorithmic Mechanism DesignInformation DiscoveryDecision ScienceDecision TheoryMechanism DesignExploration V ExploitationIncentive ModelBehavioral Economics
We study a Bayesian multi-armed bandit (MAB) setting in which a principal seeks to maximize the sum of expected time-discounted rewards obtained by pulling arms, when the arms are actually pulled by selfish and myopic individuals. Since such individuals pull the arm with highest expected posterior reward (i.e., they always exploit and never explore), the principal must incentivize them to explore by offering suitable payments. Among others, this setting models crowdsourced information discovery and funding agencies incentivizing scientists to perform high-risk, high-reward research.
| Year | Citations | |
|---|---|---|
Page 1
Page 1