Publication | Closed Access
Multi-armed recommendation bandits for selecting state machine policies for robotic systems
18
Citations
17
References
2013
Year
Unknown Venue
Artificial IntelligenceRobotic SystemsEngineeringMachine LearningIntelligent RoboticsState Machine PoliciesIntelligent SystemsState Space SearchData ScienceRobot LearningDriving State MachineSequential Decision MakingComputer ScienceCold-start ProblemMarkov Decision ProcessExploration V ExploitationMulti-armed Recommendation BanditsAutomationRoboticsCollaborative FilteringState Machine
We investigate the problem of selecting a state-machine from a library to control a robot. We are particularly interested in this problem when evaluating such state machines on a particular robotics task is expensive. As a motivating example, we consider a problem where a simulated vacuuming robot must select a driving state machine well-suited for a particular (unknown) room layout. By borrowing concepts from collaborative filtering (recommender systems such as Netflix and Amazon.com), we present a multi-armed bandit formulation that incorporates recommendation techniques to efficiently select state machines for individual room layouts. We show that this formulation outperforms the individual approaches (recommendation, multi-armed bandits) as well as the baseline of selecting the `average best' state machine across all rooms.
| Year | Citations | |
|---|---|---|
Page 1
Page 1