Publication | Open Access
Quantifying exploration in reward-based motor learning
46
Citations
35
References
2020
Year
Motor LearningMotor SkillCognitionMotor ControlReward-based Motor LearningSensorimotor NoiseSocial SciencesKinesiologyRobot LearningCognitive NeuroscienceMotor BehaviorHealth SciencesCognitive ScienceSensorimotor IntegrationExperimental PsychologyPerception-action LoopExploration V ExploitationSensorimotor TransformationIncreased Variability
Exploration in reward-based motor learning is observable in experimental data as increased variability. In order to quantify exploration, we compare three methods for estimating other sources of variability: sensorimotor noise. We use a task in which participants could receive stochastic binary reward feedback following a target-directed weight shift. Participants first performed six baseline blocks without feedback, and next twenty blocks alternating with and without feedback. Variability was assessed based on trial-to-trial changes in movement endpoint. We estimated sensorimotor noise by the median squared trial-to-trial change in movement endpoint for trials in which no exploration is expected. We identified three types of such trials: trials in baseline blocks, trials in the blocks without feedback, and rewarded trials in the blocks with feedback. We estimated exploration by the median squared trial-to-trial change following non-rewarded trials minus sensorimotor noise. As expected, variability was larger following non-rewarded trials than following rewarded trials. This indicates that our reward-based weight-shifting task successfully induced exploration. Most importantly, our three estimates of sensorimotor noise differed: the estimate based on rewarded trials was significantly lower than the estimates based on the two types of trials without feedback. Consequently, the estimates of exploration also differed. We conclude that the quantification of exploration depends critically on the type of trials used to estimate sensorimotor noise. We recommend the use of variability following rewarded trials.
| Year | Citations | |
|---|---|---|
Page 1
Page 1