Publication | Closed Access
Reinforcement learning for dialog management using least-squares Policy iteration and fast feature selection
57
Citations
12
References
2009
Year
Unknown Venue
Artificial IntelligenceEngineeringMachine LearningSequential LearningSpoken Dialog SystemIntelligent SystemsSpeech RecognitionNatural Language ProcessingData ScienceFast Feature SelectionLeast-squares Policy IterationConversation AnalysisRobot LearningRl OptimizationDialogue ManagementDialog SystemsAction Model LearningConversational Recommender SystemComputer ScienceSequential Decision MakingDialog ManagementBaseline Rl
Reinforcement learning (RL) is a promising technique for creating a dialog manager. RL accepts features of the current dialog state and seeks to find the best action given those features. Although it is often easy to posit a large set of potentially useful features, in practice, it is difficult to find the subset which is large enough to contain useful information yet compact enough to reliably learn a good policy. In this paper, we propose a method for RL optimization which automatically performs feature selection. The algorithm is based on least-squares policy iteration, a state-of-the-art RL algorithm which is highly sampleefficient and can learn from a static corpus or on-line. Experiments in dialog simulation show it is more stable than a baseline RL algorithm taken from a working dialog system.
| Year | Citations | |
|---|---|---|
Page 1
Page 1