Publication | Open Access
Learning Structured Representation for Text Classification via Reinforcement Learning
155
Citations
31
References
2018
Year
Structured PredictionEngineeringMachine LearningLanguage ProcessingText MiningRepresentation LearningNatural Language ProcessingData ScienceComputational LinguisticsDocument ClassificationLanguage StudiesNatural LanguageSequence ModellingNlp TaskKnowledge DiscoveryComputer ScienceStructure DiscoveryStructured DocumentLinguistics
Representation learning is a fundamental problem in natural language processing. This paper studies how to learn a structured representation for text classification. Unlike most existing representation models that either use no structure or rely on pre-specified structures, we propose a reinforcement learning (RL) method to learn sentence representation by discovering optimized structures automatically. We demonstrate two attempts to build structured representation: Information Distilled LSTM (ID-LSTM) and Hierarchically Structured LSTM (HS-LSTM). ID-LSTM selects only important, task-relevant words, and HS-LSTM discovers phrase structures in a sentence. Structure discovery in the two representation models is formulated as a sequential decision problem: current decision of structure discovery affects following decisions, which can be addressed by policy gradient RL. Results show that our method can learn task-friendly representations by identifying important words or task-relevant structures without explicit structure annotations, and thus yields competitive performance.
| Year | Citations | |
|---|---|---|
Page 1
Page 1