Publication | Open Access
Curriculum Learning for Natural Answer Generation
70
Citations
21
References
2018
Year
Artificial IntelligenceUneven QualityLlm Fine-tuningEngineeringMachine LearningCorpus LinguisticsText MiningNatural Language ProcessingInformation RetrievalData ScienceNatural AnswersComputational LinguisticsLanguage StudiesMachine TranslationNatural LanguageQuestion AnsweringNlp TaskKnowledge DiscoveryNatural Answer GenerationRetrieval Augmented GenerationCurriculum LearningLinguisticsLanguage Generation
By reason of being able to obtain natural language responses, natural answers are more favored in real-world Question Answering (QA) systems. Generative models learn to automatically generate natural answers from large-scale question answer pairs (QA-pairs). However, they are suffering from the uncontrollable and uneven quality of QA-pairs crawled from the Internet. To address this problem, we propose a curriculum learning based framework for natural answer generation (CL-NAG), which is able to take full advantage of the valuable learning data from a noisy and uneven-quality corpus. Specifically, we employ two practical measures to automatically measure the quality (complexity) of QA-pairs. Based on the measurements, CL-NAG firstly utilizes simple and low-quality QA-pairs to learn a basic model, and then gradually learns to produce better answers with richer contents and more complete syntaxes based on more complex and higher-quality QA-pairs. In this way, all valuable information in the noisy and uneven-quality corpus could be fully exploited. Experiments demonstrate that CL-NAG outperforms the state-of-the-arts, which increases 6.8% and 8.7% in the accuracy for simple and complex questions, respectively.
| Year | Citations | |
|---|---|---|
Page 1
Page 1