Publication | Open Access
Assessing The Factual Accuracy of Generated Text
149
Citations
43
References
2019
Year
Unknown Venue
EngineeringEntity SummarizationGenerated TextBilingual Evaluation UnderstudyCorpus LinguisticsText MiningAutomatic SummarizationNatural Language ProcessingInformation RetrievalData ScienceText SummarizationComputational LinguisticsLanguage StudiesContent AnalysisMachine TranslationNlp TaskFactual AccuracyInformation ExtractionFact CheckingRetrieval Augmented GenerationText ProcessingLinguisticsLanguage Generation
We propose a model-based metric to estimate the factual accuracy of generated text that is complementary to typical scoring schemes like ROUGE (Recall-Oriented Understudy for Gisting Evaluation) and BLEU (Bilingual Evaluation Understudy). We introduce and release a new large-scale dataset based on Wikipedia and Wikidata to train relation classifiers and end-to-end fact extraction models. The end-to-end models are shown to be able to extract complete sets of facts from datasets with full pages of text. We then analyse multiple models that estimate factual accuracy on a Wikipedia text summarization task, and show their efficacy compared to ROUGE and other model-free variants by conducting a human evaluation study.
| Year | Citations | |
|---|---|---|
Page 1
Page 1