Publication | Open Access
Are Linked Datasets fit for Open-domain Question Answering? A Quality Assessment
17
Citations
25
References
2016
Year
Unknown Venue
EngineeringKnowledge ExtractionIntelligent Information RetrievalOpen-domain Question AnsweringSemantic WebCorpus LinguisticsText MiningNatural Language ProcessingAre Linked DatasetsInformation RetrievalData ScienceComputational LinguisticsData IntegrationLanguage StudiesQuality AssessmentLinked DataContent AnalysisMachine TranslationBenchmark DatasetsQuestion AnsweringEntity DisambiguationKnowledge DiscoveryComputer ScienceQuality MetricsRetrieval Augmented GenerationData Quality Metrics
The current decade is a witness to an enormous explosion of data being published on the Web as Linked Data to maximise its reusability. Answering questions that users speak or write in natural language is an increasingly popular application scenario for Web Data, especially when the domain of the questions is not limited to a domain where dedicated curated datasets exist, like in medicine. The increasing use of Web Data in this and other settings has highlighted the importance of assessing its quality. While quite some work has been done with regard to assessing the quality of Linked Data, only few efforts have been dedicated to quality assessment of linked data from the question answering domain's perspective. From the linked data quality metrics that have so far been well documented in the literature, we have identified those that are most relevant for QA. We apply these quality metrics, implemented in the Luzzu framework, to subsets of two datasets of crucial importance to open domain QA -- DBpedia and Wikidata -- and thus present the first assessment of the quality of these datasets for QA. From these datasets, we assess slices covering the specific domains of restaurants, politicians, films and soccer players. The results of our experiments suggest that for most of these domains, the quality of Wikidata with regard to the majority of relevant metrics is higher than that of DBpedia.
| Year | Citations | |
|---|---|---|
Page 1
Page 1