Publication | Closed Access
TaxoFinder: A Graph-Based Approach for Taxonomy Learning
27
Citations
33
References
2015
Year
EngineeringKnowledge ExtractionLearned TaxonomySemantic WebCorpus LinguisticsLanguage ProcessingText MiningNatural Language ProcessingInformation RetrievalData ScienceData MiningComputational LinguisticsCorpus AnalysisLanguage StudiesKnowledge RepresentationGraph Analytic AlgorithmSemantic LearningKnowledge DiscoveryTaxonomy LearningTerminology ExtractionSemantic NetworkDomain Knowledge ModelingLinguisticsSemantic Similarity
Taxonomy learning is an important task for knowledge acquisition, sharing, and classification as well as application development and utilization in various domains. To reduce human effort to build a taxonomy from scratch and improve the quality of the learned taxonomy, we propose a new taxonomy learning approach, named <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">TaxoFinder</i> . TaxoFinder takes three steps to automatically build a taxonomy. First, it identifies domain-specific concepts from a domain text corpus. Second, it builds a graph representing how such concepts are associated together based on their co-occurrences. As the key method in TaxoFinder, we propose a method for measuring associative strengths among the concepts, which quantify how strongly they are associated in the graph, using similarities between sentences and spatial distances between sentences. Lastly, TaxoFinder induces a taxonomy from the graph using a graph analytic algorithm. TaxoFinder aims to build a taxonomy in such a way that it maximizes the overall associative strengths among the concepts in the graph to build a taxonomy. We evaluate TaxoFinder using gold-standard evaluation on three different domains: <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">emergency management for mass gatherings</i> , <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">autism research</i> , and <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">disease</i> domains. In our evaluation, we compare TaxoFinder with a state-of-the-art subsumption method and show that TaxoFinder is an effective approach significantly outperforming the subsumption method.
| Year | Citations | |
|---|---|---|
Page 1
Page 1