Publication | Open Access
Multiword expression filtering for building knowledge maps
13
Citations
4
References
2004
Year
Unknown Venue
Multiword ExpressionEngineeringKnowledge ExtractionSemantic WebSemanticsCorpus LinguisticsText MiningNatural Language ProcessingInformation RetrievalData ScienceComputational LinguisticsLanguage StudiesMachine TranslationMultiword Expression QualityComputational LexicologyKnowledge DiscoveryTerminology ExtractionKeyword SearchKnowledge BaseMultiword ExpressionsLexical ResourceKeyword ExtractionLinguistics
This paper describes an algorithm that can be used to improve the quality of multiword expressions extracted from documents. We measure multiword expression quality by the "usefulness" of a multiword expression in helping ontologists build knowledge maps that allow users to search a large document corpus. Our stopword based algorithm takes n-grams extracted from documents, and cleans them up to make them more suitable for building knowledge maps. Running our algorithm on large corpora of documents has shown that it helps to increase the percentage of useful terms from 40% to 70% --- with an eight-fold improvement observed in some cases.
| Year | Citations | |
|---|---|---|
Page 1
Page 1