Publication | Open Access
Automatic retrieval and clustering of similar words
1.6K
Citations
16
References
1998
Year
Unknown Venue
Automatic RetrievalEngineeringSimilarity MeasureSemanticsSemantic WebCorpus LinguisticsText MiningApplied LinguisticsNatural Language ProcessingWord Similarity MeasureInformation RetrievalData ScienceComputational LinguisticsLanguage StudiesRoget ThesaurusMachine TranslationDocument ClusteringComputational LexicologySimilarity SearchKnowledge DiscoveryTerminology ExtractionDistributional SemanticsLexical ResourceLinguisticsSemantic Similarity
Bootstrapping semantics from text is one of the greatest challenges in natural language learning. We first define a word similarity measure based on the distributional pattern of words. The similarity measure allows us to construct a thesaurus using a parsed corpus. We then present a new evaluation methodology for the automatically constructed thesaurus. The evaluation results show that the thesaurus is significantly closer to WordNet than Roget Thesaurus is.
| Year | Citations | |
|---|---|---|
Page 1
Page 1