Publication | Closed Access
Corpus-based identification and refinement of semantic classes.
26
Citations
12
References
1997
Year
EngineeringLexical SemanticsSemanticsSnomed NomenclatureCorpus LinguisticsLanguage ProcessingText MiningNatural Language ProcessingApplied LinguisticsSemantic CategorizationComputational LinguisticsCorpus AnalysisLanguage StudiesBiomedical Text MiningLexiconMachine TranslationSemantic ClassesComputational LexicologyNlp TaskMedical Language ProcessingDistributional SemanticsLexical ResourceLinguistics
Medical Language Processing (MLP), especially in specific domains, requires fine-grained semantic lexica. We examine whether robust natural language processing tools used on a representative corpus of a domain help in building and refining a semantic categorization. We test this hypothesis with ZELLIG, a corpus analysis tool. The first clusters we obtain are consistent with a model of the domain, as found in the SNOMED nomenclature. They correspond to coarse-grained semantic categories, but isolate as well lexical idiosyncrasies belonging to the clinical sub-language. Moreover, they help categorize additional words.
| Year | Citations | |
|---|---|---|
Page 1
Page 1