Publication | Open Access
Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.
2K
Citations
27
References
2001
Year
EngineeringKnowledge ExtractionMetathesaurus ConceptsEffective MappingSemantic WebCorpus LinguisticsText MiningNatural Language ProcessingInformation RetrievalData ScienceData MiningBiostatisticsPublic HealthBiomedical Text MiningMedical OntologyBiomedical OntologyBiomedical LiteratureKnowledge DiscoveryTerminology ExtractionUmls MetathesaurusBiomedical Data IntegrationMedicineHealth InformaticsMetamap Program
The UMLS Metathesaurus is the largest biomedical thesaurus, offering concepts classified by semantic type and rich hierarchical and non‑hierarchical relationships that support decision support, patient record management, information retrieval, and data mining. The paper introduces MetaMap, a program developed at the National Library of Medicine to map biomedical text to the UMLS Metathesaurus and identify referenced concepts. MetaMap employs a knowledge‑intensive, symbolic natural‑language‑processing approach that leverages computational linguistic techniques. MetaMap is applied to information retrieval and data mining and underpins NLM’s Indexing Initiative System, which performs semi‑automatic and fully automatic indexing of biomedical literature.
The UMLS Metathesaurus, the largest thesaurus in the biomedical domain, provides a representation of biomedical knowledge consisting of concepts classified by semantic type and both hierarchical and non-hierarchical relationships among the concepts. This knowledge has proved useful for many applications including decision support systems, management of patient records, information retrieval (IR) and data mining. Gaining effective access to the knowledge is critical to the success of these applications. This paper describes MetaMap, a program developed at the National Library of Medicine (NLM) to map biomedical text to the Metathesaurus or, equivalently, to discover Metathesaurus concepts referred to in text. MetaMap uses a knowledge intensive approach based on symbolic, natural language processing (NLP) and computational linguistic techniques. Besides being applied for both IR and data mining applications, MetaMap is one of the foundations of NLM's Indexing Initiative System which is being applied to both semi-automatic and fully automatic indexing of the biomedical literature at the library.
| Year | Citations | |
|---|---|---|
Page 1
Page 1