Publication | Open Access
Using dependency parsing and probabilistic inference to extract relationships between genes, proteins and malignancies implicit among multiple biomedical research abstracts
27
Citations
25
References
2006
Year
Unknown Venue
EngineeringKnowledge ExtractionPathologyMultiomicsSemantic WebCorpus LinguisticsText MiningNatural Language ProcessingComputational LinguisticsBiostatisticsBiomedical Text MiningMolecular DiagnosticsBiomedical OntologyTranslational BioinformaticsBiological DatabaseKnowledge DiscoveryOmicsProbabilistic InferencePathway AnalysisSemantic RelationshipsFunctional GenomicsBioinformaticsDependency ParsingRelationship ExtractionComputational BiologySimple Semantic RelationshipsPrototype Software SystemSystems BiologyMedicine
We describe BioLiterate, a prototype software system which infers relationships involving relationships between genes, proteins and malignancies from research abstracts, and has initially been tested in the domain of the molecular genetics of oncology. The architecture uses a natural language processing module to extract entities, dependencies and simple semantic relationships from texts, and then feeds these features into a probabilistic reasoning module which combines the semantic relationships extracted by the NLP module to form new semantic relationships. One application of this system is the discovery of relationships that are not contained in any individual abstract but are implicit in the combined knowledge contained in two or more abstracts.
| Year | Citations | |
|---|---|---|
Page 1
Page 1