Publication | Closed Access
DiscoveryLink: A system for integrated access to life sciences data sources
270
Citations
27
References
2001
Year
EngineeringVast AmountsScience GatewaySemantic WebBioinformatics DatabaseInformation RetrievalData ScienceWrapper ArchitectureDatabase SystemDatabase SupportManagementData IntegrationData RetrievalData ManagementBiological DataData ModelingBiological DatabaseDiscoverylink OfferingKnowledge DiscoveryOmicsDatabase TechnologyBioinformaticsQuery OptimizationBiomedical Data IntegrationComputational BiologySystems BiologyIntegrated AccessBig Data
Vast amounts of life sciences data reside in specialized sources, and data from one source often must be combined with data from others to provide desired information; database middleware systems extract data from multiple sources in response to a single query, and IBM's DiscoveryLink is one such system targeted to life sciences applications. The paper introduces DiscoveryLink, a system that integrates access to life sciences data from heterogeneous sources. DiscoveryLink employs a wrapper architecture and a query optimizer to provide a virtual database that can answer arbitrarily complex queries across multiple data sources. DiscoveryLink enables users to pose arbitrarily complex queries that are answered even though the necessary data originates from several sources that individually cannot answer the query.
Vast amounts of life sciences data reside today in specialized data sources, with specialized query processing capabilities. Data from one source often must be combined with data from other sources to give users the information they desire. There are database middleware systems that extract data from multiple sources in response to a single query. IBM's DiscoveryLink is one such system, targeted to applications from the life sciences industry. DiscoveryLink provides users with a virtual database to which they can pose arbitrarily complex queries, even though the actual data needed to answer the query may originate from several different sources, and none of those sources, by itself, is capable of answering the query. We describe the DiscoveryLink offering, focusing on two key elements, the wrapper architecture and the query optimizer, and illustrate how it can be used to integrate the access to life sciences data from heterogeneous data sources.
| Year | Citations | |
|---|---|---|
Page 1
Page 1