Publication | Closed Access
A conceptual framework for managing very diverse data for complex, interdisciplinary science
127
Citations
46
References
2011
Year
EngineeringData Management IssueData CurationData VisualizationDiverse DataInterdisciplinary ScienceData PublishingData InfrastructureData EcosystemMassive Data VolumesCollaborative Data ManagementData ScienceData ResourcesManagementData IntegrationData Intensive ScienceCollaborative Data ScienceBig DataData ManagementResearch Data ManagementHealth Data ScienceCollaborative Data AnalysisConceptual FrameworkResearch DataData Modeling
Modern data‑intensive science faces challenges of massive volumes and, increasingly, the heterogeneity and distribution of interdisciplinary research data, requiring simpler, more flexible technologies and cultural adaptation. The paper proposes a conceptual framework for managing diverse interdisciplinary data to enable understanding of complex systemic problems, such as environmental change, by envisioning discoverable, open, linked, useful, and safe data collections curated with information‑science principles. Using the International Polar Year 2007–8 as a case study, the authors examine data‑management approaches and suggest a vision of linked, curated data collections that integrate best practices from information and library science. They find that technologies for exa‑scale data volumes differ from those needed for extremely distributed and heterogeneous data, highlighting the need for simpler, more flexible solutions.
Much attention has been given to the challenges of handling massive data volumes in modern data-intensive science. This paper examines an equally daunting challenge – the diversity of interdisciplinary data, notably research data, and the need to interrelate these data to understand complex systemic problems such as environmental change and its impact. We use the experience of the International Polar Year 2007–8 (IPY) as a case study to examine data management approaches seeking to address issues around complex interdisciplinary science. We find that, while technology is a critical factor in addressing the interdisciplinary dimension of the data intensive science, the technologies developing for exa-scale data volumes differ from those that are needed for extremely distributed and heterogeneous data. Research data will continue to be highly heterogeneous and distributed and will require technologies to be much simpler and more flexible. More importantly, there is a need for both technical and cultural adaptation. We describe a vision of discoverable, open, linked, useful, and safe collections of data, organized and curated using the best principles and practices of information and library science. This vision provides a framework for our discussion and leads us to suggest several short- and long-term strategies to facilitate a socio-technical evolution in the overall science data ecosystem.
| Year | Citations | |
|---|---|---|
Page 1
Page 1