Publication | Open Access
DCDB Wintermute: Enabling Online and Holistic Operational Data Analytics on HPC Systems
31
Citations
36
References
2020
Year
Unknown Venue
Cluster ComputingEngineeringHpc SystemsService MonitoringHigh Performance ComputingSoftware AnalysisData ScienceSystems EngineeringData IntegrationParallel ComputingData ManagementHigh-performance Data AnalyticsHybrid Hpc WorkloadOperations AnalyticsComputer EngineeringComputer ScienceExascale EraData-intensive ComputingOnline OdaPerformance MonitoringCloud ComputingEnabling OnlineParallel ProgrammingOperational SystemSystem MonitoringIndustrial InformaticsSystem SoftwareBig DataDcdb Wintermute
As we approach the exascale era, the size and complexity of HPC systems continues to increase, raising concerns about their manageability and sustainability. For this reason, more and more HPC centers are experimenting with fine-grained monitoring coupled with Operational Data Analytics (ODA) to optimize efficiency and effectiveness of system operations. However, while monitoring is a common reality in HPC, there is no well-stated and comprehensive list of requirements, nor matching frameworks, to support holistic and online ODA. This leads to insular ad-hoc solutions, each addressing only specific aspects of the problem.
| Year | Citations | |
|---|---|---|
2009 | 2.3K | |
2004 | 1.3K | |
2003 | 777 | |
1998 | 317 | |
2014 | 243 | |
2008 | 200 | |
2014 | 198 | |
2010 | 198 | |
2007 | 190 | |
2014 | 130 |
Page 1
Page 1