Publication | Open Access
Learning Cost-Effective Sampling Strategies for Empirical Performance Modeling
17
Citations
32
References
2020
Year
Unknown Venue
EngineeringMachine LearningComputer ArchitectureSampling TechniqueCost-effective Sampling StrategiesData ScienceModeling And SimulationParallel ComputingStatisticsQuantitative ManagementPerformance PredictionEmpirical Performance ModelsHigh-performance Data AnalyticsPredictive AnalyticsComputer EngineeringSampling TheorySampling (Statistics)Computer SciencePerformance Analysis ToolParallel ApplicationsComputational ScienceScalability BottlenecksBenchmarking ToolParameter TuningStatistical InferenceParallel Programming
Identifying scalability bottlenecks in parallel applications is a vital but also laborious and expensive task. Empirical performance models have proven to be helpful to find such limitations, though they require a set of experiments in order to gain valuable insights. Therefore, the experiment design determines the quality and cost of the models. Extra-P is an empirical modeling tool that uses small-scale experiments to assess the scalability of applications. Its current version requires an exponential number of experiments per model parameter. This makes the creation of empirical performance models very expensive, and in some situations even impractical. In this paper, we propose a novel parameter-value selection heuristic, which functions as a guideline for the experiment design, leveraging sparse performance-modeling, a technique that only needs a polynomial number of experiments per model parameter. Using synthetic analysis and data from three different case studies, we show that our solution reduces the average modeling costs by about 85% while retaining 92% of the model accuracy.
| Year | Citations | |
|---|---|---|
Page 1
Page 1