Publication | Closed Access
Quantifying performance benefits of overlap using MPI-2 in a seismic modeling application
24
Citations
11
References
2010
Year
Unknown Venue
Cluster ComputingHeterogeneous ComputingNew DesignEngineeringComputer ArchitectureStructural OptimizationStructural EngineeringHigh-performance ArchitectureSeismic AnalysisNumerical SimulationSystems EngineeringMpi Communication RoutinesModeling And SimulationParallel ComputingSeismic Modeling ApplicationMpi Communication CallsMassively-parallel ComputingEarthquake EngineeringHybrid ProgrammingComputer EngineeringPerformance BenefitsComputer ScienceSeismologyEdge ComputingSeismic Reflection ProfilingCivil EngineeringCloud ComputingSeismic IsolationParallel Performance EvaluationParallel Programming
AWM-Olsen is a widely used ground motion simulation code based on a parallel finite difference solution of the 3-D velocity-stress wave equation. This application runs on tens of thousands of cores and consumes several million CPU hours on the TeraGrid Clusters every year. A significant portion of its run-time (37% in a 4,096 process run), is spent in MPI communication routines. Hence, it demands an optimized communication design coupled with a low-latency, high-bandwidth network and an efficient communication subsystem for good performance. In this paper, we analyze the performance bottlenecks of the application with regard to the time spent in MPI communication calls. We find that much of this time can be overlapped with computation using MPI non-blocking calls. We use both two-sided and MPI-2 one-sided communication semantics to re-design the communication in AWM-Olsen. We find that with our new design, using MPI-2 one-sided communication semantics, the entire application can be sped up by 12% at 4K processes and by 10% at 8K processes on a state-of-the-art InfiniBand cluster, Ranger at the Texas Advanced Computing Center (TACC).
| Year | Citations | |
|---|---|---|
Page 1
Page 1