Publication | Closed Access
Flexpath: Type-Based Publish/Subscribe System for Large-Scale Science Analytics
73
Citations
28
References
2014
Year
Unknown Venue
Cluster ComputingEngineeringLarge-scale Science AnalyticsFlex Path SystemScience GatewayData ScienceData-intensive PlatformData IntegrationParallel ComputingData ManagementFlex PathModel DisseminationKnowledge DiscoveryComputer ScienceScientific InquiryData-intensive ComputingWorkflow ExecutionScientific Workflow SystemCloud ComputingParallel ProgrammingBig Data
As high-end systems move toward exascale sizes, a new model of scientific inquiry being developed is one in which online data analytics run concurrently with the high end simulations producing data outputs. Goals are to gain rapid insights into the ongoing scientific processes, assess their scientific validity, and/or initiate corrective or supplementary actions by launching additional computations when needed. The Flex path system presented in this paper addresses the fundamental problem of how to structure and efficiently implement the communications between high end simulations and concurrently running online data analytics, the latter comprised of componentized dynamic services and service pipelines. Using a type-based publish/subscribe approach, Flexpath encourages diversity by permitting analytics services to differ in their computational and scaling characteristics and even in their internal execution models. Flex path uses direct and MxN connections between interacting services to reduce data movements, to allow for runtime connectivity changes to accommodate component arrivals/departures, and to support the multiple underlying communication protocols used for analytics workflows in which simulation outputs are processed by analytics services residing on the same nodes where they are generated, on the same machine, and/or on attached or remote analytics engines. This paper describes the design and implementation of Flex path, and evaluates it with two widely used scientific applications and their associated data analytics methods.
| Year | Citations | |
|---|---|---|
Page 1
Page 1