Publication | Closed Access
Musketeer
82
Citations
38
References
2015
Year
Unknown Venue
Cluster ComputingWorkflow ExecutionEngineeringParallel ProcessingCloud ComputingData-intensive PlatformComputer ArchitectureExpress WorkflowsParallel ProgrammingComputer ScienceScalable ComputingParallel ComputingHigh-throughput ComputingData ManagementSystem SoftwareData-intensive ComputingBig Data
Many systems for the parallel processing of big data are available today. Yet, few users can tell by intuition which system, or combination of systems, is "best" for a given workflow. Porting workflows between systems is tedious. Hence, users become "locked in", despite faster or more efficient systems being available. This is a direct consequence of the tight coupling between user-facing front-ends that express workflows (e.g., Hive, SparkSQL, Lindi, GraphLINQ) and the back-end execution engines that run them (e.g., MapReduce, Spark, PowerGraph, Naiad).
| Year | Citations | |
|---|---|---|
Page 1
Page 1