Publication | Closed Access
Grizzly: Efficient Stream Processing Through Adaptive Query Compilation
36
Citations
63
References
2020
Year
Unknown Venue
Cluster ComputingStream Processing EnginesEngineeringComputer ArchitectureData Streaming ArchitectureSoftware AnalysisData ScienceData IntegrationParallel ComputingData ManagementStream ProcessingHigh-performance Data AnalyticsRuntime OptimizationsStreaming EngineComputer EngineeringComputer ScienceData Stream ManagementProgram AnalysisParallel ProgrammingBig Data
Stream Processing Engines (SPEs) execute long-running queries on unbounded data streams. They follow an interpretation-based processing model and do not perform runtime optimizations. This limits the utilization of modern hardware and neglects changing data characteristics at runtime. In this paper, we present Grizzly, a novel adaptive query compilation-based SPE, to enable highly efficient query execution. We extend query compilation and task-based parallelization for the unique requirements of stream processing and apply adaptive compilation to enable runtime re-optimizations. The combination of light-weight statistic gathering with just-in-time compilation enables Grizzly to adjust to changing data-characteristics dynamically at runtime. Our experiments show that Grizzly outperforms state-of-the-art SPEs by up to an order of magnitude in throughput.
| Year | Citations | |
|---|---|---|
Page 1
Page 1