Publication | Closed Access
Distribution Based Workload Modelling of Continuous Queries in Clouds
46
Citations
26
References
2016
Year
Continuous QueriesCluster ComputingEngineeringCloud Load BalancingData Streaming ArchitectureStreaming DataCloud Resource ManagementOperations ResearchData ScienceSystems EngineeringResource UsageInternet Of ThingsData ManagementStreaming EngineMobile ComputingComputer ScienceData Stream ManagementResource Usage EstimationEdge ComputingCloud ComputingWorkload ManagementBig Data
Resource usage estimation for managing streaming workload in emerging applications domains such as enterprise computing, smart cities, remote healthcare, and astronomy, has emerged as a challenging research problem. Such resource estimation for processing continuous queries over streaming data is challenging due to: (i) uncertain stream arrival patterns, (ii) need to process different mixes of queries, and (iii) varying resource consumption. Existing techniques approximate resource usage for a query as a single point value which may not be sufficient because it is neither expressive enough nor does it capture the aforementioned nature of streaming workload. In this paper, we present a novel approach of using mixture density networks to estimate the whole spectrum of resource usage as probability density functions. We have evaluated our technique using the linear road benchmark and TPC-H in both private and public clouds. The efficiency and applicability of the proposed approach is demonstrated via two novel applications: i) predictable auto-scaling policy setting which highlights the potential of distribution prediction in consistent definition of cloud elasticity rules; and ii) a distribution based admission controller which is able to efficiently admit or reject incoming queries based on probabilistic service level agreements compliance goals.
| Year | Citations | |
|---|---|---|
Page 1
Page 1