Concepedia

Abstract

The Real-time Analytics Data Stack, colloquially referred to \ as the RADStack, is an open-source data analytics stack designed \ to provide fast, flexible queries over up-to-the-second \ data. It is designed to overcome the limitations of either \ a purely batch processing system (it takes too long to surface \ new events) or a purely real-time system (it’s difficult \ to ensure that no data is left behind and there is often no \ way to correct data after initial processing). It will seamlessly \ return best-effort results on very recent data combined \ with guaranteed-correct results on older data. In this paper, \ we introduce the architecture of the RADStack and discuss \ our methods of providing interactive analytics and a flexible \ data processing environment to handle a variety of real-world \ workloads.

References

YearCitations

2008

18.4K

2010

4.8K

2012

3.6K

2007

3.4K

2010

2.6K

2013

1.8K

2010

1.3K

1996

1.2K

2005

923

2010

702

Page 1