Publication | Closed Access
A performance evaluation of Apache Kafka in support of big data streaming applications
106
Citations
3
References
2017
Year
Unknown Venue
Cluster ComputingEngineeringData Streaming ArchitectureStreaming DataData ScienceManagementBig Data ArchitectureData IntegrationParallel ComputingData ManagementStream ProcessingStreaming EngineComputer ScienceData Stream ManagementEdge ComputingCloud ComputingParallel ProgrammingApache KafkaStream ComputingData StreamsBig Data
Stream computing is becoming a more and more popular paradigm as it enables the real-time promise of data analytics. Apache Kafka is currently the most popular framework used to ingest the data streams into the processing platforms. However, how to tune Kafka and how much resources to allocate for it remains a challenge for most users, who now rely mainly on empirical approaches to determine the best parameter settings for their deployments. In this poster, we make a through evaluation of several configurations and performance metrics of Kafka in order to allow users avoid bottlenecks, reach its full potential and avoid bottlenecks and eventually leverage some good practice for efficient stream processing.
| Year | Citations | |
|---|---|---|
Page 1
Page 1