Publication | Open Access
XGBoost
44.2K
Citations
15
References
2016
Year
Unknown Venue
Cluster ComputingEngineeringMachine LearningMachine Learning ToolInformation RetrievalData ScienceData MiningPattern RecognitionData ManagementHigh-performance Data AnalyticsKnowledge DiscoveryQuantile SketchComputer ScienceApproximate TreeScalable TreeData-intensive ComputingModel CompressionMassive Data ProcessingBig Data
Tree boosting is a highly effective and widely used machine learning method. In this paper, we describe a scalable end-to-end tree boosting system called XGBoost, which is used widely by data scientists to achieve state-of-the-art results on many machine learning challenges. We propose a novel sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning. More importantly, we provide insights on cache access patterns, data compression and sharding to build a scalable tree boosting system. By combining these insights, XGBoost scales beyond billions of examples using far fewer resources than existing systems.
| Year | Citations | |
|---|---|---|
2001 | 119.3K | |
2001 | 27.3K | |
2011 | 8K | |
2002 | 6.9K | |
2000 | 6.9K | |
2014 | 868 | |
2006 | 769 | |
2001 | 448 | |
2007 | 434 | |
2008 | 330 |
Page 1
Page 1