Publication | Closed Access
BOOSTED TREES FOR ECOLOGICAL MODELING AND PREDICTION
1.3K
Citations
23
References
2007
Year
EngineeringForest BiometricsMachine LearningComputer AnalysisEcological ModellingForestryData ScienceBiogeographyData MiningBoosted TreesDecision TreeManagementDecision Tree LearningStatisticsAccurate PredictionPrediction ModellingRegressionData ModelingPredictive AnalyticsKnowledge DiscoveryDeforestationData ClassificationClassificationForest InventorySoftware Package
Accurate prediction and explanation are fundamental objectives of statistical analysis, yet they seldom coincide. The study proposes aggregated boosted trees (ABT) to achieve accurate prediction and explanation, demonstrating reduced prediction error in simulations. ABT accommodates numeric, categorical, and censored responses, multiple loss functions, and predictors, quantifies interactions, and is implemented in an R package for comparison with boosted trees, bagged trees, random forests, and generalized additive models. In simulations and a regression data set, ABT reduces prediction error compared to boosted trees and other methods.
Accurate prediction and explanation are fundamental objectives of statistical analysis, yet they seldom coincide. Boosted trees are a statistical learning method that attains both of these objectives for regression and classification analyses. They can deal with many types of response variables (numeric, categorical, and censored), loss functions (Gaussian, binomial, Poisson, and robust), and predictors (numeric, categorical). Interactions between predictors can also be quantified and visualized. The theory underpinning boosted trees is presented, together with interpretive techniques. A new form of boosted trees, namely, "aggregated boosted trees" (ABT), is proposed and, in a simulation study, is shown to reduce prediction error relative to boosted trees. A regression data set is analyzed using ABT to illustrate the technique and to compare it with other methods, including boosted trees, bagged trees, random forests, and generalized additive models. A software package for ABT analysis using the R software environment is included in the Appendices together with worked examples.
| Year | Citations | |
|---|---|---|
Page 1
Page 1