Concepedia

Publication | Open Access

Decision tree methods: applications for classification and prediction.

1.1K

Citations

10

References

2015

Year

Abstract

Decision tree methodology is a commonly used data mining method for establishing classification systems based on multiple covariates or for developing prediction algorithms for a target variable. This method classifies a population into branch-like segments that construct an inverted tree with a root node, internal nodes, and leaf nodes. The algorithm is non-parametric and can efficiently deal with large, complicated datasets without imposing a complicated parametric structure. When the sample size is large enough, study data can be divided into training and validation datasets. Using the training dataset to build a decision tree model and a validation dataset to decide on the appropriate tree size needed to achieve the optimal final model. This paper introduces frequently used algorithms used to develop decision trees (including CART, C4.5, CHAID, and QUEST) and describes the SPSS and SAS programs that can be used to visualize tree structure.

References

YearCitations

1992

434

1989

102

2014

88

2009

59

2001

51

2013

40

2004

20

2004

19

2008

13

2012

13

Page 1