Publication | Closed Access
Decision Tree Based Predictive Models for Breast Cancer Survivability on Imbalanced Data
105
Citations
10
References
2009
Year
Unknown Venue
Imbalanced DataEngineeringData ScienceData MiningPrediction ModellingClass ImbalancePredictive AnalyticsDecision TreeBreast Cancer SurvivabilityBreast ImagingDecision Tree LearningBreast CancerBiostatisticsCost-sensitive LearningPublic HealthStatisticsHealth InformaticsRadiology
Based on imbalanced data, the predictive models for 5-year survivability of breast cancer using decision tree are proposed. After data preprocessing from SEER breast cancer datasets, it is obviously that the category of data distribution is imbalanced. Under-sampling is taken to make up the disadvantage of the performance of models caused by the imbalanced data. The performance of the models is evaluated by AUC under ROC curve, accuracy, specificity and sensitivity with 10-fold stratified cross-validation. The performance of models is best while the distribution of data is approximately equal. Bagging algorithm is used to build an integration decision tree model for predicting breast cancer survivability.
| Year | Citations | |
|---|---|---|
Page 1
Page 1