Publication | Closed Access
Experimental perspectives on learning from imbalanced data
779
Citations
12
References
2007
Year
Unknown Venue
Comprehensive SuiteEngineeringMachine LearningSampling TechniqueDifferent Performance MetricsData ScienceData MiningPattern RecognitionClass ImbalanceBiasManagementStatisticsPredictive AnalyticsKnowledge DiscoveryComputer ScienceData ClassificationExperimental PerspectivesClassifier PerformanceClassifier SystemCost-sensitive LearningLimited Data Learning
When classes are imbalanced, many learning algorithms suffer reduced performance. The study investigates whether data sampling can improve learner performance on imbalanced data, how effectiveness depends on learner type, and whether results vary across performance metrics. The authors conduct a comprehensive suite of experiments on learning from imbalanced data. The results show that sampling frequently improves classifier performance.
We present a comprehensive suite of experimentation on the subject of learning from imbalanced data. When classes are imbalanced, many learning algorithms can suffer from the perspective of reduced performance. Can data sampling be used to improve the performance of learners built from imbalanced data? Is the effectiveness of sampling related to the type of learner? Do the results change if the objective is to optimize different performance metrics? We address these and other issues in this work, showing that sampling in many cases will improve classifier performance.
| Year | Citations | |
|---|---|---|
Page 1
Page 1