Concepedia

Abstract

Several aspects may influence the performance achieved by a classifier created by a Machine Learning system. One of these aspects is related to the dierence between the numbers of examples belonging to each class. When this dierence is large, the learning system may have diculties to learn the concept related to the minority class. In this work 1 , we discuss several issues related to learning with skewed class distributions, such as the relationship between cost-sensitive learning and class distributions, and the limitations of accuracy and error rate to measure the performance of classifiers. Also, we survey some methods proposed by the Machine Learning community to solve the problem of learning with imbalanced data sets, and discuss some limitations of these methods.

References

YearCitations

Page 1