Publication | Open Access
Understanding Data Augmentation for Classification: When to Warp?
130
Citations
14
References
2016
Year
Unknown Venue
Convolutional Neural NetworkEngineeringMachine LearningAutoencodersHuman AugmentationAdditional Training SamplesImage AnalysisData SciencePattern RecognitionGeneric AugmentationData AugmentationMachine VisionFeature LearningMachine Learning ModelKnowledge DiscoveryComputer ScienceDeep LearningMedical Image ComputingComputer Vision
In this paper we investigate the benefit of augmenting data with synthetically created samples when training a machine learning classifier. Two approaches for creating additional training samples are data warping, which generates additional samples through transformations applied in the data-space, and synthetic over-sampling, which creates additional samples in feature-space. We experimentally evaluate the benefits of data augmentation for a convolutional backpropagation-trained neural network, a convolutional support vector machine and a convolutional extreme learning machine classifier, using the standard MNIST handwritten digit dataset. We found that while it is possible to perform generic augmentation in feature-space, if plausible transforms for the data are known then augmentation in data-space provides a greater benefit for improving performance and reducing overfitting.
| Year | Citations | |
|---|---|---|
Page 1
Page 1