Publication | Closed Access
Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach
1.4K
Citations
35
References
2017
Year
Unknown Venue
Convolutional Neural NetworkEngineeringMachine LearningAutoencodersLabel NoiseImage AnalysisData ScienceNoiseLoss CorrectionSemi-supervised LearningSupervised LearningLoss Correction ApproachData AugmentationMachine VisionFeature LearningLoss CurvatureNoisy DataComputer ScienceDeep LearningComputer VisionDeep Neural Networks
The paper proposes a theoretically grounded, domain‑agnostic approach to train deep neural networks under class‑dependent label noise by introducing two loss‑correction procedures. The method reduces to a matrix inversion and multiplication once the class‑corruption probabilities are known, and these probabilities can be estimated end‑to‑end by extending a recent noise‑estimation technique to multi‑class settings. Experiments on MNIST, IMDB, CIFAR‑10, CIFAR‑100, and a large clothing image dataset across diverse architectures confirm the noise robustness of the proposed methods, and the authors prove that with ReLU alone the loss curvature remains immune to class‑dependent label noise.
We present a theoretically grounded approach to train deep neural networks, including recurrent networks, subject to class-dependent label noise. We propose two procedures for loss correction that are agnostic to both application domain and network architecture. They simply amount to at most a matrix inversion and multiplication, provided that we know the probability of each class being corrupted into another. We further show how one can estimate these probabilities, adapting a recent technique for noise estimation to the multi-class setting, and thus providing an end-to-end framework. Extensive experiments on MNIST, IMDB, CIFAR-10, CIFAR-100 and a large scale dataset of clothing images employing a diversity of architectures - stacking dense, convolutional, pooling, dropout, batch normalization, word embedding, LSTM and residual layers - demonstrate the noise robustness of our proposals. Incidentally, we also prove that, when ReLU is the only non-linearity, the loss curvature is immune to class-dependent label noise.
| Year | Citations | |
|---|---|---|
Page 1
Page 1