Publication | Open Access
Mitigating Adversarial Effects Through Randomization
195
Citations
17
References
2017
Year
Artificial IntelligenceData AugmentationConvolutional Neural NetworkEngineeringMachine LearningRandomization MethodGenerative Adversarial NetworkDefense SystemsAttack ModelAdversarial EffectsAdversarial Machine LearningConvolutional Neural NetworksData PrivacyAi SafetyComputer ScienceRandomization OperationsDeep LearningData Security
Convolutional neural networks achieve high accuracy yet are highly vulnerable to imperceptible adversarial perturbations that can cause misclassification. This work proposes applying randomization during inference to mitigate such adversarial effects. The defense applies random resizing of input images to a random size followed by random zero padding around the image. Experiments show the method defends effectively against single‑step and iterative attacks, adds minimal computation, requires no retraining, and when combined with adversarial training achieves a normalized score of 0.924, ranking second among 107 teams, far surpassing adversarial training alone (0.773). Code is publicly available at https://github.com/cihangxie/NIPS2017_adv_challenge_defense.
Convolutional neural networks have demonstrated high accuracy on various tasks in recent years. However, they are extremely vulnerable to adversarial examples. For example, imperceptible perturbations added to clean images can cause convolutional neural networks to fail. In this paper, we propose to utilize randomization at inference time to mitigate adversarial effects. Specifically, we use two randomization operations: random resizing, which resizes the input images to a random size, and random padding, which pads zeros around the input images in a random manner. Extensive experiments demonstrate that the proposed randomization method is very effective at defending against both single-step and iterative attacks. Our method provides the following advantages: 1) no additional training or fine-tuning, 2) very few additional computations, 3) compatible with other adversarial defense methods. By combining the proposed randomization method with an adversarially trained model, it achieves a normalized score of 0.924 (ranked No.2 among 107 defense teams) in the NIPS 2017 adversarial examples defense challenge, which is far better than using adversarial training alone with a normalized score of 0.773 (ranked No.56). The code is public available at https://github.com/cihangxie/NIPS2017_adv_challenge_defense.
| Year | Citations | |
|---|---|---|
Page 1
Page 1