Publication | Closed Access
Boosting Adversarial Attacks with Momentum
2.8K
Citations
20
References
2018
Year
Unknown Venue
Artificial IntelligenceData AugmentationDeep Neural NetworksEngineeringMachine LearningData ScienceGenerative Adversarial NetworkMachine Learning ModelAdversarial Machine LearningAi SafetyGenerative ModelComputer ScienceDeep LearningDeep Learning ModelsAdversarial Attacks
Deep neural networks are vulnerable to adversarial examples, and existing attacks achieve only low success rates against black‑box models, making robustness evaluation a critical security concern. The authors propose momentum‑based iterative algorithms, including ensemble variants, to enhance the success of black‑box adversarial attacks and establish a benchmark for evaluating model robustness. By incorporating a momentum term into the iterative update, the methods stabilize direction, escape poor local maxima, and generate more transferable adversarial examples, especially when applied to ensembles of models. The approach secured first place in both the NIPS 2017 non‑targeted and targeted adversarial attack competitions.
Deep neural networks are vulnerable to adversarial examples, which poses security concerns on these algorithms due to the potentially severe consequences. Adversarial attacks serve as an important surrogate to evaluate the robustness of deep learning models before they are deployed. However, most of existing adversarial attacks can only fool a black-box model with a low success rate. To address this issue, we propose a broad class of momentum-based iterative algorithms to boost adversarial attacks. By integrating the momentum term into the iterative process for attacks, our methods can stabilize update directions and escape from poor local maxima during the iterations, resulting in more transferable adversarial examples. To further improve the success rates for black-box attacks, we apply momentum iterative algorithms to an ensemble of models, and show that the adversarially trained models with a strong defense ability are also vulnerable to our black-box attacks. We hope that the proposed methods will serve as a benchmark for evaluating the robustness of various deep models and defense methods. With this method, we won the first places in NIPS 2017 Non-targeted Adversarial Attack and Targeted Adversarial Attack competitions.
| Year | Citations | |
|---|---|---|
Page 1
Page 1