Adversarial Examples Are a Natural Consequence of Test Error in Noise

- July 07, 2019

Over the last few years, the phenomenon of adversarial examples — maliciously constructed inputs that fool trained machine learning models — has captured the attention of the research community, especially when the adversary is restricted to small modifications of a correctly handled input. Less surprisingly, image classifiers also lack human-level performance on randomly corrupted images, such as images with additive Gaussian noise. In this paper, we provide both empirical and theoretical evidence that these are two manifestations of the same underlying phenomenon. We establish close connections between the adversarial robustness and corruption robustness research programs, with the strongest connection in the case of additive Gaussian noise. This suggests that improving adversarial robustness should go hand in hand with improving performance in the presence of more general and realistic image corruptions. Based on our results we recommend that future adversarial defenses consider evaluating the robustness of their methods to distributional shift with benchmarks such as ImageNet-C.

Ford, Nic, et al. "Adversarial examples are a natural consequence of test error in noise." arXiv preprint arXiv:1901.10513 (2019).

Search This Blog

XinLi AI Blog

Adversarial Examples Are a Natural Consequence of Test Error in Noise

Comments

Post a Comment

Popular posts from this blog

Reading notes: Impact of Adversarial Examples on Deep Learning Models for Biomedical Image Segmentation

Reading notes: Degenerative Adversarial NeuroImage Nets: Generating Images that Mimic Disease Progression

Reading notes: Non-Local Context Encoder: Robust Biomedical Image Segmentation against Adversarial Attacks