Abstract
Adversarial attack is a technique that causes a malfunction of classification models by adding noise that cannot be distinguished by humans, which poses a threat to a deep learning model. In this paper, we propose an efficient method to detect adversarial images using Gaussian process regression. Existing deep learning-based adversarial detection methods require numerous adversarial images for their training. The proposed method overcomes this problem by performing classification based on the statistical features of adversarial images and clean images that are extracted by Gaussian process regression with a small number of images. This technique can determine whether the input image is an adversarial image by applying Gaussian process regression based on the intermediate output value of the classification model. Experimental results show that the proposed method achieves higher detection performance than the other deep learning-based adversarial detection methods for powerful attacks. In particular, the Gaussian process regression-based detector shows better detection performance than the baseline models for most attacks in the case with fewer adversarial examples.
| Original language | English |
|---|---|
| Pages (from-to) | 4285-4299 |
| Number of pages | 15 |
| Journal | KSII Transactions on Internet and Information Systems |
| Volume | 13 |
| Issue number | 8 |
| DOIs | |
| State | Published - 2019 |
Keywords
- Adversarial Attack
- Adversarial Defense
- Adversarial Detection
- Gaussian Process Regression
- Image Classification
Quacquarelli Symonds(QS) Subject Topics
- Computer Science & Information Systems
Fingerprint
Dive into the research topics of 'Adversarial detection with Gaussian process regression-based detector'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver