A Halfspace-Mass Depth-Based Method for Adversarial Attack Detection - Laboratoire d'informatique de l'X (LIX) Accéder directement au contenu
Article Dans Une Revue Transactions on Machine Learning Research Journal Année : 2023

A Halfspace-Mass Depth-Based Method for Adversarial Attack Detection

Résumé

Despite the widespread use of deep learning algorithms, vulnerability to adversarial attacks is still an issue limiting their use in critical applications. Detecting these attacks is thus crucial to build reliable algorithms and has received increasing attention in the last few years. In this paper, we introduce the HalfspAce Mass dePth dEtectoR (HAMPER), a new method to detect adversarial examples by leveraging the concept of data depths, a statistical notion that provides center-outward ordering of points with respect to (w.r.t.) a probability distribution. In particular, the halfspace-mass (HM) depth exhibits attractive properties which makes it a natural candidate for adversarial attack detection in high-dimensional spaces. Additionally, HM is non differentiable making it harder for attackers to directly attack HAMPER via gradient based-methods. We evaluate HAMPER in the context of supervised adversarial attacks detection across four benchmark datasets. Overall, we empirically show that HAMPER consistently outperforms SOTA methods. In particular, the gains are 13.1% (29.0%) in terms of AUROC↑ (resp. FPR ↓95%) on SVHN, 14.6% (25.7%) on CIFAR10 and 22.6% (49.0%) on CIFAR100 compared to the best performing method.
Fichier principal
Vignette du fichier
451_a_halfspace_mass_depth_based_m.pdf (3.27 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04575113 , version 1 (14-05-2024)

Identifiants

  • HAL Id : hal-04575113 , version 1

Citer

Marine Picot, Federica Granese, Guillaume Staerman, Marco Romanelli, Francisco Messina, et al.. A Halfspace-Mass Depth-Based Method for Adversarial Attack Detection. Transactions on Machine Learning Research Journal, 2023. ⟨hal-04575113⟩
0 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More