Ανίχνευση επιθέσεων με αντίπαλη τεχνητή νοημοσύνη

Κατσιάνος, Ευριπίδης

Adversarial AI attack detection

Master Thesis

Author

Κατσιάνος, Ευριπίδης

Date

2025-12-09

Abstract

In this thesis, the detection of adversarial attacks in deep learning systems used for image classification is investigated. Two different datasets were examined the LISA Traffic Light Dataset (traffic light recognition) and the FruitNet: Indian Fruits Quality Dataset (fruit quality recognition), in order to evaluate model performance both in-distribution and out-of-distribution. Initially, ANN, CNN, and RNN classifiers were trained, and subsequently FGSM and PGD adversarial attacks were generated. Based on these data, adversarial input detectors of corresponding architectures were developed, along with a semi-supervised and a lifelong CNN-based detector. The evaluations were conducted on clean and perturbed images, as well as on unknown attacks. The results showed that CNN models are the most stable and accurate both as classifiers and as detectors, maintaining high performance across both datasets. The semi-supervised detector improved generalization without additional labeling, while the lifelong learning detector achieved near-complete detection on unknown attacks, with only a small increase in false alarms. The consistency of the findings across the two datasets indicates that adversarial patterns exhibit common characteristics regardless of the domain, and that the proposed detectors can operate reliably in changing environments.

Postgraduate Studies Programme

Ασφάλεια Ψηφιακών Συστημάτων

Department

Σχολή Τεχνολογιών Πληροφορικής και Επικοινωνιών. Τμήμα Ψηφιακών Συστημάτων

Number of pages

Language

Greek

URI

https://dione.lib.unipi.gr/xmlui/handle/unipi/18736

Collections

Τμήμα Ψηφιακών Συστημάτων

Show full item record

Except where otherwise noted, this item's license is described as
Αναφορά Δημιουργού-Μη Εμπορική Χρήση-Όχι Παράγωγα Έργα 3.0 Ελλάδα