izpis_h1_title_alt

Y-GAN : learning dual data representations for anomaly detection in images
ID Ivanovska, Marija (Author), ID Štruc, Vitomir (Author)

.pdfPDF - Presentation file, Download (4,99 MB)
MD5: 83C971776348434F3D3BF2C4CF9C37DD
URLURL - Source URL, Visit https://www.sciencedirect.com/science/article/pii/S0957417424002756 This link opens in a new window

Abstract
We propose a novel reconstruction-based model for anomaly detection in image data, called ’Y-GAN’. The model consists of a Y-shaped auto-encoder and represents images in two separate latent spaces. The first captures meaningful image semantics, which are key for representing (normal) training data, whereas the second encodes low-level residual image characteristics. To ensure the dual representations encode mutually exclusive information, a disentanglement procedure is designed around a latent (proxy) classifier. Additionally, a novel representation-consistency mechanism is proposed to prevent information leakage between the latent spaces. The model is trained in a one-class learning setting using only normal training data. Due to the separation of semantically-relevant and residual information, Y-GAN is able to derive informative data representations that allow for efficacious anomaly detection across a diverse set of anomaly detection tasks. The model is evaluated in comprehensive experiments with several recent anomaly detection models using four popular image datasets, i.e., MNIST, FMNIST, CIFAR10, and PlantVillage. Experimental results show that Y-GAN outperforms all tested models by a considerable margin and yields state-of-the-art results. The source code for the model is made publicly available at https://github.com/MIvanovska/Y-GAN.

Language:English
Keywords:anomaly detection, one-class learning, deep learning, disentangled data representations
Work type:Article
Typology:1.01 - Original Scientific Article
Organization:FE - Faculty of Electrical Engineering
Publication status:Published
Publication version:Version of Record
Year:2024
Number of pages:17 str.
Numbering:Vol. 248, art. 123410
PID:20.500.12556/RUL-156066 This link opens in a new window
UDC:004.93
ISSN on article:1873-6793
DOI:10.1016/j.eswa.2024.123410 This link opens in a new window
COBISS.SI-ID:184821507 This link opens in a new window
Publication date in RUL:07.05.2024
Views:314
Downloads:84
Metadata:XML DC-XML DC-RDF
:
Copy citation
Share:Bookmark and Share

Record is a part of a journal

Title:Expert systems with applications
Publisher:Elsevier
ISSN:1873-6793
COBISS.SI-ID:23001861 This link opens in a new window

Licences

License:CC BY-NC-ND 4.0, Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International
Link:http://creativecommons.org/licenses/by-nc-nd/4.0/
Description:The most restrictive Creative Commons license. This only allows people to download and share the work for no commercial gain and for no other purposes.

Secondary language

Language:Slovenian
Keywords:detekcija anomalij, enorazredno učenje, globoko učenje

Projects

Funder:ARRS - Slovenian Research Agency
Project number:P2-0250
Name:Metrologija in biometrični sistemi

Funder:ARRS - Slovenian Research Agency
Project number:J2-50065
Name:Odkrivanje globokih ponaredkov z metodami zaznave anomalij (DeepFake DAD)

Similar documents

Similar works from RUL:
Similar works from other Slovenian collections:

Back