izpis_h1_title_alt

FICE : text-conditioned fashion-image editing with guided GAN inversion
ID Pernuš, Martin (Author), ID Fookes, Clinton (Author), ID Štruc, Vitomir (Author), ID Dobrišek, Simon (Author)

.pdfPDF - Presentation file, Download (4,18 MB)
MD5: 159EEA326A40F18F0E79952FA1615B4F
URLURL - Source URL, Visit https://www.sciencedirect.com/science/article/pii/S0031320324007738 This link opens in a new window

Abstract
Fashion-image editing is a challenging computer-vision task where the goal is to incorporate selected apparel into a given input image. Most existing techniques, known as Virtual Try-On methods, deal with this task by first selecting an example image of the desired apparel and then transferring the clothing onto the target person. Conversely, in this paper, we consider editing fashion images with text descriptions. Such an approach has several advantages over example-based virtual try-on techniques: (i) it does not require an image of the target fashion item, and (ii) it allows the expression of a wide variety of visual concepts through the use of natural language. Existing image-editing methods that work with language inputs are heavily constrained by their requirement for training sets with rich attribute annotations or they are only able to handle simple text descriptions. We address these constraints by proposing a novel text-conditioned editing model called FICE (Fashion Image CLIP Editing) that is capable of handling a wide variety of diverse text descriptions to guide the editing procedure. Specifically, with FICE, we extend the common GAN-inversion process by including semantic, pose-related, and image-level constraints when generating images. We leverage the capabilities of the CLIP model to enforce the text-provided semantics, due to its impressive image–text association capabilities. We furthermore propose a latent-code regularization technique that provides the means to better control the fidelity of the synthesized images. We validate the FICE through rigorous experiments on a combination of VITON images and Fashion-Gen text descriptions and in comparison with several state-of-the-art, text-conditioned, image-editing approaches. Experimental results demonstrate that the FICE generates very realistic fashion images and leads to better editing than existing, competing approaches. The source code is publicly available from: https://github.com/MartinPernus/FICE.

Language:English
Keywords:text-conditioning, GAN inversion, image editing, generative artificial intelligence, generative adversarial networks, deep learning, multimodality
Work type:Article
Typology:1.01 - Original Scientific Article
Organization:FE - Faculty of Electrical Engineering
Publication status:Published
Publication version:Version of Record
Year:2025
Number of pages:18 str.
Numbering:Vol. 158, art. 111022
PID:20.500.12556/RUL-162586 This link opens in a new window
UDC:004.93
ISSN on article:0031-3203
DOI:10.1016/j.patcog.2024.111022 This link opens in a new window
COBISS.SI-ID:207863555 This link opens in a new window
Publication date in RUL:25.09.2024
Views:134
Downloads:67
Metadata:XML DC-XML DC-RDF
:
Copy citation
Share:Bookmark and Share

Record is a part of a journal

Title:Pattern recognition
Shortened title:Pattern recogn.
Publisher:Elsevier
ISSN:0031-3203
COBISS.SI-ID:26103040 This link opens in a new window

Licences

License:CC BY 4.0, Creative Commons Attribution 4.0 International
Link:http://creativecommons.org/licenses/by/4.0/
Description:This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.

Secondary language

Language:Slovenian
Keywords:besedilno pogojevanje, invertiranje GAN modelov, urejanje slik, generativna umetna inteligenca

Projects

Funder:ARRS - Slovenian Research Agency
Project number:P2-0250
Name:Metrologija in biometrični sistemi

Funder:ARRS - Slovenian Research Agency
Project number:J2-2501
Name:Globoki generativni modeli za lepotno in modno industrijo (DeepBeauty)

Funder:ARRS - Slovenian Research Agency
Funding programme:Young researchers

Similar documents

Similar works from RUL:
Similar works from other Slovenian collections:

Back