Izboljšanje ločljivosti slik obrazov z uporabo latentno sklopljenih samokodirnikov

LUKEK, MARK

Izboljšanje ločljivosti slik obrazov z uporabo latentno sklopljenih samokodirnikov
ID LUKEK, MARK (Author), ID Štruc, Vitomir (Mentor) More about this mentor... This link opens in a new window

, ID Grm, Klemen (Comentor)

PDF - Presentation file, Download (11,00 MB)
MD5: 5020F9E7A1E6CFAC25C4EE21749BD164

Abstract

V zadnjem času so konvolucijski modeli, ki temeljijo na nevronskih mrežah, do- segli velik uspeh pri superločljivosti z uporabo ene vhodne slike (ang. Single- Image-Super-Resolution). Takšni modeli so zelo prožni in učinkoviti pri neline- arni preslikavi slik nizke ločljivosti v slike visoke ločljivosti. V tem delu pred- stavljamo nov postopek za superločljivost, ki temelji na dveh samokodirnikih in sklopljenih latentnih prostorih. Prvi samokodirnik je zmožen rekonstrukcije nizkoločljivostnih slik, drugi pa visokoločljivostnih slik. Latentna prostora samo- kodirnikov povezuje povezovalna mreža, ki omogoča pretvorbo med nizko- in viso- koločljivostnim latentnim prostorom. Z uporabo nizkoločljivostnega samokodir- nika, povezovalne mreže in visokoločljivostnega samokodirnika je mogoče poljubno vhodno nizkoločljivostno sliko preslikati v sliko visoke ločljivosti. Rezultati ome- njene metode so testirani na štirih podatkovnih zbirkah, CASIA-WebFace, LFW, QMUL-TinyFace in QMUL-SurvFace. Del podatkovne zbirke CASIA-WebFace je bil uporabljen za učenje vseh modelov, preostali del za testiranje. Zbirki QMUL- TinyFace in QMUL-SurvFace sta uporabljeni za preverjanje delovanja sistema na realnih slikah, kjer nimamo visokoločljivostnih parov. Rezultati izvedbe su- perločljivosti se na koncu še primerjajo z že obstoječimi pristopi, kot so bikubična interpolacija, SRCNN in SRGAN. V primerih sprednjega dela obraza, naš pri- stop presega delovanje bikubične interpolacije in modela SRCNN. Obrazi so bolj izraziti in gladki, vendar ne vsebujejo dovolj visokoločljivostnih podrobnosti, kot jih proizvede sistem SRGAN.

Language:	Slovenian
Keywords:	globoko učenje, umetna inteligenca, konvolucijski sloj, su- perločljivost, samokodirnik
Work type:	Master's thesis/paper
Organization:	FE - Faculty of Electrical Engineering
Year:	2022
PID:	20.500.12556/RUL-136206
COBISS.SI-ID:	105855747
Publication date in RUL:	20.04.2022
Views:	1161
Downloads:	122
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	Improving the resolution of facial images using latent-space coupled autoencoders
Recently, convolutional models based on neural networks have achieved great success in super-resolution using a single input image, called Single-Image-Super- Resolution or SISR. Such models are very flexible and efficient in non-linear map- ping of low-resolution images to high-resolution ones. In this work, we present a novel super-resolution procedure based on two autoencoders and coupled latent spaces. The first autoencoder is capable of reconstructiong low-resolution images, while the second one is capable od reconstructing high-resolution images. The latent spaces of the two autoencoders are connected by a linking network which allows conversion between the low- and high- resolution latent spaces. Using the low-resolution encoder, the linking network and the high-resolution decoder it is possible to efficiently upscale an arbitrary low-resolution inpout image. The re- sults of the above method area tested on four datasets, CASIA-WebFace, LFW, QMUL-TinyFace and QMUL-SurvFace. Part of the CASIA-WebFace database was used to train all models, the rest for testing. The QMUL-TinyFace and QMUL-SurvFace databases are used to verify the system performance on real images where we do not have high-resolution pairs. Finally, the results of the super-resolution model are further compared with existing approaches such as bicubic interploation, SRCNN and SRGAN. In the cases frontal face images are used as input, our approach outperforms bicubic interpolation and the SRCNN model. The faces are more pronounced and smoother, but do not contain less high-resolution details than faces produced by SRGAN.
Keywords:	deep learning, artificial intelligence, convolutional layers, super- resolution, autoencoders

Similar works from RUL:
Similar works from other Slovenian collections:

Secondary language

Similar documents