Globoke nevronske mreže za semantično segmentacijo za navidezna ozadja

TRATNIK, AMADEJ

Globoke nevronske mreže za semantično segmentacijo za navidezna ozadja
ID TRATNIK, AMADEJ (Author), ID Čehovin Zajc, Luka (Mentor) More about this mentor... This link opens in a new window

PDF - Presentation file, Download (11,91 MB)
MD5: 989D4EB4581210C6770C9B724045CC8A

Abstract

Zaznavanje objektov na slikah je aktualna tematika v industriji in raziskovanju, saj omogoča avtomatsko prepoznavanje posameznega objekta na sliki, pogosto hitreje in točneje od človeškega očesa. S porastom globokih nevronskih mrež je še posebej zanimivo področje semantične segmentacije, ki omogoča ekstrakcijo informacije do ravni posameznih slikovnih elementov. V okviru diplomske naloge smo se posvetili problemu prepoznavanja osebe v videu in zamenjave ozadja s poljubno vsebino. Zasnovali smo primerno točno in raznoliko podatkovno množico oseb in njihovih binarnih mask, implementirali in naučili dve konvolucijski nevronski mreži segmentacije, Fast-SCNN in UNet, ju primerjali in analizirali rezultate. Arhitekturo Fast-SCNN smo še dodatno optimizirali z orodjem ONNX Runtime, namenjenim produkciji, in ji omogočili izvajanje na CPE v realnem času. S primerno anotirano množico za učenje in optimizirano različico nevronske mreže Fast-SCNN smo dosegli v povprečju 27 sličic na sekundo pri prepoznavanju osebe v videu ter 29 sličic na sekundo pri prepoznavanju osebe v realnem času preko spletne kamere.

Language:	Slovenian
Keywords:	semantična segmentacija, globoko učenje, računalniški vid
Work type:	Bachelor thesis/paper
Typology:	2.11 - Undergraduate Thesis
Organization:	FRI - Faculty of Computer and Information Science
Year:	2022
PID:	20.500.12556/RUL-135746
COBISS.SI-ID:	104184067
Publication date in RUL:	30.03.2022
Views:	1318
Downloads:	177
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	Virtual backgrounds as semantic segmentation using deep neural networks
Object detection is a current topic in industry and research. It enables automatic identification of an individual objects in an image, which is often faster and more accurate than that of the human eye. With the rise of deep neural networks, the process of semantic segmentation is particularly interesting, as it allows the extraction of information from an image on pixel level. As part of the BA thesis, we addressed the issue of identifying a person in a video and replacing their background with any given content. We designed a diverse and accurate set of data subjects and their binary masks, implemented and trained two convolutional neural networks for semantic segmentation, Fast-SCNN and UNet. We then compared the two networks and analyzed the results. The Fast-SCNN network was further optimized with ONNX Runtime to enable real-time execution on the CPU. On an appropriately annotated dataset combined with an optimized version of the Fast-SCNN neural network, we achieved an average of 27 FPS in videos and 29 FPS in real-time webcam segmentation.
Keywords:	semantic segmentation, deep learning, computer vision

Similar works from RUL:
Similar works from other Slovenian collections:

Secondary language

Similar documents