Podrobno

Sistem za interakcijo z virtualnimi liki na podlagi velikih jezikovnih modelov
ID Soršek, Matej (Avtor), ID Bešter, Janez (Mentor) Več o mentorju... Povezava se odpre v novem oknu, ID Mali, Luka (Komentor)

.pdfPDF - Predstavitvena datoteka, prenos (1,67 MB)
MD5: F4F23C31C736242D049393E8F6D0A073

Izvleček
Cilj diplomskega dela je bil razviti sistem za pogovor z virtualnimi liki na osnovi razpoznave govora, generacije odgovorov in sinteze govora. Pri izvedbi je bilo treba pregledati obstoječa orodja umetne inteligence (AI/Artifical Inteligence), orodja za generacijo besedila in govora ter jih preizkusiti in povezati v celoto, ki nam omogoča uporabo sistema, tj. govorno interakcijo z virtualnim likom. Izbor orodij je temeljil na kriterijih kakovosti, izvedljivosti, izhodnega rezultata, hitrosti odziva in združljivosti z načrtovano arhitekturo sistema. Ključna je bila storitev velikih jezikovnih modelov (LLM/Large Language Model), saj predstavlja osrednji del sistema in definira odzive na vhodna vprašanja. Sledila je implementacija za pretvorbo besedilnih odgovorov v govor, integracijo prepoznave govora in njegovo pretvorbo v besedilo. Rezultat diplomskega dela je delujoč sistem, ki združuje napredne storitve velikih jezikovnih modelov in omogoča govorno interakcijo z virtualnim likom. Kot dodatek smo razvili tudi predvajalnik, ki je sicer primarno namenjen predvajanju zvoka, lahko pa vanj vstavimo poljuben videoposnetek, da se lažje vključimo v interakcijo z likom. Sistem tako predstavlja osnovo za nadaljnji razvoj, ki bo speljan v smeri naprednega video prikazovanja.

Jezik:Slovenski jezik
Ključne besede:uporabniški vmesnik, veliki jezikovni modeli, analiza govora, sinteza govora, virtualni lik
Vrsta gradiva:Diplomsko delo/naloga
Tipologija:2.11 - Diplomsko delo
Organizacija:FE - Fakulteta za elektrotehniko
Leto izida:2025
PID:20.500.12556/RUL-172581 Povezava se odpre v novem oknu
COBISS.SI-ID:250354691 Povezava se odpre v novem oknu
Datum objave v RUL:09.09.2025
Število ogledov:141
Število prenosov:21
Metapodatki:XML DC-XML DC-RDF
:
Kopiraj citat
Objavi na:Bookmark and Share

Sekundarni jezik

Jezik:Angleški jezik
Naslov:Solution for Interaction with Virtual Characters Based on Large Language Models
Izvleček:
The goal of the thesis was to develop a system for conversation with virtual characters based on speech recognition, response generation, and speech synthesis. During implementation, it was necessary to review existing artificial intelligence (AI) tools, including those for text and speech generation, test them, and integrate them into a system that enables us to use the system, i.e., speech interaction with a virtual character. The selection of tools was based on the criteria of quality, feasibility, output result, response speed, and compatibility with the planned system architecture. The key was the service of large language models (LLMs/Large Language Models), as they represent the central part of the system and define responses to input questions. This was followed by the implementation of converting text responses into speech, integrating speech recognition, and its subsequent conversion into text. The result of the thesis is a working system that combines advanced services of large language models and enables speech interaction with a virtual character. As an addition, we have also developed a player that is primarily intended for playing audio, but can also be used to insert any video clip to facilitate interaction with the character. The system thus represents the basis for further development, which will be directed towards advanced video display.

Ključne besede:user interface, large language models, speech analysis, speech synthesis, virtual character

Podobna dela

Podobna dela v RUL:
Podobna dela v drugih slovenskih zbirkah:

Nazaj