Details

Razpoznavanje slovenskega govora z metodami globokih nevronskih mrež
ID Ulčar, Matej (Author), ID Robnik Šikonja, Marko (Mentor) More about this mentor... This link opens in a new window, ID Dobrišek, Simon (Comentor)

.pdfPDF - Presentation file, Download (700,11 KB)
MD5: CF5844140875D1ACCAD9A61E360FA590

Abstract
Ročno zapisovanje govora je počasen proces, ki ga čedalje bolj nadomešča avtomatsko razpoznavanje govora. Slednje se lahko uporablja tudi za glasovno upravljanje programov in naprav. V magistrski nalogi smo kot osnovo za razpoznavanje govorjene slovenščine uporabili uveljavljene metode GMM-HMM za akustični model in n-gramov za jezikovni model. Modela smo nadgradili z uporabo globokih nevronskih mrež, ki so se izkazale za zelo uspešne. Preizkusili smo različne arhitekture časovno zakasnjenih nevronskih mrež in nevronskih mrež z dolgim kratkoročnim spominom na akustičnem in jezikovnem modelu razpoznavalnika govora. Razpoznavalnik smo učili na širokem besednjaku, ki vsebuje približno milijon različnih besed. Najboljše rezultate dosegajo časovno zakasnjene nevronske mreže, kjer smo dosegli 72,84% pravilno prepoznanih besed pri tekočem govoru.

Language:Slovenian
Keywords:strojno učenje, globoke nevronske mreže, razpoznavanje govora
Work type:Master's thesis/paper
Organization:FRI - Faculty of Computer and Information Science
Year:2018
PID:20.500.12556/RUL-104850 This link opens in a new window
Publication date in RUL:12.10.2018
Views:3494
Downloads:408
Metadata:XML DC-XML DC-RDF
:
ULČAR, Matej, 2018, Razpoznavanje slovenskega govora z metodami globokih nevronskih mrež [online]. Master’s thesis. [Accessed 18 July 2025]. Retrieved from: https://repozitorij.uni-lj.si/IzpisGradiva.php?lang=eng&id=104850
Copy citation
Share:Bookmark and Share

Secondary language

Language:English
Title:Computer Speech Recognition in Slovene Language
Abstract:
Manual transcription of speech is slow and is being replaced by automatic speech recognition systems. These systems are also used for voice control of various programs and devices. In this thesis, we used as a baseline for Slovene speech recognition GMM-HMM methods for acoustic model and n-grams for language model. We improved both models with deep neural networks, which have proven to be very successful. We tested several architectures of time-delayed neural networks and neural networks with long short-term memory for both acoustic and language model. We used a large lexicon, containing about a million words. Time-delayed neural networks achieved the best results on continuous speech, with 72,84% of correctly identified words.

Keywords:machine learning, deep neural networks, speech recognition

Similar documents

Similar works from RUL:
  1. THE RENOVATION OF THE PERSONAL DATA PROTECTION SYSTEM IN THE SELECTED MUNICIPALITY
  2. ǂThe ǂright to be forgotten on the internet
  3. General Data Protection Regulation 2016/679 implementation challenges in HR consulting firms
  4. Civil liability for data breach
  5. The effects of the general data protection regulation (GDPR) on digital marketing practice in Slovenia
Similar works from other Slovenian collections:
  1. The impact of the EU General Data Protection Regulation (GDPR) on mobile devices
  2. Spremenjen proces obdelovanja osebnih podatkov po uvedbi splošne uredbe EU o varstvu podatkov (GDPR - general data protection regulation)
  3. Information challenges when implementing General Data Protection Regulation
  4. Analysis of blockchain technology compliance with General Data Protection Regulation
  5. AUTOMATED DECISION MAKING: THE FUTURE OF THE COURTS?

Back