Your browser does not allow JavaScript!
JavaScript is necessary for the proper functioning of this website. Please enable JavaScript or use a modern browser.
Repository of the University of Ljubljana
Open Science Slovenia
Open Science
DiKUL
slv
|
eng
Search
Browse
New in RUL
About RUL
In numbers
Help
Sign in
Details
Avtomatsko postavljanje ločil v surovem tekstu
ID
Rizvič, Mitja
(
Author
),
ID
Bajec, Marko
(
Mentor
)
More about this mentor...
,
ID
Lebar Bajec, Iztok
(
Comentor
)
PDF - Presentation file,
Download
(2,13 MB)
MD5: 8D102980466C30869A1C58226A5ED481
Image galllery
Abstract
Razpoznava govora je sistem, ki omogoča avtomatsko pretvorbo govora v besedilo. Izhod takšnega sistema je surovo besedilo brez velikih začetnic, ločil in ostalih oblikovnih lastnosti. Ker je takšno besedilo nepregledno, ročno urejanje pa zahteva veliko dela, so se uveljavile različne metode, ki omenjene težave rešujejo avtomatsko. Takšni sistemi lahko temeljijo na različnih metodah, vendar so se v zadnjem času predvsem zaradi dobrih rezultatov uveljavili različni tipi nevronskih mrež. Tako smo v sklopu magistrskega dela implementirali sistem, ki za svoje delovanje uporablja rekurenčne nevronske mreže. Preizkusili smo ga z različnimi vektorskimi vložitvami, kot so GloVe, ELMO in BERT. Implementirali smo tudi spletno storitev, ki omogoča, da sistem enostavno integriramo v različne storitve, kot je npr. že prej omenjena avtomatska razpoznava govora.
Language:
Slovenian
Keywords:
strojno učenje
,
nevronske mreže
,
postavljanje ločil
Work type:
Master's thesis/paper
Typology:
2.09 - Master's Thesis
Organization:
FRI - Faculty of Computer and Information Science
Year:
2020
PID:
20.500.12556/RUL-117687
COBISS.SI-ID:
32307203
Publication date in RUL:
22.07.2020
Views:
3050
Downloads:
282
Metadata:
Cite this work
Plain text
BibTeX
EndNote XML
EndNote/Refer
RIS
ABNT
ACM Ref
AMA
APA
Chicago 17th Author-Date
Harvard
IEEE
ISO 690
MLA
Vancouver
:
RIZVIČ, Mitja, 2020,
Avtomatsko postavljanje ločil v surovem tekstu
[online]. Master’s thesis. [Accessed 26 March 2025]. Retrieved from: https://repozitorij.uni-lj.si/IzpisGradiva.php?lang=eng&id=117687
Copy citation
Share:
Secondary language
Language:
English
Title:
Automatic punctuation in raw word sequences
Abstract:
Speech recognition is a system that allows for automatic conversion of speech into written text. Such systems typicaly return raw text without any formatting such as capital letters or punctuation symbols. Because such text is unreadable and it also requires a lot of work to edit manually, various methods have been introduced that solve these problems automatically. Such systems can be based on a variety of methods. However, due to good results they provide, different types of neural networks are mainly used nowdays. As part of the master's thesis, we have implemented a system that uses recurrent neural network to predict punctuation symbols in raw unpunctuated text. We have tried it with different word embeddings such as GloVe, ELMO and BERT. We have also implemented a web service that allows us to easily integrate the system into various other services, such as automatic speech recognition.
Keywords:
machine learning
,
neural networks
,
punctuation restoration
Similar documents
Similar works from RUL:
Five invasive alien plant powders, Norway spruce (Picea abies [L.] H. Karst.) wood ash and diatomaceous earth against Sitophilus oryzae (L.) adults
The effectiveness of three essential oils for controlling the rice weevil (Sitophilus oryzae [L.], Coleoptera, Curculionidae) in stored wheat
Impact of geochemical composition of diatomaceous earth on its insecticidal activity against adults of Sitophilus oryzae (L.) (Coleoptera: Curculionidae)
The effect of diatomaceous earth of different origin, temperature and relative humidity against adults of rice weevil (Sitophylus oryzae [L.], Coleoptera, Curculionidae) in stored wheat
Intraspecific variability of Steinernema feltiae (Filipjev) (Rhabditida: Steinernematidae) as biological control agent of rice weevil (Sitophylus oryzae [L.], Coleoptera, Curculionidae) adults
Similar works from other Slovenian collections:
No similar works found
Back