Your browser does not allow JavaScript!
JavaScript is necessary for the proper functioning of this website. Please enable JavaScript or use a modern browser.
Repository of the University of Ljubljana
Open Science Slovenia
Open Science
DiKUL
slv
|
eng
Search
Browse
New in RUL
About RUL
In numbers
Help
Sign in
Details
Razvoj govornega vmesnika za vnos podatkov pri terenskem delu
ID
SEVER, VID
(
Author
),
ID
Dobrišek, Simon
(
Mentor
)
More about this mentor...
PDF - Presentation file,
Download
(1,48 MB)
MD5: 5C7AA42581B52863EF068CDE2558A48C
PID:
20.500.12556/rul/2a16d64a-44ef-416c-a336-8e47cdfde282
Image galllery
Abstract
Cilj dela v diplomski nalogi je razviti govorni vmesnik, ki bo uspešno reševal probleme z vnašanjem podatkov v informacijske sisteme med terenskim delom. V prvem delu naloge smo raziskali področje razpoznavanja govora in pregledali možne govorne vmesnike ter orodja, katere bi lahko uporabili pri svojem delu V drugem delu naloge smo se osredotočili na samo izvedbo govornega vmesnika v programskem jeziku Python. Pri obdelavi posnetkov govora smo uporabili nekaj nestandardnih Python knjižnic. Za razpoznavanje govora smo uporabili Googlov govorni programski vmesnik Google Speech API. Razpoznano besedilo smo oblikovali v HTML formatu. Razvili smo tudi grafični vmesnik. Delovanje govornega vmesnika smo preizkusili v okoljih z različno ravnijo hrupa. Ugotovili smo, da zadovoljivo dobro deluje tudi pri posnetkih, narejenih v naravnem okolju, v katerem terensko delo navadno poteka.
Language:
Slovenian
Keywords:
razpoznavanje govora
,
govorni vmesnik
,
Google Speech API
Work type:
Undergraduate thesis
Organization:
FE - Faculty of Electrical Engineering
Year:
2016
PID:
20.500.12556/RUL-85659
Publication date in RUL:
20.09.2016
Views:
2021
Downloads:
603
Metadata:
Cite this work
Plain text
BibTeX
EndNote XML
EndNote/Refer
RIS
ABNT
ACM Ref
AMA
APA
Chicago 17th Author-Date
Harvard
IEEE
ISO 690
MLA
Vancouver
:
SEVER, VID, 2016,
Razvoj govornega vmesnika za vnos podatkov pri terenskem delu
[online]. Bachelor’s thesis. [Accessed 31 March 2025]. Retrieved from: https://repozitorij.uni-lj.si/IzpisGradiva.php?lang=eng&id=85659
Copy citation
Share:
Secondary language
Language:
English
Title:
The development of a speech interface for data entry in fieldwork
Abstract:
Main goal of the thesis was to develop a speech interface for solving problems with data entry during fieldwork. In first part of the thesis we did an overview of speech recognition field, tools and speech interfaces which we cloud use in development of my own speech interface. In the second part of the thesis we focused on developing speech interface with python programing language. We used some nonstandard python libraries for audio processing. Speech recognition was performed by Google Speech API. We used HTML format to achieve the desired text structure of the output. We also developed a graphical user interface. We tested the speech interface in different environments with different noise volumes. We concluded that it performs well with voice recordings that were recorded in a natural environment, where fieldwork is usually performed. Performance drops only in environments with a really loud noise.
Keywords:
speech recognition
,
speech interface
,
Google Speech API
Similar documents
Similar works from RUL:
Antioxidant activity of germinated spelt
Germinated buckwheat
Podaljševanje trajnosti kruha z mikrovalovi
Determination of selected physico-chemical parameters of buckwheat honey
Antimicrobial activity of phenolic extracts from grape pomace
Similar works from other Slovenian collections:
Isolation of biologically active compounds from bilberries (Vaccinium myrtillus L.)
THE CONTENT OF PHENOLIC COMPOUNDS IN FRUIT AND HERBAL BEVERAGES
Extraction and analysis of bioactive compounds from oats
Antioksidativna kapaciteta rdečih vin
EXTRACTION OF PHENOLIC COMPOUNDS FROM MULBERRY FRUIT
Back