The diploma thesis is focused on the development of a prototype system for voice control of an application on an android device.
On internet majority of services for speech recognition offered by technological giants are payable and mostly works with usage of internet.
Open source speech recognition systems are not well documented and support only
world languages like english or german.
Therefore, the whole process of speech recognition development on android device for the Slovenian language was created and described on the basis of open source tools CMU Sphinx, which are operating without an internet connection, which has not yet been addressed.
Using the CMU Sphinx tools, an acoustic model was developed for a limited set of Slovene commands based on one speaker.
This acoustic model was then integrated into a simple demonstration android application where the recognition of commands in the Slovene language was implemented using the PocketSphinx library.
Test results were extremely successful and showed the fast and accurate operation of voice command recognition.
|