The thesis is focused on automatic speech recognition of Slovenian phonemes, based on the Sofes database. The speech and phoneme recognition toolkit Kaldi is used, which has thus far not been used for the Slovenian language.
The speech recognition process was implented with various acousting and lingustic models. The results, obtained by using both neural networks and classical HMM approaches, yielded promising results.
|