Main goal of the thesis was to develop a speech interface for solving problems with data entry during fieldwork.
In first part of the thesis we did an overview of speech recognition field, tools and speech interfaces which we cloud use in development of my own speech interface.
In the second part of the thesis we focused on developing speech interface with python programing language. We used some nonstandard python libraries for audio processing. Speech recognition was performed by Google Speech API. We used HTML format to achieve the desired text structure of the output. We also developed a graphical user interface.
We tested the speech interface in different environments with different noise volumes. We concluded that it performs well with voice recordings that were recorded in a natural environment, where fieldwork is usually performed. Performance drops only in environments with a really loud noise.
|