The project Communication in Slovene includes the construction of a reference corpus of spoken Slovene, which will function as a resource for certain language guides and research projects. Due to its practical goals, key aims of the corpus are straightforward search options and easy-to-read transcription. This paper presents the method to be used for the mark-up of recordings, and for segmenting and transcribing speech samples.
|