In terms of material, Slovene corpus linguistics has taken huge strides forward in the last two decades in its awareness and treatment of written lexica, but at the same time Slovene orthography has been considerably neglected. Current corpora show certain orthographical phenomena in relation to the original versions in truncated form (imprecise punctuation, relations between spaces and punctuations marks, symbols) and at a level that is inaccessible to users with the current computer search tools (strings of punctuation marks, symbols).
|