izpis_h1_title_alt

Korpus učbenikov za učenje slovenščine kot drugega in tujega jezika
ID Klemen, Matej (Author), ID Arhar Holdt, Špela (Author), ID Pollak, Senja (Author), ID Kosem, Iztok (Author), ID Huber, Damjan (Author), ID Lutar, Mateja (Author)

.pdfPDF - Presentation file, Download (268,52 KB)
MD5: ABF36CDDA4B4DBB50D57976A720C9D67
URLURL - Source URL, Visit https://ebooks.uni-lj.si/ZalozbaUL/catalog/book/374 This link opens in a new window

Abstract
V prispevku prikažemo, kako je potekalo oblikovanje korpusa učbenikov za učenje slovenščine kot drugega in tujega jezika – KUUS, ki je nastal kot vzporedni projekt priprave stopenjskih beril na Centru za slovenščino kot drugi in tuji jezik. KUUS v trenutni različici vključuje 17 učbenikov, obsega 691.003 pojavnice oz. 491.022 besed in je skladno z načeli priprave tovrstnih jezikovnih virov opremljen z metapodatki in oznakami, ki omogočajo uporabo jezikovnih podatkov za različne namene. Predstavimo metodološke odločitve, ki smo jih sprejeli pri pripravi korpusa, trenutno različico korpusa in prvi primer uporabe korpusnih podatkov. Opišemo, kako smo podatke uporabili za pripravo pogostnostnih seznamov besed, ki so prvi korak do korpusno podprtega nabora jedrnega besedišča za slovenščino kot drugi ali tuji jezik in omogočajo primerjavo z drugimi seznami besed. Prispevek zaključimo z načrti za nadaljnji razvoj korpusa in seznamov.

Language:Slovenian
Keywords:slovenščina, slovenščina kot drugi jezik, slovenščina kot tuji jezik, korpus učbenikov, KUUS, seznam besed, Skupni evropski jezikovni okvir
Work type:Article
Typology:1.16 - Independent Scientific Component Part or a Chapter in a Monograph
Organization:FF - Faculty of Arts
Publication status:Published
Publication version:Version of Record
Year:2022
Number of pages:Str. 165-174
PID:20.500.12556/RUL-143990 This link opens in a new window
UDC:811.163.6'243:37.091.64
DOI:10.4312/Obdobja.41.165-174 This link opens in a new window
COBISS.SI-ID:129975811 This link opens in a new window
Copyright:
Licenca navedena na pristajalni strani zbornika.
Publication date in RUL:26.01.2023
Views:1497
Downloads:114
Metadata:XML DC-XML DC-RDF
:
Copy citation
Share:Bookmark and Share

Record is a part of a monograph

Title:Na stičišču svetov : slovenščina kot drugi in tuji jezik
Editors:Nataša Pirih Svetina, Ina Ferbežar
Place of publishing:Ljubljana
Publisher:Založba Univerze
Year:2022
ISBN:978-961-297-026-0
COBISS.SI-ID:128310531 This link opens in a new window
Collection title:Zbirka Obdobja
Collection numbering:41
Collection ISSN:1408-211X

Licences

License:CC BY-NC-SA 4.0, Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International
Link:http://creativecommons.org/licenses/by-nc-sa/4.0/
Description:A Creative Commons license that bans commercial use and requires the user to release any modified works under this license.

Secondary language

Language:English
Abstract:
This article describes the creation of a corpus of textbooks for learning Slovenian as a second and foreign language. The KUUS corpus was created as a parallel project for developing graded readers at the Center for Slovenian as a Second and Foreign Language. In its current version, KUUS includes seventeen textbooks, comprises 691,003 tokens or 491,022 words, and, in line with the principles of preparing language resources of this kind, is equipped with metadata and annotations that allow the linguistic data to be used for various purposes. The methodological decisions made in preparing the corpus, the current version of the corpus, and a first example of the use of corpus data are presented. The paper describes how the data were used to compile word frequency lists, which are the first step toward a corpus-based core vocabulary for Slovenian as a second or foreign language and allow comparison with other word lists. The article concludes with plans for further development of the corpus and lists.

Keywords:Slovene, Slovene as a second language, Slovene as a foreign language, textbook corpus, KUUS, word list, Common European Framework of Reference for Languages

Projects

Funder:ARRS - Slovenian Research Agency
Project number:P6-0411-2019
Name:Jezikovni viri in tehnologije za slovenski jezik

Funder:ARRS - Slovenian Research Agency
Project number:J7-3159-2021
Name:Empirična podlaga za digitalno podprt razvoj pisne jezikovne zmožnosti

Funder:ARRS - Slovenian Research Agency
Project number:P2-0103-2022
Name:Tehnologije znanja

Similar documents

Similar works from RUL:
Similar works from other Slovenian collections:

Back