izpis_h1_title_alt

Več glav več ve : uporaba množičenja za čiščenje sloWNeta
ID Fišer, Darja (Author), ID Tavčar, Aleš (Author)

.pdfPDF - Presentation file, Download (875,38 KB)
MD5: 1733FEBDBEB555A8E2081AB76FA602A7
URLURL - Source URL, Visit https://centerslo.si/simpozij-obdobja/zborniki/obdobja-32/ This link opens in a new window

Abstract
V prispevku predstavljamo projekt čiščenja avtomatsko generiranega semantičnega leksikona sloWNet. Napake, ki se v leksikonu pojavljajo zaradi napačne avtomatske disambiguacije večpomenskih besed, smo odpravili s pomočjo orodja sloWCrowd, ki je zasnovano tako, da odgovore za problematične literale zbira iz široke množice uporabnikov - prostovoljcev. Naloga je oblikovana kot spletna igra, v kateri uporabniki tekmujejo, kdo bo zbral več točk (prispeval več pravilnih odgovorov). Glede na to, da tekmovalci niso izurjeni leksikografi, njihovi odgovori niso nujno zanesljivi, zato orodje omogoča merjenje njihove natan~nosti in pri vsakem vprašanju upošteva večinski odgovor, s čimer zagotavlja, da posamezni napačni odgovori sicer zanesljivih uporabnikov ter vsi odgovori nezanesljivih uporabnikov ne vplivajo na dokončno odločitev, ali se določen literal iz leksikona izbriše ali ne.

Language:Slovenian
Keywords:slovenščina, množičenje, leksikalna semantika, večpomenskost, sloWNet
Work type:Article
Typology:1.16 - Independent Scientific Component Part or a Chapter in a Monograph
Organization:FF - Faculty of Arts
Year:2013
Number of pages:Str. 125-132
PID:20.500.12556/RUL-147646 This link opens in a new window
UDC:811.163.6'374'371'322
COBISS.SI-ID:53227362 This link opens in a new window
Publication date in RUL:10.07.2023
Views:974
Downloads:48
Metadata:XML DC-XML DC-RDF
:
Copy citation
Share:Bookmark and Share

Record is a part of a monograph

Title:Družbena funkcijskost jezika : (vidiki, merila, opredelitve)
Editors:Andreja Žele
Place of publishing:Ljubljana
Publisher:Znanstvena založba Filozofske fakultete
Year:2013
ISBN:978-961-237-609-3
COBISS.SI-ID:269357568 This link opens in a new window
Collection title:Obdobja
Collection numbering:32
Collection ISSN:1408-211X

Secondary language

Language:English
Abstract:
The paper presents the cleaning of the automatically generated semantic lexicon sloWNet. Errors that occurred due to inappropriate disambiguation of polysemous words were eliminated with a tool called sloWCrowd, which is designed in such a way that it collects multiple answers for problematic literals from a wide number of volunteer users. The task is designed as a web game in which users compete who will collect the highest number of points (contribute the most correct answers). Since the users are not trained lexicographers, the reliability of their answers is questionable, which is whythe tool has been designed to measure the usersʼ accuracy and relies on themajority vote for each literal. This means that the individual incorrect answers from otherwise reliable users and all the answers from unreliable users do not affect the final decision whether or not the literal is to be deleted from the lexicon.

Keywords:Slovenian language, crowdsourcing, lexical semantics, polysemy, sloWNet

Similar documents

Similar works from RUL:
Similar works from other Slovenian collections:

Back