Enriching Slovene WordNet with domain-specific terms
Vintar, Špela (Author), Fišer, Darja (Author)

URLURL - Presentation file, Visit http://www.t-c3.org/index.php/t-c3/article/view/4 This link opens in a new window

The paper describes an innovative approach to expanding the domain coverage ofwordnet by exploiting multiple resources. In the experiment described here we are using a large monolingual Slovene corpus of texts from the domain of informatics to harvest terminology from, and a parallel English-Slovene corpusand an online dictionary as bilingual resources to facilitate the mapping of terms to the Slovene Wordnet. We first identify the core terms of the domain in English using the Princeton Wordnet, and then we translate them into Slovene using a bilingual lexicon produced from the parallel corpus. In the next step we extract multi-word terms from the Slovene domain-specific corpus using a hybrid approach, and finally match the term candidates to existing Wordnet synsets. The proposed method appears to be a successful way to improve the domain coverage of Wordnet as it yields abundant term candidates and exploits various multilingual resources.

Keywords:samodejna izdelava WordNeta, večbesedni termini, WordNet, luščenje terminologije, vzporedni korpusi, Slovenija, WordNet construction, parallel corpora, multi-word expressions, term extraction, Slovene WordNet
Work type:Not categorized (r6)
Tipology:1.01 - Original Scientific Article
Organization:FF - Faculty of Arts
Number of pages:str. 29-44
Numbering:Vol. 1, no. 1
ISSN on article:2193-6986
COBISS.SI-ID:48473698 Link is opened in a new window
Average score:(0 votes)
Your score:Voting is allowed only to logged in users.
AddThis uses cookies that require your consent. Edit consent...

Record is a part of a journal

Title:Translation: computation, corpora, cognition
Shortened title:Transl.: comput. corpora cogn.
Publisher:ICSI, RWTH Aachen University, Johannes-Gutenberg-Universität
COBISS.SI-ID:48463714 This link opens in a new window

Similar documents

Similar works from RUL:
Similar works from other Slovenian collections:


Leave comment

You have to log in to leave a comment.

Comments (0)
0 - 0 / 0
There are no comments!