Cross-lingual WSD for translation extraction from comparable corpora
Apidianaki, Marianne (Author), Ljubešić, Nikola (Author), Fišer, Darja (Author)

URLURL - Presentation file, Visit http://aclweb.org/anthology/W/W13/W13-2501.pdf This link opens in a new window

We propose a data-driven approach to enhance translation extraction from comparable corpora. Instead of resorting to an external dictionary, we translate source vector features by using a cross-lingual Word Sense Disambiguation method. The candidate senses for a feature correspond to sense clusters of its translations in a parallel corpus and the context used for disambiguation consists of the vector that contains the feature. The translations found in the disambiguation output convey the sense of the features in the source vector, while the use of translation clusters permits to expand their translation with several variants. As a consequence, the translated vectors are less noisy and richer, and allow for the extraction of higher quality lexicons compared to simpler methods.

Keywords:primerljivi corpusi, razdvoumljanje večpomenskih besed, razvrščanje pomenov v skupine, comparable corpora, word sense disambiguation, sense clustering
Work type:Not categorized (r6)
Tipology:1.08 - Published Scientific Conference Contribution
Organization:FF - Faculty of Arts
Number of pages:Str. 1-10
COBISS.SI-ID:52671842 Link is opened in a new window
Average score:(0 votes)
Your score:Voting is allowed only to logged in users.
AddThis uses cookies that require your consent. Edit consent...

Similar documents

Similar works from RUL:
Similar works from other Slovenian collections:


Leave comment

You have to log in to leave a comment.

Comments (0)
0 - 0 / 0
There are no comments!