This thesis aims to develop a classifier for antonym detection. A database of pre-made word embeddings for Slovene was used to create the solution. First we collected a learning set consisting of synonyms and antonyms. Then we searched for the most appropriate classification model. We observed some support vector machine models and some deep neural networks. We applied the learned model to groups of words closest to the selected words. Thus, we obtained candidates for pairs of synonyms and antonyms. The accuracy of the results set was evaluated on the test set. The top rated model reaches classification accuracy of 0.70.
|