Your browser does not allow JavaScript!
JavaScript is necessary for the proper functioning of this website. Please enable JavaScript or use a modern browser.
Repository of the University of Ljubljana
Open Science Slovenia
Open Science
DiKUL
slv
|
eng
Search
Browse
New in RUL
About RUL
In numbers
Help
Sign in
Details
Oblikoskladenjsko označevanje slovenskega jezika z globokimi nevronskimi mrežami
ID
Belej, Primož
(
Author
),
ID
Robnik Šikonja, Marko
(
Mentor
)
More about this mentor...
,
ID
Krek, Simon
(
Comentor
)
PDF - Presentation file,
Download
(4,95 MB)
MD5: 93613CCF380E4F794BB8825859E333F3
Image galllery
Abstract
V magistrskem delu se ukvarjamo z oblikoskladenjskim označevanjem slovenskega jezika. Pri tej nalogi s področja obdelave naravnega jezika povedim priredimo ustrezno zaporedje oznak, ki opisujejo oblikoskladenjske lastnosti besed. Za razliko od tipičnih pristopov, ki vhodne povedi obravnavajo na nivoju besed, naša rešitev obravnava vhodne povedi kot zaporedja znakov. Nalogo označevanja rešujemo s kombinacijo konvolucijskih in rekurentnih nevronskih mrež. Posebnost našega pristopa je tudi v sami naravi označevanja, saj ga ne obravnavamo kot problem večrazredne klasifikacije, temveč kot večznačno klasifikacijo, kjer primerom dodeljujemo oznake. Z namenom izboljšave rezultatov našo rešitev združimo v ansambel treh označevalnikov, skupaj z dvema obstoječima označevalnikoma za slovenski jezik. Ob primerjavi naše rešitve z obstoječimi ugotovimo, da predlagana rešitev dosega najboljše rezultate pri reševanju zadanega problema.
Language:
Slovenian
Keywords:
strojno učenje
,
oblikoskladenjsko označevanje
,
globoko učenje
,
konvolucijske nevronske mreže
,
rekurentne nevronske mreže
,
ansambli klasifikatorjev
Work type:
Master's thesis/paper
Organization:
FRI - Faculty of Computer and Information Science
Year:
2018
PID:
20.500.12556/RUL-105266
Publication date in RUL:
16.11.2018
Views:
3062
Downloads:
398
Metadata:
Cite this work
Plain text
BibTeX
EndNote XML
EndNote/Refer
RIS
ABNT
ACM Ref
AMA
APA
Chicago 17th Author-Date
Harvard
IEEE
ISO 690
MLA
Vancouver
:
BELEJ, Primož, 2018,
Oblikoskladenjsko označevanje slovenskega jezika z globokimi nevronskimi mrežami
[online]. Master’s thesis. [Accessed 29 March 2025]. Retrieved from: https://repozitorij.uni-lj.si/IzpisGradiva.php?lang=eng&id=105266
Copy citation
Share:
Secondary language
Language:
English
Title:
Part of speech tagging of slovene language using deep neural networks
Abstract:
The thesis deals with part of speech tagging of Slovene language. Part of speech tagging is a process of matching sentences in natural language with a sequence of suitable tags, which contain information about parts of speech and morphological properties of words. Our solution uses character-level representation of words, which is different from typical solutions, which process input sentences as sequences of words. Our part of speech tagger is implemented using convolutional and recurrent neural networks. Unlike common approaches that address this problem as multi-class classification, our solution proposes a multi-label classification approach. In order to improve our results we implement an ensemble of three part of speech taggers. When comparing our solution with existing ones, we find that the proposed solution achieves the best results.
Keywords:
machine learning
,
part-of-speech tagging
,
deep learning
,
convolutional neural networks
,
recurrent neural networks
,
ensemble classifiers
Similar documents
Similar works from RUL:
ǂThe ǂimportance of cancer stem cells and epithelial-mesenchymal transition in the progression of non-small cell lung cancer
Gene expression levels of the prolyl hydroxylase domain proteins PHD1 and PHD2 but not PHD3 are decreased in primary tumours and correlate with poor prognosis of patients with surgically resected non-small-cell lung cancer
Study of phosphatidylethanolamine N-methyltransferase gene expression in non-small cell lung cancer tissue
Usefulness of immunohistochemically determined epidermal growth factor receptor mutations in lung cancer
Diabetes mellitus and physical activity
Similar works from other Slovenian collections:
Expression patterns and prognostic relevance of subtype-specific transcription factors in surgically resected small cell lung cancer
Outsourcing predictive biomarker testing in non-small cell carcinoma
Sequential afatinib and osimertinib in patients with EGFR mutation-positive non-small-cell lung cancer
Sequential afatinib and osimertinib in patients with EGFR mutation-positive non-small-cell lung cancer
NSCLC molecular testing in Central and Eastern European countries
Back