Your browser does not allow JavaScript!
JavaScript is necessary for the proper functioning of this website. Please enable JavaScript or use a modern browser.
Repository of the University of Ljubljana
Open Science Slovenia
Open Science
DiKUL
slv
|
eng
Search
Browse
New in RUL
About RUL
In numbers
Help
Sign in
Details
Izboljšava obravnave negacije v velikih jezikovnih modelih
ID
Kranjec, Matej
(
Author
),
ID
Robnik Šikonja, Marko
(
Mentor
)
More about this mentor...
PDF - Presentation file,
Download
(746,11 KB)
MD5: F60E1D57CFEDCBE322037E24A60AEB0B
Image galllery
Abstract
V diplomski nalogi smo preizkusili metodo za izboljšavo klasifikacije globokih nevronskih mrež s predznanjem o negaciji. Najuspešnejši jezikovni modeli, kot na primer BERT ali ELMo, so uspešni pri klasifikaciji besedil, a odpovejo pri negaciji. Prednaučene jezikovne modele smo prilagodili, da tudi v slovenščini bolje delujejo z negacijo. To smo dosegli z spreminjanjem funkcije izgube nevronske mreže ter prilagajanjem obstoječih modelov. Metodo smo preizkusili na prilagojenem korpusu z dodanimi negacijami osnovnih stavkov. Metoda je uspešno zmanjšala delež napačnih napovedi v negiranih stavkih pri maskiranem jezikovnem modelu, točnost na nalogah iz slovenske zbirke SuperGLUE pa je ponekod izboljšala, drugje pa poslabšala.
Language:
Slovenian
Keywords:
globoke nevronske mreže
,
klasifikacija
,
obravnava negacije
,
veliki vnaprej naučeni jezikovni modeli
Work type:
Bachelor thesis/paper
Typology:
2.11 - Undergraduate Thesis
Organization:
FRI - Faculty of Computer and Information Science
Year:
2022
PID:
20.500.12556/RUL-139701
COBISS.SI-ID:
121785859
Publication date in RUL:
06.09.2022
Views:
1509
Downloads:
1296
Metadata:
Cite this work
Plain text
BibTeX
EndNote XML
EndNote/Refer
RIS
ABNT
ACM Ref
AMA
APA
Chicago 17th Author-Date
Harvard
IEEE
ISO 690
MLA
Vancouver
:
KRANJEC, Matej, 2022,
Izboljšava obravnave negacije v velikih jezikovnih modelih
[online]. Bachelor’s thesis. [Accessed 17 April 2025]. Retrieved from: https://repozitorij.uni-lj.si/IzpisGradiva.php?lang=eng&id=139701
Copy citation
Share:
Secondary language
Language:
English
Title:
Improving negation handling in large language models
Abstract:
In the thesis we have tested a method for improved classification of deep neural networks with prior knowledge of negation. State of the art language models, such as ELMo and BERT, are successful at text classification, but fail when there is negation involved. We adjusted pre-trained language models to work better with negation in Slovene. We modified the loss function of the neural networks and retrained the models. We have tested the method on a modified corpus with added negations of original sentences. The method successfully reduced the error in the negated sentences for masked language models, and it increased the accuracy for some tasks from the Slovene version of the SuperGLUE benchmark but decreased for others.
Keywords:
deep neural networks
,
classification
,
negation modeling
,
large pretrained language models
Similar documents
Similar works from RUL:
Comparison of genetic polymorphisms of glutathione S-transferase M1 and glutathione S-transferase T1 in patients with asbestosis and patients with pleural plaques
Influence of body composition and biochemical parameters on pharmacokinetics of ramipril in heart failure patients
Genetic variability of genes involved in regulation of serotonin pathway in patients with schizophrenia
Populacijski farmakokinetični model aktivnega metabolita leflunomida in vrednotenje vpliva genskih polimorfizmov v njegovi presnovi z randomizacijskim testom
Principles of antipsychotics use in patients with schizophrenia, schizotypal and delusional disorder in Psychiatric Hospital Idrija
Similar works from other Slovenian collections:
Coal in Slovenia and world
Underground coal gasification (UCG) - the Velenje coal mine experience
Clean coal technologies at Velenje coal mine
PREMOGOVNIK VELENJE D.D. COAL TRANSPORT CONTROL SYSTEM RENEWAL
Calculation of the excavation chain scraper conveyor in Premogovnik Velenje
Back