Strojno učenje na medicinskih podatkih z interpretabilnimi modeli

Srovin, Jure Vito

Strojno učenje na medicinskih podatkih z interpretabilnimi modeli
ID Srovin, Jure Vito (Author), ID Kukar, Matjaž (Mentor) More about this mentor... This link opens in a new window

PDF - Presentation file, Download (4,32 MB)
MD5: 4895F50FD0E20EA62773CD9256902432

Abstract

Modeli strojnega učenja se uporabljajo v mnogih domenah, v katerih imajo napake lahko hude posledice na posameznika in družbo. Napačne klasifikacije in vzroke za njih je pogosto težko odkriti, še posebej pri uporabi kompleksnih modelov, katerih delovanje je človeku nerazumljivo. Cilj diplomske naloge je predstaviti pomen zmožnosti interpretacije modelov strojnega učenja in preveriti uspešnost delovanja preprostih modelov, ki so sami po sebi interpretabilni. Preizkusili smo RiskSLIM, algoritem za gradnjo redkih celoštevilskih linearnih modelov, ki so enostavni za uporabo, in ga primerjali z bolj priljubljenimi metodami strojnega učenja. Rezultate smo pridobili na dvorazrednih in večrazrednih medicinskih podatkovnih množicah različnih velikosti. Uspešnost modelov RiskSLIM na binarnih množicah je bila nekoliko slabša od preostalih metod, vendar še vedno zelo dobra. RiskSLIM ponuja odlično razmerje med interpretabilnostjo modela in uspešnostjo klasifikacij. Vendar pa deluje slabo na množicah, pri katerih je za uspešno klasifikacijo treba upoštevati veliko število atributov, kar pri RiskSLIM ni možno, saj je omejen na majhno število značilk. Uporabljamo ga lahko tudi na večrazrednih podatkovnih množicah s pomočjo metaklasifikatorjev. Njegova velika slabost je dolgotrajen postopek gradnje modela, ki se eksponentno podaljšuje z večjim številom atributov v množici. Zamudna je tudi ročna obdelava podatkov, saj je treba analizirati in diskretizirati vsako značilko posebej.

Language:	Slovenian
Keywords:	interpretabilni modeli, RiskSLIM, strojno učenje v medicini
Work type:	Bachelor thesis/paper
Typology:	2.11 - Undergraduate Thesis
Organization:	FRI - Faculty of Computer and Information Science
Year:	2021
PID:	20.500.12556/RUL-125320
COBISS.SI-ID:	54697731
Publication date in RUL:	10.03.2021
Views:	1439
Downloads:	198
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	Machine learning in medicine with interpretable models
Machine learning models are used in many domains where wrong decisions can have severe consequences for the individual and society. Misclassifications and their causes are often difficult to detect, especially when using complex models whose decision-making behaviour is unintelligible to humans. The goal of the thesis is to present the importance of interpretability of machine learning models and evaluate the performance of simple models that are inherently interpretable. We tested RiskSLIM, an algorithm for building sparse linear integer models that are easy to use and compared it to more popular machine learning methods. Results were obtained on binary and multiclass medical datasets of different sizes. The performance of RiskSLIM models on binary datasets was slightly worse than performance of other methods, but very good nonetheless. RiskSLIM offers an excellent trade-off between model interpretability and classification accuracy. However, it has poor performance on datasets, where a large number of features is required for successful classification, which is not possible with RiskSLIM, as it is limited to a small number of features per model. It can also be utilized on multiclass datasets using meta-classifiers. Its major drawback is the lengthy process of building the model, which is exponentially extended on datasets with large number of features. Manual data processing is also time consuming, as it is necessary to analyze and discretize each feature individually.
Keywords:	interpretable models, RiskSLIM, machine learning in medicine

Similar works from RUL:
Similar works from other Slovenian collections:

Secondary language

Similar documents