Podrobno

Razvoj klepetalnika za diagnozo bolezni
ID Pintar, Žiga (Avtor), ID Bajec, Marko (Mentor) Več o mentorju... Povezava se odpre v novem oknu

.pdfPDF - Predstavitvena datoteka, prenos (1,56 MB)
MD5: B1E9D183C565AF917C490B4E8E9348DF

Izvleček
S staranjem prebivalstva prihaja do upokojevanja kvalificiranega zdravstvenega osebja na primarni ravni, kar posledično vodi do preobremenjenosti preostalega kadra. Naše magistrsko delo je bilo usmerjeno v zmanjšanje obiskov nenujnih pacientov v javnih zdravstvenih zavodih. Ustvarili smo klepetalnik ZdravBOT, ki omogoča okvirno diagnozo 24 bolezenskih stanj na podlagi uporabnikove klinične slike. Zaradi težke dostopnosti medicinskih podatkovnih zbirk smo se odločili prevesti eno od javno dostopnih angleški zbirk v slovenski jezik. Dobljeno podatkovno zbirko, ki je vsebovala okoli 88 tisoč stavkov, smo uporabili za učenje svojega modela BERT, ki je temeljil na modelu SloBERT. Ustvarjeni model smo nato uporabili v klepetalniku Rasa, ki je skozi pogovor od uporabnika zbral vse potrebne informacije ter poskušal najti okvirno diagnozo, ki bi imela dovolj veliko stopnjo zaupanja. Iskanje diagnoze je temeljilo na algoritmu kosinusne podobnosti med klinično sliko pacienta in znanimi boleznimi. Za grafični vmesnik smo ustvarili mobilno aplikacijo Android, ki se je povezovala na prej omenjeni klepetalnik Rasa. Rezultati učenja modela BERT so pokazali, da samo prevedene podatkovne zbirke morda niso dovolj, saj je pri učenju modela prišlo do prileganja podatkom. Ne glede na prej omenjene težave je naš model BERT uspešno prepoznal v povprečju od en do dva simptoma iz uporabnikovega uvodnega sporočila. Med testiranjem smo opravili 87 diagnoz, od katerih jih je bilo 62 % pravilnih. Največkrat smo se zmotili pri diagnosticiranju skupin bolezni, ki so imele veliko sorodnih simptomov, kar je bilo najpogosteje pri pljučnih in infekcijskih boleznih. Prav tako smo opazili, da algoritem kosinusne podobnosti ni najbolj optimalen način iskanja ujemanja bolezenskih stanj, saj zaradi različnega števila simptomov preferira bolezni z manjšim številom le-teh.

Jezik:Slovenski jezik
Ključne besede:klepetalnik, RASA, BERT, obdelava naravnega jezika
Vrsta gradiva:Magistrsko delo/naloga
Organizacija:FRI - Fakulteta za računalništvo in informatiko
Leto izida:2025
PID:20.500.12556/RUL-177761 Povezava se odpre v novem oknu
Datum objave v RUL:06.01.2026
Število ogledov:54
Število prenosov:3
Metapodatki:XML DC-XML DC-RDF
:
Kopiraj citat
Objavi na:Bookmark and Share

Sekundarni jezik

Jezik:Angleški jezik
Naslov:Development of a chatbot for disease diagnosis
Izvleček:
Due to our society aging, there is an increase of healthcare personnel who are retiring, which in turn leads to additional workload for the remaining staff. Our master’s thesis was aimed at reducing non-emergency patient walk ins in public health facilities. Thus, we have created a chatbot named ZdraBOT, which can provide a rough diagnosis for 24 diseases based on the user’s current symptoms. Since medical databases are not publicly available especially in uncommon languages like Slovene, we decided to translate one of the publicly available English collections into Slovenian language. The resulting dataset which contained 88 thousand sentences, was used to train our BERT model, which was based on the SloBERT model. We then used the created model inside Rasa client, which gathered all the necessary information from the user, whit which we then tried to find an approximate diagnosis with a sufficiently large level of confidence. The diagnosis was made using the cosine similarity algorithm between the user’s symptoms and the 24 known diseases. For the user interface, we created an Android application which connected to the previously mentioned Rasa client. What we found during the BERT training phase is that the translated dataset alone might not be enough, as the model seemed too overfit to the data provided. Regardless of the aforementioned issues, our BERT model was able to on average identify one to two symptoms form the user’s messages. During testing we made 87 diagnoses, of which 62 % were correct. In most cases the incorrect diagnosis was chosen because many of the symptoms were overlapping for diseases in the same group. This was most obvious for pulmonary and infectious diseases. We also noticed that the cosine similarity algorithm is not the best option for matching diseases whit the user’s symptoms, because due to the different number of symptoms, it favours diseases with a smaller number of symptoms.

Ključne besede:chatbot, RASA, BERT, natural language procesing

Podobna dela

Podobna dela v RUL:
Podobna dela v drugih slovenskih zbirkah:

Nazaj