Prepoznavanje motivov v pravljicah s pomočjo velikih jezikovnih modelov

Beden, Domen

Repository of the University of Ljubljana

Details

Prepoznavanje motivov v pravljicah s pomočjo velikih jezikovnih modelov
ID Beden, Domen (Author), ID Robnik Šikonja, Marko (Mentor) More about this mentor... This link opens in a new window

PDF - Presentation file, Download (3,11 MB)
MD5: B43AEA8AD59B37B35EFD71EFDE24335E

Abstract

V diplomskem delu raziskujemo uporabo velikih jezikovnih modelov (VJM) za avtomatsko prepoznavanje pripovednih motivov v ljudskih pravljicah. Najprej predstavimo folkloristično teorijo motivov, klasifikacijske sisteme (ATU, Thompson) ter sodobne digitalne zbirke. Obravnavamo temeljne koncepte obdelave naravnega jezika in arhitekturo velikih modelov, s poudarkom na učenju z navodili. V eksperimentalnem delu uporabimo model Gemma 7B, ki ga učimo na strukturiranih primerih zgodb in motivov. Preizkusimo več učnih strategij (polno prilagajanje, LoRA, destilacija z Gemini 2.5 Pro) ter izvedemo kvantitativno in kvalitativno evalvacijo rezultatov. Ugotovimo, da so VJM-ji sposobni učinkovite klasifikacije motivov, še posebej, ko so podatki obogateni z verigami misli. Delo prispeva k razvoju orodij za računalniško folkloristiko in kaže možnosti za nadaljnje raziskave.

Language:	Slovenian
Keywords:	motivi v pravljicah, veliki jezikovni modeli, obdelava naravnega jezika
Work type:	Bachelor thesis/paper
Typology:	2.11 - Undergraduate Thesis
Organization:	FRI - Faculty of Computer and Information Science
Year:	2025
PID:	20.500.12556/RUL-171270
COBISS.SI-ID:	247357187
Publication date in RUL:	21.08.2025
Views:	288
Downloads:	84
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	Detection of folkloristic motifs with large language models
This thesis explores the use of large language models (LLMs) for the automatic recognition of narrative motifs in folktales. We begin by presenting folkloristic theory on motifs, classification systems (ATU, Thompson), and modern digital corpora. We outline key natural language processing concepts and LLM architectures, with an emphasis on instruction-based learning. In the experimental part, we fine-tune the Gemma 7B model on structured examples linking stories and motifs. We evaluate several learning strategies (full fine-tuning, LoRA, distillation using Gemini 2.5 Pro) and perform both quantitative and qualitative evaluations. Our results show that LLMs can effectively classify motifs, especially when the dataset is enriched with chainof- thought explanations. This work contributes to the field of computational folkloristics and opens pathways for further research.
Keywords:	motifs in folktales, large language models, natural language processing

Similar works from RUL:
Similar works from other Slovenian collections:

Details

Secondary language

Similar documents