izpis_h1_title_alt

Gručenje novic glede na dogodke
ID CALCINA, ERIK (Author), ID Robnik Šikonja, Marko (Mentor) More about this mentor... This link opens in a new window, ID Erik, Novak (Comentor)

.pdfPDF - Presentation file, Download (594,88 KB)
MD5: CD62355874F89F02FC6B1BEB130787B8

Abstract
V sodobnem svetu se vsak dan prebijamo skozi poplavo novic. Za njihovo lažje iskanje je koristno, če so novice združene glede na pripadajoče dogodke. V diplomski nalogi predstavimo metodologijo za gručenje novic v dogodke. Metodologija kombinira uporabo tekstovnih vložitev, algoritma za gručenje in metod za filtriranje novic. Metodologijo smo preizkusili na naboru podatkov spletnih novic ter naredili statistično in ročno evalvacijo. Rezultati so pokazali, da gruče novice v večini opisujejo enake dogodke. Posledica višje natančnosti je veliko nerazporejenih novic.

Language:Slovenian
Keywords:strojno učenje, gručenje novic, detekcija dogodkov, jezikovni model
Work type:Bachelor thesis/paper
Typology:2.11 - Undergraduate Thesis
Organization:FRI - Faculty of Computer and Information Science
Year:2023
PID:20.500.12556/RUL-149491 This link opens in a new window
COBISS.SI-ID:163762435 This link opens in a new window
Publication date in RUL:07.09.2023
Views:576
Downloads:57
Metadata:XML DC-XML DC-RDF
:
Copy citation
Share:Bookmark and Share

Secondary language

Language:English
Title:Event-based news clustering
Abstract:
In the modern world, we daily face a flood of news. For easier searching, it is useful if the news are grouped according to related events. In the thesis, we present a methodology for clustering news by events. The methodology combines the use of text embeddings, a clustering algorithm and news filtering methods. We tested the methodology on a dataset of online news and evaluated it statisticaly and manualy. The results indicate that the news clusters primarily depict the same events. However, higher accuracy is accompanied by a substantial amount of non-clustered news.

Keywords:machine learning, news clustering, event detection, language model

Similar documents

Similar works from RUL:
Similar works from other Slovenian collections:

Back