izpis_h1_title_alt

Analiza spletnih novic s tehnikami prikaza pojavitev besed in besednih zvez
ID Vouk, Paula (Author), ID Zupan, Blaž (Mentor) More about this mentor... This link opens in a new window

.pdfPDF - Presentation file, Download (10,35 MB)
MD5: E135F7DB1672893273464245AF30C7A3
PID: 20.500.12556/rul/e527cc1e-ce91-4c9f-8a62-a2dc6f446687

Abstract
Na voljo imamo ogromne količine literature v slovenskem jeziku, iz katere lahko s preprostimi algoritmi veliko izvemo o naši družbi in njeni kulturi, znanosti, politiki ter drugih področjih. V diplomski nalogi smo izbor zožili na novičarske članke, ki so bili med letoma 1998 in 2006 objavljeni na spletni strani časopisa Dnevnik. S pomočjo grafov frekvence pojavitev določenih besed in besednih zvez smo želeli prikazati vpliv nekaterih pomembnih dogodkov v svetovnem in slovenskem merilu na poročanje slovenskih medijev. Ugotovili smo, da se povečane frekvence pojavitev besed kronološko ujemajo s pripadajočimi fenomeni. Preučevali smo tudi sopojavitve nekaterih poznanih imen s pojmi ter jih na ta način umestili v tematsko okolje. Na številnih primerih smo preizkusili kako sta za predstavitev rezultatov takšne vrste primerni sievovi in circos diagrami. Dobljene povezave med besedami so smiselne in do neke mere pričakovane, kljub temu pa nastali diagrami prikazujejo in poudarjajo zanimiva in presenetljiva razmerja.

Language:Slovenian
Keywords:Circos, sievov diagram, n-gram, sopojavitev besed, frekvenca besed.
Work type:Bachelor thesis/paper
Organization:FRI - Faculty of Computer and Information Science
Year:2016
PID:20.500.12556/RUL-84434 This link opens in a new window
Publication date in RUL:23.08.2016
Views:999
Downloads:385
Metadata:XML RDF-CHPDL DC-XML DC-RDF
:
Copy citation
Share:Bookmark and Share

Secondary language

Language:English
Title:Online news analysis with the techniques of word occurrence visualization
Abstract:
There is an enormous amount of publications in Slovenian language waiting to be analysed. With simple algorithms we can reveal interesting facts about our society and it’s culture, science, politics as well as many other aspects. In this thesis we focused on online articles that were published by newspaper Dnevnik between 1998 and 2006. By evaluating word-usage frequency graphs we wanted to investigate the influence of some important phenomena on Slovenian press. We found that higher usage frequencies of specific words chronologically match with associated phenomena. We also studied how the names of well-known people co-occur with words that pertain to a specific topic. With several examples we examined how appropriate Sieve and Circos diagrams are to visualising these types of results. Word connections presented with selected visualization tools are meaningful and expected but on the other hand the diagrams bring forward some interesting and unexpected relations.

Keywords:Circos, Sieve diagram, n-gram, word co-occurrences, word frequency

Similar documents

Similar works from RUL:
Similar works from other Slovenian collections:

Back