Vizualizacija imen novorojenčkov v povezavi s podatki iz baze IMDb in Wikipedije

KOBENTAR, ALEKS

Repository of the University of Ljubljana

Details

Vizualizacija imen novorojenčkov v povezavi s podatki iz baze IMDb in Wikipedije
ID KOBENTAR, ALEKS (Author), ID Kavčič, Alenka (Mentor) More about this mentor... This link opens in a new window

, ID Pesek, Matevž (Comentor)

PDF - Presentation file, Download (1,56 MB)
MD5: 0CCD2FF36F8085186848DC4D8F7A648C

Abstract

Diplomska naloga opisuje postopek pridobivanja in prikaza podatkov iz različnih virov. Vir podatkov predstavljajo tri podatkovne zbirke. S Statističnega urada Republike Slovenije so podatki o številu imen novorojenčkov od leta 1992 do 2017. IMDb-podatkovna zbirka nudi podatke o igralcih in filmih. Zadnji vir podatkov je spletna enciklopedija Wikipedija, ki nudi zanimive podatke o podrobnostih imen. Združitev vseh podatkov zahteva uporabo različnih orodij. Za uvoz podatkov je uporabljen programski jezik Python. Za hranjenje podatkov o številu imen novorojenčkov je uporabljena nerelacijska podatkovna zbirka Elasticsearch. Za potrebe vizualizacije podatkov, ki so shranjeni na spletnih straneh ali v lokalni podatkovni zbirki, so implementirani strežniki s programskima jezikoma Python in Node.js. Poleg osnovnih spletnih orodij sta JavaScript in knjižnica D3.js uporabljena kot glavno orodje za prikaz podatkov.

Language:	Slovenian
Keywords:	Spletni strežnik, podatkovna zbirka, vir podatkov, vizualizacija, imena otrok.
Work type:	Bachelor thesis/paper
Organization:	FRI - Faculty of Computer and Information Science
Year:	2019
PID:	20.500.12556/RUL-107347
Publication date in RUL:	02.04.2019
Views:	1623
Downloads:	361
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	Visualization of baby names combined with IMDb and Wikipedia data
This Bachelor’s Thesis describes the process of collecting and visualizing data from different sources. There are three different data sources. The first source is from the Statistical office of Slovenia where there is data about the number of baby names occurring from 1992 to 2017. The second sourse is the IMDb database, which has data about actors and movies. The third data source is the free Wikipedia encyclopedia , which holds interesting data about names. To be able to merge all the datasources requires a great range of frameworks. For importing the data, the programming language Python is used. For data storage about the number of babynames, the unrelation database Elasticsearch is used. For the exchange of data which is stored on the inter- net or on local machine servers, either Python or Node.js. are implemented. In addition, the basic web technologies JavaScript and D3.js are the main tools for data visualization.
Keywords:	Web server, database, source of data, visualization, baby names.

Similar works from RUL:
Similar works from other Slovenian collections:

Details

Secondary language

Similar documents