Primerjava učinkovitosti podatkovnih baz pri obravnavi časovnih vrst

LUKIĆ, FILIP

Primerjava učinkovitosti podatkovnih baz pri obravnavi časovnih vrst
ID LUKIĆ, FILIP (Author), ID Zrnec, Aljaž (Mentor) More about this mentor... This link opens in a new window

, ID Poženel, Marko (Comentor)

PDF - Presentation file, Download (1,23 MB)
MD5: 7D65AF13A1F0692B5B180B962B9BC12E

Abstract

Danes živimo v svetu, kjer se naše vsakodnevne dejavnosti vedno bolj digitalizirajo. Priča smo obdobju, kjer imamo na voljo veliko naprav, ki se vključujejo v omrežje in ustvarjajo velike količine podatkov. Med te naprave ne sodijo zgolj prenosniki in mobilni telefoni, temveč vse bolj prisotni senzorji v avtomobilih, pametnih domovih, sistemih v industriji in energetiki, ki pošiljajo vrste podatkov v omrežje. Tradicionalne podatkovne baze niso kos izzivom, ki jih predstavljajo tolikšne količine podatkov. Zato so se pojavile podatkovne baze za časovne vrste, ki senzorske podatke časovno žigosajo in jih fizično združujejo po času, kar omogoča učinkovitejšo obdelavo podatkov. Trenutno obstaja na trgu veliko različnih podatkovnih baz za časovne vrste in baz, ki podpirajo delo s časovnimi vrstami. To predstavlja izbor najustreznejše podatkovne baze za neko rešitev razvijalcem izredno zahteven izziv. Za izbiro podatkovne baze si razvijalci programske opreme pomagajo z uporabo ali pregledom performančnih testov. Performančni test je v osnovi program, ki skozi definirano in nadzorovano okolje ovrednoti programsko opremo s pomočjo nabora določenih meritev. V primeru podatkovnih baz je to program, ki pripravi podatke in izvede nabor določenih poizvedb nad podatkovno bazo, ter pri tem izmeri kriterije učinkovitosti. Ker so podatkovne baze za časovne vrste razmeroma nove, obstaja malo neodvisnih primerjav učinkovitosti, po katerih bi se lahko zgledovali. Obstoječe primerjave vsebujejo omejen nabor orodij in scenarijev. Cilj diplomske naloge je predstaviti performančni test, ki bi skozi študijo primera uporabe pomagal pri odgovoru na vprašanje katera podatkovna baza za časovne vrste je najbolj ustrezna za določen scenarij.

Language:	Slovenian
Keywords:	časovne vrste, podatkovna baza za časovne vrste, Timescale, InfluxDB, Postgres, performančni test
Work type:	Bachelor thesis/paper
Typology:	2.11 - Undergraduate Thesis
Organization:	FRI - Faculty of Computer and Information Science
Year:	2021
PID:	20.500.12556/RUL-130505
COBISS.SI-ID:	78603523
Publication date in RUL:	15.09.2021
Views:	1860
Downloads:	80
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	Comparison of database performance when dealing with time series
Today we live in a world, where our everyday activities are becoming more and more digitalised. We are witnesses of an age, where we have a large number of devices at our disposal, which are connecting to the web and are producing large quantities of data. Among these devices belong not only laptops and mobile phones, but also even more present sensors in cars, smart homes, industry and power plants, which are sending rows of data into the web. Traditional databases could not cope with all the challenges, which were presented by such large amounts of data. Thus the timeseries databases emerged. Their main characteristic is that they add a timestamp to sensor data, which is then used as a basis for grouping. Finally this allows them to handle data more effectively. Currently, there are many different timeseries databases and others which support work with timeseries. Consequently, this greets the developers with the difficulty of choosing the right database for some solution. Therefore to fulfill this task, the developers require the assistance of a benchmark. A benchmark is basically a program, which through a defined and controlled environment, evaluates the performances of some software, with a set of tests. In the case of databases, this represents a program which prepares data and executes a set of given queries, on a database. Lastly it measures the effectiveness, through the chosen criteria. The problem is that, there are merely a few existing benchmarks, which could be used as a reference, as the TSDBs are a relatively new concept. These are usually limited to only a few databases and scenarios. The goal of this thesis is to present a benchmark, which will through a use case study, help answer the question which timeseries database is more suitable for a given scenario.
Keywords:	timeseries, timeseries database, Timescale, InfluxDB, Postgres, benchmark

Similar works from RUL:
Similar works from other Slovenian collections:

Secondary language

Similar documents