izpis_h1_title_alt

Napovedovanje porabe pomnilniških kapacitet pri rezervnem kopiranju
ID KONCILJA, BLAŽ (Author), ID Robnik Šikonja, Marko (Mentor) More about this mentor... This link opens in a new window

.pdfPDF - Presentation file, Download (491,54 KB)
MD5: E01B7612DA080BD607E5A0FD312D4A82
PID: 20.500.12556/rul/347b7733-1f8d-44bf-bad2-1276f98046d4

Abstract
Potreba po rezervnem kopiranju oziroma arhiviranju podatkov v svetu narašča. Podjetja potrebujejo za normalno delovanje hraniti vedno več informacij. Hranjenje teh podatkov lahko predstavlja velik strošek, zato hočemo pomnilniško kapaciteto držati na ravni, ki zadovolji naše potrebe in hkrati ni predimenzionirana. S pomočjo podatkovnega rudarjenja želimo napovedati trende porabe pomnilniških kapacitet. Najprej smo pridobili podatke iz dveh različnih okolij za arhiviranje in jih shranili v podatkovno bazo. To nam je omogočilo hitro združevanje in upravljanje s podatki. Podatke smo analizirali z metodami linearne regresije, linearne regresije po kosih in k-najbližjih sosedov. Za najbolj zanesljivo metodo za napovedovanje trendov se je izkazala linearna regresija po kosih. Čeprav so rezultati dovolj dobri za uvedbo metode v produkcijo, moramo biti previdni, saj sta se analizirani okolji izkazali za zelo različni, kar neposredno vpliva na zanesljivost napovedi.

Language:Slovenian
Keywords:podatkovno rudarjenje, postopek CRISP-DM, linearna regresija po kosih, priprava podatkov
Work type:Undergraduate thesis
Organization:FRI - Faculty of Computer and Information Science
Year:2016
PID:20.500.12556/RUL-84965 This link opens in a new window
Publication date in RUL:08.09.2016
Views:1678
Downloads:377
Metadata:XML RDF-CHPDL DC-XML DC-RDF
:
Copy citation
Share:Bookmark and Share

Secondary language

Language:English
Title:Forecasting backup storage consumption
Abstract:
Storage needs for archiving data are increasing. Companies need to store more and more data to function normally. Storing this data can be costly, that is why we want to provide sufficient storage capacity to meet the demands and not exceed them which brings additional costs. With the help of data mining we are trying to forecast trends in storage consumption. We acquired data from two environments for archiving and saved them to a database. We analysed data consumption trends with linear regression, piecewise linear regression and k-nearest neighbours. Piecewise linear regression proved to be the most accurate and reliable. Even though results are good enough to be implemented into production, we should be cautious as the two environments have different characteristics and this influences the forecasting.

Keywords:data mining, procedure CRISP-DM, piecewise linear regression, data preparation

Similar documents

Similar works from RUL:
Similar works from other Slovenian collections:

Back