Your browser does not allow JavaScript!
JavaScript is necessary for the proper functioning of this website. Please enable JavaScript or use a modern browser.
Open Science Slovenia
Open Science
DiKUL
slv
|
eng
Search
Browse
New in RUL
About RUL
In numbers
Help
Sign in
Analysis of machine learning algorithms for anomaly detection on edge devices
ID
Huč, Aleks
(
Author
),
ID
Šalej, Jakob
(
Author
),
ID
Trebar, Mira
(
Author
)
PDF - Presentation file,
Download
(10,89 MB)
MD5: A50D89A9038F2FC98157A8E8AF36BD98
URL - Source URL, Visit
https://www.mdpi.com/1424-8220/21/14/4946
Image galllery
Abstract
The Internet of Things (IoT) consists of small devices or a network of sensors, which permanently generate huge amounts of data. Usually, they have limited resources, either computing power or memory, which means that raw data are transferred to central systems or the cloud for analysis. Lately, the idea of moving intelligence to the IoT is becoming feasible, with machine learning (ML) moved to edge devices. The aim of this study is to provide an experimental analysis of processing a large imbalanced dataset (DS2OS), split into a training dataset (80%) and a test dataset (20%). The training dataset was reduced by randomly selecting a smaller number of samples to create new datasets Di (i = 1, 2, 5, 10, 15, 20, 40, 60, 80%). Afterwards, they were used with several machine learning algorithms to identify the size at which the performance metrics show saturation and classification results stop improving with an F1 score equal to 0.95 or higher, which happened at 20% of the training dataset. Further on, two solutions for the reduction of the number of samples to provide a balanced dataset are given. In the first, datasets DRi consist of all anomalous samples in seven classes and a reduced majority class (‘NL’) with i = 0.1, 0.2, 0.5, 1, 2, 5, 10, 15, 20 percent of randomly selected samples. In the second, datasets DCi are generated from the representative samples determined with clustering from the training dataset. All three dataset reduction methods showed comparable performance results. Further evaluation of training times and memory usage on Raspberry Pi 4 shows a possibility to run ML algorithms with limited sized datasets on edge devices.
Language:
English
Keywords:
machine learning
,
classification
,
edge computing
,
imbalanced dataset
,
training dataset
,
anomaly detection
,
clustering
Work type:
Article
Typology:
1.01 - Original Scientific Article
Organization:
FRI - Faculty of Computer and Information Science
Publication status:
Published
Publication version:
Version of Record
Year:
2021
Number of pages:
22 str.
Numbering:
Vol. 21, iss. 14, art. 4946
PID:
20.500.12556/RUL-135739
UDC:
004.85
ISSN on article:
1424-8220
DOI:
10.3390/s21144946
COBISS.SI-ID:
71000323
Publication date in RUL:
30.03.2022
Views:
760
Downloads:
175
Metadata:
Cite this work
Plain text
BibTeX
EndNote XML
EndNote/Refer
RIS
ABNT
ACM Ref
AMA
APA
Chicago 17th Author-Date
Harvard
IEEE
ISO 690
MLA
Vancouver
:
Copy citation
Share:
Record is a part of a journal
Title:
Sensors
Shortened title:
Sensors
Publisher:
MDPI
ISSN:
1424-8220
COBISS.SI-ID:
10176278
Licences
License:
CC BY 4.0, Creative Commons Attribution 4.0 International
Link:
http://creativecommons.org/licenses/by/4.0/
Description:
This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.
Licensing start date:
20.07.2021
Secondary language
Language:
Slovenian
Keywords:
strojno učenje
,
klasifikacija
,
robno računanje
,
neuravnotežena podatkovna zbirka
,
učna zbirka
,
zaznavanje anomalij
,
gručenje
Projects
Funder:
ARRS - Slovenian Research Agency
Project number:
P2-0359
Name:
Vseprisotno računalništvo
Similar documents
Similar works from RUL:
Similar works from other Slovenian collections:
Back