izpis_h1_title_alt

Self-adaptive approximate mobile deep learning
ID Knez, Timotej (Author), ID Machidon, Octavian-Mihai (Author), ID Pejović, Veljko (Author)

.pdfPDF - Presentation file, Download (1,50 MB)
MD5: 63EC2A9E4F57BE3F9C37A4E9DCF9CAD8
URLURL - Source URL, Visit https://www.mdpi.com/2079-9292/10/23/2958 This link opens in a new window

Abstract
Edge intelligence is currently facing several important challenges hindering its performance, with the major drawback being meeting the high resource requirements of deep learning by the resource-constrained edge computing devices. The most recent adaptive neural network compression techniques demonstrated, in theory, the potential to facilitate the flexible deployment of deep learning models in real-world applications. However, their actual suitability and performance in ubiquitous or edge computing applications has not, to this date, been evaluated. In this context, our work aims to bridge the gap between the theoretical resource savings promised by such approaches and the requirements of a real-world mobile application by introducing algorithms that dynamically guide the compression rate of a neural network according to the continuously changing context in which the mobile computation is taking place. Through an in-depth trace-based investigation, we confirm the feasibility of our adaptation algorithms in offering a scalable trade-off between the inference accuracy and resource usage. We then implement our approach on real-world edge devices and, through a human activity recognition application, confirm that it offers efficient neural network compression adaptation in highly dynamic environments. The results of our experiment with 21 participants show that, compared to using static network compression, our approach uses 2.18× less energy with only a 1.5% drop in the average accuracy of the classification.

Language:English
Keywords:mobile sensing, neural networks, dynamic optimization, quantization, DNN slimming
Work type:Article
Typology:1.01 - Original Scientific Article
Organization:FRI - Faculty of Computer and Information Science
Publication status:Published
Publication version:Version of Record
Year:2021
Number of pages:23 str.
Numbering:Vol. 10, iss. 23, art. 2958
PID:20.500.12556/RUL-136700 This link opens in a new window
UDC:004
ISSN on article:2079-9292
DOI:10.3390/electronics10232958 This link opens in a new window
COBISS.SI-ID:89465859 This link opens in a new window
Publication date in RUL:16.05.2022
Views:939
Downloads:150
Metadata:XML DC-XML DC-RDF
:
Copy citation
Share:Bookmark and Share

Record is a part of a journal

Title:Electronics
Shortened title:Electronics
Publisher:MDPI
ISSN:2079-9292
COBISS.SI-ID:523068953 This link opens in a new window

Licences

License:CC BY 4.0, Creative Commons Attribution 4.0 International
Link:http://creativecommons.org/licenses/by/4.0/
Description:This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.
Licensing start date:01.12.2021

Secondary language

Language:Slovenian
Keywords:mobilno zaznavanje, nevronske mreže, dinamična optimizacija, kvantizacija, tanjšanje globokih mrež

Projects

Funder:ARRS - Slovenian Research Agency
Project number:N2-0136
Name:Povečanje učinkovitosti uporabe virov na pametnih telefonih s pomočjo približnega računanja

Funder:ARRS - Slovenian Research Agency
Project number:J2-3047
Name:Kontekstno odvisno približno računanje na mobilnih napravah

Funder:ARRS - Slovenian Research Agency
Project number:P2-0098
Name:Računalniške strukture in sistemi

Similar documents

Similar works from RUL:
Similar works from other Slovenian collections:

Back