Offline Compression of Convolutional Neural Networks on Edge Devices

Bachelor thesis (2020)

Authors

S.A. Tulling Electrical Engineering, Mathematics and Computer Science

Contributors

Y. Chen Data-Intensive Systems - (mentor)

Lydia Y. Chen Data-Intensive Systems - (mentor)

S. Ghiassi Data-Intensive Systems - (graduation committee member)

B.A. Cox Electrical Engineering, Mathematics and Computer Science (graduation committee member)

Marco Zuniga Embedded Systems - (coach)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

To reference this document use:

http://resolver.tudelft.nl/uuid:5eb53059-d4bf-4a39-92ae-1bfbfa8c4f64

More Info

expand_more

Published Date

22-06-2020

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Edge Devices and Artificial Intelligence are important and ever increasing fields in technology. Yet their combination is lacking because the neural networks used in AI are being made increasingly large and complex while edge devices lack the resources to keep up with these developments. Neural network model compression will allow these edge devices to run these models due to overcoming memory constraints. This paper proposes to use both singular value decomposition and canonical polyadic decomposition as a way to decrease the size of convolutional neural networks at the cost of some accuracy. This compression pipeline can be run on an edge device and is configurable to change the trade-off between file size and accuracy. This creates a possibility to run convolutional neural networks natively on edge devices.

Files

Research_8_.pdf

(pdf | 0.622 Mb)

Unknown license