Efficiency in Deep Learning

Image and Video Deep Model Efficiency

Doctoral thesis (2024)

Authors

X. Liu Pattern Recognition and Bioinformatics -

Research Group

Pattern Recognition and Bioinformatics () (TU Delft)

Efficiency Deep Learning Computer Vision

To reference this document use:

http://resolver.tudelft.nl/uuid:eb63fb5e-4ebe-4f50-963d-d56b03a45e25

More Info

expand_more

Published Date

2024

Language

English

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Intelligent Systems

Research Group

Pattern Recognition and Bioinformatics

Abstract

Deep learning is the core algorithmic tool for automatically processing large amounts of data. Deep learning models are defined as a stack of functions (called layers) with millions of parameters, that are updated during training by fitting them to data. Deep learning models have show remarkable accuracy gains on visual problems in video and images. Yet at the same time, this comes at a considerable computational cost that raises concerns about energy consumption. The escalation in the number of parameters and the surging demand for extensive data exacerbate these concerns. This thesis delves into the core of these concerns, proposing innovative techniques to enhance the efficiency of deep learning models. This thesis starts with exploring efficient deep learning models for video data, followed by efficient models for image data.....

Files

Xin_dissertation_15944.pdf

(pdf | 10.8 Mb)