Global ETD Search

Return to search

A SYSTEMATIC STUDY OF SPARSE DEEP LEARNING WITH DIFFERENT PENALTIES

Deep learning has been the driving force behind many successful data science achievements. However, the deep neural network (DNN) that forms the basis of deep learning is
often over-parameterized, leading to training, prediction, and interpretation challenges. To
address this issue, it is common practice to apply an appropriate penalty to each connection
weight, limiting its magnitude. This approach is equivalent to imposing a prior distribution
on each connection weight from a Bayesian perspective. This project offers a systematic investigation into the selection of the penalty function or prior distribution. Specifically, under
the general theoretical framework of posterior consistency, we prove that consistent sparse
deep learning can be achieved with a variety of penalty functions or prior distributions.
Examples include amenable regularization penalties (such as MCP and SCAD), spike-and?slab priors (such as mixture Gaussian distribution and mixture Laplace distribution), and
polynomial decayed priors (such as the student-t distribution). Our theory is supported by
numerical results.

10.25394/pgs.22693573.v1

Deep learning

Statistics not elsewhere classified

Network Compression

Sparse Deep Learning

Nonlinear feature selection

Posterior Consistency

Identifer	oai:union.ndltd.org:purdue.edu/oai:figshare.com:article/22693573
Date	25 April 2023
Creators	Xinlin Tao (13143465)
Source Sets	Purdue University
Detected Language	English
Type	Text, Thesis
Rights	CC BY 4.0
Relation	https://figshare.com/articles/thesis/A_SYSTEMATIC_STUDY_OF_SPARSE_DEEP_LEARNING_WITH_DIFFERENT_PENALTIES/22693573

Page generated in 0.0022 seconds

A SYSTEMATIC STUDY OF SPARSE DEEP LEARNING WITH DIFFERENT PENALTIES

Description

Links & Downloads

Tags

Additional Fields