Guided Layer-wise Learning for Deep Models using Side Information

Sulimov, Pavel; Sukmanova, Elena; Chereshnev, Roman; Kertesz-Farkas, Attila

Computer Science > Machine Learning

arXiv:1911.02048 (cs)

[Submitted on 5 Nov 2019]

Title:Guided Layer-wise Learning for Deep Models using Side Information

Authors:Pavel Sulimov, Elena Sukmanova, Roman Chereshnev, Attila Kertesz-Farkas

View PDF

Abstract:Training of deep models for classification tasks is hindered by local minima problems and vanishing gradients, while unsupervised layer-wise pretraining does not exploit information from class labels. Here, we propose a new regularization technique, called diversifying regularization (DR), which applies a penalty on hidden units at any layer if they obtain similar features for different types of data. For generative models, DR is defined as divergence over the variational posteriori distributions and included in the maximum likelihood estimation as a prior. Thus, DR includes class label information for greedy pretraining of deep belief networks which result in a better weight initialization for fine-tuning methods. On the other hand, for discriminative training of deep neural networks, DR is defined as a distance over the features and included in the learning objective. With our experimental tests, we show that DR can help the backpropagation to cope with vanishing gradient problems and to provide faster convergence and smaller generalization errors.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1911.02048 [cs.LG]
	(or arXiv:1911.02048v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1911.02048

Submission history

From: Pavel Sulimov Mr [view email]
[v1] Tue, 5 Nov 2019 19:27:16 UTC (891 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-11

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Roman Chereshnev
Attila Kertész-Farkas

export BibTeX citation

Computer Science > Machine Learning

Title:Guided Layer-wise Learning for Deep Models using Side Information

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Guided Layer-wise Learning for Deep Models using Side Information

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators