Statistical mechanics of unsupervised feature learning in a restricted Boltzmann machine with binary synapses

Huang, Haiping

doi:10.1088/1742-5468/aa6ddc

Computer Science > Machine Learning

arXiv:1612.01717 (cs)

[Submitted on 6 Dec 2016 (v1), last revised 7 Mar 2017 (this version, v2)]

Title:Statistical mechanics of unsupervised feature learning in a restricted Boltzmann machine with binary synapses

Authors:Haiping Huang

View PDF

Abstract:Revealing hidden features in unlabeled data is called unsupervised feature learning, which plays an important role in pretraining a deep neural network. Here we provide a statistical mechanics analysis of the unsupervised learning in a restricted Boltzmann machine with binary synapses. A message passing equation to infer the hidden feature is derived, and furthermore, variants of this equation are analyzed. A statistical analysis by replica theory describes the thermodynamic properties of the model. Our analysis confirms an entropy crisis preceding the non-convergence of the message passing equation, suggesting a discontinuous phase transition as a key characteristic of the restricted Boltzmann machine. Continuous phase transition is also confirmed depending on the embedded feature strength in the data. The mean-field result under the replica symmetric assumption agrees with that obtained by running message passing algorithms on single instances of finite sizes. Interestingly, in an approximate Hopfield model, the entropy crisis is absent, and a continuous phase transition is observed instead. We also develop an iterative equation to infer the hyper-parameter (temperature) hidden in the data, which in physics corresponds to iteratively imposing Nishimori condition. Our study provides insights towards understanding the thermodynamic properties of the restricted Boltzmann machine learning, and moreover important theoretical basis to build simplified deep networks.

Comments:	24 pages, 9 figures, results added
Subjects:	Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Neural and Evolutionary Computing (cs.NE); Neurons and Cognition (q-bio.NC)
Cite as:	arXiv:1612.01717 [cs.LG]
	(or arXiv:1612.01717v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1612.01717
Journal reference:	J. Stat. Mech. (2017) 053302
Related DOI:	https://doi.org/10.1088/1742-5468/aa6ddc

Submission history

From: Haiping Huang [view email]
[v1] Tue, 6 Dec 2016 09:17:14 UTC (124 KB)
[v2] Tue, 7 Mar 2017 04:47:07 UTC (128 KB)

Computer Science > Machine Learning

Title:Statistical mechanics of unsupervised feature learning in a restricted Boltzmann machine with binary synapses

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Statistical mechanics of unsupervised feature learning in a restricted Boltzmann machine with binary synapses

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators