Multilayer perceptrons, Hopfield’s associative memories, and restricted Boltzmann machines

Asakawa, Shin-ichi

doi:10.1186/1471-2202-15-S1-P223

Multilayer perceptrons, Hopfield’s associative memories, and restricted Boltzmann machines

Poster presentation
Open access
Published: 21 July 2014

Volume 15, article number P223, (2014)
Cite this article

Download PDF

You have full access to this open access article

BMC Neuroscience Aims and scope Submit manuscript

Multilayer perceptrons, Hopfield’s associative memories, and restricted Boltzmann machines

Download PDF

Shin-ichi Asakawa¹

1382 Accesses
Explore all metrics

This study was intended to describe multilayer perceptrons (MLP), Hopfield’s associative memories (HAM), and restricted Boltzmann machines (RBM) from a unified point of view. Despite of mutual relation between three models, for example, RBMs have been utilizing to construct deeper architectures than shallower MLPs. The energy function in HAM is analogous to the Ising model in statistical mechanics, and it connects microscopic physics to thermodynamics. The canonical partition function Z in the Boltzmann distribution is also utilized RBMs. Asynchronous updating and contrastive divergence (CD) based upon Gibbs sampling is also related. Therefore, it seems to be worth considering these three models within a common framework. This attempt might lead to “one algorithm hypothesis.”, which insists that our brains might rule a single but universal rule.

An algorithm, which someone could find out in a region, may be applicable to other regions. Multilayer perceptrons (henceforth, MLP) are feed forward models for pattern recognition and classification. Hopfield proposed another kind of neural network models for associative memory and optimization (HAM). Hiton adopted the restricted Boltzmann machines (RBM) in “Deep Learning” in order to construct deeper layered neural networks. The energy employed in RBMs are elicited the generalized EM algorithm, which was closely related to the energy employed by HAM. In spite of other various differences, see Table 1, it is worth considering to compare among them. At least, an attempt is worth attempting to explain all of them in a unified terminology.

Table 1 Summary of MLP, HAM, and RBM

Full size table

HAM and RBM have symmetrically weighted connections, w_ij = w_ji, although generalized Boltzmann machines can not satisfy this constraints. Similarly, there are no feedback connections in MLP in general. When we denote a connection weight from j-th unit to i-th unit as w_ij, w_ij ∈ R, w_ji = 0 in MLP. When we consider a merged weight matrix W, all the models can be considered as identical.

The construction methods adopted by Deep Learning are based upon RBMs. One of key concepts to success for constructing multilayer deep architecture is the non–linearity, because units in hidden layer in RBMs are binary. The non–linearity seems to play an important role to construct deep architecture. When we suppose to abandon CD and binary feature, multilayer architecture might replace one weight matrix W = W₁W₂… Wp. Also, we can consider a thought experiment with only one hidden unit in RBM. If h = 0, then there are no meanings at all. If h = 1, then it must be an identity mapping, or at least, it might be extract the eigenvector vector corresponded to the maximum eigenvalue value in data matrix X. This might be equivalent to the algorithm proposed by Oja (1985). Since Deep Learning architecture network models trained via RBMs have no within layer connections, we might not be able to reject a possibility that a hidden unit hi might be trained to detect exactly the same features as another hidden unit hj. In order to avoid these situations, we must prepare a large number of binary hidden units more than the entropy being involved in input data set. RBM has no assumptions about within layer connections, it might success to detect important features among data matrix via CD. However, this constraint might weaken slightly, when we would introduce the EM algorithm to be estimate the states of latent variables, and an online algorithm of HAM. This might bring us to an idea “semi restricted Boltzmann machines.”

Author information

Authors and Affiliations

Center for Information Sciences, Tokyo Woman’s Christian University, 2-6-1 Zempukuji, Suginami-ku, Tokyo, 167-8585, Japan
Shin-ichi Asakawa

Authors

Shin-ichi Asakawa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shin-ichi Asakawa.

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Asakawa, Si. Multilayer perceptrons, Hopfield’s associative memories, and restricted Boltzmann machines. BMC Neurosci 15 (Suppl 1), P223 (2014). https://doi.org/10.1186/1471-2202-15-S1-P223

Download citation

Published: 21 July 2014
DOI: https://doi.org/10.1186/1471-2202-15-S1-P223

Multilayer perceptrons, Hopfield’s associative memories, and restricted Boltzmann machines

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Multilayer perceptrons, Hopfield’s associative memories, and restricted Boltzmann machines

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation