A Beginner's Tutorial of Restricted Boltzmann Machines

Mapping Intimacies ◽

10.20944/preprints202003.0337.v1 ◽

2020 ◽

Author(s):

Yiping Cheng

Keyword(s):

Deep Learning ◽

Building Blocks ◽

Learning Networks ◽

Restricted Boltzmann Machines ◽

Boltzmann Machines

Restricted Boltzmann machines (RBMs) are the building blocks of some deep learning networks. However, despite their importance, it is our perception that some very important derivations about the RBM are missing in the literature, and a beginner may feel RBM very hard to understand. We provide here these missing derivations. We cover the classic Bernoulli-Bernoulli RBM and the Gaussian-Bernoulli RBM, but leave out the ``continuous'' RBM as it is believed not as mature as the former two. This tutorial can be used as a companion or complement to the famous RBM paper ``Training restricted Boltzmann machines: An introduction'' by Fisher and Igel.

Download Full-text

A deep learning scheme for mental workload classification based on restricted Boltzmann machines

Cognition Technology & Work ◽

10.1007/s10111-017-0430-6 ◽

2017 ◽

Vol 19 (4) ◽

pp. 607-631 ◽

Cited By ~ 3

Author(s):

Jianhua Zhang ◽

Sunan Li

Keyword(s):

Deep Learning ◽

Mental Workload ◽

Restricted Boltzmann Machines ◽

Boltzmann Machines ◽

Learning Scheme

Download Full-text

Deep Restricted Kernel Machines Using Conjugate Feature Duality

Neural Computation ◽

10.1162/neco_a_00984 ◽

2017 ◽

Vol 29 (8) ◽

pp. 2123-2163 ◽

Cited By ~ 17

Author(s):

Johan A. K. Suykens

Keyword(s):

Deep Learning ◽

Least Squares ◽

Principal Component ◽

Feedforward Neural Networks ◽

Kernel Principal Component Analysis ◽

Support Vector ◽

Kernel Machines ◽

Continuous Variables ◽

Restricted Boltzmann Machines ◽

Boltzmann Machines

The aim of this letter is to propose a theory of deep restricted kernel machines offering new foundations for deep learning with kernel machines. From the viewpoint of deep learning, it is partially related to restricted Boltzmann machines, which are characterized by visible and hidden units in a bipartite graph without hidden-to-hidden connections and deep learning extensions as deep belief networks and deep Boltzmann machines. From the viewpoint of kernel machines, it includes least squares support vector machines for classification and regression, kernel principal component analysis (PCA), matrix singular value decomposition, and Parzen-type models. A key element is to first characterize these kernel machines in terms of so-called conjugate feature duality, yielding a representation with visible and hidden units. It is shown how this is related to the energy form in restricted Boltzmann machines, with continuous variables in a nonprobabilistic setting. In this new framework of so-called restricted kernel machine (RKM) representations, the dual variables correspond to hidden features. Deep RKM are obtained by coupling the RKMs. The method is illustrated for deep RKM, consisting of three levels with a least squares support vector machine regression level and two kernel PCA levels. In its primal form also deep feedforward neural networks can be trained within this framework.

Download Full-text

A Deep Learning Scheme for Motor Imagery Classification based on Restricted Boltzmann Machines

IEEE Transactions on Neural Systems and Rehabilitation Engineering ◽

10.1109/tnsre.2016.2601240 ◽

2017 ◽

Vol 25 (6) ◽

pp. 566-576 ◽

Cited By ~ 129

Author(s):

Na Lu ◽

Tengfei Li ◽

Xiaodong Ren ◽

Hongyu Miao

Keyword(s):

Deep Learning ◽

Motor Imagery ◽

Restricted Boltzmann Machines ◽

Boltzmann Machines ◽

Learning Scheme

Download Full-text

Ensemble Classification Restricted Boltzmann Machines: A Deep Learning Based Classification Method

Journal of Information and Computational Science ◽

10.12733/jics20106538 ◽

2015 ◽

Vol 12 (14) ◽

pp. 5299-5307

Author(s):

Peiming Zhang

Keyword(s):

Deep Learning ◽

Ensemble Classification ◽

Classification Method ◽

Restricted Boltzmann Machines ◽

Boltzmann Machines

Download Full-text

Protein Function Prediction Using Deep Restricted Boltzmann Machines

BioMed Research International ◽

10.1155/2017/1729301 ◽

2017 ◽

Vol 2017 ◽

pp. 1-9 ◽

Cited By ~ 2

Author(s):

Xianchun Zou ◽

Guijun Wang ◽

Guoxian Yu

Keyword(s):

Deep Learning ◽

Language Processing ◽

Protein Function ◽

Homo Sapiens ◽

Protein Function Prediction ◽

Restricted Boltzmann Machines ◽

Boltzmann Machines ◽

Functional Annotations ◽

Learning Techniques ◽

Wide Range

Accurately annotating biological functions of proteins is one of the key tasks in the postgenome era. Many machine learning based methods have been applied to predict functional annotations of proteins, but this task is rarely solved by deep learning techniques. Deep learning techniques recently have been successfully applied to a wide range of problems, such as video, images, and nature language processing. Inspired by these successful applications, we investigate deep restricted Boltzmann machines (DRBM), a representative deep learning technique, to predict the missing functional annotations of partially annotated proteins. Experimental results onHomo sapiens,Saccharomyces cerevisiae,Mus musculus,andDrosophilashow that DRBM achieves better performance than other related methods across different evaluation metrics, and it also runs faster than these comparing methods.

Download Full-text

A Deep Learning Network via Shunt-Wound Restricted Boltzmann Machines Using Raw Data for Fault Detection

IEEE Transactions on Instrumentation and Measurement ◽

10.1109/tim.2019.2953436 ◽

2020 ◽

Vol 69 (7) ◽

pp. 4852-4862 ◽

Cited By ~ 2

Author(s):

Tongyang Pan ◽

Jinglong Chen ◽

Jun Pan ◽

Zitong Zhou

Keyword(s):

Deep Learning ◽

Fault Detection ◽

Restricted Boltzmann Machines ◽

Raw Data ◽

Boltzmann Machines ◽

Learning Network ◽

Deep Learning Network

Download Full-text

Anomaly Detection of Wind Turbines Based on Deep Small-World Neural Network

Applied Sciences ◽

10.3390/app10041243 ◽

2020 ◽

Vol 10 (4) ◽

pp. 1243 ◽

Cited By ~ 1

Author(s):

Meng Li ◽

Shuangxin Wang ◽

Shanxiang Fang ◽

Juchao Zhao

Keyword(s):

Neural Network ◽

Anomaly Detection ◽

Wind Turbines ◽

Small World ◽

Value Theory ◽

False Alarms ◽

Learning Networks ◽

Restricted Boltzmann Machines ◽

Boltzmann Machines ◽

Early Failures

Accurate and efficient condition monitoring is the key to enhance the reliability and security of wind turbines. In recent years, an intelligent anomaly detection method based on deep learning networks has been receiving increasing attention. Since accurately labeled data are usually difficult to obtain in real industries, this paper proposes a novel Deep Small-World Neural Network (DSWNN) on the basis of unsupervised learning to detect the early failures of wind turbines. During network construction, a regular auto-encoder network with multiple restricted Boltzmann machines is first constructed and pre-trained by using unlabeled data of wind turbines. After that, the trained network is transformed into a DSWNN model by randomly add-edges method, where the network parameters are fine-tuned by using minimal amounts of labeled data. In order to guard against the changes and disturbances of wind speed and reduce false alarms, an adaptive threshold based on extreme value theory is presented as the criterion of anomaly judgment. The DSWNN model is excellent in depth mining data characteristics and accurate measurement error. Last, two failure cases of wind turbine anomaly detection are given to demonstrate its validity and accuracy of the proposed methodology contrasted with the deep belief network and deep neural network.

Download Full-text

Representational Power of Restricted Boltzmann Machines and Deep Belief Networks

Neural Computation ◽

10.1162/neco.2008.04-07-510 ◽

2008 ◽

Vol 20 (6) ◽

pp. 1631-1649 ◽

Cited By ~ 357

Author(s):

Nicolas Le Roux ◽

Yoshua Bengio

Keyword(s):

Learning Algorithm ◽

Network Models ◽

Building Blocks ◽

Discrete Distributions ◽

Belief Networks ◽

Deep Belief Networks ◽

Neural Network Models ◽

Restricted Boltzmann Machines ◽

Boltzmann Machines ◽

Representational Power

Deep belief networks (DBN) are generative neural network models with many layers of hidden explanatory factors, recently introduced by Hinton, Osindero, and Teh (2006) along with a greedy layer-wise unsupervised learning algorithm. The building block of a DBN is a probabilistic model called a restricted Boltzmann machine (RBM), used to represent one layer of the model. Restricted Boltzmann machines are interesting because inference is easy in them and because they have been successfully used as building blocks for training deeper models. We first prove that adding hidden units yields strictly improved modeling power, while a second theorem shows that RBMs are universal approximators of discrete distributions. We then study the question of whether DBNs with more layers are strictly more powerful in terms of representational power. This suggests a new and less greedy criterion for training RBMs within DBNs.

Download Full-text

Enhanced Gradient for Training Restricted Boltzmann Machines

Neural Computation ◽

10.1162/neco_a_00397 ◽

2013 ◽

Vol 25 (3) ◽

pp. 805-831 ◽

Cited By ~ 31

Author(s):

KyungHyun Cho ◽

Tapani Raiko ◽

Alexander Ilin

Keyword(s):

Building Blocks ◽

Data Representation ◽

Learning Rate ◽

Traditional Learning ◽

Restricted Boltzmann Machines ◽

Specific Data ◽

Boltzmann Machines ◽

Learning Rules ◽

Deep Networks ◽

The Right

Restricted Boltzmann machines (RBMs) are often used as building blocks in greedy learning of deep networks. However, training this simple model can be laborious. Traditional learning algorithms often converge only with the right choice of metaparameters that specify, for example, learning rate scheduling and the scale of the initial weights. They are also sensitive to specific data representation. An equivalent RBM can be obtained by flipping some bits and changing the weights and biases accordingly, but traditional learning rules are not invariant to such transformations. Without careful tuning of these training settings, traditional algorithms can easily get stuck or even diverge. In this letter, we present an enhanced gradient that is derived to be invariant to bit-flipping transformations. We experimentally show that the enhanced gradient yields more stable training of RBMs both when used with a fixed learning rate and an adaptive one.

Download Full-text

Combining acoustic features and medical data in deep learning networks for voice pathology classification

2020 28th European Signal Processing Conference (EUSIPCO) ◽

10.23919/eusipco47968.2020.9287333 ◽

2021 ◽

Author(s):

Ioanna Miliaresi ◽

Kyriakos Poutos ◽

Aggelos Pikrakis

Keyword(s):

Deep Learning ◽

Medical Data ◽

Learning Networks ◽

Acoustic Features ◽

Pathology Classification

Download Full-text