def. Restricted Boltzmann Machine.1 Let a partitioned neural network with visible layer and hidden (=latent) layer , with weight matrix and biases for visible layer and for hidden layer. Then we can define a probability distribution over the whole network:

where is called the “Energy function”

In most cases, each (Bernouilli–Bernouilli) or (Gaussian–Bernouilli) Conditional Inference. Given data we obtain the distribution of hidden layer

from which we can “sample” in the latent distribution to get . Alternatively, given a latent representation we obtain the distribution of generated “data”:

Training. We minimize the log-likelihood to model the data distribution to fit the data best:

Footnotes

  1. Neural networks [5.1] : Restricted Boltzmann machine - definition - YouTube ← this whole playlist is very useful.