Motivation. Principle Component Analysis was about transforming data into the explanatory variables (=principle components). PPCA is mathematically similar, but it thinks of each source variable as a random variable.

def. Probabilitic Principle Component Analysis (PPCA) model. We have observed data with and we want to extract latent features into . Naturally, .

where

  • i.e. multivariate standard normal distribution
  • noise, i.e. multivarite normal distribution with mean , variance for each
  • Observe that then where Objective is to find the optimal , noise variance and given observed data. We use MLEs for this. Steps are briefly outlined below. Objective function is the log-likelihood:

The analytic solution using eigendecompotision is:

where

  • in decreasing order, is from a spectral decomposition of sample covariance
  • is an arbitrary orthogonal rotation matrix