Motivation. Principle Component Analysis was about transforming data into the explanatory variables (=principle components). PPCA is mathematically similar, but it thinks of each source variable as a random variable.
def. Probabilitic Principle Component Analysis (PPCA) model. We have observed data with and we want to extract latent features into . Naturally, .
where
- i.e. multivariate standard normal distribution
- noise, i.e. multivarite normal distribution with mean , variance for each
- Observe that then where Objective is to find the optimal , noise variance and given observed data. We use MLEs for this. Steps are briefly outlined below. Objective function is the log-likelihood:
The analytic solution using eigendecompotision is:
where
- in decreasing order, is from a spectral decomposition of sample covariance
- is an arbitrary orthogonal rotation matrix