A trick in deep learning to speed up training computation.

Linear Normialization

For each

where are learnable parameters

Batch Normalization