Part III :  What does Low Rank Factorization of a Convolutional Layer really do?
A Manual Implementation of Quantization in PyTorch - Single Layer
Part II :  Shrinking Neural Networks for Embedded Systems Using Low Rank Approximations (LoRA)