Part I : Shrinking Neural Networks for Embedded Systems Using Low Rank Approximations (LoRA)
An elementary explanation of the problem with full rank matrices in neural networks and their solution via low rank approximations. Detailed explanation on how to set up the optimization problem and how to solve it, in possibly linear time.
Read more



