Tag: Optimization

Posted 2024-09-13Updated 2025-03-02artificial-intelligence22 minutes read (About 3302 words)

Part III : What does Low Rank Factorization of a Convolutional Layer really do?

In this post, we will explore the Low Rank Approximation (LoRA) technique for shrinking neural networks for embedded systems. We will focus on the Convolutional Neural Network (CNN) case and discuss the rank selection process.

Posted 2024-05-16Updated 2025-03-02artificial-intelligence9 minutes read (About 1372 words)

Are Values Passed Between Layers Float or Int in PyTorch Post Quantization?

In this article, we will discuss how values are passed between layers post quantization in PyTorch. We will also discuss why floating point operations are slower than integer operations.

Posted 2024-05-16Updated 2025-03-02artificial-intelligence27 minutes read (About 4019 words)

A Manual Implementation of Quantization in PyTorch - Single Layer

A manual implementation of quantization in PyTorch.

Posted 2024-04-24Updated 2025-03-02artificial-intelligence5 minutes read (About 757 words)

Part II : Shrinking Neural Networks for Embedded Systems Using Low Rank Approximations (LoRA)

In this post, we will explore the Low Rank Approximation (LoRA) technique for shrinking neural networks for embedded systems. We will focus on the Convolutional Neural Network (CNN) case and discuss the rank selection process.

Posted 2024-04-03Updated 2025-03-02artificial-intelligence19 minutes read (About 2888 words)

Part I : Shrinking Neural Networks for Embedded Systems Using Low Rank Approximations (LoRA)

An elementary explanation of the problem with full rank matrices in neural networks and their solution via low rank approximations. Detailed explanation on how to set up the optimization problem and how to solve it, in possibly linear time.

Links

Categories

Recents

Archives

Tags

Subscribe for updates

follow.it