Abstract: Low-Rank Approximation (LRA) is a commonly used method for compressing deep learning (DL) models by factorizing weight matrices into lower-dimensional components. Although LRA is most ...