Skip to main content

Showing 1–6 of 6 results for author: Zangrando, E

.
  1. arXiv:2410.18720  [pdf, other

    cs.LG cs.AI math.NA

    GeoLoRA: Geometric integration for parameter efficient fine-tuning

    Authors: Steffen Schotthöfer, Emanuele Zangrando, Gianluca Ceruti, Francesco Tudisco, Jonas Kusch

    Abstract: Low-Rank Adaptation (LoRA) has become a widely used method for parameter-efficient fine-tuning of large-scale, pre-trained neural networks. However, LoRA and its extensions face several challenges, including the need for rank adaptivity, robustness, and computational efficiency during the fine-tuning process. We introduce GeoLoRA, a novel approach that addresses these limitations by leveraging dyn… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

  2. arXiv:2410.12607  [pdf, other

    cs.LG cs.AI math.NA stat.ML

    Low-Rank Adversarial PGD Attack

    Authors: Dayana Savostianova, Emanuele Zangrando, Francesco Tudisco

    Abstract: Adversarial attacks on deep neural network models have seen rapid development and are extensively used to study the stability of these networks. Among various adversarial strategies, Projected Gradient Descent (PGD) is a widely adopted method in computer vision due to its effectiveness and quick implementation, making it suitable for adversarial training. In this work, we observe that in many case… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  3. arXiv:2402.03991  [pdf, other

    cs.LG math.NA stat.ML

    Neural Rank Collapse: Weight Decay and Small Within-Class Variability Yield Low-Rank Bias

    Authors: Emanuele Zangrando, Piero Deidda, Simone Brugiapaglia, Nicola Guglielmi, Francesco Tudisco

    Abstract: Recent work in deep learning has shown strong empirical and theoretical evidence of an implicit low-rank bias: weight matrices in deep networks tend to be approximately low-rank and removing relatively small singular values during training or from available trained models may significantly reduce model size while maintaining or even improving model performance. However, the majority of the theoret… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  4. arXiv:2306.01485  [pdf, other

    cs.LG cs.AI math.NA stat.ML

    Robust low-rank training via approximate orthonormal constraints

    Authors: Dayana Savostianova, Emanuele Zangrando, Gianluca Ceruti, Francesco Tudisco

    Abstract: With the growth of model and data sizes, a broad effort has been made to design pruning techniques that reduce the resource demand of deep learning pipelines, while retaining model performance. In order to reduce both inference and training costs, a prominent line of work uses low-rank matrix factorizations to represent the network weights. Although able to retain accuracy, we observe that low-ran… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  5. arXiv:2305.19059  [pdf, other

    cs.LG math.NA stat.ML

    Geometry-aware training of factorized layers in tensor Tucker format

    Authors: Emanuele Zangrando, Steffen Schotthöfer, Gianluca Ceruti, Jonas Kusch, Francesco Tudisco

    Abstract: Reducing parameter redundancies in neural network architectures is crucial for achieving feasible computational and memory requirements during training and inference phases. Given its easy implementation and flexibility, one promising approach is layer factorization, which reshapes weight tensors into a matrix format and parameterizes them as the product of two small rank matrices. However, this a… ▽ More

    Submitted 14 October, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

  6. arXiv:2205.13571  [pdf, other

    cs.LG cs.AI math.NA stat.ML

    Low-rank lottery tickets: finding efficient low-rank neural networks via matrix differential equations

    Authors: Steffen Schotthöfer, Emanuele Zangrando, Jonas Kusch, Gianluca Ceruti, Francesco Tudisco

    Abstract: Neural networks have achieved tremendous success in a large variety of applications. However, their memory footprint and computational demand can render them impractical in application settings with limited hardware or energy resources. In this work, we propose a novel algorithm to find efficient low-rank subnetworks. Remarkably, these subnetworks are determined and adapted already during the trai… ▽ More

    Submitted 18 October, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

    Journal ref: Proceedings NeurIPS 2022