Skip to main content

Showing 1–3 of 3 results for author: Sabot, A

.
  1. arXiv:2504.00254  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning

    Authors: Huandong Chang, Zicheng Ma, Mingyuan Ma, Zhenting Qi, Andrew Sabot, Hong Jiang, H. T. Kung

    Abstract: Low-Rank Adaptation (LoRA) has become a widely adopted technique for fine-tuning large-scale pre-trained models with minimal parameter updates. However, existing methods rely on fixed ranks or focus solely on either rank pruning or expansion, failing to adapt ranks dynamically to match the importance of different layers during training. In this work, we propose ElaLoRA, an adaptive low-rank adapta… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

  2. arXiv:2307.03930  [pdf, other

    cs.LG cs.AR cs.PF cs.PL

    Rosko: Row Skipping Outer Products for Sparse Matrix Multiplication Kernels

    Authors: Vikas Natesh, Andrew Sabot, H. T. Kung, Mark Ting

    Abstract: We propose Rosko -- row skipping outer products -- for deriving sparse matrix multiplication (SpMM) kernels in reducing computation and memory access requirements of deep neural networks (DNNs). Rosko allows skipping of entire row computations during program execution with low sparsity-management overheads. We analytically derive sparse CPU kernels that adapt to given hardware characteristics to e… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: Rosko's CPU implementation can be found at https://github.com/vnatesh/Rosko

  3. arXiv:2304.05544  [pdf, other

    cs.LG cs.AR cs.PF cs.PL

    MEMA Runtime Framework: Minimizing External Memory Accesses for TinyML on Microcontrollers

    Authors: Andrew Sabot, Vikas Natesh, H. T. Kung, Wei-Te Ting

    Abstract: We present the MEMA framework for the easy and quick derivation of efficient inference runtimes that minimize external memory accesses for matrix multiplication on TinyML systems. The framework accounts for hardware resource constraints and problem sizes in analytically determining optimized schedules and kernels that minimize memory accesses. MEMA provides a solution to a well-known problem in th… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: Accepted as a full paper by the TinyML Research Symposium 2023