Skip to main content

Showing 1–3 of 3 results for author: Ombuki-Berman, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.00046  [pdf, other

    cs.LG cs.CL

    Optimization Strategies for Enhancing Resource Efficiency in Transformers & Large Language Models

    Authors: Tom Wallace, Naser Ezzati-Jivan, Beatrice Ombuki-Berman

    Abstract: Advancements in Natural Language Processing are heavily reliant on the Transformer architecture, whose improvements come at substantial resource costs due to ever-growing model sizes. This study explores optimization techniques, including Quantization, Knowledge Distillation, and Pruning, focusing on energy and computational efficiency while retaining performance. Among standalone methods, 4-bit Q… ▽ More

    Submitted 16 January, 2025; originally announced February 2025.

    Comments: Accepted for ACM's ICPE 2025 in Short Paper format

    MSC Class: 68T50

  2. arXiv:2501.03095  [pdf, other

    cs.CV cs.NE

    A Novel Structure-Agnostic Multi-Objective Approach for Weight-Sharing Compression in Deep Neural Networks

    Authors: Rasa Khosrowshahli, Shahryar Rahnamayan, Beatrice Ombuki-Berman

    Abstract: Deep neural networks suffer from storing millions and billions of weights in memory post-training, making challenging memory-intensive models to deploy on embedded devices. The weight-sharing technique is one of the popular compression approaches that use fewer weight values and share across specific connections in the network. In this paper, we propose a multi-objective evolutionary algorithm (MO… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

    Comments: 16 pages, 9 figures, submitted to IEEE Transactions on Neural Networks and Learning Systems

  3. arXiv:2408.07194  [pdf, other

    cs.NE cs.AI cs.LG

    Massive Dimensions Reduction and Hybridization with Meta-heuristics in Deep Learning

    Authors: Rasa Khosrowshahli, Shahryar Rahnamayan, Beatrice Ombuki-Berman

    Abstract: Deep learning is mainly based on utilizing gradient-based optimization for training Deep Neural Network (DNN) models. Although robust and widely used, gradient-based optimization algorithms are prone to getting stuck in local minima. In this modern deep learning era, the state-of-the-art DNN models have millions and billions of parameters, including weights and biases, making them huge-scale optim… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: 8 pages, 5 figures, 3 tables, accepted at IEEE CCECE 2024 (updated Fig. 1 and conclusion remarks)