Skip to main content

Showing 1–1 of 1 results for author: Jami, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.14595  [pdf, other

    cs.CL cs.LG

    EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models

    Authors: Hossein Rajabzadeh, Aref Jafari, Aman Sharma, Benyamin Jami, Hyock Ju Kwon, Ali Ghodsi, Boxing Chen, Mehdi Rezagholizadeh

    Abstract: Large Language Models (LLMs), with their increasing depth and number of parameters, have demonstrated outstanding performance across a variety of natural language processing tasks. However, this growth in scale leads to increased computational demands, particularly during inference and fine-tuning. To address these challenges, we introduce EchoAtt, a novel framework aimed at optimizing transformer… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.