Skip to main content

Showing 1–1 of 1 results for author: Gorlla, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.04051  [pdf, other

    cs.LG cs.CL

    Empirical Analysis of Efficient Fine-Tuning Methods for Large Pre-Trained Language Models

    Authors: Nigel Doering, Cyril Gorlla, Trevor Tuttle, Adhvaith Vijay

    Abstract: Fine-tuning large pre-trained language models for downstream tasks remains a critical challenge in natural language processing. This paper presents an empirical analysis comparing two efficient fine-tuning methods - BitFit and adapter modules - to standard full model fine-tuning. Experiments conducted on GLUE benchmark datasets (MRPC, COLA, STS-B) reveal several key insights. The BitFit approach,… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.