Skip to main content

Showing 1–1 of 1 results for author: Vipparla, S

.
  1. arXiv:2501.06208  [pdf, other

    cs.CL

    Enhancing AI Safety Through the Fusion of Low Rank Adapters

    Authors: Satya Swaroop Gudipudi, Sreeram Vipparla, Harpreet Singh, Shashwat Goel, Ponnurangam Kumaraguru

    Abstract: Instruction fine-tuning of large language models (LLMs) is a powerful method for improving task-specific performance, but it can inadvertently lead to a phenomenon where models generate harmful responses when faced with malicious prompts. In this paper, we explore Low-Rank Adapter Fusion (LoRA) as a means to mitigate these risks while preserving the model's ability to handle diverse instructions e… ▽ More

    Submitted 30 December, 2024; originally announced January 2025.