Skip to main content

Showing 1–1 of 1 results for author: Bolandraftar, B

.
  1. arXiv:2406.13551  [pdf, other

    cs.CL cs.AI

    Mitigating Social Biases in Language Models through Unlearning

    Authors: Omkar Dige, Diljot Singh, Tsz Fung Yau, Qixuan Zhang, Borna Bolandraftar, Xiaodan Zhu, Faiza Khan Khattak

    Abstract: Mitigating bias in language models (LMs) has become a critical problem due to the widespread deployment of LMs. Numerous approaches revolve around data pre-processing and fine-tuning of language models, tasks that can be both time-consuming and computationally demanding. Consequently, there is a growing interest in machine unlearning techniques given their capacity to induce the forgetting of unde… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.