Skip to main content

Showing 1–4 of 4 results for author: Dige, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.17228  [pdf, ps, other

    cs.LG

    Automated Capability Evaluation of Foundation Models

    Authors: Arash Afkanpour, Omkar Dige, Fatemeh Tavakoli

    Abstract: Current evaluation frameworks for foundation models rely heavily on fixed, manually curated benchmarks, limiting their ability to capture the full breadth of model capabilities. This paper introduces Active learning for Capability Evaluation (ACE), a novel framework for scalable, automated, and fine-grained evaluation of foundation models. ACE leverages the knowledge embedded in powerful language… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  2. arXiv:2406.13551  [pdf, other

    cs.CL cs.AI

    Mitigating Social Biases in Language Models through Unlearning

    Authors: Omkar Dige, Diljot Singh, Tsz Fung Yau, Qixuan Zhang, Borna Bolandraftar, Xiaodan Zhu, Faiza Khan Khattak

    Abstract: Mitigating bias in language models (LMs) has become a critical problem due to the widespread deployment of LMs. Numerous approaches revolve around data pre-processing and fine-tuning of language models, tasks that can be both time-consuming and computationally demanding. Consequently, there is a growing interest in machine unlearning techniques given their capacity to induce the forgetting of unde… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  3. arXiv:2308.00071  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    On The Role of Reasoning in the Identification of Subtle Stereotypes in Natural Language

    Authors: Jacob-Junqi Tian, Omkar Dige, D. B. Emerson, Faiza Khan Khattak

    Abstract: Large language models (LLMs) are trained on vast, uncurated datasets that contain various forms of biases and language reinforcing harmful stereotypes that may be subsequently inherited by the models themselves. Therefore, it is essential to examine and address biases in language models, integrating fairness into their development to ensure that these models do not perpetuate social biases. In thi… ▽ More

    Submitted 28 September, 2024; v1 submitted 24 July, 2023; originally announced August 2023.

    Comments: 15 pages, 11 Figures, 3 Tables

    MSC Class: 68T50

  4. arXiv:2307.10472  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Can Instruction Fine-Tuned Language Models Identify Social Bias through Prompting?

    Authors: Omkar Dige, Jacob-Junqi Tian, David Emerson, Faiza Khan Khattak

    Abstract: As the breadth and depth of language model applications continue to expand rapidly, it is increasingly important to build efficient frameworks for measuring and mitigating the learned or inherited social biases of these models. In this paper, we present our work on evaluating instruction fine-tuned language models' ability to identify bias through zero-shot prompting, including Chain-of-Thought (C… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.