Skip to main content

Showing 1–10 of 10 results for author: Magooda, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.22037  [pdf, other

    cs.CL cs.CR cs.SE

    Jailbreak Distillation: Renewable Safety Benchmarking

    Authors: Jingyu Zhang, Ahmed Elgohary, Xiawei Wang, A S M Iftekhar, Ahmed Magooda, Benjamin Van Durme, Daniel Khashabi, Kyle Jackson

    Abstract: Large language models (LLMs) are rapidly deployed in critical applications, raising urgent needs for robust safety benchmarking. We propose Jailbreak Distillation (JBDistill), a novel benchmark construction framework that "distills" jailbreak attacks into high-quality and easily-updatable safety benchmarks. JBDistill utilizes a small set of development models and existing jailbreak attack algorith… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: Project page: https://aka.ms/jailbreak-distillation

  2. arXiv:2410.08968  [pdf, other

    cs.CL cs.AI

    Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements

    Authors: Jingyu Zhang, Ahmed Elgohary, Ahmed Magooda, Daniel Khashabi, Benjamin Van Durme

    Abstract: The current paradigm for safety alignment of large language models (LLMs) follows a one-size-fits-all approach: the model refuses to interact with any content deemed unsafe by the model provider. This approach lacks flexibility in the face of varying social norms across cultures and regions. In addition, users may have diverse safety needs, making a model with static safety standards too restricti… ▽ More

    Submitted 3 March, 2025; v1 submitted 11 October, 2024; originally announced October 2024.

    Comments: ICLR 2025 camera ready

  3. arXiv:2406.13905  [pdf, other

    cs.CL

    Persuasiveness of Generated Free-Text Rationales in Subjective Decisions: A Case Study on Pairwise Argument Ranking

    Authors: Mohamed Elaraby, Diane Litman, Xiang Lorraine Li, Ahmed Magooda

    Abstract: Generating free-text rationales is among the emergent capabilities of Large Language Models (LLMs). These rationales have been found to enhance LLM performance across various NLP tasks. Recently, there has been growing interest in using these rationales to provide insights for various important downstream tasks. In this paper, we analyze generated free-text rationales in tasks with subjective answ… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2404.01282  [pdf, other

    cs.CV

    LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization

    Authors: Akshita Gupta, Gaurav Mittal, Ahmed Magooda, Ye Yu, Graham W. Taylor, Mei Chen

    Abstract: Temporal Action Localization (TAL) involves localizing and classifying action snippets in an untrimmed video. The emergence of large video foundation models has led RGB-only video backbones to outperform previous methods needing both RGB and optical flow modalities. Leveraging these large models is often limited to training only the TAL head due to the prohibitively large GPU memory required to ad… ▽ More

    Submitted 5 December, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: WACV 2025 Accepted

  5. arXiv:2310.17750  [pdf, other

    cs.CL

    A Framework for Automated Measurement of Responsible AI Harms in Generative AI Applications

    Authors: Ahmed Magooda, Alec Helyar, Kyle Jackson, David Sullivan, Chad Atalla, Emily Sheng, Dan Vann, Richard Edgar, Hamid Palangi, Roman Lutz, Hongliang Kong, Vincent Yun, Eslam Kamal, Federico Zarfati, Hanna Wallach, Sarah Bird, Mei Chen

    Abstract: We present a framework for the automated measurement of responsible AI (RAI) metrics for large language models (LLMs) and associated products and services. Our framework for automatically measuring harms from LLMs builds on existing technical and sociotechnical expertise and leverages the capabilities of state-of-the-art LLMs, such as GPT-4. We use this framework to run through several case studie… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: This is a living document

  6. arXiv:2109.08569  [pdf, other

    cs.CL

    Mitigating Data Scarceness through Data Synthesis, Augmentation and Curriculum for Abstractive Summarization

    Authors: Ahmed Magooda, Diane Litman

    Abstract: This paper explores three simple data manipulation techniques (synthesis, augmentation, curriculum) for improving abstractive summarization models without the need for any additional data. We introduce a method of data synthesis with paraphrasing, a data augmentation technique with sample mixing, and curriculum learning with two new difficulty metrics based on specificity and abstractiveness. We c… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: To appear in proceedings of EMNLP 2021 (https://2021.emnlp.org/)

  7. arXiv:2109.08565  [pdf, other

    cs.CL

    Exploring Multitask Learning for Low-Resource AbstractiveSummarization

    Authors: Ahmed Magooda, Mohamed Elaraby, Diane Litman

    Abstract: This paper explores the effect of using multitask learning for abstractive summarization in the context of small training corpora. In particular, we incorporate four different tasks (extractive summarization, language modeling, concept detection, and paraphrase detection) both individually and in combination, with the goal of enhancing the target task of abstractive summarization via multitask lea… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: To appear in proceedings of EMNLP 2021 (https://2021.emnlp.org/)

  8. arXiv:2002.03407  [pdf, ps, other

    cs.CL cs.LG

    Abstractive Summarization for Low Resource Data using Domain Transfer and Data Synthesis

    Authors: Ahmed Magooda, Diane Litman

    Abstract: Training abstractive summarization models typically requires large amounts of data, which can be a limitation for many domains. In this paper we explore using domain transfer and data synthesis to improve the performance of recent abstractive summarization methods when applied to small corpora of student reflections. First, we explored whether tuning state of the art model trained on newspaper dat… ▽ More

    Submitted 9 February, 2020; originally announced February 2020.

    Comments: To be published in FLAIRS33 (https://www.flairs-33.info/) and appear in he proceedings of AAAI

  9. arXiv:2002.03405  [pdf, ps, other

    cs.CL cs.LG

    Attend to the beginning: A study on using bidirectional attention for extractive summarization

    Authors: Ahmed Magooda, Cezary Marcjan

    Abstract: Forum discussion data differ in both structure and properties from generic form of textual data such as news. Henceforth, summarization techniques should, in turn, make use of such differences, and craft models that can benefit from the structural nature of discussion data. In this work, we propose attending to the beginning of a document, to improve the performance of extractive summarization mod… ▽ More

    Submitted 8 May, 2020; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: To be published in FLAIRS33 (https://www.flairs-33.info/) and appear in he proceedings of AAAI

  10. eRevise: Using Natural Language Processing to Provide Formative Feedback on Text Evidence Usage in Student Writing

    Authors: Haoran Zhang, Ahmed Magooda, Diane Litman, Richard Correnti, Elaine Wang, Lindsay Clare Matsumura, Emily Howe, Rafael Quintana

    Abstract: Writing a good essay typically involves students revising an initial paper draft after receiving feedback. We present eRevise, a web-based writing and revising environment that uses natural language processing features generated for rubric-based essay scoring to trigger formative feedback messages regarding students' use of evidence in response-to-text writing. By helping students understand the c… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

    Comments: Published in IAAI 19

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence (2019) vol. 33, 9619-9625