Skip to main content

Showing 1–6 of 6 results for author: Obadinma, S

.
  1. arXiv:2506.11083  [pdf, ps, other

    cs.CL

    RedDebate: Safer Responses through Multi-Agent Red Teaming Debates

    Authors: Ali Asad, Stephen Obadinma, Radin Shayanfar, Xiaodan Zhu

    Abstract: We propose RedDebate, a novel multi-agent debate framework that leverages adversarial argumentation among Large Language Models (LLMs) to proactively identify and mitigate their own unsafe behaviours. Existing AI safety methods often depend heavily on costly human evaluations or isolated single-model assessment, both subject to scalability constraints and oversight risks. RedDebate instead embrace… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  2. arXiv:2405.18553  [pdf, other

    cs.AI

    FAIIR: Building Toward A Conversational AI Agent Assistant for Youth Mental Health Service Provision

    Authors: Stephen Obadinma, Alia Lachana, Maia Norman, Jocelyn Rankin, Joanna Yu, Xiaodan Zhu, Darren Mastropaolo, Deval Pandya, Roxana Sultan, Elham Dolatabadi

    Abstract: The world's healthcare systems and mental health agencies face both a growing demand for youth mental health services, alongside a simultaneous challenge of limited resources. Here, we focus on frontline crisis support, where Crisis Responders (CRs) engage in conversations for youth mental health support and assign an issue tag to each conversation. In this study, we develop FAIIR (Frontline Assis… ▽ More

    Submitted 11 February, 2025; v1 submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2401.02718  [pdf, other

    cs.LG cs.CR

    Calibration Attacks: A Comprehensive Study of Adversarial Attacks on Model Confidence

    Authors: Stephen Obadinma, Xiaodan Zhu, Hongyu Guo

    Abstract: In this work, we highlight and perform a comprehensive study on calibration attacks, a form of adversarial attacks that aim to trap victim models to be heavily miscalibrated without altering their predicted labels, hence endangering the trustworthiness of the models and follow-up decision making based on their confidence. We propose four typical forms of calibration attacks: underconfidence, overc… ▽ More

    Submitted 29 November, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted at Transactions on Machine Learning Research

  4. arXiv:2303.02577  [pdf, other

    cs.CL

    Effectiveness of Data Augmentation for Parameter Efficient Tuning with Limited Data

    Authors: Stephen Obadinma, Hongyu Guo, Xiaodan Zhu

    Abstract: Recent work has demonstrated that using parameter efficient tuning techniques such as prefix tuning (or P-tuning) on pretrained language models can yield performance that is comparable or superior to fine-tuning while dramatically reducing trainable parameters. Nevertheless, the effectiveness of such methods under the context of data augmentation, a common strategy to improve learning under low da… ▽ More

    Submitted 29 June, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

    Comments: Published at the 8th Workshop on Representation Learning for NLP (RepL4NLP 2023) at ACL 2023

  5. arXiv:2302.03222  [pdf, other

    cs.CL

    Bringing the State-of-the-Art to Customers: A Neural Agent Assistant Framework for Customer Service Support

    Authors: Stephen Obadinma, Faiza Khan Khattak, Shirley Wang, Tania Sidhom, Elaine Lau, Sean Robertson, Jingcheng Niu, Winnie Au, Alif Munim, Karthik Raja K. Bhaskar, Bencheng Wei, Iris Ren, Waqar Muhammad, Erin Li, Bukola Ishola, Michael Wang, Griffin Tanner, Yu-Jia Shiah, Sean X. Zhang, Kwesi P. Apponsah, Kanishk Patel, Jaswinder Narain, Deval Pandya, Xiaodan Zhu, Frank Rudzicz , et al. (1 additional authors not shown)

    Abstract: Building Agent Assistants that can help improve customer service support requires inputs from industry users and their customers, as well as knowledge about state-of-the-art Natural Language Processing (NLP) technology. We combine expertise from academia and industry to bridge the gap and build task/domain-specific Neural Agent Assistants (NAA) with three high-level components for: (1) Intent Iden… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: Camera Ready Version of Paper Published in EMNLP 2022 Industry Track

  6. arXiv:2008.00563  [pdf, other

    cs.CL

    SemEval-2020 Task 5: Counterfactual Recognition

    Authors: Xiaoyu Yang, Stephen Obadinma, Huasha Zhao, Qiong Zhang, Stan Matwin, Xiaodan Zhu

    Abstract: We present a counterfactual recognition (CR) task, the shared Task 5 of SemEval-2020. Counterfactuals describe potential outcomes (consequents) produced by actions or circumstances that did not happen or cannot happen and are counter to the facts (antecedent). Counterfactual thinking is an important characteristic of the human cognitive system; it connects antecedents and consequents with causal r… ▽ More

    Submitted 2 August, 2020; originally announced August 2020.

    Comments: Task description paper of SemEval-2020 Task 5: Modelling Causal Reasoning in Language: Detecting Counterfactuals