Skip to main content

Showing 1–20 of 20 results for author: Sicilia, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.17348  [pdf, other

    cs.CL cs.HC

    Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems

    Authors: Mert İnan, Anthony Sicilia, Suvodip Dey, Vardhan Dongre, Tejas Srinivasan, Jesse Thomason, Gökhan Tür, Dilek Hakkani-Tür, Malihe Alikhani

    Abstract: While theories of discourse and cognitive science have long recognized the value of unhurried pacing, recent dialogue research tends to minimize friction in conversational systems. Yet, frictionless dialogue risks fostering uncritical reliance on AI outputs, which can obscure implicit assumptions and lead to unintended consequences. To meet this challenge, we propose integrating positive friction… ▽ More

    Submitted 31 January, 2025; v1 submitted 28 January, 2025; originally announced January 2025.

  2. arXiv:2501.06129  [pdf, other

    cs.CL cs.AI

    Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI

    Authors: Yuya Asano, Sabit Hassan, Paras Sharma, Anthony Sicilia, Katherine Atwell, Diane Litman, Malihe Alikhani

    Abstract: General-purpose automatic speech recognition (ASR) systems do not always perform well in goal-oriented dialogue. Existing ASR correction methods rely on prior user data or named entities. We extend correction to tasks that have no prior user data and exhibit linguistic flexibility such as lexical and syntactic variations. We propose a novel context augmentation with a large language model and a ra… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

    Comments: Accepted to COLING 2025 Industry Track

  3. arXiv:2410.14746  [pdf, other

    cs.CL cs.AI cs.HC

    Accounting for Sycophancy in Language Model Uncertainty Estimation

    Authors: Anthony Sicilia, Mert Inan, Malihe Alikhani

    Abstract: Effective human-machine collaboration requires machine learning models to externalize uncertainty, so users can reflect and intervene when necessary. For language models, these representations of uncertainty may be impacted by sycophancy bias: proclivity to agree with users, even if they are wrong. For instance, models may be over-confident in (incorrect) problem solutions suggested by a user. We… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  4. arXiv:2410.14744  [pdf, other

    cs.CL cs.AI

    Eliciting Uncertainty in Chain-of-Thought to Mitigate Bias against Forecasting Harmful User Behaviors

    Authors: Anthony Sicilia, Malihe Alikhani

    Abstract: Conversation forecasting tasks a model with predicting the outcome of an unfolding conversation. For instance, it can be applied in social media moderation to predict harmful user behaviors before they occur, allowing for preventative interventions. While large language models (LLMs) have recently been proposed as an effective tool for conversation forecasting, it's unclear what biases they may ha… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  5. arXiv:2410.14026  [pdf, other

    cs.CL cs.AI cs.CY cs.HC

    Generating Signed Language Instructions in Large-Scale Dialogue Systems

    Authors: Mert İnan, Katherine Atwell, Anthony Sicilia, Lorna Quandt, Malihe Alikhani

    Abstract: We introduce a goal-oriented conversational AI system enhanced with American Sign Language (ASL) instructions, presenting the first implementation of such a system on a worldwide multimodal conversational AI platform. Accessible through a touch-based interface, our system receives input from users and seamlessly generates ASL instructions by leveraging retrieval methods and cognitively based gloss… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024) Industry Track

  6. arXiv:2410.13641  [pdf, other

    cs.CL

    An Active Learning Framework for Inclusive Generation by Large Language Models

    Authors: Sabit Hassan, Anthony Sicilia, Malihe Alikhani

    Abstract: Ensuring that Large Language Models (LLMs) generate text representative of diverse sub-populations is essential, particularly when key concepts related to under-represented groups are scarce in the training data. We address this challenge with a novel clustering-based active learning framework, enhanced with knowledge distillation. The proposed framework transforms the intermediate outputs of the… ▽ More

    Submitted 14 December, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: COLING, 2025

  7. arXiv:2410.11114  [pdf, other

    cs.CL

    Active Learning for Robust and Representative LLM Generation in Safety-Critical Scenarios

    Authors: Sabit Hassan, Anthony Sicilia, Malihe Alikhani

    Abstract: Ensuring robust safety measures across a wide range of scenarios is crucial for user-facing systems. While Large Language Models (LLMs) can generate valuable data for safety measures, they often exhibit distributional biases, focusing on common scenarios and neglecting rare but critical cases. This can undermine the effectiveness of safety protocols developed using such data. To address this, we p… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  8. arXiv:2409.14986  [pdf, other

    cs.CL cs.AI

    Evaluating Theory of (an uncertain) Mind: Predicting the Uncertain Beliefs of Others in Conversation Forecasting

    Authors: Anthony Sicilia, Malihe Alikhani

    Abstract: Typically, when evaluating Theory of Mind, we consider the beliefs of others to be binary: held or not held. But what if someone is unsure about their own beliefs? How can we quantify this uncertainty? We propose a new suite of tasks, challenging language models (LMs) to model the uncertainty of others in dialogue. We design these tasks around conversation forecasting, wherein an agent forecasts a… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  9. arXiv:2402.03284  [pdf, other

    cs.CL cs.AI cs.LG

    Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models

    Authors: Anthony Sicilia, Hyunwoo Kim, Khyathi Raghavi Chandu, Malihe Alikhani, Jack Hessel

    Abstract: Effective interlocutors account for the uncertain goals, beliefs, and emotions of others. But even the best human conversationalist cannot perfectly anticipate the trajectory of a dialogue. How well can language models represent inherent uncertainty in conversations? We propose FortUne Dial, an expansion of the long-standing "conversation forecasting" task: instead of just accuracy, evaluation is… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 2 Figures; 7 Tables; 27 pages

  10. arXiv:2307.04303  [pdf, other

    cs.CL cs.AI

    Learning to Generate Equitable Text in Dialogue from Biased Training Data

    Authors: Anthony Sicilia, Malihe Alikhani

    Abstract: The ingrained principles of fairness in a dialogue system's decision-making process and generated responses are crucial for user engagement, satisfaction, and task achievement. Absence of equitable and inclusive principles can hinder the formation of common ground, which in turn negatively impacts the overall performance of the system. For example, misusing pronouns in a user interaction may cause… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

  11. arXiv:2305.14195  [pdf, other

    cs.CL cs.AI

    HumBEL: A Human-in-the-Loop Approach for Evaluating Demographic Factors of Language Models in Human-Machine Conversations

    Authors: Anthony Sicilia, Jennifer C. Gates, Malihe Alikhani

    Abstract: While demographic factors like age and gender change the way people talk, and in particular, the way people talk to machines, there is little investigation into how large pre-trained language models (LMs) can adapt to these changes. To remedy this gap, we consider how demographic factors in LM language skills can be measured to determine compatibility with a target demographic. We suggest clinical… ▽ More

    Submitted 5 February, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 17 pages, 9 figures, 5 tables

  12. arXiv:2210.07777  [pdf, other

    cs.CL cs.LG

    LEATHER: A Framework for Learning to Generate Human-like Text in Dialogue

    Authors: Anthony Sicilia, Malihe Alikhani

    Abstract: Algorithms for text-generation in dialogue can be misguided. For example, in task-oriented settings, reinforcement learning that optimizes only task-success can lead to abysmal lexical diversity. We hypothesize this is due to poor theoretical understanding of the objectives in text-generation and their relation to the learning process (i.e., model training). To this end, we propose a new theoretic… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  13. arXiv:2207.07255  [pdf, other

    cs.CL cs.LG

    Modeling Non-Cooperative Dialogue: Theoretical and Empirical Insights

    Authors: Anthony Sicilia, Tristan Maidment, Pat Healy, Malihe Alikhani

    Abstract: Investigating cooperativity of interlocutors is central in studying pragmatics of dialogue. Models of conversation that only assume cooperative agents fail to explain the dynamics of strategic conversations. Thus, we investigate the ability of agents to identify non-cooperative interlocutors while completing a concurrent visual-dialogue task. Within this novel setting, we study the optimality of c… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

  14. arXiv:2207.05685  [pdf, other

    cs.LG

    PAC-Bayesian Domain Adaptation Bounds for Multiclass Learners

    Authors: Anthony Sicilia, Katherine Atwell, Malihe Alikhani, Seong Jae Hwang

    Abstract: Multiclass neural networks are a common tool in modern unsupervised domain adaptation, yet an appropriate theoretical description for their non-uniform sample complexity is lacking in the adaptation literature. To fill this gap, we propose the first PAC-Bayesian adaptation bounds for multiclass learners. We facilitate practical use of our bounds by also proposing the first approximation techniques… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  15. arXiv:2205.06427  [pdf, other

    cs.CV cs.AI cs.LG

    Test-time Fourier Style Calibration for Domain Generalization

    Authors: Xingchen Zhao, Chang Liu, Anthony Sicilia, Seong Jae Hwang, Yun Fu

    Abstract: The topic of generalizing machine learning models learned on a collection of source domains to unknown target domains is challenging. While many domain generalization (DG) methods have achieved promising results, they primarily rely on the source domains at train-time without manipulating the target domains at test-time. Thus, it is still possible that those methods can overfit to source domains a… ▽ More

    Submitted 18 May, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: 31st International Joint Conference on Artificial Intelligence (IJCAI) 2022

  16. arXiv:2203.11317  [pdf, other

    cs.CL cs.LG

    The Change that Matters in Discourse Parsing: Estimating the Impact of Domain Shift on Parser Error

    Authors: Katherine Atwell, Anthony Sicilia, Seong Jae Hwang, Malihe Alikhani

    Abstract: Discourse analysis allows us to attain inferences of a text document that extend beyond the sentence-level. The current performance of discourse models is very low on texts outside of the training distribution's coverage, diminishing the practical utility of existing models. There is need for a measure that can inform us to what extent our model generalizes from the training to the test sample whe… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  17. arXiv:2104.05600  [pdf, other

    cs.LG cs.CV stat.ML

    PAC Bayesian Performance Guarantees for Deep (Stochastic) Networks in Medical Imaging

    Authors: Anthony Sicilia, Xingchen Zhao, Anastasia Sosnovskikh, Seong Jae Hwang

    Abstract: Application of deep neural networks to medical imaging tasks has in some sense become commonplace. Still, a "thorn in the side" of the deep learning movement is the argument that deep networks are prone to overfitting and are thus unable to generalize well when datasets are small (as is common in medical imaging tasks). One way to bolster confidence is to provide mathematical guarantees, or bounds… ▽ More

    Submitted 8 July, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: MICCAI 2021

  18. arXiv:2102.13147  [pdf, other

    cs.CV cs.LG eess.IV

    Multi-Domain Learning by Meta-Learning: Taking Optimal Steps in Multi-Domain Loss Landscapes by Inner-Loop Learning

    Authors: Anthony Sicilia, Xingchen Zhao, Davneet Minhas, Erin O'Connor, Howard Aizenstein, William Klunk, Dana Tudorascu, Seong Jae Hwang

    Abstract: We consider a model-agnostic solution to the problem of Multi-Domain Learning (MDL) for multi-modal applications. Many existing MDL techniques are model-dependent solutions which explicitly require nontrivial architectural changes to construct domain-specific modules. Thus, properly applying these MDL techniques for new problems with well-established models, e.g. U-Net for semantic segmentation, m… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: IEEE International Symposium on Biomedical Imaging 2021

  19. arXiv:2102.06650  [pdf, other

    cs.CV

    Robust White Matter Hyperintensity Segmentation on Unseen Domain

    Authors: Xingchen Zhao, Anthony Sicilia, Davneet Minhas, Erin O'Connor, Howard Aizenstein, William Klunk, Dana Tudorascu, Seong Jae Hwang

    Abstract: Typical machine learning frameworks heavily rely on an underlying assumption that training and test data follow the same distribution. In medical imaging which increasingly begun acquiring datasets from multiple sites or scanners, this identical distribution assumption often fails to hold due to systematic variability induced by site or scanner dependent factors. Therefore, we cannot simply expect… ▽ More

    Submitted 16 February, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: IEEE International Symposium on Biomedical Imaging 2021

  20. arXiv:2102.03924  [pdf, other

    cs.LG cs.CV

    Domain Adversarial Neural Networks for Domain Generalization: When It Works and How to Improve

    Authors: Anthony Sicilia, Xingchen Zhao, Seong Jae Hwang

    Abstract: Theoretically, domain adaptation is a well-researched problem. Further, this theory has been well-used in practice. In particular, we note the bound on target error given by Ben-David et al. (2010) and the well-known domain-aligning algorithm based on this work using Domain Adversarial Neural Networks (DANN) presented by Ganin and Lempitsky (2015). Recently, multiple variants of DANN have been pro… ▽ More

    Submitted 18 March, 2022; v1 submitted 7 February, 2021; originally announced February 2021.