Skip to main content

Showing 1–14 of 14 results for author: Bonial, C

.
  1. arXiv:2505.23323  [pdf, ps, other

    cs.CL

    Neither Stochastic Parroting nor AGI: LLMs Solve Tasks through Context-Directed Extrapolation from Training Data Priors

    Authors: Harish Tayyar Madabushi, Melissa Torgbi, Claire Bonial

    Abstract: In this position paper we raise critical awareness of a realistic view of LLM capabilities that eschews extreme alternative views that LLMs are either "stochastic parrots" or in possession of "emergent" advanced reasoning capabilities, which, due to their unpredictable emergence, constitute an existential threat. Our middle-ground view is that LLMs extrapolate from priors from their training data,… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  2. arXiv:2503.12370  [pdf, other

    cs.CL

    Understanding Common Ground Misalignment in Goal-Oriented Dialog: A Case-Study with Ubuntu Chat Logs

    Authors: Rupak Sarkar, Neha Srikanth, Taylor Hudson, Rachel Rudinger, Claire Bonial, Philip Resnik

    Abstract: While it is commonly accepted that maintaining common ground plays a role in conversational success, little prior research exists connecting conversational grounding to success in task-oriented conversations. We study failures of grounding in the Ubuntu IRC dataset, where participants use text-only communication to resolve technical issues. We find that disruptions in conversational flow often ste… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

    Comments: 8 pages

  3. arXiv:2502.18452  [pdf, other

    cs.CL cs.AI

    FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response

    Authors: Mollie Shichman, Claire Bonial, Austin Blodgett, Taylor Hudson, Francis Ferraro, Rachel Rudinger

    Abstract: Large Language Models (LLMs) have the potential for substantial common sense reasoning. However, these capabilities are often emergent in larger models. This means smaller models that can be run locally are less helpful and capable with respect to certain reasoning tasks. To meet our problem space requirements, we fine-tune smaller LLMs to disaster domains, as these domains involve complex and low… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

    Comments: 8 pages, 3 figures, 5 tables

  4. arXiv:2501.04661  [pdf, other

    cs.CL cs.AI

    Assessing Language Comprehension in Large Language Models Using Construction Grammar

    Authors: Wesley Scivetti, Melissa Torgbi, Austin Blodgett, Mollie Shichman, Taylor Hudson, Claire Bonial, Harish Tayyar Madabushi

    Abstract: Large Language Models, despite their significant capabilities, are known to fail in surprising and unpredictable ways. Evaluating their true `understanding' of language is particularly challenging due to the extensive web-scale data they are trained on. Therefore, we construct an evaluation to systematically assess natural language understanding (NLU) in LLMs by leveraging Construction Grammar (Cx… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

  5. arXiv:2411.12844  [pdf, other

    cs.HC cs.CL cs.RO

    SCOUT: A Situated and Multi-Modal Human-Robot Dialogue Corpus

    Authors: Stephanie M. Lukin, Claire Bonial, Matthew Marge, Taylor Hudson, Cory J. Hayes, Kimberly A. Pollard, Anthony Baker, Ashley N. Foots, Ron Artstein, Felix Gervits, Mitchell Abrams, Cassidy Henry, Lucia Donatelli, Anton Leuski, Susan G. Hill, David Traum, Clare R. Voss

    Abstract: We introduce the Situated Corpus Of Understanding Transactions (SCOUT), a multi-modal collection of human-robot dialogue in the task domain of collaborative exploration. The corpus was constructed from multiple Wizard-of-Oz experiments where human participants gave verbal instructions to a remotely-located robot to move and gather information about its surroundings. SCOUT contains 89,056 utterance… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

    Comments: 14 pages, 7 figures

    ACM Class: I.2.7; I.2.9; I.2.10; H.5.2; J.7

    Journal ref: 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) https://aclanthology.org/2024.lrec-main.1259/

  6. Human-Robot Dialogue Annotation for Multi-Modal Common Ground

    Authors: Claire Bonial, Stephanie M. Lukin, Mitchell Abrams, Anthony Baker, Lucia Donatelli, Ashley Foots, Cory J. Hayes, Cassidy Henry, Taylor Hudson, Matthew Marge, Kimberly A. Pollard, Ron Artstein, David Traum, Clare R. Voss

    Abstract: In this paper, we describe the development of symbolic representations annotated on human-robot dialogue data to make dimensions of meaning accessible to autonomous systems participating in collaborative, natural language dialogue, and to enable common ground with human partners. A particular challenge for establishing common ground arises in remote dialogue (occurring in disaster relief or search… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

    Comments: 52 pages, 14 figures

    ACM Class: I.2.7; I.2.9; I.2.10; H.5.2; J.7

    Journal ref: Language Resources and Evaluation 2024

  7. arXiv:2310.17568  [pdf, other

    cs.HC cs.CL cs.RO

    Navigating to Success in Multi-Modal Human-Robot Collaboration: Analysis and Corpus Release

    Authors: Stephanie M. Lukin, Kimberly A. Pollard, Claire Bonial, Taylor Hudson, Ron Arstein, Clare Voss, David Traum

    Abstract: Human-guided robotic exploration is a useful approach to gathering information at remote locations, especially those that might be too risky, inhospitable, or inaccessible for humans. Maintaining common ground between the remotely-located partners is a challenge, one that can be facilitated by multi-modal communication. In this paper, we explore how participants utilized multiple modalities to inv… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 7 pages, 3 figures

    Journal ref: Proceedings of the 2023 IEEE Robot and Human Interactive Communication Conference

  8. arXiv:2305.14331  [pdf, other

    cs.CL cs.AI

    What Else Do I Need to Know? The Effect of Background Information on Users' Reliance on QA Systems

    Authors: Navita Goyal, Eleftheria Briakou, Amanda Liu, Connor Baumler, Claire Bonial, Jeffrey Micher, Clare R. Voss, Marine Carpuat, Hal Daumé III

    Abstract: NLP systems have shown impressive performance at answering questions by retrieving relevant context. However, with the increasingly large models, it is impossible and often undesirable to constrain models' knowledge or reasoning to only the retrieved context. This leads to a mismatch between the information that the models access to derive the answer and the information that is available to the us… ▽ More

    Submitted 25 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  9. arXiv:1906.00038  [pdf, other

    cs.CL cs.CV

    Visual Understanding and Narration: A Deeper Understanding and Explanation of Visual Scenes

    Authors: Stephanie M. Lukin, Claire Bonial, Clare R. Voss

    Abstract: We describe the task of Visual Understanding and Narration, in which a robot (or agent) generates text for the images that it collects when navigating its environment, by answering open-ended questions, such as 'what happens, or might have happened, here?'

    Submitted 23 September, 2019; v1 submitted 31 May, 2019; originally announced June 2019.

    Comments: 2-page extended abstract, presented at the Workshop on Shortcomings in Vision and Language (SiVL), 2019, at the North American Association for Computational Linguistics (NAACL)

  10. arXiv:1810.02017  [pdf, other

    cs.RO cs.HC

    Balancing Efficiency and Coverage in Human-Robot Dialogue Collection

    Authors: Matthew Marge, Claire Bonial, Stephanie Lukin, Cory Hayes, Ashley Foots, Ron Artstein, Cassidy Henry, Kimberly Pollard, Carla Gordon, Felix Gervits, Anton Leuski, Susan Hill, Clare Voss, David Traum

    Abstract: We describe a multi-phased Wizard-of-Oz approach to collecting human-robot dialogue in a collaborative search and navigation task. The data is being used to train an initial automated robot dialogue system to support collaborative exploration tasks. In the first phase, a wizard freely typed robot utterances to human participants. For the second phase, this data was used to design a GUI that includ… ▽ More

    Submitted 7 October, 2018; v1 submitted 3 October, 2018; originally announced October 2018.

    Comments: Presented at AI-HRI AAAI-FSS, 2018 (arXiv:1809.06606)

    Report number: AI-HRI/2018/01

  11. arXiv:1807.08076  [pdf, ps, other

    cs.CL cs.HC cs.RO

    Consequences and Factors of Stylistic Differences in Human-Robot Dialogue

    Authors: Stephanie M. Lukin, Kimberly A. Pollard, Claire Bonial, Matthew Marge, Cassidy Henry, Ron Arstein, David Traum, Clare R. Voss

    Abstract: This paper identifies stylistic differences in instruction-giving observed in a corpus of human-robot dialogue. Differences in verbosity and structure (i.e., single-intent vs. multi-intent instructions) arose naturally without restrictions or prior guidance on how users should speak with the robot. Different styles were found to produce different rates of miscommunication, and correlations were fo… ▽ More

    Submitted 20 July, 2018; originally announced July 2018.

    Comments: Originally published in the Proceedings of the 19th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL), 2018

  12. arXiv:1805.01818  [pdf, other

    cs.CV

    Object and Text-guided Semantics for CNN-based Activity Recognition

    Authors: Sungmin Eum, Christopher Reale, Heesung Kwon, Claire Bonial, Clare Voss

    Abstract: Many previous methods have demonstrated the importance of considering semantically relevant objects for carrying out video-based human activity recognition, yet none of the methods have harvested the power of large text corpora to relate the objects and the activities to be transferred into learning a unified deep convolutional neural network. We present a novel activity recognition CNN which co-l… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

    Comments: Submitted to ICIP 2018

  13. arXiv:1710.06406  [pdf, other

    cs.CL cs.AI cs.HC cs.RO

    Laying Down the Yellow Brick Road: Development of a Wizard-of-Oz Interface for Collecting Human-Robot Dialogue

    Authors: Claire Bonial, Matthew Marge, Ron artstein, Ashley Foots, Felix Gervits, Cory J. Hayes, Cassidy Henry, Susan G. Hill, Anton Leuski, Stephanie M. Lukin, Pooja Moolchandani, Kimberly A. Pollard, David Traum, Clare R. Voss

    Abstract: We describe the adaptation and refinement of a graphical user interface designed to facilitate a Wizard-of-Oz (WoZ) approach to collecting human-robot dialogue data. The data collected will be used to develop a dialogue system for robot navigation. Building on an interface previously used in the development of dialogue systems for virtual agents and video playback, we add templates with open param… ▽ More

    Submitted 17 October, 2017; originally announced October 2017.

    Comments: 7 pages, 2 figures, accepted for oral presentation at the Symposium on Natural Communication for Human-Robot Collaboration, AAAI Fall Symposium Series, November 9-11, 2017, https://www.aaai.org/ocs/index.php/FSS/FSS17

  14. arXiv:1703.03714  [pdf

    cs.CL cs.AI cs.HC cs.RO

    Applying the Wizard-of-Oz Technique to Multimodal Human-Robot Dialogue

    Authors: Matthew Marge, Claire Bonial, Brendan Byrne, Taylor Cassidy, A. William Evans, Susan G. Hill, Clare Voss

    Abstract: Our overall program objective is to provide more natural ways for soldiers to interact and communicate with robots, much like how soldiers communicate with other soldiers today. We describe how the Wizard-of-Oz (WOz) method can be applied to multimodal human-robot dialogue in a collaborative exploration task. While the WOz method can help design robot behaviors, traditional approaches place the bu… ▽ More

    Submitted 10 March, 2017; originally announced March 2017.

    Comments: Presented at the 2016 IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), Interactive Session, August 26-31, 2016