Skip to main content

Showing 1–27 of 27 results for author: Nachman, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.00875  [pdf, other

    cs.AI

    Thoughts without Thinking: Reconsidering the Explanatory Value of Chain-of-Thought Reasoning in LLMs through Agentic Pipelines

    Authors: Ramesh Manuvinakurike, Emanuel Moss, Elizabeth Anne Watkins, Saurav Sahay, Giuseppe Raffa, Lama Nachman

    Abstract: Agentic pipelines present novel challenges and opportunities for human-centered explainability. The HCXAI community is still grappling with how best to make the inner workings of LLMs transparent in actionable ways. Agentic pipelines consist of multiple LLMs working in cooperation with minimal human control. In this research paper, we present early findings from an agentic pipeline implementation… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  2. arXiv:2503.05926  [pdf, ps, other

    cs.HC cs.CY

    What's So Human about Human-AI Collaboration, Anyway? Generative AI and Human-Computer Interaction

    Authors: Elizabeth Anne Watkins, Emanuel Moss, Giuseppe Raffa, Lama Nachman

    Abstract: While human-AI collaboration has been a longstanding goal and topic of study for computational research, the emergence of increasingly naturalistic generative AI language models has greatly inflected the trajectory of such research. In this paper we identify how, given the language capabilities of generative AI, common features of human-human collaboration derived from the social sciences can be a… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

  3. arXiv:2412.02638  [pdf, other

    cs.CL cs.AI

    QA-TOOLBOX: Conversational Question-Answering for process task guidance in manufacturing

    Authors: Ramesh Manuvinakurike, Elizabeth Watkins, Celal Savur, Anthony Rhodes, Sovan Biswas, Gesem Gudino Mejia, Richard Beckwith, Saurav Sahay, Giuseppe Raffa, Lama Nachman

    Abstract: In this work we explore utilizing LLMs for data augmentation for manufacturing task guidance system. The dataset consists of representative samples of interactions with technicians working in an advanced manufacturing setting. The purpose of this work to explore the task, data augmentation for the supported tasks and evaluating the performance of the existing LLMs. We observe that that task is com… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  4. arXiv:2408.03907  [pdf, other

    cs.CL cs.AI

    Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models

    Authors: Shachi H Kumar, Saurav Sahay, Sahisnu Mazumder, Eda Okur, Ramesh Manuvinakurike, Nicole Beckage, Hsuan Su, Hung-yi Lee, Lama Nachman

    Abstract: Large Language Models (LLMs) have excelled at language understanding and generating human-level text. However, even with supervised training and human alignment, these LLMs are susceptible to adversarial attacks where malicious users can prompt the model to generate undesirable text. LLMs also inherently encode potential biases that can cause various harmful effects during interactions. Bias evalu… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 6 pages paper content, 17 pages of appendix

  5. arXiv:2404.12241  [pdf, other

    cs.CL cs.AI

    Introducing v0.5 of the AI Safety Benchmark from MLCommons

    Authors: Bertie Vidgen, Adarsh Agrawal, Ahmed M. Ahmed, Victor Akinwande, Namir Al-Nuaimi, Najla Alfaraj, Elie Alhajjar, Lora Aroyo, Trupti Bavalatti, Max Bartolo, Borhane Blili-Hamelin, Kurt Bollacker, Rishi Bomassani, Marisa Ferrara Boston, Siméon Campos, Kal Chakra, Canyu Chen, Cody Coleman, Zacharie Delpierre Coudert, Leon Derczynski, Debojyoti Dutta, Ian Eisenberg, James Ezick, Heather Frase, Brian Fuller , et al. (75 additional authors not shown)

    Abstract: This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess the safety risks of AI systems that use chat-tuned language models. We introduce a principled approach to specifying and constructing the benchmark, which for v0.5 covers only a single use case (an adult chatting to a general-pu… ▽ More

    Submitted 13 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  6. arXiv:2311.18041  [pdf, other

    cs.CL

    Zero-shot Conversational Summarization Evaluations with small Large Language Models

    Authors: Ramesh Manuvinakurike, Saurav Sahay, Sangeeta Manepalli, Lama Nachman

    Abstract: Large Language Models (LLMs) exhibit powerful summarization abilities. However, their capabilities on conversational summarization remains under explored. In this work we evaluate LLMs (approx. 10 billion parameters) on conversational summarization and showcase their performance on various prompts. We show that the summaries generated by models depend on the instructions and the performance of LLM… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted at RoF0Mo workshop at Neurips 2023

  7. arXiv:2306.00482  [pdf, other

    cs.CY cs.CL cs.SD eess.AS math.HO

    Inspecting Spoken Language Understanding from Kids for Basic Math Learning at Home

    Authors: Eda Okur, Roddy Fuentes Alba, Saurav Sahay, Lama Nachman

    Abstract: Enriching the quality of early childhood education with interactive math learning at home systems, empowered by recent advances in conversational AI technologies, is slowly becoming a reality. With this motivation, we implement a multimodal dialogue system to support play-based learning experiences at home, guiding kids to master basic math concepts. This work explores Spoken Language Understandin… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA) at ACL 2023

  8. arXiv:2302.05888  [pdf, other

    cs.CL cs.AI cs.LG

    Position Matters! Empirical Study of Order Effect in Knowledge-grounded Dialogue

    Authors: Hsuan Su, Shachi H Kumar, Sahisnu Mazumder, Wenda Chen, Ramesh Manuvinakurike, Eda Okur, Saurav Sahay, Lama Nachman, Shang-Tse Chen, Hung-yi Lee

    Abstract: With the power of large pretrained language models, various research works have integrated knowledge into dialogue systems. The traditional techniques treat knowledge as part of the input sequence for the dialogue system, prepending a set of knowledge statements in front of dialogue history. However, such a mechanism forces knowledge sets to be concatenated in an ordered manner, making models impl… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  9. arXiv:2211.03511  [pdf, other

    cs.CL

    End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics

    Authors: Eda Okur, Saurav Sahay, Roddy Fuentes Alba, Lama Nachman

    Abstract: The advances in language-based Artificial Intelligence (AI) technologies applied to build educational applications can present AI for social-good opportunities with a broader positive impact. Across many disciplines, enhancing the quality of mathematics education is crucial in building critical thinking and problem-solving skills at younger ages. Conversational AI systems have started maturing to… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: Proceedings of the 1st Workshop on Mathematical Natural Language Processing (MathNLP) at EMNLP 2022

  10. arXiv:2211.01824  [pdf, other

    cs.CL

    Human in the loop approaches in multi-modal conversational task guidance system development

    Authors: Ramesh Manuvinakurike, Sovan Biswas, Giuseppe Raffa, Richard Beckwith, Anthony Rhodes, Meng Shi, Gesem Gudino Mejia, Saurav Sahay, Lama Nachman

    Abstract: Development of task guidance systems for aiding humans in a situated task remains a challenging problem. The role of search (information retrieval) and conversational systems for task guidance has immense potential to help the task performers achieve various goals. However, there are several technical challenges that need to be addressed to deliver such conversational systems, where common supervi… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: SCAI @ SIGIR

  11. arXiv:2205.13754  [pdf, other

    cs.CL cs.HC

    NLU for Game-based Learning in Real: Initial Evaluations

    Authors: Eda Okur, Saurav Sahay, Lama Nachman

    Abstract: Intelligent systems designed for play-based interactions should be contextually aware of the users and their surroundings. Spoken Dialogue Systems (SDS) are critical for these interactive agents to carry out effective goal-oriented communication with users in real-time. For the real-world (i.e., in-the-wild) deployment of such conversational agents, improving the Natural Language Understanding (NL… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: Proceedings of the Games and Natural Language Processing Workshop at LREC 2022

  12. arXiv:2205.08657  [pdf, other

    cs.RO cs.AI cs.HC

    Intuitive and Efficient Human-robot Collaboration via Real-time Approximate Bayesian Inference

    Authors: Javier Felip Leon, David Gonzalez-Aguirre, Lama Nachman

    Abstract: The combination of collaborative robots and end-to-end AI, promises flexible automation of human tasks in factories and warehouses. However, such promise seems a few breakthroughs away. In the meantime, humans and cobots will collaborate helping each other. For these collaborations to be effective and safe, robots need to model, predict and exploit human's intents for responsive decision making pr… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 7 pages

  13. arXiv:2205.04006  [pdf, other

    cs.CL cs.AI

    Data Augmentation with Paraphrase Generation and Entity Extraction for Multimodal Dialogue System

    Authors: Eda Okur, Saurav Sahay, Lama Nachman

    Abstract: Contextually aware intelligent agents are often required to understand the users and their surroundings in real-time. Our goal is to build Artificial Intelligence (AI) systems that can assist children in their learning process. Within such complex frameworks, Spoken Dialogue Systems (SDS) are crucial building blocks to handle efficient task-oriented communication with children in game-based learni… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

    Comments: Proceedings of the 13th International Conference on Language Resources and Evaluation (LREC 2022)

  14. arXiv:2112.02246  [pdf, other

    cs.CL

    Controllable Response Generation for Assistive Use-cases

    Authors: Shachi H Kumar, Hsuan Su, Ramesh Manuvinakurike, Saurav Sahay, Lama Nachman

    Abstract: Conversational agents have become an integral part of the general population for simple task enabling situations. However, these systems are yet to have any social impact on the diverse and minority population, for example, helping people with neurological disorders, for example ALS, and people with speech, language and social communication disorders. Language model technology can play a huge role… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

  15. arXiv:2104.13406  [pdf, other

    cs.CL cs.HC

    Semi-supervised Interactive Intent Labeling

    Authors: Saurav Sahay, Eda Okur, Nagib Hakim, Lama Nachman

    Abstract: Building the Natural Language Understanding (NLU) modules of task-oriented Spoken Dialogue Systems (SDS) involves a definition of intents and entities, collection of task-relevant data, annotating the data with intents and entities, and then repeating the same process over and over again for adding any functionality/enhancement to the SDS. In this work, we showcase an Intent Bulk Labeling system w… ▽ More

    Submitted 11 May, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

    Comments: NAACL 2021 - Workshop on Data Science with Human-in-the-loop: Language Advances (DaSH-LA)

  16. arXiv:2011.07586  [pdf, other

    cs.CY cs.HC cs.LG

    Uncertainty as a Form of Transparency: Measuring, Communicating, and Using Uncertainty

    Authors: Umang Bhatt, Javier Antorán, Yunfeng Zhang, Q. Vera Liao, Prasanna Sattigeri, Riccardo Fogliato, Gabrielle Gauthier Melançon, Ranganath Krishnan, Jason Stanley, Omesh Tickoo, Lama Nachman, Rumi Chunara, Madhulika Srikumar, Adrian Weller, Alice Xiang

    Abstract: Algorithmic transparency entails exposing system properties to various stakeholders for purposes that include understanding, improving, and contesting predictions. Until now, most research into algorithmic transparency has predominantly focused on explainability. Explainability attempts to provide reasons for a machine learning model's behavior to stakeholders. However, understanding a model's spe… ▽ More

    Submitted 4 May, 2021; v1 submitted 15 November, 2020; originally announced November 2020.

    Comments: AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society (AIES) 2021

  17. arXiv:2007.03876  [pdf, other

    cs.CL

    Audio-Visual Understanding of Passenger Intents for In-Cabin Conversational Agents

    Authors: Eda Okur, Shachi H Kumar, Saurav Sahay, Lama Nachman

    Abstract: Building multimodal dialogue understanding capabilities situated in the in-cabin context is crucial to enhance passenger comfort in autonomous vehicle (AV) interaction systems. To this end, understanding passenger intents from spoken interactions and vehicle vision systems is a crucial component for developing contextual and visually grounded conversational agents for AV. Towards this goal, we exp… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: ACL 2020 - Second Grand-Challenge and Workshop on Multimodal Language (Challenge-HML)

  18. arXiv:2007.02038  [pdf, other

    cs.CL

    Low Rank Fusion based Transformers for Multimodal Sequences

    Authors: Saurav Sahay, Eda Okur, Shachi H Kumar, Lama Nachman

    Abstract: Our senses individually work in a coordinated fashion to express our emotional intentions. In this work, we experiment with modeling modality-specific sensory signals to attend to our latent multimodal emotional intentions and vice versa expressed via low-rank multimodal fusion and multimodal transformers. The low-rank factorization of multimodal fusion amongst the modalities helps represent appro… ▽ More

    Submitted 4 July, 2020; originally announced July 2020.

    Comments: ACL 2020 workshop on Second Grand Challenge and Workshop on Multimodal Language

  19. Optimizing User Interface Layouts via Gradient Descent

    Authors: Peitong Duan, Casimir Wierzynski, Lama Nachman

    Abstract: Automating parts of the user interface (UI) design process has been a longstanding challenge. We present an automated technique for optimizing the layouts of mobile UIs. Our method uses gradient descent on a neural network model of task performance with respect to the model's inputs to make layout modifications that result in improved predicted error rates and task completion times. We start by ex… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

  20. arXiv:1912.10132  [pdf, ps, other

    cs.CL

    Exploring Context, Attention and Audio Features for Audio Visual Scene-Aware Dialog

    Authors: Shachi H Kumar, Eda Okur, Saurav Sahay, Jonathan Huang, Lama Nachman

    Abstract: We are witnessing a confluence of vision, speech and dialog system technologies that are enabling the IVAs to learn audio-visual groundings of utterances and have conversations with users about the objects, activities and events surrounding them. Recent progress in visual grounding techniques and Audio Understanding are enabling machines to understand shared semantic concepts and listen to the var… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: Presented at the Visual Question Answering and Dialog Workshop, CVPR 2019, Long Beach, USA. arXiv admin note: substantial text overlap with arXiv:1912.10131

  21. arXiv:1912.10131  [pdf, other

    cs.MM cs.CL cs.SD eess.AS

    Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog

    Authors: Shachi H Kumar, Eda Okur, Saurav Sahay, Jonathan Huang, Lama Nachman

    Abstract: With the recent advancements in Artificial Intelligence (AI), Intelligent Virtual Assistants (IVA) such as Alexa, Google Home, etc., have become a ubiquitous part of many homes. Currently, such IVAs are mostly audio-based, but going forward, we are witnessing a confluence of vision, speech and dialog system technologies that are enabling the IVAs to learn audio-visual groundings of utterances. Thi… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: Presented at the 3rd Visually Grounded Interaction and Language (ViGIL) Workshop, NeurIPS 2019, Vancouver, Canada. arXiv admin note: substantial text overlap with arXiv:1812.08407, arXiv:1912.10132

  22. arXiv:1912.10130  [pdf, other

    cs.CL

    Modeling Intent, Dialog Policies and Response Adaptation for Goal-Oriented Interactions

    Authors: Saurav Sahay, Shachi H Kumar, Eda Okur, Haroon Syed, Lama Nachman

    Abstract: Building a machine learning driven spoken dialog system for goal-oriented interactions involves careful design of intents and data collection along with development of intent recognition models and dialog policy learning algorithms. The models should be robust enough to handle various user distractions during the interaction flow and should steer the user back into an engaging interaction for succ… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: Presented as a full-paper at the 23rd Workshop on the Semantics and Pragmatics of Dialogue (SemDial 2019 - LondonLogue), Sep 4-6, 2019, London, UK

    Journal ref: Proceedings of the 23rd Workshop on the Semantics and Pragmatics of Dialogue (SEMDIAL), pp. 146-155, London, United Kingdom, September 2019

  23. arXiv:1909.13714  [pdf, ps, other

    cs.MM cs.CL cs.CV

    Towards Multimodal Understanding of Passenger-Vehicle Interactions in Autonomous Vehicles: Intent/Slot Recognition Utilizing Audio-Visual Data

    Authors: Eda Okur, Shachi H Kumar, Saurav Sahay, Lama Nachman

    Abstract: Understanding passenger intents from spoken interactions and car's vision (both inside and outside the vehicle) are important building blocks towards developing contextual dialog systems for natural interactions in autonomous vehicles (AV). In this study, we continued exploring AMIE (Automated-vehicle Multimodal In-cabin Experience), the in-cabin agent responsible for handling certain multimodal p… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: Presented as a short-paper at the 23rd Workshop on the Semantics and Pragmatics of Dialogue (SemDial 2019 - LondonLogue), Sep 4-6, 2019, London, UK

    Journal ref: Proceedings of the 23rd Workshop on the Semantics and Pragmatics of Dialogue (SEMDIAL), pp. 213-215, London, United Kingdom, September 2019

  24. arXiv:1904.10500  [pdf, other

    cs.CL cs.HC cs.LG cs.SD eess.AS

    Natural Language Interactions in Autonomous Vehicles: Intent Detection and Slot Filling from Passenger Utterances

    Authors: Eda Okur, Shachi H Kumar, Saurav Sahay, Asli Arslan Esme, Lama Nachman

    Abstract: Understanding passenger intents and extracting relevant slots are important building blocks towards developing contextual dialogue systems for natural interactions in autonomous vehicles (AV). In this work, we explored AMIE (Automated-vehicle Multi-modal In-cabin Experience), the in-cabin agent responsible for handling certain passenger-vehicle interactions. When the passengers give instructions t… ▽ More

    Submitted 23 April, 2019; originally announced April 2019.

    Comments: Accepted and presented as a full paper at 20th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2019), April 7-13, 2019, La Rochelle, France

    Journal ref: Springer LNCS Proceedings for CICLing 2019

  25. arXiv:1901.04899  [pdf, ps, other

    cs.CL

    Conversational Intent Understanding for Passengers in Autonomous Vehicles

    Authors: Eda Okur, Shachi H Kumar, Saurav Sahay, Asli Arslan Esme, Lama Nachman

    Abstract: Understanding passenger intents and extracting relevant slots are important building blocks towards developing a contextual dialogue system responsible for handling certain vehicle-passenger interactions in autonomous vehicles (AV). When the passengers give instructions to AMIE (Automated-vehicle Multimodal In-cabin Experience), the agent should parse such commands properly and trigger the appropr… ▽ More

    Submitted 13 December, 2018; originally announced January 2019.

  26. arXiv:1812.08407  [pdf, other

    cs.CL

    Context, Attention and Audio Feature Explorations for Audio Visual Scene-Aware Dialog

    Authors: Shachi H Kumar, Eda Okur, Saurav Sahay, Juan Jose Alvarado Leanos, Jonathan Huang, Lama Nachman

    Abstract: With the recent advancements in AI, Intelligent Virtual Assistants (IVA) have become a ubiquitous part of every home. Going forward, we are witnessing a confluence of vision, speech and dialog system technologies that are enabling the IVAs to learn audio-visual groundings of utterances and have conversations with users about the objects, activities and events surrounding them. As a part of the 7th… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

    Comments: 7 pages, 2 figures, DSTC7 workshop at AAAI 2019

  27. arXiv:1806.02923  [pdf, other

    cs.CL

    Multimodal Relational Tensor Network for Sentiment and Emotion Classification

    Authors: Saurav Sahay, Shachi H Kumar, Rui Xia, Jonathan Huang, Lama Nachman

    Abstract: Understanding Affect from video segments has brought researchers from the language, audio and video domains together. Most of the current multimodal research in this area deals with various techniques to fuse the modalities, and mostly treat the segments of a video independently. Motivated by the work of (Zadeh et al., 2017) and (Poria et al., 2017), we present our architecture, Relational Tensor… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.