Skip to main content

Showing 1–50 of 63 results for author: Mooney, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.10886  [pdf, other

    cs.CL

    MET-Bench: Multimodal Entity Tracking for Evaluating the Limitations of Vision-Language and Reasoning Models

    Authors: Vanya Cohen, Raymond Mooney

    Abstract: Entity tracking is a fundamental challenge in natural language understanding, requiring models to maintain coherent representations of entities. Previous work has benchmarked entity tracking performance in purely text-based tasks. We introduce MET-Bench, a multimodal entity tracking benchmark designed to evaluate the ability of vision-language models to track entity states across modalities. Using… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

  2. arXiv:2501.12539  [pdf, other

    cs.LG cs.CL

    Compositional Instruction Following with Language Models and Reinforcement Learning

    Authors: Vanya Cohen, Geraud Nangue Tasse, Nakul Gopalan, Steven James, Matthew Gombolay, Ray Mooney, Benjamin Rosman

    Abstract: Combining reinforcement learning with language grounding is challenging as the agent needs to explore the environment while simultaneously learning multiple language-conditioned tasks. To address this, we introduce a novel method: the compositionally-enabled reinforcement learning language agent (CERLLA). Our method reduces the sample complexity of tasks specified with language by leveraging compo… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    Comments: TMLR 2024

  3. arXiv:2409.12306  [pdf, other

    cs.CL cs.CV cs.SD eess.AS

    Measuring Sound Symbolism in Audio-visual Models

    Authors: Wei-Cheng Tseng, Yi-Jen Shih, David Harwath, Raymond Mooney

    Abstract: Audio-visual pre-trained models have gained substantial attention recently and demonstrated superior performance on various audio-visual tasks. This study investigates whether pre-trained audio-visual models demonstrate non-arbitrary associations between sounds and visual representations$\unicode{x2013}$known as sound symbolism$\unicode{x2013}$which is also observed in humans. We developed a speci… ▽ More

    Submitted 11 November, 2024; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: SLT 2024

  4. arXiv:2406.15823  [pdf, other

    cs.CL

    CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans

    Authors: Yash Kumar Lal, Vanya Cohen, Nathanael Chambers, Niranjan Balasubramanian, Raymond Mooney

    Abstract: Understanding the abilities of LLMs to reason about natural language plans, such as instructional text and recipes, is critical to reliably using them in decision-making systems. A fundamental aspect of plans is the temporal order in which their steps needs to be executed, which reflects the underlying causal dependencies between them. We introduce CaT-Bench, a benchmark of Step Order Prediction q… ▽ More

    Submitted 7 January, 2025; v1 submitted 22 June, 2024; originally announced June 2024.

    Comments: Accepted to EMNLP 2024 Main Conference

  5. arXiv:2406.06438  [pdf, other

    cs.CL cs.CV cs.HC cs.LG cs.SD eess.AS

    Multimodal Contextualized Semantic Parsing from Speech

    Authors: Jordan Voas, Raymond Mooney, David Harwath

    Abstract: We introduce Semantic Parsing in Contextual Environments (SPICE), a task designed to enhance artificial agents' contextual awareness by integrating multimodal inputs with prior contexts. SPICE goes beyond traditional semantic parsing by offering a structured, interpretable framework for dynamically updating an agent's knowledge with new information, mirroring the complexity of human communication.… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 10 Pages, 3 figures, ACL 2024 Main

  6. arXiv:2405.13245  [pdf, other

    cs.RO cs.AI cs.CL

    A Survey of Robotic Language Grounding: Tradeoffs between Symbols and Embeddings

    Authors: Vanya Cohen, Jason Xinyu Liu, Raymond Mooney, Stefanie Tellex, David Watkins

    Abstract: With large language models, robots can understand language more flexibly and more capable than ever before. This survey reviews and situates recent literature into a spectrum with two poles: 1) mapping between language and some manually defined formal representation of meaning, and 2) mapping between language and high-dimensional vector spaces that translate directly to low-level robot policy. Usi… ▽ More

    Submitted 22 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: IJCAI 2024 Survey Track

  7. arXiv:2405.10020  [pdf, other

    cs.RO cs.CL cs.CV cs.LG

    Natural Language Can Help Bridge the Sim2Real Gap

    Authors: Albert Yu, Adeline Foote, Raymond Mooney, Roberto Martín-Martín

    Abstract: The main challenge in learning image-conditioned robotic policies is acquiring a visual representation conducive to low-level control. Due to the high dimensionality of the image space, learning a good visual representation requires a considerable amount of visual data. However, when learning in the real world, data is expensive. Sim2Real is a promising paradigm for overcoming data scarcity in the… ▽ More

    Submitted 2 July, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: To appear in RSS 2024. Project website at https://robin-lab.cs.utexas.edu/lang4sim2real/

    ACM Class: I.2.9; I.2.7; I.2.6

  8. arXiv:2404.08148  [pdf, other

    cs.CL

    Distilling Algorithmic Reasoning from LLMs via Explaining Solution Programs

    Authors: Jierui Li, Raymond Mooney

    Abstract: Distilling explicit chain-of-thought reasoning paths has emerged as an effective method for improving the reasoning abilities of large language models (LLMs) across various tasks. However, when tackling complex tasks that pose significant challenges for state-of-the-art models, this technique often struggles to produce effective chains of thought that lead to correct answers. In this work, we prop… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: pre-print

  9. arXiv:2404.01158  [pdf, other

    cs.CL cs.RO

    Dialogue with Robots: Proposals for Broadening Participation and Research in the SLIVAR Community

    Authors: Casey Kennington, Malihe Alikhani, Heather Pon-Barry, Katherine Atwell, Yonatan Bisk, Daniel Fried, Felix Gervits, Zhao Han, Mert Inan, Michael Johnston, Raj Korpan, Diane Litman, Matthew Marge, Cynthia Matuszek, Ross Mead, Shiwali Mohan, Raymond Mooney, Natalie Parde, Jivko Sinapov, Angela Stewart, Matthew Stone, Stefanie Tellex, Tom Williams

    Abstract: The ability to interact with machines using natural human language is becoming not just commonplace, but expected. The next step is not just text interfaces, but speech interfaces and not just with computers, but with all machines including robots. In this paper, we chronicle the recent history of this growing field of spoken dialogue with robots and offer the community three proposals, the first… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: NSF Report on the "Dialogue with Robots" Workshop held in Pittsburg, PA, April 2023

  10. arXiv:2402.10890  [pdf, other

    cs.CL cs.AI cs.LG

    When is Tree Search Useful for LLM Planning? It Depends on the Discriminator

    Authors: Ziru Chen, Michael White, Raymond Mooney, Ali Payani, Yu Su, Huan Sun

    Abstract: In this paper, we examine how large language models (LLMs) solve multi-step problems under a language agent framework with three components: a generator, a discriminator, and a planning method. We investigate the practical utility of two advanced planning methods, iterative correction and tree search. We present a comprehensive analysis of how discrimination accuracy affects the overall performanc… ▽ More

    Submitted 6 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: ACL 2024 main

  11. arXiv:2401.04055  [pdf, other

    cs.IR

    Sparse Meets Dense: A Hybrid Approach to Enhance Scientific Document Retrieval

    Authors: Priyanka Mandikal, Raymond Mooney

    Abstract: Traditional information retrieval is based on sparse bag-of-words vector representations of documents and queries. More recent deep-learning approaches have used dense embeddings learned using a transformer-based large language model. We show that on a classic benchmark on scientific document retrieval in the medical domain of cystic fibrosis, that both of these models perform roughly equivalently… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted at SDU-AAAI 2024

  12. arXiv:2309.10248  [pdf, other

    cs.CL cs.GR cs.LG

    What is the Best Automated Metric for Text to Motion Generation?

    Authors: Jordan Voas, Yili Wang, Qixing Huang, Raymond Mooney

    Abstract: There is growing interest in generating skeleton-based human motions from natural language descriptions. While most efforts have focused on developing better neural architectures for this task, there has been no significant work on determining the proper evaluation metric. Human evaluation is the ultimate accuracy measure for this task, and automated metrics should correlate well with human qualit… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 8 pages, SIGGRAPH Asia 2023 Conference

  13. arXiv:2307.05337  [pdf, other

    cs.CL

    Explaining Competitive-Level Programming Solutions using LLMs

    Authors: Jierui Li, Szymon Tworkowski, Yingying Wu, Raymond Mooney

    Abstract: In this paper, we approach competitive-level programming problem-solving as a composite task of reasoning and code generation. We propose a novel method to automatically annotate natural language explanations to \textit{<problem, solution>} pairs. We show that despite poor performance in solving competitive-level programming problems, state-of-the-art LLMs exhibit a strong capacity in describing a… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: 14 pages, presented at the 1st NLRSE workshop

  14. arXiv:2305.13073  [pdf, other

    cs.CL cs.AI cs.DB cs.LG

    Text-to-SQL Error Correction with Language Models of Code

    Authors: Ziru Chen, Shijie Chen, Michael White, Raymond Mooney, Ali Payani, Jayanth Srinivasa, Yu Su, Huan Sun

    Abstract: Despite recent progress in text-to-SQL parsing, current semantic parsers are still not accurate enough for practical use. In this paper, we investigate how to build automatic text-to-SQL error correction models. Noticing that token-level edits are out of context and sometimes ambiguous, we propose building clause-level edit models instead. Besides, while most language models of code are not specif… ▽ More

    Submitted 28 May, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL 2023 Short Paper

  15. arXiv:2302.10166  [pdf, other

    cs.SE cs.CL cs.LG

    Learning Deep Semantics for Test Completion

    Authors: Pengyu Nie, Rahul Banerjee, Junyi Jessy Li, Raymond J. Mooney, Milos Gligoric

    Abstract: Writing tests is a time-consuming yet essential task during software development. We propose to leverage recent advances in deep learning for text and code generation to assist developers in writing tests. We formalize the novel task of test completion to automatically complete the next statement in a test method based on the context of prior statements and the code under test. We develop TeCo --… ▽ More

    Submitted 7 March, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Accepted as a conference paper in ICSE 2023

  16. arXiv:2301.09770  [pdf, other

    cs.AI

    Language-guided Task Adaptation for Imitation Learning

    Authors: Prasoon Goyal, Raymond J. Mooney, Scott Niekum

    Abstract: We introduce a novel setting, wherein an agent needs to learn a task from a demonstration of a related task with the difference between the tasks communicated in natural language. The proposed setting allows reusing demonstrations from other tasks, by providing low effort language descriptions, and can also be used to provide feedback to correct agent errors, which are both important desiderata fo… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  17. arXiv:2211.09935  [pdf, other

    cs.AI cs.CL cs.LG cs.RO

    CAPE: Corrective Actions from Precondition Errors using Large Language Models

    Authors: Shreyas Sundara Raman, Vanya Cohen, Ifrah Idrees, Eric Rosen, Ray Mooney, Stefanie Tellex, David Paulius

    Abstract: Extracting commonsense knowledge from a large language model (LLM) offers a path to designing intelligent robots. Existing approaches that leverage LLMs for planning are unable to recover when an action fails and often resort to retrying failed actions, without resolving the error's underlying cause. We propose a novel approach (CAPE) that attempts to propose corrective actions to resolve precondi… ▽ More

    Submitted 9 March, 2024; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: 17 pages, 6 figures, accepted at ICRA 2024

    MSC Class: 68T20; 68T50 ACM Class: I.2.7; I.2.8; I.2.2; I.2.4

  18. arXiv:2211.06335  [pdf

    cs.SE cs.CL

    Using Developer Discussions to Guide Fixing Bugs in Software

    Authors: Sheena Panthaplackel, Milos Gligoric, Junyi Jessy Li, Raymond J. Mooney

    Abstract: Automatically fixing software bugs is a challenging task. While recent work showed that natural language context is useful in guiding bug-fixing models, the approach required prompting developers to provide this context, which was simulated through commit messages written after the bug-fixing code changes were made. We instead propose using bug report discussions, which are available before the ta… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: Accepted in the Findings of EMNLP 2022

  19. arXiv:2211.02178  [pdf, other

    cs.CV cs.CL

    Zero-shot Video Moment Retrieval With Off-the-Shelf Models

    Authors: Anuj Diwan, Puyuan Peng, Raymond J. Mooney

    Abstract: For the majority of the machine learning community, the expensive nature of collecting high-quality human-annotated data and the inability to efficiently finetune very large state-of-the-art pretrained models on limited compute are major bottlenecks for building models for new tasks. We propose a zero-shot simple approach for one such task, Video Moment Retrieval (VMR), that does not perform any a… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: Accepted to the NeurIPS 2022 Workshop on Transfer Learning for NLP (TL4NLP). 12 pages, 5 figures

  20. arXiv:2210.10176  [pdf, other

    cs.CL cs.CV

    Entity-Focused Dense Passage Retrieval for Outside-Knowledge Visual Question Answering

    Authors: Jialin Wu, Raymond J. Mooney

    Abstract: Most Outside-Knowledge Visual Question Answering (OK-VQA) systems employ a two-stage framework that first retrieves external knowledge given the visual question and then predicts the answer based on the retrieved content. However, the retrieved knowledge is often inadequate. Retrievals are frequently too general and fail to cover specific knowledge needed to answer the question. Also, the naturall… ▽ More

    Submitted 20 October, 2022; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  21. arXiv:2210.04476  [pdf, other

    cs.RO cs.CL cs.LG

    Using Both Demonstrations and Language Instructions to Efficiently Learn Robotic Tasks

    Authors: Albert Yu, Raymond J. Mooney

    Abstract: Demonstrations and natural language instructions are two common ways to specify and teach robots novel tasks. However, for many complex tasks, a demonstration or language instruction alone contains ambiguities, preventing tasks from being specified clearly. In such cases, a combination of both a demonstration and an instruction more concisely and effectively conveys the task to the robot than eith… ▽ More

    Submitted 28 April, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 24 pages, 10 figures. Project website at https://deltaco-robot.github.io/

    ACM Class: I.2.9; I.2.7; I.2.6

  22. arXiv:2201.05017  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Automated Error Analysis: Learning to Characterize Errors

    Authors: Tong Gao, Shivang Singh, Raymond J. Mooney

    Abstract: Characterizing the patterns of errors that a system makes helps researchers focus future development on increasing its accuracy and robustness. We propose a novel form of "meta learning" that automatically learns interpretable rules that characterize the types of errors that a system makes, and demonstrate these rules' ability to help understand and improve two NLP systems. Our approach works by c… ▽ More

    Submitted 13 February, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

    Comments: 12 pages, 11 figures

  23. arXiv:2110.09935  [pdf, ps, other

    cs.LG eess.SP

    Random Feature Approximation for Online Nonlinear Graph Topology Identification

    Authors: Rohan Money, Joshin Krishnan, Baltasar Beferull-Lozano

    Abstract: Online topology estimation of graph-connected time series is challenging, especially since the causal dependencies in many real-world networks are nonlinear. In this paper, we propose a kernel-based algorithm for graph topology estimation. The algorithm uses a Fourier-based Random feature approximation to tackle the curse of dimensionality associated with the kernel representations. Exploiting the… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

  24. arXiv:2110.04353  [pdf, other

    cs.CL cs.SE

    Learning to Describe Solutions for Bug Reports Based on Developer Discussions

    Authors: Sheena Panthaplackel, Junyi Jessy Li, Milos Gligoric, Raymond J. Mooney

    Abstract: When a software bug is reported, developers engage in a discussion to collaboratively resolve it. While the solution is likely formulated within the discussion, it is often buried in a large amount of text, making it difficult to comprehend and delaying its implementation. To expedite bug resolution, we propose generating a concise natural language description of the solution by synthesizing relev… ▽ More

    Submitted 30 March, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: Accepted in Findings of ACL 2022

  25. arXiv:2108.09619  [pdf, other

    cs.SE cs.LG

    Impact of Evaluation Methodologies on Code Summarization

    Authors: Pengyu Nie, Jiyang Zhang, Junyi Jessy Li, Raymond J. Mooney, Milos Gligoric

    Abstract: There has been a growing interest in developing machine learning (ML) models for code summarization tasks, e.g., comment generation and method naming. Despite substantial increase in the effectiveness of ML models, the evaluation methodologies, i.e., the way people split datasets into training, validation, and test sets, were not well studied. Specifically, no prior work on code summarization cons… ▽ More

    Submitted 5 April, 2022; v1 submitted 21 August, 2021; originally announced August 2021.

    Comments: Accepted as a conference paper in ACL 2022

  26. TellMeWhy: A Dataset for Answering Why-Questions in Narratives

    Authors: Yash Kumar Lal, Nathanael Chambers, Raymond Mooney, Niranjan Balasubramanian

    Abstract: Answering questions about why characters perform certain actions is central to understanding and reasoning about narratives. Despite recent progress in QA, it is not clear if existing models have the ability to answer "why" questions that may require commonsense knowledge external to the input narrative. In this work, we introduce TellMeWhy, a new crowd-sourced dataset that consists of more than 3… ▽ More

    Submitted 17 August, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: Accepted to Findings of ACL, 2021 Data and evaluation suite available at http://lunr.cs.stonybrook.edu/tellmewhy

  27. arXiv:2106.02972  [pdf, other

    cs.AI cs.CL cs.LG

    Zero-shot Task Adaptation using Natural Language

    Authors: Prasoon Goyal, Raymond J. Mooney, Scott Niekum

    Abstract: Imitation learning and instruction-following are two common approaches to communicate a user's intent to a learning agent. However, as the complexity of tasks grows, it could be beneficial to use both demonstrations and language to communicate with an agent. In this work, we propose a novel setting where an agent is given both a demonstration and a description, and must combine information from bo… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

  28. arXiv:2103.13426  [pdf, other

    cs.CL cs.LG cs.SE

    Learning to Generate Code Comments from Class Hierarchies

    Authors: Jiyang Zhang, Sheena Panthaplackel, Pengyu Nie, Raymond J. Mooney, Junyi Jessy Li, Milos Gligoric

    Abstract: Descriptive code comments are essential for supporting code comprehension and maintenance. We propose the task of automatically generating comments for overriding methods. We formulate a novel framework which accommodates the unique contextual and linguistic reasoning that is required for performing this task. Our approach features: (1) incorporating context from the class hierarchy; (2) condition… ▽ More

    Submitted 17 April, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

  29. arXiv:2010.01625  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    Deep Just-In-Time Inconsistency Detection Between Comments and Source Code

    Authors: Sheena Panthaplackel, Junyi Jessy Li, Milos Gligoric, Raymond J. Mooney

    Abstract: Natural language comments convey key aspects of source code such as implementation, usage, and pre- and post-conditions. Failure to update comments accordingly when the corresponding code is modified introduces inconsistencies, which is known to lead to confusion and software bugs. In this paper, we aim to detect whether a comment becomes inconsistent as a result of changes to the corresponding bo… ▽ More

    Submitted 26 December, 2020; v1 submitted 4 October, 2020; originally announced October 2020.

    Comments: Accepted in AAAI 2021

  30. arXiv:2009.05552  [pdf, other

    cs.AI cs.CL

    Systematic Generalization on gSCAN with Language Conditioned Embedding

    Authors: Tong Gao, Qi Huang, Raymond J. Mooney

    Abstract: Systematic Generalization refers to a learning algorithm's ability to extrapolate learned behavior to unseen situations that are distinct but semantically similar to its training data. As shown in recent work, state-of-the-art deep learning models fail dramatically even on tasks for which they are designed when the test set is systematically different from the training data. We hypothesize that ex… ▽ More

    Submitted 4 October, 2020; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: Accepted by AACL-IJCNLP 2020. Huang and Gao share co-first authorship, authors contribute equally and are listed in alphabetical order

  31. arXiv:2007.15543  [pdf, other

    cs.LG cs.AI stat.ML

    PixL2R: Guiding Reinforcement Learning Using Natural Language by Mapping Pixels to Rewards

    Authors: Prasoon Goyal, Scott Niekum, Raymond J. Mooney

    Abstract: Reinforcement learning (RL), particularly in sparse reward settings, often requires prohibitively large numbers of interactions with the environment, thereby limiting its applicability to complex problems. To address this, several prior approaches have used natural language to guide the agent's exploration. However, these approaches typically operate on structured representations of the environmen… ▽ More

    Submitted 19 November, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: Conference on Robot Learning (CoRL), 2020

  32. arXiv:2006.15631  [pdf, other

    cs.CV

    Improving VQA and its Explanations \\ by Comparing Competing Explanations

    Authors: Jialin Wu, Liyan Chen, Raymond J. Mooney

    Abstract: Most recent state-of-the-art Visual Question Answering (VQA) systems are opaque black boxes that are only trained to fit the answer distribution given the question and visual content. As a result, these systems frequently take shortcuts, focusing on simple visual concepts or question priors. This phenomenon becomes more problematic as the questions become complex that requires more reasoning and c… ▽ More

    Submitted 28 June, 2020; originally announced June 2020.

  33. arXiv:2006.14767  [pdf, ps, other

    cs.CL

    Dialog as a Vehicle for Lifelong Learning

    Authors: Aishwarya Padmakumar, Raymond J. Mooney

    Abstract: Dialog systems research has primarily been focused around two main types of applications - task-oriented dialog systems that learn to use clarification to aid in understanding a goal, and open-ended dialog systems that are expected to carry out unconstrained "chit chat" conversations. However, dialog interactions can also be used to obtain various types of knowledge that can be used to improve an… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: Position Paper Track at the SIGDIAL Special Session on Physically Situated Dialogue (RoboDial 2.0) - Camera Ready Version

  34. arXiv:2006.05456  [pdf, other

    cs.CV cs.CL cs.LG

    Dialog Policy Learning for Joint Clarification and Active Learning Queries

    Authors: Aishwarya Padmakumar, Raymond J. Mooney

    Abstract: Intelligent systems need to be able to recover from mistakes, resolve uncertainty, and adapt to novel concepts not seen during training. Dialog interaction can enable this by the use of clarifications for correction and resolving uncertainty, and active learning queries to learn new concepts encountered during operation. Prior work on dialog systems has either focused on exclusively learning how t… ▽ More

    Submitted 13 December, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: AAAI 2020 Camera Ready

    Journal ref: Proceedings of 2021 AAAI Conference on Artificial Intelligence (AAAI-2021)

  35. arXiv:2004.12169  [pdf, other

    cs.CL cs.LG cs.SE

    Learning to Update Natural Language Comments Based on Code Changes

    Authors: Sheena Panthaplackel, Pengyu Nie, Milos Gligoric, Junyi Jessy Li, Raymond J. Mooney

    Abstract: We formulate the novel task of automatically updating an existing natural language comment based on changes in the body of code it accompanies. We propose an approach that learns to correlate changes across two distinct language representations, to generate a sequence of edits that are applied to the existing comment to reflect the source code modifications. We train and evaluate our model using a… ▽ More

    Submitted 27 April, 2020; v1 submitted 25 April, 2020; originally announced April 2020.

    Comments: Accepted in Association for Computational Linguistics (ACL) 2020

  36. arXiv:1912.06728  [pdf

    cs.CL cs.LG cs.SE

    Associating Natural Language Comment and Source Code Entities

    Authors: Sheena Panthaplackel, Milos Gligoric, Raymond J. Mooney, Junyi Jessy Li

    Abstract: Comments are an integral part of software development; they are natural language descriptions associated with source code elements. Understanding explicit associations can be useful in improving code comprehensibility and maintaining the consistency between code and comments. As an initial step towards this larger goal, we address the task of associating entities in Javadoc comments with elements… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

    Comments: Accepted in AAAI 2020

  37. arXiv:1910.14208  [pdf, other

    cs.CV cs.CL

    Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder

    Authors: Jialin Wu, Raymond J. Mooney

    Abstract: Most RNN-based image captioning models receive supervision on the output words to mimic human captions. Therefore, the hidden states can only receive noisy gradient signals via layers of back-propagation through time, leading to less accurate generated captions. Consequently, we propose a novel framework, Hidden State Guidance (HSG), that matches the hidden states in the caption decoder to those i… ▽ More

    Submitted 14 January, 2020; v1 submitted 30 October, 2019; originally announced October 2019.

  38. arXiv:1908.02308  [pdf

    cs.MM

    Report of 2017 NSF Workshop on Multimedia Challenges, Opportunities and Research Roadmaps

    Authors: Shih-Fu Chang, Alex Hauptmann, Louis-Philippe Morency, Sameer Antani, Dick Bulterman, Carlos Busso, Joyce Chai, Julia Hirschberg, Ramesh Jain, Ketan Mayer-Patel, Reuven Meth, Raymond Mooney, Klara Nahrstedt, Shri Narayanan, Prem Natarajan, Sharon Oviatt, Balakrishnan Prabhakaran, Arnold Smeulders, Hari Sundaram, Zhengyou Zhang, Michelle Zhou

    Abstract: With the transformative technologies and the rapidly changing global R&D landscape, the multimedia and multimodal community is now faced with many new opportunities and uncertainties. With the open source dissemination platform and pervasive computing resources, new research results are being discovered at an unprecedented pace. In addition, the rapid exchange and influence of ideas across traditi… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

    Comments: Long Report of NSF Workshop on Multimedia Challenges, Opportunities and Research Roadmaps, held in March 2017, Washington DC. Short report available separately

  39. arXiv:1906.00513  [pdf, other

    cs.CV cs.CL

    Generating Question Relevant Captions to Aid Visual Question Answering

    Authors: Jialin Wu, Zeyuan Hu, Raymond J. Mooney

    Abstract: Visual question answering (VQA) and image captioning require a shared body of general knowledge connecting language and vision. We present a novel approach to improve VQA performance that exploits this connection by jointly generating captions that are targeted to help answer a specific visual question. The model is trained using an existing caption dataset by automatically determining question-re… ▽ More

    Submitted 3 January, 2020; v1 submitted 2 June, 2019; originally announced June 2019.

    Comments: ACL 2019 camera-ready

  40. arXiv:1905.13714  [pdf, other

    cs.CL

    Do Human Rationales Improve Machine Explanations?

    Authors: Julia Strout, Ye Zhang, Raymond J. Mooney

    Abstract: Work on "learning with rationales" shows that humans providing explanations to a machine learning system can improve the system's predictive accuracy. However, this work has not been connected to work in "explainable AI" which concerns machines explaining their reasoning to humans. In this work, we show that learning with rationales can also improve the quality of the machine's explanations as eva… ▽ More

    Submitted 31 May, 2019; originally announced May 2019.

  41. arXiv:1905.09998  [pdf, other

    cs.CV cs.CL

    Self-Critical Reasoning for Robust Visual Question Answering

    Authors: Jialin Wu, Raymond J. Mooney

    Abstract: Visual Question Answering (VQA) deep-learning systems tend to capture superficial statistical correlations in the training data because of strong language priors and fail to generalize to test data with a significantly different question-answer (QA) distribution. To address this issue, we introduce a self-critical training objective that ensures that visual explanations of correct answers match th… ▽ More

    Submitted 30 December, 2019; v1 submitted 23 May, 2019; originally announced May 2019.

    Comments: In NeurIPS 2019

  42. arXiv:1903.02020  [pdf, other

    cs.LG cs.AI stat.ML

    Using Natural Language for Reward Shaping in Reinforcement Learning

    Authors: Prasoon Goyal, Scott Niekum, Raymond J. Mooney

    Abstract: Recent reinforcement learning (RL) approaches have shown strong performance in complex domains such as Atari games, but are often highly sample inefficient. A common approach to reduce interaction time with the environment is to use reward shaping, which involves carefully designing reward functions that provide the agent intermediate rewards for progress towards the goal. However, designing appro… ▽ More

    Submitted 31 May, 2019; v1 submitted 5 March, 2019; originally announced March 2019.

    Comments: IJCAI 2019

  43. Improving Grounded Natural Language Understanding through Human-Robot Dialog

    Authors: Jesse Thomason, Aishwarya Padmakumar, Jivko Sinapov, Nick Walker, Yuqian Jiang, Harel Yedidsion, Justin Hart, Peter Stone, Raymond J. Mooney

    Abstract: Natural language understanding for robotics can require substantial domain- and platform-specific engineering. For example, for mobile robots to pick-and-place objects in an environment to satisfy human commands, we can specify the language humans use to issue such commands, and connect concept words like red can to physical object properties. One way to alleviate this engineering for a new domain… ▽ More

    Submitted 28 February, 2019; originally announced March 2019.

  44. arXiv:1810.02919  [pdf, other

    cs.RO

    Interaction and Autonomy in RoboCup@Home and Building-Wide Intelligence

    Authors: Justin Hart, Harel Yedidsion, Yuqian Jiang, Nick Walker, Rishi Shah, Jesse Thomason, Aishwarya Padmakumar, Rolando Fernandez, Jivko Sinapov, Raymond Mooney, Peter Stone

    Abstract: Efforts are underway at UT Austin to build autonomous robot systems that address the challenges of long-term deployments in office environments and of the more prescribed domestic service tasks of the RoboCup@Home competition. We discuss the contrasts and synergies of these efforts, highlighting how our work to build a RoboCup@Home Domestic Standard Platform League entry led us to identify an inte… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

    Comments: Presented at AI-HRI AAAI-FSS, 2018 (arXiv:1809.06606)

    Report number: AI-HRI/2018/10

  45. arXiv:1809.02805  [pdf, other

    cs.CL cs.CV

    Faithful Multimodal Explanation for Visual Question Answering

    Authors: Jialin Wu, Raymond J. Mooney

    Abstract: AI systems' ability to explain their reasoning is critical to their utility and trustworthiness. Deep neural networks have enabled significant progress on many challenging problems such as visual question answering (VQA). However, most of them are opaque black boxes with limited explanatory capability. This paper presents a novel approach to developing a high-performing VQA system that can elucida… ▽ More

    Submitted 3 June, 2019; v1 submitted 8 September, 2018; originally announced September 2018.

    Comments: In ACL 2019 BlackboxNLP workshop

  46. arXiv:1808.10009  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Learning a Policy for Opportunistic Active Learning

    Authors: Aishwarya Padmakumar, Peter Stone, Raymond J. Mooney

    Abstract: Active learning identifies data points to label that are expected to be the most useful in improving a supervised model. Opportunistic active learning incorporates active learning into interactive tasks that constrain possible queries during interactions. Prior work has shown that opportunistic active learning can be used to improve grounding of natural language descriptions in an interactive obje… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018 Camera Ready

    Journal ref: EMNLP 2018

  47. arXiv:1808.01729  [pdf, other

    cs.SE

    Executable Trigger-Action Comments

    Authors: Pengyu Nie, Rishabh Rai, Junyi Jessy Li, Sarfraz Khurshid, Raymond J. Mooney, Milos Gligoric

    Abstract: Natural language elements, e.g., todo comments, are frequently used to communicate among the developers and to describe tasks that need to be performed (actions) when specific conditions hold in the code repository (triggers). As projects evolve, development processes change, and development teams reorganize, these comments, because of their informal nature, frequently become irrelevant or forgott… ▽ More

    Submitted 6 August, 2018; originally announced August 2018.

  48. arXiv:1805.08389  [pdf, other

    cs.CL cs.CV

    Joint Image Captioning and Question Answering

    Authors: Jialin Wu, Zeyuan Hu, Raymond J. Mooney

    Abstract: Answering visual questions need acquire daily common knowledge and model the semantic connection among different parts in images, which is too difficult for VQA systems to learn from images with the only supervision from answers. Meanwhile, image captioning systems with beam search strategy tend to generate similar captions and fail to diversely describe images. To address the aforementioned issue… ▽ More

    Submitted 22 May, 2018; originally announced May 2018.

  49. arXiv:1709.02271  [pdf, other

    cs.CL

    Leveraging Discourse Information Effectively for Authorship Attribution

    Authors: Su Wang, Elisa Ferracane, Raymond J. Mooney

    Abstract: We explore techniques to maximize the effectiveness of discourse information in the task of authorship attribution. We present a novel method to embed discourse features in a Convolutional Neural Network text classifier, which achieves a state-of-the-art result by a substantial margin. We empirically investigate several featurization methods to understand the conditions under which discourse featu… ▽ More

    Submitted 7 September, 2017; originally announced September 2017.

    Comments: Accepted at IJCNLP 2017 as a conference paper

    Journal ref: The 8th International Joint Conference on Natural Language Processing (IJCNLP 2017)

  50. arXiv:1606.07770  [pdf, other

    cs.CV cs.CL

    Captioning Images with Diverse Objects

    Authors: Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond Mooney, Trevor Darrell, Kate Saenko

    Abstract: Recent captioning models are limited in their ability to scale and describe concepts unseen in paired image-text corpora. We propose the Novel Object Captioner (NOC), a deep visual semantic captioning model that can describe a large number of object categories not present in existing image-caption datasets. Our model takes advantage of external sources -- labeled images from object recognition dat… ▽ More

    Submitted 20 July, 2017; v1 submitted 24 June, 2016; originally announced June 2016.

    Comments: CVPR 2017 Camera ready version. 17 pages (8 + 9 supplement), 12 figures, 8 tables. Includes project page http://vsubhashini.github.io/noc.html