Skip to main content

Showing 1–9 of 9 results for author: Burachas, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2207.02972  [pdf, other

    cs.LG cs.NE

    Learning Invariant World State Representations with Predictive Coding

    Authors: Avi Ziskind, Sujeong Kim, Giedrius T. Burachas

    Abstract: Self-supervised learning methods overcome the key bottleneck for building more capable AI: limited availability of labeled data. However, one of the drawbacks of self-supervised architectures is that the representations that they learn are implicit and it is hard to extract meaningful information about the encoded world states, such as 3D structure of the visual scene encoded in a depth map. Moreo… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: 11 pages, 5 figures, submitted

  2. arXiv:2110.06863  [pdf, other

    cs.CV cs.AI cs.HC

    Improving Users' Mental Model with Attention-directed Counterfactual Edits

    Authors: Kamran Alipour, Arijit Ray, Xiao Lin, Michael Cogswell, Jurgen P. Schulze, Yi Yao, Giedrius T. Burachas

    Abstract: In the domain of Visual Question Answering (VQA), studies have shown improvement in users' mental model of the VQA system when they are exposed to examples of how these systems answer certain Image-Question (IQ) pairs. In this work, we show that showing controlled counterfactual image-question examples are more effective at improving the mental model of users as compared to simply showing random e… ▽ More

    Submitted 15 October, 2021; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Accepted for publication in Applied AI Letters

  3. arXiv:2103.14712  [pdf, other

    cs.CV cs.AI cs.CY cs.HC

    Generating and Evaluating Explanations of Attended and Error-Inducing Input Regions for VQA Models

    Authors: Arijit Ray, Michael Cogswell, Xiao Lin, Kamran Alipour, Ajay Divakaran, Yi Yao, Giedrius Burachas

    Abstract: Attention maps, a popular heatmap-based explanation method for Visual Question Answering (VQA), are supposed to help users understand the model by highlighting portions of the image/question used by the model to infer answers. However, we see that users are often misled by current attention map visualizations that point to relevant regions despite the model producing an incorrect answer. Hence, we… ▽ More

    Submitted 25 October, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Applied AI Letters, Wiley, 25 October 2021

  4. arXiv:2011.10082  [pdf, other

    cs.CV

    Hybrid Consistency Training with Prototype Adaptation for Few-Shot Learning

    Authors: Meng Ye, Xiao Lin, Giedrius Burachas, Ajay Divakaran, Yi Yao

    Abstract: Few-Shot Learning (FSL) aims to improve a model's generalization capability in low data regimes. Recent FSL works have made steady progress via metric learning, meta learning, representation learning, etc. However, FSL remains challenging due to the following longstanding difficulties. 1) The seen and unseen classes are disjoint, resulting in a distribution shift between training and testing. 2) D… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

  5. arXiv:2007.00900  [pdf, other

    cs.CV cs.AI cs.HC

    The Impact of Explanations on AI Competency Prediction in VQA

    Authors: Kamran Alipour, Arijit Ray, Xiao Lin, Jurgen P. Schulze, Yi Yao, Giedrius T. Burachas

    Abstract: Explainability is one of the key elements for building trust in AI systems. Among numerous attempts to make AI explainable, quantifying the effect of explanations remains a challenge in conducting human-AI collaborative tasks. Aside from the ability to predict the overall behavior of AI, in many applications, users need to understand an AI agent's competency in different aspects of the task domain… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

    Comments: Submitted to HCCAI 2020

  6. arXiv:2003.00431  [pdf, other

    cs.AI

    A Study on Multimodal and Interactive Explanations for Visual Question Answering

    Authors: Kamran Alipour, Jurgen P. Schulze, Yi Yao, Avi Ziskind, Giedrius Burachas

    Abstract: Explainability and interpretability of AI models is an essential factor affecting the safety of AI. While various explainable AI (XAI) approaches aim at mitigating the lack of transparency in deep networks, the evidence of the effectiveness of these approaches in improving usability, trust, and understanding of AI systems are still missing. We evaluate multimodal explanations in the setting of a V… ▽ More

    Submitted 1 March, 2020; originally announced March 2020.

    Comments: http://ceur-ws.org/Vol-2560/paper44.pdf

    Journal ref: Proceedings of the Workshop on Artificial Intelligence Safety (SafeAI 2020) co-located with 34th AAAI Conference on Artificial Intelligence (AAAI 2020), New York, USA, Feb 7, 2020

  7. arXiv:1909.04696  [pdf, other

    cs.CV cs.AI

    Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation

    Authors: Arijit Ray, Karan Sikka, Ajay Divakaran, Stefan Lee, Giedrius Burachas

    Abstract: While models for Visual Question Answering (VQA) have steadily improved over the years, interacting with one quickly reveals that these models lack consistency. For instance, if a model answers "red" to "What color is the balloon?", it might answer "no" if asked, "Is the balloon red?". These responses violate simple notions of entailment and raise questions about how effectively VQA models ground… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP 2019)

  8. arXiv:1904.03285  [pdf, other

    cs.CY cs.CV cs.HC

    Can You Explain That? Lucid Explanations Help Human-AI Collaborative Image Retrieval

    Authors: Arijit Ray, Yi Yao, Rakesh Kumar, Ajay Divakaran, Giedrius Burachas

    Abstract: While there have been many proposals on making AI algorithms explainable, few have attempted to evaluate the impact of AI-generated explanations on human performance in conducting human-AI collaborative tasks. To bridge the gap, we propose a Twenty-Questions style collaborative image retrieval game, Explanation-assisted Guess Which (ExAG), as a method of evaluating the efficacy of explanations (vi… ▽ More

    Submitted 21 September, 2019; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: 2019 AAAI Conference on Human Computation and Crowdsourcing

    Journal ref: 2019 AAAI Conference on Human Computation and Crowdsourcing

  9. arXiv:1902.05715  [pdf, other

    cs.CL cs.AI

    Generating Natural Language Explanations for Visual Question Answering using Scene Graphs and Visual Attention

    Authors: Shalini Ghosh, Giedrius Burachas, Arijit Ray, Avi Ziskind

    Abstract: In this paper, we present a novel approach for the task of eXplainable Question Answering (XQA), i.e., generating natural language (NL) explanations for the Visual Question Answering (VQA) problem. We generate NL explanations comprising of the evidence to support the answer to a question asked to an image using two sources of information: (a) annotations of entities in an image (e.g., object label… ▽ More

    Submitted 15 February, 2019; originally announced February 2019.