Skip to main content

Showing 1–1 of 1 results for author: Hovsepian, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2105.02626  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    A First Look: Towards Explainable TextVQA Models via Visual and Textual Explanations

    Authors: Varun Nagaraj Rao, Xingjian Zhen, Karen Hovsepian, Mingwei Shen

    Abstract: Explainable deep learning models are advantageous in many situations. Prior work mostly provide unimodal explanations through post-hoc approaches not part of the original system design. Explanation mechanisms also ignore useful textual information present in images. In this paper, we propose MTXNet, an end-to-end trainable multimodal architecture to generate multimodal explanations, which focuses… ▽ More

    Submitted 28 April, 2021; originally announced May 2021.

    Comments: This paper is done when Xingjian was an intern in Amazon PARS group, summer 2020. This paper is accepted by NAACL-MAI-Workshop, 2021