Skip to main content

Showing 1–4 of 4 results for author: Dao, C D

Searching in archive cs. Search in all archives.
.
  1. arXiv:1808.03766  [pdf, ps, other

    cs.CV

    The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary

    Authors: Bernard Ghanem, Juan Carlos Niebles, Cees Snoek, Fabian Caba Heilbron, Humam Alwassel, Victor Escorcia, Ranjay Krishna, Shyamal Buch, Cuong Duc Dao

    Abstract: The 3rd annual installment of the ActivityNet Large- Scale Activity Recognition Challenge, held as a full-day workshop in CVPR 2018, focused on the recognition of daily life, high-level, goal-oriented activities from user-generated videos as those found in internet video portals. The 2018 challenge hosted six diverse tasks which aimed to push the limits of semantic visual understanding of videos a… ▽ More

    Submitted 23 August, 2018; v1 submitted 11 August, 2018; originally announced August 2018.

    Comments: CVPR Workshop 2018 challenge summary

  2. arXiv:1804.01824  [pdf, other

    cs.CV

    Guess Where? Actor-Supervision for Spatiotemporal Action Localization

    Authors: Victor Escorcia, Cuong D. Dao, Mihir Jain, Bernard Ghanem, Cees Snoek

    Abstract: This paper addresses the problem of spatiotemporal localization of actions in videos. Compared to leading approaches, which all learn to localize based on carefully annotated boxes on training video frames, we adhere to a weakly-supervised solution that only requires a video class label. We introduce an actor-supervised architecture that exploits the inherent compositionality of actions in terms o… ▽ More

    Submitted 5 April, 2018; originally announced April 2018.

    Comments: cvpr version

  3. arXiv:1711.06232  [pdf, other

    cs.CV cs.CL

    A Novel Framework for Robustness Analysis of Visual QA Models

    Authors: Jia-Hong Huang, Cuong Duc Dao, Modar Alfadly, Bernard Ghanem

    Abstract: Deep neural networks have been playing an essential role in many computer vision tasks including Visual Question Answering (VQA). Until recently, the study of their accuracy was the main focus of research but now there is a trend toward assessing the robustness of these models against adversarial attacks by evaluating their tolerance to varying noise levels. In VQA, adversarial attacks can target… ▽ More

    Submitted 24 December, 2018; v1 submitted 16 November, 2017; originally announced November 2017.

    Comments: Accepted by the Thirty-Third AAAI Conference on Artificial Intelligence, (AAAI-19), as an oral paper

  4. arXiv:1709.04625  [pdf, other

    cs.CV cs.CL

    Robustness Analysis of Visual QA Models by Basic Questions

    Authors: Jia-Hong Huang, Cuong Duc Dao, Modar Alfadly, C. Huck Yang, Bernard Ghanem

    Abstract: Visual Question Answering (VQA) models should have both high robustness and accuracy. Unfortunately, most of the current VQA research only focuses on accuracy because there is a lack of proper methods to measure the robustness of VQA models. There are two main modules in our algorithm. Given a natural language question about an image, the first module takes the question as input and then outputs t… ▽ More

    Submitted 26 May, 2018; v1 submitted 14 September, 2017; originally announced September 2017.

    Comments: Accepted by CVPR 2018 VQA Challenge and Visual Dialog Workshop. (Acknowledgement updating)