Skip to main content

Showing 1–3 of 3 results for author: Quinn, M H

Searching in archive cs. Search in all archives.
.
  1. arXiv:1711.00088  [pdf, other

    cs.CV

    Semantic Image Retrieval via Active Grounding of Visual Situations

    Authors: Max H. Quinn, Erik Conser, Jordan M. Witte, Melanie Mitchell

    Abstract: We describe a novel architecture for semantic image retrieval---in particular, retrieval of instances of visual situations. Visual situations are concepts such as "a boxing match," "walking the dog," "a crowd waiting for a bus," or "a game of ping-pong," whose instantiations in images are linked more by their common spatial and semantic structure than by low-level visual similarity. Given a query… ▽ More

    Submitted 31 October, 2017; originally announced November 2017.

  2. arXiv:1611.05369  [pdf, other

    cs.CV cs.LG

    Fast On-Line Kernel Density Estimation for Active Object Localization

    Authors: Anthony D. Rhodes, Max H. Quinn, Melanie Mitchell

    Abstract: A major goal of computer vision is to enable computers to interpret visual situations---abstract concepts (e.g., "a person walking a dog," "a crowd waiting for a bus," "a picnic") whose image instantiations are linked more by their common spatial and semantic structure than by low-level visual similarity. In this paper, we propose a novel method for prior learning and active object localization fo… ▽ More

    Submitted 16 November, 2016; originally announced November 2016.

    Comments: arXiv admin note: text overlap with arXiv:1607.00548

  3. arXiv:1607.00548  [pdf, other

    cs.CV

    Active Object Localization in Visual Situations

    Authors: Max H. Quinn, Anthony D. Rhodes, Melanie Mitchell

    Abstract: We describe a method for performing active localization of objects in instances of visual situations. A visual situation is an abstract concept---e.g., "a boxing match", "a birthday party", "walking the dog", "waiting for a bus"---whose image instantiations are linked more by their common spatial and semantic structure than by low-level visual similarity. Our system combines given and learned know… ▽ More

    Submitted 2 July, 2016; originally announced July 2016.

    Comments: 14 pages