Skip to main content

Showing 1–6 of 6 results for author: Jain, S D

Searching in archive cs. Search in all archives.
.
  1. arXiv:1905.00060  [pdf, other

    cs.CV

    Predicting How to Distribute Work Between Algorithms and Humans to Segment an Image Batch

    Authors: Danna Gurari, Yinan Zhao, Suyog Dutt Jain, Margrit Betke, Kristen Grauman

    Abstract: Foreground object segmentation is a critical step for many image analysis tasks. While automated methods can produce high-quality results, their failures disappoint users in need of practical solutions. We propose a resource allocation framework for predicting how best to allocate a fixed budget of human annotation effort in order to collect higher quality segmentations for a given batch of images… ▽ More

    Submitted 30 April, 2019; originally announced May 2019.

  2. arXiv:1808.04702  [pdf, other

    cs.CV

    Pixel Objectness: Learning to Segment Generic Objects Automatically in Images and Videos

    Authors: Bo Xiong, Suyog Dutt Jain, Kristen Grauman

    Abstract: We propose an end-to-end learning framework for segmenting generic objects in both images and videos. Given a novel image or video, our approach produces a pixel-level mask for all "object-like" regions---even for object categories never seen during training. We formulate the task as a structured prediction problem of assigning an object/background label to each pixel, implemented using a deep ful… ▽ More

    Submitted 17 December, 2018; v1 submitted 11 August, 2018; originally announced August 2018.

    Comments: To appear in PAMI. arXiv admin note: text overlap with arXiv:1701.05349, arXiv:1701.05384

  3. arXiv:1705.00366  [pdf, other

    cs.CV

    Predicting Foreground Object Ambiguity and Efficiently Crowdsourcing the Segmentation(s)

    Authors: Danna Gurari, Kun He, Bo Xiong, Jianming Zhang, Mehrnoosh Sameki, Suyog Dutt Jain, Stan Sclaroff, Margrit Betke, Kristen Grauman

    Abstract: We propose the ambiguity problem for the foreground object segmentation task and motivate the importance of estimating and accounting for this ambiguity when designing vision systems. Specifically, we distinguish between images which lead multiple annotators to segment different foreground objects (ambiguous) versus minor inter-annotator differences of the same object. Taking images from eight wid… ▽ More

    Submitted 30 April, 2017; originally announced May 2017.

  4. arXiv:1701.05384  [pdf, other

    cs.CV

    FusionSeg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos

    Authors: Suyog Dutt Jain, Bo Xiong, Kristen Grauman

    Abstract: We propose an end-to-end learning framework for segmenting generic objects in videos. Our method learns to combine appearance and motion information to produce pixel level segmentation masks for all prominent objects in videos. We formulate this task as a structured prediction problem and design a two-stream fully convolutional neural network which fuses together motion and appearance in a unified… ▽ More

    Submitted 12 April, 2017; v1 submitted 19 January, 2017; originally announced January 2017.

    Comments: CVPR 2017

  5. arXiv:1701.05349  [pdf, other

    cs.CV

    Pixel Objectness

    Authors: Suyog Dutt Jain, Bo Xiong, Kristen Grauman

    Abstract: We propose an end-to-end learning framework for generating foreground object segmentations. Given a single novel image, our approach produces pixel-level masks for all "object-like" regions---even for object categories never seen during training. We formulate the task as a structured prediction problem of assigning foreground/background labels to all pixels, implemented using a deep fully convolut… ▽ More

    Submitted 12 April, 2017; v1 submitted 19 January, 2017; originally announced January 2017.

  6. arXiv:1607.01115  [pdf, other

    cs.CV cs.AI cs.HC

    Click Carving: Segmenting Objects in Video with Point Clicks

    Authors: Suyog Dutt Jain, Kristen Grauman

    Abstract: We present a novel form of interactive video object segmentation where a few clicks by the user helps the system produce a full spatio-temporal segmentation of the object of interest. Whereas conventional interactive pipelines take the user's initialization as a starting point, we show the value in the system taking the lead even in initialization. In particular, for a given video frame, the syste… ▽ More

    Submitted 5 July, 2016; originally announced July 2016.

    Comments: A preliminary version of the material in this document was filed as University of Texas technical report no. UT AI16-01

    Report number: University of Texas Technical Report UT AI16-01