Skip to main content

Showing 1–3 of 3 results for author: Dogan, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2007.14682  [pdf, other

    cs.CV cs.CL

    Enriching Video Captions With Contextual Text

    Authors: Philipp Rimle, Pelin Dogan, Markus Gross

    Abstract: Understanding video content and generating caption with context is an important and challenging task. Unlike prior methods that typically attempt to generate generic video captions without context, our architecture contextualizes captioning by infusing extracted information from relevant text data. We propose an end-to-end sequence-to-sequence model which generates video captions based on visual i… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: Accepted at ICPR 2020

    MSC Class: I.2.10; I.2.7

  2. arXiv:1903.07669  [pdf, other

    cs.CV

    Neural Sequential Phrase Grounding (SeqGROUND)

    Authors: Pelin Dogan, Leonid Sigal, Markus Gross

    Abstract: We propose an end-to-end approach for phrase grounding in images. Unlike prior methods that typically attempt to ground each phrase independently by building an image-text embedding, our architecture formulates grounding of multiple phrases as a sequential and contextual process. Specifically, we encode region proposals and all phrases into two stacks of LSTM cells, along with so-far grounded phra… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.

    Comments: Accepted at CVPR 2019

  3. arXiv:1803.00057  [pdf, other

    cs.CV cs.CL cs.LG

    A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)

    Authors: Pelin Dogan, Boyang Li, Leonid Sigal, Markus Gross

    Abstract: The alignment of heterogeneous sequential data (video to text) is an important and challenging problem. Standard techniques for this task, including Dynamic Time Warping (DTW) and Conditional Random Fields (CRFs), suffer from inherent drawbacks. Mainly, the Markov assumption implies that, given the immediate past, future alignment decisions are independent of further history. The separation betwee… ▽ More

    Submitted 9 April, 2018; v1 submitted 19 February, 2018; originally announced March 2018.

    Comments: Accepted at CVPR 2018 (Spotlight). arXiv file includes the paper and the supplemental material