Skip to main content

Showing 51–59 of 59 results for author: Shakhnarovich, G

.
  1. arXiv:1703.04044  [pdf, other

    cs.CV

    Colorization as a Proxy Task for Visual Understanding

    Authors: Gustav Larsson, Michael Maire, Gregory Shakhnarovich

    Abstract: We investigate and improve self-supervision as a drop-in replacement for ImageNet pretraining, focusing on automatic colorization as the proxy task. Self-supervised training has been shown to be more promising for utilizing unlabeled data than other, traditional unsupervised learning methods. We build on this success and evaluate the ability of our self-supervised network in several contexts. On V… ▽ More

    Submitted 13 August, 2017; v1 submitted 11 March, 2017; originally announced March 2017.

    Comments: CVPR 2017 (Project page: http://people.cs.uchicago.edu/~larsson/color-proxy/)

  2. arXiv:1701.03439  [pdf, other

    cs.CV

    Comprehension-guided referring expressions

    Authors: Ruotian Luo, Gregory Shakhnarovich

    Abstract: We consider generation and comprehension of natural language referring expression for objects in an image. Unlike generic "image captioning" which lacks natural standard evaluation criteria, quality of a referring expression may be measured by the receiver's ability to correctly infer which object is being described. Following this intuition, we propose two approaches to utilize models trained for… ▽ More

    Submitted 12 January, 2017; originally announced January 2017.

  3. arXiv:1612.01991  [pdf, other

    cs.CV

    Diverse Sampling for Self-Supervised Learning of Semantic Segmentation

    Authors: Mohammadreza Mostajabi, Nicholas Kolkin, Gregory Shakhnarovich

    Abstract: We propose an approach for learning category-level semantic segmentation purely from image-level classification tags indicating presence of categories. It exploits localization cues that emerge from training classification-tasked convolutional networks, to drive a "self-supervision" process that automatically labels a sparse, diverse training set of points likely to belong to classes of interest.… ▽ More

    Submitted 6 December, 2016; originally announced December 2016.

  4. arXiv:1611.05760  [pdf, other

    cs.CV

    Examining the Impact of Blur on Recognition by Convolutional Networks

    Authors: Igor Vasiljevic, Ayan Chakrabarti, Gregory Shakhnarovich

    Abstract: State-of-the-art algorithms for many semantic visual tasks are based on the use of convolutional neural networks. These networks are commonly trained, and evaluated, on large annotated datasets of artifact-free high-quality images. In this paper, we investigate the effect of one such artifact that is quite common in natural capture settings: optical blur. We show that standard network models, trai… ▽ More

    Submitted 30 May, 2017; v1 submitted 17 November, 2016; originally announced November 2016.

  5. arXiv:1609.07876  [pdf, other

    cs.CL cs.CV

    Lexicon-Free Fingerspelling Recognition from Video: Data, Models, and Signer Adaptation

    Authors: Taehwan Kim, Jonathan Keane, Weiran Wang, Hao Tang, Jason Riggle, Gregory Shakhnarovich, Diane Brentari, Karen Livescu

    Abstract: We study the problem of recognizing video sequences of fingerspelled letters in American Sign Language (ASL). Fingerspelling comprises a significant but relatively understudied part of ASL. Recognizing fingerspelling is challenging for a number of reasons: It involves quick, small motions that are often highly coarticulated; it exhibits significant variation between signers; and there has been a d… ▽ More

    Submitted 26 September, 2016; originally announced September 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1608.08339

  6. arXiv:1605.07648  [pdf, other

    cs.CV

    FractalNet: Ultra-Deep Neural Networks without Residuals

    Authors: Gustav Larsson, Michael Maire, Gregory Shakhnarovich

    Abstract: We introduce a design strategy for neural network macro-architecture based on self-similarity. Repeated application of a simple expansion rule generates deep networks whose structural layouts are precisely truncated fractals. These networks contain interacting subpaths of different lengths, but do not include any pass-through or residual connections; every internal signal is transformed by a filte… ▽ More

    Submitted 26 May, 2017; v1 submitted 24 May, 2016; originally announced May 2016.

    Comments: updated with ImageNet results; published as a conference paper at ICLR 2017; project page at http://people.cs.uchicago.edu/~larsson/fractalnet/

  7. arXiv:1605.07081  [pdf, other

    cs.CV

    Depth from a Single Image by Harmonizing Overcomplete Local Network Predictions

    Authors: Ayan Chakrabarti, Jingyu Shao, Gregory Shakhnarovich

    Abstract: A single color image can contain many cues informative towards different aspects of local geometric structure. We approach the problem of monocular depth estimation by using a neural network to produce a mid-level representation that summarizes these cues. This network is trained to characterize local scene geometry by predicting, at every image location, depth derivatives of different orders, ori… ▽ More

    Submitted 7 September, 2016; v1 submitted 23 May, 2016; originally announced May 2016.

    Comments: NIPS 2016. Project page at http://www.ttic.edu/chakrabarti/mdepth/

  8. arXiv:1603.06668  [pdf, other

    cs.CV

    Learning Representations for Automatic Colorization

    Authors: Gustav Larsson, Michael Maire, Gregory Shakhnarovich

    Abstract: We develop a fully automatic image colorization system. Our approach leverages recent advances in deep networks, exploiting both low-level and semantic representations. As many scene elements naturally appear according to multimodal color distributions, we train our model to predict per-pixel color histograms. This intermediate output can be used to automatically generate a color image, or further… ▽ More

    Submitted 13 August, 2017; v1 submitted 22 March, 2016; originally announced March 2016.

    Comments: ECCV 2016 (Project page: http://people.cs.uchicago.edu/~larsson/colorization/)

  9. arXiv:1412.0774  [pdf, other

    cs.CV

    Feedforward semantic segmentation with zoom-out features

    Authors: Mohammadreza Mostajabi, Payman Yadollahpour, Gregory Shakhnarovich

    Abstract: We introduce a purely feed-forward architecture for semantic segmentation. We map small image elements (superpixels) to rich feature representations extracted from a sequence of nested regions of increasing extent. These regions are obtained by "zooming out" from the superpixel all the way to scene-level resolution. This approach exploits statistical structure in the image and in the label space w… ▽ More

    Submitted 1 December, 2014; originally announced December 2014.