Skip to main content

Showing 1–7 of 7 results for author: Gabbay, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2106.15610  [pdf, other

    cs.CV cs.LG

    An Image is Worth More Than a Thousand Words: Towards Disentanglement in the Wild

    Authors: Aviv Gabbay, Niv Cohen, Yedid Hoshen

    Abstract: Unsupervised disentanglement has been shown to be theoretically impossible without inductive biases on the models and the data. As an alternative approach, recent methods rely on limited supervision to disentangle the factors of variation and allow their identifiability. While annotating the true generative factors is only required for a limited number of observations, we argue that it is infeasib… ▽ More

    Submitted 25 October, 2021; v1 submitted 29 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021. Project page: http://www.vision.huji.ac.il/zerodim

  2. arXiv:2103.14017  [pdf, other

    cs.CV cs.LG

    Scaling-up Disentanglement for Image Translation

    Authors: Aviv Gabbay, Yedid Hoshen

    Abstract: Image translation methods typically aim to manipulate a set of labeled attributes (given as supervision at training time e.g. domain label) while leaving the unlabeled attributes intact. Current methods achieve either: (i) disentanglement, which exhibits low visual fidelity and can only be satisfied where the attributes are perfectly uncorrelated. (ii) visually-plausible translations, which are cl… ▽ More

    Submitted 8 September, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

    Comments: ICCV 2021. Project page: http://www.vision.huji.ac.il/overlord

  3. arXiv:2007.04964  [pdf, other

    cs.CV cs.LG

    Improving Style-Content Disentanglement in Image-to-Image Translation

    Authors: Aviv Gabbay, Yedid Hoshen

    Abstract: Unsupervised image-to-image translation methods have achieved tremendous success in recent years. However, it can be easily observed that their models contain significant entanglement which often hurts the translation performance. In this work, we propose a principled approach for improving style-content disentanglement in image-to-image translation. By considering the information flow into each o… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: Project page: http://www.vision.huji.ac.il/style-content-disentanglement

  4. arXiv:1906.11880  [pdf, other

    cs.CV cs.LG stat.ML

    Style Generator Inversion for Image Enhancement and Animation

    Authors: Aviv Gabbay, Yedid Hoshen

    Abstract: One of the main motivations for training high quality image generative models is their potential use as tools for image manipulation. Recently, generative adversarial networks (GANs) have been able to generate images of remarkable quality. Unfortunately, adversarially-trained unconditional generator networks have not been successful as image priors. One of the main requirements for a network to ac… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

    Comments: Project page: http://www.vision.huji.ac.il/style-image-prior

  5. arXiv:1906.11796  [pdf, other

    cs.LG cs.CV stat.ML

    Demystifying Inter-Class Disentanglement

    Authors: Aviv Gabbay, Yedid Hoshen

    Abstract: Learning to disentangle the hidden factors of variations within a set of observations is a key task for artificial intelligence. We present a unified formulation for class and content disentanglement and use it to illustrate the limitations of current methods. We therefore introduce LORD, a novel method based on Latent Optimization for Representation Disentanglement. We find that latent optimizati… ▽ More

    Submitted 18 February, 2020; v1 submitted 27 June, 2019; originally announced June 2019.

    Comments: ICLR 2020. Project page: http://www.vision.huji.ac.il/lord

  6. arXiv:1711.08789  [pdf, other

    cs.CV cs.SD eess.AS

    Visual Speech Enhancement

    Authors: Aviv Gabbay, Asaph Shamir, Shmuel Peleg

    Abstract: When video is shot in noisy environment, the voice of a speaker seen in the video can be enhanced using the visible mouth movements, reducing background noise. While most existing methods use audio-only inputs, improved performance is obtained with our visual speech enhancement, based on an audio-visual neural network. We include in the training data videos to which we added the voice of the targe… ▽ More

    Submitted 13 June, 2018; v1 submitted 23 November, 2017; originally announced November 2017.

    Comments: Accepted to Interspeech 2018. Supplementary video: https://www.youtube.com/watch?v=nyYarDGpcYA

  7. arXiv:1708.06767  [pdf, other

    cs.CV cs.SD

    Seeing Through Noise: Visually Driven Speaker Separation and Enhancement

    Authors: Aviv Gabbay, Ariel Ephrat, Tavi Halperin, Shmuel Peleg

    Abstract: Isolating the voice of a specific person while filtering out other voices or background noises is challenging when video is shot in noisy environments. We propose audio-visual methods to isolate the voice of a single speaker and eliminate unrelated sounds. First, face motions captured in the video are used to estimate the speaker's voice, by passing the silent video frames through a video-to-speec… ▽ More

    Submitted 9 February, 2018; v1 submitted 22 August, 2017; originally announced August 2017.

    Comments: Supplementary video: https://www.youtube.com/watch?v=qmsyj7vAzoI