Skip to main content

Showing 1–3 of 3 results for author: Stephenson, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2105.14602  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    On the geometry of generalization and memorization in deep neural networks

    Authors: Cory Stephenson, Suchismita Padhy, Abhinav Ganesh, Yue Hui, Hanlin Tang, SueYeon Chung

    Abstract: Understanding how large neural networks avoid memorizing training data is key to explaining their high generalization performance. To examine the structure of when and where memorization occurs in a deep network, we use a recently developed replica-based mean field theoretic geometric analysis method. We find that all layers preferentially learn from examples which share features, and link this be… ▽ More

    Submitted 30 May, 2021; originally announced May 2021.

    Comments: ICLR 2021

  2. arXiv:1910.00067  [pdf, other

    stat.ML cs.LG eess.AS

    Semi-supervised voice conversion with amortized variational inference

    Authors: Cory Stephenson, Gokce Keskin, Anil Thomas, Oguz H. Elibol

    Abstract: In this work we introduce a semi-supervised approach to the voice conversion problem, in which speech from a source speaker is converted into speech of a target speaker. The proposed method makes use of both parallel and non-parallel utterances from the source and target simultaneously during training. This approach can be used to extend existing parallel data voice conversion systems such that th… ▽ More

    Submitted 30 September, 2019; originally announced October 2019.

    Comments: Accepted for publication at Interspeech 2019

    Journal ref: Proc. Interspeech 2019 (2019): 729-733

  3. arXiv:1705.04662  [pdf, other

    cs.SD cs.AI cs.LG stat.ML

    Monaural Audio Speaker Separation with Source Contrastive Estimation

    Authors: Cory Stephenson, Patrick Callier, Abhinav Ganesh, Karl Ni

    Abstract: We propose an algorithm to separate simultaneously speaking persons from each other, the "cocktail party problem", using a single microphone. Our approach involves a deep recurrent neural networks regression to a vector space that is descriptive of independent speakers. Such a vector space can embed empirically determined speaker characteristics and is optimized by distinguishing between speaker m… ▽ More

    Submitted 12 May, 2017; originally announced May 2017.