Skip to main content

Showing 1–3 of 3 results for author: Seybold, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:1910.09588  [pdf, other

    cs.LG stat.ML

    Collapsed Amortized Variational Inference for Switching Nonlinear Dynamical Systems

    Authors: Zhe Dong, Bryan A. Seybold, Kevin P. Murphy, Hung H. Bui

    Abstract: We propose an efficient inference method for switching nonlinear dynamical systems. The key idea is to learn an inference network which can be used as a proposal distribution for the continuous latent variables, while performing exact marginalization of the discrete latent variables. This allows us to use the reparameterization trick, and apply end-to-end training with stochastic gradient descent.… ▽ More

    Submitted 10 February, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

  2. arXiv:1905.07478  [pdf, other

    cs.LG stat.ML

    Dueling Decoders: Regularizing Variational Autoencoder Latent Spaces

    Authors: Bryan Seybold, Emily Fertig, Alex Alemi, Ian Fischer

    Abstract: Variational autoencoders learn unsupervised data representations, but these models frequently converge to minima that fail to preserve meaningful semantic information. For example, variational autoencoders with autoregressive decoders often collapse into autodecoders, where they learn to ignore the encoder input. In this work, we demonstrate that adding an auxiliary decoder to regularize the laten… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 16 pages, 9 figures, supplemental

  3. arXiv:1609.09430  [pdf, other

    cs.SD cs.LG stat.ML

    CNN Architectures for Large-Scale Audio Classification

    Authors: Shawn Hershey, Sourish Chaudhuri, Daniel P. W. Ellis, Jort F. Gemmeke, Aren Jansen, R. Channing Moore, Manoj Plakal, Devin Platt, Rif A. Saurous, Bryan Seybold, Malcolm Slaney, Ron J. Weiss, Kevin Wilson

    Abstract: Convolutional Neural Networks (CNNs) have proven very effective in image classification and show promise for audio. We use various CNN architectures to classify the soundtracks of a dataset of 70M training videos (5.24 million hours) with 30,871 video-level labels. We examine fully connected Deep Neural Networks (DNNs), AlexNet [1], VGG [2], Inception [3], and ResNet [4]. We investigate varying th… ▽ More

    Submitted 10 January, 2017; v1 submitted 29 September, 2016; originally announced September 2016.

    Comments: Accepted for publication at ICASSP 2017 Changes: Added definitions of mAP, AUC, and d-prime. Updated mAP/AUC/d-prime numbers for Audio Set based on changes of latest Audio Set revision. Changed wording to fit 4 page limit with new additions