Skip to main content

Showing 1–6 of 6 results for author: Livne, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2003.02645  [pdf, other

    cs.CL cs.LG stat.ML

    SentenceMIM: A Latent Variable Language Model

    Authors: Micha Livne, Kevin Swersky, David J. Fleet

    Abstract: SentenceMIM is a probabilistic auto-encoder for language data, trained with Mutual Information Machine (MIM) learning to provide a fixed length representation of variable length language observations (i.e., similar to VAE). Previous attempts to learn VAEs for language data faced challenges due to posterior collapse. MIM learning encourages high mutual information between observations and latent va… ▽ More

    Submitted 21 April, 2021; v1 submitted 18 February, 2020; originally announced March 2020.

    Comments: Preprint. Demo: https://github.com/seraphlabs-ca/SentenceMIM-demo

    MSC Class: 68T50 ACM Class: I.2.7

  2. arXiv:1910.04153  [pdf, other

    stat.ML cs.IT cs.LG

    High Mutual Information in Representation Learning with Symmetric Variational Inference

    Authors: Micha Livne, Kevin Swersky, David J. Fleet

    Abstract: We introduce the Mutual Information Machine (MIM), a novel formulation of representation learning, using a joint distribution over the observations and latent state in an encoder/decoder framework. Our key principles are symmetry and mutual information, where symmetry encourages the encoder and decoder to learn different factorizations of the same underlying distribution, and mutual information, t… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

    Comments: Bayesian Deep Learning Workshop (NeurIPS 2019). arXiv admin note: substantial text overlap with arXiv:1910.03175

  3. arXiv:1910.03175  [pdf, other

    cs.LG cs.IT stat.ML

    MIM: Mutual Information Machine

    Authors: Micha Livne, Kevin Swersky, David J. Fleet

    Abstract: We introduce the Mutual Information Machine (MIM), a probabilistic auto-encoder for learning joint distributions over observations and latent variables. MIM reflects three design principles: 1) low divergence, to encourage the encoder and decoder to learn consistent factorizations of the same underlying distribution; 2) high mutual information, to encourage an informative relation between data and… ▽ More

    Submitted 21 February, 2020; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: Pre-print. Project webpage: https://research.seraphlabs.ca/projects/mim/

    MSC Class: 62F15 ACM Class: G.3; I.2.6

  4. arXiv:1902.01893  [pdf, other

    cs.LG stat.ML

    TzK: Flow-Based Conditional Generative Model

    Authors: Micha Livne, David Fleet

    Abstract: We formulate a new class of conditional generative models based on probability flows. Trained with maximum likelihood, it provides efficient inference and sampling from class-conditionals or the joint distribution, and does not require a priori knowledge of the number of classes or the relationships between classes. This allows one to train generative models from multiple, heterogeneous datasets,… ▽ More

    Submitted 22 April, 2019; v1 submitted 5 February, 2019; originally announced February 2019.

    Comments: 8 pages, 9 figures, 2 tables, preprint

    MSC Class: 68T30

  5. arXiv:1811.01837  [pdf, other

    cs.LG stat.ML

    TzK Flow - Conditional Generative Model

    Authors: Micha Livne, David J. Fleet

    Abstract: We introduce TzK (pronounced "task"), a conditional probability flow-based model that exploits attributes (e.g., style, class membership, or other side information) in order to learn tight conditional prior around manifolds of the target observations. The model is trained via approximated ML, and offers efficient approximation of arbitrary data sample distributions (similar to GAN and flow-based M… ▽ More

    Submitted 19 February, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: 5 pages, 4 figures, Accepted to Bayesian Deep Learning Workshop NIPS 2018, camera ready NOTE: This workshop paper has been replaced. Please refer to the following work: arXiv:1902.01893

    MSC Class: 68T05 ACM Class: F.1.1; F.1.2; G.3

  6. arXiv:1809.04430  [pdf, other

    cs.CV cs.LG cs.NE physics.med-ph stat.ML

    Deep learning to achieve clinically applicable segmentation of head and neck anatomy for radiotherapy

    Authors: Stanislav Nikolov, Sam Blackwell, Alexei Zverovitch, Ruheena Mendes, Michelle Livne, Jeffrey De Fauw, Yojan Patel, Clemens Meyer, Harry Askham, Bernardino Romera-Paredes, Christopher Kelly, Alan Karthikesalingam, Carlton Chu, Dawn Carnell, Cheng Boon, Derek D'Souza, Syed Ali Moinuddin, Bethany Garie, Yasmin McQuinlan, Sarah Ireland, Kiarna Hampton, Krystle Fuller, Hugh Montgomery, Geraint Rees, Mustafa Suleyman , et al. (4 additional authors not shown)

    Abstract: Over half a million individuals are diagnosed with head and neck cancer each year worldwide. Radiotherapy is an important curative treatment for this disease, but it requires manual time consuming delineation of radio-sensitive organs at risk (OARs). This planning process can delay treatment, while also introducing inter-operator variability with resulting downstream radiation dose differences. Wh… ▽ More

    Submitted 13 January, 2021; v1 submitted 12 September, 2018; originally announced September 2018.