Skip to main content

Showing 1–4 of 4 results for author: Kemp, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:1909.09712  [pdf, other

    cs.LG stat.ML

    Learning an Adaptive Learning Rate Schedule

    Authors: Zhen Xu, Andrew M. Dai, Jonas Kemp, Luke Metz

    Abstract: The learning rate is one of the most important hyper-parameters for model training and generalization. However, current hand-designed parametric learning rate schedules offer limited flexibility and the predefined schedule may not match the training dynamics of high dimensional and non-convex optimization problems. In this paper, we propose a reinforcement learning based framework that can automat… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.

  2. arXiv:1909.03039  [pdf, other

    cs.LG cs.CL stat.ML

    Improved Hierarchical Patient Classification with Language Model Pretraining over Clinical Notes

    Authors: Jonas Kemp, Alvin Rajkomar, Andrew M. Dai

    Abstract: Clinical notes in electronic health records contain highly heterogeneous writing styles, including non-standard terminology or abbreviations. Using these notes in predictive modeling has traditionally required preprocessing (e.g. taking frequent terms or topic modeling) that removes much of the richness of the source data. We propose a pretrained hierarchical recurrent neural network model that pa… ▽ More

    Submitted 14 November, 2019; v1 submitted 6 September, 2019; originally announced September 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - extended abstract

  3. Analyzing the Role of Model Uncertainty for Electronic Health Records

    Authors: Michael W. Dusenberry, Dustin Tran, Edward Choi, Jonas Kemp, Jeremy Nixon, Ghassen Jerfel, Katherine Heller, Andrew M. Dai

    Abstract: In medicine, both ethical and monetary costs of incorrect predictions can be significant, and the complexity of the problems often necessitates increasingly complex models. Recent work has shown that changing just the random seed is enough for otherwise well-tuned deep neural networks to vary in their individual predicted probabilities. In light of this, we investigate the role of model uncertaint… ▽ More

    Submitted 25 March, 2020; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: Published in the ACM Conference on Health, Inference, and Learning (CHIL) 2020. Code available at https://github.com/Google-Health/records-research

  4. arXiv:1905.08547  [pdf

    cs.LG stat.ML

    Benchmarking Deep Learning Architectures for Predicting Readmission to the ICU and Describing Patients-at-Risk

    Authors: Sebastiano Barbieri, James Kemp, Oscar Perez-Concha, Sradha Kotwal, Martin Gallagher, Angus Ritchie, Louisa Jorm

    Abstract: Objective: To compare different deep learning architectures for predicting the risk of readmission within 30 days of discharge from the intensive care unit (ICU). The interpretability of attention-based models is leveraged to describe patients-at-risk. Methods: Several deep learning architectures making use of attention mechanisms, recurrent layers, neural ordinary differential equations (ODEs), a… ▽ More

    Submitted 6 January, 2020; v1 submitted 21 May, 2019; originally announced May 2019.