Skip to main content

Showing 1–6 of 6 results for author: Kleinman, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.07279  [pdf, other

    cs.LG stat.ML

    Towards a theory of learning dynamics in deep state space models

    Authors: Jakub Smékal, Jimmy T. H. Smith, Michael Kleinman, Dan Biderman, Scott W. Linderman

    Abstract: State space models (SSMs) have shown remarkable empirical performance on many long sequence modeling tasks, but a theoretical understanding of these models is still lacking. In this work, we study the learning dynamics of linear SSMs to understand how covariance structure in data, latent state size, and initialization affect the evolution of parameters throughout learning with gradient descent. We… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2404.01620  [pdf

    cs.SD cs.AI cs.CY eess.AS

    Voice EHR: Introducing Multimodal Audio Data for Health

    Authors: James Anibal, Hannah Huth, Ming Li, Lindsey Hazen, Veronica Daoud, Dominique Ebedes, Yen Minh Lam, Hang Nguyen, Phuc Hong, Michael Kleinman, Shelley Ost, Christopher Jackson, Laura Sprabery, Cheran Elangovan, Balaji Krishnaiah, Lee Akst, Ioan Lina, Iqbal Elyazar, Lenny Ekwati, Stefan Jansen, Richard Nduwayezu, Charisse Garcia, Jeffrey Plum, Jacqueline Brenner, Miranda Song , et al. (5 additional authors not shown)

    Abstract: Artificial intelligence (AI) models trained on audio data may have the potential to rapidly perform clinical tasks, enhancing medical decision-making and potentially improving outcomes through early detection. Existing technologies depend on limited datasets collected with expensive recording equipment in high-income countries, which challenges deployment in resource-constrained, high-volume setti… ▽ More

    Submitted 9 November, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 21 pages, 5 figures, 6 tables

  3. arXiv:2308.12221  [pdf, other

    cs.LG cs.AI q-bio.NC stat.ML

    Critical Learning Periods Emerge Even in Deep Linear Networks

    Authors: Michael Kleinman, Alessandro Achille, Stefano Soatto

    Abstract: Critical learning periods are periods early in development where temporary sensory deficits can have a permanent effect on behavior and learned representations. Despite the radical differences between biological and artificial networks, critical learning periods have been empirically observed in both systems. This suggests that critical periods may be fundamental to learning and not an accident of… ▽ More

    Submitted 24 May, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: ICLR 2024 (Spotlight)

  4. arXiv:2210.04643  [pdf, other

    cs.LG cs.AI cs.CV q-bio.NC

    Critical Learning Periods for Multisensory Integration in Deep Networks

    Authors: Michael Kleinman, Alessandro Achille, Stefano Soatto

    Abstract: We show that the ability of a neural network to integrate information from diverse sources hinges critically on being exposed to properly correlated signals during the early phases of training. Interfering with the learning process during this initial stage can permanently impair the development of a skill, both in artificial and biological systems where the phenomenon is known as a critical learn… ▽ More

    Submitted 14 September, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: CVPR 2023 (Highlighted Paper)

  5. arXiv:2205.12239  [pdf, other

    cs.LG cs.CV cs.IT

    Gacs-Korner Common Information Variational Autoencoder

    Authors: Michael Kleinman, Alessandro Achille, Stefano Soatto, Jonathan Kao

    Abstract: We propose a notion of common information that allows one to quantify and separate the information that is shared between two random variables from the information that is unique to each. Our notion of common information is defined by an optimization problem over a family of functions and recovers the Gács-Körner common information as a special case. Importantly, our notion can be approximated emp… ▽ More

    Submitted 5 November, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: Accepted to NeurIPS 2023

  6. arXiv:2010.02459  [pdf, other

    cs.LG cs.IT stat.ML

    Usable Information and Evolution of Optimal Representations During Training

    Authors: Michael Kleinman, Alessandro Achille, Daksh Idnani, Jonathan C. Kao

    Abstract: We introduce a notion of usable information contained in the representation learned by a deep network, and use it to study how optimal representations for the task emerge during training. We show that the implicit regularization coming from training with Stochastic Gradient Descent with a high learning-rate and small batch size plays an important role in learning minimal sufficient representations… ▽ More

    Submitted 28 February, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: ICLR 2021