Skip to main content

Showing 1–8 of 8 results for author: Harutyunyan, H

Searching in archive stat. Search in all archives.
.
  1. Formal limitations of sample-wise information-theoretic generalization bounds

    Authors: Hrayr Harutyunyan, Greg Ver Steeg, Aram Galstyan

    Abstract: Some of the tightest information-theoretic generalization bounds depend on the average information between the learned hypothesis and a single training example. However, these sample-wise bounds were derived only for expected generalization gap. We show that even for expected squared generalization gap no such sample-wise information-theoretic bounds exist. The same is true for PAC-Bayes and singl… ▽ More

    Submitted 13 December, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

    Comments: 2022 IEEE Information Theory Workshop

  2. arXiv:2110.01584  [pdf, other

    cs.LG stat.ML

    Information-theoretic generalization bounds for black-box learning algorithms

    Authors: Hrayr Harutyunyan, Maxim Raginsky, Greg Ver Steeg, Aram Galstyan

    Abstract: We derive information-theoretic generalization bounds for supervised learning algorithms based on the information contained in predictions rather than in the output of the training algorithm. These bounds improve over the existing information-theoretic bounds, are applicable to a wider range of algorithms, and solve two key challenges: (a) they give meaningful results for deterministic algorithms… ▽ More

    Submitted 5 October, 2021; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021

  3. arXiv:2101.06640  [pdf, other

    cs.LG stat.ML

    Estimating informativeness of samples with Smooth Unique Information

    Authors: Hrayr Harutyunyan, Alessandro Achille, Giovanni Paolini, Orchid Majumder, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

    Abstract: We define a notion of information that an individual sample provides to the training of a neural network, and we specialize it to measure both how much a sample informs the final weights and how much it informs the function computed by the weights. Though related, we show that these quantities have a qualitatively different behavior. We give efficient approximations of these quantities using a lin… ▽ More

    Submitted 28 March, 2021; v1 submitted 17 January, 2021; originally announced January 2021.

    Comments: ICLR 2021, 22 pages

  4. arXiv:2002.07933  [pdf, other

    cs.LG stat.ML

    Improving Generalization by Controlling Label-Noise Information in Neural Network Weights

    Authors: Hrayr Harutyunyan, Kyle Reing, Greg Ver Steeg, Aram Galstyan

    Abstract: In the presence of noisy or incorrect labels, neural networks have the undesirable tendency to memorize information about the noise. Standard regularization techniques such as dropout, weight decay or data augmentation sometimes help, but do not prevent this behavior. If one considers neural network weights as random variables that depend on the data and stochasticity of training, the amount of me… ▽ More

    Submitted 20 November, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: ICML, 2020

  5. arXiv:1905.13276  [pdf, other

    cs.LG stat.ML

    Efficient Covariance Estimation from Temporal Data

    Authors: Hrayr Harutyunyan, Daniel Moyer, Hrant Khachatrian, Greg Ver Steeg, Aram Galstyan

    Abstract: Estimating the covariance structure of multivariate time series is a fundamental problem with a wide-range of real-world applications -- from financial modeling to fMRI analysis. Despite significant recent advances, current state-of-the-art methods are still severely limited in terms of scalability, and do not work well in high-dimensional undersampled regimes. In this work we propose a novel meth… ▽ More

    Submitted 11 February, 2021; v1 submitted 30 May, 2019; originally announced May 2019.

  6. arXiv:1905.00067  [pdf, other

    cs.LG cs.SI stat.ML

    MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing

    Authors: Sami Abu-El-Haija, Bryan Perozzi, Amol Kapoor, Nazanin Alipourfard, Kristina Lerman, Hrayr Harutyunyan, Greg Ver Steeg, Aram Galstyan

    Abstract: Existing popular methods for semi-supervised learning with Graph Neural Networks (such as the Graph Convolutional Network) provably cannot learn a general class of neighborhood mixing relationships. To address this weakness, we propose a new model, MixHop, that can learn these relationships, including difference operators, by repeatedly mixing feature representations of neighbors at various distan… ▽ More

    Submitted 19 June, 2019; v1 submitted 30 April, 2019; originally announced May 2019.

  7. arXiv:1706.03353  [pdf, other

    stat.ML cs.IT

    Fast structure learning with modular regularization

    Authors: Greg Ver Steeg, Hrayr Harutyunyan, Daniel Moyer, Aram Galstyan

    Abstract: Estimating graphical model structure from high-dimensional and undersampled data is a fundamental problem in many scientific fields. Existing approaches, such as GLASSO, latent variable GLASSO, and latent tree models, suffer from high computational complexity and may impose unrealistic sparsity priors in some cases. We introduce a novel method that leverages a newly discovered connection between i… ▽ More

    Submitted 6 September, 2019; v1 submitted 11 June, 2017; originally announced June 2017.

    Comments: 22 pages, accepted to NeurIPS 2019

  8. Multitask learning and benchmarking with clinical time series data

    Authors: Hrayr Harutyunyan, Hrant Khachatrian, David C. Kale, Greg Ver Steeg, Aram Galstyan

    Abstract: Health care is one of the most exciting frontiers in data mining and machine learning. Successful adoption of electronic health records (EHRs) created an explosion in digital clinical data available for analysis, but progress in machine learning for healthcare research has been difficult to measure because of the absence of publicly available benchmark data sets. To address this problem, we propos… ▽ More

    Submitted 9 August, 2019; v1 submitted 22 March, 2017; originally announced March 2017.

    Comments: This version of the paper adds details about the generation of the benchmark tasks and describes improved neural baselines

    Journal ref: Scientific Data 6 (2019) 96