Skip to main content

Showing 1–4 of 4 results for author: Hafez-Kolahi, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2202.07537  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Information-Theoretic Analysis of Minimax Excess Risk

    Authors: Hassan Hafez-Kolahi, Behrad Moniri, Shohreh Kasaei

    Abstract: Two main concepts studied in machine learning theory are generalization gap (difference between train and test error) and excess risk (difference between test error and the minimum possible error). While information-theoretic tools have been used extensively to study the generalization gap of learning algorithms, the information-theoretic nature of excess risk has not yet been fully investigated.… ▽ More

    Submitted 28 February, 2023; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: Published in the IEEE Transactions on Information Theory

  2. arXiv:2105.04180  [pdf, other

    cs.LG cs.IT

    Rate-Distortion Analysis of Minimum Excess Risk in Bayesian Learning

    Authors: Hassan Hafez-Kolahi, Behrad Moniri, Shohreh Kasaei, Mahdieh Soleymani Baghshah

    Abstract: In parametric Bayesian learning, a prior is assumed on the parameter $W$ which determines the distribution of samples. In this setting, Minimum Excess Risk (MER) is defined as the difference between the minimum expected loss achievable when learning from data and the minimum expected loss that could be achieved if $W$ was observed. In this paper, we build upon and extend the recent results of (Xu… ▽ More

    Submitted 17 July, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: Accepted at ICML 2021

  3. arXiv:1909.09706  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    Do Compressed Representations Generalize Better?

    Authors: Hassan Hafez-Kolahi, Shohreh Kasaei, Mahdiyeh Soleymani-Baghshah

    Abstract: One of the most studied problems in machine learning is finding reasonable constraints that guarantee the generalization of a learning algorithm. These constraints are usually expressed as some simplicity assumptions on the target. For instance, in the Vapnik-Chervonenkis (VC) theory the space of possible hypotheses is considered to have a limited VC dimension. In this paper, the constraint on the… ▽ More

    Submitted 2 January, 2020; v1 submitted 20 September, 2019; originally announced September 2019.

  4. arXiv:1904.03743  [pdf, other

    cs.LG cs.IT stat.ML

    Information Bottleneck and its Applications in Deep Learning

    Authors: Hassan Hafez-Kolahi, Shohreh Kasaei

    Abstract: Information Theory (IT) has been used in Machine Learning (ML) from early days of this field. In the last decade, advances in Deep Neural Networks (DNNs) have led to surprising improvements in many applications of ML. The result has been a paradigm shift in the community toward revisiting previous ideas and applications in this new framework. Ideas from IT are no exception. One of the ideas which… ▽ More

    Submitted 7 April, 2019; originally announced April 2019.