Skip to main content

Showing 1–6 of 6 results for author: Fogel, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.07661  [pdf, ps, other

    cs.LG cs.IT stat.ML

    The Universality Lens: Why Even Highly Over-Parametrized Models Learn Well

    Authors: Meir Feder, Ruediger Urbanke, Yaniv Fogel

    Abstract: A fundamental question in modern machine learning is why large, over-parameterized models, such as deep neural networks and transformers, tend to generalize well, even when their number of parameters far exceeds the number of training samples. We investigate this phenomenon through the lens of information theory, grounded in universal learning theory. Specifically, we study a Bayesian mixture le… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  2. arXiv:2209.08764  [pdf, other

    cs.DC cs.CC cs.DS

    An Optimal Level-synchronous Shared-memory Parallel BFS Algorithm with Optimal parallel Prefix-sum Algorithm and its Implications for Energy Consumption

    Authors: Jesmin Jahan Tithi, Yonatan Fogel, Rezaul Chowdhury

    Abstract: We present a work-efficient parallel level-synchronous Breadth First Search (BFS) algorithm for shared-memory architectures which achieves the theoretical lower bound on parallel running time. The optimality holds regardless of the shape of the graph. We also demonstrate the implication of this optimality for the energy consumption of the program empirically. The key idea is never to use more proc… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 2 pages, brief announcement

  3. arXiv:2011.10334  [pdf, other

    cs.LG stat.ML

    Efficient Data-Dependent Learnability

    Authors: Yaniv Fogel, Tal Shapira, Meir Feder

    Abstract: The predictive normalized maximum likelihood (pNML) approach has recently been proposed as the min-max optimal solution to the batch learning problem where both the training set and the test data feature are individuals, known sequences. This approach has yields a learnability measure that can also be interpreted as a stability measure. This measure has shown some potential in detecting out-of-dis… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

    Comments: 12 pages, 10 figures

  4. arXiv:1905.04708  [pdf, other

    cs.LG cs.IT stat.ML

    A New Look at an Old Problem: A Universal Learning Approach to Linear Regression

    Authors: Koby Bibas, Yaniv Fogel, Meir Feder

    Abstract: Linear regression is a classical paradigm in statistics. A new look at it is provided via the lens of universal learning. In applying universal learning to linear regression the hypotheses class represents the label $y\in {\cal R}$ as a linear combination of the feature vector $x^Tθ$ where $x\in {\cal R}^M$, within a Gaussian error. The Predictive Normalized Maximum Likelihood (pNML) solution for… ▽ More

    Submitted 12 May, 2019; originally announced May 2019.

  5. arXiv:1904.12286  [pdf, other

    cs.LG stat.ML

    Deep pNML: Predictive Normalized Maximum Likelihood for Deep Neural Networks

    Authors: Koby Bibas, Yaniv Fogel, Meir Feder

    Abstract: The Predictive Normalized Maximum Likelihood (pNML) scheme has been recently suggested for universal learning in the individual setting, where both the training and test samples are individual data. The goal of universal learning is to compete with a ``genie'' or reference learner that knows the data values, but is restricted to use a learner from a given model class. The pNML minimizes the associ… ▽ More

    Submitted 8 January, 2020; v1 submitted 28 April, 2019; originally announced April 2019.

  6. arXiv:1812.09520  [pdf, other

    cs.IT cs.LG stat.ML

    Universal Supervised Learning for Individual Data

    Authors: Yaniv Fogel, Meir Feder

    Abstract: Universal supervised learning is considered from an information theoretic point of view following the universal prediction approach, see Merhav and Feder (1998). We consider the standard supervised "batch" learning where prediction is done on a test sample once the entire training data is observed, and the individual setting where the features and labels, both in the training and test, are specifi… ▽ More

    Submitted 22 December, 2018; originally announced December 2018.