Skip to main content

Showing 1–6 of 6 results for author: Weinberger, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.19031  [pdf, ps, other

    stat.ML cs.LG

    When Diffusion Models Memorize: Inductive Biases in Probability Flow of Minimum-Norm Shallow Neural Nets

    Authors: Chen Zeno, Hila Manor, Greg Ongie, Nir Weinberger, Tomer Michaeli, Daniel Soudry

    Abstract: While diffusion models generate high-quality images via probability flow, the theoretical understanding of this process remains incomplete. A key question is when probability flow converges to training samples or more general points on the data manifold. We analyze this by studying the probability flow of shallow ReLU neural network denoisers trained with minimal $\ell^2$ norm. For intuition, we i… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: Accepted to the Forty-second International Conference on Machine Learning (ICML 2025)

  2. arXiv:2502.00206  [pdf, other

    cs.LG cs.DC cs.IT stat.ML

    BICompFL: Stochastic Federated Learning with Bi-Directional Compression

    Authors: Maximilian Egger, Rawad Bitar, Antonia Wachter-Zeh, Nir Weinberger, Deniz Gündüz

    Abstract: We address the prominent communication bottleneck in federated learning (FL). We specifically consider stochastic FL, in which models or compressed model updates are specified by distributions rather than deterministic parameters. Stochastic FL offers a principled approach to compression, and has been shown to reduce the communication load under perfect downlink transmission from the federator to… ▽ More

    Submitted 31 January, 2025; originally announced February 2025.

  3. arXiv:2402.13366  [pdf, other

    cs.LG stat.ML

    Statistical curriculum learning: An elimination algorithm achieving an oracle risk

    Authors: Omer Cohen, Ron Meir, Nir Weinberger

    Abstract: We consider a statistical version of curriculum learning (CL) in a parametric prediction setting. The learner is required to estimate a target parameter vector, and can adaptively collect samples from either the target model, or other source models that are similar to the target model, but less noisy. We consider three types of learners, depending on the level of side-information they receive. The… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  4. arXiv:2402.02265  [pdf, other

    cs.IT eess.SP stat.ML

    Characterization of the Distortion-Perception Tradeoff for Finite Channels with Arbitrary Metrics

    Authors: Dror Freirich, Nir Weinberger, Ron Meir

    Abstract: Whenever inspected by humans, reconstructed signals should not be distinguished from real ones. Typically, such a high perceptual quality comes at the price of high reconstruction error, and vice versa. We study this distortion-perception (DP) tradeoff over finite-alphabet channels, for the Wasserstein-$1$ distance induced by a general metric as the perception index, and an arbitrary distortion ma… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  5. arXiv:2401.10204  [pdf, ps, other

    cs.IT stat.ML

    Maximal-Capacity Discrete Memoryless Channel Identification

    Authors: Maximilian Egger, Rawad Bitar, Antonia Wachter-Zeh, Deniz Gündüz, Nir Weinberger

    Abstract: The problem of identifying the channel with the highest capacity among several discrete memoryless channels (DMCs) is considered. The problem is cast as a pure-exploration multi-armed bandit problem, which follows the practical use of training sequences to sense the communication channel statistics. A capacity estimator is proposed and tight confidence bounds on the estimator error are derived. Ba… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  6. arXiv:2311.06748  [pdf, other

    stat.ML cs.LG

    How do Minimum-Norm Shallow Denoisers Look in Function Space?

    Authors: Chen Zeno, Greg Ongie, Yaniv Blumenfeld, Nir Weinberger, Daniel Soudry

    Abstract: Neural network (NN) denoisers are an essential building block in many common tasks, ranging from image reconstruction to image generation. However, the success of these models is not well understood from a theoretical perspective. In this paper, we aim to characterize the functions realized by shallow ReLU NN denoisers -- in the common theoretical setting of interpolation (i.e., zero training loss… ▽ More

    Submitted 16 January, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: Thirty-seventh Conference on Neural Information Processing Systems