Skip to main content

Showing 1–15 of 15 results for author: Weissman, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2306.12625  [pdf, other

    cs.LG cs.DC stat.ML

    Adaptive Compression in Federated Learning via Side Information

    Authors: Berivan Isik, Francesco Pase, Deniz Gunduz, Sanmi Koyejo, Tsachy Weissman, Michele Zorzi

    Abstract: The high communication cost of sending model updates from the clients to the server is a significant bottleneck for scalable federated learning (FL). Among existing approaches, state-of-the-art bitrate-accuracy tradeoffs have been achieved using stochastic compression methods -- in which the client $n$ sends a sample from a client-only probability distribution $q_{φ^{(n)}}$, and the server estimat… ▽ More

    Submitted 21 April, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: Published at the International Conference on Artificial Intelligence and Statistics (AISTATS), 2024

  2. arXiv:2306.04924  [pdf, other

    cs.LG cs.CR cs.DC cs.IT stat.ML

    Exact Optimality of Communication-Privacy-Utility Tradeoffs in Distributed Mean Estimation

    Authors: Berivan Isik, Wei-Ning Chen, Ayfer Ozgur, Tsachy Weissman, Albert No

    Abstract: We study the mean estimation problem under communication and local differential privacy constraints. While previous work has proposed \emph{order}-optimal algorithms for the same problem (i.e., asymptotically optimal as we spend more bits), \emph{exact} optimality (in the non-asymptotic setting) still has not been achieved. In this work, we take a step towards characterizing the \emph{exact}-optim… ▽ More

    Submitted 28 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: Published at the Conference on Neural Information Processing Systems (NeurIPS), 2023

  3. arXiv:2209.15328  [pdf, other

    cs.LG stat.AP stat.ML

    Sparse Random Networks for Communication-Efficient Federated Learning

    Authors: Berivan Isik, Francesco Pase, Deniz Gunduz, Tsachy Weissman, Michele Zorzi

    Abstract: One main challenge in federated learning is the large communication cost of exchanging weight updates from clients to the server at each round. While prior work has made great progress in compressing the weight updates through gradient compression methods, we propose a radically different approach that does not update the weights at all. Instead, our method freezes the weights at their initial \em… ▽ More

    Submitted 8 February, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: Published at the International Conference on Learning Representations (ICLR) 2023

  4. arXiv:2102.08329  [pdf, other

    cs.LG cs.IT eess.SP stat.ML

    An Information-Theoretic Justification for Model Pruning

    Authors: Berivan Isik, Tsachy Weissman, Albert No

    Abstract: We study the neural network (NN) compression problem, viewing the tension between the compression ratio and NN performance through the lens of rate-distortion theory. We choose a distortion metric that reflects the effect of NN compression on the model output and derive the tradeoff between rate (compression) and distortion. In addition to characterizing theoretical limits of NN compression, this… ▽ More

    Submitted 9 February, 2022; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: Published in the International Conference on Artificial Intelligence and Statistics (AISTATS) 2022. Previous titles: 1) Rate-Distortion Theoretic Model Compression: Successive Refinement for Pruning, 2) Successive pruning for model compression via rate distortion theory

  5. arXiv:2007.04568  [pdf, ps, other

    cs.LG cs.GT cs.IT stat.ML

    Learning to Bid Optimally and Efficiently in Adversarial First-price Auctions

    Authors: Yanjun Han, Zhengyuan Zhou, Aaron Flores, Erik Ordentlich, Tsachy Weissman

    Abstract: First-price auctions have very recently swept the online advertising industry, replacing second-price auctions as the predominant auction mechanism on many platforms. This shift has brought forth important challenges for a bidder: how should one bid in a first-price auction, where unlike in second-price auctions, it is no longer optimal to bid one's private value truthfully and hard to know the ot… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

  6. arXiv:2003.09795  [pdf, other

    cs.LG cs.GT cs.IT stat.ME stat.ML

    Optimal No-regret Learning in Repeated First-price Auctions

    Authors: Yanjun Han, Zhengyuan Zhou, Tsachy Weissman

    Abstract: We study online learning in repeated first-price auctions where a bidder, only observing the winning bid at the end of each auction, learns to adaptively bid in order to maximize her cumulative payoff. To achieve this goal, the bidder faces censored feedback: if she wins the bid, then she is not able to observe the highest bid of the other bidders, which we assume is \textit{iid} drawn from an unk… ▽ More

    Submitted 4 March, 2024; v1 submitted 21 March, 2020; originally announced March 2020.

    Comments: To appear in Operations Research

  7. arXiv:1811.07557  [pdf, other

    cs.LG cs.IT stat.ML

    Neural Joint Source-Channel Coding

    Authors: Kristy Choi, Kedar Tatwawadi, Aditya Grover, Tsachy Weissman, Stefano Ermon

    Abstract: For reliable transmission across a noisy communication channel, classical results from information theory show that it is asymptotically optimal to separate out the source and channel coding processes. However, this decomposition can fall short in the finite bit-length regime, as it requires non-trivial tuning of hand-crafted codes and assumes infinite computational power for decoding. In this wor… ▽ More

    Submitted 14 May, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

  8. arXiv:1802.08417  [pdf, ps, other

    cs.DC cs.IT stat.ME

    Geometric Lower Bounds for Distributed Parameter Estimation under Communication Constraints

    Authors: Yanjun Han, Ayfer Özgür, Tsachy Weissman

    Abstract: We consider parameter estimation in distributed networks, where each sensor in the network observes an independent sample from an underlying distribution and has $k$ bits to communicate its sample to a centralized processor which computes an estimate of a desired parameter. We develop lower bounds for the minimax risk of estimating the underlying parameter for a large class of losses and distribut… ▽ More

    Submitted 22 July, 2021; v1 submitted 23 February, 2018; originally announced February 2018.

    Comments: This version (v4) added a new corollary on logistic regression, as well as more discussions on sparse Gaussian mean estimation, compared to v3

    Journal ref: published in COLT 2018

  9. arXiv:1802.08405  [pdf, ps, other

    stat.ME cs.IT cs.LG

    Local moment matching: A unified methodology for symmetric functional estimation and distribution estimation under Wasserstein distance

    Authors: Yanjun Han, Jiantao Jiao, Tsachy Weissman

    Abstract: We present \emph{Local Moment Matching (LMM)}, a unified methodology for symmetric functional estimation and distribution estimation under Wasserstein distance. We construct an efficiently computable estimator that achieves the minimax rates in estimating the distribution up to permutation, and show that the plug-in approach of our unlabeled distribution estimator is "universal" in estimating symm… ▽ More

    Submitted 26 June, 2018; v1 submitted 23 February, 2018; originally announced February 2018.

  10. arXiv:1802.07889  [pdf, ps, other

    cs.LG math.ST stat.ML

    Entropy Rate Estimation for Markov Chains with Large State Space

    Authors: Yanjun Han, Jiantao Jiao, Chuan-Zheng Lee, Tsachy Weissman, Yihong Wu, Tiancheng Yu

    Abstract: Estimating the entropy based on data is one of the prototypical problems in distribution property testing and estimation. For estimating the Shannon entropy of a distribution on $S$ elements with independent samples, [Paninski2004] showed that the sample complexity is sublinear in $S$, and [Valiant--Valiant2011] showed that consistent estimation of Shannon entropy is possible if and only if the sa… ▽ More

    Submitted 24 September, 2018; v1 submitted 21 February, 2018; originally announced February 2018.

    Comments: Published as a conference paper on NIPS 2018

  11. arXiv:1712.07177  [pdf, other

    cs.LG stat.ML

    Approximate Profile Maximum Likelihood

    Authors: Dmitri S. Pavlichin, Jiantao Jiao, Tsachy Weissman

    Abstract: We propose an efficient algorithm for approximate computation of the profile maximum likelihood (PML), a variant of maximum likelihood maximizing the probability of observing a sufficient statistic rather than the empirical sample. The PML has appealing theoretical properties, but is difficult to compute exactly. Inspired by observations gleaned from exactly solvable cases, we look for an approxim… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

  12. arXiv:1711.02141  [pdf, ps, other

    math.ST cs.IT stat.ME

    Optimal rates of entropy estimation over Lipschitz balls

    Authors: Yanjun Han, Jiantao Jiao, Tsachy Weissman, Yihong Wu

    Abstract: We consider the problem of minimax estimation of the entropy of a density over Lipschitz balls. Dropping the usual assumption that the density is bounded away from zero, we obtain the minimax rates $(n\ln n)^{-s/(s+d)} + n^{-1/2}$ for $0<s\leq 2$ for densities supported on $[0,1]^d$, where $s$ is the smoothness parameter and $n$ is the number of independent samples. We generalize the results to de… ▽ More

    Submitted 10 November, 2019; v1 submitted 6 November, 2017; originally announced November 2017.

  13. arXiv:1707.01203  [pdf, ps, other

    cs.IT stat.ML

    Estimating the Fundamental Limits is Easier than Achieving the Fundamental Limits

    Authors: Jiantao Jiao, Yanjun Han, Irena Fischer-Hwang, Tsachy Weissman

    Abstract: We show through case studies that it is easier to estimate the fundamental limits of data processing than to construct explicit algorithms to achieve those limits. Focusing on binary classification, data compression, and prediction under logarithmic loss, we show that in the finite space setting, when it is possible to construct an estimator of the limits with vanishing error with $n$ samples, it… ▽ More

    Submitted 1 October, 2017; v1 submitted 4 July, 2017; originally announced July 2017.

  14. arXiv:1611.01186  [pdf, other

    cs.NE cs.LG stat.ML

    Demystifying ResNet

    Authors: Sihan Li, Jiantao Jiao, Yanjun Han, Tsachy Weissman

    Abstract: The Residual Network (ResNet), proposed in He et al. (2015), utilized shortcut connections to significantly reduce the difficulty of training, which resulted in great performance boosts in terms of both training and generalization error. It was empirically observed in He et al. (2015) that stacking more layers of residual blocks with shortcut 2 results in smaller training error, while it is not… ▽ More

    Submitted 20 May, 2017; v1 submitted 3 November, 2016; originally announced November 2016.

  15. arXiv:1409.7458  [pdf, ps, other

    stat.ME cs.DS cs.IT stat.ML

    Beyond Maximum Likelihood: from Theory to Practice

    Authors: Jiantao Jiao, Kartik Venkat, Yanjun Han, Tsachy Weissman

    Abstract: Maximum likelihood is the most widely used statistical estimation technique. Recent work by the authors introduced a general methodology for the construction of estimators for functionals in parametric models, and demonstrated improvements - both in theory and in practice - over the maximum likelihood estimator (MLE), particularly in high dimensional scenarios involving parameter dimension compara… ▽ More

    Submitted 25 September, 2014; originally announced September 2014.