-
Exact Optimality of Communication-Privacy-Utility Tradeoffs in Distributed Mean Estimation
Abstract: We study the mean estimation problem under communication and local differential privacy constraints. While previous work has proposed \emph{order}-optimal algorithms for the same problem (i.e., asymptotically optimal as we spend more bits), \emph{exact} optimality (in the non-asymptotic setting) still has not been achieved. In this work, we take a step towards characterizing the \emph{exact}-optim… ▽ More
Submitted 28 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.
Comments: Published at the Conference on Neural Information Processing Systems (NeurIPS), 2023
-
Neural Tangent Kernel Analysis of Deep Narrow Neural Networks
Abstract: The tremendous recent progress in analyzing the training dynamics of overparameterized neural networks has primarily focused on wide networks and therefore does not sufficiently address the role of depth in deep learning. In this work, we present the first trainability guarantee of infinitely deep but narrow neural networks. We study the infinite-depth limit of a multilayer perceptron (MLP) with a… ▽ More
Submitted 27 June, 2022; v1 submitted 7 February, 2022; originally announced February 2022.
Journal ref: Published in International Conference on Machine Learning, 2022
-
An Information-Theoretic Justification for Model Pruning
Abstract: We study the neural network (NN) compression problem, viewing the tension between the compression ratio and NN performance through the lens of rate-distortion theory. We choose a distortion metric that reflects the effect of NN compression on the model output and derive the tradeoff between rate (compression) and distortion. In addition to characterizing theoretical limits of NN compression, this… ▽ More
Submitted 9 February, 2022; v1 submitted 16 February, 2021; originally announced February 2021.
Comments: Published in the International Conference on Artificial Intelligence and Statistics (AISTATS) 2022. Previous titles: 1) Rate-Distortion Theoretic Model Compression: Successive Refinement for Pruning, 2) Successive pruning for model compression via rate distortion theory