Skip to main content

Showing 1–8 of 8 results for author: Stollenwerk, A

Searching in archive math. Search in all archives.
.
  1. arXiv:2405.11095  [pdf, other

    cs.LG math.NA math.OC

    Flattened one-bit stochastic gradient descent: compressed distributed optimization with controlled variance

    Authors: Alexander Stollenwerk, Laurent Jacques

    Abstract: We propose a novel algorithm for distributed stochastic gradient descent (SGD) with compressed gradient communication in the parameter-server framework. Our gradient compression technique, named flattened one-bit stochastic gradient descent (FO-SGD), relies on two simple algorithmic ideas: (i) a one-bit quantization procedure leveraging the technique of dithering, and (ii) a randomized fast Walsh-… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 20 pages

    Report number: TR.2024.01

  2. arXiv:2204.04109  [pdf, ps, other

    math.PR cs.IT

    Fast metric embedding into the Hamming cube

    Authors: Sjoerd Dirksen, Shahar Mendelson, Alexander Stollenwerk

    Abstract: We consider the problem of embedding a subset of $\mathbb{R}^n$ into a low-dimensional Hamming cube in an almost isometric way. We construct a simple, data-oblivious, and computationally efficient map that achieves this task with high probability: we first apply a specific structured random matrix, which we call the double circulant matrix; using that matrix requires linear storage and matrix-vect… ▽ More

    Submitted 6 September, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: Added new, near-optimal result on fast near-isometric embedding of $\ell_2^n$ into $\ell_1^m$

  3. arXiv:2201.05204  [pdf, ps, other

    math.PR cs.IT

    Sharp estimates on random hyperplane tessellations

    Authors: Sjoerd Dirksen, Shahar Mendelson, Alexander Stollenwerk

    Abstract: We study the problem of generating a hyperplane tessellation of an arbitrary set $T$ in $\mathbb{R}^n$, ensuring that the Euclidean distance between any two points corresponds to the fraction of hyperplanes separating them up to a pre-specified error $δ$. We focus on random gaussian tessellations with uniformly distributed shifts and derive sharp bounds on the number of hyperplanes $m$ that are re… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

  4. arXiv:2108.00207  [pdf, other

    cs.LG math.ST

    The Separation Capacity of Random Neural Networks

    Authors: Sjoerd Dirksen, Martin Genzel, Laurent Jacques, Alexander Stollenwerk

    Abstract: Neural networks with random weights appear in a variety of machine learning applications, most prominently as the initialization of many deep learning algorithms and as a computationally cheap alternative to fully learned neural networks. In the present article, we enhance the theoretical understanding of random neural networks by addressing the following data separation problem: under what condit… ▽ More

    Submitted 28 November, 2022; v1 submitted 31 July, 2021; originally announced August 2021.

    Comments: The current version of the manuscript has been accepted to Journal of Machine Learning Research

    Journal ref: J. Mach. Learn. Res. 23:309 (2022) 1-47

  5. A Unified Approach to Uniform Signal Recovery From Non-Linear Observations

    Authors: Martin Genzel, Alexander Stollenwerk

    Abstract: Recent advances in quantized compressed sensing and high-dimensional estimation have shown that signal recovery is even feasible under strong non-linear distortions in the observation process. An important characteristic of associated guarantees is uniformity, i.e., recovery succeeds for an entire class of structured signals with a fixed measurement ensemble. However, despite significant results i… ▽ More

    Submitted 6 January, 2022; v1 submitted 19 September, 2020; originally announced September 2020.

    Comments: to be published in Foundations of Computational Mathematics (accepted version)

    Journal ref: Found. Comut. Math. 23:3 (2023) 899-972

  6. arXiv:2009.08320  [pdf, ps, other

    cs.IT cs.DS math.MG

    Binarized Johnson-Lindenstrauss embeddings

    Authors: Sjoerd Dirksen, Alexander Stollenwerk

    Abstract: We consider the problem of encoding a set of vectors into a minimal number of bits while preserving information on their Euclidean geometry. We show that this task can be accomplished by applying a Johnson-Lindenstrauss embedding and subsequently binarizing each vector by comparing each entry of the vector to a uniformly random threshold. Using this simple construction we produce two encodings of… ▽ More

    Submitted 11 April, 2022; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: The results of this preprint have been strongly improved and expanded. The current preprint is no longer intended for publication and has been replaced by two new preprints, posted as arXiv:2201.05204 and arXiv:2204.04109

  7. arXiv:1911.07816  [pdf, other

    cs.IT math.PR

    Quantized Compressed Sensing by Rectified Linear Units

    Authors: Hans Christian Jung, Johannes Maly, Lars Palzer, Alexander Stollenwerk

    Abstract: This work is concerned with the problem of recovering high-dimensional signals $\mathbf{x} \in \mathbb{R}^n$ which belong to a convex set of low-complexity from a small number of quantized measurements. We propose to estimate the signals via a convex program based on rectified linear units (ReLUs) for two different quantization schemes, namely one-bit and uniform multi-bit quantization. Assuming t… ▽ More

    Submitted 26 March, 2021; v1 submitted 18 November, 2019; originally announced November 2019.

    Comments: 40 pages, 5 figures

    MSC Class: 62B10 ACM Class: G.3

  8. Robust 1-Bit Compressed Sensing via Hinge Loss Minimization

    Authors: Martin Genzel, Alexander Stollenwerk

    Abstract: This work theoretically studies the problem of estimating a structured high-dimensional signal $x_0 \in \mathbb{R}^n$ from noisy $1$-bit Gaussian measurements. Our recovery approach is based on a simple convex program which uses the hinge loss function as data fidelity term. While such a risk minimization strategy is very natural to learn binary output models, such as in classification, its capaci… ▽ More

    Submitted 30 May, 2020; v1 submitted 13 April, 2018; originally announced April 2018.

    MSC Class: 94A12; 60D05; 90C25

    Journal ref: Inf. Inference 9.2 (2020), 361-422