Skip to main content

Showing 1–24 of 24 results for author: Dirksen, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.19695  [pdf, ps, other

    stat.ML cs.LG math.PR

    Near-optimal estimates for the $\ell^p$-Lipschitz constants of deep random ReLU neural networks

    Authors: Sjoerd Dirksen, Patrick Finke, Paul Geuchen, Dominik Stöger, Felix Voigtlaender

    Abstract: This paper studies the $\ell^p$-Lipschitz constants of ReLU neural networks $Φ: \mathbb{R}^d \to \mathbb{R}$ with random parameters for $p \in [1,\infty]$. The distribution of the weights follows a variant of the He initialization and the biases are drawn from symmetric distributions. We derive high probability upper and lower bounds for wide networks that differ at most by a factor that is logari… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: The introduction will still be expanded with additional references

    MSC Class: 68T07; 26A16; 60B20; 60G15

  2. arXiv:2505.15351  [pdf, other

    cs.IT math.OC

    Phasebook: A Survey of Selected Open Problems in Phase Retrieval

    Authors: Marc Allain, Selin Aslan, Wim Coene, Sjoerd Dirksen, Jonathan Dong, Julien Flamant, Mark Iwen, Felix Krahmer, Tristan van Leeuwen, Oleh Melnyk, Andreas Menzel, Allard P. Mosk, Viktor Nikitin, Gerlind Plonka, Palina Salanevich, Matthias Wellershoff

    Abstract: Phase retrieval is an inverse problem that, on one hand, is crucial in many applications across imaging and physics, and, on the other hand, leads to deep research questions in theoretical signal processing and applied harmonic analysis. This survey paper is an outcome of the recent workshop Phase Retrieval in Mathematics and Applications (PRiMA) (held on August 5--9 2024 at the Lorentz Center in… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  3. arXiv:2502.17037  [pdf, ps, other

    cs.IT

    Subspace and DOA estimation under coarse quantization

    Authors: Sjoerd Dirksen, Weilin Li, Johannes Maly

    Abstract: We study direction-of-arrival (DOA) estimation from coarsely quantized data. We focus on a two-step approach which first estimates the signal subspace via covariance estimation and then extracts DOA angles by the ESPRIT algorithm. In particular, we analyze two stochastic quantization schemes which use dithering: a one-bit quantizer combined with rectangular dither and a multi-bit quantizer with tr… ▽ More

    Submitted 11 August, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

  4. arXiv:2502.13263  [pdf, ps, other

    cs.IT math.PR

    Spectral method for low-dose Poisson and Bernoulli phase retrieval

    Authors: Sjoerd Dirksen, Felix Krahmer, Patricia Römer, Palina Salanevich

    Abstract: We consider the problem of phaseless reconstruction from measurements with Poisson or Bernoulli distributed noise. This is of particular interest in biological imaging experiments where a low dose of radiation has to be used to mitigate potential damage of the specimen, resulting in low observed particle counts. We derive recovery guarantees for the spectral method for these noise models in the ca… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  5. arXiv:2310.00327  [pdf, other

    stat.ML cs.LG math.ST

    Memorization With Neural Nets: Going Beyond the Worst Case

    Authors: Sjoerd Dirksen, Patrick Finke, Martin Genzel

    Abstract: In practice, deep neural networks are often able to easily interpolate their training data. To understand this phenomenon, many works have aimed to quantify the memorization capacity of a neural network architecture: the largest number of points such that the architecture can interpolate any placement of these points with any assignment of labels. For real-world data, however, one intuitively expe… ▽ More

    Submitted 6 December, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: The current version of the manuscript has been accepted to Journal of Machine Learning Research

    Journal ref: J. Mach. Learn. Res. 25:347 (2024) 1-38

  6. arXiv:2307.12613  [pdf, other

    math.ST cs.IT

    Tuning-free one-bit covariance estimation using data-driven dithering

    Authors: Sjoerd Dirksen, Johannes Maly

    Abstract: We consider covariance estimation of any subgaussian distribution from finitely many i.i.d. samples that are quantized to one bit of information per entry. Recent work has shown that a reliable estimator can be constructed if uniformly distributed dithers on $[-λ,λ]$ are used in the one-bit quantizer. This estimator enjoys near-minimax optimal, non-asymptotic error estimates in the operator and Fr… ▽ More

    Submitted 12 January, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

  7. arXiv:2301.04641  [pdf, other

    cs.IT eess.SP

    Plug-in Channel Estimation with Dithered Quantized Signals in Spatially Non-Stationary Massive MIMO Systems

    Authors: Tianyu Yang, Johannes Maly, Sjoerd Dirksen, Giuseppe Caire

    Abstract: As the array dimension of massive MIMO systems increases to unprecedented levels, two problems occur. First, the spatial stationarity assumption along the antenna elements is no longer valid. Second, the large array size results in an unacceptably high power consumption if high-resolution analog-to-digital converters are used. To address these two challenges, we consider a Bussgang linear minimum… ▽ More

    Submitted 24 January, 2024; v1 submitted 11 January, 2023; originally announced January 2023.

    Comments: submitted to IEEE Transactions on Communications

  8. arXiv:2204.04109  [pdf, ps, other

    math.PR cs.IT

    Fast metric embedding into the Hamming cube

    Authors: Sjoerd Dirksen, Shahar Mendelson, Alexander Stollenwerk

    Abstract: We consider the problem of embedding a subset of $\mathbb{R}^n$ into a low-dimensional Hamming cube in an almost isometric way. We construct a simple, data-oblivious, and computationally efficient map that achieves this task with high probability: we first apply a specific structured random matrix, which we call the double circulant matrix; using that matrix requires linear storage and matrix-vect… ▽ More

    Submitted 6 September, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: Added new, near-optimal result on fast near-isometric embedding of $\ell_2^n$ into $\ell_1^m$

  9. arXiv:2201.05204  [pdf, ps, other

    math.PR cs.IT

    Sharp estimates on random hyperplane tessellations

    Authors: Sjoerd Dirksen, Shahar Mendelson, Alexander Stollenwerk

    Abstract: We study the problem of generating a hyperplane tessellation of an arbitrary set $T$ in $\mathbb{R}^n$, ensuring that the Euclidean distance between any two points corresponds to the fraction of hyperplanes separating them up to a pre-specified error $δ$. We focus on random gaussian tessellations with uniformly distributed shifts and derive sharp bounds on the number of hyperplanes $m$ that are re… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

  10. arXiv:2108.00207  [pdf, other

    cs.LG math.ST

    The Separation Capacity of Random Neural Networks

    Authors: Sjoerd Dirksen, Martin Genzel, Laurent Jacques, Alexander Stollenwerk

    Abstract: Neural networks with random weights appear in a variety of machine learning applications, most prominently as the initialization of many deep learning algorithms and as a computationally cheap alternative to fully learned neural networks. In the present article, we enhance the theoretical understanding of random neural networks by addressing the following data separation problem: under what condit… ▽ More

    Submitted 28 November, 2022; v1 submitted 31 July, 2021; originally announced August 2021.

    Comments: The current version of the manuscript has been accepted to Journal of Machine Learning Research

    Journal ref: J. Mach. Learn. Res. 23:309 (2022) 1-47

  11. arXiv:2104.01280  [pdf, other

    cs.IT math.ST

    Covariance estimation under one-bit quantization

    Authors: Sjoerd Dirksen, Johannes Maly, Holger Rauhut

    Abstract: We consider the classical problem of estimating the covariance matrix of a subgaussian distribution from i.i.d. samples in the novel context of coarse quantization, i.e., instead of having full knowledge of the samples, they are quantized to one or two bits per entry. This problem occurs naturally in signal processing applications. We introduce new estimators in two different quantization scenario… ▽ More

    Submitted 22 April, 2022; v1 submitted 2 April, 2021; originally announced April 2021.

  12. arXiv:2009.08320  [pdf, ps, other

    cs.IT cs.DS math.MG

    Binarized Johnson-Lindenstrauss embeddings

    Authors: Sjoerd Dirksen, Alexander Stollenwerk

    Abstract: We consider the problem of encoding a set of vectors into a minimal number of bits while preserving information on their Euclidean geometry. We show that this task can be accomplished by applying a Johnson-Lindenstrauss embedding and subsequently binarizing each vector by comparing each entry of the vector to a uniformly random threshold. Using this simple construction we produce two encodings of… ▽ More

    Submitted 11 April, 2022; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: The results of this preprint have been strongly improved and expanded. The current preprint is no longer intended for publication and has been replaced by two new preprints, posted as arXiv:2201.05204 and arXiv:2204.04109

  13. arXiv:2007.04005  [pdf, other

    stat.ML cs.LG physics.ao-ph stat.AP

    Statistical post-processing of wind speed forecasts using convolutional neural networks

    Authors: Simon Veldkamp, Kirien Whan, Sjoerd Dirksen, Maurice Schmeits

    Abstract: Current statistical post-processing methods for probabilistic weather forecasting are not capable of using full spatial patterns from the numerical weather prediction (NWP) model. In this paper we incorporate spatial wind speed information by using convolutional neural networks (CNNs) and obtain probabilistic wind speed forecasts in the Netherlands for 48 hours ahead, based on KNMI's deterministic… ▽ More

    Submitted 8 January, 2021; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: 44 pages, 5 figures

  14. arXiv:2005.06994  [pdf, ps, other

    cs.IT math.NA

    Sparse recovery in bounded Riesz systems with applications to numerical methods for PDEs

    Authors: Simone Brugiapaglia, Sjoerd Dirksen, Hans Christian Jung, Holger Rauhut

    Abstract: We study sparse recovery with structured random measurement matrices having independent, identically distributed, and uniformly bounded rows and with a nontrivial covariance structure. This class of matrices arises from random sampling of bounded Riesz systems and generalizes random partial Fourier matrices. Our main result improves the currently available results for the null space and restricted… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

  15. arXiv:1812.06719  [pdf, ps, other

    cs.IT eess.SP math.PR

    Robust one-bit compressed sensing with partial circulant matrices

    Authors: Sjoerd Dirksen, Shahar Mendelson

    Abstract: We present optimal sample complexity estimates for one-bit compressed sensing problems in a realistic scenario: the procedure uses a structured matrix (a randomly sub-sampled circulant matrix) and is robust to analog pre-quantization noise as well as to adversarial bit corruptions in the quantization process. Our results imply that quantization is not a statistically expensive procedure in the pre… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

  16. arXiv:1805.09409  [pdf, other

    cs.IT math.PR

    Non-Gaussian Hyperplane Tessellations and Robust One-Bit Compressed Sensing

    Authors: Sjoerd Dirksen, Shahar Mendelson

    Abstract: We show that a tessellation generated by a small number of random affine hyperplanes can be used to approximate Euclidean distances between any two points in an arbitrary bounded set $T$, where the random hyperplanes are generated by subgaussian or heavy-tailed normal vectors and uniformly distributed shifts. We derive quantitative bounds on the number of hyperplanes needed for constructing such t… ▽ More

    Submitted 13 August, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: Title and presentation changed, typos corrected

  17. arXiv:1710.03287  [pdf, ps, other

    cs.IT math.PR

    One-bit compressed sensing with partial Gaussian circulant matrices

    Authors: Sjoerd Dirksen, Hans Christian Jung, Holger Rauhut

    Abstract: In this paper we consider memoryless one-bit compressed sensing with randomly subsampled Gaussian circulant matrices. We show that in a small sparsity regime and for small enough accuracy $δ$, $m\sim δ^{-4} s\log(N/sδ)$ measurements suffice to reconstruct the direction of any $s$-sparse vector up to accuracy $δ$ via an efficient program. We derive this result by proving that partial Gaussian circu… ▽ More

    Submitted 9 October, 2017; originally announced October 2017.

    Comments: 20 pages

    MSC Class: 94A20; 60B20

  18. arXiv:1702.06781  [pdf, ps, other

    cs.IT math.FA

    Gelfand numbers related to structured sparsity and Besov space embeddings with small mixed smoothness

    Authors: Sjoerd Dirksen, Tino Ullrich

    Abstract: We consider the problem of determining the asymptotic order of the Gelfand numbers of mixed-(quasi-)norm embeddings $\ell^b_p(\ell^d_q) \hookrightarrow \ell^b_r(\ell^d_u)$ given that $p \leq r$ and $q \leq u$, with emphasis on cases with $p\leq 1$ and/or $q\leq 1$. These cases turn out to be related to structured sparsity. We obtain sharp bounds in a number of interesting parameter constellations.… ▽ More

    Submitted 28 February, 2020; v1 submitted 22 February, 2017; originally announced February 2017.

    Journal ref: Journal of Complexity, Volume 48, October 2018, Pages 69-102

  19. arXiv:1608.06498  [pdf, ps, other

    cs.IT cs.DS

    Fast binary embeddings with Gaussian circulant matrices: improved bounds

    Authors: Sjoerd Dirksen, Alexander Stollenwerk

    Abstract: We consider the problem of encoding a finite set of vectors into a small number of bits while approximately retaining information on the angular distances between the vectors. By deriving improved variance bounds related to binary Gaussian circulant embeddings, we largely fix a gap in the proof of the best known fast binary embedding method. Our bounds also show that well-spreadness assumptions on… ▽ More

    Submitted 26 December, 2017; v1 submitted 23 August, 2016; originally announced August 2016.

    MSC Class: 60B20 (Primary) 68Q87 (Secondary)

  20. arXiv:1504.05073  [pdf, ps, other

    cs.IT math.ST

    On the gap between RIP-properties and sparse recovery conditions

    Authors: Sjoerd Dirksen, Guillaume Lecué, Holger Rauhut

    Abstract: We consider the problem of recovering sparse vectors from underdetermined linear measurements via $\ell_p$-constrained basis pursuit. Previous analyses of this problem based on generalized restricted isometry properties have suggested that two phenomena occur if $p\neq 2$. First, one may need substantially more than $s \log(en/s)$ measurements (optimal for $p=2$) for uniform recovery of all $s$-sp… ▽ More

    Submitted 20 April, 2015; originally announced April 2015.

  21. arXiv:1407.7680  [pdf, ps, other

    cs.IT

    Uniform recovery of fusion frame structured sparse signals

    Authors: Ulaş Ayaz, Sjoerd Dirksen, Holger Rauhut

    Abstract: We consider the problem of recovering fusion frame sparse signals from incomplete measurements. These signals are composed of a small number of nonzero blocks taken from a family of subspaces. First, we show that, by using a-priori knowledge of a coherence parameter associated with the angles between the subspaces, one can uniformly recover fusion frame sparse signals with a significantly reduced… ▽ More

    Submitted 29 July, 2014; originally announced July 2014.

  22. arXiv:1402.3973  [pdf, ps, other

    cs.IT cs.DS stat.ML

    Dimensionality reduction with subgaussian matrices: a unified theory

    Authors: Sjoerd Dirksen

    Abstract: We present a theory for Euclidean dimensionality reduction with subgaussian matrices which unifies several restricted isometry property and Johnson-Lindenstrauss type results obtained earlier for specific data sets. In particular, we recover and, in several cases, improve results for sets of sparse and structured sparse vectors, low-rank matrices and tensors, and smooth manifolds. In addition, we… ▽ More

    Submitted 17 February, 2014; originally announced February 2014.

  23. arXiv:1311.2542  [pdf, ps, other

    cs.DS cs.CG cs.IT math.PR stat.ML

    Toward a unified theory of sparse dimensionality reduction in Euclidean space

    Authors: Jean Bourgain, Sjoerd Dirksen, Jelani Nelson

    Abstract: Let $Φ\in\mathbb{R}^{m\times n}$ be a sparse Johnson-Lindenstrauss transform [KN14] with $s$ non-zeroes per column. For a subset $T$ of the unit sphere, $\varepsilon\in(0,1/2)$ given, we study settings for $m,s$ required to ensure $$ \mathop{\mathbb{E}}_Φ\sup_{x\in T} \left|\|Φx\|_2^2 - 1 \right| < \varepsilon , $$ i.e. so that $Φ$ preserves the norm of every $x\in T$ simultaneously and multiplica… ▽ More

    Submitted 25 August, 2015; v1 submitted 11 November, 2013; originally announced November 2013.

    Journal ref: Geometric and Functional Analysis 25 (2015), no. 4, 1009-1088

  24. arXiv:1309.3522  [pdf, ps, other

    math.PR cs.IT

    Tail bounds via generic chaining

    Authors: Sjoerd Dirksen

    Abstract: We modify Talagrand's generic chaining method to obtain upper bounds for all p-th moments of the supremum of a stochastic process. These bounds lead to an estimate for the upper tail of the supremum with optimal deviation parameters. We apply our procedure to improve and extend some known deviation inequalities for suprema of unbounded empirical processes and chaos processes. As an application we… ▽ More

    Submitted 24 March, 2014; v1 submitted 13 September, 2013; originally announced September 2013.

    Comments: Added detailed proof of Theorem 3.5; Application to dimensionality reduction expanded and moved to separate note arXiv:1402.3973