Search | arXiv e-print repository

The CLT in high dimensions: quantitative bounds via martingale embedding

Authors: Ronen Eldan, Dan Mikulincer, Alex Zhai

Abstract: We introduce a new method for obtaining quantitative convergence rates for the central limit theorem (CLT) in a high dimensional setting. Using our method, we obtain several new bounds for convergence in transportation distance and entropy, and in particular: (a) We improve the best known bound, obtained by the third named author, for convergence in quadratic Wasserstein transportation distance fo… ▽ More We introduce a new method for obtaining quantitative convergence rates for the central limit theorem (CLT) in a high dimensional setting. Using our method, we obtain several new bounds for convergence in transportation distance and entropy, and in particular: (a) We improve the best known bound, obtained by the third named author, for convergence in quadratic Wasserstein transportation distance for bounded random vectors; (b) We derive the first non-asymptotic convergence rate for the entropic CLT in arbitrary dimension, for general log-concave random vectors; (c) We give an improved bound for convergence in transportation distance under a log-concavity assumption and improvements for both metrics under the assumption of strong log-concavity. Our method is based on martingale embeddings and specifically on the Skorokhod embedding constructed by the first named author. △ Less

Submitted 7 September, 2020; v1 submitted 24 June, 2018; originally announced June 2018.

Comments: 37 pages

arXiv:1805.05025 [pdf, other]

Cutoff for product replacement on finite groups

Authors: Yuval Peres, Ryokichi Tanaka, Alex Zhai

Abstract: We analyze a Markov chain, known as the product replacement chain, on the set of generating $n$-tuples of a fixed finite group $G$. We show that as $n \rightarrow \infty$, the total-variation mixing time of the chain has a cutoff at time $\frac{3}{2} n \log n$ with window of order $n$. This generalizes a result of Ben-Hamou and Peres (who established the result for $G = \mathbb{Z}/2$) and confirms… ▽ More We analyze a Markov chain, known as the product replacement chain, on the set of generating $n$-tuples of a fixed finite group $G$. We show that as $n \rightarrow \infty$, the total-variation mixing time of the chain has a cutoff at time $\frac{3}{2} n \log n$ with window of order $n$. This generalizes a result of Ben-Hamou and Peres (who established the result for $G = \mathbb{Z}/2$) and confirms a conjecture of Diaconis and Saloff-Coste that for an arbitrary but fixed finite group, the mixing time of the product replacement chain is $O(n \log n)$. △ Less

Submitted 14 May, 2018; originally announced May 2018.

Comments: 26 pages, 1 figure

MSC Class: 60J10

arXiv:1801.04783 [pdf, other]

Subpolynomial trace reconstruction for random strings and arbitrary deletion probability

Authors: Nina Holden, Robin Pemantle, Yuval Peres, Alex Zhai

Abstract: The insertion-deletion channel takes as input a bit string ${\bf x}\in\{0,1\}^{n}$, and outputs a string where bits have been deleted and inserted independently at random. The trace reconstruction problem is to recover $\bf x$ from many independent outputs (called "traces") of the insertion-deletion channel applied to $\bf x$. We show that if $\bf x$ is chosen uniformly at random, then… ▽ More The insertion-deletion channel takes as input a bit string ${\bf x}\in\{0,1\}^{n}$, and outputs a string where bits have been deleted and inserted independently at random. The trace reconstruction problem is to recover $\bf x$ from many independent outputs (called "traces") of the insertion-deletion channel applied to $\bf x$. We show that if $\bf x$ is chosen uniformly at random, then $\exp(O(\log^{1/3} n))$ traces suffice to reconstruct $\bf x$ with high probability. For the deletion channel with deletion probability $q < 1/2$ the earlier upper bound was $\exp(O(\log^{1/2} n))$. The case of $q\geq 1/2$ or the case where insertions are allowed has not been previously analyzed, and therefore the earlier upper bound was as for worst-case strings, i.e., $\exp(O( n^{1/3}))$. We also show that our reconstruction algorithm runs in $n^{1+o(1)}$ time. A key ingredient in our proof is a delicate two-step alignment procedure where we estimate the location in each trace corresponding to a given bit of $\bf x$. The alignment is done by viewing the strings as random walks and comparing the increments in the walk associated with the input string and the trace, respectively. △ Less

Submitted 26 April, 2020; v1 submitted 15 January, 2018; originally announced January 2018.

Comments: Analysis of running time added and proof simplified. Alex Zhai added as author. 37 pages, 7 figures

arXiv:1708.00854 [pdf, other]

Average-case reconstruction for the deletion channel: subpolynomially many traces suffice

Authors: Yuval Peres, Alex Zhai

Abstract: The deletion channel takes as input a bit string $\mathbf{x} \in \{0,1\}^n$, and deletes each bit independently with probability $q$, yielding a shorter string. The trace reconstruction problem is to recover an unknown string $\mathbf{x}$ from many independent outputs (called "traces") of the deletion channel applied to $\mathbf{x}$. We show that if $\mathbf{x}$ is drawn uniformly at random and… ▽ More The deletion channel takes as input a bit string $\mathbf{x} \in \{0,1\}^n$, and deletes each bit independently with probability $q$, yielding a shorter string. The trace reconstruction problem is to recover an unknown string $\mathbf{x}$ from many independent outputs (called "traces") of the deletion channel applied to $\mathbf{x}$. We show that if $\mathbf{x}$ is drawn uniformly at random and $q < 1/2$, then $e^{O(\log^{1/2} n)}$ traces suffice to reconstruct $\mathbf{x}$ with high probability. The previous best bound, established in 2008 by Holenstein-Mitzenmacher-Panigrahy-Wieder, uses $n^{O(1)}$ traces and only applies for $q$ less than a smaller threshold (it seems that $q < 0.07$ is needed). Our algorithm combines several ideas: 1) an alignment scheme for "greedily" fitting the output of the deletion channel as a subsequence of the input; 2) a version of the idea of "anchoring" used by Holenstein-Mitzenmacher-Panigrahy-Wieder; and 3) complex analysis techniques from recent work of Nazarov-Peres and De-O'Donnell-Servedio. △ Less

Submitted 1 August, 2017; originally announced August 2017.

Comments: 28 pages, 4 figures

MSC Class: 62B10

arXiv:1705.03914 [pdf, ps, other]

Zero Sets for Spaces of Analytic Functions

Authors: Russell Lyons, Alex Zhai

Abstract: We show that under mild conditions, a Gaussian analytic function $\boldsymbol F$ that a.s. does not belong to a given weighted Bergman space or Bargmann-Fock space has the property that a.s. no non-zero function in that space vanishes where $\boldsymbol F$ does. This establishes a conjecture of Shapiro (1979) on Bergman spaces and allows us to resolve a question of Zhu (1993) on Bargmann-Fock spac… ▽ More We show that under mild conditions, a Gaussian analytic function $\boldsymbol F$ that a.s. does not belong to a given weighted Bergman space or Bargmann-Fock space has the property that a.s. no non-zero function in that space vanishes where $\boldsymbol F$ does. This establishes a conjecture of Shapiro (1979) on Bergman spaces and allows us to resolve a question of Zhu (1993) on Bargmann-Fock spaces. We also give a similar result on the union of two (or more) such zero sets, thereby establishing another conjecture of Shapiro (1979) on Bergman spaces and allowing us to strengthen a result of Zhu (1993) on Bargmann-Fock spaces. △ Less

Submitted 29 June, 2018; v1 submitted 10 May, 2017; originally announced May 2017.

Comments: 17 pp

MSC Class: 30H20; 60G15

Journal ref: Ann. Inst. Fourier 68, no. 6 (2018), 2311--2328

arXiv:1704.08238 [pdf, other]

Gravitational allocation for uniform points on the sphere

Authors: Nina Holden, Yuval Peres, Alex Zhai

Abstract: Given a collection $\mathcal L$ of $n$ points on a sphere $\mathbf{S}^2_n$ of surface area $n$, a fair allocation is a partition of the sphere into $n$ parts each of area $1$, and each associated with a distinct point of $\mathcal L$. We show that if the $n$ points are chosen uniformly at random and the partition is defined by considering the gravitational field defined by the $n$ points, then the… ▽ More Given a collection $\mathcal L$ of $n$ points on a sphere $\mathbf{S}^2_n$ of surface area $n$, a fair allocation is a partition of the sphere into $n$ parts each of area $1$, and each associated with a distinct point of $\mathcal L$. We show that if the $n$ points are chosen uniformly at random and the partition is defined by considering the gravitational field defined by the $n$ points, then the expected distance between a point on the sphere and the associated point of $\mathcal L$ is $O(\sqrt{\log n})$. We use our result to define a matching between two collections of $n$ independent and uniform points on the sphere, and prove that the expected distance between a pair of matched points is $O(\sqrt{\log n})$, which is optimal by a result of Ajtai, Komlós, and Tusnády. △ Less

Submitted 26 February, 2019; v1 submitted 26 April, 2017; originally announced April 2017.

Comments: 26 pages, 5 figures

MSC Class: 60A99

arXiv:1612.03239 [pdf, other]

When multiplicative noise stymies control

Authors: Jian Ding, Yuval Peres, Gireeja Ranade, Alex Zhai

Abstract: We consider the stabilization of an unstable discrete-time linear system that is observed over a channel corrupted by continuous multiplicative noise. Our main result shows that if the system growth is large enough, then the system cannot be stabilized in a second-moment sense. This is done by showing that the probability that the state magnitude remains bounded must go to zero with time. Our proo… ▽ More We consider the stabilization of an unstable discrete-time linear system that is observed over a channel corrupted by continuous multiplicative noise. Our main result shows that if the system growth is large enough, then the system cannot be stabilized in a second-moment sense. This is done by showing that the probability that the state magnitude remains bounded must go to zero with time. Our proof technique recursively bounds the conditional density of the system state (instead of focusing on the second moment) to bound the progress the controller can make. This sidesteps the difficulty encountered in using the standard data-rate theorem style approach; that approach does not work because the mutual information per round between the system state and the observation is potentially unbounded. It was known that a system with multiplicative observation noise can be stabilized using a simple memoryless linear strategy if the system growth is suitably bounded. In this paper, we show that while memory cannot improve the performance of a linear scheme, a simple non-linear scheme that uses one-step memory can do better than the best linear scheme. △ Less

Submitted 20 December, 2016; v1 submitted 9 December, 2016; originally announced December 2016.

arXiv:1602.05565 [pdf, ps, other]

A high-dimensional CLT in $\mathcal{W}_2$ distance with near optimal convergence rate

Authors: Alex Zhai

Abstract: Let $X_1, \ldots , X_n$ be i.i.d. random vectors in $\mathbb{R}^d$ with $\|X_1\| \le β$. Then, we show that $\frac{1}{\sqrt{n}}(X_1 + \ldots + X_n)$ converges to a Gaussian in quadratic transportation (also known as "Kantorovich" or "Wasserstein") distance at a rate of $O\left( \frac{\sqrt{d} β\log n}{\sqrt{n}} \right)$, improving a result of Valiant and Valiant. The main feature of our theorem is… ▽ More Let $X_1, \ldots , X_n$ be i.i.d. random vectors in $\mathbb{R}^d$ with $\|X_1\| \le β$. Then, we show that $\frac{1}{\sqrt{n}}(X_1 + \ldots + X_n)$ converges to a Gaussian in quadratic transportation (also known as "Kantorovich" or "Wasserstein") distance at a rate of $O\left( \frac{\sqrt{d} β\log n}{\sqrt{n}} \right)$, improving a result of Valiant and Valiant. The main feature of our theorem is that the rate of convergence is within $\log n$ of optimal for $n, d \rightarrow \infty$. △ Less

Submitted 23 July, 2017; v1 submitted 17 February, 2016; originally announced February 2016.

Comments: Updated introduction and various minor revisions, to appear in Probability Theory and Related Fields

arXiv:1408.0822 [pdf, ps, other]

Surprise probabilities in Markov chains

Authors: James Norris, Yuval Peres, Alex Zhai

Abstract: In a Markov chain started at a state $x$, the hitting time $τ(y)$ is the first time that the chain reaches another state $y$. We study the probability $\mathbf{P}_x(τ(y) = t)$ that the first visit to $y$ occurs precisely at a given time $t$. Informally speaking, the event that a new state is visited at a large time $t$ may be considered a "surprise". We prove the following three bounds: 1) In an… ▽ More In a Markov chain started at a state $x$, the hitting time $τ(y)$ is the first time that the chain reaches another state $y$. We study the probability $\mathbf{P}_x(τ(y) = t)$ that the first visit to $y$ occurs precisely at a given time $t$. Informally speaking, the event that a new state is visited at a large time $t$ may be considered a "surprise". We prove the following three bounds: 1) In any Markov chain with $n$ states, $\mathbf{P}_x(τ(y) = t) \le \frac{n}{t}$. 2) In a reversible chain with $n$ states, $\mathbf{P}_x(τ(y) = t) \le \frac{\sqrt{2n}}{t}$ for $t \ge 4n + 4$. 3) For random walk on a simple graph with $n \ge 2$ vertices, $\mathbf{P}_x(τ(y) = t) \le \frac{4e \log n}{t}$. We construct examples showing that these bounds are close to optimal. The main feature of our bounds is that they require very little knowledge of the structure of the Markov chain. To prove the bound for random walk on graphs, we establish the following estimate conjectured by Aldous, Ding and Oveis-Gharan (private communication): For random walk on an $n$-vertex graph, for every initial vertex $x$, \[ \sum_y \left( \sup_{t \ge 0} p^t(x, y) \right) = O(\log n). \] △ Less

Submitted 4 August, 2014; originally announced August 2014.

arXiv:1407.7617 [pdf, ps, other]

Exponential concentration of cover times

Authors: Alex Zhai

Abstract: We prove an exponential concentration bound for cover times of general graphs in terms of the Gaussian free field, extending the work of Ding-Lee-Peres and Ding. The estimate is asymptotically sharp as the ratio of hitting time to cover time goes to zero. The bounds are obtained by showing a stochastic domination in the generalized second Ray-Knight theorem, which was shown to imply exponential… ▽ More We prove an exponential concentration bound for cover times of general graphs in terms of the Gaussian free field, extending the work of Ding-Lee-Peres and Ding. The estimate is asymptotically sharp as the ratio of hitting time to cover time goes to zero. The bounds are obtained by showing a stochastic domination in the generalized second Ray-Knight theorem, which was shown to imply exponential concentration of cover times by Ding. This stochastic domination result appeared earlier in a preprint of Lupu, but the connection to cover times was not mentioned. △ Less

Submitted 28 July, 2014; originally announced July 2014.

arXiv:1311.5592 [pdf, ps, other]

On multiple peaks and moderate deviations for supremum of Gaussian field

Authors: Jian Ding, Ronen Eldan, Alex Zhai

Abstract: We prove two theorems concerning extreme values of general Gaussian fields. Our first theorem concerns with the concept of multiple peaks. A theorem of Chatterjee states that when a centered Gaussian field admits the so-called superconcentration property, it typically attains values near its maximum on multiple near-orthogonal sites, known as multiple peaks. We improve his theorem in two aspects:… ▽ More We prove two theorems concerning extreme values of general Gaussian fields. Our first theorem concerns with the concept of multiple peaks. A theorem of Chatterjee states that when a centered Gaussian field admits the so-called superconcentration property, it typically attains values near its maximum on multiple near-orthogonal sites, known as multiple peaks. We improve his theorem in two aspects: (i) the number of peaks attained by our bound is of the order $\exp(c / σ^2)$ (as opposed to Chatterjee's polynomial bound in $1/σ$), where $σ$ is the standard deviation of the supremum of the Gaussian field, which is assumed to have variance at most $1$ and (ii) our bound need not assume that the correlations are non-negative. We also prove a similar result based on the superconcentration of the free energy. As primary applications, we infer that for the S-K spin glass model on the $n$-hypercube and directed polymers on $\mathbb{Z}_n^2$, there are polynomially (in $n$) many near-orthogonal sites that achieve values near their respective maxima. Our second theorem gives an upper bound on moderate deviation for the supremum of a general Gaussian field. While the Gaussian isoperimetric inequality implies a sub-Gaussian concentration bound for the supremum, we show that the exponent in that bound can be improved under the assumption that the expectation of the supremum is of the same order as that of the independent case. △ Less

Submitted 24 November, 2013; v1 submitted 21 November, 2013; originally announced November 2013.

Comments: 25 pages; The title of the paper is revised

MSC Class: 60G15; 60G70

arXiv:1111.3142 [pdf, ps, other]

Fibonacci-like growth of numerical semigroups of a given genus

Authors: Alex Zhai

Abstract: We give an asymptotic estimate of the number of numerical semigroups of a given genus. In particular, if $n_g$ is the number of numerical semigroups of genus $g$, we prove that $n_g$ tends to $S φ^g$, where $φ$ is the golden ratio, and $S$ is a constant, resolving several related conjectures concerning the growth of $n_g$. In addition, we show that the proportion of numerical semigroups of genus… ▽ More We give an asymptotic estimate of the number of numerical semigroups of a given genus. In particular, if $n_g$ is the number of numerical semigroups of genus $g$, we prove that $n_g$ tends to $S φ^g$, where $φ$ is the golden ratio, and $S$ is a constant, resolving several related conjectures concerning the growth of $n_g$. In addition, we show that the proportion of numerical semigroups of genus $g$ satisfying $f < 3m$ approaches 1 as $g \rightarrow \infty$, where $m$ is the multiplicity and $f$ is the Frobenius number. △ Less

Submitted 14 November, 2011; originally announced November 2011.

Comments: 30 pages

arXiv:1111.2779 [pdf, ps, other]

An asymptotic result concerning a question of Wilf

Authors: Alex Zhai

Abstract: Let $Λ$ be a numerical semigroup with embedding dimension $e(Λ)$. Define $c(Λ)$ to be one plus the largest integer not in $Λ$, and define $c'(Λ)$ to be the number of elements in $Λ$ less than $c(Λ)$. It was asked by Wilf whether $\frac{c'(Λ)}{c(Λ)} \ge \frac{1}{e(Λ)}$ always holds. We prove an asymptotic version of this conjecture: we show that for a fixed positive integer $k$ and any $ε> 0$, the… ▽ More Let $Λ$ be a numerical semigroup with embedding dimension $e(Λ)$. Define $c(Λ)$ to be one plus the largest integer not in $Λ$, and define $c'(Λ)$ to be the number of elements in $Λ$ less than $c(Λ)$. It was asked by Wilf whether $\frac{c'(Λ)}{c(Λ)} \ge \frac{1}{e(Λ)}$ always holds. We prove an asymptotic version of this conjecture: we show that for a fixed positive integer $k$ and any $ε> 0$, the inequality $\frac{c'(Λ)}{c(Λ)} \ge \frac{1}{k} - ε$ holds for all but finitely many numerical semigroups $Λ$ satisfying $e(Λ) = k$. △ Less

Submitted 11 November, 2011; originally announced November 2011.

Comments: 9 pages, submitted to Semigroup Forum

Showing 1–13 of 13 results for author: Zhai, A