-
The CLT in high dimensions: quantitative bounds via martingale embedding
Authors:
Ronen Eldan,
Dan Mikulincer,
Alex Zhai
Abstract:
We introduce a new method for obtaining quantitative convergence rates for the central limit theorem (CLT) in a high dimensional setting. Using our method, we obtain several new bounds for convergence in transportation distance and entropy, and in particular: (a) We improve the best known bound, obtained by the third named author, for convergence in quadratic Wasserstein transportation distance fo…
▽ More
We introduce a new method for obtaining quantitative convergence rates for the central limit theorem (CLT) in a high dimensional setting. Using our method, we obtain several new bounds for convergence in transportation distance and entropy, and in particular: (a) We improve the best known bound, obtained by the third named author, for convergence in quadratic Wasserstein transportation distance for bounded random vectors; (b) We derive the first non-asymptotic convergence rate for the entropic CLT in arbitrary dimension, for general log-concave random vectors; (c) We give an improved bound for convergence in transportation distance under a log-concavity assumption and improvements for both metrics under the assumption of strong log-concavity. Our method is based on martingale embeddings and specifically on the Skorokhod embedding constructed by the first named author.
△ Less
Submitted 7 September, 2020; v1 submitted 24 June, 2018;
originally announced June 2018.
-
Cutoff for product replacement on finite groups
Authors:
Yuval Peres,
Ryokichi Tanaka,
Alex Zhai
Abstract:
We analyze a Markov chain, known as the product replacement chain, on the set of generating $n$-tuples of a fixed finite group $G$. We show that as $n \rightarrow \infty$, the total-variation mixing time of the chain has a cutoff at time $\frac{3}{2} n \log n$ with window of order $n$. This generalizes a result of Ben-Hamou and Peres (who established the result for $G = \mathbb{Z}/2$) and confirms…
▽ More
We analyze a Markov chain, known as the product replacement chain, on the set of generating $n$-tuples of a fixed finite group $G$. We show that as $n \rightarrow \infty$, the total-variation mixing time of the chain has a cutoff at time $\frac{3}{2} n \log n$ with window of order $n$. This generalizes a result of Ben-Hamou and Peres (who established the result for $G = \mathbb{Z}/2$) and confirms a conjecture of Diaconis and Saloff-Coste that for an arbitrary but fixed finite group, the mixing time of the product replacement chain is $O(n \log n)$.
△ Less
Submitted 14 May, 2018;
originally announced May 2018.
-
Subpolynomial trace reconstruction for random strings and arbitrary deletion probability
Authors:
Nina Holden,
Robin Pemantle,
Yuval Peres,
Alex Zhai
Abstract:
The insertion-deletion channel takes as input a bit string ${\bf x}\in\{0,1\}^{n}$, and outputs a string where bits have been deleted and inserted independently at random. The trace reconstruction problem is to recover $\bf x$ from many independent outputs (called "traces") of the insertion-deletion channel applied to $\bf x$. We show that if $\bf x$ is chosen uniformly at random, then…
▽ More
The insertion-deletion channel takes as input a bit string ${\bf x}\in\{0,1\}^{n}$, and outputs a string where bits have been deleted and inserted independently at random. The trace reconstruction problem is to recover $\bf x$ from many independent outputs (called "traces") of the insertion-deletion channel applied to $\bf x$. We show that if $\bf x$ is chosen uniformly at random, then $\exp(O(\log^{1/3} n))$ traces suffice to reconstruct $\bf x$ with high probability. For the deletion channel with deletion probability $q < 1/2$ the earlier upper bound was $\exp(O(\log^{1/2} n))$. The case of $q\geq 1/2$ or the case where insertions are allowed has not been previously analyzed, and therefore the earlier upper bound was as for worst-case strings, i.e., $\exp(O( n^{1/3}))$. We also show that our reconstruction algorithm runs in $n^{1+o(1)}$ time.
A key ingredient in our proof is a delicate two-step alignment procedure where we estimate the location in each trace corresponding to a given bit of $\bf x$. The alignment is done by viewing the strings as random walks and comparing the increments in the walk associated with the input string and the trace, respectively.
△ Less
Submitted 26 April, 2020; v1 submitted 15 January, 2018;
originally announced January 2018.
-
Average-case reconstruction for the deletion channel: subpolynomially many traces suffice
Authors:
Yuval Peres,
Alex Zhai
Abstract:
The deletion channel takes as input a bit string $\mathbf{x} \in \{0,1\}^n$, and deletes each bit independently with probability $q$, yielding a shorter string. The trace reconstruction problem is to recover an unknown string $\mathbf{x}$ from many independent outputs (called "traces") of the deletion channel applied to $\mathbf{x}$. We show that if $\mathbf{x}$ is drawn uniformly at random and…
▽ More
The deletion channel takes as input a bit string $\mathbf{x} \in \{0,1\}^n$, and deletes each bit independently with probability $q$, yielding a shorter string. The trace reconstruction problem is to recover an unknown string $\mathbf{x}$ from many independent outputs (called "traces") of the deletion channel applied to $\mathbf{x}$. We show that if $\mathbf{x}$ is drawn uniformly at random and $q < 1/2$, then $e^{O(\log^{1/2} n)}$ traces suffice to reconstruct $\mathbf{x}$ with high probability. The previous best bound, established in 2008 by Holenstein-Mitzenmacher-Panigrahy-Wieder, uses $n^{O(1)}$ traces and only applies for $q$ less than a smaller threshold (it seems that $q < 0.07$ is needed). Our algorithm combines several ideas: 1) an alignment scheme for "greedily" fitting the output of the deletion channel as a subsequence of the input; 2) a version of the idea of "anchoring" used by Holenstein-Mitzenmacher-Panigrahy-Wieder; and 3) complex analysis techniques from recent work of Nazarov-Peres and De-O'Donnell-Servedio.
△ Less
Submitted 1 August, 2017;
originally announced August 2017.
-
Zero Sets for Spaces of Analytic Functions
Authors:
Russell Lyons,
Alex Zhai
Abstract:
We show that under mild conditions, a Gaussian analytic function $\boldsymbol F$ that a.s. does not belong to a given weighted Bergman space or Bargmann-Fock space has the property that a.s. no non-zero function in that space vanishes where $\boldsymbol F$ does. This establishes a conjecture of Shapiro (1979) on Bergman spaces and allows us to resolve a question of Zhu (1993) on Bargmann-Fock spac…
▽ More
We show that under mild conditions, a Gaussian analytic function $\boldsymbol F$ that a.s. does not belong to a given weighted Bergman space or Bargmann-Fock space has the property that a.s. no non-zero function in that space vanishes where $\boldsymbol F$ does. This establishes a conjecture of Shapiro (1979) on Bergman spaces and allows us to resolve a question of Zhu (1993) on Bargmann-Fock spaces. We also give a similar result on the union of two (or more) such zero sets, thereby establishing another conjecture of Shapiro (1979) on Bergman spaces and allowing us to strengthen a result of Zhu (1993) on Bargmann-Fock spaces.
△ Less
Submitted 29 June, 2018; v1 submitted 10 May, 2017;
originally announced May 2017.
-
Gravitational allocation for uniform points on the sphere
Authors:
Nina Holden,
Yuval Peres,
Alex Zhai
Abstract:
Given a collection $\mathcal L$ of $n$ points on a sphere $\mathbf{S}^2_n$ of surface area $n$, a fair allocation is a partition of the sphere into $n$ parts each of area $1$, and each associated with a distinct point of $\mathcal L$. We show that if the $n$ points are chosen uniformly at random and the partition is defined by considering the gravitational field defined by the $n$ points, then the…
▽ More
Given a collection $\mathcal L$ of $n$ points on a sphere $\mathbf{S}^2_n$ of surface area $n$, a fair allocation is a partition of the sphere into $n$ parts each of area $1$, and each associated with a distinct point of $\mathcal L$. We show that if the $n$ points are chosen uniformly at random and the partition is defined by considering the gravitational field defined by the $n$ points, then the expected distance between a point on the sphere and the associated point of $\mathcal L$ is $O(\sqrt{\log n})$. We use our result to define a matching between two collections of $n$ independent and uniform points on the sphere, and prove that the expected distance between a pair of matched points is $O(\sqrt{\log n})$, which is optimal by a result of Ajtai, Komlós, and Tusnády.
△ Less
Submitted 26 February, 2019; v1 submitted 26 April, 2017;
originally announced April 2017.
-
When multiplicative noise stymies control
Authors:
Jian Ding,
Yuval Peres,
Gireeja Ranade,
Alex Zhai
Abstract:
We consider the stabilization of an unstable discrete-time linear system that is observed over a channel corrupted by continuous multiplicative noise. Our main result shows that if the system growth is large enough, then the system cannot be stabilized in a second-moment sense. This is done by showing that the probability that the state magnitude remains bounded must go to zero with time. Our proo…
▽ More
We consider the stabilization of an unstable discrete-time linear system that is observed over a channel corrupted by continuous multiplicative noise. Our main result shows that if the system growth is large enough, then the system cannot be stabilized in a second-moment sense. This is done by showing that the probability that the state magnitude remains bounded must go to zero with time. Our proof technique recursively bounds the conditional density of the system state (instead of focusing on the second moment) to bound the progress the controller can make. This sidesteps the difficulty encountered in using the standard data-rate theorem style approach; that approach does not work because the mutual information per round between the system state and the observation is potentially unbounded.
It was known that a system with multiplicative observation noise can be stabilized using a simple memoryless linear strategy if the system growth is suitably bounded. In this paper, we show that while memory cannot improve the performance of a linear scheme, a simple non-linear scheme that uses one-step memory can do better than the best linear scheme.
△ Less
Submitted 20 December, 2016; v1 submitted 9 December, 2016;
originally announced December 2016.
-
A high-dimensional CLT in $\mathcal{W}_2$ distance with near optimal convergence rate
Authors:
Alex Zhai
Abstract:
Let $X_1, \ldots , X_n$ be i.i.d. random vectors in $\mathbb{R}^d$ with $\|X_1\| \le β$. Then, we show that $\frac{1}{\sqrt{n}}(X_1 + \ldots + X_n)$ converges to a Gaussian in quadratic transportation (also known as "Kantorovich" or "Wasserstein") distance at a rate of $O\left( \frac{\sqrt{d} β\log n}{\sqrt{n}} \right)$, improving a result of Valiant and Valiant. The main feature of our theorem is…
▽ More
Let $X_1, \ldots , X_n$ be i.i.d. random vectors in $\mathbb{R}^d$ with $\|X_1\| \le β$. Then, we show that $\frac{1}{\sqrt{n}}(X_1 + \ldots + X_n)$ converges to a Gaussian in quadratic transportation (also known as "Kantorovich" or "Wasserstein") distance at a rate of $O\left( \frac{\sqrt{d} β\log n}{\sqrt{n}} \right)$, improving a result of Valiant and Valiant. The main feature of our theorem is that the rate of convergence is within $\log n$ of optimal for $n, d \rightarrow \infty$.
△ Less
Submitted 23 July, 2017; v1 submitted 17 February, 2016;
originally announced February 2016.
-
Surprise probabilities in Markov chains
Authors:
James Norris,
Yuval Peres,
Alex Zhai
Abstract:
In a Markov chain started at a state $x$, the hitting time $τ(y)$ is the first time that the chain reaches another state $y$. We study the probability $\mathbf{P}_x(τ(y) = t)$ that the first visit to $y$ occurs precisely at a given time $t$. Informally speaking, the event that a new state is visited at a large time $t$ may be considered a "surprise". We prove the following three bounds:
1) In an…
▽ More
In a Markov chain started at a state $x$, the hitting time $τ(y)$ is the first time that the chain reaches another state $y$. We study the probability $\mathbf{P}_x(τ(y) = t)$ that the first visit to $y$ occurs precisely at a given time $t$. Informally speaking, the event that a new state is visited at a large time $t$ may be considered a "surprise". We prove the following three bounds:
1) In any Markov chain with $n$ states, $\mathbf{P}_x(τ(y) = t) \le \frac{n}{t}$.
2) In a reversible chain with $n$ states, $\mathbf{P}_x(τ(y) = t) \le \frac{\sqrt{2n}}{t}$ for $t \ge 4n + 4$.
3) For random walk on a simple graph with $n \ge 2$ vertices, $\mathbf{P}_x(τ(y) = t) \le \frac{4e \log n}{t}$.
We construct examples showing that these bounds are close to optimal. The main feature of our bounds is that they require very little knowledge of the structure of the Markov chain.
To prove the bound for random walk on graphs, we establish the following estimate conjectured by Aldous, Ding and Oveis-Gharan (private communication): For random walk on an $n$-vertex graph, for every initial vertex $x$,
\[ \sum_y \left( \sup_{t \ge 0} p^t(x, y) \right) = O(\log n). \]
△ Less
Submitted 4 August, 2014;
originally announced August 2014.
-
Exponential concentration of cover times
Authors:
Alex Zhai
Abstract:
We prove an exponential concentration bound for cover times of general graphs in terms of the Gaussian free field, extending the work of Ding-Lee-Peres and Ding. The estimate is asymptotically sharp as the ratio of hitting time to cover time goes to zero.
The bounds are obtained by showing a stochastic domination in the generalized second Ray-Knight theorem, which was shown to imply exponential…
▽ More
We prove an exponential concentration bound for cover times of general graphs in terms of the Gaussian free field, extending the work of Ding-Lee-Peres and Ding. The estimate is asymptotically sharp as the ratio of hitting time to cover time goes to zero.
The bounds are obtained by showing a stochastic domination in the generalized second Ray-Knight theorem, which was shown to imply exponential concentration of cover times by Ding. This stochastic domination result appeared earlier in a preprint of Lupu, but the connection to cover times was not mentioned.
△ Less
Submitted 28 July, 2014;
originally announced July 2014.
-
On multiple peaks and moderate deviations for supremum of Gaussian field
Authors:
Jian Ding,
Ronen Eldan,
Alex Zhai
Abstract:
We prove two theorems concerning extreme values of general Gaussian fields. Our first theorem concerns with the concept of multiple peaks. A theorem of Chatterjee states that when a centered Gaussian field admits the so-called superconcentration property, it typically attains values near its maximum on multiple near-orthogonal sites, known as multiple peaks. We improve his theorem in two aspects:…
▽ More
We prove two theorems concerning extreme values of general Gaussian fields. Our first theorem concerns with the concept of multiple peaks. A theorem of Chatterjee states that when a centered Gaussian field admits the so-called superconcentration property, it typically attains values near its maximum on multiple near-orthogonal sites, known as multiple peaks. We improve his theorem in two aspects: (i) the number of peaks attained by our bound is of the order $\exp(c / σ^2)$ (as opposed to Chatterjee's polynomial bound in $1/σ$), where $σ$ is the standard deviation of the supremum of the Gaussian field, which is assumed to have variance at most $1$ and (ii) our bound need not assume that the correlations are non-negative. We also prove a similar result based on the superconcentration of the free energy. As primary applications, we infer that for the S-K spin glass model on the $n$-hypercube and directed polymers on $\mathbb{Z}_n^2$, there are polynomially (in $n$) many near-orthogonal sites that achieve values near their respective maxima.
Our second theorem gives an upper bound on moderate deviation for the supremum of a general Gaussian field. While the Gaussian isoperimetric inequality implies a sub-Gaussian concentration bound for the supremum, we show that the exponent in that bound can be improved under the assumption that the expectation of the supremum is of the same order as that of the independent case.
△ Less
Submitted 24 November, 2013; v1 submitted 21 November, 2013;
originally announced November 2013.
-
Fibonacci-like growth of numerical semigroups of a given genus
Authors:
Alex Zhai
Abstract:
We give an asymptotic estimate of the number of numerical semigroups of a given genus. In particular, if $n_g$ is the number of numerical semigroups of genus $g$, we prove that $n_g$ tends to $S φ^g$, where $φ$ is the golden ratio, and $S$ is a constant, resolving several related conjectures concerning the growth of $n_g$. In addition, we show that the proportion of numerical semigroups of genus…
▽ More
We give an asymptotic estimate of the number of numerical semigroups of a given genus. In particular, if $n_g$ is the number of numerical semigroups of genus $g$, we prove that $n_g$ tends to $S φ^g$, where $φ$ is the golden ratio, and $S$ is a constant, resolving several related conjectures concerning the growth of $n_g$. In addition, we show that the proportion of numerical semigroups of genus $g$ satisfying $f < 3m$ approaches 1 as $g \rightarrow \infty$, where $m$ is the multiplicity and $f$ is the Frobenius number.
△ Less
Submitted 14 November, 2011;
originally announced November 2011.
-
An asymptotic result concerning a question of Wilf
Authors:
Alex Zhai
Abstract:
Let $Λ$ be a numerical semigroup with embedding dimension $e(Λ)$. Define $c(Λ)$ to be one plus the largest integer not in $Λ$, and define $c'(Λ)$ to be the number of elements in $Λ$ less than $c(Λ)$. It was asked by Wilf whether $\frac{c'(Λ)}{c(Λ)} \ge \frac{1}{e(Λ)}$ always holds. We prove an asymptotic version of this conjecture: we show that for a fixed positive integer $k$ and any $ε> 0$, the…
▽ More
Let $Λ$ be a numerical semigroup with embedding dimension $e(Λ)$. Define $c(Λ)$ to be one plus the largest integer not in $Λ$, and define $c'(Λ)$ to be the number of elements in $Λ$ less than $c(Λ)$. It was asked by Wilf whether $\frac{c'(Λ)}{c(Λ)} \ge \frac{1}{e(Λ)}$ always holds. We prove an asymptotic version of this conjecture: we show that for a fixed positive integer $k$ and any $ε> 0$, the inequality $\frac{c'(Λ)}{c(Λ)} \ge \frac{1}{k} - ε$ holds for all but finitely many numerical semigroups $Λ$ satisfying $e(Λ) = k$.
△ Less
Submitted 11 November, 2011;
originally announced November 2011.