-
Time-Biased Random Walks and Robustness of Expanders
Authors:
Sam Olesker-Taylor,
Thomas Sauerwald,
John Sylvester
Abstract:
Random walks on expanders play a crucial role in Markov Chain Monte Carlo algorithms, derandomization, graph theory, and distributed computing. A desirable property is that they are rapidly mixing, which is equivalent to having a spectral gap $γ$ (asymptotically) bounded away from $0$. Our work has two main strands.
First, we establish a dichotomy for the robustness of mixing times on edge-weigh…
▽ More
Random walks on expanders play a crucial role in Markov Chain Monte Carlo algorithms, derandomization, graph theory, and distributed computing. A desirable property is that they are rapidly mixing, which is equivalent to having a spectral gap $γ$ (asymptotically) bounded away from $0$. Our work has two main strands.
First, we establish a dichotomy for the robustness of mixing times on edge-weighted $d$-regular graphs (i.e., reversible Markov chains) subject to a Lipschitz condition, which bounds the ratio of adjacent weights by $β\geq 1$. If $β\ge 1$ is sufficiently small, then $γ\asymp 1$ and the mixing time is logarithmic in $n$. On the other hand, if $β\geq 2d$, there is an edge-weighting such that $γ$ is polynomially small in $1/n$.
Second, we apply our robustness result to a time-dependent version of the so-called $\varepsilon$-biased random walk, as introduced in Azar et al. [Combinatorica 1996]. We show that, for any constant $\varepsilon>0$, a bias strategy can be chosen adaptively so that the $\varepsilon$-biased random walk covers any bounded-degree regular expander in $Θ(n)$ expected time, improving the previous-best bound of $O(n \log \log n)$. We prove the first non-trivial lower bound on the cover time of the $\varepsilon$-biased random walk, showing that, on bounded-degree regular expanders, it is $ω(n)$ whenever $\varepsilon = o(1)$. We establish this by controlling how much the probability of arbitrary events can be ``boosted'' by using a time-dependent bias strategy.
△ Less
Submitted 17 December, 2024;
originally announced December 2024.
-
Rumors with Changing Credibility
Authors:
Charlotte Out,
Nicolás Rivera,
Thomas Sauerwald,
John Sylvester
Abstract:
Randomized rumor spreading processes diffuse information on an undirected graph and have been widely studied. In this work, we present a generic framework for analyzing a broad class of such processes on regular graphs. Our analysis is protocol-agnostic, as it only requires the expected proportion of newly informed vertices in each round to be bounded, and a natural negative correlation property.…
▽ More
Randomized rumor spreading processes diffuse information on an undirected graph and have been widely studied. In this work, we present a generic framework for analyzing a broad class of such processes on regular graphs. Our analysis is protocol-agnostic, as it only requires the expected proportion of newly informed vertices in each round to be bounded, and a natural negative correlation property.
This framework allows us to analyze various protocols, including PUSH, PULL, and PUSH-PULL, thereby extending prior research. Unlike previous work, our framework accommodates message failures at any time $t\geq 0$ with a probability of $1-q(t)$, where the credibility $q(t)$ is any function of time. This enables us to model real-world scenarios in which the transmissibility of rumors may fluctuate, as seen in the spread of ``fake news'' and viruses. Additionally, our framework is sufficiently broad to cover dynamic graphs.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
An Improved Drift Theorem for Balanced Allocations
Authors:
Dimitrios Los,
Thomas Sauerwald
Abstract:
In the balanced allocations framework, there are $m$ jobs (balls) to be allocated to $n$ servers (bins). The goal is to minimize the gap, the difference between the maximum and the average load.
Peres, Talwar and Wieder (RSA 2015) used the hyperbolic cosine potential function to analyze a large family of allocation processes including the $(1+β)$-process and graphical balanced allocations. The k…
▽ More
In the balanced allocations framework, there are $m$ jobs (balls) to be allocated to $n$ servers (bins). The goal is to minimize the gap, the difference between the maximum and the average load.
Peres, Talwar and Wieder (RSA 2015) used the hyperbolic cosine potential function to analyze a large family of allocation processes including the $(1+β)$-process and graphical balanced allocations. The key ingredient was to prove that the potential drops in every step, i.e., a drift inequality.
In this work we improve the drift inequality so that (i) it is asymptotically tighter, (ii) it assumes weaker preconditions, (iii) it applies not only to processes allocating to more than one bin in a single step and (iv) to processes allocating a varying number of balls depending on the sampled bin. Our applications include the processes of (RSA 2015), but also several new processes, and we believe that our techniques may lead to further results in future work.
△ Less
Submitted 21 August, 2023;
originally announced August 2023.
-
Mean-Biased Processes for Balanced Allocations
Authors:
Dimitrios Los,
Thomas Sauerwald,
John Sylvester
Abstract:
We introduce a new class of balanced allocation processes which bias towards underloaded bins (those with load below the mean load) either by skewing the probability by which a bin is chosen for an allocation (probability bias), or alternatively, by adding more balls to an underloaded bin (weight bias). A prototypical process satisfying the probability bias condition is Mean-Thinning: At each roun…
▽ More
We introduce a new class of balanced allocation processes which bias towards underloaded bins (those with load below the mean load) either by skewing the probability by which a bin is chosen for an allocation (probability bias), or alternatively, by adding more balls to an underloaded bin (weight bias). A prototypical process satisfying the probability bias condition is Mean-Thinning: At each round, we sample one bin and if it is underloaded, we allocate one ball; otherwise, we allocate one ball to a second bin sample. Versions of this process have been in use since at least 1986. An example of a process, introduced by us, which satisfies the weight bias condition is Twinning: At each round, we only sample one bin. If the bin is underloaded, then we allocate two balls; otherwise, we allocate only one ball.
Our main result is that for any process with a probability or weight bias, with high probability the gap between maximum and minimum load is logarithmic in the number of bins. This result holds for any number of allocated balls (heavily loaded case), covers many natural processes that relax the Two-Choice process, and we also prove it is tight for many such processes, including Mean-Thinning and Twinning.
Our analysis employs a delicate interplay between linear, quadratic and exponential potential functions. It also hinges on a phenomenon we call "mean quantile stabilization", which holds in greater generality than our framework and may be of independent interest.
△ Less
Submitted 10 January, 2024; v1 submitted 9 August, 2023;
originally announced August 2023.
-
Balanced Allocations in Batches: The Tower of Two Choices
Authors:
Dimitrios Los,
Thomas Sauerwald
Abstract:
In balanced allocations, the goal is to place $m$ balls into $n$ bins, so as to minimize the gap (difference of max to average load). The One-Choice process places each ball to a bin sampled independently and uniformly at random. The Two-Choice process places balls in the least loaded of two sampled bins. Finally, the $(1+β)$-process mixes these processes, meaning each ball is allocated using Two-…
▽ More
In balanced allocations, the goal is to place $m$ balls into $n$ bins, so as to minimize the gap (difference of max to average load). The One-Choice process places each ball to a bin sampled independently and uniformly at random. The Two-Choice process places balls in the least loaded of two sampled bins. Finally, the $(1+β)$-process mixes these processes, meaning each ball is allocated using Two-Choice with probability $β\in(0,1)$, and using One-Choice otherwise.
Despite Two-Choice being optimal in the sequential setting, it has been observed in practice that it does not perform well in a parallel environment, where load information may be outdated. Following [BCEFN12], we study such a parallel setting where balls are allocated in batches of size $b$, and balls within the same batch are allocated with the same strategy and based on the same load information.
For small batch sizes $b\in[n,n\log n]$, it was shown in [LS22a] that Two-Choice achieves an asymptotically optimal gap among all processes with a constant number of samples.
In this work, we focus on larger batch sizes $b\in[n\log n,n^3]$. It was proved in [LS22c] that Two-Choice leads to a gap of $Θ(b/n)$. As our main result, we prove that the gap reduces to $O(\sqrt{(b/n)\cdot\log n})$, if one runs the $(1+β)$-process with an appropriately chosen $β$ (in fact this result holds for a larger class of processes). This not only proves the phenomenon that Two-Choice is not the best (leading to the formation of "towers" over previously light bins), but also that mixing two processes (One-Choice and Two-Choice) leads to a process which achieves a gap that is asymptotically smaller than both. We also derive a matching lower bound of $Ω(\sqrt{(b/n)\cdot\log n})$ for any allocation process, which demonstrates that the above $(1+β)$-process is asymptotically optimal.
△ Less
Submitted 21 April, 2023; v1 submitted 8 February, 2023;
originally announced February 2023.
-
Balanced Allocations with Heterogeneous Bins: The Power of Memory
Authors:
Dimitrios Los,
Thomas Sauerwald,
John Sylvester
Abstract:
We consider the allocation of $m$ balls (jobs) into $n$ bins (servers). In the standard Two-Choice process, at each step $t=1,2,\ldots,m$ we first sample two bins uniformly at random and place a ball in the least loaded bin. It is well-known that for any $m \geq n$, this results in a gap (difference between the maximum and average load) of $\log_2 \log n + Θ(1)$ (with high probability). In this wo…
▽ More
We consider the allocation of $m$ balls (jobs) into $n$ bins (servers). In the standard Two-Choice process, at each step $t=1,2,\ldots,m$ we first sample two bins uniformly at random and place a ball in the least loaded bin. It is well-known that for any $m \geq n$, this results in a gap (difference between the maximum and average load) of $\log_2 \log n + Θ(1)$ (with high probability). In this work, we consider the Memory process [Mitzenmacher, Prabhakar and Shah 2002] where instead of two choices, we only sample one bin per step but we have access to a cache which can store the location of one bin. Mitzenmacher, Prabhakar and Shah showed that in the lightly loaded case ($m = n$), the Memory process achieves a gap of $\mathcal{O}(\log \log n)$.
Extending the setting of Mitzenmacher et al. in two ways, we first allow the number of balls $m$ to be arbitrary, which includes the challenging heavily loaded case where $m \geq n$. Secondly, we follow the heterogeneous bins model of Wieder [Wieder 2007], where the sampling distribution of bins can be biased up to some arbitrary multiplicative constant. Somewhat surprisingly, we prove that even in this setting, the Memory process still achieves an $\mathcal{O}(\log \log n)$ gap bound. This is in stark contrast with the Two-Choice (or any $d$-Choice with $d=\mathcal{O}(1)$) process, where it is known that the gap diverges as $m \rightarrow \infty$ [Wieder 2007]. Further, we show that for any sampling distribution independent of $m$ (but possibly dependent on $n$) the Memory process has a gap that can be bounded independently of $m$. Finally, we prove a tight gap bound of $\mathcal{O}(\log n)$ for Memory in another relaxed setting with heterogeneous (weighted) balls and a cache which can only be maintained for two steps.
△ Less
Submitted 23 January, 2023;
originally announced January 2023.
-
Balanced Allocations with the Choice of Noise
Authors:
Dimitrios Los,
Thomas Sauerwald
Abstract:
We consider the allocation of $m$ balls (jobs) into $n$ bins (servers). In the standard Two-Choice process, at each step $t=1,2,\ldots,m$ we first sample two randomly chosen bins, compare their two loads and then place a ball in the least loaded bin. It is well-known that for any $m\geq n$, this results in a gap (difference between the maximum and average load) of $\log_2\log n+Θ(1)$ (with high pr…
▽ More
We consider the allocation of $m$ balls (jobs) into $n$ bins (servers). In the standard Two-Choice process, at each step $t=1,2,\ldots,m$ we first sample two randomly chosen bins, compare their two loads and then place a ball in the least loaded bin. It is well-known that for any $m\geq n$, this results in a gap (difference between the maximum and average load) of $\log_2\log n+Θ(1)$ (with high probability).
In this work, we consider Two-Choice in different settings with noisy load comparisons. One key setting involves an adaptive adversary whose power is limited by some threshold $g\in\mathbb{N}$. In each step, such adversary can determine the result of any load comparison between two bins whose loads differ by at most $g$, while if the load difference is greater than $g$, the comparison is correct.
For this adversarial setting, we first prove that for any $m \geq n$ the gap is $O(g+\log n)$ with high probability. Then through a refined analysis we prove that if $g\leq\log n$, then for any $m \geq n$ the gap is $O(\frac{g}{\log g}\cdot\log\log n)$. For constant values of $g$, this generalizes the heavily loaded analysis of [BCSV06, TW14] for the Two-Choice process, and establishes that asymptotically the same gap bound holds even if load comparisons among "similarly loaded" bins are wrong. Finally, we complement these upper bounds with tight lower bounds, which establish an interesting phase transition on how the parameter $g$ impacts the gap.
The analysis also applies to settings with outdated and delayed information. For example, for the setting of [BCEFN12] where balls are allocated in consecutive batches of size $b=n$, we present an improved and tight gap bound of $Θ(\frac{\log n}{\log\log n})$. This bound also extends for a range of values of $b$ and applies to a relaxed setting where the reported load of a bin can be any load value from the last $b$ steps.
△ Less
Submitted 14 January, 2023; v1 submitted 15 June, 2022;
originally announced June 2022.
-
The Power of Filling in Balanced Allocations
Authors:
Dimitrios Los,
Thomas Sauerwald,
John Sylvester
Abstract:
We introduce a new class of balanced allocation processes which are primarily characterized by ``filling'' underloaded bins. A prototypical example is the Packing process: At each round we only take one bin sample, if the load is below the average load, then we place as many balls until the average load is reached; otherwise, we place only one ball. We prove that for any process in this class the…
▽ More
We introduce a new class of balanced allocation processes which are primarily characterized by ``filling'' underloaded bins. A prototypical example is the Packing process: At each round we only take one bin sample, if the load is below the average load, then we place as many balls until the average load is reached; otherwise, we place only one ball. We prove that for any process in this class the gap between the maximum and average load is $\mathcal{O}(\log n)$ w.h.p. for any number of balls $m\geq 1$. For the Packing process, we also provide a matching lower bound. Additionally, we prove that the Packing process is sample-efficient in the sense that the expected number of balls allocated per sample is strictly greater than one. Finally, we also demonstrate that the upper bound of $\mathcal{O}(\log n)$ on the gap can be extended to the Memory process studied by Mitzenmacher, Prabhakar and Shah (2002).
△ Less
Submitted 28 August, 2023; v1 submitted 8 April, 2022;
originally announced April 2022.
-
Balanced Allocations in Batches: Simplified and Generalized
Authors:
Dimitrios Los,
Thomas Sauerwald
Abstract:
We consider the allocation of $m$ balls (jobs) into $n$ bins (servers). In the Two-Choice process, for each of $m$ sequentially arriving balls, two randomly chosen bins are sampled and the ball is placed in the least loaded bin. It is well-known that the maximum load is $m/n+\log_2 \log n + O(1)$ w.h.p.
Berenbrink, Czumaj, Englert, Friedetzky and Nagel (2012) introduced a parallel version of thi…
▽ More
We consider the allocation of $m$ balls (jobs) into $n$ bins (servers). In the Two-Choice process, for each of $m$ sequentially arriving balls, two randomly chosen bins are sampled and the ball is placed in the least loaded bin. It is well-known that the maximum load is $m/n+\log_2 \log n + O(1)$ w.h.p.
Berenbrink, Czumaj, Englert, Friedetzky and Nagel (2012) introduced a parallel version of this process, where $m$ balls arrive in consecutive batches of size $b=n$ each. Balls within the same batch are allocated in parallel, using the load information of the bins at the beginning of the batch. They proved that the gap of this process is $O(\log n)$ with high probability.
In this work, we present a new analysis of this setting, which is based on exponential potential functions. This allows us to both simplify and generalize the analysis of [BCE12] in different ways:
$\quad 1.$ Our analysis covers a broad class of processes. This includes not only Two-Choice, but also processes with fewer bin samples like $(1+β)$, processes which can only receive one bit of information from each bin sample and graphical allocation, where bins correspond to vertices in a graph.
$\quad 2.$ Balls may be of different weights, as long as their weights are independent samples from a distribution satisfying a technical condition on its moment generating function.
$\quad 3.$ For arbitrary batch sizes $b \geq n$, we prove a gap of $O(b/n \cdot \log n)$. For any $b \in [n , n^3]$, we improve this to $O(b/n + \log n)$ and show that it is tight for a family of processes. This implies the unexpected result that for e.g. $(1+β)$ with constant $β\in (0, 1]$, the gap is $Θ(\log n)$ for all $b \in [n,n \log n]$.
We also conduct experiments which support our theoretical results, and even hint at a superiority of less powerful processes like $(1+β)$ for large batch sizes.
△ Less
Submitted 25 March, 2022;
originally announced March 2022.
-
Tight Bounds for Repeated Balls-into-Bins
Authors:
Dimitrios Los,
Thomas Sauerwald
Abstract:
We study the repeated balls-into-bins process introduced by Becchetti, Clementi, Natale, Pasquale and Posta (2019). This process starts with $m$ balls arbitrarily distributed across $n$ bins. At each round $t=1,2,\ldots$, one ball is selected from each non-empty bin, and then placed it into a bin chosen independently and uniformly at random. We prove the following results:
$\quad \bullet$ For an…
▽ More
We study the repeated balls-into-bins process introduced by Becchetti, Clementi, Natale, Pasquale and Posta (2019). This process starts with $m$ balls arbitrarily distributed across $n$ bins. At each round $t=1,2,\ldots$, one ball is selected from each non-empty bin, and then placed it into a bin chosen independently and uniformly at random. We prove the following results:
$\quad \bullet$ For any $n \leq m \leq \mathrm{poly}(n)$, we prove a lower bound of $Ω(m/n \cdot \log n)$ on the maximum load. For the special case $m=n$, this matches the upper bound of $O(\log n)$, as shown in [BCNPP19]. It also provides a positive answer to the conjecture in [BCNPP19] that for $m=n$ the maximum load is $ω(\log n/ \log \log n)$ at least once in a polynomially large time interval. For $m\in [ω(n),n\log n]$, our new lower bound disproves the conjecture in [BCNPP19] that the maximum load remains $O(\log n)$.
$\quad \bullet$ For any $n\leq m\leq\mathrm{poly}(n)$, we prove an upper bound of $O(m/n\cdot\log n)$ on the maximum load for all steps of a polynomially large time interval. This matches our lower bound up to multiplicative constants.
$\quad \bullet$ For any $m\geq n$, our analysis also implies an $O(m^2/n)$ waiting time to reach a configuration with a $O(m/n\cdot\log m)$ maximum load, even for worst-case initial distributions.
$\quad \bullet$ For any $m \geq n$, we show that every ball visits every bin in $O(m\log m)$ rounds. For $m = n$, this improves the previous upper bound of $O(n \log^2 n)$ in [BCNPP19]. We also prove that the upper bound is tight up to multiplicative constants for any $n \leq m \leq \mathrm{poly}(n)$.
△ Less
Submitted 30 January, 2023; v1 submitted 23 March, 2022;
originally announced March 2022.
-
Balanced Allocations: Caching and Packing, Twinning and Thinning
Authors:
Dimitrios Los,
Thomas Sauerwald,
John Sylvester
Abstract:
We consider the sequential allocation of $m$ balls (jobs) into $n$ bins (servers) by allowing each ball to choose from some bins sampled uniformly at random. The goal is to maintain a small gap between the maximum load and the average load. In this paper, we present a general framework that allows us to analyze various allocation processes that slightly prefer allocating into underloaded, as oppos…
▽ More
We consider the sequential allocation of $m$ balls (jobs) into $n$ bins (servers) by allowing each ball to choose from some bins sampled uniformly at random. The goal is to maintain a small gap between the maximum load and the average load. In this paper, we present a general framework that allows us to analyze various allocation processes that slightly prefer allocating into underloaded, as opposed to overloaded bins. Our analysis covers several natural instances of processes, including:
The Caching process (a.k.a. memory protocol) as studied by Mitzenmacher, Prabhakar and Shah (2002): At each round we only take one bin sample, but we also have access to a cache in which the most recently used bin is stored. We place the ball into the least loaded of the two.
The Packing process: At each round we only take one bin sample. If the load is below some threshold (e.g., the average load), then we place as many balls until the threshold is reached; otherwise, we place only one ball.
The Twinning process: At each round, we only take one bin sample. If the load is below some threshold, then we place two balls; otherwise, we place only one ball.
The Thinning process as recently studied by Feldheim and Gurel-Gurevich (2021): At each round, we first take one bin sample. If its load is below some threshold, we place one ball; otherwise, we place one ball into a $\textit{second}$ bin sample.
As we demonstrate, our general framework implies for all these processes a gap of $\mathcal{O}(\log n)$ between the maximum load and average load, even when an arbitrary number of balls $m \geq n$ are allocated (heavily loaded case). Our analysis is inspired by a previous work of Peres, Talwar and Wieder (2010) for the $(1+β)$-process, however here we rely on the interplay between different potential functions to prove stabilization.
△ Less
Submitted 10 August, 2023; v1 submitted 20 October, 2021;
originally announced October 2021.
-
Balanced Allocations with Incomplete Information: The Power of Two Queries
Authors:
Dimitrios Los,
Thomas Sauerwald
Abstract:
We consider the allocation of $m$ balls into $n$ bins with incomplete information. In the classical Two-Choice process a ball first queries the load of two randomly chosen bins and is then placed in the least loaded bin. In our setting, each ball also samples two random bins but can only estimate a bin's load by sending binary queries of the form "Is the load at least the median?" or "Is the load…
▽ More
We consider the allocation of $m$ balls into $n$ bins with incomplete information. In the classical Two-Choice process a ball first queries the load of two randomly chosen bins and is then placed in the least loaded bin. In our setting, each ball also samples two random bins but can only estimate a bin's load by sending binary queries of the form "Is the load at least the median?" or "Is the load at least 100?".
For the lightly loaded case $m=O(n)$, Feldheim and Gurel-Gurevich (2021) showed that with one query it is possible to achieve a maximum load of $O(\sqrt{\log n/\log \log n})$, and posed the question whether a maximum load of $m/n+O(\sqrt{\log n/\log \log n})$ is possible for any $m = Ω(n)$. In this work, we resolve this open problem by proving a lower bound of $m/n+Ω( \sqrt{\log n})$ for a fixed $m=Θ(n \sqrt{\log n})$, and a lower bound of $m/n+Ω(\log n/\log \log n)$ for some $m$ depending on the used strategy.
We complement this negative result by proving a positive result for multiple queries. In particular, we show that with only two binary queries per chosen bin, there is an oblivious strategy which ensures a maximum load of $m/n+O(\sqrt{\log n})$ for any $m \geq 1$. Further, for any number of $k = O(\log \log n)$ binary queries, the upper bound on the maximum load improves to $m/n + O(k(\log n)^{1/k})$ for any $m \geq 1$.
Further, this result for $k$ queries implies (i) new bounds for the $(1+β)$-process introduced by Peres et al (2015), (ii) new bounds for the graphical balanced allocation process on dense expander graphs, and (iii) the bound of $m/n+O(\log \log n)$ on the maximum load achieved by the Two-Choice process, including the heavily loaded case $m=Ω(n)$ derived by Berenbrink et al. (2006). One novel aspect of our proofs is the use of multiple super-exponential potential functions, which might be of use in future work.
△ Less
Submitted 26 January, 2022; v1 submitted 8 July, 2021;
originally announced July 2021.
-
Multiple Random Walks on Graphs: Mixing Few to Cover Many
Authors:
Nicolás Rivera,
Thomas Sauerwald,
John Sylvester
Abstract:
Random walks on graphs are an essential primitive for many randomised algorithms and stochastic processes. It is natural to ask how much can be gained by running $k$ multiple random walks independently and in parallel. Although the cover time of multiple walks has been investigated for many natural networks, the problem of finding a general characterisation of multiple cover times for worst-case s…
▽ More
Random walks on graphs are an essential primitive for many randomised algorithms and stochastic processes. It is natural to ask how much can be gained by running $k$ multiple random walks independently and in parallel. Although the cover time of multiple walks has been investigated for many natural networks, the problem of finding a general characterisation of multiple cover times for worst-case start vertices (posed by Alon, Avin, Koucký, Kozma, Lotker, and Tuttle~in 2008) remains an open problem.
First, we improve and tighten various bounds on the stationary cover time when $k$ random walks start from vertices sampled from the stationary distribution. For example, we prove an unconditional lower bound of $Ω((n/k) \log n)$ on the stationary cover time, holding for any $n$-vertex graph $G$ and any $1 \leq k =o(n\log n )$. Secondly, we establish the stationary cover times of multiple walks on several fundamental networks up to constant factors. Thirdly, we present a framework characterising worst-case cover times in terms of stationary cover times and a novel, relaxed notion of mixing time for multiple walks called the partial mixing time. Roughly speaking, the partial mixing time only requires a specific portion of all random walks to be mixed. Using these new concepts, we can establish (or recover) the worst-case cover times for many networks including expanders, preferential attachment graphs, grids, binary trees and hypercubes.
△ Less
Submitted 29 December, 2022; v1 submitted 16 November, 2020;
originally announced November 2020.
-
Random walks on randomly evolving graphs
Authors:
Leran Cai,
Thomas Sauerwald,
Luca Zanetti
Abstract:
A random walk is a basic stochastic process on graphs and a key primitive in the design of distributed algorithms. One of the most important features of random walks is that, under mild conditions, they converge to a stationary distribution in time that is at most polynomial in the size of the graph. This fundamental property, however, only holds if the graph does not change over time, while on th…
▽ More
A random walk is a basic stochastic process on graphs and a key primitive in the design of distributed algorithms. One of the most important features of random walks is that, under mild conditions, they converge to a stationary distribution in time that is at most polynomial in the size of the graph. This fundamental property, however, only holds if the graph does not change over time, while on the other hand many distributed networks are inherently dynamic, and their topology is subjected to potentially drastic changes.
In this work we study the mixing (i.e., converging) properties of random walks on graphs subjected to random changes over time. Specifically, we consider the edge-Markovian random graph model: for each edge slot, there is a two-state Markov chain with transition probabilities $p$ (add a non-existing edge) and $q$ (remove an existing edge). We derive several positive and negative results that depend on both the density of the graph and the speed by which the graph changes. We show that if $p$ is very small (i.e., below the connectivity threshold of Erdős-Rényi random graphs), random walks do not mix (fast). When $p$ is larger, instead, we observe the following behavior: if the graph changes slowly over time (i.e., $q$ is small), random walks enjoy strong mixing properties that are comparable to the ones possessed by random walks on static graphs; however, if the graph changes too fast (i.e., $q$ is large), only coarse mixing properties are preserved.
△ Less
Submitted 18 June, 2020;
originally announced June 2020.
-
Time Dependent Biased Random Walks
Authors:
John Haslegrave,
Thomas Sauerwald,
John Sylvester
Abstract:
We study the biased random walk where at each step of a random walk a "controller" can, with a certain small probability, move the walk to an arbitrary neighbour. This model was introduced by Azar et al. [STOC'1992]; we extend their work to the time dependent setting and consider cover times of this walk. We obtain new bounds on the cover and hitting times. Azar et al. conjectured that the control…
▽ More
We study the biased random walk where at each step of a random walk a "controller" can, with a certain small probability, move the walk to an arbitrary neighbour. This model was introduced by Azar et al. [STOC'1992]; we extend their work to the time dependent setting and consider cover times of this walk. We obtain new bounds on the cover and hitting times. Azar et al. conjectured that the controller can increase the stationary probability of a vertex from $p$ to $p^{1-ε}$; while this conjecture is not true in full generality, we propose a best-possible amended version of this conjecture and confirm it for a broad class of graphs.
We also consider the problem of computing an optimal strategy for the controller to minimise the cover time and show that for directed graphs determining the cover time is PSPACE-complete.
△ Less
Submitted 4 August, 2021; v1 submitted 3 June, 2020;
originally announced June 2020.
-
The Power of Two Choices for Random Walks
Authors:
Agelos Georgakopoulos,
John Haslegrave,
Thomas Sauerwald,
John Sylvester
Abstract:
We apply the power-of-two-choices paradigm to a random walk on a graph: rather than moving to a uniform random neighbour at each step, a controller is allowed to choose from two independent uniform random neighbours. We prove that this allows the controller to significantly accelerate the hitting and cover times in several natural graph classes. In particular, we show that the cover time becomes l…
▽ More
We apply the power-of-two-choices paradigm to a random walk on a graph: rather than moving to a uniform random neighbour at each step, a controller is allowed to choose from two independent uniform random neighbours. We prove that this allows the controller to significantly accelerate the hitting and cover times in several natural graph classes. In particular, we show that the cover time becomes linear in the number $n$ of vertices on discrete tori and bounded degree trees, of order $\mathcal{O}(n \log \log n)$ on bounded degree expanders, and of order $\mathcal{O}(n (\log \log n)^2)$ on the Erdős-Rényi random graph in a certain sparsely connected regime. We also consider the algorithmic question of computing an optimal strategy, and prove a dichotomy in efficiency between computing strategies for hitting and cover times.
△ Less
Submitted 27 April, 2021; v1 submitted 12 November, 2019;
originally announced November 2019.
-
Random Walks on Dynamic Graphs: Mixing Times, HittingTimes, and Return Probabilities
Authors:
Thomas Sauerwald,
Luca Zanetti
Abstract:
We establish and generalise several bounds for various random walk quantities including the mixing time and the maximum hitting time. Unlike previous analyses, our derivations are based on rather intuitive notions of local expansion properties which allows us to capture the progress the random walk makes through $t$-step probabilities.
We apply our framework to dynamically changing graphs, where…
▽ More
We establish and generalise several bounds for various random walk quantities including the mixing time and the maximum hitting time. Unlike previous analyses, our derivations are based on rather intuitive notions of local expansion properties which allows us to capture the progress the random walk makes through $t$-step probabilities.
We apply our framework to dynamically changing graphs, where the set of vertices is fixed while the set of edges changes in each round. For random walks on dynamic connected graphs for which the stationary distribution does not change over time, we show that their behaviour is in a certain sense similar to static graphs. For example, we show that the mixing and hitting times of any sequence of $d$-regular connected graphs is $O(n^2)$, generalising a well-known result for static graphs. We also provide refined bounds depending on the isoperimetric dimension of the graph, matching again known results for static graphs. Finally, we investigate properties of random walks on dynamic graphs that are not always connected: we relate their convergence to stationarity to the spectral properties of an average of transition matrices and provide some examples that demonstrate strong discrepancies between static and dynamic graphs.
△ Less
Submitted 4 March, 2019;
originally announced March 2019.
-
The dispersion time of random walks on finite graphs
Authors:
Nicolas Rivera,
Alexandre Stauffer,
Thomas Sauerwald,
John Sylvester
Abstract:
We study two random processes on an $n$-vertex graph inspired by the internal diffusion limited aggregation (IDLA) model. In both processes $n$ particles start from an arbitrary but fixed origin. Each particle performs a simple random walk until first encountering an unoccupied vertex, and at which point the vertex becomes occupied and the random walk terminates. In one of the processes, called \t…
▽ More
We study two random processes on an $n$-vertex graph inspired by the internal diffusion limited aggregation (IDLA) model. In both processes $n$ particles start from an arbitrary but fixed origin. Each particle performs a simple random walk until first encountering an unoccupied vertex, and at which point the vertex becomes occupied and the random walk terminates. In one of the processes, called \textit{Sequential-IDLA}, only one particle moves until settling and only then does the next particle start whereas in the second process, called \textit{Parallel-IDLA}, all unsettled particles move simultaneously. Our main goal is to analyze the so-called dispersion time of these processes, which is the maximum number of steps performed by any of the $n$ particles.
In order to compare the two processes, we develop a coupling which shows the dispersion time of the Parallel-IDLA stochastically dominates that of the Sequential-IDLA; however, the total number of steps performed by all particles has the same distribution in both processes. This coupling also gives us that dispersion time of Parallel-IDLA is bounded in expectation by dispersion time of the Sequential-IDLA up to a multiplicative $\log n$ factor. Moreover, we derive asymptotic upper and lower bound on the dispersion time for several graph classes, such as cliques, cycles, binary trees, $d$-dimensional grids, hypercubes and expanders. Most of our bounds are tight up to a multiplicative constant.
△ Less
Submitted 26 November, 2019; v1 submitted 28 August, 2018;
originally announced August 2018.
-
On coalescence time in graphs--When is coalescing as fast as meeting?
Authors:
Varun Kanade,
Frederik Mallmann-Trenn,
Thomas Sauerwald
Abstract:
Coalescing random walks is a fundamental stochastic process, where a set of particles perform independent discrete-time random walks on an undirected graph. Whenever two or more particles meet at a given node, they merge and continue as a single random walk. The coalescence time is defined as the expected time until only one particle remains, starting from one particle at every node. Despite recen…
▽ More
Coalescing random walks is a fundamental stochastic process, where a set of particles perform independent discrete-time random walks on an undirected graph. Whenever two or more particles meet at a given node, they merge and continue as a single random walk. The coalescence time is defined as the expected time until only one particle remains, starting from one particle at every node. Despite recent progress the coalescence time for graphs such as binary trees, d-dimensional tori, hypercubes and more generally, vertex-transitive graphs, remains unresolved. We provide a powerful toolkit that results in tight bounds for various topologies including the aforementioned ones. The meeting time is defined as the worst-case expected time required for two random walks to arrive at the same node at the same time. As a general result, we establish that for graphs whose meeting time is only marginally larger than the mixing time (a factor of log^2 n), the coalescence time of n random walks equals the meeting time up to constant factors. This upper bound is complemented by the construction of a graph family demonstrating that this result is the best possible up to constant factors. For almost-regular graphs, we bound the coalescence time by the hitting time, resolving the discrete-time variant of a conjecture by Aldous for this class of graphs. Finally, we prove that for any graph the coalescence time is bounded by O(n^3) (which is tight for the Barbell graph); surprisingly even such a basic question about the coalescing time was not answered before this work. By duality, our results give bounds on the voter model and therefore give bounds on the consensus time in arbitrary undirected graphs. We also establish a new bound on the hitting time and cover time of regular graphs, improving and tightening previous results by Broder and Karlin, as well as those by Aldous and Fill.
△ Less
Submitted 1 November, 2018; v1 submitted 8 November, 2016;
originally announced November 2016.
-
Intersection and mixing times for reversible chains
Authors:
Yuval Peres,
Thomas Sauerwald,
Perla Sousi,
Alexandre Stauffer
Abstract:
Suppose X and Y are two independent irreducible Markov chains on n states. We consider the intersection time, which is the first time their trajectories intersect. We show for reversible and lazy chains that the total variation mixing time is always upper bounded by the expected intersection time taken over the worst starting states. For random walks on trees we show the two quantities are equival…
▽ More
Suppose X and Y are two independent irreducible Markov chains on n states. We consider the intersection time, which is the first time their trajectories intersect. We show for reversible and lazy chains that the total variation mixing time is always upper bounded by the expected intersection time taken over the worst starting states. For random walks on trees we show the two quantities are equivalent. We obtain an expression for the expected intersection time in terms of the eigenvalues for reversible and transitive chains. For such chains we also show that it is up to constants the geometric mean of n and E[I], where I is the number of intersections up to the uniform mixing time. Finally for random walks on regular graphs we obtain sharp inequalities that relate the expected intersection time to maximum hitting time and mixing time.
△ Less
Submitted 29 December, 2014;
originally announced December 2014.
-
Asymptotic bounds on the equilateral dimension of hypercubes
Authors:
Lorenz Minder,
Thomas Sauerwald,
Sven-Ake Wegner
Abstract:
A subset of the finite dimensional hypercube is said to be equilateral if the distance of any two distinct points equals a fixed value. The equilateral dimension of the hypercube is defined as the maximal size of its equilateral subsets. We study asymptotic bounds on the latter quantity considered as a function of two variables, namely dimension and distance.
A subset of the finite dimensional hypercube is said to be equilateral if the distance of any two distinct points equals a fixed value. The equilateral dimension of the hypercube is defined as the maximal size of its equilateral subsets. We study asymptotic bounds on the latter quantity considered as a function of two variables, namely dimension and distance.
△ Less
Submitted 21 August, 2014;
originally announced August 2014.
-
Cutoff Phenomenon for Random Walks on Kneser Graphs
Authors:
Ali Pourmiri,
Thomas Sauerwald
Abstract:
The cutoff phenomenon for an ergodic Markov chain describes a sharp transition in the convergence to its stationary distribution, over a negligible period of time, known as cutoff window. We study the cutoff phenomenon for simple random walks on Kneser graphs, which is a family of ergodic Markov chains. Given two integers $n$ and $k$, the Kneser graph $K(2n+k,n)$ is defined as the graph with verte…
▽ More
The cutoff phenomenon for an ergodic Markov chain describes a sharp transition in the convergence to its stationary distribution, over a negligible period of time, known as cutoff window. We study the cutoff phenomenon for simple random walks on Kneser graphs, which is a family of ergodic Markov chains. Given two integers $n$ and $k$, the Kneser graph $K(2n+k,n)$ is defined as the graph with vertex set being all subsets of $\{1,\ldots,2n+k\}$ of size $n$ and two vertices $A$ and $B$ being connected by an edge if $A\cap B =\emptyset$. We show that for any $k=O(n)$, the random walk on $K(2n+k,n)$ exhibits a cutoff at $\frac{1}{2}\log_{1+k/n}{(2n+k)}$ with a window of size $O(\frac{n}{k})$.
△ Less
Submitted 17 April, 2014;
originally announced April 2014.
-
Balls into bins via local search: cover time and maximum load
Authors:
Karl Bringmann,
Thomas Sauerwald,
Alexandre Stauffer,
He Sun
Abstract:
We study a natural process for allocating m balls into n bins that are organized as the vertices of an undirected graph G. Balls arrive one at a time. When a ball arrives, it first chooses a vertex u in G uniformly at random. Then the ball performs a local search in G starting from u until it reaches a vertex with local minimum load, where the ball is finally placed on. Then the next ball arrives…
▽ More
We study a natural process for allocating m balls into n bins that are organized as the vertices of an undirected graph G. Balls arrive one at a time. When a ball arrives, it first chooses a vertex u in G uniformly at random. Then the ball performs a local search in G starting from u until it reaches a vertex with local minimum load, where the ball is finally placed on. Then the next ball arrives and this procedure is repeated. For the case m = n, we give an upper bound for the maximum load on graphs with bounded degrees. We also propose the study of the cover time of this process, which is defined as the smallest m so that every bin has at least one ball allocated to it. We establish an upper bound for the cover time on graphs with bounded degrees. Our bounds for the maximum load and the cover time are tight when the graph is transitive or sufficiently homogeneous. We also give upper bounds for the maximum load when m > n.
△ Less
Submitted 15 February, 2014; v1 submitted 2 October, 2013;
originally announced October 2013.
-
Balls into Bins via Local Search
Authors:
Paul Bogdan,
Thomas Sauerwald,
Alexandre Stauffer,
He Sun
Abstract:
We propose a natural process for allocating n balls into n bins that are organized as the vertices of an undirected graph G. Each ball first chooses a vertex u in G uniformly at random. Then the ball performs a local search in G starting from u until it reaches a vertex with local minimum load, where the ball is finally placed on. In our main result, we prove that this process yields a maximum loa…
▽ More
We propose a natural process for allocating n balls into n bins that are organized as the vertices of an undirected graph G. Each ball first chooses a vertex u in G uniformly at random. Then the ball performs a local search in G starting from u until it reaches a vertex with local minimum load, where the ball is finally placed on. In our main result, we prove that this process yields a maximum load of only Θ(\log \log n) on expander graphs. In addition, we show that for d-dimensional grids the maximum load is Θ\Big(\big(\frac{\log n}{\log \log n}\big)^{\frac{1}{d+1}}\Big). Finally, for almost regular graphs with minimum degree Ω(\log n), we prove that the maximum load is constant and also reveal a fundamental difference between random and arbitrary tie-breaking rules.
△ Less
Submitted 9 July, 2012;
originally announced July 2012.
-
Tight Bounds for Randomized Load Balancing on Arbitrary Network Topologies
Authors:
Thomas Sauerwald,
He Sun
Abstract:
We consider the problem of balancing load items (tokens) in networks. Starting with an arbitrary load distribution, we allow nodes to exchange tokens with their neighbors in each round. The goal is to achieve a distribution where all nodes have nearly the same number of tokens.
For the continuous case where tokens are arbitrarily divisible, most load balancing schemes correspond to Markov chains…
▽ More
We consider the problem of balancing load items (tokens) in networks. Starting with an arbitrary load distribution, we allow nodes to exchange tokens with their neighbors in each round. The goal is to achieve a distribution where all nodes have nearly the same number of tokens.
For the continuous case where tokens are arbitrarily divisible, most load balancing schemes correspond to Markov chains, whose convergence is fairly well-understood in terms of their spectral gap. However, in many applications, load items cannot be divided arbitrarily, and we need to deal with the discrete case where the load is composed of indivisible tokens. This discretization entails a non-linear behavior due to its rounding errors, which makes this analysis much harder than in the continuous case.
We investigate several randomized protocols for different communication models in the discrete case. As our main result, we prove that for any regular network in the matching model, all nodes have the same load up to an additive constant in (asymptotically) the same number of rounds as required in the continuous case. This generalizes and tightens the previous best result, which only holds for expander graphs, and demonstrates that there is almost no difference between the discrete and continuous cases. Our results also provide a positive answer to the question of how well discrete load balancing can be approximated by (continuous) Markov chains, which has been posed by many researchers.
△ Less
Submitted 10 November, 2014; v1 submitted 12 January, 2012;
originally announced January 2012.
-
The Cover Time of Deterministic Random Walks
Authors:
Tobias Friedrich,
Thomas Sauerwald
Abstract:
The rotor router model is a popular deterministic analogue of a random walk on a graph. Instead of moving to a random neighbor, the neighbors are served in a fixed order. We examine how fast this "deterministic random walk" covers all vertices (or all edges). We present general techniques to derive upper bounds for the vertex and edge cover time and derive matching lower bounds for several importa…
▽ More
The rotor router model is a popular deterministic analogue of a random walk on a graph. Instead of moving to a random neighbor, the neighbors are served in a fixed order. We examine how fast this "deterministic random walk" covers all vertices (or all edges). We present general techniques to derive upper bounds for the vertex and edge cover time and derive matching lower bounds for several important graph classes. Depending on the topology, the deterministic random walk can be asymptotically faster, slower or equally fast as the classic random walk. We also examine the short term behavior of deterministic random walks, that is, the time to visit a fixed small number of vertices or edges.
△ Less
Submitted 17 June, 2010;
originally announced June 2010.
-
Cover Time and Broadcast Time
Authors:
Robert Elsässer,
Thomas Sauerwald
Abstract:
We introduce a new technique for bounding the cover time of random walks by relating it to the runtime of randomized broadcast. In particular, we strongly confirm for dense graphs the intuition of Chandra et al. \cite{CRRST97} that "the cover time of the graph is an appropriate metric for the performance of certain kinds of randomized broadcast algorithms". In more detail, our results are as fol…
▽ More
We introduce a new technique for bounding the cover time of random walks by relating it to the runtime of randomized broadcast. In particular, we strongly confirm for dense graphs the intuition of Chandra et al. \cite{CRRST97} that "the cover time of the graph is an appropriate metric for the performance of certain kinds of randomized broadcast algorithms". In more detail, our results are as follows: For any graph $G=(V,E)$ of size $n$ and minimum degree $δ$, we have $\mathcal{R}(G)= \Oh(\frac{|E|}δ \cdot \log n)$, where $\mathcal{R}(G)$ denotes the quotient of the cover time and broadcast time. This bound is tight for binary trees and tight up to logarithmic factors for many graphs including hypercubes, expanders and lollipop graphs. For any $δ$-regular (or almost $δ$-regular) graph $G$ it holds that $\mathcal{R}(G) = Ω(\frac{δ^2}{n} \cdot \frac{1}{\log n})$. Together with our upper bound on $\mathcal{R}(G)$, this lower bound strongly confirms the intuition of Chandra et al. for graphs with minimum degree $Θ(n)$, since then the cover time equals the broadcast time multiplied by $n$ (neglecting logarithmic factors). Conversely, for any $δ$ we construct almost $δ$-regular graphs that satisfy $\mathcal{R}(G) = \Oh(\max \{\sqrt{n},δ\} \cdot \log^2 n)$. Since any regular expander satisfies $\mathcal{R}(G) = Θ(n)$, the strong relationship given above does not hold if $δ$ is polynomially smaller than $n$. Our bounds also demonstrate that the relationship between cover time and broadcast time is much stronger than the known relationships between any of them and the mixing time (or the closely related spectral gap).
△ Less
Submitted 10 February, 2009;
originally announced February 2009.