-
Distributed Triangle Detection is Hard in Few Rounds
Authors:
Sepehr Assadi,
Janani Sundaresan
Abstract:
In the distributed triangle detection problem, we have an $n$-vertex network $G=(V,E)$ with one player for each vertex of the graph who sees the edges incident on the vertex. The players communicate in synchronous rounds using the edges of this network and have a limited bandwidth of $O(\log{n})$ bits over each edge. The goal is to detect whether or not $G$ contains a triangle as a subgraph in a m…
▽ More
In the distributed triangle detection problem, we have an $n$-vertex network $G=(V,E)$ with one player for each vertex of the graph who sees the edges incident on the vertex. The players communicate in synchronous rounds using the edges of this network and have a limited bandwidth of $O(\log{n})$ bits over each edge. The goal is to detect whether or not $G$ contains a triangle as a subgraph in a minimal number of rounds.
We prove that any protocol (deterministic or randomized) for distributed triangle detection requires $Ω(\log\log{n})$ rounds of communication. Prior to our work, only one-round lower bounds were known for this problem.
The primary technique for proving these types of distributed lower bounds is via reductions from two-party communication complexity. However, it has been known for a while that this approach is provably incapable of establishing any meaningful lower bounds for distributed triangle detection. Our main technical contribution is a new information theoretic argument which combines recent advances on multi-pass graph streaming lower bounds with the point-to-point communication aspects of distributed models, and can be of independent interest.
△ Less
Submitted 2 April, 2025;
originally announced April 2025.
-
Settling the Pass Complexity of Approximate Matchings in Dynamic Graph Streams
Authors:
Sepehr Assadi,
Soheil Behnezhad,
Christian Konrad,
Kheeran K. Naidu,
Janani Sundaresan
Abstract:
A semi-streaming algorithm in dynamic graph streams processes any $n$-vertex graph by making one or multiple passes over a stream of insertions and deletions to edges of the graph and using $O(n \cdot \mbox{polylog}(n))$ space. Semi-streaming algorithms for dynamic streams were first obtained in the seminal work of Ahn, Guha, and McGregor in 2012, alongside the introduction of the graph sketching…
▽ More
A semi-streaming algorithm in dynamic graph streams processes any $n$-vertex graph by making one or multiple passes over a stream of insertions and deletions to edges of the graph and using $O(n \cdot \mbox{polylog}(n))$ space. Semi-streaming algorithms for dynamic streams were first obtained in the seminal work of Ahn, Guha, and McGregor in 2012, alongside the introduction of the graph sketching technique, which remains the de facto way of designing algorithms in this model and a highly popular technique for designing graph algorithms in general.
We settle the pass complexity of approximating maximum matchings in dynamic streams via semi-streaming algorithms by improving the state-of-the-art in both upper and lower bounds.
We present a randomized sketching based semi-streaming algorithm for $O(1)$-approximation of maximum matching in dynamic streams using $O(\log\log{n})$ passes. The approximation ratio of this algorithm can be improved to $(1+ε)$ for any fixed $ε> 0$ even on weighted graphs using standard techniques. This exponentially improves upon several $O(\log{n})$ pass algorithms developed for this problem since the introduction of the dynamic graph streaming model.
In addition, we prove that any semi-streaming algorithm (not only sketching based) for $O(1)$-approximation of maximum matching in dynamic streams requires $Ω(\log\log{n})$ passes. This presents the first multi-pass lower bound for this problem, which is already also optimal, settling a longstanding open question in this area.
△ Less
Submitted 30 July, 2024;
originally announced July 2024.
-
Optimal Communication Complexity of Chained Index
Authors:
Janani Sundaresan
Abstract:
We study the CHAIN communication problem introduced by Cormode et al. [ICALP 2019]. It is a generalization of the well-studied INDEX problem. For $k\geq 1$, in CHAIN$_{n,k}$, there are $k$ instances of INDEX, all with the same answer. They are shared between $k+1$ players as follows. Player 1 has the first string $X^1 \in \{0,1\}^n$, player 2 has the first index $σ^1 \in [n]$ and the second string…
▽ More
We study the CHAIN communication problem introduced by Cormode et al. [ICALP 2019]. It is a generalization of the well-studied INDEX problem. For $k\geq 1$, in CHAIN$_{n,k}$, there are $k$ instances of INDEX, all with the same answer. They are shared between $k+1$ players as follows. Player 1 has the first string $X^1 \in \{0,1\}^n$, player 2 has the first index $σ^1 \in [n]$ and the second string $X^2 \in \{0,1\}^n$, player 3 has the second index $σ^2 \in [n]$ along with the third string $X^3 \in \{0,1\}^n$, and so on. Player $k+1$ has the last index $σ^k \in [n]$. The communication is one way from each player to the next, starting from player 1 to player 2, then from player 2 to player 3 and so on. Player $k+1$, after receiving the message from player $k$, has to output a single bit which is the answer to all $k$ instances of INDEX.
It was proved that the CHAIN$_{n,k}$ problem requires $Ω(n/k^2)$ communication by Cormode et al., and they used it to prove streaming lower bounds for approximation of maximum independent sets. Subsequently, it was used by Feldman et al. [STOC 2020] to prove lower bounds for streaming submodular maximization. However, these works do not get optimal bounds on the communication complexity of CHAIN$_{n,k}$, and in fact, it was conjectured by Cormode et al. that $Ω(n)$ bits are necessary, for any $k$.
As our main result, we prove the optimal lower bound of $Ω(n)$ for CHAIN$_{n,k}$. This settles the open conjecture of Cormode et al. in the affirmative. The key technique is to use information theoretic tools to analyze protocols over the Jensen-Shannon divergence measure, as opposed to total variation distance. As a corollary, we get an improved lower bound for approximation of maximum independent set in vertex arrival streams through a reduction from CHAIN directly.
△ Less
Submitted 30 May, 2024; v1 submitted 10 April, 2024;
originally announced April 2024.
-
$\mathcal{O}(\log\log{n})$ Passes is Optimal for Semi-Streaming Maximal Independent Set
Authors:
Sepehr Assadi,
Christian Konrad,
Kheeran K. Naidu,
Janani Sundaresan
Abstract:
In the semi-streaming model for processing massive graphs, an algorithm makes multiple passes over the edges of a given $n$-vertex graph and is tasked with computing the solution to a problem using $O(n \cdot \text{polylog}(n))$ space. Semi-streaming algorithms for Maximal Independent Set (MIS) that run in $O(\log\log{n})$ passes have been known for almost a decade, however, the best lower bounds…
▽ More
In the semi-streaming model for processing massive graphs, an algorithm makes multiple passes over the edges of a given $n$-vertex graph and is tasked with computing the solution to a problem using $O(n \cdot \text{polylog}(n))$ space. Semi-streaming algorithms for Maximal Independent Set (MIS) that run in $O(\log\log{n})$ passes have been known for almost a decade, however, the best lower bounds can only rule out single-pass algorithms. We close this large gap by proving that the current algorithms are optimal: Any semi-streaming algorithm for finding an MIS with constant probability of success requires $Ω(\log\log{n})$ passes. This settles the complexity of this fundamental problem in the semi-streaming model, and constitutes one of the first optimal multi-pass lower bounds in this model.
We establish our result by proving an optimal round vs communication tradeoff for the (multi-party) communication complexity of MIS. The key ingredient of this result is a new technique, called hierarchical embedding, for performing round elimination: we show how to pack many but small hard $(r-1)$-round instances of the problem into a single $r$-round instance, in a way that enforces any $r$-round protocol to effectively solve all these $(r-1)$-round instances also. These embeddings are obtained via a novel application of results from extremal graph theory -- in particular dense graphs with many disjoint unique shortest paths -- together with a newly designed graph product, and are analyzed via information-theoretic tools such as direct-sum and message compression arguments.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Hidden Permutations to the Rescue: Multi-Pass Semi-Streaming Lower Bounds for Approximate Matchings
Authors:
Sepehr Assadi,
Janani Sundaresan
Abstract:
We prove that any semi-streaming algorithm for $(1-ε)$-approximation of maximum bipartite matching requires \[ Ω(\frac{\log{(1/ε)}}{\log{(1/β)}}) \] passes, where $β\in (0,1)$ is the largest parameter so that an $n$-vertex graph with $n^β$ edge-disjoint induced matchings of size $Θ(n)$ exist (such graphs are referred to as RS graphs). Currently, it is known that \[ Ω(\frac{1}{\log\log{n}}) \leqsla…
▽ More
We prove that any semi-streaming algorithm for $(1-ε)$-approximation of maximum bipartite matching requires \[ Ω(\frac{\log{(1/ε)}}{\log{(1/β)}}) \] passes, where $β\in (0,1)$ is the largest parameter so that an $n$-vertex graph with $n^β$ edge-disjoint induced matchings of size $Θ(n)$ exist (such graphs are referred to as RS graphs). Currently, it is known that \[ Ω(\frac{1}{\log\log{n}}) \leqslant β\leqslant 1-Θ(\frac{\log^*{n}}{\log{n}}) \] and closing this huge gap between upper and lower bounds has remained a notoriously difficult problem in combinatorics.
Under the plausible hypothesis that $β= Ω(1)$, our lower bound result provides the first pass-approximation lower bound for (small) constant approximation of matchings in the semi-streaming model, a longstanding open question in the graph streaming literature.
Our techniques are based on analyzing communication protocols for compressing (hidden) permutations. Prior work in this context relied on reducing such problems to Boolean domain and analyzing them via tools like XOR Lemmas and Fourier analysis on Boolean hypercube. In contrast, our main technical contribution is a hardness amplification result for permutations through concatenation in place of prior XOR Lemmas. This result is proven by analyzing permutations directly via simple tools from group representation theory combined with detailed information-theoretic arguments, and can be of independent interest.
△ Less
Submitted 11 October, 2023; v1 submitted 9 October, 2023;
originally announced October 2023.
-
(Noisy) Gap Cycle Counting Strikes Back: Random Order Streaming Lower Bounds for Connected Components and Beyond
Authors:
Sepehr Assadi,
Janani Sundaresan
Abstract:
We continue the study of the communication complexity of gap cycle counting problems. These problems have been introduced by Verbin and Yu [SODA 2011] and have found numerous applications in proving streaming lower bounds. In the noisy gap cycle counting problem (NGC), there is a small integer $k \geq 1$ and an $n$-vertex graph consisted of vertex-disjoint union of either $k$-cycles or $2k$-cycles…
▽ More
We continue the study of the communication complexity of gap cycle counting problems. These problems have been introduced by Verbin and Yu [SODA 2011] and have found numerous applications in proving streaming lower bounds. In the noisy gap cycle counting problem (NGC), there is a small integer $k \geq 1$ and an $n$-vertex graph consisted of vertex-disjoint union of either $k$-cycles or $2k$-cycles, plus $O(n/k)$ disjoint paths of length $k-1$ in both cases (``noise''). The edges of this graph are partitioned between Alice and Bob whose goal is to decide which case the graph belongs to with minimal communication from Alice to Bob.
We study the robust communication complexity -- `a la Chakrabarti, Cormode, and McGregor [STOC 2008] -- of NGC, namely, when edges are partitioned randomly between the players. This is in contrast to all prior work on gap cycle counting problems in adversarial partitions. While NGC can be solved trivially with zero communication when $k < \log{n}$, we prove that when $k$ is a constant factor larger than $\log{n}$, the robust (one-way) communication complexity of NGC is $Ω(n)$ bits.
As a corollary of this result, we can prove several new graph streaming lower bounds for random order streams. In particular, we show that any streaming algorithm that for every $\varepsilon > 0$ estimates the number of connected components of a graph presented in a random order stream to within an $\varepsilon \cdot n$ additive factor requires $2^{Ω(1/\varepsilon)}$ space, settling a conjecture of Peng and Sohler [SODA 2018]. We further discuss new implications of our lower bounds to other problems such as estimating size of maximum matchings and independent sets on planar graphs, random walks, as well as to stochastic streams.
△ Less
Submitted 18 May, 2023;
originally announced May 2023.