Search | arXiv e-print repository

Achieving Tighter Finite-Time Rates for Heterogeneous Federated Stochastic Approximation under Markovian Sampling

Authors: Feng Zhu, Aritra Mitra, Robert W. Heath

Abstract: Motivated by collaborative reinforcement learning (RL) and optimization with time-correlated data, we study a generic federated stochastic approximation problem involving $M$ agents, where each agent is characterized by an agent-specific (potentially nonlinear) local operator. The goal is for the agents to communicate intermittently via a server to find the root of the average of the agents' local… ▽ More Motivated by collaborative reinforcement learning (RL) and optimization with time-correlated data, we study a generic federated stochastic approximation problem involving $M$ agents, where each agent is characterized by an agent-specific (potentially nonlinear) local operator. The goal is for the agents to communicate intermittently via a server to find the root of the average of the agents' local operators. The generality of our setting stems from allowing for (i) Markovian data at each agent and (ii) heterogeneity in the roots of the agents' local operators. The limited recent work that has accounted for both these features in a federated setting fails to guarantee convergence to the desired point or to show any benefit of collaboration; furthermore, they rely on projection steps in their algorithms to guarantee bounded iterates. Our work overcomes each of these limitations. We develop a novel algorithm titled \texttt{FedHSA}, and prove that it guarantees convergence to the correct point, while enjoying an $M$-fold linear speedup in sample-complexity due to collaboration. To our knowledge, \emph{this is the first finite-time result of its kind}, and establishing it (without relying on a projection step) entails a fairly intricate argument that accounts for the interplay between complex temporal correlations due to Markovian sampling, multiple local steps to save communication, and the drift-effects induced by heterogeneous local operators. Our results have implications for a broad class of heterogeneous federated RL problems (e.g., policy evaluation and control) with function approximation, where the agents' Markov decision processes can differ in their probability transition kernels and reward functions. △ Less

Submitted 15 April, 2025; originally announced April 2025.

arXiv:2409.05291 [pdf, ps, other]

Towards Fast Rates for Federated and Multi-Task Reinforcement Learning

Authors: Feng Zhu, Robert W. Heath Jr., Aritra Mitra

Abstract: We consider a setting involving $N$ agents, where each agent interacts with an environment modeled as a Markov Decision Process (MDP). The agents' MDPs differ in their reward functions, capturing heterogeneous objectives/tasks. The collective goal of the agents is to communicate intermittently via a central server to find a policy that maximizes the average of long-term cumulative rewards across e… ▽ More We consider a setting involving $N$ agents, where each agent interacts with an environment modeled as a Markov Decision Process (MDP). The agents' MDPs differ in their reward functions, capturing heterogeneous objectives/tasks. The collective goal of the agents is to communicate intermittently via a central server to find a policy that maximizes the average of long-term cumulative rewards across environments. The limited existing work on this topic either only provide asymptotic rates, or generate biased policies, or fail to establish any benefits of collaboration. In response, we propose Fast-FedPG - a novel federated policy gradient algorithm with a carefully designed bias-correction mechanism. Under a gradient-domination condition, we prove that our algorithm guarantees (i) fast linear convergence with exact gradients, and (ii) sub-linear rates that enjoy a linear speedup w.r.t. the number of agents with noisy, truncated policy gradients. Notably, in each case, the convergence is to a globally optimal policy with no heterogeneity-induced bias. In the absence of gradient-domination, we establish convergence to a first-order stationary point at a rate that continues to benefit from collaboration. △ Less

Submitted 8 September, 2024; originally announced September 2024.

Comments: Accepted to the Decision and Control Conference (CDC), 2024

arXiv:2304.00593 [pdf, other]

Online variable-length source coding for minimum bitrate LQG control

Authors: Travis C. Cuvelier, Takashi Tanaka, Robert W. Heath Jr

Abstract: We propose an adaptive coding approach to achieve linear-quadratic-Gaussian (LQG) control with near-minimum bitrate prefix-free feedback. Our approach combines a recent analysis of a quantizer design for minimum rate LQG control with work on universal lossless source coding for sources on countable alphabets. In the aforementioned quantizer design, it was established that the quantizer outputs are… ▽ More We propose an adaptive coding approach to achieve linear-quadratic-Gaussian (LQG) control with near-minimum bitrate prefix-free feedback. Our approach combines a recent analysis of a quantizer design for minimum rate LQG control with work on universal lossless source coding for sources on countable alphabets. In the aforementioned quantizer design, it was established that the quantizer outputs are an asymptotically stationary, ergodic process. To enable LQG control with provably near-minimum bitrate, the quantizer outputs must be encoded into binary codewords efficiently. This is possible given knowledge of the probability distributions of the quantizer outputs, or of their limiting distribution. Obtaining such knowledge is challenging; the distributions do not readily admit closed form descriptions. This motivates the application of universal source coding. Our main theoretical contribution in this work is a proof that (after an invertible transformation), the quantizer outputs are random variables that fall within an exponential or power-law envelope class (depending on the plant dimension). Using ideas from universal coding on envelope classes, we develop a practical, zero-delay version of these algorithms that operates with fixed precision arithmetic. We evaluate the performance of this algorithm numerically, and demonstrate competitive results with respect to fundamental tradeoffs between bitrate and LQG control performance. △ Less

Submitted 2 April, 2023; originally announced April 2023.

Comments: 8 pages 5 figures, under submission to the 2023 IEEE Conference on Decision and Control

arXiv:2204.00588 [pdf, other]

doi 10.1109/JSAIT.2022.3232060

Time-invariant Prefix Coding for LQG Control

Authors: Travis C. Cuvelier, Takashi Tanaka, Robert W. Heath Jr

Abstract: Motivated by control with communication constraints, in this work we develop a time-invariant data compression architecture for linear-quadratic-Gaussian (LQG) control with minimum bitrate prefix-free feedback. For any fixed control performance, the approach we propose nearly achieves known directed information (DI) lower bounds on the time-average expected codeword length. We refine the analysis… ▽ More Motivated by control with communication constraints, in this work we develop a time-invariant data compression architecture for linear-quadratic-Gaussian (LQG) control with minimum bitrate prefix-free feedback. For any fixed control performance, the approach we propose nearly achieves known directed information (DI) lower bounds on the time-average expected codeword length. We refine the analysis of a classical achievability approach, which required quantized plant measurements to be encoded via a time-varying lossless source code. We prove that the sequence of random variables describing the quantizations has a limiting distribution and that the quantizations may be encoded with a fixed source code optimized for this distribution without added time-asymptotic redundancy. Our result follows from analyzing the long-term stochastic behavior of the system, and permits us to additionally guarantee that the time-average codeword length (as opposed to expected length) is almost surely within a few bits of the minimum DI. To our knowledge, this time-invariant achievability result is the first in the literature. The originally published version of the supplementary material included a proof that contained an error that turned out to be inconsequential. This updated preprint corrects this error, which originally appeared under Lemma A.7. △ Less

Submitted 6 July, 2023; v1 submitted 1 April, 2022; originally announced April 2022.

Comments: Version as accepted to the IEEE Journal on Selected Areas in Information Theory (Special Issue on Modern Compression), modulo an additional correction to the proof of Lemma A.7. Official version: https://ieeexplore.ieee.org/document/10002900. 14 page main paper, 4 pages appendix, 3 figures

Journal ref: IEEE Journal on Selected Areas in Information Theory 2022

arXiv:2203.12467 [pdf, ps, other]

doi 10.1109/LCSYS.2022.3180402

A Lower-bound for Variable-length Source Coding in Linear-Quadratic-Gaussian Control with Shared Randomness

Authors: Travis C. Cuvelier, Takashi Tanaka, Robert W. Heath Jr

Abstract: In this letter, we consider a Linear Quadratic Gaussian (LQG) control system where feedback occurs over a noiseless binary channel and derive lower bounds on the minimum communication cost (quantified via the channel bitrate) required to attain a given control performance. We assume that at every time step an encoder can convey a packet containing a variable number of bits over the channel to a de… ▽ More In this letter, we consider a Linear Quadratic Gaussian (LQG) control system where feedback occurs over a noiseless binary channel and derive lower bounds on the minimum communication cost (quantified via the channel bitrate) required to attain a given control performance. We assume that at every time step an encoder can convey a packet containing a variable number of bits over the channel to a decoder at the controller. Our system model provides for the possibility that the encoder and decoder have shared randomness, as is the case in systems using dithered quantizers. We define two extremal prefix-free requirements that may be imposed on the message packets; such constraints are useful in that they allow the decoder, and potentially other agents to uniquely identify the end of a transmission in an online fashion. We then derive a lower bound on the rate of prefix-free coding in terms of directed information; in particular we show that a previously known bound still holds in the case with shared randomness. We generalize the bound for when prefix constraints are relaxed, and conclude with a rate-distortion formulation. △ Less

Submitted 2 June, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

arXiv:1107.5510 [pdf, ps, other]

A Nielsen theory for coincidences of iterates

Authors: Philip R. Heath, P. Christopher Staecker

Abstract: As the title suggests, this paper gives a Nielsen theory of coincidences of iterates of two self maps f, g of a closed manifold. The ideas is, as much as possible, to generalize Nielsen type periodic point theory, but there are many obstacles. Many times we get similar results to the "classical ones" in Nielsen periodic point theory, but with stronger hypotheses. As the title suggests, this paper gives a Nielsen theory of coincidences of iterates of two self maps f, g of a closed manifold. The ideas is, as much as possible, to generalize Nielsen type periodic point theory, but there are many obstacles. Many times we get similar results to the "classical ones" in Nielsen periodic point theory, but with stronger hypotheses. △ Less

Submitted 27 July, 2011; originally announced July 2011.

Comments: 30 pages

MSC Class: 55M20; 37C25

arXiv:1009.3046 [pdf, other]

A discontinuous Galerkin method for the Vlasov-Poisson system

Authors: R. E. Heath, I. M. Gamba, P. J. Morrison, C. Michler

Abstract: A discontinuous Galerkin method for approximating the Vlasov-Poisson system of equations describing the time evolution of a collisionless plasma is proposed. The method is mass conservative and, in the case that piecewise constant functions are used as a basis, the method preserves the positivity of the electron distribution function and weakly enforces continuity of the electric field through mes… ▽ More A discontinuous Galerkin method for approximating the Vlasov-Poisson system of equations describing the time evolution of a collisionless plasma is proposed. The method is mass conservative and, in the case that piecewise constant functions are used as a basis, the method preserves the positivity of the electron distribution function and weakly enforces continuity of the electric field through mesh interfaces and boundary conditions. The performance of the method is investigated by computing several examples and error estimates associated system's approximation are stated. In particular, computed results are benchmarked against established theoretical results for linear advection and the phenomenon of linear Landau damping for both the Maxwell and Lorentz distributions. Moreover, two nonlinear problems are considered: nonlinear Landau damping and a version of the two-stream instability are computed. For the latter, fine scale details of the resulting long-time BGK-like state are presented. Conservation laws are examined and various comparisons to theory are made. The results obtained demonstrate that the discontinuous Galerkin method is a viable option for integrating the Vlasov-Poisson system. △ Less

Submitted 1 October, 2011; v1 submitted 15 September, 2010; originally announced September 2010.

Comments: To appear in Journal for Computational Physics, 2011. 63 pages, 86 figures

arXiv:0709.0535 [pdf, ps, other]

Constructing packings in Grassmannian manifolds via alternating projection

Authors: I. S. Dhillon, R. W. Heath Jr, T. Strohmer, J. A. Tropp

Abstract: This paper describes a numerical method for finding good packings in Grassmannian manifolds equipped with various metrics. This investigation also encompasses packing in projective spaces. In each case, producing a good packing is equivalent to constructing a matrix that has certain structural and spectral properties. By alternately enforcing the structural condition and then the spectral condit… ▽ More This paper describes a numerical method for finding good packings in Grassmannian manifolds equipped with various metrics. This investigation also encompasses packing in projective spaces. In each case, producing a good packing is equivalent to constructing a matrix that has certain structural and spectral properties. By alternately enforcing the structural condition and then the spectral condition, it is often possible to reach a matrix that satisfies both. One may then extract a packing from this matrix. This approach is both powerful and versatile. In cases where experiments have been performed, the alternating projection method yields packings that compete with the best packings recorded. It also extends to problems that have not been studied numerically. For example, it can be used to produce packings of subspaces in real and complex Grassmannian spaces equipped with the Fubini--Study distance; these packings are valuable in wireless communications. One can prove that some of the novel configurations constructed by the algorithm have packing diameters that are nearly optimal. △ Less

Submitted 4 September, 2007; originally announced September 2007.

Comments: 41 pages, 7 tables, 4 figures

MSC Class: 51N15; 52C17

Journal ref: Exper. Math., Vol. 17, num. 1, pp. 9--35, 2008

arXiv:math/0301135 [pdf, ps, other]

Grassmannian Frames with Applications to Coding and Communication

Authors: Thomas Strohmer, Robert Heath

Abstract: For a given class ${\cal F}$ of uniform frames of fixed redundancy we define a Grassmannian frame as one that minimizes the maximal correlation $|< f_k,f_l >|$ among all frames $\{f_k\}_{k \in {\cal I}} \in {\cal F}$. We first analyze finite-dimensional Grassmannian frames. Using links to packings in Grassmannian spaces and antipodal spherical codes we derive bounds on the minimal achievable cor… ▽ More For a given class ${\cal F}$ of uniform frames of fixed redundancy we define a Grassmannian frame as one that minimizes the maximal correlation $|< f_k,f_l >|$ among all frames $\{f_k\}_{k \in {\cal I}} \in {\cal F}$. We first analyze finite-dimensional Grassmannian frames. Using links to packings in Grassmannian spaces and antipodal spherical codes we derive bounds on the minimal achievable correlation for Grassmannian frames. These bounds yield a simple condition under which Grassmannian frames coincide with uniform tight frames. We exploit connections to graph theory, equiangular line sets, and coding theory in order to derive explicit constructions of Grassmannian frames. Our findings extend recent results on uniform tight frames. We then introduce infinite-dimensional Grassmannian frames and analyze their connection to uniform tight frames for frames which are generated by group-like unitary systems. We derive an example of a Grassmannian Gabor frame by using connections to sphere packing theory. Finally we discuss the application of Grassmannian frames to wireless communication and to multiple description coding. △ Less

Submitted 13 January, 2003; originally announced January 2003.

Comments: Submitted in June 2002 to Appl. Comp. Harm. Anal

Showing 1–9 of 9 results for author: Heath, R