-
A new look at unitarity in quantization commutes with reduction for toric manifolds
Authors:
José M. Mourão,
João P. Nunes,
Augusto Pereira,
Dan Wang
Abstract:
For a symplectic toric manifold we consider half-form quantization in mixed polarizations $\mathcal{P}_\infty$, associated to the action of a subtorus $T^p\subset T^n$. The real directions in these polarizations are generated by components of the $T^p$ moment map.
Polarizations of this type can be obtained by starting at a toric Kähler polarization $\mathcal{P}_0$ and then following
Mabuchi ra…
▽ More
For a symplectic toric manifold we consider half-form quantization in mixed polarizations $\mathcal{P}_\infty$, associated to the action of a subtorus $T^p\subset T^n$. The real directions in these polarizations are generated by components of the $T^p$ moment map.
Polarizations of this type can be obtained by starting at a toric Kähler polarization $\mathcal{P}_0$ and then following
Mabuchi rays of toric Kähler polarizations generated by the norm square of the moment map of the torus subgroup. These geodesic rays are lifted to the quantum bundle via a generalized coherent state transform (gCST) and define equivariant isomorphisms between Hilbert spaces for the Kähler polarizations and the Hilbert space for the mixed polarization.
The polarizations $\mathcal{P}_\infty$ give a new way of looking at the problem of unitarity in the quantization commutes with reduction with respect to the $T^p$-action, as follows. The prequantum operators for the components of the moment map of the $T^p$-action act diagonally with discrete spectrum corresponding to the integral points of the moment polytope. The Hilbert space for the quantization with respect to $\mathcal{P}_\infty$ then naturally decomposes as a direct sum of the Hilbert spaces for all its quantizable coisotropic reductions which, in fact, are the Kähler reductions of the initial Kähler polarization $\mathcal{P}_0$. This will be shown to imply that, for the polarization $\mathcal{P}_\infty$, quantization commutes unitarily with reduction. The problem of unitarity in quantization commutes with reduction for $\mathcal{P}_0$ is then equivalent to the question of whether quantization in the polarization $\mathcal{P}_0$ is unitarily equivalent with quantization in the polarization $\mathcal{P}_\infty$. In fact, this does not hold in general in the toric case.
△ Less
Submitted 4 March, 2025; v1 submitted 22 December, 2024;
originally announced December 2024.
-
Solving Functional Optimization with Deep Networks and Variational Principles
Authors:
Kawisorn Kamtue,
Jose M. F. Moura,
Orathai Sangpetch
Abstract:
Can neural networks solve math problems using first a principle alone? This paper shows how to leverage the fundamental theorem of the calculus of variations to design deep neural networks to solve functional optimization without requiring training data (e.g., ground-truth optimal solutions). Our approach is particularly crucial when the solution is a function defined over an unknown interval or s…
▽ More
Can neural networks solve math problems using first a principle alone? This paper shows how to leverage the fundamental theorem of the calculus of variations to design deep neural networks to solve functional optimization without requiring training data (e.g., ground-truth optimal solutions). Our approach is particularly crucial when the solution is a function defined over an unknown interval or support\textemdash such as in minimum-time control problems. By incorporating the necessary conditions satisfied by the optimal function solution, as derived from the calculus of variation, in the design of the deep architecture, CalVNet leverages overparameterized neural networks to learn these optimal functions directly. We validate CalVNet by showing that, without relying on ground-truth data and simply incorporating first principles, it successfully derives the Kalman filter for linear filtering, the bang-bang optimal control for minimum-time problems, and finds geodesics on manifolds. Our results demonstrate that CalVNet can be trained in an unsupervised manner, without relying on ground-truth data, establishing a promising framework for addressing general, potentially unsolved functional optimization problems that still lack analytical solutions.
△ Less
Submitted 11 March, 2025; v1 submitted 8 October, 2024;
originally announced October 2024.
-
Mabuchi rays, test configurations and quantization for toric manifolds
Authors:
António Gouveia,
José M. Mourão,
João P. Nunes
Abstract:
We consider Mabuchi rays of toric Kähler structures on symplectic toric manifolds which are associated to toric test configurations and that are generated by convex functions on themoment polytope, $P$, whose second derivative has support given by a compact subset $K<P$. Associated to the test configuration there is a polyhedral decomposition of $P$ whose components are approximated by the compone…
▽ More
We consider Mabuchi rays of toric Kähler structures on symplectic toric manifolds which are associated to toric test configurations and that are generated by convex functions on themoment polytope, $P$, whose second derivative has support given by a compact subset $K<P$. Associated to the test configuration there is a polyhedral decomposition of $P$ whose components are approximated by the components of $P \setminus K$. Along such Mabuchi rays, the toric complex structure remains unchanged on the inverse image under the moment map of $(P \setminus \check {K})$, where $\check {K}$ denotes the interior of $K$. At infinite geodesic time, the Kähler polarizations along the ray converge to interesting new toric mixed polarizations. The quantization in these limit polarizations is given by restrictions of the monomial holomorphic sections of the Kähler quantization, for monomials corresponding to integral points in $P \setminus \check {K}$, and by sections on the fibers of the moment map over the integral points contained in $\check {K}$, which, along the directions parallel to $K$ are holomorphic and which along the directions transverse to $K$ are distributional. These quantizations correspond to quantizations of the central fiber of the test family, in the symplectic picture. We present the case of $S2$ in detail and then generalize to higher dimensional symplectic toric manifolds. Metrically, at infinite Mabuchi geodesic time, the sphere decomposes into two discs and a collection of cylinders, separated by infinitely long lines. Correspondingly, the quantization in the limit polarization decomposes into a direct sum of the contributions from the quantizations of each of these components.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Fibering polarizations and Mabuchi rays on symmetric spaces of compact type
Authors:
Thomas Baier,
Ana Cristina Ferreira,
Joachim Hilgert,
José M. Mourão,
João P. Nunes
Abstract:
In this paper, we describe holomorphic quantizations of the cotangent bundle of a symmetric space of compact type $T^*(U/K)\cong U_\mathbb{C}/K_\mathbb{C}$, along Mabuchi rays of $U$-invariant Kähler structures. At infinite geodesic time, the Kähler polarizations converge to a mixed polarization $\mathcal{P}_\infty$. We show how a generalized coherent state transform relates the quantizations alon…
▽ More
In this paper, we describe holomorphic quantizations of the cotangent bundle of a symmetric space of compact type $T^*(U/K)\cong U_\mathbb{C}/K_\mathbb{C}$, along Mabuchi rays of $U$-invariant Kähler structures. At infinite geodesic time, the Kähler polarizations converge to a mixed polarization $\mathcal{P}_\infty$. We show how a generalized coherent state transform relates the quantizations along the Mabuchi geodesics such that holomorphic sections converge, as geodesic time goes to infinity, to distributional $\mathcal{P}_\infty$-polarized sections. Unlike in the case of $T^*U$, the gCST mapping from the Hilbert space of vertically polarized sections are not asymptotically unitary due to the appearance of representation dependent factors associated to the isotypical decomposition for the $U$-action. In agreement with the general program outlined in [Bai+23], we also describe how the quantization in the limit polarization $\mathcal{P}_\infty$ is given by the direct sum of the quantizations for all the symplectic reductions relative to the invariant torus action associated to the Hamiltonian action of $U$.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Gradient Networks
Authors:
Shreyas Chaudhari,
Srinivasa Pranav,
José M. F. Moura
Abstract:
Directly parameterizing and learning gradients of functions has widespread significance, with specific applications in inverse problems, generative modeling, and optimal transport. This paper introduces gradient networks (GradNets): novel neural network architectures that parameterize gradients of various function classes. GradNets exhibit specialized architectural constraints that ensure correspo…
▽ More
Directly parameterizing and learning gradients of functions has widespread significance, with specific applications in inverse problems, generative modeling, and optimal transport. This paper introduces gradient networks (GradNets): novel neural network architectures that parameterize gradients of various function classes. GradNets exhibit specialized architectural constraints that ensure correspondence to gradient functions. We provide a comprehensive GradNet design framework that includes methods for transforming GradNets into monotone gradient networks (mGradNets), which are guaranteed to represent gradients of convex functions. Our results establish that our proposed GradNet (and mGradNet) universally approximate the gradients of (convex) functions. Furthermore, these networks can be customized to correspond to specific spaces of potential functions, including transformed sums of (convex) ridge functions. Our analysis leads to two distinct GradNet architectures, GradNet-C and GradNet-M, and we describe the corresponding monotone versions, mGradNet-C and mGradNet-M. Our empirical results demonstrate that these architectures provide efficient parameterizations and outperform existing methods by up to 15 dB in gradient field tasks and by up to 11 dB in Hamiltonian dynamics learning tasks.
△ Less
Submitted 24 January, 2025; v1 submitted 10 April, 2024;
originally announced April 2024.
-
Learning Gradients of Convex Functions with Monotone Gradient Networks
Authors:
Shreyas Chaudhari,
Srinivasa Pranav,
José M. F. Moura
Abstract:
While much effort has been devoted to deriving and analyzing effective convex formulations of signal processing problems, the gradients of convex functions also have critical applications ranging from gradient-based optimization to optimal transport. Recent works have explored data-driven methods for learning convex objective functions, but learning their monotone gradients is seldom studied. In t…
▽ More
While much effort has been devoted to deriving and analyzing effective convex formulations of signal processing problems, the gradients of convex functions also have critical applications ranging from gradient-based optimization to optimal transport. Recent works have explored data-driven methods for learning convex objective functions, but learning their monotone gradients is seldom studied. In this work, we propose C-MGN and M-MGN, two monotone gradient neural network architectures for directly learning the gradients of convex functions. We show that, compared to state of the art methods, our networks are easier to train, learn monotone gradient fields more accurately, and use significantly fewer parameters. We further demonstrate their ability to learn optimal transport mappings to augment driving image data.
△ Less
Submitted 17 March, 2023; v1 submitted 25 January, 2023;
originally announced January 2023.
-
Quantization in fibering polarizations, Mabuchi rays and geometric Peter--Weyl theorem
Authors:
Thomas Baier,
Joachim Hilgert,
Oğuzhan Kaya,
José M. Mourão,
João P. Nunes
Abstract:
In this paper we use techniques of geometric quantization to give a geometric interpretation of the Peter--Weyl theorem. We present a novel approach to half-form corrected geometric quantization in a specific type of non-Kähler polarizations and study one important class of examples, namely cotangent bundles of compact semi-simple groups $K$. Our main results state that this canonically defined po…
▽ More
In this paper we use techniques of geometric quantization to give a geometric interpretation of the Peter--Weyl theorem. We present a novel approach to half-form corrected geometric quantization in a specific type of non-Kähler polarizations and study one important class of examples, namely cotangent bundles of compact semi-simple groups $K$. Our main results state that this canonically defined polarization occurs in the geodesic boundary of the space of $K\times K$-invariant Kähler polarizations equipped with Mabuchi's metric, and that its half-form corrected quantization is isomorphic to the Kähler case. An important role is played by invariance of the limit polarization under a torus action.
Unitary parallel transport on the bundle of quantum states along a specific Mabuchi geodesic, given by the coherent state transform of Hall, relates the non-commutative Fourier transform for $K$ with the Borel--Weil description of irreducible representations of $K$.
△ Less
Submitted 25 January, 2023;
originally announced January 2023.
-
Networked Signal and Information Processing
Authors:
Stefan Vlaski,
Soummya Kar,
Ali H. Sayed,
José M. F. Moura
Abstract:
The article reviews significant advances in networked signal and information processing, which have enabled in the last 25 years extending decision making and inference, optimization, control, and learning to the increasingly ubiquitous environments of distributed agents. As these interacting agents cooperate, new collective behaviors emerge from local decisions and actions. Moreover, and signific…
▽ More
The article reviews significant advances in networked signal and information processing, which have enabled in the last 25 years extending decision making and inference, optimization, control, and learning to the increasingly ubiquitous environments of distributed agents. As these interacting agents cooperate, new collective behaviors emerge from local decisions and actions. Moreover, and significantly, theory and applications show that networked agents, through cooperation and sharing, are able to match the performance of cloud or federated solutions, while offering the potential for improved privacy, increasing resilience, and saving resources.
△ Less
Submitted 18 April, 2023; v1 submitted 25 October, 2022;
originally announced October 2022.
-
Finite-Time In-Network Computation of Linear Transforms
Authors:
Soummya Kar,
Markus Püschel,
José M. F. Moura
Abstract:
This paper focuses on finite-time in-network computation of linear transforms of distributed graph data. Finite-time transform computation problems are of interest in graph-based computing and signal processing applications in which the objective is to compute, by means of distributed iterative methods, various (linear) transforms of the data distributed at the agents or nodes of the graph. While…
▽ More
This paper focuses on finite-time in-network computation of linear transforms of distributed graph data. Finite-time transform computation problems are of interest in graph-based computing and signal processing applications in which the objective is to compute, by means of distributed iterative methods, various (linear) transforms of the data distributed at the agents or nodes of the graph. While finite-time computation of consensus-type or more generally rank-one transforms have been studied, systematic approaches toward scalable computing of general linear transforms, specifically in the case of heterogeneous agent objectives in which each agent is interested in obtaining a different linear combination of the network data, are relatively less explored. In this paper, by employing ideas from algebraic geometry, we develop a systematic characterization of linear transforms that are amenable to distributed in-network computation in finite-time using linear iterations. Further, we consider the general case of directed inter-agent communication graphs. Specifically, it is shown that \emph{almost all} linear transformations of data distributed on the nodes of a digraph containing a Hamiltonian cycle may be computed using at most $N$ linear distributed iterations. Finally, by studying an associated matrix factorization based reformulation of the transform computation problem, we obtain, as a by-product, certain results and characterizations on sparsity-constrained matrix factorization that are of independent interest.
△ Less
Submitted 3 April, 2021;
originally announced April 2021.
-
Distributed Gradient Methods for Nonconvex Optimization: Local and Global Convergence Guarantees
Authors:
Brian Swenson,
Soummya Kar,
H. Vincent Poor,
José M. F. Moura,
Aaron Jaech
Abstract:
The article discusses distributed gradient-descent algorithms for computing local and global minima in nonconvex optimization. For local optimization, we focus on distributed stochastic gradient descent (D-SGD)--a simple network-based variant of classical SGD. We discuss local minima convergence guarantees and explore the simple but critical role of the stable-manifold theorem in analyzing saddle-…
▽ More
The article discusses distributed gradient-descent algorithms for computing local and global minima in nonconvex optimization. For local optimization, we focus on distributed stochastic gradient descent (D-SGD)--a simple network-based variant of classical SGD. We discuss local minima convergence guarantees and explore the simple but critical role of the stable-manifold theorem in analyzing saddle-point avoidance. For global optimization, we discuss annealing-based methods in which slowly decaying noise is added to D-SGD. Conditions are discussed under which convergence to global minima is guaranteed. Numerical examples illustrate the key concepts in the paper.
△ Less
Submitted 16 September, 2020; v1 submitted 23 March, 2020;
originally announced March 2020.
-
Primal-dual methods for large-scale and distributed convex optimization and data analytics
Authors:
Dusan Jakovetic,
Dragana Bajovic,
Joao Xavier,
Jose M. F. Moura
Abstract:
The augmented Lagrangian method (ALM) is a classical optimization tool that solves a given "difficult" (constrained) problem via finding solutions of a sequence of "easier"(often unconstrained) sub-problems with respect to the original (primal) variable, wherein constraints satisfaction is controlled via the so-called dual variables. ALM is highly flexible with respect to how primal sub-problems c…
▽ More
The augmented Lagrangian method (ALM) is a classical optimization tool that solves a given "difficult" (constrained) problem via finding solutions of a sequence of "easier"(often unconstrained) sub-problems with respect to the original (primal) variable, wherein constraints satisfaction is controlled via the so-called dual variables. ALM is highly flexible with respect to how primal sub-problems can be solved, giving rise to a plethora of different primal-dual methods. The powerful ALM mechanism has recently proved to be very successful in various large scale and distributed applications. In addition, several significant advances have appeared, primarily on precise complexity results with respect to computational and communication costs in the presence of inexact updates and design and analysis of novel optimal methods for distributed consensus optimization. We provide a tutorial-style introduction to ALM and its variants for solving convex optimization problems in large scale and distributed settings. We describe control-theoretic tools for the algorithms' analysis and design, survey recent results, and provide novel insights in the context of two emerging applications: federated learning and distributed energy trading.
△ Less
Submitted 14 April, 2020; v1 submitted 18 December, 2019;
originally announced December 2019.
-
Resilient Distributed Recovery of Large Fields
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
This paper studies the resilient distributed recovery of large fields under measurement attacks, by a team of agents, where each measures a small subset of the components of a large spatially distributed field. An adversary corrupts some of the measurements. The agents collaborate to process their measurements, and each is interested in recovering only a fraction of the field. We present a field r…
▽ More
This paper studies the resilient distributed recovery of large fields under measurement attacks, by a team of agents, where each measures a small subset of the components of a large spatially distributed field. An adversary corrupts some of the measurements. The agents collaborate to process their measurements, and each is interested in recovering only a fraction of the field. We present a field recovery consensus+innovations type distributed algorithm that is resilient to measurement attacks, where an agent maintains and updates a local state based on its neighbors states and its own measurement. Under sufficient conditions on the attacker and the connectivity of the communication network, each agent's state, even those with compromised measurements, converges to the true value of the field components that it is interested in recovering. Finally, we illustrate the performance of our algorithm through numerical examples.
△ Less
Submitted 19 October, 2019;
originally announced October 2019.
-
Distributed Global Optimization by Annealing
Authors:
Brian Swenson,
Soummya Kar,
H. Vincent Poor,
José M. F. Moura
Abstract:
The paper considers a distributed algorithm for global minimization of a nonconvex function. The algorithm is a first-order consensus + innovations type algorithm that incorporates decaying additive Gaussian noise for annealing, converging to the set of global minima under certain technical assumptions. The paper presents simple methods for verifying that the required technical assumptions hold an…
▽ More
The paper considers a distributed algorithm for global minimization of a nonconvex function. The algorithm is a first-order consensus + innovations type algorithm that incorporates decaying additive Gaussian noise for annealing, converging to the set of global minima under certain technical assumptions. The paper presents simple methods for verifying that the required technical assumptions hold and illustrates it with a distributed target-localization application.
△ Less
Submitted 20 July, 2019;
originally announced July 2019.
-
Partial coherent state transforms, $G \times T$-invariant Kähler structures and geometric quantization of cotangent bundles of compact Lie groups
Authors:
José M. Mourão,
João P. Nunes,
Miguel B. Pereira
Abstract:
In this paper, we study the analytic continuation to complex time of the Hamiltonian flow of certain $G\times T$-invariant functions on the cotangent bundle of a compact connected Lie group $G$ with maximal torus $T$. Namely, we will take the Hamiltonian flows of one $G\times G$-invariant function, $h$, and one $G\times T$-invariant function, $f$. Acting with these complex time Hamiltonian flows o…
▽ More
In this paper, we study the analytic continuation to complex time of the Hamiltonian flow of certain $G\times T$-invariant functions on the cotangent bundle of a compact connected Lie group $G$ with maximal torus $T$. Namely, we will take the Hamiltonian flows of one $G\times G$-invariant function, $h$, and one $G\times T$-invariant function, $f$. Acting with these complex time Hamiltonian flows on $G\times G$-invariant Kähler structures gives new $G\times T$-invariant, but not $G\times G$-invariant, Kähler structures on $T^*G$. We study the Hilbert spaces ${\mathcal H}_{τ,σ}$ corresponding to the quantization of $T^*G$ with respect to these non-invariant Kähler structures. On the other hand, by taking the vertical Schrödinger polarization as a starting point, the above $G\times T$-invariant Hamiltonian flows also generate families of mixed polarizations $\mathcal{P}_{0,σ}, σ\in {\mathbb C}, {\rm Im}(σ) >0$. Each of these mixed polarizations is globally given by a direct sum of an integrable real distribution and of a complex distribution that defines a Kähler structure on the leaves of a foliation of $T^*G$. The geometric quantization of $T^*G$ with respect to these mixed polarizations gives rise to unitary partial coherent state transforms, corresponding to KSH maps as defined in [KMN1,KMN2].
△ Less
Submitted 9 September, 2019; v1 submitted 11 July, 2019;
originally announced July 2019.
-
Resilient Distributed Field Estimation
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
We study resilient distributed field estimation under measurement attacks. A network of agents or devices measures a large, spatially distributed physical field parameter. An adversary arbitrarily manipulates the measurements of some of the agents. Each agent's goal is to process its measurements and information received from its neighbors to estimate only a few specific components of the field. W…
▽ More
We study resilient distributed field estimation under measurement attacks. A network of agents or devices measures a large, spatially distributed physical field parameter. An adversary arbitrarily manipulates the measurements of some of the agents. Each agent's goal is to process its measurements and information received from its neighbors to estimate only a few specific components of the field. We present $\mathbf{SAFE}$, the Saturating Adaptive Field Estimator, a consensus+innovations distributed field estimator that is resilient to measurement attacks. Under sufficient conditions on the compromised measurement streams, the physical coupling between the field and the agents' measurements, and the connectivity of the cyber communication network, $\mathbf{SAFE}$ guarantees that each agent's estimate converges almost surely to the true value of the components of the parameter in which the agent is interested. Finally, we illustrate the performance of $\mathbf{SAFE}$ through numerical examples.
△ Less
Submitted 26 March, 2020; v1 submitted 18 April, 2019;
originally announced April 2019.
-
Annealing for Distributed Global Optimization
Authors:
Brian Swenson,
Soummya Kar,
H. Vincent Poor,
Jose' M. F. Moura
Abstract:
The paper proves convergence to global optima for a class of distributed algorithms for nonconvex optimization in network-based multi-agent settings. Agents are permitted to communicate over a time-varying undirected graph. Each agent is assumed to possess a local objective function (assumed to be smooth, but possibly nonconvex). The paper considers algorithms for optimizing the sum function. A di…
▽ More
The paper proves convergence to global optima for a class of distributed algorithms for nonconvex optimization in network-based multi-agent settings. Agents are permitted to communicate over a time-varying undirected graph. Each agent is assumed to possess a local objective function (assumed to be smooth, but possibly nonconvex). The paper considers algorithms for optimizing the sum function. A distributed algorithm of the consensus+innovations type is proposed which relies on first-order information at the agent level. Under appropriate conditions on network connectivity and the cost objective, convergence to the set of global optima is achieved by an annealing-type approach, with decaying Gaussian noise independently added into each agent's update step. It is shown that the proposed algorithm converges in probability to the set of global minima of the sum function.
△ Less
Submitted 18 March, 2019;
originally announced March 2019.
-
Resilient Distributed Parameter Estimation with Heterogeneous Data
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
This paper studies resilient distributed estimation under measurement attacks. A set of agents each makes successive local, linear, noisy measurements of an unknown vector field collected in a vector parameter. The local measurement models are heterogeneous across agents and may be locally unobservable for the unknown parameter. An adversary compromises some of the measurement streams and changes…
▽ More
This paper studies resilient distributed estimation under measurement attacks. A set of agents each makes successive local, linear, noisy measurements of an unknown vector field collected in a vector parameter. The local measurement models are heterogeneous across agents and may be locally unobservable for the unknown parameter. An adversary compromises some of the measurement streams and changes their values arbitrarily. The agents' goal is to cooperate over a peer-to-peer communication network to process their (possibly compromised) local measurements and estimate the value of the unknown vector parameter. We present SAGE, the Saturating Adaptive Gain Estimator, a distributed, recursive, consensus+innovations estimator that is resilient to measurement attacks. We demonstrate that, as long as the number of compromised measurement streams is below a particular bound, then, SAGE guarantees that all of the agents' local estimates converge almost surely to the value of the parameter. The resilience of the estimator -- i.e., the number of compromised measurement streams it can tolerate -- does not depend on the topology of the inter-agent communication network. Finally, we illustrate the performance of SAGE through numerical examples.
△ Less
Submitted 30 May, 2019; v1 submitted 20 December, 2018;
originally announced December 2018.
-
Resilient Distributed Estimation: Sensor Attacks
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
This paper studies multi-agent distributed estimation under sensor attacks. Individual agents make sensor measurements of an unknown parameter belonging to a compact set, and, at every time step, a fraction of the agents' sensor measurements may fall under attack and take arbitrary values. We present the Saturated Innovation Update ($\mathcal{SIU}$) algorithm for distributed estimation resilient t…
▽ More
This paper studies multi-agent distributed estimation under sensor attacks. Individual agents make sensor measurements of an unknown parameter belonging to a compact set, and, at every time step, a fraction of the agents' sensor measurements may fall under attack and take arbitrary values. We present the Saturated Innovation Update ($\mathcal{SIU}$) algorithm for distributed estimation resilient to sensor attacks. Under the iterative $\mathcal{SIU}$ algorithm, if less than one half of the agent sensors fall under attack, then, all of the agents' estimates converge at a polynomial rate (with respect to the number of iterations) to the true parameter. The resilience of $\mathcal{SIU}$ to sensor attacks does not depend on the topology of the inter-agent communication network, as long as it remains connected. We demonstrate the performance of $\mathcal{SIU}$ with numerical examples.
△ Less
Submitted 24 June, 2018; v1 submitted 18 September, 2017;
originally announced September 2017.
-
Preorder Construct on Simple Undirected Graphs
Authors:
Augusto Almeida Santos,
José M. F. Moura,
João Xavier
Abstract:
We construct a novel preorder on the set of nodes of a simple undirected graph. We prove that the preorder (induced by the topology of the graph) is preserved, e.g., by the logistic dynamical system (both in discrete and continuous time). Moreover, the underlying equivalence relation of the preorder corresponds to the coarsest equitable partition (CEP). This will further imply that the logistic dy…
▽ More
We construct a novel preorder on the set of nodes of a simple undirected graph. We prove that the preorder (induced by the topology of the graph) is preserved, e.g., by the logistic dynamical system (both in discrete and continuous time). Moreover, the underlying equivalence relation of the preorder corresponds to the coarsest equitable partition (CEP). This will further imply that the logistic dynamical system on a graph preserves its coarsest equitable partition. The results provide a nontrivial invariant set for the logistic and the like dynamical systems, as we show. We note that our construct provides a functional characterization for the CEP as an alternative to the pure set theoretical iterated degree sequences characterization. The construct and results presented might have independent interest for analysis on graphs or qualitative analysis of dynamical systems over networks.
△ Less
Submitted 9 March, 2017;
originally announced March 2017.
-
Thermodynamic Limit of Interacting Particle Systems over Time-varying Sparse Random Networks
Authors:
Augusto Almeida Santos,
Soummya Kar,
José M. F. Moura,
João Xavier
Abstract:
We establish a functional weak law of large numbers for observable macroscopic state variables of interacting particle systems (e.g., voter and contact processes) over fast time-varying sparse random networks of interactions. We show that, as the number of agents $N$ grows large, the proportion of agents $\left(\overline{Y}_{k}^{N}(t)\right)$ at a certain state $k$ converges in distribution -- or,…
▽ More
We establish a functional weak law of large numbers for observable macroscopic state variables of interacting particle systems (e.g., voter and contact processes) over fast time-varying sparse random networks of interactions. We show that, as the number of agents $N$ grows large, the proportion of agents $\left(\overline{Y}_{k}^{N}(t)\right)$ at a certain state $k$ converges in distribution -- or, more precisely, weakly with respect to the uniform topology on the space of \emph{càdlàg} sample paths -- to the solution of an ordinary differential equation over any compact interval $\left[0,T\right]$. Although the limiting process is Markov, the prelimit processes, i.e., the normalized macrostate vector processes $\left(\mathbf{\overline{Y}}^{N}(t)\right)=\left(\overline{Y}_{1}^{N}(t),\ldots,\overline{Y}_{K}^{N}(t)\right)$, are non-Markov as they are tied to the \emph{high-dimensional} microscopic state of the system, which precludes the direct application of standard arguments for establishing weak convergence. The techniques developed in the paper for establishing weak convergence might be of independent interest.
△ Less
Submitted 26 February, 2017;
originally announced February 2017.
-
Picard group and quantization of toric orbifolds
Authors:
Thomas Baier,
José M. Mourão,
João P. Nunes
Abstract:
In the classical theory of toric manifolds polytopes appear in two guises -- as Newton polytopes of line bundles on the complex, and as moment polytopes on the symplectic side, the link between the two being established by the prequantizability condition on the cohomology class of the symplectic form.
Here we give a combinatorial description of the orbifold Picard group for complete toric orbifo…
▽ More
In the classical theory of toric manifolds polytopes appear in two guises -- as Newton polytopes of line bundles on the complex, and as moment polytopes on the symplectic side, the link between the two being established by the prequantizability condition on the cohomology class of the symplectic form.
Here we give a combinatorial description of the orbifold Picard group for complete toric orbifolds, with the aim of detailing the relation between complex and symplectic aspects in the orbifold setting. In particular this permits to illustrate the breakdown of identification of (orbifold) line bundles by their Chern class (or moment polytope up to translations in $\mathfrak{t}^\ast$), and non-constancy of $h^0$ on representatives of the same Chern class. As an application, we discuss symplectic reduction with respect to restrictions of the action to sub-tori, and the associated Bohr--Sommerfeld conditions in mixed polarizations.
△ Less
Submitted 2 July, 2018; v1 submitted 8 February, 2017;
originally announced February 2017.
-
Spectral Statistics of Lattice Graph Structured, Non-uniform Percolations
Authors:
Stephen Kruzick,
José M. F. Moura
Abstract:
Design of filters for graph signal processing benefits from knowledge of the spectral decomposition of matrices that encode graphs, such as the adjacency matrix and the Laplacian matrix, used to define the shift operator. For shift matrices with real eigenvalues, which arise for symmetric graphs, the empirical spectral distribution captures the eigenvalue locations. Under realistic circumstances,…
▽ More
Design of filters for graph signal processing benefits from knowledge of the spectral decomposition of matrices that encode graphs, such as the adjacency matrix and the Laplacian matrix, used to define the shift operator. For shift matrices with real eigenvalues, which arise for symmetric graphs, the empirical spectral distribution captures the eigenvalue locations. Under realistic circumstances, stochastic influences often affect the network structure and, consequently, the shift matrix empirical spectral distribution. Nevertheless, deterministic functions may often be found to approximate the asymptotic behavior of empirical spectral distributions of random matrices. This paper uses stochastic canonical equation methods developed by Girko to derive such deterministic equivalent distributions for the empirical spectral distributions of random graphs formed by structured, non-uniform percolation of a D-dimensional lattice supergraph. Included simulations demonstrate the results for sample parameters.
△ Less
Submitted 6 January, 2017;
originally announced January 2017.
-
Resilient Distributed Estimation Through Adversary Detection
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
This paper studies resilient multi-agent distributed estimation of an unknown vector parameter when a subset of the agents is adversarial. We present and analyze a Flag Raising Distributed Estimator ($\mathcal{FRDE}$) that allows the agents under attack to perform accurate parameter estimation and detect the adversarial agents. The $\mathcal{FRDE}$ algorithm is a consensus+innovations estimator in…
▽ More
This paper studies resilient multi-agent distributed estimation of an unknown vector parameter when a subset of the agents is adversarial. We present and analyze a Flag Raising Distributed Estimator ($\mathcal{FRDE}$) that allows the agents under attack to perform accurate parameter estimation and detect the adversarial agents. The $\mathcal{FRDE}$ algorithm is a consensus+innovations estimator in which agents combine estimates of neighboring agents (consensus) with local sensing information (innovations). We establish that, under $\mathcal{FRDE}$, either the uncompromised agents' estimates are almost surely consistent or the uncompromised agents detect compromised agents if and only if the network of uncompromised agents is connected and globally observable. Numerical examples illustrate the performance of $\mathcal{FRDE}$.
△ Less
Submitted 12 January, 2018; v1 submitted 3 January, 2017;
originally announced January 2017.
-
Spectral Statistics of Lattice Graph Percolation Models
Authors:
Stephen Kruzick,
Jose M. F. Moura
Abstract:
In graph signal processing, the graph adjacency matrix or the graph Laplacian commonly define the shift operator. The spectral decomposition of the shift operator plays an important role in that the eigenvalues represent frequencies and the eigenvectors provide a spectral basis. This is useful, for example, in the design of filters. However, the graph or network may be uncertain due to stochastic…
▽ More
In graph signal processing, the graph adjacency matrix or the graph Laplacian commonly define the shift operator. The spectral decomposition of the shift operator plays an important role in that the eigenvalues represent frequencies and the eigenvectors provide a spectral basis. This is useful, for example, in the design of filters. However, the graph or network may be uncertain due to stochastic influences in construction and maintenance, and, under such conditions, the eigenvalues of the shift matrix become random variables. This paper examines the spectral distribution of the eigenvalues of random networks formed by including each link of a D-dimensional lattice supergraph independently with identical probability, a percolation model. Using the stochastic canonical equation methods developed by Girko for symmetric matrices with independent upper triangular entries, a deterministic distribution is found that asymptotically approximates the empirical spectral distribution of the scaled adjacency matrix for a model with arbitrary parameters. The main results characterize the form of the solution to an important system of equations that leads to this deterministic distribution function and significantly reduce the number of equations that must be solved to find the solution for a given set of model parameters. Simulations comparing the expected empirical spectral distributions and the computed deterministic distributions are provided for sample parameters.
△ Less
Submitted 26 September, 2016;
originally announced November 2016.
-
Optimal Attack Strategies Subject to Detection Constraints Against Cyber-Physical Systems
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
This paper studies an attacker against a cyber-physical system (CPS) whose goal is to move the state of a CPS to a target state while ensuring that his or her probability of being detected does not exceed a given bound. The attacker's probability of being detected is related to the nonnegative bias induced by his or her attack on the CPS' detection statistic. We formulate a linear quadratic cost f…
▽ More
This paper studies an attacker against a cyber-physical system (CPS) whose goal is to move the state of a CPS to a target state while ensuring that his or her probability of being detected does not exceed a given bound. The attacker's probability of being detected is related to the nonnegative bias induced by his or her attack on the CPS' detection statistic. We formulate a linear quadratic cost function that captures the attacker's control goal and establish constraints on the induced bias that reflect the attacker's detection-avoidance objectives. When the attacker is constrained to be detected at the false-alarm rate of the detector, we show that the optimal attack strategy reduces to a linear feedback of the attacker's state estimate. In the case that the attacker's bias is upper bounded by a positive constant, we provide two algorithms -- an optimal algorithm and a sub-optimal, less computationally intensive algorithm -- to find suitable attack sequences. Finally, we illustrate our attack strategies in numerical examples based on a remotely-controlled helicopter under attack.
△ Less
Submitted 30 March, 2017; v1 submitted 11 October, 2016;
originally announced October 2016.
-
Cyber Physical Attacks with Control Objectives
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
This paper studies attackers with control objectives against cyber-physical systems (CPS). The system is equipped with its own controller and attack detector, and the goal of the attacker is to move the system to a target state while altering the system's actuator input and sensor output to avoid detection. We formulate a cost function that reflects the attacker's goals, and, using dynamic program…
▽ More
This paper studies attackers with control objectives against cyber-physical systems (CPS). The system is equipped with its own controller and attack detector, and the goal of the attacker is to move the system to a target state while altering the system's actuator input and sensor output to avoid detection. We formulate a cost function that reflects the attacker's goals, and, using dynamic programming, we show that the optimal attack strategy reduces to a linear feedback of the attacker's state estimate. By changing the parameters of the cost function, we show how an attacker can design optimal attacks to balance the control objective and the detection avoidance objective. Finally, we provide a numerical illustration based on a remotely-controlled helicopter under attack.
△ Less
Submitted 20 July, 2016;
originally announced July 2016.
-
Consensus+Innovations Distributed Kalman Filter with Optimized Gains
Authors:
Subhro Das,
José M. F. Moura
Abstract:
In this paper, we address the distributed filtering and prediction of time-varying random fields represented by linear time-invariant (LTI) dynamical systems. The field is observed by a sparsely connected network of agents/sensors collaborating among themselves. We develop a Kalman filter type consensus+innovations distributed linear estimator of the dynamic field termed as Consensus+Innovations K…
▽ More
In this paper, we address the distributed filtering and prediction of time-varying random fields represented by linear time-invariant (LTI) dynamical systems. The field is observed by a sparsely connected network of agents/sensors collaborating among themselves. We develop a Kalman filter type consensus+innovations distributed linear estimator of the dynamic field termed as Consensus+Innovations Kalman Filter. We analyze the convergence properties of this distributed estimator. We prove that the mean-squared error of the estimator asymptotically converges if the degree of instability of the field dynamics is within a pre-specified threshold defined as tracking capacity of the estimator. The tracking capacity is a function of the local observation models and the agent communication network. We design the optimal consensus and innovation gain matrices yielding distributed estimates with minimized mean-squared error. Through numerical evaluations, we show that, the distributed estimator with optimal gains converges faster and with approximately 3dB better mean-squared error performance than previous distributed estimators.
△ Less
Submitted 13 October, 2016; v1 submitted 19 May, 2016;
originally announced May 2016.
-
Distributed Constrained Recursive Nonlinear Least-Squares Estimation: Algorithms and Asymptotics
Authors:
Anit Kumar Sahu,
Soummya Kar,
Jose' M. F. Moura,
H. Vincent Poor
Abstract:
This paper focuses on the problem of recursive nonlinear least squares parameter estimation in multi-agent networks, in which the individual agents observe sequentially over time an independent and identically distributed (i.i.d.) time-series consisting of a nonlinear function of the true but unknown parameter corrupted by noise. A distributed recursive estimator of the \emph{consensus} + \emph{in…
▽ More
This paper focuses on the problem of recursive nonlinear least squares parameter estimation in multi-agent networks, in which the individual agents observe sequentially over time an independent and identically distributed (i.i.d.) time-series consisting of a nonlinear function of the true but unknown parameter corrupted by noise. A distributed recursive estimator of the \emph{consensus} + \emph{innovations} type, namely $\mathcal{CIWNLS}$, is proposed, in which the agents update their parameter estimates at each observation sampling epoch in a collaborative way by simultaneously processing the latest locally sensed information~(\emph{innovations}) and the parameter estimates from other agents~(\emph{consensus}) in the local neighborhood conforming to a pre-specified inter-agent communication topology. Under rather weak conditions on the connectivity of the inter-agent communication and a \emph{global observability} criterion, it is shown that at every network agent, the proposed algorithm leads to consistent parameter estimates. Furthermore, under standard smoothness assumptions on the local observation functions, the distributed estimator is shown to yield order-optimal convergence rates, i.e., as far as the order of pathwise convergence is concerned, the local parameter estimates at each agent are as good as the optimal centralized nonlinear least squares estimator which would require access to all the observations across all the agents at all times. In order to benchmark the performance of the proposed distributed $\mathcal{CIWNLS}$ estimator with that of the centralized nonlinear least squares estimator, the asymptotic normality of the estimate sequence is established and the asymptotic covariance of the distributed estimator is evaluated. Finally, simulation results are presented which illustrate and verify the analytical findings.
△ Less
Submitted 19 October, 2016; v1 submitted 31 January, 2016;
originally announced February 2016.
-
Dynamic Attack Detection in Cyber-Physical Systems with Side Initial State Information
Authors:
Yuan Chen,
Soummya Kar,
Jose' M. F. Moura
Abstract:
This paper studies the impact of side initial state information on the detectability of data deception attacks against cyber-physical systems. We assume the attack detector has access to a linear function of the initial system state that cannot be altered by an attacker. First, we provide a necessary and sufficient condition for an attack to be undetectable by any dynamic attack detector under eac…
▽ More
This paper studies the impact of side initial state information on the detectability of data deception attacks against cyber-physical systems. We assume the attack detector has access to a linear function of the initial system state that cannot be altered by an attacker. First, we provide a necessary and sufficient condition for an attack to be undetectable by any dynamic attack detector under each specific side information pattern. Second, we characterize attacks that can be sustained for arbitrarily long periods without being detected. Third, we define the zero state inducing attack, the only type of attack that remains dynamically undetectable regardless of the side initial state information available to the attack detector. Finally, we design a dynamic attack detector that detects detectable attacks.
△ Less
Submitted 16 June, 2016; v1 submitted 24 March, 2015;
originally announced March 2015.
-
Quantization in singular real polarizations: Kähler regularization, Maslov correction and pairings
Authors:
João N. Esteves,
José M. Mourão,
João P. Nunes
Abstract:
We study the Maslov correction to semiclassical states by using a Kähler regularized BKS pairing map from the energy representation to the Schrödinger representation. For general semiclassical states, the existence of this regularization is based on recently found families of Kähler polarizations degenerating to singular real polarizations and corresponding to special geodesic rays in the space of…
▽ More
We study the Maslov correction to semiclassical states by using a Kähler regularized BKS pairing map from the energy representation to the Schrödinger representation. For general semiclassical states, the existence of this regularization is based on recently found families of Kähler polarizations degenerating to singular real polarizations and corresponding to special geodesic rays in the space of Kähler metrics. In the case of the one-dimensional harmonic oscillator, we show that the correct phases associated with caustic points of the projection of the Lagrangian curves to the configuration space are correctly reproduced.
△ Less
Submitted 20 January, 2015; v1 submitted 31 December, 2014;
originally announced January 2015.
-
Complex symplectomorphisms and pseudo-Kähler islands in the quantization of toric manifolds
Authors:
William D. Kirwin,
José M. Mourão,
João P. Nunes
Abstract:
Let $P$ be a Delzant polytope. We show that the quantization of the corresponding toric manifold $X_{P}$ in toric Kähler polarizations and in the toric real polarization are related by analytic continuation of Hamiltonian flows evaluated at time $t = \sqrt{-1} s$. We relate the quantization of $X_{P}$ in two different toric Kähler polarizations by taking the time-$\sqrt{-1} s$ Hamiltonian "flow" o…
▽ More
Let $P$ be a Delzant polytope. We show that the quantization of the corresponding toric manifold $X_{P}$ in toric Kähler polarizations and in the toric real polarization are related by analytic continuation of Hamiltonian flows evaluated at time $t = \sqrt{-1} s$. We relate the quantization of $X_{P}$ in two different toric Kähler polarizations by taking the time-$\sqrt{-1} s$ Hamiltonian "flow" of strongly convex functions on the moment polytope $P$. By taking $s$ to infinity, we obtain the quantization of $X_{P}$ in the (singular) real toric polarization.
Recall that $X_{P}$ has an open dense subset which is biholomorphic to $({\mathbb{C}}^{*})^{n}$. The quantization of $X_{P}$ in a toric Kähler polarization can also be described by applying the complexified Hamiltonian flow of the Abreu--Guillemin symplectic potential $g$, at time $t=\sqrt{-1}$, to an appropriate finite-dimensional subspace of quantum states in the quantization of $T^{*}{\mathbb{T}}^{n}$ in the vertical polarization. By taking other imaginary times, $t= k \sqrt{-1}, k\in {\mathbb{R}}$, we describe toric Kähler metrics with cone singularities along the toric divisors in $X_{P}$.
For convex Hamiltonian functions and sufficiently negative imaginary part of the complex time, we obtain degenerate Kähler structures which are negative definite in some regions of $X_{P}$. We show that the pointwise and $L^2$-norms of quantum states are asymptotically vanishing on negative-definite regions.
△ Less
Submitted 11 November, 2014;
originally announced November 2014.
-
On complexified analytic Hamiltonian flows and geodesics on the space of Kahler metrics
Authors:
Jose M. Mourao,
Joao P. Nunes
Abstract:
In the case of a compact real analytic symplectic manifold M we describe an approach to the complexification of Hamiltonian flows [Se, Do1, Th1] and corresponding geodesics on the space of Kahler metrics. In this approach, motivated by recent work on quantization, the complexified Hamiltonian flows act, through the Grobner theory of Lie series, on the sheaf of complex valued real analytic function…
▽ More
In the case of a compact real analytic symplectic manifold M we describe an approach to the complexification of Hamiltonian flows [Se, Do1, Th1] and corresponding geodesics on the space of Kahler metrics. In this approach, motivated by recent work on quantization, the complexified Hamiltonian flows act, through the Grobner theory of Lie series, on the sheaf of complex valued real analytic functions, changing the sheaves of holomorphic functions. This defines an action on the space of (equivalent) complex structures on M and also a direct action on M. This description is related to the approach of [BLU] where one has an action on a complexification M_C of M followed by projection to M. Our approach allows for the study of some Hamiltonian functions which are not real analytic. It also leads naturally to the consideration of continuous degenerations of diffeomorphisms and of Kahler structures of M. Hence, one can link continuously (geometric quantization) real, and more general non-Kahler, polarizations with Kahler polarizations. This corresponds to the extension of the geodesics to the boundary of the space of Kahler metrics. Three illustrative examples are considered. We find an explicit formula for the complex time evolution of the Kahler potential under the flow. For integral symplectic forms, this formula corresponds to the complexification of the prequantization of Hamiltonian symplectomorphisms. We verify that certain families of Kahler structures, which have been studied in geometric quantization, are geodesic families.
△ Less
Submitted 6 January, 2015; v1 submitted 15 October, 2013;
originally announced October 2013.
-
Discrete Signal Processing on Graphs: Frequency Analysis
Authors:
Aliaksei Sandryhaila,
Jose M. F. Moura
Abstract:
Signals and datasets that arise in physical and engineering applications, as well as social, genetics, biomolecular, and many other domains, are becoming increasingly larger and more complex. In contrast to traditional time and image signals, data in these domains are supported by arbitrary graphs. Signal processing on graphs extends concepts and techniques from traditional signal processing to da…
▽ More
Signals and datasets that arise in physical and engineering applications, as well as social, genetics, biomolecular, and many other domains, are becoming increasingly larger and more complex. In contrast to traditional time and image signals, data in these domains are supported by arbitrary graphs. Signal processing on graphs extends concepts and techniques from traditional signal processing to data indexed by generic graphs. This paper studies the concepts of low and high frequencies on graphs, and low-, high-, and band-pass graph filters. In traditional signal processing, there concepts are easily defined because of a natural frequency ordering that has a physical interpretation. For signals residing on graphs, in general, there is no obvious frequency ordering. We propose a definition of total variation for graph signals that naturally leads to a frequency ordering on graphs and defines low-, high-, and band-pass graph signals and filters. We study the design of graph filters with specified frequency response, and illustrate our approach with applications to sensor malfunction detection and data classification.
△ Less
Submitted 18 November, 2013; v1 submitted 1 July, 2013;
originally announced July 2013.
-
Eigendecomposition of Block Tridiagonal Matrices
Authors:
Aliaksei Sandryhaila,
Jose M. F. Moura
Abstract:
Block tridiagonal matrices arise in applied mathematics, physics, and signal processing. Many applications require knowledge of eigenvalues and eigenvectors of block tridiagonal matrices, which can be prohibitively expensive for large matrix sizes. In this paper, we address the problem of the eigendecomposition of block tridiagonal matrices by studying a connection between their eigenvalues and ze…
▽ More
Block tridiagonal matrices arise in applied mathematics, physics, and signal processing. Many applications require knowledge of eigenvalues and eigenvectors of block tridiagonal matrices, which can be prohibitively expensive for large matrix sizes. In this paper, we address the problem of the eigendecomposition of block tridiagonal matrices by studying a connection between their eigenvalues and zeros of appropriate matrix polynomials. We use this connection with matrix polynomials to derive a closed-form expression for the eigenvectors of block tridiagonal matrices, which eliminates the need for their direct calculation and can lead to a faster calculation of eigenvalues. We also demonstrate with an example that our work can lead to fast algorithms for the eigenvector expansion for block tridiagonal matrices.
△ Less
Submitted 2 June, 2013;
originally announced June 2013.
-
Coherent state transforms and the Mackey-Stone-Von Neumann theorem
Authors:
William D. Kirwin,
José M. Mourão,
João P. Nunes
Abstract:
Mackey showed that for a compact Lie group $K$, the pair $(K,C^{0}(K))$ has a unique non-trivial irreducible covariant pair of representations. We study the relevance of this result to the unitary equivalence of quantizations for an infinite-dimensional family of $K\times K$ invariant polarizations on $T^{\ast}K$. The Kähler polarizations in the family are generated by (complex) time-$τ$ Hamiltoni…
▽ More
Mackey showed that for a compact Lie group $K$, the pair $(K,C^{0}(K))$ has a unique non-trivial irreducible covariant pair of representations. We study the relevance of this result to the unitary equivalence of quantizations for an infinite-dimensional family of $K\times K$ invariant polarizations on $T^{\ast}K$. The Kähler polarizations in the family are generated by (complex) time-$τ$ Hamiltonian flows applied to the (Schrödinger) vertical real polarization. The unitary equivalence of the corresponding quantizations of $T^{\ast}K$ is then studied by considering covariant pairs of representations of $K$ defined by geometric prequantization and of representations of $C^0(K)$ defined via Heisenberg time-$(-τ)$ evolution followed by time-$(+τ)$ geometric-quantization-induced evolution. We show that in the semiclassical and large imaginary time limits, the unitary transform whose existence is guaranteed by Mackey's theorem can be approximated by composition of the time-$(+τ)$ geometric-quantization-induced evolution with the time-$(-τ)$ evolution associated with the momentum space [W. D. Kirwin and S. Wu, Momentum space for compact Lie groups and the Peter-Weyl theorem, to appear] quantization of the Hamiltonian function generating the flow. In the case of quadratic Hamiltonians, this asymptotic result is exact and unitary equivalence between quantizations is achieved by identifying the Heisenberg imaginary time evolution with heat operator evolution, in accordance with the coherent state transform of Hall.
△ Less
Submitted 9 November, 2012;
originally announced November 2012.
-
$QD$-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus + Innovations
Authors:
Soummya Kar,
Jose' M. F. Moura,
H. Vincent Poor
Abstract:
The paper considers a class of multi-agent Markov decision processes (MDPs), in which the network agents respond differently (as manifested by the instantaneous one-stage random costs) to a global controlled state and the control actions of a remote controller. The paper investigates a distributed reinforcement learning setup with no prior information on the global state transition and local agent…
▽ More
The paper considers a class of multi-agent Markov decision processes (MDPs), in which the network agents respond differently (as manifested by the instantaneous one-stage random costs) to a global controlled state and the control actions of a remote controller. The paper investigates a distributed reinforcement learning setup with no prior information on the global state transition and local agent cost statistics. Specifically, with the agents' objective consisting of minimizing a network-averaged infinite horizon discounted cost, the paper proposes a distributed version of $Q$-learning, $\mathcal{QD}$-learning, in which the network agents collaborate by means of local processing and mutual information exchange over a sparse (possibly stochastic) communication network to achieve the network goal. Under the assumption that each agent is only aware of its local online cost data and the inter-agent communication network is \emph{weakly} connected, the proposed distributed scheme is almost surely (a.s.) shown to yield asymptotically the desired value function and the optimal stationary control policy at each network agent. The analytical techniques developed in the paper to address the mixed time-scale stochastic dynamics of the \emph{consensus + innovations} form, which arise as a result of the proposed interactive distributed scheme, are of independent interest.
△ Less
Submitted 24 October, 2012; v1 submitted 30 April, 2012;
originally announced May 2012.
-
Complex time evolution in geometric quantization and generalized coherent state transforms
Authors:
William D. Kirwin,
José M. Mourão,
João P. Nunes
Abstract:
For the cotangent bundle $T^{*}K$ of a compact Lie group $K$, we study the complex-time evolution of the vertical tangent bundle and the associated geometric quantization Hilbert space $L^{2}(K)$ under an infinite-dimensional family of Hamiltonian flows. For each such flow, we construct a generalized coherent state transform (CST), which is a unitary isomorphism between $L^{2}(K)$ and a certain we…
▽ More
For the cotangent bundle $T^{*}K$ of a compact Lie group $K$, we study the complex-time evolution of the vertical tangent bundle and the associated geometric quantization Hilbert space $L^{2}(K)$ under an infinite-dimensional family of Hamiltonian flows. For each such flow, we construct a generalized coherent state transform (CST), which is a unitary isomorphism between $L^{2}(K)$ and a certain weighted $L^{2}$-space of holomorphic functions. For a particular set of choices, we show that this isomorphism is naturally decomposed as a product of a Heisenberg-type evolution (for complex time $-τ$) within $L^{2}(K)$, followed by a polarization--changing geometric quantization evolution (for complex time $+τ$). In this case, our construction yields the usual generalized Segal--Bargmann transform of Hall. We show that the infinite-dimensional family of Hamiltonian flows can also be understood in terms of Thiemann's "complexifier" method (which generalizes the construction of adapted complex structures). We will also investigate some properties of the generalized CSTs, and discuss how their existence can be understood in terms of Mackey's generalization of the Stone-von Neumann theorem.
△ Less
Submitted 21 March, 2012;
originally announced March 2012.
-
Consensus and Products of Random Stochastic Matrices: Exact Rate for Convergence in Probability
Authors:
Dragana Bajovic,
Joao Xavier,
Jose M. F. Moura,
Bruno Sinopoli
Abstract:
Distributed consensus and other linear systems with system stochastic matrices $W_k$ emerge in various settings, like opinion formation in social networks, rendezvous of robots, and distributed inference in sensor networks. The matrices $W_k$ are often random, due to, e.g., random packet dropouts in wireless sensor networks. Key in analyzing the performance of such systems is studying convergence…
▽ More
Distributed consensus and other linear systems with system stochastic matrices $W_k$ emerge in various settings, like opinion formation in social networks, rendezvous of robots, and distributed inference in sensor networks. The matrices $W_k$ are often random, due to, e.g., random packet dropouts in wireless sensor networks. Key in analyzing the performance of such systems is studying convergence of matrix products $W_kW_{k-1}... W_1$. In this paper, we find the exact exponential rate $I$ for the convergence in probability of the product of such matrices when time $k$ grows large, under the assumption that the $W_k$'s are symmetric and independent identically distributed in time. Further, for commonly used random models like with gossip and link failure, we show that the rate $I$ is found by solving a min-cut problem and, hence, easily computable. Finally, we apply our results to optimally allocate the sensors' transmission power in consensus+innovations distributed detection.
△ Less
Submitted 28 February, 2012;
originally announced February 2012.
-
Distributed Linear Parameter Estimation: Asymptotically Efficient Adaptive Strategies
Authors:
Soummya Kar,
Jose' M. F. Moura,
H. Vincent Poor
Abstract:
The paper considers the problem of distributed adaptive linear parameter estimation in multi-agent inference networks. Local sensing model information is only partially available at the agents and inter-agent communication is assumed to be unpredictable. The paper develops a generic mixed time-scale stochastic procedure consisting of simultaneous distributed learning and estimation, in which the a…
▽ More
The paper considers the problem of distributed adaptive linear parameter estimation in multi-agent inference networks. Local sensing model information is only partially available at the agents and inter-agent communication is assumed to be unpredictable. The paper develops a generic mixed time-scale stochastic procedure consisting of simultaneous distributed learning and estimation, in which the agents adaptively assess their relative observation quality over time and fuse the innovations accordingly. Under rather weak assumptions on the statistical model and the inter-agent communication, it is shown that, by properly tuning the consensus potential with respect to the innovation potential, the asymptotic information rate loss incurred in the learning process may be made negligible. As such, it is shown that the agent estimates are asymptotically efficient, in that their asymptotic covariance coincides with that of a centralized estimator (the inverse of the centralized Fisher information rate for Gaussian systems) with perfect global model information and having access to all observations at all times. The proof techniques are mainly based on convergence arguments for non-Markovian mixed time scale stochastic approximation procedures. Several approximation results developed in the process are of independent interest.
△ Less
Submitted 6 August, 2012; v1 submitted 22 September, 2011;
originally announced September 2011.
-
Degeneration of Kaehler structures and half-form quantization of toric varieties
Authors:
William D. Kirwin,
José M. Mourão,
João P. Nunes
Abstract:
We study the half-form Kaehler quantization of a smooth symplectic toric manifold $(X,ω)$, such that $[ω/2π]-c_{1}(X)/2 \in H^{2}(X,{\mathbb{Z}})$ and is nonnegative. We define the half-form corrected quantization of $(X,ω)$ to be given by holomorphic sections of a certain hermitian line bundle $L\rightarrow X$ with Chern class $[ω/ 2π]-c_{1}(X)/2$. These sections then correspond to integral point…
▽ More
We study the half-form Kaehler quantization of a smooth symplectic toric manifold $(X,ω)$, such that $[ω/2π]-c_{1}(X)/2 \in H^{2}(X,{\mathbb{Z}})$ and is nonnegative. We define the half-form corrected quantization of $(X,ω)$ to be given by holomorphic sections of a certain hermitian line bundle $L\rightarrow X$ with Chern class $[ω/ 2π]-c_{1}(X)/2$. These sections then correspond to integral points of a "corrected" polytope $P_{L}$ with integral vertices. For a suitably translated moment polytope $P_{X}$ for $(X,ω)$, we have that $P_{L}\subset P_{X}$ is obtained from $P_{X}$ by a one-half inward-pointing normal shift along the boundary.
We use our results on the Kaehler quantization to motivate a definition of half-form corrected quantization in the singular real toric polarization. Using families of complex structures studied in [Baier-Florentino-Mourao-Nunes:arXiv/0806.0606], which include the degeneration of Kaehler polarizations to the vertical polarization, we show that, under this degeneration, the half-form corrected $L^{2}$-normalized monomial holomorphic sections converge to Dirac-delta-distributional sections supported on the fibers over the integral points of $P_{L}$, which correspond to corrected Bohr-Sommerfeld fibers. This result and the limit of the corrected connection, with curvature singularities along the boundary of $P_X$, justifies the direct definition we give for the corrected quantization in the singular real toric polarization. We show that the space of quantum states for this definition coincides with the space obtained via degeneration of the Kähler quantization.
We also show that the BKS pairing between Kaehler polarizations is not unitary in general. On the other hand, the unitary connection induced by this pairing is flat.
△ Less
Submitted 10 December, 2012; v1 submitted 15 November, 2010;
originally announced November 2010.
-
Convergence Rate Analysis of Distributed Gossip (Linear Parameter) Estimation: Fundamental Limits and Tradeoffs
Authors:
Soummya Kar,
Jose' M. F. Moura
Abstract:
The paper considers gossip distributed estimation of a (static) distributed random field (a.k.a., large scale unknown parameter vector) observed by sparsely interconnected sensors, each of which only observes a small fraction of the field. We consider linear distributed estimators whose structure combines the information \emph{flow} among sensors (the \emph{consensus} term resulting from the local…
▽ More
The paper considers gossip distributed estimation of a (static) distributed random field (a.k.a., large scale unknown parameter vector) observed by sparsely interconnected sensors, each of which only observes a small fraction of the field. We consider linear distributed estimators whose structure combines the information \emph{flow} among sensors (the \emph{consensus} term resulting from the local gossiping exchange among sensors when they are able to communicate) and the information \emph{gathering} measured by the sensors (the \emph{sensing} or \emph{innovations} term.) This leads to mixed time scale algorithms--one time scale associated with the consensus and the other with the innovations. The paper establishes a distributed observability condition (global observability plus mean connectedness) under which the distributed estimates are consistent and asymptotically normal. We introduce the distributed notion equivalent to the (centralized) Fisher information rate, which is a bound on the mean square error reduction rate of any distributed estimator; we show that under the appropriate modeling and structural network communication conditions (gossip protocol) the distributed gossip estimator attains this distributed Fisher information rate, asymptotically achieving the performance of the optimal centralized estimator. Finally, we study the behavior of the distributed gossip estimator when the measurements fade (noise variance grows) with time; in particular, we consider the maximum rate at which the noise variance can grow and still the distributed estimator being consistent, by showing that, as long as the centralized estimator is consistent, the distributed estimator remains consistent.
△ Less
Submitted 7 November, 2010;
originally announced November 2010.
-
Graphical Models as Block-Tree Graphs
Authors:
Divyanshu Vats,
Jose M. F. Moura
Abstract:
We introduce block-tree graphs as a framework for deriving efficient algorithms on graphical models. We define block-tree graphs as a tree-structured graph where each node is a cluster of nodes such that the clusters in the graph are disjoint. This differs from junction-trees, where two clusters connected by an edge always have at least one common node. When compared to junction-trees, we show tha…
▽ More
We introduce block-tree graphs as a framework for deriving efficient algorithms on graphical models. We define block-tree graphs as a tree-structured graph where each node is a cluster of nodes such that the clusters in the graph are disjoint. This differs from junction-trees, where two clusters connected by an edge always have at least one common node. When compared to junction-trees, we show that constructing block-tree graphs is faster, and finding optimal block-tree graphs has a much smaller search space. Applying our block-tree graph framework to graphical models, we show that, for some graphs, e.g., grid graphs, using block-tree graphs for inference is computationally more efficient than using junction-trees. For graphical models with boundary conditions, the block-tree graph framework transforms the boundary valued problem into an initial value problem. For Gaussian graphical models, the block-tree graph framework leads to a linear state-space representation. Since exact inference in graphical models can be computationally intractable, we propose to use spanning block-trees to derive approximate inference algorithms. Experimental results show the improved performance in using spanning block-trees versus using spanning trees for approximate estimation over Gaussian graphical models.
△ Less
Submitted 13 November, 2010; v1 submitted 4 July, 2010;
originally announced July 2010.
-
Gossip and Distributed Kalman Filtering: Weak Consensus under Weak Detectability
Authors:
Soummya Kar,
José M. F. Moura
Abstract:
The paper presents the gossip interactive Kalman filter (GIKF) for distributed Kalman filtering for networked systems and sensor networks, where inter-sensor communication and observations occur at the same time-scale. The communication among sensors is random; each sensor occasionally exchanges its filtering state information with a neighbor depending on the availability of the appropriate networ…
▽ More
The paper presents the gossip interactive Kalman filter (GIKF) for distributed Kalman filtering for networked systems and sensor networks, where inter-sensor communication and observations occur at the same time-scale. The communication among sensors is random; each sensor occasionally exchanges its filtering state information with a neighbor depending on the availability of the appropriate network link. We show that under a weak distributed detectability condition:
1. the GIKF error process remains stochastically bounded, irrespective of the instability properties of the random process dynamics; and
2. the network achieves \emph{weak consensus}, i.e., the conditional estimation error covariance at a (uniformly) randomly selected sensor converges in distribution to a unique invariant measure on the space of positive semi-definite matrices (independent of the initial state.)
To prove these results, we interpret the filtered states (estimates and error covariances) at each node in the GIKF as stochastic particles with local interactions. We analyze the asymptotic properties of the error process by studying as a random dynamical system the associated switched (random) Riccati equation, the switching being dictated by a non-stationary Markov chain on the network graph.
△ Less
Submitted 2 April, 2010;
originally announced April 2010.
-
A Random Dynamical Systems Approach to Filtering in Large-scale Networks
Authors:
S. Kar,
B. Sinopoli,
J. M. F. Moura
Abstract:
The paper studies the problem of filtering a discrete-time linear system observed by a network of sensors. The sensors share a common communication medium to the estimator and transmission is bit and power budgeted. Under the assumption of conditional Gaussianity of the signal process at the estimator (which may be ensured by observation packet acknowledgements), the conditional prediction error…
▽ More
The paper studies the problem of filtering a discrete-time linear system observed by a network of sensors. The sensors share a common communication medium to the estimator and transmission is bit and power budgeted. Under the assumption of conditional Gaussianity of the signal process at the estimator (which may be ensured by observation packet acknowledgements), the conditional prediction error covariance of the optimum mean-squared error filter is shown to evolve according to a random dynamical system (RDS) on the space of non-negative definite matrices. Our RDS formalism does not depend on the particular medium access protocol (randomized) and, under a minimal distributed observability assumption, we show that the sequence of random conditional prediction error covariance matrices converges in distribution to a unique invariant distribution (independent of the initial filter state), i.e., the conditional error process is shown to be ergodic. Under broad assumptions on the medium access protocol, we show that the conditional error covariance sequence satisfies a Markov-Feller property, leading to an explicit characterization of the support of its invariant measure. The methodology adopted in this work is sufficiently general to envision this application to sample path analysis of more general hybrid or switched systems, where existing analysis is mostly moment-based.
△ Less
Submitted 5 October, 2009;
originally announced October 2009.
-
Telescoping Recursive Representations and Estimation of Gauss-Markov Random Fields
Authors:
Divyanshu Vats,
Jose M. F. Moura
Abstract:
We present \emph{telescoping} recursive representations for both continuous and discrete indexed noncausal Gauss-Markov random fields. Our recursions start at the boundary (a hypersurface in $\R^d$, $d \ge 1$) and telescope inwards. For example, for images, the telescoping representation reduce recursions from $d = 2$ to $d = 1$, i.e., to recursions on a single dimension. Under appropriate conditi…
▽ More
We present \emph{telescoping} recursive representations for both continuous and discrete indexed noncausal Gauss-Markov random fields. Our recursions start at the boundary (a hypersurface in $\R^d$, $d \ge 1$) and telescope inwards. For example, for images, the telescoping representation reduce recursions from $d = 2$ to $d = 1$, i.e., to recursions on a single dimension. Under appropriate conditions, the recursions for the random field are linear stochastic differential/difference equations driven by white noise, for which we derive recursive estimation algorithms, that extend standard algorithms, like the Kalman-Bucy filter and the Rauch-Tung-Striebel smoother, to noncausal Markov random fields.
△ Less
Submitted 19 December, 2010; v1 submitted 30 July, 2009;
originally announced July 2009.
-
Quantization of Abelian Varieties: distributional sections and the transition from Kähler to real polarizations
Authors:
Thomas Baier,
José M. Mourão,
João P. Nunes
Abstract:
We study the dependence of geometric quantization of the standard symplectic torus on the choice of invariant polarization. Real and mixed polarizations are interpreted as degenerate complex structures. Using a weak version of the equations of covariant constancy, and the Weil-Brezin expansion to describe distributional sections, we give a unified analytical description of the quantization space…
▽ More
We study the dependence of geometric quantization of the standard symplectic torus on the choice of invariant polarization. Real and mixed polarizations are interpreted as degenerate complex structures. Using a weak version of the equations of covariant constancy, and the Weil-Brezin expansion to describe distributional sections, we give a unified analytical description of the quantization spaces for all nonnegative polarizations.
The Blattner-Kostant-Sternberg (BKS) pairing maps between half-form corrected quantization spaces for different polarizations are shown to be transitive and related to an action of $Sp(2g,\R)$. Moreover, these maps are shown to be unitary.
△ Less
Submitted 26 January, 2010; v1 submitted 30 July, 2009;
originally announced July 2009.
-
Higher Dimensional Consensus: Learning in Large-Scale Networks
Authors:
Usman A. Khan,
Soummya Kar,
Jose M. F. Moura
Abstract:
The paper presents higher dimension consensus (HDC) for large-scale networks. HDC generalizes the well-known average-consensus algorithm. It divides the nodes of the large-scale network into anchors and sensors. Anchors are nodes whose states are fixed over the HDC iterations, whereas sensors are nodes that update their states as a linear combination of the neighboring states. Under appropriate…
▽ More
The paper presents higher dimension consensus (HDC) for large-scale networks. HDC generalizes the well-known average-consensus algorithm. It divides the nodes of the large-scale network into anchors and sensors. Anchors are nodes whose states are fixed over the HDC iterations, whereas sensors are nodes that update their states as a linear combination of the neighboring states. Under appropriate conditions, we show that the sensor states converge to a linear combination of the anchor states. Through the concept of anchors, HDC captures in a unified framework several interesting network tasks, including distributed sensor localization, leader-follower, distributed Jacobi to solve linear systems of algebraic equations, and, of course, average-consensus. In many network applications, it is of interest to learn the weights of the distributed linear algorithm so that the sensors converge to a desired state. We term this inverse problem the HDC learning problem. We pose learning in HDC as a constrained non-convex optimization problem, which we cast in the framework of multi-objective optimization (MOP) and to which we apply Pareto optimality. We prove analytically relevant properties of the MOP solutions and of the Pareto front from which we derive the solution to learning in HDC. Finally, the paper shows how the MOP approach resolves interesting tradeoffs (speed of convergence versus quality of the final state) arising in learning in HDC in resource constrained networks.
△ Less
Submitted 12 April, 2009;
originally announced April 2009.
-
Kalman Filtering with Intermittent Observations: Weak Convergence to a Stationary Distribution
Authors:
Soummya Kar,
Bruno Sinopoli,
Jose M. F. Moura
Abstract:
The paper studies the asymptotic behavior of Random Algebraic Riccati Equations (RARE) arising in Kalman filtering when the arrival of the observations is described by a Bernoulli i.i.d. process. We model the RARE as an order-preserving, strongly sublinear random dynamical system (RDS). Under a sufficient condition, stochastic boundedness, and using a limit-set dichotomy result for order-preservin…
▽ More
The paper studies the asymptotic behavior of Random Algebraic Riccati Equations (RARE) arising in Kalman filtering when the arrival of the observations is described by a Bernoulli i.i.d. process. We model the RARE as an order-preserving, strongly sublinear random dynamical system (RDS). Under a sufficient condition, stochastic boundedness, and using a limit-set dichotomy result for order-preserving, strongly sublinear RDS, we establish the asymptotic properties of the RARE: the sequence of random prediction error covariance matrices converges weakly to a unique invariant distribution, whose support exhibits fractal behavior. In particular, this weak convergence holds under broad conditions and even when the observations arrival rate is below the critical probability for mean stability. We apply the weak-Feller property of the Markov process governing the RARE to characterize the support of the limiting invariant distribution as the topological closure of a countable set of points, which, in general, is not dense in the set of positive semi-definite matrices. We use the explicit characterization of the support of the invariant distribution and the almost sure ergodicity of the sample paths to easily compute the moments of the invariant distribution. A one dimensional example illustrates that the support is a fractured subset of the non-negative reals with self-similarity properties.
△ Less
Submitted 28 May, 2010; v1 submitted 16 March, 2009;
originally announced March 2009.
-
Toric Kähler metrics seen from infinity, quantization and compact tropical amoebas
Authors:
Thomas Baier,
Carlos Florentino,
José M. Mourão,
João P. Nunes
Abstract:
We consider the metric space of all toric Kähler metrics on a compact toric manifold; when "looking at it from infinity" (following Gromov), we obtain the tangent cone at infinity, which is parametrized by equivalence classes of complete geodesics. In the present paper, we study the associated limit for the family of metrics on the toric variety, its quantization, and degeneration of generic divis…
▽ More
We consider the metric space of all toric Kähler metrics on a compact toric manifold; when "looking at it from infinity" (following Gromov), we obtain the tangent cone at infinity, which is parametrized by equivalence classes of complete geodesics. In the present paper, we study the associated limit for the family of metrics on the toric variety, its quantization, and degeneration of generic divisors.
The limits of the corresponding Kähler polarizations become degenerate along the Lagrangian fibration defined by the moment map. This allows us to interpolate continuously between geometric quantizations in the holomorphic and real polarizations and show that the monomial holomorphic sections of the prequantum bundle converge to Dirac delta distributions supported on Bohr-Sommerfeld fibers.
In the second part, we use these families of toric metric degenerations to study the limit of compact hypersurface amoebas and show that in Legendre transformed variables they are described by tropical amoebas. We believe that our approach gives a different, complementary, perspective on the relation between complex algebraic geometry and tropical geometry.
△ Less
Submitted 16 December, 2011; v1 submitted 3 June, 2008;
originally announced June 2008.
-
Distributed Consensus Algorithms in Sensor Networks: Link Failures and Channel Noise
Authors:
Soummya Kar,
José M. F. Moura
Abstract:
The paper studies average consensus with random topologies (intermittent links)
\emph{and} noisy channels. Consensus with noise in the network links leads to the bias-variance dilemma--running consensus for long reduces the bias of the final average estimate but increases its variance. We present two different compromises to this tradeoff: the $\mathcal{A-ND}$ algorithm modifies conventional c…
▽ More
The paper studies average consensus with random topologies (intermittent links)
\emph{and} noisy channels. Consensus with noise in the network links leads to the bias-variance dilemma--running consensus for long reduces the bias of the final average estimate but increases its variance. We present two different compromises to this tradeoff: the $\mathcal{A-ND}$ algorithm modifies conventional consensus by forcing the weights to satisfy a \emph{persistence} condition (slowly decaying to zero); and the $\mathcal{A-NC}$ algorithm where the weights are constant but consensus is run for a fixed number of iterations $\hat{\imath}$, then it is restarted and rerun for a total of $\hat{p}$ runs, and at the end averages the final states of the $\hat{p}$ runs (Monte Carlo averaging). We use controlled Markov processes and stochastic approximation arguments to prove almost sure convergence of $\mathcal{A-ND}$ to the desired average (asymptotic unbiasedness) and compute explicitly the m.s.e. (variance) of the consensus limit. We show that $\mathcal{A-ND}$ represents the best of both worlds--low bias and low variance--at the cost of a slow convergence rate; rescaling the weights...
△ Less
Submitted 7 September, 2008; v1 submitted 25 November, 2007;
originally announced November 2007.