-
A ZeNN architecture to avoid the Gaussian trap
Authors:
Luís Carvalho,
João L. Costa,
José Mourão,
Gonçalo Oliveira
Abstract:
We propose a new simple architecture, Zeta Neural Networks (ZeNNs), in order to overcome several shortcomings of standard multi-layer perceptrons (MLPs). Namely, in the large width limit, MLPs are non-parametric, they do not have a well-defined pointwise limit, they lose non-Gaussian attributes and become unable to perform feature learning; moreover, finite width MLPs perform poorly in learning hi…
▽ More
We propose a new simple architecture, Zeta Neural Networks (ZeNNs), in order to overcome several shortcomings of standard multi-layer perceptrons (MLPs). Namely, in the large width limit, MLPs are non-parametric, they do not have a well-defined pointwise limit, they lose non-Gaussian attributes and become unable to perform feature learning; moreover, finite width MLPs perform poorly in learning high frequencies. The new ZeNN architecture is inspired by three simple principles from harmonic analysis:
i) Enumerate the perceptons and introduce a non-learnable weight to enforce convergence;
ii) Introduce a scaling (or frequency) factor;
iii) Choose activation functions that lead to near orthogonal systems.
We will show that these ideas allow us to fix the referred shortcomings of MLPs. In fact, in the infinite width limit, ZeNNs converge pointwise, they exhibit a rich asymptotic structure beyond Gaussianity, and perform feature learning. Moreover, when appropriate activation functions are chosen, (finite width) ZeNNs excel at learning high-frequency features of functions with low dimensional domains.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
A new look at unitarity in quantization commutes with reduction for toric manifolds
Authors:
José M. Mourão,
João P. Nunes,
Augusto Pereira,
Dan Wang
Abstract:
For a symplectic toric manifold we consider half-form quantization in mixed polarizations $\mathcal{P}_\infty$, associated to the action of a subtorus $T^p\subset T^n$. The real directions in these polarizations are generated by components of the $T^p$ moment map.
Polarizations of this type can be obtained by starting at a toric Kähler polarization $\mathcal{P}_0$ and then following
Mabuchi ra…
▽ More
For a symplectic toric manifold we consider half-form quantization in mixed polarizations $\mathcal{P}_\infty$, associated to the action of a subtorus $T^p\subset T^n$. The real directions in these polarizations are generated by components of the $T^p$ moment map.
Polarizations of this type can be obtained by starting at a toric Kähler polarization $\mathcal{P}_0$ and then following
Mabuchi rays of toric Kähler polarizations generated by the norm square of the moment map of the torus subgroup. These geodesic rays are lifted to the quantum bundle via a generalized coherent state transform (gCST) and define equivariant isomorphisms between Hilbert spaces for the Kähler polarizations and the Hilbert space for the mixed polarization.
The polarizations $\mathcal{P}_\infty$ give a new way of looking at the problem of unitarity in the quantization commutes with reduction with respect to the $T^p$-action, as follows. The prequantum operators for the components of the moment map of the $T^p$-action act diagonally with discrete spectrum corresponding to the integral points of the moment polytope. The Hilbert space for the quantization with respect to $\mathcal{P}_\infty$ then naturally decomposes as a direct sum of the Hilbert spaces for all its quantizable coisotropic reductions which, in fact, are the Kähler reductions of the initial Kähler polarization $\mathcal{P}_0$. This will be shown to imply that, for the polarization $\mathcal{P}_\infty$, quantization commutes unitarily with reduction. The problem of unitarity in quantization commutes with reduction for $\mathcal{P}_0$ is then equivalent to the question of whether quantization in the polarization $\mathcal{P}_0$ is unitarily equivalent with quantization in the polarization $\mathcal{P}_\infty$. In fact, this does not hold in general in the toric case.
△ Less
Submitted 4 March, 2025; v1 submitted 22 December, 2024;
originally announced December 2024.
-
Solving Functional Optimization with Deep Networks and Variational Principles
Authors:
Kawisorn Kamtue,
Jose M. F. Moura,
Orathai Sangpetch
Abstract:
Can neural networks solve math problems using first a principle alone? This paper shows how to leverage the fundamental theorem of the calculus of variations to design deep neural networks to solve functional optimization without requiring training data (e.g., ground-truth optimal solutions). Our approach is particularly crucial when the solution is a function defined over an unknown interval or s…
▽ More
Can neural networks solve math problems using first a principle alone? This paper shows how to leverage the fundamental theorem of the calculus of variations to design deep neural networks to solve functional optimization without requiring training data (e.g., ground-truth optimal solutions). Our approach is particularly crucial when the solution is a function defined over an unknown interval or support\textemdash such as in minimum-time control problems. By incorporating the necessary conditions satisfied by the optimal function solution, as derived from the calculus of variation, in the design of the deep architecture, CalVNet leverages overparameterized neural networks to learn these optimal functions directly. We validate CalVNet by showing that, without relying on ground-truth data and simply incorporating first principles, it successfully derives the Kalman filter for linear filtering, the bang-bang optimal control for minimum-time problems, and finds geodesics on manifolds. Our results demonstrate that CalVNet can be trained in an unsupervised manner, without relying on ground-truth data, establishing a promising framework for addressing general, potentially unsolved functional optimization problems that still lack analytical solutions.
△ Less
Submitted 11 March, 2025; v1 submitted 8 October, 2024;
originally announced October 2024.
-
Mabuchi rays, test configurations and quantization for toric manifolds
Authors:
António Gouveia,
José M. Mourão,
João P. Nunes
Abstract:
We consider Mabuchi rays of toric Kähler structures on symplectic toric manifolds which are associated to toric test configurations and that are generated by convex functions on themoment polytope, $P$, whose second derivative has support given by a compact subset $K<P$. Associated to the test configuration there is a polyhedral decomposition of $P$ whose components are approximated by the compone…
▽ More
We consider Mabuchi rays of toric Kähler structures on symplectic toric manifolds which are associated to toric test configurations and that are generated by convex functions on themoment polytope, $P$, whose second derivative has support given by a compact subset $K<P$. Associated to the test configuration there is a polyhedral decomposition of $P$ whose components are approximated by the components of $P \setminus K$. Along such Mabuchi rays, the toric complex structure remains unchanged on the inverse image under the moment map of $(P \setminus \check {K})$, where $\check {K}$ denotes the interior of $K$. At infinite geodesic time, the Kähler polarizations along the ray converge to interesting new toric mixed polarizations. The quantization in these limit polarizations is given by restrictions of the monomial holomorphic sections of the Kähler quantization, for monomials corresponding to integral points in $P \setminus \check {K}$, and by sections on the fibers of the moment map over the integral points contained in $\check {K}$, which, along the directions parallel to $K$ are holomorphic and which along the directions transverse to $K$ are distributional. These quantizations correspond to quantizations of the central fiber of the test family, in the symplectic picture. We present the case of $S2$ in detail and then generalize to higher dimensional symplectic toric manifolds. Metrically, at infinite Mabuchi geodesic time, the sphere decomposes into two discs and a collection of cylinders, separated by infinitely long lines. Correspondingly, the quantization in the limit polarization decomposes into a direct sum of the contributions from the quantizations of each of these components.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Fibering polarizations and Mabuchi rays on symmetric spaces of compact type
Authors:
Thomas Baier,
Ana Cristina Ferreira,
Joachim Hilgert,
José M. Mourão,
João P. Nunes
Abstract:
In this paper, we describe holomorphic quantizations of the cotangent bundle of a symmetric space of compact type $T^*(U/K)\cong U_\mathbb{C}/K_\mathbb{C}$, along Mabuchi rays of $U$-invariant Kähler structures. At infinite geodesic time, the Kähler polarizations converge to a mixed polarization $\mathcal{P}_\infty$. We show how a generalized coherent state transform relates the quantizations alon…
▽ More
In this paper, we describe holomorphic quantizations of the cotangent bundle of a symmetric space of compact type $T^*(U/K)\cong U_\mathbb{C}/K_\mathbb{C}$, along Mabuchi rays of $U$-invariant Kähler structures. At infinite geodesic time, the Kähler polarizations converge to a mixed polarization $\mathcal{P}_\infty$. We show how a generalized coherent state transform relates the quantizations along the Mabuchi geodesics such that holomorphic sections converge, as geodesic time goes to infinity, to distributional $\mathcal{P}_\infty$-polarized sections. Unlike in the case of $T^*U$, the gCST mapping from the Hilbert space of vertically polarized sections are not asymptotically unitary due to the appearance of representation dependent factors associated to the isotypical decomposition for the $U$-action. In agreement with the general program outlined in [Bai+23], we also describe how the quantization in the limit polarization $\mathcal{P}_\infty$ is given by the direct sum of the quantizations for all the symplectic reductions relative to the invariant torus action associated to the Hamiltonian action of $U$.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
The Positivity of the Neural Tangent Kernel
Authors:
Luís Carvalho,
João L. Costa,
José Mourão,
Gonçalo Oliveira
Abstract:
The Neural Tangent Kernel (NTK) has emerged as a fundamental concept in the study of wide Neural Networks. In particular, it is known that the positivity of the NTK is directly related to the memorization capacity of sufficiently wide networks, i.e., to the possibility of reaching zero loss in training, via gradient descent. Here we will improve on previous works and obtain a sharp result concerni…
▽ More
The Neural Tangent Kernel (NTK) has emerged as a fundamental concept in the study of wide Neural Networks. In particular, it is known that the positivity of the NTK is directly related to the memorization capacity of sufficiently wide networks, i.e., to the possibility of reaching zero loss in training, via gradient descent. Here we will improve on previous works and obtain a sharp result concerning the positivity of the NTK of feedforward networks of any depth. More precisely, we will show that, for any non-polynomial activation function, the NTK is strictly positive definite. Our results are based on a novel characterization of polynomial functions which is of independent interest.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Gradient Networks
Authors:
Shreyas Chaudhari,
Srinivasa Pranav,
José M. F. Moura
Abstract:
Directly parameterizing and learning gradients of functions has widespread significance, with specific applications in inverse problems, generative modeling, and optimal transport. This paper introduces gradient networks (GradNets): novel neural network architectures that parameterize gradients of various function classes. GradNets exhibit specialized architectural constraints that ensure correspo…
▽ More
Directly parameterizing and learning gradients of functions has widespread significance, with specific applications in inverse problems, generative modeling, and optimal transport. This paper introduces gradient networks (GradNets): novel neural network architectures that parameterize gradients of various function classes. GradNets exhibit specialized architectural constraints that ensure correspondence to gradient functions. We provide a comprehensive GradNet design framework that includes methods for transforming GradNets into monotone gradient networks (mGradNets), which are guaranteed to represent gradients of convex functions. Our results establish that our proposed GradNet (and mGradNet) universally approximate the gradients of (convex) functions. Furthermore, these networks can be customized to correspond to specific spaces of potential functions, including transformed sums of (convex) ridge functions. Our analysis leads to two distinct GradNet architectures, GradNet-C and GradNet-M, and we describe the corresponding monotone versions, mGradNet-C and mGradNet-M. Our empirical results demonstrate that these architectures provide efficient parameterizations and outperform existing methods by up to 15 dB in gradient field tasks and by up to 11 dB in Hamiltonian dynamics learning tasks.
△ Less
Submitted 24 January, 2025; v1 submitted 10 April, 2024;
originally announced April 2024.
-
Wide neural networks: From non-gaussian random fields at initialization to the NTK geometry of training
Authors:
Luís Carvalho,
João Lopes Costa,
José Mourão,
Gonçalo Oliveira
Abstract:
Recent developments in applications of artificial neural networks with over $n=10^{14}$ parameters make it extremely important to study the large $n$ behaviour of such networks. Most works studying wide neural networks have focused on the infinite width $n \to +\infty$ limit of such networks and have shown that, at initialization, they correspond to Gaussian processes. In this work we will study t…
▽ More
Recent developments in applications of artificial neural networks with over $n=10^{14}$ parameters make it extremely important to study the large $n$ behaviour of such networks. Most works studying wide neural networks have focused on the infinite width $n \to +\infty$ limit of such networks and have shown that, at initialization, they correspond to Gaussian processes. In this work we will study their behavior for large, but finite $n$. Our main contributions are the following:
(1) The computation of the corrections to Gaussianity in terms of an asymptotic series in $n^{-\frac{1}{2}}$. The coefficients in this expansion are determined by the statistics of parameter initialization and by the activation function.
(2) Controlling the evolution of the outputs of finite width $n$ networks, during training, by computing deviations from the limiting infinite width case (in which the network evolves through a linear flow). This improves previous estimates and yields sharper decay rates for the (finite width) NTK in terms of $n$, valid during the entire training procedure. As a corollary, we also prove that, with arbitrarily high probability, the training of sufficiently wide neural networks converges to a global minimum of the corresponding quadratic loss function.
(3) Estimating how the deviations from Gaussianity evolve with training in terms of $n$. In particular, using a certain metric in the space of measures we find that, along training, the resulting measure is within $n^{-\frac{1}{2}}(\log n)^{1+}$ of the time dependent Gaussian process corresponding to the infinite width network (which is explicitly given by precomposing the initial Gaussian process with the linear flow corresponding to training in the infinite width limit).
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
Learning Gradients of Convex Functions with Monotone Gradient Networks
Authors:
Shreyas Chaudhari,
Srinivasa Pranav,
José M. F. Moura
Abstract:
While much effort has been devoted to deriving and analyzing effective convex formulations of signal processing problems, the gradients of convex functions also have critical applications ranging from gradient-based optimization to optimal transport. Recent works have explored data-driven methods for learning convex objective functions, but learning their monotone gradients is seldom studied. In t…
▽ More
While much effort has been devoted to deriving and analyzing effective convex formulations of signal processing problems, the gradients of convex functions also have critical applications ranging from gradient-based optimization to optimal transport. Recent works have explored data-driven methods for learning convex objective functions, but learning their monotone gradients is seldom studied. In this work, we propose C-MGN and M-MGN, two monotone gradient neural network architectures for directly learning the gradients of convex functions. We show that, compared to state of the art methods, our networks are easier to train, learn monotone gradient fields more accurately, and use significantly fewer parameters. We further demonstrate their ability to learn optimal transport mappings to augment driving image data.
△ Less
Submitted 17 March, 2023; v1 submitted 25 January, 2023;
originally announced January 2023.
-
Quantization in fibering polarizations, Mabuchi rays and geometric Peter--Weyl theorem
Authors:
Thomas Baier,
Joachim Hilgert,
Oğuzhan Kaya,
José M. Mourão,
João P. Nunes
Abstract:
In this paper we use techniques of geometric quantization to give a geometric interpretation of the Peter--Weyl theorem. We present a novel approach to half-form corrected geometric quantization in a specific type of non-Kähler polarizations and study one important class of examples, namely cotangent bundles of compact semi-simple groups $K$. Our main results state that this canonically defined po…
▽ More
In this paper we use techniques of geometric quantization to give a geometric interpretation of the Peter--Weyl theorem. We present a novel approach to half-form corrected geometric quantization in a specific type of non-Kähler polarizations and study one important class of examples, namely cotangent bundles of compact semi-simple groups $K$. Our main results state that this canonically defined polarization occurs in the geodesic boundary of the space of $K\times K$-invariant Kähler polarizations equipped with Mabuchi's metric, and that its half-form corrected quantization is isomorphic to the Kähler case. An important role is played by invariance of the limit polarization under a torus action.
Unitary parallel transport on the bundle of quantum states along a specific Mabuchi geodesic, given by the coherent state transform of Hall, relates the non-commutative Fourier transform for $K$ with the Borel--Weil description of irreducible representations of $K$.
△ Less
Submitted 25 January, 2023;
originally announced January 2023.
-
Networked Signal and Information Processing
Authors:
Stefan Vlaski,
Soummya Kar,
Ali H. Sayed,
José M. F. Moura
Abstract:
The article reviews significant advances in networked signal and information processing, which have enabled in the last 25 years extending decision making and inference, optimization, control, and learning to the increasingly ubiquitous environments of distributed agents. As these interacting agents cooperate, new collective behaviors emerge from local decisions and actions. Moreover, and signific…
▽ More
The article reviews significant advances in networked signal and information processing, which have enabled in the last 25 years extending decision making and inference, optimization, control, and learning to the increasingly ubiquitous environments of distributed agents. As these interacting agents cooperate, new collective behaviors emerge from local decisions and actions. Moreover, and significantly, theory and applications show that networked agents, through cooperation and sharing, are able to match the performance of cloud or federated solutions, while offering the potential for improved privacy, increasing resilience, and saving resources.
△ Less
Submitted 18 April, 2023; v1 submitted 25 October, 2022;
originally announced October 2022.
-
Finite-Time In-Network Computation of Linear Transforms
Authors:
Soummya Kar,
Markus Püschel,
José M. F. Moura
Abstract:
This paper focuses on finite-time in-network computation of linear transforms of distributed graph data. Finite-time transform computation problems are of interest in graph-based computing and signal processing applications in which the objective is to compute, by means of distributed iterative methods, various (linear) transforms of the data distributed at the agents or nodes of the graph. While…
▽ More
This paper focuses on finite-time in-network computation of linear transforms of distributed graph data. Finite-time transform computation problems are of interest in graph-based computing and signal processing applications in which the objective is to compute, by means of distributed iterative methods, various (linear) transforms of the data distributed at the agents or nodes of the graph. While finite-time computation of consensus-type or more generally rank-one transforms have been studied, systematic approaches toward scalable computing of general linear transforms, specifically in the case of heterogeneous agent objectives in which each agent is interested in obtaining a different linear combination of the network data, are relatively less explored. In this paper, by employing ideas from algebraic geometry, we develop a systematic characterization of linear transforms that are amenable to distributed in-network computation in finite-time using linear iterations. Further, we consider the general case of directed inter-agent communication graphs. Specifically, it is shown that \emph{almost all} linear transformations of data distributed on the nodes of a digraph containing a Hamiltonian cycle may be computed using at most $N$ linear distributed iterations. Finally, by studying an associated matrix factorization based reformulation of the transform computation problem, we obtain, as a by-product, certain results and characterizations on sparsity-constrained matrix factorization that are of independent interest.
△ Less
Submitted 3 April, 2021;
originally announced April 2021.
-
Safe Learning MPC with Limited Model Knowledge and Data
Authors:
Aaron Kandel,
Scott J. Moura
Abstract:
This paper presents an end-to-end framework for safe learning-based control (LbC) using nonlinear stochastic MPC and distributionally robust optimization (DRO). This work is motivated by several open challenges in LbC literature. In particular, many control-theoretic LbC methods require subject matter expertise in order to translate their own safety guarantees, often manifested as preexisting data…
▽ More
This paper presents an end-to-end framework for safe learning-based control (LbC) using nonlinear stochastic MPC and distributionally robust optimization (DRO). This work is motivated by several open challenges in LbC literature. In particular, many control-theoretic LbC methods require subject matter expertise in order to translate their own safety guarantees, often manifested as preexisting data of safe trajectories or structural model knowledge. In this paper, we focus on LbC where the controller is applied directly to a system of which it has no or extremely limited direct experience, towards safety during \textit{tabula-rasa} or ``\textit{blank slate''} model-based learning and control as a challenging case for validation. This explores the boundary of the status-quo in control theory relating to requirements for subject matter expertise. We show under basic and limited assumptions on the underlying problem, we can translate probabilistic guarantees on feasibility to nonlinear systems using results in stochastic MPC and DRO literature whose relevance we formally extend in a mathematical analysis. We also present a coupled and intuitive formulation for persistence of excitation (PoE), and illustrate the connection between PoE and applicability of the proposed method. Our case studies of vehicle obstacle avoidance and safe extreme fast charging of lithium-ion batteries reveal powerful empirical results supporting the underlying DRO theory. Our method is widely applicable within the LbC domain to, for example, airborne wind energy systems, vehicle obstacle avoidance, and energy storage systems management. It is also applicable to quantifying uncertainty beyond the LbC case.
△ Less
Submitted 21 August, 2023; v1 submitted 1 April, 2020;
originally announced April 2020.
-
Distributed Gradient Methods for Nonconvex Optimization: Local and Global Convergence Guarantees
Authors:
Brian Swenson,
Soummya Kar,
H. Vincent Poor,
José M. F. Moura,
Aaron Jaech
Abstract:
The article discusses distributed gradient-descent algorithms for computing local and global minima in nonconvex optimization. For local optimization, we focus on distributed stochastic gradient descent (D-SGD)--a simple network-based variant of classical SGD. We discuss local minima convergence guarantees and explore the simple but critical role of the stable-manifold theorem in analyzing saddle-…
▽ More
The article discusses distributed gradient-descent algorithms for computing local and global minima in nonconvex optimization. For local optimization, we focus on distributed stochastic gradient descent (D-SGD)--a simple network-based variant of classical SGD. We discuss local minima convergence guarantees and explore the simple but critical role of the stable-manifold theorem in analyzing saddle-point avoidance. For global optimization, we discuss annealing-based methods in which slowly decaying noise is added to D-SGD. Conditions are discussed under which convergence to global minima is guaranteed. Numerical examples illustrate the key concepts in the paper.
△ Less
Submitted 16 September, 2020; v1 submitted 23 March, 2020;
originally announced March 2020.
-
Primal-dual methods for large-scale and distributed convex optimization and data analytics
Authors:
Dusan Jakovetic,
Dragana Bajovic,
Joao Xavier,
Jose M. F. Moura
Abstract:
The augmented Lagrangian method (ALM) is a classical optimization tool that solves a given "difficult" (constrained) problem via finding solutions of a sequence of "easier"(often unconstrained) sub-problems with respect to the original (primal) variable, wherein constraints satisfaction is controlled via the so-called dual variables. ALM is highly flexible with respect to how primal sub-problems c…
▽ More
The augmented Lagrangian method (ALM) is a classical optimization tool that solves a given "difficult" (constrained) problem via finding solutions of a sequence of "easier"(often unconstrained) sub-problems with respect to the original (primal) variable, wherein constraints satisfaction is controlled via the so-called dual variables. ALM is highly flexible with respect to how primal sub-problems can be solved, giving rise to a plethora of different primal-dual methods. The powerful ALM mechanism has recently proved to be very successful in various large scale and distributed applications. In addition, several significant advances have appeared, primarily on precise complexity results with respect to computational and communication costs in the presence of inexact updates and design and analysis of novel optimal methods for distributed consensus optimization. We provide a tutorial-style introduction to ALM and its variants for solving convex optimization problems in large scale and distributed settings. We describe control-theoretic tools for the algorithms' analysis and design, survey recent results, and provide novel insights in the context of two emerging applications: federated learning and distributed energy trading.
△ Less
Submitted 14 April, 2020; v1 submitted 18 December, 2019;
originally announced December 2019.
-
Resilient Distributed Recovery of Large Fields
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
This paper studies the resilient distributed recovery of large fields under measurement attacks, by a team of agents, where each measures a small subset of the components of a large spatially distributed field. An adversary corrupts some of the measurements. The agents collaborate to process their measurements, and each is interested in recovering only a fraction of the field. We present a field r…
▽ More
This paper studies the resilient distributed recovery of large fields under measurement attacks, by a team of agents, where each measures a small subset of the components of a large spatially distributed field. An adversary corrupts some of the measurements. The agents collaborate to process their measurements, and each is interested in recovering only a fraction of the field. We present a field recovery consensus+innovations type distributed algorithm that is resilient to measurement attacks, where an agent maintains and updates a local state based on its neighbors states and its own measurement. Under sufficient conditions on the attacker and the connectivity of the communication network, each agent's state, even those with compromised measurements, converges to the true value of the field components that it is interested in recovering. Finally, we illustrate the performance of our algorithm through numerical examples.
△ Less
Submitted 19 October, 2019;
originally announced October 2019.
-
Hopfield Neural Network Flow: A Geometric Viewpoint
Authors:
Abhishek Halder,
Kenneth F. Caluya,
Bertrand Travacca,
Scott J. Moura
Abstract:
We provide gradient flow interpretations for the continuous-time continuous-state Hopfield neural network (HNN). The ordinary and stochastic differential equations associated with the HNN were introduced in the literature as analog optimizers, and were reported to exhibit good performance in numerical experiments. In this work, we point out that the deterministic HNN can be transcribed into Amari'…
▽ More
We provide gradient flow interpretations for the continuous-time continuous-state Hopfield neural network (HNN). The ordinary and stochastic differential equations associated with the HNN were introduced in the literature as analog optimizers, and were reported to exhibit good performance in numerical experiments. In this work, we point out that the deterministic HNN can be transcribed into Amari's natural gradient descent, and thereby uncover the explicit relation between the underlying Riemannian metric and the activation functions. By exploiting an equivalence between the natural gradient descent and the mirror descent, we show how the choice of activation function governs the geometry of the HNN dynamics.
For the stochastic HNN, we show that the so-called "diffusion machine", while not a gradient flow itself, induces a gradient flow when lifted in the space of probability measures. We characterize this infinite dimensional flow as the gradient descent of certain free energy with respect to a Wasserstein metric that depends on the geodesic distance on the ground manifold. Furthermore, we demonstrate how this gradient flow interpretation can be used for fast computation via recently developed proximal algorithms.
△ Less
Submitted 13 November, 2019; v1 submitted 4 August, 2019;
originally announced August 2019.
-
Distributed Global Optimization by Annealing
Authors:
Brian Swenson,
Soummya Kar,
H. Vincent Poor,
José M. F. Moura
Abstract:
The paper considers a distributed algorithm for global minimization of a nonconvex function. The algorithm is a first-order consensus + innovations type algorithm that incorporates decaying additive Gaussian noise for annealing, converging to the set of global minima under certain technical assumptions. The paper presents simple methods for verifying that the required technical assumptions hold an…
▽ More
The paper considers a distributed algorithm for global minimization of a nonconvex function. The algorithm is a first-order consensus + innovations type algorithm that incorporates decaying additive Gaussian noise for annealing, converging to the set of global minima under certain technical assumptions. The paper presents simple methods for verifying that the required technical assumptions hold and illustrates it with a distributed target-localization application.
△ Less
Submitted 20 July, 2019;
originally announced July 2019.
-
Partial coherent state transforms, $G \times T$-invariant Kähler structures and geometric quantization of cotangent bundles of compact Lie groups
Authors:
José M. Mourão,
João P. Nunes,
Miguel B. Pereira
Abstract:
In this paper, we study the analytic continuation to complex time of the Hamiltonian flow of certain $G\times T$-invariant functions on the cotangent bundle of a compact connected Lie group $G$ with maximal torus $T$. Namely, we will take the Hamiltonian flows of one $G\times G$-invariant function, $h$, and one $G\times T$-invariant function, $f$. Acting with these complex time Hamiltonian flows o…
▽ More
In this paper, we study the analytic continuation to complex time of the Hamiltonian flow of certain $G\times T$-invariant functions on the cotangent bundle of a compact connected Lie group $G$ with maximal torus $T$. Namely, we will take the Hamiltonian flows of one $G\times G$-invariant function, $h$, and one $G\times T$-invariant function, $f$. Acting with these complex time Hamiltonian flows on $G\times G$-invariant Kähler structures gives new $G\times T$-invariant, but not $G\times G$-invariant, Kähler structures on $T^*G$. We study the Hilbert spaces ${\mathcal H}_{τ,σ}$ corresponding to the quantization of $T^*G$ with respect to these non-invariant Kähler structures. On the other hand, by taking the vertical Schrödinger polarization as a starting point, the above $G\times T$-invariant Hamiltonian flows also generate families of mixed polarizations $\mathcal{P}_{0,σ}, σ\in {\mathbb C}, {\rm Im}(σ) >0$. Each of these mixed polarizations is globally given by a direct sum of an integrable real distribution and of a complex distribution that defines a Kähler structure on the leaves of a foliation of $T^*G$. The geometric quantization of $T^*G$ with respect to these mixed polarizations gives rise to unitary partial coherent state transforms, corresponding to KSH maps as defined in [KMN1,KMN2].
△ Less
Submitted 9 September, 2019; v1 submitted 11 July, 2019;
originally announced July 2019.
-
Resilient Distributed Field Estimation
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
We study resilient distributed field estimation under measurement attacks. A network of agents or devices measures a large, spatially distributed physical field parameter. An adversary arbitrarily manipulates the measurements of some of the agents. Each agent's goal is to process its measurements and information received from its neighbors to estimate only a few specific components of the field. W…
▽ More
We study resilient distributed field estimation under measurement attacks. A network of agents or devices measures a large, spatially distributed physical field parameter. An adversary arbitrarily manipulates the measurements of some of the agents. Each agent's goal is to process its measurements and information received from its neighbors to estimate only a few specific components of the field. We present $\mathbf{SAFE}$, the Saturating Adaptive Field Estimator, a consensus+innovations distributed field estimator that is resilient to measurement attacks. Under sufficient conditions on the compromised measurement streams, the physical coupling between the field and the agents' measurements, and the connectivity of the cyber communication network, $\mathbf{SAFE}$ guarantees that each agent's estimate converges almost surely to the true value of the components of the parameter in which the agent is interested. Finally, we illustrate the performance of $\mathbf{SAFE}$ through numerical examples.
△ Less
Submitted 26 March, 2020; v1 submitted 18 April, 2019;
originally announced April 2019.
-
Annealing for Distributed Global Optimization
Authors:
Brian Swenson,
Soummya Kar,
H. Vincent Poor,
Jose' M. F. Moura
Abstract:
The paper proves convergence to global optima for a class of distributed algorithms for nonconvex optimization in network-based multi-agent settings. Agents are permitted to communicate over a time-varying undirected graph. Each agent is assumed to possess a local objective function (assumed to be smooth, but possibly nonconvex). The paper considers algorithms for optimizing the sum function. A di…
▽ More
The paper proves convergence to global optima for a class of distributed algorithms for nonconvex optimization in network-based multi-agent settings. Agents are permitted to communicate over a time-varying undirected graph. Each agent is assumed to possess a local objective function (assumed to be smooth, but possibly nonconvex). The paper considers algorithms for optimizing the sum function. A distributed algorithm of the consensus+innovations type is proposed which relies on first-order information at the agent level. Under appropriate conditions on network connectivity and the cost objective, convergence to the set of global optima is achieved by an annealing-type approach, with decaying Gaussian noise independently added into each agent's update step. It is shown that the proposed algorithm converges in probability to the set of global minima of the sum function.
△ Less
Submitted 18 March, 2019;
originally announced March 2019.
-
Segal-Bargmann transforms from hyperbolic Hamiltonians
Authors:
William D. Kirwin,
José Mourão,
João P. Nunes,
Thomas Thiemann
Abstract:
We consider the imaginary time flow of a quadratic hyperbolic Hamiltonian on the symplectic plane, apply it to the Schrödinger polarization and study the corresponding evolution of polarized sections. The flow is periodic in imaginary time and the evolution of polarized sections has interesting features. On the time intervals for which the polarization is real or Kähler, the half--form corrected t…
▽ More
We consider the imaginary time flow of a quadratic hyperbolic Hamiltonian on the symplectic plane, apply it to the Schrödinger polarization and study the corresponding evolution of polarized sections. The flow is periodic in imaginary time and the evolution of polarized sections has interesting features. On the time intervals for which the polarization is real or Kähler, the half--form corrected time evolution of polarized sections is given by unitary operators which turn out to be equivalent to the classical Segal-Bargmann transforms (which are usually associated to the quadratic elliptic Hamiltonian $H=\frac12 p^2$ and to the heat operator). At the right endpoint of these intervals, the evolution of polarized sections is given by the Fourier transform from the Schrödinger to the momentum representation. In the complementary intervals of imaginary time, the polarizations are anti--Kähler and the Hilbert space of polarized sections collapses to ${\mathcal H}= \{0\}$.
Hyperbolic quadratic Hamiltonians thus give rise to a new factorization of the Segal-Bargmann transform, which is very different from the usual one, where one first applies a bounded contraction operator (the heat kernel operator), mapping $L^2$--states to real analytic functions with unique analytic continuation, and then one applies analytic continuation. In the factorization induced by an hyperbolic complexifier, both factors are unbounded operators but their composition is, in the Kähler or real sectors, unitary.
In another paper [KMNT], we explore the application of the above family of unitary transforms to the definition of new holomorphic fractional Fourier transforms.
△ Less
Submitted 23 February, 2019;
originally announced February 2019.
-
Resilient Distributed Parameter Estimation with Heterogeneous Data
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
This paper studies resilient distributed estimation under measurement attacks. A set of agents each makes successive local, linear, noisy measurements of an unknown vector field collected in a vector parameter. The local measurement models are heterogeneous across agents and may be locally unobservable for the unknown parameter. An adversary compromises some of the measurement streams and changes…
▽ More
This paper studies resilient distributed estimation under measurement attacks. A set of agents each makes successive local, linear, noisy measurements of an unknown vector field collected in a vector parameter. The local measurement models are heterogeneous across agents and may be locally unobservable for the unknown parameter. An adversary compromises some of the measurement streams and changes their values arbitrarily. The agents' goal is to cooperate over a peer-to-peer communication network to process their (possibly compromised) local measurements and estimate the value of the unknown vector parameter. We present SAGE, the Saturating Adaptive Gain Estimator, a distributed, recursive, consensus+innovations estimator that is resilient to measurement attacks. We demonstrate that, as long as the number of compromised measurement streams is below a particular bound, then, SAGE guarantees that all of the agents' local estimates converge almost surely to the value of the parameter. The resilience of the estimator -- i.e., the number of compromised measurement streams it can tolerate -- does not depend on the topology of the inter-agent communication network. Finally, we illustrate the performance of SAGE through numerical examples.
△ Less
Submitted 30 May, 2019; v1 submitted 20 December, 2018;
originally announced December 2018.
-
Joint Fleet Sizing and Charging System Planning for Autonomous Electric Vehicles
Authors:
Hongcai Zhang,
Colin J. R. Sheppard,
Timothy E. Lipman,
Scott J. Moura
Abstract:
This paper studies the joint fleet sizing and charging system planning problem for a company operating a fleet of autonomous electric vehicles (AEVs) for passenger and goods transportation. Most of the relevant published papers focus on intracity scenarios and adopt heuristic approaches, e.g., agent based simulation, which do not guarantee optimality. In contrast, we propose a mixed integer linear…
▽ More
This paper studies the joint fleet sizing and charging system planning problem for a company operating a fleet of autonomous electric vehicles (AEVs) for passenger and goods transportation. Most of the relevant published papers focus on intracity scenarios and adopt heuristic approaches, e.g., agent based simulation, which do not guarantee optimality. In contrast, we propose a mixed integer linear programming model for intercity scenarios. This model incorporates comprehensive considerations of 1) limited AEV driving range; 2) optimal AEV routing and relocating operations; 3) time-varying origin-destination transport demands; and 4) differentiated operation cost structure of passenger and goods transportation. The proposed model can be computational expensive when the scale of the transportation network is large. We then exploit the structure of this program to expedite its solution. Numerical experiments are conducted to validate the proposed method. Our experimental results show that AEVs in passenger and goods transportation have remarkable planning and operation differences. We also demonstrate that intelligent routing and relocating operations, charging system and vehicle parameters, e.g., charging power, battery capacity, driving speed etc., can significantly affect the economic efficiency and the planning results of an AEV fleet.
△ Less
Submitted 1 November, 2018;
originally announced November 2018.
-
Resilient Distributed Estimation: Sensor Attacks
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
This paper studies multi-agent distributed estimation under sensor attacks. Individual agents make sensor measurements of an unknown parameter belonging to a compact set, and, at every time step, a fraction of the agents' sensor measurements may fall under attack and take arbitrary values. We present the Saturated Innovation Update ($\mathcal{SIU}$) algorithm for distributed estimation resilient t…
▽ More
This paper studies multi-agent distributed estimation under sensor attacks. Individual agents make sensor measurements of an unknown parameter belonging to a compact set, and, at every time step, a fraction of the agents' sensor measurements may fall under attack and take arbitrary values. We present the Saturated Innovation Update ($\mathcal{SIU}$) algorithm for distributed estimation resilient to sensor attacks. Under the iterative $\mathcal{SIU}$ algorithm, if less than one half of the agent sensors fall under attack, then, all of the agents' estimates converge at a polynomial rate (with respect to the number of iterations) to the true parameter. The resilience of $\mathcal{SIU}$ to sensor attacks does not depend on the topology of the inter-agent communication network, as long as it remains connected. We demonstrate the performance of $\mathcal{SIU}$ with numerical examples.
△ Less
Submitted 24 June, 2018; v1 submitted 18 September, 2017;
originally announced September 2017.
-
Data-driven Chance-constrained Regulation Capacity Offering for Distributed Energy Resources
Authors:
Hongcai Zhang,
Zechun Hu,
Eric Munsing,
Scott J. Moura,
Yonghua Song
Abstract:
This paper studies the behavior of a strategic aggregator offering regulation capacity on behalf of a group of distributed energy resources (DERs, e.g. plug-in electric vehicles) in a power market. Our objective is to maximize the aggregator's revenue while controlling the risk of penalties due to poor service delivery. To achieve this goal, we propose data-driven risk-averse strategies to effecti…
▽ More
This paper studies the behavior of a strategic aggregator offering regulation capacity on behalf of a group of distributed energy resources (DERs, e.g. plug-in electric vehicles) in a power market. Our objective is to maximize the aggregator's revenue while controlling the risk of penalties due to poor service delivery. To achieve this goal, we propose data-driven risk-averse strategies to effectively handle uncertainties in: 1) The DER parameters (e.g., load demands and flexibilities) and 2) sub-hourly regulation signals (to the accuracy of every few seconds). We design both the day-ahead and the hour-ahead strategies. In the day-ahead model, we develop a two-stage stochastic program to roughly model the above uncertainties, which achieves computational efficiency by leveraging novel aggregate models of both DER parameters and sub-hourly regulation signals. In the hour-ahead model, we formulate a data-driven distributionally robust chance-constrained program to explicitly model the aforementioned uncertainties. This program can effectively control the quality of regulation service based on the aggregator's risk aversion. Furthermore, it learns the distributions of the uncertain parameters from empirical data so that it outperforms existing techniques, (e.g. robust optimization or traditional chance-constrained programming) in both modelling accuracy and cost of robustness. Finally, we derive a conic safe approximation for it which can be efficiently solved by commercial solvers. Numerical experiments are conducted to validate the proposed method.
△ Less
Submitted 16 August, 2017;
originally announced August 2017.
-
Preorder Construct on Simple Undirected Graphs
Authors:
Augusto Almeida Santos,
José M. F. Moura,
João Xavier
Abstract:
We construct a novel preorder on the set of nodes of a simple undirected graph. We prove that the preorder (induced by the topology of the graph) is preserved, e.g., by the logistic dynamical system (both in discrete and continuous time). Moreover, the underlying equivalence relation of the preorder corresponds to the coarsest equitable partition (CEP). This will further imply that the logistic dy…
▽ More
We construct a novel preorder on the set of nodes of a simple undirected graph. We prove that the preorder (induced by the topology of the graph) is preserved, e.g., by the logistic dynamical system (both in discrete and continuous time). Moreover, the underlying equivalence relation of the preorder corresponds to the coarsest equitable partition (CEP). This will further imply that the logistic dynamical system on a graph preserves its coarsest equitable partition. The results provide a nontrivial invariant set for the logistic and the like dynamical systems, as we show. We note that our construct provides a functional characterization for the CEP as an alternative to the pure set theoretical iterated degree sequences characterization. The construct and results presented might have independent interest for analysis on graphs or qualitative analysis of dynamical systems over networks.
△ Less
Submitted 9 March, 2017;
originally announced March 2017.
-
Thermodynamic Limit of Interacting Particle Systems over Time-varying Sparse Random Networks
Authors:
Augusto Almeida Santos,
Soummya Kar,
José M. F. Moura,
João Xavier
Abstract:
We establish a functional weak law of large numbers for observable macroscopic state variables of interacting particle systems (e.g., voter and contact processes) over fast time-varying sparse random networks of interactions. We show that, as the number of agents $N$ grows large, the proportion of agents $\left(\overline{Y}_{k}^{N}(t)\right)$ at a certain state $k$ converges in distribution -- or,…
▽ More
We establish a functional weak law of large numbers for observable macroscopic state variables of interacting particle systems (e.g., voter and contact processes) over fast time-varying sparse random networks of interactions. We show that, as the number of agents $N$ grows large, the proportion of agents $\left(\overline{Y}_{k}^{N}(t)\right)$ at a certain state $k$ converges in distribution -- or, more precisely, weakly with respect to the uniform topology on the space of \emph{càdlàg} sample paths -- to the solution of an ordinary differential equation over any compact interval $\left[0,T\right]$. Although the limiting process is Markov, the prelimit processes, i.e., the normalized macrostate vector processes $\left(\mathbf{\overline{Y}}^{N}(t)\right)=\left(\overline{Y}_{1}^{N}(t),\ldots,\overline{Y}_{K}^{N}(t)\right)$, are non-Markov as they are tied to the \emph{high-dimensional} microscopic state of the system, which precludes the direct application of standard arguments for establishing weak convergence. The techniques developed in the paper for establishing weak convergence might be of independent interest.
△ Less
Submitted 26 February, 2017;
originally announced February 2017.
-
Joint Planning of PEV Fast-Charging Network and Distributed PV Generation Using the Accelerated Generalized Benders Decomposition
Authors:
Hongcai Zhang,
Scott J. Moura,
Zechun Hu,
Wei Qi,
Yonghua Song
Abstract:
Integration of plug-in electric vehicles (PEVs) with distributed renewable resources will decrease PEVs' well-to-wheels greenhouse gas emissions, promote renewable power adoption and defer power system investments. This paper proposes a multidisciplinary approach to jointly planning PEV fast-charging stations and distributed photovoltaic (PV) power plants on coupled transportation and power networ…
▽ More
Integration of plug-in electric vehicles (PEVs) with distributed renewable resources will decrease PEVs' well-to-wheels greenhouse gas emissions, promote renewable power adoption and defer power system investments. This paper proposes a multidisciplinary approach to jointly planning PEV fast-charging stations and distributed photovoltaic (PV) power plants on coupled transportation and power networks. First, we develop models of 1) PEV fast-charging stations; 2) highway transportation networks under PEV driving range constraints; 3) PV power plants with reactive power control. Then, we formulate a two-stage stochastic mixed integer second order cone program (MISOCP) to determine the sites and sizes of 1) PEV fast-charging stations; 2) PV power plants. To address the uncertainty of future scenarios, a significant number of future typical load, traffic flow and PV generation curves are adopted. This makes the problem large scale. We design a Generalized Benders Decomposition Algorithm to efficiently solve it. To the authors' knowledge, this work is the first that jointly plans both PEV fast-charging stations and PV plants with consideration for PEV driving range limits and reactive PV power control. We conduct numerical experiments to illustrate the effectiveness of the proposed method, and validate the benefits of the joint planning and adopting advanced PV reactive power control.
△ Less
Submitted 14 March, 2017; v1 submitted 23 February, 2017;
originally announced February 2017.
-
Picard group and quantization of toric orbifolds
Authors:
Thomas Baier,
José M. Mourão,
João P. Nunes
Abstract:
In the classical theory of toric manifolds polytopes appear in two guises -- as Newton polytopes of line bundles on the complex, and as moment polytopes on the symplectic side, the link between the two being established by the prequantizability condition on the cohomology class of the symplectic form.
Here we give a combinatorial description of the orbifold Picard group for complete toric orbifo…
▽ More
In the classical theory of toric manifolds polytopes appear in two guises -- as Newton polytopes of line bundles on the complex, and as moment polytopes on the symplectic side, the link between the two being established by the prequantizability condition on the cohomology class of the symplectic form.
Here we give a combinatorial description of the orbifold Picard group for complete toric orbifolds, with the aim of detailing the relation between complex and symplectic aspects in the orbifold setting. In particular this permits to illustrate the breakdown of identification of (orbifold) line bundles by their Chern class (or moment polytope up to translations in $\mathfrak{t}^\ast$), and non-constancy of $h^0$ on representatives of the same Chern class. As an application, we discuss symplectic reduction with respect to restrictions of the action to sub-tori, and the associated Bohr--Sommerfeld conditions in mixed polarizations.
△ Less
Submitted 2 July, 2018; v1 submitted 8 February, 2017;
originally announced February 2017.
-
Spectral Statistics of Lattice Graph Structured, Non-uniform Percolations
Authors:
Stephen Kruzick,
José M. F. Moura
Abstract:
Design of filters for graph signal processing benefits from knowledge of the spectral decomposition of matrices that encode graphs, such as the adjacency matrix and the Laplacian matrix, used to define the shift operator. For shift matrices with real eigenvalues, which arise for symmetric graphs, the empirical spectral distribution captures the eigenvalue locations. Under realistic circumstances,…
▽ More
Design of filters for graph signal processing benefits from knowledge of the spectral decomposition of matrices that encode graphs, such as the adjacency matrix and the Laplacian matrix, used to define the shift operator. For shift matrices with real eigenvalues, which arise for symmetric graphs, the empirical spectral distribution captures the eigenvalue locations. Under realistic circumstances, stochastic influences often affect the network structure and, consequently, the shift matrix empirical spectral distribution. Nevertheless, deterministic functions may often be found to approximate the asymptotic behavior of empirical spectral distributions of random matrices. This paper uses stochastic canonical equation methods developed by Girko to derive such deterministic equivalent distributions for the empirical spectral distributions of random graphs formed by structured, non-uniform percolation of a D-dimensional lattice supergraph. Included simulations demonstrate the results for sample parameters.
△ Less
Submitted 6 January, 2017;
originally announced January 2017.
-
A new approximation method for geodesics on the space of Kähler metrics using complexified symplectomorphisms and Gröbner Lie series
Authors:
José Mourão,
João P. Nunes,
Tomás Reis
Abstract:
It has been shown that the Cauchy problem for geodesics in the space of Kähler metrics with a fixed cohomology class on a compact complex manifold $M$ can be effectively reduced to the problem of finding the flow of a related hamiltonian vector field $X_H$, followed by analytic continuation of the time to complex time.
This opens the possibility of expressing the geodesic $ω_t$ in terms of Gröbn…
▽ More
It has been shown that the Cauchy problem for geodesics in the space of Kähler metrics with a fixed cohomology class on a compact complex manifold $M$ can be effectively reduced to the problem of finding the flow of a related hamiltonian vector field $X_H$, followed by analytic continuation of the time to complex time.
This opens the possibility of expressing the geodesic $ω_t$ in terms of Gröbner Lie series of the form $\exp(\sqrt{-1} \, tX_H)(f)$, for local holomorphic functions $f$. The main goal of this paper is to use truncated Lie series as a new way of constructing approximate solutions to the geodesic equation. For the case of an elliptic curve and $H$ a certain Morse function squared, we approximate the relevant Lie series by their first twelve terms, calculated with the help of Mathematica. This leads to approximate geodesics which hit the boundary of the space of Kähler metrics in finite geodesic time. For quantum mechanical applications, one is interested also on the non-Kähler polarizations that one obtains by crossing the boundary of the space of Kähler structures. Properties of the approximate geodesics and its extensions are also studied using Mathematica.
△ Less
Submitted 9 September, 2019; v1 submitted 6 January, 2017;
originally announced January 2017.
-
Resilient Distributed Estimation Through Adversary Detection
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
This paper studies resilient multi-agent distributed estimation of an unknown vector parameter when a subset of the agents is adversarial. We present and analyze a Flag Raising Distributed Estimator ($\mathcal{FRDE}$) that allows the agents under attack to perform accurate parameter estimation and detect the adversarial agents. The $\mathcal{FRDE}$ algorithm is a consensus+innovations estimator in…
▽ More
This paper studies resilient multi-agent distributed estimation of an unknown vector parameter when a subset of the agents is adversarial. We present and analyze a Flag Raising Distributed Estimator ($\mathcal{FRDE}$) that allows the agents under attack to perform accurate parameter estimation and detect the adversarial agents. The $\mathcal{FRDE}$ algorithm is a consensus+innovations estimator in which agents combine estimates of neighboring agents (consensus) with local sensing information (innovations). We establish that, under $\mathcal{FRDE}$, either the uncompromised agents' estimates are almost surely consistent or the uncompromised agents detect compromised agents if and only if the network of uncompromised agents is connected and globally observable. Numerical examples illustrate the performance of $\mathcal{FRDE}$.
△ Less
Submitted 12 January, 2018; v1 submitted 3 January, 2017;
originally announced January 2017.
-
Clifford Coherent State Transforms on Spheres
Authors:
Pei Dang,
José Mourão,
João P. Nunes,
Tao Qian
Abstract:
We introduce a one-parameter family of transforms, $U^t_{(m)}$, $t>0$, from the Hilbert space of Clifford algebra valued square integrable functions on the $m$--dimensional sphere, $L^2(S^{m},dσ_{m})\otimes \mathbb{C}_{m+1}$, to the Hilbert spaces, ${\mathcal M}L^2(\mathbb{R}^{m+1} \setminus \{0\},dμ_t)$, of monogenic functions on $\mathbb{R}^{m+1}\setminus \{0\}$ which are square integrable with…
▽ More
We introduce a one-parameter family of transforms, $U^t_{(m)}$, $t>0$, from the Hilbert space of Clifford algebra valued square integrable functions on the $m$--dimensional sphere, $L^2(S^{m},dσ_{m})\otimes \mathbb{C}_{m+1}$, to the Hilbert spaces, ${\mathcal M}L^2(\mathbb{R}^{m+1} \setminus \{0\},dμ_t)$, of monogenic functions on $\mathbb{R}^{m+1}\setminus \{0\}$ which are square integrable with respect to appropriate measures, $dμ_t$. We prove that these transforms are unitary isomorphisms of the Hilbert spaces and are extensions of the Segal-Bargman coherent state transform, $U_{(1)} : L^2(S^{1},dσ_{1}) \longrightarrow {\mathcal H}L^2({\mathbb{C} \setminus \{0\}},dμ)$, to higher dimensional spheres in the context of Clifford analysis. In Clifford analysis it is natural to replace the analytic continuation from $S^m$ to $S^m_{\mathbb{C}}$ as in \cite{Ha1, St, HM} by the Cauchy--Kowalewski extension from $S^m$ to $\mathbb{R}^{m+1}\setminus \{0\}$. One then obtains a unitary isomorphism from an $L^2$--Hilbert space to an Hilbert space of solutions of the Dirac equation, that is to a Hilbert space of monogenic functions.
△ Less
Submitted 5 December, 2016;
originally announced December 2016.
-
Spectral Statistics of Lattice Graph Percolation Models
Authors:
Stephen Kruzick,
Jose M. F. Moura
Abstract:
In graph signal processing, the graph adjacency matrix or the graph Laplacian commonly define the shift operator. The spectral decomposition of the shift operator plays an important role in that the eigenvalues represent frequencies and the eigenvectors provide a spectral basis. This is useful, for example, in the design of filters. However, the graph or network may be uncertain due to stochastic…
▽ More
In graph signal processing, the graph adjacency matrix or the graph Laplacian commonly define the shift operator. The spectral decomposition of the shift operator plays an important role in that the eigenvalues represent frequencies and the eigenvectors provide a spectral basis. This is useful, for example, in the design of filters. However, the graph or network may be uncertain due to stochastic influences in construction and maintenance, and, under such conditions, the eigenvalues of the shift matrix become random variables. This paper examines the spectral distribution of the eigenvalues of random networks formed by including each link of a D-dimensional lattice supergraph independently with identical probability, a percolation model. Using the stochastic canonical equation methods developed by Girko for symmetric matrices with independent upper triangular entries, a deterministic distribution is found that asymptotically approximates the empirical spectral distribution of the scaled adjacency matrix for a model with arbitrary parameters. The main results characterize the form of the solution to an important system of equations that leads to this deterministic distribution function and significantly reduce the number of equations that must be solved to find the solution for a given set of model parameters. Simulations comparing the expected empirical spectral distributions and the computed deterministic distributions are provided for sample parameters.
△ Less
Submitted 26 September, 2016;
originally announced November 2016.
-
Optimal Attack Strategies Subject to Detection Constraints Against Cyber-Physical Systems
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
This paper studies an attacker against a cyber-physical system (CPS) whose goal is to move the state of a CPS to a target state while ensuring that his or her probability of being detected does not exceed a given bound. The attacker's probability of being detected is related to the nonnegative bias induced by his or her attack on the CPS' detection statistic. We formulate a linear quadratic cost f…
▽ More
This paper studies an attacker against a cyber-physical system (CPS) whose goal is to move the state of a CPS to a target state while ensuring that his or her probability of being detected does not exceed a given bound. The attacker's probability of being detected is related to the nonnegative bias induced by his or her attack on the CPS' detection statistic. We formulate a linear quadratic cost function that captures the attacker's control goal and establish constraints on the induced bias that reflect the attacker's detection-avoidance objectives. When the attacker is constrained to be detected at the false-alarm rate of the detector, we show that the optimal attack strategy reduces to a linear feedback of the attacker's state estimate. In the case that the attacker's bias is upper bounded by a positive constant, we provide two algorithms -- an optimal algorithm and a sub-optimal, less computationally intensive algorithm -- to find suitable attack sequences. Finally, we illustrate our attack strategies in numerical examples based on a remotely-controlled helicopter under attack.
△ Less
Submitted 30 March, 2017; v1 submitted 11 October, 2016;
originally announced October 2016.
-
Coherent State Transforms and the Weyl Equation in Clifford Analysis
Authors:
José Mourão,
João P. Nunes,
Tao Qian
Abstract:
We study a transform, inspired by coherent state transforms, from the Hilbert space of Clifford algebra valued square integrable functions $L^2({\mathbb R}^m,dx)\otimes {\mathbb C}_{m}$ to a Hilbert space of solutions of the Weyl equation on ${\mathbb R}^{m+1}= {\mathbb R} \times {\mathbb R}^m$, namely to the Hilbert space ${\mathcal M}L^2({\mathbb R}^{m+1},dμ)$ of ${\mathbb C}_m$-valued monogenic…
▽ More
We study a transform, inspired by coherent state transforms, from the Hilbert space of Clifford algebra valued square integrable functions $L^2({\mathbb R}^m,dx)\otimes {\mathbb C}_{m}$ to a Hilbert space of solutions of the Weyl equation on ${\mathbb R}^{m+1}= {\mathbb R} \times {\mathbb R}^m$, namely to the Hilbert space ${\mathcal M}L^2({\mathbb R}^{m+1},dμ)$ of ${\mathbb C}_m$-valued monogenic functions on ${\mathbb R}^{m+1}$ which are $L^2$ with respect to an appropriate measure $dμ$. We prove that this transform is a unitary isomorphism of Hilbert spaces and that it is therefore an analog of the Segal-Bargmann transform for Clifford analysis. As a corollary we obtain an orthonormal basis of monogenic functions on ${\mathbb R}^{m+1}$. We also study the case when ${\mathbb R}^m$ is replaced by the $m$-torus ${\mathbb T}^m.$ Quantum mechanically, this extension establishes the unitary equivalence of the Schrödinger representation on $M$, for $M={\mathbb R}^m$ and $M={\mathbb T}^m$, with a representation on the Hilbert space ${\mathcal M}L^2({\mathbb R} \times M,dμ)$ of solutions of the Weyl equation on the space-time ${\mathbb R}\times M$.
△ Less
Submitted 21 July, 2016;
originally announced July 2016.
-
Cyber Physical Attacks with Control Objectives
Authors:
Yuan Chen,
Soummya Kar,
José M. F. Moura
Abstract:
This paper studies attackers with control objectives against cyber-physical systems (CPS). The system is equipped with its own controller and attack detector, and the goal of the attacker is to move the system to a target state while altering the system's actuator input and sensor output to avoid detection. We formulate a cost function that reflects the attacker's goals, and, using dynamic program…
▽ More
This paper studies attackers with control objectives against cyber-physical systems (CPS). The system is equipped with its own controller and attack detector, and the goal of the attacker is to move the system to a target state while altering the system's actuator input and sensor output to avoid detection. We formulate a cost function that reflects the attacker's goals, and, using dynamic programming, we show that the optimal attack strategy reduces to a linear feedback of the attacker's state estimate. By changing the parameters of the cost function, we show how an attacker can design optimal attacks to balance the control objective and the detection avoidance objective. Finally, we provide a numerical illustration based on a remotely-controlled helicopter under attack.
△ Less
Submitted 20 July, 2016;
originally announced July 2016.
-
Consensus+Innovations Distributed Kalman Filter with Optimized Gains
Authors:
Subhro Das,
José M. F. Moura
Abstract:
In this paper, we address the distributed filtering and prediction of time-varying random fields represented by linear time-invariant (LTI) dynamical systems. The field is observed by a sparsely connected network of agents/sensors collaborating among themselves. We develop a Kalman filter type consensus+innovations distributed linear estimator of the dynamic field termed as Consensus+Innovations K…
▽ More
In this paper, we address the distributed filtering and prediction of time-varying random fields represented by linear time-invariant (LTI) dynamical systems. The field is observed by a sparsely connected network of agents/sensors collaborating among themselves. We develop a Kalman filter type consensus+innovations distributed linear estimator of the dynamic field termed as Consensus+Innovations Kalman Filter. We analyze the convergence properties of this distributed estimator. We prove that the mean-squared error of the estimator asymptotically converges if the degree of instability of the field dynamics is within a pre-specified threshold defined as tracking capacity of the estimator. The tracking capacity is a function of the local observation models and the agent communication network. We design the optimal consensus and innovation gain matrices yielding distributed estimates with minimized mean-squared error. Through numerical evaluations, we show that, the distributed estimator with optimal gains converges faster and with approximately 3dB better mean-squared error performance than previous distributed estimators.
△ Less
Submitted 13 October, 2016; v1 submitted 19 May, 2016;
originally announced May 2016.
-
Distributed Constrained Recursive Nonlinear Least-Squares Estimation: Algorithms and Asymptotics
Authors:
Anit Kumar Sahu,
Soummya Kar,
Jose' M. F. Moura,
H. Vincent Poor
Abstract:
This paper focuses on the problem of recursive nonlinear least squares parameter estimation in multi-agent networks, in which the individual agents observe sequentially over time an independent and identically distributed (i.i.d.) time-series consisting of a nonlinear function of the true but unknown parameter corrupted by noise. A distributed recursive estimator of the \emph{consensus} + \emph{in…
▽ More
This paper focuses on the problem of recursive nonlinear least squares parameter estimation in multi-agent networks, in which the individual agents observe sequentially over time an independent and identically distributed (i.i.d.) time-series consisting of a nonlinear function of the true but unknown parameter corrupted by noise. A distributed recursive estimator of the \emph{consensus} + \emph{innovations} type, namely $\mathcal{CIWNLS}$, is proposed, in which the agents update their parameter estimates at each observation sampling epoch in a collaborative way by simultaneously processing the latest locally sensed information~(\emph{innovations}) and the parameter estimates from other agents~(\emph{consensus}) in the local neighborhood conforming to a pre-specified inter-agent communication topology. Under rather weak conditions on the connectivity of the inter-agent communication and a \emph{global observability} criterion, it is shown that at every network agent, the proposed algorithm leads to consistent parameter estimates. Furthermore, under standard smoothness assumptions on the local observation functions, the distributed estimator is shown to yield order-optimal convergence rates, i.e., as far as the order of pathwise convergence is concerned, the local parameter estimates at each agent are as good as the optimal centralized nonlinear least squares estimator which would require access to all the observations across all the agents at all times. In order to benchmark the performance of the proposed distributed $\mathcal{CIWNLS}$ estimator with that of the centralized nonlinear least squares estimator, the asymptotic normality of the estimate sequence is established and the asymptotic covariance of the distributed estimator is evaluated. Finally, simulation results are presented which illustrate and verify the analytical findings.
△ Less
Submitted 19 October, 2016; v1 submitted 31 January, 2016;
originally announced February 2016.
-
Dynamic Attack Detection in Cyber-Physical Systems with Side Initial State Information
Authors:
Yuan Chen,
Soummya Kar,
Jose' M. F. Moura
Abstract:
This paper studies the impact of side initial state information on the detectability of data deception attacks against cyber-physical systems. We assume the attack detector has access to a linear function of the initial system state that cannot be altered by an attacker. First, we provide a necessary and sufficient condition for an attack to be undetectable by any dynamic attack detector under eac…
▽ More
This paper studies the impact of side initial state information on the detectability of data deception attacks against cyber-physical systems. We assume the attack detector has access to a linear function of the initial system state that cannot be altered by an attacker. First, we provide a necessary and sufficient condition for an attack to be undetectable by any dynamic attack detector under each specific side information pattern. Second, we characterize attacks that can be sustained for arbitrarily long periods without being detected. Third, we define the zero state inducing attack, the only type of attack that remains dynamically undetectable regardless of the side initial state information available to the attack detector. Finally, we design a dynamic attack detector that detects detectable attacks.
△ Less
Submitted 16 June, 2016; v1 submitted 24 March, 2015;
originally announced March 2015.
-
Quantization in singular real polarizations: Kähler regularization, Maslov correction and pairings
Authors:
João N. Esteves,
José M. Mourão,
João P. Nunes
Abstract:
We study the Maslov correction to semiclassical states by using a Kähler regularized BKS pairing map from the energy representation to the Schrödinger representation. For general semiclassical states, the existence of this regularization is based on recently found families of Kähler polarizations degenerating to singular real polarizations and corresponding to special geodesic rays in the space of…
▽ More
We study the Maslov correction to semiclassical states by using a Kähler regularized BKS pairing map from the energy representation to the Schrödinger representation. For general semiclassical states, the existence of this regularization is based on recently found families of Kähler polarizations degenerating to singular real polarizations and corresponding to special geodesic rays in the space of Kähler metrics. In the case of the one-dimensional harmonic oscillator, we show that the correct phases associated with caustic points of the projection of the Lagrangian curves to the configuration space are correctly reproduced.
△ Less
Submitted 20 January, 2015; v1 submitted 31 December, 2014;
originally announced January 2015.
-
Complex symplectomorphisms and pseudo-Kähler islands in the quantization of toric manifolds
Authors:
William D. Kirwin,
José M. Mourão,
João P. Nunes
Abstract:
Let $P$ be a Delzant polytope. We show that the quantization of the corresponding toric manifold $X_{P}$ in toric Kähler polarizations and in the toric real polarization are related by analytic continuation of Hamiltonian flows evaluated at time $t = \sqrt{-1} s$. We relate the quantization of $X_{P}$ in two different toric Kähler polarizations by taking the time-$\sqrt{-1} s$ Hamiltonian "flow" o…
▽ More
Let $P$ be a Delzant polytope. We show that the quantization of the corresponding toric manifold $X_{P}$ in toric Kähler polarizations and in the toric real polarization are related by analytic continuation of Hamiltonian flows evaluated at time $t = \sqrt{-1} s$. We relate the quantization of $X_{P}$ in two different toric Kähler polarizations by taking the time-$\sqrt{-1} s$ Hamiltonian "flow" of strongly convex functions on the moment polytope $P$. By taking $s$ to infinity, we obtain the quantization of $X_{P}$ in the (singular) real toric polarization.
Recall that $X_{P}$ has an open dense subset which is biholomorphic to $({\mathbb{C}}^{*})^{n}$. The quantization of $X_{P}$ in a toric Kähler polarization can also be described by applying the complexified Hamiltonian flow of the Abreu--Guillemin symplectic potential $g$, at time $t=\sqrt{-1}$, to an appropriate finite-dimensional subspace of quantum states in the quantization of $T^{*}{\mathbb{T}}^{n}$ in the vertical polarization. By taking other imaginary times, $t= k \sqrt{-1}, k\in {\mathbb{R}}$, we describe toric Kähler metrics with cone singularities along the toric divisors in $X_{P}$.
For convex Hamiltonian functions and sufficiently negative imaginary part of the complex time, we obtain degenerate Kähler structures which are negative definite in some regions of $X_{P}$. We show that the pointwise and $L^2$-norms of quantum states are asymptotically vanishing on negative-definite regions.
△ Less
Submitted 11 November, 2014;
originally announced November 2014.
-
On complexified analytic Hamiltonian flows and geodesics on the space of Kahler metrics
Authors:
Jose M. Mourao,
Joao P. Nunes
Abstract:
In the case of a compact real analytic symplectic manifold M we describe an approach to the complexification of Hamiltonian flows [Se, Do1, Th1] and corresponding geodesics on the space of Kahler metrics. In this approach, motivated by recent work on quantization, the complexified Hamiltonian flows act, through the Grobner theory of Lie series, on the sheaf of complex valued real analytic function…
▽ More
In the case of a compact real analytic symplectic manifold M we describe an approach to the complexification of Hamiltonian flows [Se, Do1, Th1] and corresponding geodesics on the space of Kahler metrics. In this approach, motivated by recent work on quantization, the complexified Hamiltonian flows act, through the Grobner theory of Lie series, on the sheaf of complex valued real analytic functions, changing the sheaves of holomorphic functions. This defines an action on the space of (equivalent) complex structures on M and also a direct action on M. This description is related to the approach of [BLU] where one has an action on a complexification M_C of M followed by projection to M. Our approach allows for the study of some Hamiltonian functions which are not real analytic. It also leads naturally to the consideration of continuous degenerations of diffeomorphisms and of Kahler structures of M. Hence, one can link continuously (geometric quantization) real, and more general non-Kahler, polarizations with Kahler polarizations. This corresponds to the extension of the geodesics to the boundary of the space of Kahler metrics. Three illustrative examples are considered. We find an explicit formula for the complex time evolution of the Kahler potential under the flow. For integral symplectic forms, this formula corresponds to the complexification of the prequantization of Hamiltonian symplectomorphisms. We verify that certain families of Kahler structures, which have been studied in geometric quantization, are geodesic families.
△ Less
Submitted 6 January, 2015; v1 submitted 15 October, 2013;
originally announced October 2013.
-
Discrete Signal Processing on Graphs: Frequency Analysis
Authors:
Aliaksei Sandryhaila,
Jose M. F. Moura
Abstract:
Signals and datasets that arise in physical and engineering applications, as well as social, genetics, biomolecular, and many other domains, are becoming increasingly larger and more complex. In contrast to traditional time and image signals, data in these domains are supported by arbitrary graphs. Signal processing on graphs extends concepts and techniques from traditional signal processing to da…
▽ More
Signals and datasets that arise in physical and engineering applications, as well as social, genetics, biomolecular, and many other domains, are becoming increasingly larger and more complex. In contrast to traditional time and image signals, data in these domains are supported by arbitrary graphs. Signal processing on graphs extends concepts and techniques from traditional signal processing to data indexed by generic graphs. This paper studies the concepts of low and high frequencies on graphs, and low-, high-, and band-pass graph filters. In traditional signal processing, there concepts are easily defined because of a natural frequency ordering that has a physical interpretation. For signals residing on graphs, in general, there is no obvious frequency ordering. We propose a definition of total variation for graph signals that naturally leads to a frequency ordering on graphs and defines low-, high-, and band-pass graph signals and filters. We study the design of graph filters with specified frequency response, and illustrate our approach with applications to sensor malfunction detection and data classification.
△ Less
Submitted 18 November, 2013; v1 submitted 1 July, 2013;
originally announced July 2013.
-
Eigendecomposition of Block Tridiagonal Matrices
Authors:
Aliaksei Sandryhaila,
Jose M. F. Moura
Abstract:
Block tridiagonal matrices arise in applied mathematics, physics, and signal processing. Many applications require knowledge of eigenvalues and eigenvectors of block tridiagonal matrices, which can be prohibitively expensive for large matrix sizes. In this paper, we address the problem of the eigendecomposition of block tridiagonal matrices by studying a connection between their eigenvalues and ze…
▽ More
Block tridiagonal matrices arise in applied mathematics, physics, and signal processing. Many applications require knowledge of eigenvalues and eigenvectors of block tridiagonal matrices, which can be prohibitively expensive for large matrix sizes. In this paper, we address the problem of the eigendecomposition of block tridiagonal matrices by studying a connection between their eigenvalues and zeros of appropriate matrix polynomials. We use this connection with matrix polynomials to derive a closed-form expression for the eigenvectors of block tridiagonal matrices, which eliminates the need for their direct calculation and can lead to a faster calculation of eigenvalues. We also demonstrate with an example that our work can lead to fast algorithms for the eigenvector expansion for block tridiagonal matrices.
△ Less
Submitted 2 June, 2013;
originally announced June 2013.
-
Asymptotically Efficient Distributed Estimation With Exponential Family Statistics
Authors:
Soummya Kar,
Jose Moura
Abstract:
The paper studies the problem of distributed parameter estimation in multi-agent networks with exponential family observation statistics. A certainty-equivalence type distributed estimator of the consensus + innovations form is proposed in which, at each each observation sampling epoch agents update their local parameter estimates by appropriately combining the data received from their neighbors a…
▽ More
The paper studies the problem of distributed parameter estimation in multi-agent networks with exponential family observation statistics. A certainty-equivalence type distributed estimator of the consensus + innovations form is proposed in which, at each each observation sampling epoch agents update their local parameter estimates by appropriately combining the data received from their neighbors and the locally sensed new information (innovation). Under global observability of the networked sensing model, i.e., the ability to distinguish between different instances of the parameter value based on the joint observation statistics, and mean connectivity of the inter-agent communication network, the proposed estimator is shown to yield consistent parameter estimates at each network agent. Further, it is shown that the distributed estimator is asymptotically efficient, in that, the asymptotic covariances of the agent estimates coincide with that of the optimal centralized estimator, i.e., the inverse of the centralized Fisher information rate. From a technical viewpoint, the proposed distributed estimator leads to non-Markovian mixed timescale stochastic recursions and the analytical methods developed in the paper contribute to the general theory of distributed stochastic approximation.
△ Less
Submitted 1 February, 2014; v1 submitted 21 January, 2013;
originally announced January 2013.
-
Coherent state transforms and the Mackey-Stone-Von Neumann theorem
Authors:
William D. Kirwin,
José M. Mourão,
João P. Nunes
Abstract:
Mackey showed that for a compact Lie group $K$, the pair $(K,C^{0}(K))$ has a unique non-trivial irreducible covariant pair of representations. We study the relevance of this result to the unitary equivalence of quantizations for an infinite-dimensional family of $K\times K$ invariant polarizations on $T^{\ast}K$. The Kähler polarizations in the family are generated by (complex) time-$τ$ Hamiltoni…
▽ More
Mackey showed that for a compact Lie group $K$, the pair $(K,C^{0}(K))$ has a unique non-trivial irreducible covariant pair of representations. We study the relevance of this result to the unitary equivalence of quantizations for an infinite-dimensional family of $K\times K$ invariant polarizations on $T^{\ast}K$. The Kähler polarizations in the family are generated by (complex) time-$τ$ Hamiltonian flows applied to the (Schrödinger) vertical real polarization. The unitary equivalence of the corresponding quantizations of $T^{\ast}K$ is then studied by considering covariant pairs of representations of $K$ defined by geometric prequantization and of representations of $C^0(K)$ defined via Heisenberg time-$(-τ)$ evolution followed by time-$(+τ)$ geometric-quantization-induced evolution. We show that in the semiclassical and large imaginary time limits, the unitary transform whose existence is guaranteed by Mackey's theorem can be approximated by composition of the time-$(+τ)$ geometric-quantization-induced evolution with the time-$(-τ)$ evolution associated with the momentum space [W. D. Kirwin and S. Wu, Momentum space for compact Lie groups and the Peter-Weyl theorem, to appear] quantization of the Hamiltonian function generating the flow. In the case of quadratic Hamiltonians, this asymptotic result is exact and unitary equivalence between quantizations is achieved by identifying the Heisenberg imaginary time evolution with heat operator evolution, in accordance with the coherent state transform of Hall.
△ Less
Submitted 9 November, 2012;
originally announced November 2012.
-
$QD$-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus + Innovations
Authors:
Soummya Kar,
Jose' M. F. Moura,
H. Vincent Poor
Abstract:
The paper considers a class of multi-agent Markov decision processes (MDPs), in which the network agents respond differently (as manifested by the instantaneous one-stage random costs) to a global controlled state and the control actions of a remote controller. The paper investigates a distributed reinforcement learning setup with no prior information on the global state transition and local agent…
▽ More
The paper considers a class of multi-agent Markov decision processes (MDPs), in which the network agents respond differently (as manifested by the instantaneous one-stage random costs) to a global controlled state and the control actions of a remote controller. The paper investigates a distributed reinforcement learning setup with no prior information on the global state transition and local agent cost statistics. Specifically, with the agents' objective consisting of minimizing a network-averaged infinite horizon discounted cost, the paper proposes a distributed version of $Q$-learning, $\mathcal{QD}$-learning, in which the network agents collaborate by means of local processing and mutual information exchange over a sparse (possibly stochastic) communication network to achieve the network goal. Under the assumption that each agent is only aware of its local online cost data and the inter-agent communication network is \emph{weakly} connected, the proposed distributed scheme is almost surely (a.s.) shown to yield asymptotically the desired value function and the optimal stationary control policy at each network agent. The analytical techniques developed in the paper to address the mixed time-scale stochastic dynamics of the \emph{consensus + innovations} form, which arise as a result of the proposed interactive distributed scheme, are of independent interest.
△ Less
Submitted 24 October, 2012; v1 submitted 30 April, 2012;
originally announced May 2012.
-
Complex time evolution in geometric quantization and generalized coherent state transforms
Authors:
William D. Kirwin,
José M. Mourão,
João P. Nunes
Abstract:
For the cotangent bundle $T^{*}K$ of a compact Lie group $K$, we study the complex-time evolution of the vertical tangent bundle and the associated geometric quantization Hilbert space $L^{2}(K)$ under an infinite-dimensional family of Hamiltonian flows. For each such flow, we construct a generalized coherent state transform (CST), which is a unitary isomorphism between $L^{2}(K)$ and a certain we…
▽ More
For the cotangent bundle $T^{*}K$ of a compact Lie group $K$, we study the complex-time evolution of the vertical tangent bundle and the associated geometric quantization Hilbert space $L^{2}(K)$ under an infinite-dimensional family of Hamiltonian flows. For each such flow, we construct a generalized coherent state transform (CST), which is a unitary isomorphism between $L^{2}(K)$ and a certain weighted $L^{2}$-space of holomorphic functions. For a particular set of choices, we show that this isomorphism is naturally decomposed as a product of a Heisenberg-type evolution (for complex time $-τ$) within $L^{2}(K)$, followed by a polarization--changing geometric quantization evolution (for complex time $+τ$). In this case, our construction yields the usual generalized Segal--Bargmann transform of Hall. We show that the infinite-dimensional family of Hamiltonian flows can also be understood in terms of Thiemann's "complexifier" method (which generalizes the construction of adapted complex structures). We will also investigate some properties of the generalized CSTs, and discuss how their existence can be understood in terms of Mackey's generalization of the Stone-von Neumann theorem.
△ Less
Submitted 21 March, 2012;
originally announced March 2012.