-
Neural Operators for Forward and Inverse Potential-Density Mappings in Classical Density Functional Theory
Authors:
Runtong Pan,
Xinyi Fang,
Kamyar Azizzadenesheli,
Miguel Liu-Schiaffini,
Mengyang Gu,
Jianzhong Wu
Abstract:
Neural operators are capable of capturing nonlinear mappings between infinite-dimensional functional spaces, offering a data-driven approach to modeling complex functional relationships in classical density functional theory (cDFT). In this work, we evaluate the performance of several neural operator architectures in learning the functional relationships between the one-body density profile…
▽ More
Neural operators are capable of capturing nonlinear mappings between infinite-dimensional functional spaces, offering a data-driven approach to modeling complex functional relationships in classical density functional theory (cDFT). In this work, we evaluate the performance of several neural operator architectures in learning the functional relationships between the one-body density profile $ρ(x)$, the one-body direct correlation function $c_1(x)$, and the external potential $V_{ext}(x)$ of inhomogeneous one-dimensional (1D) hard-rod fluids, using training data generated from analytical solutions of the underlying statistical-mechanical model. We compared their performance in terms of the Mean Squared Error (MSE) loss in establishing the functional relationships as well as in predicting the excess free energy across two test sets: (1) a group test set generated via random cross-validation (CV) to assess interpolation capability, and (2) a newly constructed dataset for leave-one-group CV to evaluate extrapolation performance. Our results show that FNO achieves the most accurate predictions of the excess free energy, with the squared ReLU activation function outperforming other activation choices. Among the DeepONet variants, the Residual Multiscale Convolutional Neural Network (RMSCNN) combined with a trainable Gaussian derivative kernel (GK-RMSCNN-DeepONet) demonstrates the best performance. Additionally, we applied the trained models to solve for the density profiles at various external potentials and compared the results with those obtained from the direct mapping $V_{ext} \mapsto ρ$ with neural operators, as well as with Gaussian Process Regression (GPR) combined with Active Learning by Error Control (ALEC), which has shown strong performance in previous studies.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
On triple product $L$-functions and the fiber bundle method
Authors:
Jayce R. Getz,
Miao Pam Gu,
Chun-Hsien Hsu,
Spencer Leslie
Abstract:
We introduce multi-variable zeta integrals which unfold to Euler products representing the triple product $L$-function times a product of $L$-functions with known analytic properties. We then formulate a generalization of the Poisson summation conjecture and show how it implies the analytic properties of triple product $L$-functions. Finally, we propose a strategy, the fiber bundle method, to redu…
▽ More
We introduce multi-variable zeta integrals which unfold to Euler products representing the triple product $L$-function times a product of $L$-functions with known analytic properties. We then formulate a generalization of the Poisson summation conjecture and show how it implies the analytic properties of triple product $L$-functions. Finally, we propose a strategy, the fiber bundle method, to reduce this generalized conjecture to a simpler case of the Poisson summation conjecture along with certain local compatibility statements.
△ Less
Submitted 1 May, 2025; v1 submitted 27 March, 2025;
originally announced March 2025.
-
Fast data inversion for high-dimensional dynamical systems from noisy measurements
Authors:
Yizi Lin,
Xubo Liu,
Paul Segall,
Mengyang Gu
Abstract:
In this work, we develop a scalable approach for a flexible latent factor model for high-dimensional dynamical systems. Each latent factor process has its own correlation and variance parameters, and the orthogonal factor loading matrix can be either fixed or estimated. We utilize an orthogonal factor loading matrix that avoids computing the inversion of the posterior covariance matrix at each tim…
▽ More
In this work, we develop a scalable approach for a flexible latent factor model for high-dimensional dynamical systems. Each latent factor process has its own correlation and variance parameters, and the orthogonal factor loading matrix can be either fixed or estimated. We utilize an orthogonal factor loading matrix that avoids computing the inversion of the posterior covariance matrix at each time of the Kalman filter, and derive closed-form expressions in an expectation-maximization algorithm for parameter estimation, which substantially reduces the computational complexity without approximation. Our study is motivated by inversely estimating slow slip events from geodetic data, such as continuous GPS measurements. Extensive simulated studies illustrate higher accuracy and scalability of our approach compared to alternatives. By applying our method to geodetic measurements in the Cascadia region, our estimated slip better agrees with independently measured seismic data of tremor events. The substantial acceleration from our method enables the use of massive noisy data for geological hazard quantification and other applications.
△ Less
Submitted 18 June, 2025; v1 submitted 2 January, 2025;
originally announced January 2025.
-
Mixing, Enhanced Dissipation and Phase Transition in the Kinetic Vicsek Model
Authors:
Mengyang Gu,
Siming He
Abstract:
In this paper, we study the kinetic Vicsek model, which serves as a starting point for describing the polarization phenomena observed in the experiments of fibroblasts moving on liquid crystalline substrates. The long-time behavior of the kinetic equation is analyzed, revealing that, within specific parameter regimes, the mixing and enhanced dissipation phenomena stabilize the dynamics and ensure…
▽ More
In this paper, we study the kinetic Vicsek model, which serves as a starting point for describing the polarization phenomena observed in the experiments of fibroblasts moving on liquid crystalline substrates. The long-time behavior of the kinetic equation is analyzed, revealing that, within specific parameter regimes, the mixing and enhanced dissipation phenomena stabilize the dynamics and ensure effective information communication among agents. Consequently, the solution exhibits features similar to those of a spatially-homogeneous system. As a result, we confirm the phase transition observed in the agent-based Vicsek model at the kinetic level.
△ Less
Submitted 17 April, 2025; v1 submitted 29 September, 2024;
originally announced September 2024.
-
Euler characteristics of the generalized Kloosterman sheaves for symplectic and orthogonal groups
Authors:
Yu Fu,
Miao Pam Gu
Abstract:
We study the monodromy of certain $\ell$-adic local systems attached to the generalized Kloosterman sheaves constructed by Yun and calculate their Euler characteristics under standard representations in the cases of symplectic and split/quasi-split orthogonal groups. This provides evidence for the conjectural description of their Swan conductors at $\infty$ which is predicted by Reeder-Yu on the L…
▽ More
We study the monodromy of certain $\ell$-adic local systems attached to the generalized Kloosterman sheaves constructed by Yun and calculate their Euler characteristics under standard representations in the cases of symplectic and split/quasi-split orthogonal groups. This provides evidence for the conjectural description of their Swan conductors at $\infty$ which is predicted by Reeder-Yu on the Langlands parameters attached to the epipelagic representations.
△ Less
Submitted 29 July, 2024;
originally announced July 2024.
-
Learning from landmarks, curves, surfaces, and shapes in Geomstats
Authors:
Luís F. Pereira,
Alice Le Brigant,
Adele Myers,
Emmanuel Hartman,
Amil Khan,
Malik Tuerkoen,
Trey Dold,
Mengyang Gu,
Pablo Suárez-Serrato,
Nina Miolane
Abstract:
We introduce the shape module of the Python package Geomstats to analyze shapes of objects represented as landmarks, curves and surfaces across fields of natural sciences and engineering. The shape module first implements widely used shape spaces, such as the Kendall shape space, as well as elastic spaces of discrete curves and surfaces. The shape module further implements the abstract mathematica…
▽ More
We introduce the shape module of the Python package Geomstats to analyze shapes of objects represented as landmarks, curves and surfaces across fields of natural sciences and engineering. The shape module first implements widely used shape spaces, such as the Kendall shape space, as well as elastic spaces of discrete curves and surfaces. The shape module further implements the abstract mathematical structures of group actions, fiber bundles, quotient spaces and associated Riemannian metrics which allow users to build their own shape spaces. The Riemannian geometry tools enable users to compare, average, interpolate between shapes inside a given shape space. These essential operations can then be leveraged to perform statistics and machine learning on shape data. We present the object-oriented implementation of the shape module along with illustrative examples and show how it can be used to perform statistics and machine learning on shape spaces.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Paired 2-disjoint path covers of burnt pancake graphs with faulty elements
Authors:
Tomáš Dvořák,
Mei-Mei Gu
Abstract:
The burnt pancake graph $BP_n$ is the Cayley graph of the hyperoctahedral group using prefix reversals as generators. Let $\{u,v\}$ and $\{x,y\}$ be any two pairs of distinct vertices of $BP_n$ for $n\geq 4$. We show that there are $u-v$ and $x-y$ paths whose vertices partition the vertex set of $BP_n$ even if $BP_n$ has up to $n-4$ faulty elements. On the other hand, for every $n\ge3$ there is a…
▽ More
The burnt pancake graph $BP_n$ is the Cayley graph of the hyperoctahedral group using prefix reversals as generators. Let $\{u,v\}$ and $\{x,y\}$ be any two pairs of distinct vertices of $BP_n$ for $n\geq 4$. We show that there are $u-v$ and $x-y$ paths whose vertices partition the vertex set of $BP_n$ even if $BP_n$ has up to $n-4$ faulty elements. On the other hand, for every $n\ge3$ there is a set of $n-2$ faulty edges or faulty vertices for which such a fault-free disjoint path cover does not exist.
△ Less
Submitted 28 October, 2023;
originally announced October 2023.
-
On the maximum second eigenvalue of outerplanar graphs
Authors:
George Brooks,
Maggie Gu,
Jack Hyatt,
William Linz,
Linyuan Lu
Abstract:
For a fixed positive integer $k$ and a graph $G$, let $λ_k(G)$ denote the $k$-th largest eigenvalue of the adjacency matrix of $G$. In 2017, Tait and Tobin proved that the maximum $λ_1(G)$ among all outerplanar graphs on $n$ vertices is achieved by the fan graph $K_1\vee P_{n-1}$. In this paper, we consider a similar problem of determining the maximum $λ_2$ among all connected outerplanar graphs o…
▽ More
For a fixed positive integer $k$ and a graph $G$, let $λ_k(G)$ denote the $k$-th largest eigenvalue of the adjacency matrix of $G$. In 2017, Tait and Tobin proved that the maximum $λ_1(G)$ among all outerplanar graphs on $n$ vertices is achieved by the fan graph $K_1\vee P_{n-1}$. In this paper, we consider a similar problem of determining the maximum $λ_2$ among all connected outerplanar graphs on $n$ vertices. For $n$ even and sufficiently large, we prove that the maximum $λ_2$ is uniquely achieved by the graph $(K_1\vee P_{n/2-1})\!\!-\!\!(K_1\vee P_{n/2-1})$, which is obtained by connecting two disjoint copies of $(K_1\vee P_{n/2-1})$ through a new edge joining their smallest degree vertices. When $n$ is odd and sufficiently large, the extremal graphs are not unique. The extremal graphs are those graphs $G$ that contain a cut vertex $u$ such that $G\setminus \{u\}$ is isomorphic to $2(K_1\vee P_{n/2-1})$. We also determine the maximum $λ_2$ among all 2-connected outerplanar graphs and asymptotically determine the maximum of $λ_k(G)$ among all connected outerplanar graphs for any fixed $k$.
△ Less
Submitted 15 November, 2024; v1 submitted 15 September, 2023;
originally announced September 2023.
-
Conics meeting eight lines over perfect fields
Authors:
Cameron Darwin,
Aygul Galimova,
Miao Pam Gu,
Stephen McKean
Abstract:
Over the complex numbers, there are 92 plane conics meeting 8 general lines in projective 3-space. Using the Euler class and local degree from motivic homotopy theory, we give an enriched version of this result over any perfect field. This provides a weighted count of the number of plane conics meeting 8 general lines, where the weight of each conic is determined the geometry of its intersections…
▽ More
Over the complex numbers, there are 92 plane conics meeting 8 general lines in projective 3-space. Using the Euler class and local degree from motivic homotopy theory, we give an enriched version of this result over any perfect field. This provides a weighted count of the number of plane conics meeting 8 general lines, where the weight of each conic is determined the geometry of its intersections with the 8 given lines. As a corollary, real conics meeting 8 general lines come in two families of equal size.
△ Less
Submitted 25 April, 2023; v1 submitted 12 July, 2021;
originally announced July 2021.
-
Automorphic-twisted summation formulae for pairs of quadratic spaces
Authors:
Miao Pam Gu
Abstract:
We prove a summation formula for pairs of quadratic spaces following the conjectures of Braverman-Kazhdan, Lafforgue, Ngô and Sakellaridis. We also give an expression of the local factors where all the data are unramified.
We prove a summation formula for pairs of quadratic spaces following the conjectures of Braverman-Kazhdan, Lafforgue, Ngô and Sakellaridis. We also give an expression of the local factors where all the data are unramified.
△ Less
Submitted 22 November, 2024; v1 submitted 11 February, 2021;
originally announced February 2021.
-
Randomized Projection for Rank-Revealing Matrix Factorizations and Low-Rank Approximations
Authors:
Jed A. Duersch,
Ming Gu
Abstract:
Rank-revealing matrix decompositions provide an essential tool in spectral analysis of matrices, including the Singular Value Decomposition (SVD) and related low-rank approximation techniques. QR with Column Pivoting (QRCP) is usually suitable for these purposes, but it can be much slower than the unpivoted QR algorithm. For large matrices, the difference in performance is due to increased communi…
▽ More
Rank-revealing matrix decompositions provide an essential tool in spectral analysis of matrices, including the Singular Value Decomposition (SVD) and related low-rank approximation techniques. QR with Column Pivoting (QRCP) is usually suitable for these purposes, but it can be much slower than the unpivoted QR algorithm. For large matrices, the difference in performance is due to increased communication between the processor and slow memory, which QRCP needs in order to choose pivots during decomposition. Our main algorithm, Randomized QR with Column Pivoting (RQRCP), uses randomized projection to make pivot decisions from a much smaller sample matrix, which we can construct to reside in a faster level of memory than the original matrix. This technique may be understood as trading vastly reduced communication for a controlled increase in uncertainty during the decision process. For rank-revealing purposes, the selection mechanism in RQRCP produces results that are the same quality as the standard algorithm, but with performance near that of unpivoted QR (often an order of magnitude faster for large matrices). We also propose two formulas that facilitate further performance improvements. The first efficiently updates sample matrices to avoid computing new randomized projections. The second avoids large trailing updates during the decomposition in truncated low-rank approximations. Our truncated version of RQRCP also provides a key initial step in our truncated SVD approximation, TUXV. These advances open up a new performance domain for large matrix factorizations that will support efficient problem-solving techniques for challenging applications in science, engineering, and data analysis.
△ Less
Submitted 10 August, 2020;
originally announced August 2020.
-
Thermodynamic Machine Learning through Maximum Work Production
Authors:
A. B. Boyd,
J. P. Crutchfield,
M. Gu
Abstract:
Adaptive systems -- such as a biological organism gaining survival advantage, an autonomous robot executing a functional task, or a motor protein transporting intracellular nutrients -- must model the regularities and stochasticity in their environments to take full advantage of thermodynamic resources. Analogously, but in a purely computational realm, machine learning algorithms estimate models t…
▽ More
Adaptive systems -- such as a biological organism gaining survival advantage, an autonomous robot executing a functional task, or a motor protein transporting intracellular nutrients -- must model the regularities and stochasticity in their environments to take full advantage of thermodynamic resources. Analogously, but in a purely computational realm, machine learning algorithms estimate models to capture predictable structure and identify irrelevant noise in training data. This happens through optimization of performance metrics, such as model likelihood. If physically implemented, is there a sense in which computational models estimated through machine learning are physically preferred? We introduce the thermodynamic principle that work production is the most relevant performance metric for an adaptive physical agent and compare the results to the maximum-likelihood principle that guides machine learning. Within the class of physical agents that most efficiently harvest energy from their environment, we demonstrate that an efficient agent's model explicitly determines its architecture and how much useful work it harvests from the environment. We then show that selecting the maximum-work agent for given environmental data corresponds to finding the maximum-likelihood model. This establishes an equivalence between nonequilibrium thermodynamics and dynamic learning. In this way, work maximization emerges as an organizing principle that underlies learning in adaptive thermodynamic systems.
△ Less
Submitted 12 April, 2021; v1 submitted 27 June, 2020;
originally announced June 2020.
-
Paired 3-disjoint path covers of balanced hypercubes
Authors:
Mei-Rong Guo,
Rong-Xia Hao,
Mei-Mei Gu
Abstract:
The balanced hypercube $BH_{n}$, proposed by Wu and Huang, is a variation of the hypercube. The paired 1-disjoint path cover of $BH_{n}$ is the Hamiltonian laceability, which was obtained by Xu et al. in [Appl. Math. Comput. 189 (2007) 1393--1401]. The paired 2-disjoint path cover of $BH_{n}$ was obtained by Cheng et al. in [Appl. Math. and Comput. 242 (2014) 127-142]. In this paper, we obtain the…
▽ More
The balanced hypercube $BH_{n}$, proposed by Wu and Huang, is a variation of the hypercube. The paired 1-disjoint path cover of $BH_{n}$ is the Hamiltonian laceability, which was obtained by Xu et al. in [Appl. Math. Comput. 189 (2007) 1393--1401]. The paired 2-disjoint path cover of $BH_{n}$ was obtained by Cheng et al. in [Appl. Math. and Comput. 242 (2014) 127-142]. In this paper, we obtain the paired 3-disjoint path cover of $BH_{n}$ with $n\geq 3$. This result improves the above known results about the paired $k$-disjoint path covers of $BH_{n}$ for $k=1,2$.
△ Less
Submitted 15 December, 2019;
originally announced December 2019.
-
Neighbor connectivity of $k$-ary $n$-cubes
Authors:
Tomáš Dvořák,
Mei-Mei Gu
Abstract:
The neighbor connectivity of a graph $G$ is the least number of vertices such that removing their closed neighborhoods from $G$ results in a graph that is disconnected, complete or empty. If a~graph is used to model the topology of an interconnection network, this means that the failure of a network node causes failures of all its neighbors. We completely determine the neighbor connectivity of…
▽ More
The neighbor connectivity of a graph $G$ is the least number of vertices such that removing their closed neighborhoods from $G$ results in a graph that is disconnected, complete or empty. If a~graph is used to model the topology of an interconnection network, this means that the failure of a network node causes failures of all its neighbors. We completely determine the neighbor connectivity of $k$-ary $n$-cubes for all $n\ge1$ and $k\ge2$.
△ Less
Submitted 27 October, 2019;
originally announced October 2019.
-
Strong Menger connectedness of augmented $k$-ary $n$-cubes
Authors:
Mei-Mei Gu,
Jou-Ming Chang,
Rong-Xia Hao
Abstract:
A connected graph $G$ is called strongly Menger (edge) connected if for any two distinct vertices $x,y$ of $G$, there are $\min \{{\rm deg}_G(x), {\rm deg}_G(y)\}$ vertex(edge)-disjoint paths between $x$ and $y$. In this paper, we consider strong Menger (edge) connectedness of the augmented $k$-ary $n$-cube $AQ_{n,k}$, which is a variant of $k$-ary $n$-cube $Q_n^k$. By exploring the topological pr…
▽ More
A connected graph $G$ is called strongly Menger (edge) connected if for any two distinct vertices $x,y$ of $G$, there are $\min \{{\rm deg}_G(x), {\rm deg}_G(y)\}$ vertex(edge)-disjoint paths between $x$ and $y$. In this paper, we consider strong Menger (edge) connectedness of the augmented $k$-ary $n$-cube $AQ_{n,k}$, which is a variant of $k$-ary $n$-cube $Q_n^k$. By exploring the topological proprieties of $AQ_{n,k}$, we show that $AQ_{n,3}$ for $n\geq 4$ (resp.\ $AQ_{n,k}$ for $n\geq 2$ and $k\geq 4$) is still strongly Menger connected even when there are $4n-9$ (resp.\ $4n-8$) faulty vertices and $AQ_{n,k}$ is still strongly Menger edge connected even when there are $4n-4$ faulty edges for $n\geq 2$ and $k\geq 3$. Moreover, under the restricted condition that each vertex has at least two fault-free edges, we show that $AQ_{n,k}$ is still strongly Menger edge connected even when there are $8n-10$ faulty edges for $n\geq 2$ and $k\geq 3$. These results are all optimal in the sense of the maximum number of tolerated vertex (resp.\ edge) faults.
△ Less
Submitted 2 October, 2019;
originally announced October 2019.
-
The Component Connectivity of Alternating Group Graphs and Split-Stars
Authors:
Mei-Mei Gu,
Rong-Xia Hao,
Jou-Ming Chang
Abstract:
For an integer $\ell\geqslant 2$, the $\ell$-component connectivity of a graph $G$, denoted by $κ_{\ell}(G)$, is the minimum number of vertices whose removal from $G$ results in a disconnected graph with at least $\ell$ components or a graph with fewer than $\ell$ vertices. This is a natural generalization of the classical connectivity of graphs defined in term of the minimum vertex-cut and is a g…
▽ More
For an integer $\ell\geqslant 2$, the $\ell$-component connectivity of a graph $G$, denoted by $κ_{\ell}(G)$, is the minimum number of vertices whose removal from $G$ results in a disconnected graph with at least $\ell$ components or a graph with fewer than $\ell$ vertices. This is a natural generalization of the classical connectivity of graphs defined in term of the minimum vertex-cut and is a good measure of robustness for the graph corresponding to a network. So far, the exact values of $\ell$-connectivity are known only for a few classes of networks and small $\ell$'s. It has been pointed out in~[Component connectivity of the hypercubes, Int. J. Comput. Math. 89 (2012) 137--145] that determining $\ell$-connectivity is still unsolved for most interconnection networks, such as alternating group graphs and star graphs. In this paper, by exploring the combinatorial properties and fault-tolerance of the alternating group graphs $AG_n$ and a variation of the star graphs called split-stars $S_n^2$, we study their $\ell$-component connectivities. We obtain the following results: (i) $κ_3(AG_n)=4n-10$ and $κ_4(AG_n)=6n-16$ for $n\geqslant 4$, and $κ_5(AG_n)=8n-24$ for $n\geqslant 5$; (ii) $κ_3(S_n^2)=4n-8$, $κ_4(S_n^2)=6n-14$, and $κ_5(S_n^2)=8n-20$ for $n\geqslant 4$.
△ Less
Submitted 3 December, 2018;
originally announced December 2018.
-
Superlinear Convergence of Randomized Block Lanczos Algorithm
Authors:
Qiaochu Yuan,
Ming Gu,
Bo Li
Abstract:
The low rank approximation of matrices is a crucial component in many data mining applications today. A competitive algorithm for this class of problems is the randomized block Lanczos algorithm - an amalgamation of the traditional block Lanczos algorithm with a randomized starting matrix. While empirically this algorithm performs quite well, there has been scant new theoretical results on its con…
▽ More
The low rank approximation of matrices is a crucial component in many data mining applications today. A competitive algorithm for this class of problems is the randomized block Lanczos algorithm - an amalgamation of the traditional block Lanczos algorithm with a randomized starting matrix. While empirically this algorithm performs quite well, there has been scant new theoretical results on its convergence behavior and approximation accuracy, and past results have been restricted to certain parameter settings. In this paper, we present a unified singular value convergence analysis for this algorithm, for all valid choices of the block size parameter. We present novel results on the rate of singular value convergence and show that under certain spectrum regimes, the convergence is superlinear. Additionally, we provide results from numerical experiments that validate our analysis.
△ Less
Submitted 19 August, 2018;
originally announced August 2018.
-
A theoretical framework of the scaled Gaussian stochastic process in prediction and calibration
Authors:
Mengyang Gu,
Fangzheng Xie,
Long Wang
Abstract:
Model calibration or data inversion is one of fundamental tasks in uncertainty quantification. In this work, we study the theoretical properties of the scaled Gaussian stochastic process (S-GaSP), to model the discrepancy between reality and imperfect mathematical models. We establish the explicit connection between Gaussian stochastic process (GaSP) and S-GaSP through the orthogonal series repres…
▽ More
Model calibration or data inversion is one of fundamental tasks in uncertainty quantification. In this work, we study the theoretical properties of the scaled Gaussian stochastic process (S-GaSP), to model the discrepancy between reality and imperfect mathematical models. We establish the explicit connection between Gaussian stochastic process (GaSP) and S-GaSP through the orthogonal series representation. The predictive mean estimator in the S-GaSP calibration model converges to the reality at the same rate as the GaSP with a suitable choice of the regularization and scaling parameters. We also show the calibrated mathematical model in the S-GaSP calibration converges to the one that minimizes the $L_2$ loss between the reality and mathematical model, whereas the GaSP model with other widely used covariance functions does not have this property. Numerical examples confirm the excellent finite sample performance of our approaches compared to a few recent approaches.
△ Less
Submitted 2 August, 2020; v1 submitted 10 July, 2018;
originally announced July 2018.
-
Spectrum-Revealing Cholesky Factorization for Kernel Methods
Authors:
Jianwei Xiao,
Ming Gu
Abstract:
Kernel methods represent some of the most popular machine learning tools for data analysis. Since exact kernel methods can be prohibitively expensive for large problems, reliable low-rank matrix approximations and high-performance implementations have become indispensable for practical applications of kernel methods. In this work, we introduce spectrum-revealing Cholesky factorization, a reliable…
▽ More
Kernel methods represent some of the most popular machine learning tools for data analysis. Since exact kernel methods can be prohibitively expensive for large problems, reliable low-rank matrix approximations and high-performance implementations have become indispensable for practical applications of kernel methods. In this work, we introduce spectrum-revealing Cholesky factorization, a reliable low-rank matrix factorization, for kernel matrix approximation. We also develop an efficient and effective randomized algorithm for computing this factorization. Our numerical experiments demonstrate that this algorithm is as effective as other Cholesky factorization based kernel methods on machine learning problems, but significantly faster.
△ Less
Submitted 13 April, 2018;
originally announced April 2018.
-
Fast Parallel Randomized QR with Column Pivoting Algorithms for Reliable Low-rank Matrix Approximations
Authors:
Jianwei Xiao,
Ming Gu,
Julien Langou
Abstract:
Factorizing large matrices by QR with column pivoting (QRCP) is substantially more expensive than QR without pivoting, owing to communication costs required for pivoting decisions. In contrast, randomized QRCP (RQRCP) algorithms have proven themselves empirically to be highly competitive with high-performance implementations of QR in processing time, on uniprocessor and shared memory machines, and…
▽ More
Factorizing large matrices by QR with column pivoting (QRCP) is substantially more expensive than QR without pivoting, owing to communication costs required for pivoting decisions. In contrast, randomized QRCP (RQRCP) algorithms have proven themselves empirically to be highly competitive with high-performance implementations of QR in processing time, on uniprocessor and shared memory machines, and as reliable as QRCP in pivot quality.
We show that RQRCP algorithms can be as reliable as QRCP with failure probabilities exponentially decaying in oversampling size. We also analyze efficiency differences among different RQRCP algorithms. More importantly, we develop distributed memory implementations of RQRCP that are significantly better than QRCP implementations in ScaLAPACK.
As a further development, we introduce the concept of and develop algorithms for computing spectrum-revealing QR factorizations for low-rank matrix approximations, and demonstrate their effectiveness against leading low-rank approximation methods in both theoretical and numerical reliability and efficiency.
△ Less
Submitted 13 April, 2018;
originally announced April 2018.
-
Low-Rank Matrix Approximations with Flip-Flop Spectrum-Revealing QR Factorization
Authors:
Yuehua Feng,
Jianwei Xiao,
Ming Gu
Abstract:
We present Flip-Flop Spectrum-Revealing QR (Flip-Flop SRQR) factorization, a significantly faster and more reliable variant of the QLP factorization of Stewart, for low-rank matrix approximations. Flip-Flop SRQR uses SRQR factorization to initialize a partial column pivoted QR factorization and then compute a partial LQ factorization. As observed by Stewart in his original QLP work, Flip-Flop SRQR…
▽ More
We present Flip-Flop Spectrum-Revealing QR (Flip-Flop SRQR) factorization, a significantly faster and more reliable variant of the QLP factorization of Stewart, for low-rank matrix approximations. Flip-Flop SRQR uses SRQR factorization to initialize a partial column pivoted QR factorization and then compute a partial LQ factorization. As observed by Stewart in his original QLP work, Flip-Flop SRQR tracks the exact singular values with "considerable fidelity". We develop singular value lower bounds and residual error upper bounds for Flip-Flop SRQR factorization. In situations where singular values of the input matrix decay relatively quickly, the low-rank approximation computed by SRQR is guaranteed to be as accurate as truncated SVD. We also perform a complexity analysis to show that for the same accuracy, Flip-Flop SRQR is faster than randomized subspace iteration for approximating the SVD, the standard method used in Matlab tensor toolbox. We also compare Flip-Flop SRQR with alternatives on two applications, tensor approximation and nuclear norm minimization, to demonstrate its efficiency and effectiveness.
△ Less
Submitted 14 March, 2018; v1 submitted 5 March, 2018;
originally announced March 2018.
-
Randomized Complete Pivoting for Solving Symmetric Indefinite Linear Systems
Authors:
Yuehua Feng,
Jianwei Xiao,
Ming Gu
Abstract:
The Bunch-Kaufman algorithm and Aasen's algorithm are two of the most widely used methods for solving symmetric indefinite linear systems, yet they both are known to suffer from occasional numerical instability due to potentially exponential element growth or unbounded entries in the matrix factorization. In this work, we develop a randomized complete pivoting (RCP) algorithm for solving symmetric…
▽ More
The Bunch-Kaufman algorithm and Aasen's algorithm are two of the most widely used methods for solving symmetric indefinite linear systems, yet they both are known to suffer from occasional numerical instability due to potentially exponential element growth or unbounded entries in the matrix factorization. In this work, we develop a randomized complete pivoting (RCP) algorithm for solving symmetric indefinite linear systems. RCP is comparable to the Bunch-Kaufman algorithm and Aasen's algorithm in computational efficiency, yet enjoys theoretical element growth and bounded entries in the factorization comparable to that of complete-pivoting, up to a theoretical failure probability that exponentially decays with an oversampling parameter. Our finite precision analysis shows that RCP is as numerically stable as Gaussian elimination with complete pivoting, and RCP has been observed to be numerically stable in our extensive numerical experiments.
△ Less
Submitted 29 September, 2017;
originally announced October 2017.
-
Factorization tests and algorithms arising from counting modular forms and automorphic representations
Authors:
Miao Gu,
Greg Martin
Abstract:
A theorem of Gekeler compares the number of non-isomorphic automorphic representations associated with the space of cusp forms of weight $k$ on $Γ_0(N)$ to a simpler function of $k$ and $N$, showing that the two are equal whenever $N$ is squarefree. We prove the converse of this theorem (with one small exception), thus providing a characterization of squarefree integers. We also establish a simila…
▽ More
A theorem of Gekeler compares the number of non-isomorphic automorphic representations associated with the space of cusp forms of weight $k$ on $Γ_0(N)$ to a simpler function of $k$ and $N$, showing that the two are equal whenever $N$ is squarefree. We prove the converse of this theorem (with one small exception), thus providing a characterization of squarefree integers. We also establish a similar characterization of prime numbers in terms of the number of Hecke newforms of weight $k$ on $Γ_0(N)$.
It follows that a hypothetical fast algorithm for computing the number of such automorphic representations for even a single weight $k$ would yield a fast test for whether $N$ is squarefree. We also show how to obtain bounds on the possible square divisors of a number $N$ that has been found to not be squarefree via this test, and we show how to probabilistically obtain the complete factorization of the squarefull part of $N$ from the number of such automorphic representations for two different weights. If in addition we have the number of such Hecke newforms for even a single weight $k$, then we show how to probabilistically factor $N$ entirely. All of these computations could be performed quickly in practice, given the number(s) of automorphic representations and modular forms as input.
△ Less
Submitted 21 June, 2018; v1 submitted 7 September, 2017;
originally announced September 2017.
-
Robust Gaussian Stochastic Process Emulation
Authors:
Mengyang Gu,
Xiaojing Wang,
James O. Berger
Abstract:
We consider estimation of the parameters of a Gaussian Stochastic Process (GaSP), in the context of emulation (approximation) of computer models for which the outcomes are real-valued scalars. The main focus is on estimation of the GaSP parameters through various generalized maximum likelihood methods, mostly involving finding posterior modes; this is because full Bayesian analysis in computer mod…
▽ More
We consider estimation of the parameters of a Gaussian Stochastic Process (GaSP), in the context of emulation (approximation) of computer models for which the outcomes are real-valued scalars. The main focus is on estimation of the GaSP parameters through various generalized maximum likelihood methods, mostly involving finding posterior modes; this is because full Bayesian analysis in computer model emulation is typically prohibitively expensive. The posterior modes that are studied arise from objective priors, such as the reference prior. These priors have been studied in the literature for the situation of an isotropic covariance function or under the assumption of separability in the design of inputs for model runs used in the GaSP construction. In this paper, we consider more general designs (e.g., a Latin Hypercube Design) with a class of commonly used anisotropic correlation functions, which can be written as a product of isotropic correlation functions, each having an unknown range parameter and a fixed roughness parameter. We discuss properties of the objective priors and marginal likelihoods for the parameters of the GaSP and establish the posterior propriety of the GaSP parameters, but our main focus is to demonstrate that certain parameterizations result in more robust estimation of the GaSP parameters than others, and that some parameterizations that are in common use should clearly be avoided. These results are applicable to many frequently used covariance functions, e.g., power exponential, Mat{é}rn, rational quadratic and spherical covariance. We also generalize the results to the GaSP model with a nugget parameter. Both theoretical and numerical evidence is presented concerning the performance of the studied procedures.
△ Less
Submitted 15 August, 2017;
originally announced August 2017.
-
A robust and efficient implementation of LOBPCG
Authors:
Jed A. Duersch,
Meiyue Shao,
Chao Yang,
Ming Gu
Abstract:
Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG) is widely used to compute eigenvalues of large sparse symmetric matrices. The algorithm can suffer from numerical instability if it is not implemented with care. This is especially problematic when the number of eigenpairs to be computed is relatively large. In this paper we propose an improved basis selection strategy based on earli…
▽ More
Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG) is widely used to compute eigenvalues of large sparse symmetric matrices. The algorithm can suffer from numerical instability if it is not implemented with care. This is especially problematic when the number of eigenpairs to be computed is relatively large. In this paper we propose an improved basis selection strategy based on earlier work by Hetmaniuk and Lehoucq as well as a robust convergence criterion which is backward stable to enhance the robustness. We also suggest several algorithmic optimizations that improve performance of practical LOBPCG implementations. Numerical examples confirm that our approach consistently and significantly outperforms previous competing approaches in both stability and speed.
△ Less
Submitted 24 April, 2018; v1 submitted 24 April, 2017;
originally announced April 2017.
-
Fault diagnosability of data center networks
Authors:
Mei-Mei Gu,
Rong-Xia Hao,
Shuming Zhou
Abstract:
The data center networks $D_{n,k}$, proposed in 2008, has many desirable features such as high network capacity. A kind of generalization of diagnosability for network $G$ is $g$-good-neighbor diagnosability which is denoted by $t_g(G)$. Let $κ^g(G)$ be the $R^g$-connectivity. Lin et. al. in [IEEE Trans. on Reliability, 65 (3) (2016) 1248--1262] and Xu et. al in [Theor. Comput. Sci. 659 (2017) 53-…
▽ More
The data center networks $D_{n,k}$, proposed in 2008, has many desirable features such as high network capacity. A kind of generalization of diagnosability for network $G$ is $g$-good-neighbor diagnosability which is denoted by $t_g(G)$. Let $κ^g(G)$ be the $R^g$-connectivity. Lin et. al. in [IEEE Trans. on Reliability, 65 (3) (2016) 1248--1262] and Xu et. al in [Theor. Comput. Sci. 659 (2017) 53--63] gave the same problem independently that: the relationship between the $R^g$-connectivity $κ^g(G)$ and $t_g(G)$ of a general graph $G$ need to be studied in the future. In this paper, this open problem is solved for general regular graphs. We firstly establish the relationship of $κ^g(G)$ and $t_g(G)$, and obtain that $t_g(G)=κ^g(G)+g$ under some conditions. Secondly, we obtain the $g$-good-neighbor diagnosability of $D_{k,n}$ which are $t_g(D_{k,n})=(g+1)(k-1)+n+g$ for $1\leq g\leq n-1$ under the PMC model and the MM model, respectively. Further more, we show that $D_{k,n}$ is tightly super $(n+k-1)$-connected for $n\geq 2$ and $k\geq 2$ and we also prove that the largest connected component of the survival graph contains almost all of the remaining vertices in $D_{k,n}$ when $2k+n-2$ vertices removed.
△ Less
Submitted 29 January, 2017;
originally announced February 2017.
-
Equal relation between the extra connectivity and pessimistic diagnosability for some regular graphs
Authors:
Mei-Mei Gu,
Rong-Xia Hao,
Jun-Ming Xu,
Yan-Quan Feng
Abstract:
Extra connectivity and the pessimistic diagnosis are two crucial subjects for a multiprocessor system's ability to tolerate and diagnose faulty processor. The pessimistic diagnosis strategy is a classic strategy based on the PMC model in which isolates all faulty vertices within a set containing at most one fault-free vertex. In this paper, the result that the pessimistic diagnosability $t_p(G)$ e…
▽ More
Extra connectivity and the pessimistic diagnosis are two crucial subjects for a multiprocessor system's ability to tolerate and diagnose faulty processor. The pessimistic diagnosis strategy is a classic strategy based on the PMC model in which isolates all faulty vertices within a set containing at most one fault-free vertex. In this paper, the result that the pessimistic diagnosability $t_p(G)$ equals the extra connectivity $κ_{1}(G)$ of a regular graph $G$ under some conditions are shown. Furthermore, the following new results are gotten: the pessimistic diagnosability $t_p(S_n^2)=4n-9$ for split-star networks $S_n^2$, $t_p(Γ_n)=2n-4$ for Cayley graphs generated by transposition trees $Γ_n$, $t_p(Γ_{n}(Δ))=4n-11$ for Cayley graph generated by the $2$-tree $Γ_{n}(Δ)$, $t_{p}(BP_n)=2n-2$ for the burnt pancake networks $BP_n$. As corollaries, the known results about the extra connectivity and the pessimistic diagnosability of many famous networks including the alternating group graphs, the alternating group networks, BC networks, the $k$-ary $n$-cube networks etc. are obtained directly.
△ Less
Submitted 29 January, 2017;
originally announced January 2017.
-
Fault-tolerance of balanced hypercubes with faulty vertices and faulty edges
Authors:
Mei-Mei Gu,
Rong-Xia Hao
Abstract:
Let $F_{v}$ (resp. $F_e$) be the set of faulty vertices (resp. faulty edges) in the $n$-dimensional balanced hypercube $BH_n$. Fault-tolerant Hamiltonian laceability in $BH_n$ with at most $2n-2$ faulty edges is obtained in [Inform. Sci. 300 (2015) 20--27]. The existence of edge-Hamiltonian cycles in $BH_n-F_e$ for $|F_e|\leq 2n-2$ are gotten in [Appl. Math. Comput. 244 (2014) 447--456]. Up to now…
▽ More
Let $F_{v}$ (resp. $F_e$) be the set of faulty vertices (resp. faulty edges) in the $n$-dimensional balanced hypercube $BH_n$. Fault-tolerant Hamiltonian laceability in $BH_n$ with at most $2n-2$ faulty edges is obtained in [Inform. Sci. 300 (2015) 20--27]. The existence of edge-Hamiltonian cycles in $BH_n-F_e$ for $|F_e|\leq 2n-2$ are gotten in [Appl. Math. Comput. 244 (2014) 447--456]. Up to now, almost all results about fault-tolerance in $BH_n$ with only faulty vertices or only faulty edges. In this paper, we consider fault-tolerant cycle embedding of $BH_n$ with both faulty vertices and faulty edges, and prove that there exists a fault-free cycle of length $2^{2n}-2|F_v|$ in $BH_n$ with $|F_v|+|F_e|\leq 2n-2$ and $|F_v|\leq n-1$ for $n\geq 2$. Since $BH_n$ is a bipartite graph with two partite sets of equal size, the cycle of a length $2^{2n}-2|F_v|$ is the longest in the worst-case.
△ Less
Submitted 26 April, 2016;
originally announced April 2016.
-
An Efficient, Sparsity-Preserving, Online Algorithm for Low-Rank Approximation
Authors:
David G. Anderson,
Ming Gu
Abstract:
Low-rank matrix approximation is a fundamental tool in data analysis for processing large datasets, reducing noise, and finding important signals. In this work, we present a novel truncated LU factorization called Spectrum-Revealing LU (SRLU) for effective low-rank matrix approximation, and develop a fast algorithm to compute an SRLU factorization. We provide both matrix and singular value approxi…
▽ More
Low-rank matrix approximation is a fundamental tool in data analysis for processing large datasets, reducing noise, and finding important signals. In this work, we present a novel truncated LU factorization called Spectrum-Revealing LU (SRLU) for effective low-rank matrix approximation, and develop a fast algorithm to compute an SRLU factorization. We provide both matrix and singular value approximation error bounds for the SRLU approximation computed by our algorithm. Our analysis suggests that SRLU is competitive with the best low-rank matrix approximation methods, deterministic or randomized, in both computational complexity and approximation quality. Numeric experiments illustrate that SRLU preserves sparsity, highlights important data features and variables, can be efficiently updated, and calculates data approximations nearly as accurately as possible. To the best of our knowledge this is the first practical variant of the LU factorization for effective and efficient low-rank matrix approximation.
△ Less
Submitted 18 August, 2017; v1 submitted 18 February, 2016;
originally announced February 2016.
-
Using quantum theory to reduce the complexity of input-output processes
Authors:
Jayne Thompson,
Andrew J. P. Garner,
Vlatko Vedral,
Mile Gu
Abstract:
All natural things process and transform information. They receive environmental information as input, and transform it into appropriate output responses. Much of science is dedicated to building models of such systems -- algorithmic abstractions of their input-output behavior that allow us to simulate how such systems can behave in the future, conditioned on what has transpired in the past. Here,…
▽ More
All natural things process and transform information. They receive environmental information as input, and transform it into appropriate output responses. Much of science is dedicated to building models of such systems -- algorithmic abstractions of their input-output behavior that allow us to simulate how such systems can behave in the future, conditioned on what has transpired in the past. Here, we show that classical models cannot avoid inefficiency -- storing past information that is unnecessary for correct future simulation. We construct quantum models that mitigate this waste, whenever it is physically possible to do so. This suggests that the complexity of general input-output processes depends fundamentally on what sort of information theory we use to describe them.
△ Less
Submitted 13 February, 2017; v1 submitted 20 January, 2016;
originally announced January 2016.
-
Gaussian Elimination with Randomized Complete Pivoting
Authors:
Christopher Melgaard,
Ming Gu
Abstract:
Gaussian elimination with partial pivoting (GEPP) has long been among the most widely used methods for computing the LU factorization of a given matrix. However, this method is also known to fail for matrices that induce large element growth during the factorization process. In this paper, we propose a new scheme, Gaussian elimination with randomized complete pivoting (GERCP) for the efficient and…
▽ More
Gaussian elimination with partial pivoting (GEPP) has long been among the most widely used methods for computing the LU factorization of a given matrix. However, this method is also known to fail for matrices that induce large element growth during the factorization process. In this paper, we propose a new scheme, Gaussian elimination with randomized complete pivoting (GERCP) for the efficient and reliable LU factorization of a given matrix. GERCP satisfies GECP (Gaussian elimination with complete pivoting) style element growth bounds with high probability, yet costs only marginally higher than GEPP. Our numerical experimental results strongly suggest that GERCP is as reliable as GECP and as efficient as GEPP for computing the LU factorization.
△ Less
Submitted 26 November, 2015;
originally announced November 2015.
-
The classical-quantum divergence of complexity in modelling spin chains
Authors:
Whei Yeap Suen,
Jayne Thompson,
Andrew J. P. Garner,
Vlatko Vedral,
Mile Gu
Abstract:
The minimal memory required to model a given stochastic process - known as the statistical complexity - is a widely adopted quantifier of structure in complexity science. Here, we ask if quantum mechanics can fundamentally change the qualitative behaviour of this measure. We study this question in the context of the classical Ising spin chain. In this system, the statistical complexity is known to…
▽ More
The minimal memory required to model a given stochastic process - known as the statistical complexity - is a widely adopted quantifier of structure in complexity science. Here, we ask if quantum mechanics can fundamentally change the qualitative behaviour of this measure. We study this question in the context of the classical Ising spin chain. In this system, the statistical complexity is known to grow monotonically with temperature. We evaluate the spin chain's quantum mechanical statistical complexity by explicitly constructing its provably simplest quantum model, and demonstrate that this measure exhibits drastically different behaviour: it rises to a maximum at some finite temperature then tends back towards zero for higher temperatures. This demonstrates how complexity, as captured by the amount of memory required to model a process, can exhibit radically different behaviour when quantum processing is allowed.
△ Less
Submitted 31 July, 2017; v1 submitted 18 November, 2015;
originally announced November 2015.
-
Randomized QR with Column Pivoting
Authors:
Jed A. Duersch,
Ming Gu
Abstract:
The dominant contribution to communication complexity in factorizing a matrix using QR with column pivoting is due to column-norm updates that are required to process pivot decisions. We use randomized sampling to approximate this process which dramatically reduces communication in column selection. We also introduce a sample update formula to reduce the cost of sampling trailing matrices. Using o…
▽ More
The dominant contribution to communication complexity in factorizing a matrix using QR with column pivoting is due to column-norm updates that are required to process pivot decisions. We use randomized sampling to approximate this process which dramatically reduces communication in column selection. We also introduce a sample update formula to reduce the cost of sampling trailing matrices. Using our column selection mechanism we observe results that are comparable in quality to those obtained from the QRCP algorithm, but with performance near unpivoted QR. We also demonstrate strong parallel scalability on shared memory multiple core systems using an implementation in Fortran with OpenMP.
This work immediately extends to produce low-rank truncated approximations of large matrices. We propose a truncated QR factorization with column pivoting that avoids trailing matrix updates which are used in current implementations of level-3 BLAS QR and QRCP. Provided the truncation rank is small, avoiding trailing matrix updates reduces approximation time by nearly half. By using these techniques and employing a variation on Stewart's QLP algorithm, we develop an approximate truncated SVD that runs nearly as fast as truncated QR.
△ Less
Submitted 29 May, 2017; v1 submitted 22 September, 2015;
originally announced September 2015.
-
Subspace Iteration Randomization and Singular Value Problems
Authors:
Ming Gu
Abstract:
A classical problem in matrix computations is the efficient and reliable approximation of a given matrix by a matrix of lower rank. The truncated singular value decomposition (SVD) is known to provide the best such approximation for any given fixed rank. However, the SVD is also known to be very costly to compute. Among the different approaches in the literature for computing low-rank approximatio…
▽ More
A classical problem in matrix computations is the efficient and reliable approximation of a given matrix by a matrix of lower rank. The truncated singular value decomposition (SVD) is known to provide the best such approximation for any given fixed rank. However, the SVD is also known to be very costly to compute. Among the different approaches in the literature for computing low-rank approximations, randomized algorithms have attracted researchers' recent attention due to their surprising reliability and computational efficiency in different application areas. Typically, such algorithms are shown to compute with very high probability low-rank approximations that are within a constant factor from optimal, and are known to perform even better in many practical situations. In this paper, we present a novel error analysis that considers randomized algorithms within the subspace iteration framework and show with very high probability that highly accurate low-rank approximations as well as singular values can indeed be computed quickly for matrices with rapidly decaying singular values. Such matrices appear frequently in diverse application areas such as data analysis, fast structured matrix computations and fast direct methods for large sparse linear systems of equations and are the driving motivation for randomized methods. Furthermore, we show that the low-rank approximations computed by these randomized algorithms are actually rank-revealing approximations, and the special case of a rank-1 approximation can also be used to correctly estimate matrix 2-norms with very high probability. Our numerical experiments are in full support of our conclusions.
△ Less
Submitted 10 August, 2014;
originally announced August 2014.
-
3-extra connectivity of 3-ary n-cube networks
Authors:
Meimei Gu,
Rongxia Hao
Abstract:
Let G be a connected graph and S be a set of vertices. The h-extra connectivity of G is the cardinality of a minimum set S such that G-S is disconnected and each component of G-S has at least h+1 vertices. The h-extra connectivity is an important parameter to measure the reliability and fault tolerance ability of large interconnection networks. The h-extra connectivity for h=1,2 of k-ary n-cube ar…
▽ More
Let G be a connected graph and S be a set of vertices. The h-extra connectivity of G is the cardinality of a minimum set S such that G-S is disconnected and each component of G-S has at least h+1 vertices. The h-extra connectivity is an important parameter to measure the reliability and fault tolerance ability of large interconnection networks. The h-extra connectivity for h=1,2 of k-ary n-cube are gotten by Hsieh et al. in [Theoretical Computer Science, 443 (2012) 63-69] for k>=4 and Zhu et al. in [Theory of Computing Systems, arxiv.org/pdf/1105.0991v1 [cs.DM] 5 May 2011] for k=3. In this paper, we show that the h-extra connectivity of the 3-ary n-cube networks for h=3 is equal to 8n-12, where n>=3.
△ Less
Submitted 19 September, 2013;
originally announced September 2013.
-
Exponential-Condition-Based Barrier Certificate Generation for Safety Verification of Hybrid Systems
Authors:
Hui Kong,
Fei He,
Xiaoyu Song,
William N. N. Hung,
Ming Gu
Abstract:
A barrier certificate is an inductive invariant function which can be used for the safety verification of a hybrid system. Safety verification based on barrier certificate has the benefit of avoiding explicit computation of the exact reachable set which is usually intractable for nonlinear hybrid systems. In this paper, we propose a new barrier certificate condition, called Exponential Condition,…
▽ More
A barrier certificate is an inductive invariant function which can be used for the safety verification of a hybrid system. Safety verification based on barrier certificate has the benefit of avoiding explicit computation of the exact reachable set which is usually intractable for nonlinear hybrid systems. In this paper, we propose a new barrier certificate condition, called Exponential Condition, for the safety verification of semi-algebraic hybrid systems. The most important benefit of Exponential Condition is that it has a lower conservativeness than the existing convex condition and meanwhile it possesses the property of convexity. On the one hand, a less conservative barrier certificate forms a tighter over-approximation for the reachable set and hence is able to verify critical safety properties. On the other hand, the property of convexity guarantees its solvability by semidefinite programming method. Some examples are presented to illustrate the effectiveness and practicality of our method.
△ Less
Submitted 27 March, 2013;
originally announced March 2013.
-
An improved dqds algorithm
Authors:
Shengguo Li,
Ming Gu,
Beresford N. Parlett
Abstract:
In this paper we present an improved dqds algorithm for computing all the singular values of a bidiagonal matrix to high relative accuracy. There are two key contributions: a novel deflation strategy that improves the convergence for badly scaled matrices, and some modifications to certain shift strategies that accelerate the convergence for most bidiagonal matrices. These techniques together ensu…
▽ More
In this paper we present an improved dqds algorithm for computing all the singular values of a bidiagonal matrix to high relative accuracy. There are two key contributions: a novel deflation strategy that improves the convergence for badly scaled matrices, and some modifications to certain shift strategies that accelerate the convergence for most bidiagonal matrices. These techniques together ensure linear worst case complexity of the improved algorithm (denoted by V5). Our extensive numerical experiments indicate that V5 is typically 1.2x--4x faster than DLASQ (the LAPACK-3.4.0 implementation of dqds) without any degradation in accuracy. On matrices for which DLASQ shows very slow convergence, V5 can be 3x--10x faster. At the end of this paper, a hybrid algorithm (HDLASQ) is developed by combining our improvements with the aggressive early deflation strategy (AggDef2 in [SIAM J. Matrix Anal. Appl., 33(2012), 22-51]). Numerical results show that HDLASQ is the fastest among these different versions.
△ Less
Submitted 2 March, 2014; v1 submitted 24 September, 2012;
originally announced September 2012.
-
LU factorization with panel rank revealing pivoting and its communication avoiding version
Authors:
Amal Khabou,
James W. Demmel,
Laura Grigori,
Ming Gu
Abstract:
We present the LU decomposition with panel rank revealing pivoting (LU_PRRP), an LU factorization algorithm based on strong rank revealing QR panel factorization. LU_PRRP is more stable than Gaussian elimination with partial pivoting (GEPP). Our extensive numerical experiments show that the new factorization scheme is as numerically stable as GEPP in practice, but it is more resistant to pathologi…
▽ More
We present the LU decomposition with panel rank revealing pivoting (LU_PRRP), an LU factorization algorithm based on strong rank revealing QR panel factorization. LU_PRRP is more stable than Gaussian elimination with partial pivoting (GEPP). Our extensive numerical experiments show that the new factorization scheme is as numerically stable as GEPP in practice, but it is more resistant to pathological cases and easily solves the Wilkinson matrix and the Foster matrix. We also present CALU_PRRP, a communication avoiding version of LU_PRRP that minimizes communication. CALU_PRRP is based on tournament pivoting, with the selection of the pivots at each step of the tournament being performed via strong rank revealing QR factorization. CALU_PRRP is more stable than CALU, the communication avoiding version of GEPP. CALU_PRRP is also more stable in practice and is resistant to pathological cases on which GEPP and CALU fail.
△ Less
Submitted 12 August, 2012;
originally announced August 2012.
-
A reduced complexity numerical method for optimal gate synthesis
Authors:
Srinivas Sridharan,
Mile Gu,
Matthew R. James,
William M. McEneaney
Abstract:
Although quantum computers have the potential to efficiently solve certain problems considered difficult by known classical approaches, the design of a quantum circuit remains computationally difficult. It is known that the optimal gate design problem is equivalent to the solution of an associated optimal control problem, the solution to which is also computationally intensive. Hence, in this arti…
▽ More
Although quantum computers have the potential to efficiently solve certain problems considered difficult by known classical approaches, the design of a quantum circuit remains computationally difficult. It is known that the optimal gate design problem is equivalent to the solution of an associated optimal control problem, the solution to which is also computationally intensive. Hence, in this article, we introduce the application of a class of numerical methods (termed the max-plus curse of dimensionality free techniques) that determine the optimal control thereby synthesizing the desired unitary gate. The application of this technique to quantum systems has a growth in complexity that depends on the cardinality of the control set approximation rather than the much larger growth with respect to spatial dimensions in approaches based on gridding of the space, used in previous literature. This technique is demonstrated by obtaining an approximate solution for the gate synthesis on $SU(4)$- a problem that is computationally intractable by grid based approaches.
△ Less
Submitted 27 November, 2010;
originally announced November 2010.
-
PARNES: A rapidly convergent algorithm for accurate recovery of sparse and approximately sparse signals
Authors:
Ming Gu,
Lek-Heng Lim,
Cinna Julie Wu
Abstract:
In this article, we propose an algorithm, NESTA-LASSO, for the LASSO problem, i.e., an underdetermined linear least-squares problem with a 1-norm constraint on the solution. We prove under the assumption of the restricted isometry property (RIP) and a sparsity condition on the solution, that NESTA-LASSO is guaranteed to be almost always locally linearly convergent. As in the case of the algorithm…
▽ More
In this article, we propose an algorithm, NESTA-LASSO, for the LASSO problem, i.e., an underdetermined linear least-squares problem with a 1-norm constraint on the solution. We prove under the assumption of the restricted isometry property (RIP) and a sparsity condition on the solution, that NESTA-LASSO is guaranteed to be almost always locally linearly convergent. As in the case of the algorithm NESTA proposed by Becker, Bobin, and Candes, we rely on Nesterov's accelerated proximal gradient method, which takes O(e^{-1/2}) iterations to come within e > 0 of the optimal value. We introduce a modification to Nesterov's method that regularly updates the prox-center in a provably optimal manner, and the aforementioned linear convergence is in part due to this modification.
In the second part of this article, we attempt to solve the basis pursuit denoising BPDN problem (i.e., approximating the minimum 1-norm solution to an underdetermined least squares problem) by using NESTA-LASSO in conjunction with the Pareto root-finding method employed by van den Berg and Friedlander in their SPGL1 solver. The resulting algorithm is called PARNES. We provide numerical evidence to show that it is comparable to currently available solvers.
△ Less
Submitted 30 March, 2012; v1 submitted 2 November, 2009;
originally announced November 2009.
-
Global Minimization of Rational Functions and the Nearest GCDs
Authors:
Jiawang Nie,
James Demmel,
Ming Gu
Abstract:
This paper has been withdrawn by the authors due to its publication
This paper has been withdrawn by the authors due to its publication
△ Less
Submitted 18 September, 2007; v1 submitted 5 January, 2006;
originally announced January 2006.