-
A Hierarchical Constructive Heuristic for Large-Scale Survivable Traffic Grooming Problem under Double-Link Failures
Authors:
Silong Zhang,
Jixuan Feng,
Junyan Liu,
Yu Liu,
Zhou Xu,
Fan Zhang
Abstract:
This paper studies a survivable traffic grooming problem in large-scale optical transport networks under double-link failures (STG2). Each communication demand must be assigned a route for every possible scenario involving zero, one, or two failed fiber links. Protection against double-link failures is crucial for ensuring reliable telecommunications services while minimizing equipment costs, maki…
▽ More
This paper studies a survivable traffic grooming problem in large-scale optical transport networks under double-link failures (STG2). Each communication demand must be assigned a route for every possible scenario involving zero, one, or two failed fiber links. Protection against double-link failures is crucial for ensuring reliable telecommunications services while minimizing equipment costs, making it essential for telecommunications companies today. However, this significantly complicates the problem and is rarely addressed in existing studies. Furthermore, current research typically examines networks with fewer than 300 nodes, much smaller than some emerging networks containing thousands of nodes. To address these challenges, we propose a novel hierarchical constructive heuristic for STG2. This heuristic constructs and assigns routes to communication demands across different scenarios by following a hierarchical sequence. It incorporates several innovative optimization techniques and utilizes parallel computing to enhance efficiency. Extensive experiments have been conducted on large-scale STG2 instances provided by our industry partner, encompassing networks with 1,000 to 2,600 nodes. Results demonstrate that within a one-hour time limit and a 16 GB memory limit set by the industry partner, our heuristic improves the objective values of the best-known solutions by 18.5\% on average, highlighting its significant potential for practical applications.
△ Less
Submitted 15 June, 2025;
originally announced June 2025.
-
A Sparse Bayesian Learning Algorithm for Estimation of Interaction Kernels in Motsch-Tadmor Model
Authors:
Jinchao Feng,
Sui Tang
Abstract:
In this paper, we investigate the data-driven identification of asymmetric interaction kernels in the Motsch-Tadmor model based on observed trajectory data. The model under consideration is governed by a class of semilinear evolution equations, where the interaction kernel defines a normalized, state-dependent Laplacian operator that governs collective dynamics. To address the resulting nonlinear…
▽ More
In this paper, we investigate the data-driven identification of asymmetric interaction kernels in the Motsch-Tadmor model based on observed trajectory data. The model under consideration is governed by a class of semilinear evolution equations, where the interaction kernel defines a normalized, state-dependent Laplacian operator that governs collective dynamics. To address the resulting nonlinear inverse problem, we propose a variational framework that reformulates kernel identification using the implicit form of the governing equations, reducing it to a subspace identification problem. We establish an identifiability result that characterizes conditions under which the interaction kernel can be uniquely recovered up to scale. To solve the inverse problem robustly, we develop a sparse Bayesian learning algorithm that incorporates informative priors for regularization, quantifies uncertainty, and enables principled model selection. Extensive numerical experiments on representative interacting particle systems demonstrate the accuracy, robustness, and interpretability of the proposed framework across a range of noise levels and data regimes.
△ Less
Submitted 11 May, 2025;
originally announced May 2025.
-
Exponentially accurate spectral Monte Carlo method for linear PDEs and their error estimates
Authors:
Jiaying Feng,
Changtao Sheng,
Chenglong Xu
Abstract:
This paper introduces a spectral Monte Carlo iterative method (SMC) for solving linear Poisson and parabolic equations driven by $α$-stable Lévy process with $α\in (0,2)$, which was initially proposed and developed by Gobet and Maire in their pioneering works (Monte Carlo Methods Appl 10(3-4), 275--285, 2004, and SIAM J Numer Anal 43(3), 1256--1275, 2005) for the case $α=2$. The novel method effec…
▽ More
This paper introduces a spectral Monte Carlo iterative method (SMC) for solving linear Poisson and parabolic equations driven by $α$-stable Lévy process with $α\in (0,2)$, which was initially proposed and developed by Gobet and Maire in their pioneering works (Monte Carlo Methods Appl 10(3-4), 275--285, 2004, and SIAM J Numer Anal 43(3), 1256--1275, 2005) for the case $α=2$. The novel method effectively integrates multiple computational techniques, including the interpolation based on generalized Jacobi functions (GJFs), space-time spectral methods, control variates techniques, and a novel walk-on-sphere method (WOS). The exponential convergence of the error bounds is rigorously established through finite iterations for both Poisson and parabolic equations involving the integral fractional Laplacian operator. Remarkably, the proposed space-time spectral Monte Carlo method (ST-SMC) for the parabolic equation is unified for both $α\in(0,2)$ and $α=2$. Extensive numerical results are provided to demonstrate the spectral accuracy and efficiency of the proposed method, thereby validating the theoretical findings.
△ Less
Submitted 20 February, 2025;
originally announced February 2025.
-
A system of Schrödinger's problems and functional equations
Authors:
Toshio Mikami,
Jin Feng
Abstract:
We propose and study a system of Schrödinger's problems and functional equations in probability theory. More precisely, we consider a system of variational problems of relative entropies for probability measures on a Euclidean space with given two endpoint marginals, which can be defined inductively. We also consider an inductively defined system of functional equations, which are Euler's equation…
▽ More
We propose and study a system of Schrödinger's problems and functional equations in probability theory. More precisely, we consider a system of variational problems of relative entropies for probability measures on a Euclidean space with given two endpoint marginals, which can be defined inductively. We also consider an inductively defined system of functional equations, which are Euler's equations for our variational problems. These are generalizations of Schrödinger's problem and functional equation. % in probability theory. We prove the existence and uniqueness of solutions to our functional equations, % up to a multiplicative function, from which we show the existence and uniqueness of a minimizer of our variational problem. Our problem gives an approach for a stochastic optimal transport analog of the Knothe--Rosenblatt rearrangement
via a variational problem point of view.
△ Less
Submitted 15 June, 2025; v1 submitted 31 December, 2024;
originally announced January 2025.
-
Set stabilization of Boolean control networks based on bisimulations: A dimensionality reduction approach
Authors:
Tiantian Mu,
Jun-e Feng,
Biao Wang
Abstract:
This paper exploits bisimulation relations, generated by extracting the concept of morphisms between algebraic structures, to analyze set stabilization of Boolean control networks with lower complexity. First, for two kinds of bisimulation relations, called as weak bisimulation and strong bisimulation relations, a novel verification method is provided by constructing the bisimulation matrices. The…
▽ More
This paper exploits bisimulation relations, generated by extracting the concept of morphisms between algebraic structures, to analyze set stabilization of Boolean control networks with lower complexity. First, for two kinds of bisimulation relations, called as weak bisimulation and strong bisimulation relations, a novel verification method is provided by constructing the bisimulation matrices. Then the comparison for set stabilization of BCNs via two kinds of bisimulation methods is presented, which involves the dimensionality of quotient systems and dependency of the control laws on the original system. Moreover, the proposed method is also applied to the analysis of probabilistic Boolean control networks to establish the unified analysis framework of bisimulations. Finally, the validity of the obtained results is verified by the practical example.
△ Less
Submitted 23 December, 2024;
originally announced December 2024.
-
Topology-Aware 3D Gaussian Splatting: Leveraging Persistent Homology for Optimized Structural Integrity
Authors:
Tianqi Shen,
Shaohua Liu,
Jiaqi Feng,
Ziye Ma,
Ning An
Abstract:
Gaussian Splatting (GS) has emerged as a crucial technique for representing discrete volumetric radiance fields. It leverages unique parametrization to mitigate computational demands in scene optimization. This work introduces Topology-Aware 3D Gaussian Splatting (Topology-GS), which addresses two key limitations in current approaches: compromised pixel-level structural integrity due to incomplete…
▽ More
Gaussian Splatting (GS) has emerged as a crucial technique for representing discrete volumetric radiance fields. It leverages unique parametrization to mitigate computational demands in scene optimization. This work introduces Topology-Aware 3D Gaussian Splatting (Topology-GS), which addresses two key limitations in current approaches: compromised pixel-level structural integrity due to incomplete initial geometric coverage, and inadequate feature-level integrity from insufficient topological constraints during optimization. To overcome these limitations, Topology-GS incorporates a novel interpolation strategy, Local Persistent Voronoi Interpolation (LPVI), and a topology-focused regularization term based on persistent barcodes, named PersLoss. LPVI utilizes persistent homology to guide adaptive interpolation, enhancing point coverage in low-curvature areas while preserving topological structure. PersLoss aligns the visual perceptual similarity of rendered images with ground truth by constraining distances between their topological features. Comprehensive experiments on three novel-view synthesis benchmarks demonstrate that Topology-GS outperforms existing methods in terms of PSNR, SSIM, and LPIPS metrics, while maintaining efficient memory usage. This study pioneers the integration of topology with 3D-GS, laying the groundwork for future research in this area.
△ Less
Submitted 14 June, 2025; v1 submitted 21 December, 2024;
originally announced December 2024.
-
$DW$-DP operators and $DW$-limited operators on Banach lattices
Authors:
Jin Xi Chen,
Jingge Feng
Abstract:
This paper is devoted to the study of two classes of operators related to disjointly weakly compact sets, which we call $DW$-DP operators and $DW$-limited operators, respectively. They carry disjointly weakly compact subsets of a Banach lattice onto Dunford-Pettis sets and limited sets, respectively. We show that $DW$-DP (resp. $DW$-limited) operators are precisely the operators which are both wea…
▽ More
This paper is devoted to the study of two classes of operators related to disjointly weakly compact sets, which we call $DW$-DP operators and $DW$-limited operators, respectively. They carry disjointly weakly compact subsets of a Banach lattice onto Dunford-Pettis sets and limited sets, respectively. We show that $DW$-DP (resp. $DW$-limited) operators are precisely the operators which are both weak Dunford-Pettis and order Dunford-Pettis (resp. weak$^*$ Dunford-Pettis and order limited) operators. Furthermore, the approximation properties of positive $DW$-DP and positive $DW$-limited operators are given.
△ Less
Submitted 2 December, 2024;
originally announced December 2024.
-
Proximal methods for structured nonsmooth optimization over Riemannian submanifolds
Authors:
Qia Li,
Na Zhang,
Hanwei Yan,
Junyu Feng
Abstract:
In this paper, we consider a class of structured nonsmooth optimization problems over an embedded submanifold of a Euclidean space, where the first part of the objective is the sum of a difference-of-convex (DC) function and a smooth function, while the remaining part is the square of a weakly convex function over a smooth function. This model problem has many important applications in machine lea…
▽ More
In this paper, we consider a class of structured nonsmooth optimization problems over an embedded submanifold of a Euclidean space, where the first part of the objective is the sum of a difference-of-convex (DC) function and a smooth function, while the remaining part is the square of a weakly convex function over a smooth function. This model problem has many important applications in machine learning and scientific computing, for example, the sparse generalized eigenvalue problem. We propose a manifold proximal-gradient-subgradient algorithm (MPGSA) and show that under mild conditions any accumulation point of the solution sequence generated by it is a critical point of the underlying problem. By assuming the Kurdyka-Łojasiewicz property of an auxiliary function, we further establish the convergence of the full sequence generated by MPGSA under some suitable conditions. When the second component of the DC function involved is the maximum of finite continuously differentiable convex functions, we also propose an enhanced MPGSA with guaranteed subsequential convergence to a lifted B-stationary points of the optimization problem. Finally, some preliminary numerical experiments are conducted to illustrate the efficiency of the proposed algorithms.
△ Less
Submitted 21 February, 2025; v1 submitted 24 November, 2024;
originally announced November 2024.
-
$DW$-compact operators on Banach lattices
Authors:
Jin Xi Chen,
Jingge Feng
Abstract:
This paper is devoted to the study of $DW$-compact operators, that is, those operators which map disjointly weakly compact sets in a Banach lattice onto relatively compact sets. We show that $DW$-compact operators are precisely the operators which are both Dunford-Pettis and $AM$-compact. As an application, Banach lattices with the property that every disjointly weakly compact set is a limited (re…
▽ More
This paper is devoted to the study of $DW$-compact operators, that is, those operators which map disjointly weakly compact sets in a Banach lattice onto relatively compact sets. We show that $DW$-compact operators are precisely the operators which are both Dunford-Pettis and $AM$-compact. As an application, Banach lattices with the property that every disjointly weakly compact set is a limited (resp. Dunford-Pettis) set, are characterized by using $DW$-compact operators.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Efficient Combinatorial Optimization via Heat Diffusion
Authors:
Hengyuan Ma,
Wenlian Lu,
Jianfeng Feng
Abstract:
Combinatorial optimization problems are widespread but inherently challenging due to their discrete nature. The primary limitation of existing methods is that they can only access a small fraction of the solution space at each iteration, resulting in limited efficiency for searching the global optimal. To overcome this challenge, diverging from conventional efforts of expanding the solver's search…
▽ More
Combinatorial optimization problems are widespread but inherently challenging due to their discrete nature. The primary limitation of existing methods is that they can only access a small fraction of the solution space at each iteration, resulting in limited efficiency for searching the global optimal. To overcome this challenge, diverging from conventional efforts of expanding the solver's search scope, we focus on enabling information to actively propagate to the solver through heat diffusion. By transforming the target function while preserving its optima, heat diffusion facilitates information flow from distant regions to the solver, providing more efficient navigation. Utilizing heat diffusion, we propose a framework for solving general combinatorial optimization problems. The proposed methodology demonstrates superior performance across a range of the most challenging and widely encountered combinatorial optimizations. Echoing recent advancements in harnessing thermodynamics for generative artificial intelligence, our study further reveals its significant potential in advancing combinatorial optimization.
△ Less
Submitted 26 September, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
Using binary string to prove the Collatz conjecture
Authors:
Jishe Feng
Abstract:
We introduce a full binary directed tree structure to represent the set of natural numbers, further categorizing them into three distinct subsets: pure odd numbers, pure even numbers, and mixed numbers. We adopt a binary string representation for natural numbers and elaborate on the composite methodology encompassing odd- and even-number functions. Our analysis focuses on examining the iteration s…
▽ More
We introduce a full binary directed tree structure to represent the set of natural numbers, further categorizing them into three distinct subsets: pure odd numbers, pure even numbers, and mixed numbers. We adopt a binary string representation for natural numbers and elaborate on the composite methodology encompassing odd- and even-number functions. Our analysis focuses on examining the iteration sequence (or composition) of the Collatz function and its reduced variant, which serves as an analog to the inverse function, to scrutinize the validity of the Collatz conjecture. To substantiate this conjecture, we incorporate binary strings into an algebraic formula that captures the essence of the Collatz sequence. By this means, we transform discrete powers of 2 into continuous counterparts, ultimately culminating in the smallest natural number, 1. Consequently, the sequence generated through infinite iterations of the Collatz function emerges as an eventually periodic sequence, thereby validating an enduring 87-year-old conjecture.
△ Less
Submitted 11 June, 2024; v1 submitted 20 September, 2023;
originally announced February 2024.
-
Global Convergence of Natural Policy Gradient with Hessian-aided Momentum Variance Reduction
Authors:
Jie Feng,
Ke Wei,
Jinchi Chen
Abstract:
Natural policy gradient (NPG) and its variants are widely-used policy search methods in reinforcement learning. Inspired by prior work, a new NPG variant coined NPG-HM is developed in this paper, which utilizes the Hessian-aided momentum technique for variance reduction, while the sub-problem is solved via the stochastic gradient descent method. It is shown that NPG-HM can achieve the global last…
▽ More
Natural policy gradient (NPG) and its variants are widely-used policy search methods in reinforcement learning. Inspired by prior work, a new NPG variant coined NPG-HM is developed in this paper, which utilizes the Hessian-aided momentum technique for variance reduction, while the sub-problem is solved via the stochastic gradient descent method. It is shown that NPG-HM can achieve the global last iterate $ε$-optimality with a sample complexity of $\mathcal{O}(ε^{-2})$, which is the best known result for natural policy gradient type methods under the generic Fisher non-degenerate policy parameterizations. The convergence analysis is built upon a relaxed weak gradient dominance property tailored for NPG under the compatible function approximation framework, as well as a neat way to decompose the error when handling the sub-problem. Moreover, numerical experiments on Mujoco-based environments demonstrate the superior performance of NPG-HM over other state-of-the-art policy gradient methods.
△ Less
Submitted 21 January, 2024; v1 submitted 2 January, 2024;
originally announced January 2024.
-
Scalable iterative data-adaptive RKHS regularization
Authors:
Haibo Li,
Jinchao Feng,
Fei Lu
Abstract:
We present iDARR, a scalable iterative Data-Adaptive RKHS Regularization method, for solving ill-posed linear inverse problems. The method searches for solutions in subspaces where the true solution can be identified, with the data-adaptive RKHS penalizing the spaces of small singular values. At the core of the method is a new generalized Golub-Kahan bidiagonalization procedure that recursively co…
▽ More
We present iDARR, a scalable iterative Data-Adaptive RKHS Regularization method, for solving ill-posed linear inverse problems. The method searches for solutions in subspaces where the true solution can be identified, with the data-adaptive RKHS penalizing the spaces of small singular values. At the core of the method is a new generalized Golub-Kahan bidiagonalization procedure that recursively constructs orthonormal bases for a sequence of RKHS-restricted Krylov subspaces. The method is scalable with a complexity of $O(kmn)$ for $m$-by-$n$ matrices with $k$ denoting the iteration numbers. Numerical tests on the Fredholm integral equation and 2D image deblurring show that it outperforms the widely used $L^2$ and $l^2$ norms, producing stable accurate solutions consistently converging when the noise level decays.
△ Less
Submitted 31 December, 2023;
originally announced January 2024.
-
Limited bisimulations for nondeterministic fuzzy transition systems
Authors:
Sha Qiao,
Jun e Feng,
Ping Zhu
Abstract:
The limited version of bisimulation, called limited approximate bisimulation, has recently been introduced to fuzzy transition systems (NFTSs). This article extends limited approximate bisimulation to NFTSs, which are more general structures than FTSs, to introduce a notion of $k$-limited $α$-bisimulation by using an approach of relational lifting, where $k$ is a natural number and $α\in[0,1]$. To…
▽ More
The limited version of bisimulation, called limited approximate bisimulation, has recently been introduced to fuzzy transition systems (NFTSs). This article extends limited approximate bisimulation to NFTSs, which are more general structures than FTSs, to introduce a notion of $k$-limited $α$-bisimulation by using an approach of relational lifting, where $k$ is a natural number and $α\in[0,1]$. To give the algorithmic characterization, a fixed point characterization of $k$-limited $α$-bisimilarity is first provided. Then $k$-limited $α$-bisimulation vector with $i$-th element being a $(k-i+1)$-limited $α$-bisimulation is introduced to investigate conditions for two states to be $k$-limited $α$-bisimilar, where $1\leq i\leq k+1$. Using these results, an $O(2k^2|V|^6\cdot\left|\lra\right|^2)$ algorithm is designed for computing the degree of similarity between two states, where $|V|$ is the number of states of the NFTS and $\left|\lra\right|$ is the greatest number of transitions from states. Finally, the relationship between $k$-limited $α$-bisimilar and $α$-bisimulation under $\widetilde{S}$ is showed, and by which, a logical characterization of $k$-limited $α$-bisimilarity is provided.
△ Less
Submitted 26 November, 2023;
originally announced November 2023.
-
Fluid limit of a model for distributed ledger with random delay
Authors:
Jiewei Feng,
Christopher King
Abstract:
Blockchain and other decentralized databases, known as distributed ledgers, are designed to store information online where all trusted network members can update the data with transparency. The dynamics of ledger's development can be mathematically represented by a directed acyclic graph (DAG). In this paper, we study a DAG model which considers batch arrivals and random delay of attachment. We an…
▽ More
Blockchain and other decentralized databases, known as distributed ledgers, are designed to store information online where all trusted network members can update the data with transparency. The dynamics of ledger's development can be mathematically represented by a directed acyclic graph (DAG). In this paper, we study a DAG model which considers batch arrivals and random delay of attachment. We analyze the asymptotic behavior of this model by letting the arrival rate goes to infinity and the inter arrival time goes to zero. We establish that the number of leaves in the DAG and various random variables characterizing the vertices in the DAG can be approximated by its fluid limit, represented as delayed partial differential equations. Furthermore, we establish the stable state of this fluid limit and validate our findings through simulations.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Data-Driven Model Selections of Second-Order Particle Dynamics via Integrating Gaussian Processes with Low-Dimensional Interacting Structures
Authors:
Jinchao Feng,
Charles Kulick,
Sui Tang
Abstract:
In this paper, we focus on the data-driven discovery of a general second-order particle-based model that contains many state-of-the-art models for modeling the aggregation and collective behavior of interacting agents of similar size and body type. This model takes the form of a high-dimensional system of ordinary differential equations parameterized by two interaction kernels that appraise the al…
▽ More
In this paper, we focus on the data-driven discovery of a general second-order particle-based model that contains many state-of-the-art models for modeling the aggregation and collective behavior of interacting agents of similar size and body type. This model takes the form of a high-dimensional system of ordinary differential equations parameterized by two interaction kernels that appraise the alignment of positions and velocities. We propose a Gaussian Process-based approach to this problem, where the unknown model parameters are marginalized by using two independent Gaussian Process (GP) priors on latent interaction kernels constrained to dynamics and observational data. This results in a nonparametric model for interacting dynamical systems that accounts for uncertainty quantification. We also develop acceleration techniques to improve scalability. Moreover, we perform a theoretical analysis to interpret the methodology and investigate the conditions under which the kernels can be recovered. We demonstrate the effectiveness of the proposed approach on various prototype systems, including the selection of the order of the systems and the types of interactions. In particular, we present applications to modeling two real-world fish motion datasets that display flocking and milling patterns up to 248 dimensions. Despite the use of small data sets, the GP-based approach learns an effective representation of the nonlinear dynamics in these spaces and outperforms competitor methods.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
Learning Collective Behaviors from Observation
Authors:
Jinchao Feng,
Ming Zhong
Abstract:
We present a comprehensive examination of learning methodologies employed for the structural identification of dynamical systems. These techniques are designed to elucidate emergent phenomena within intricate systems of interacting agents. Our approach not only ensures theoretical convergence guarantees but also exhibits computational efficiency when handling high-dimensional observational data. T…
▽ More
We present a comprehensive examination of learning methodologies employed for the structural identification of dynamical systems. These techniques are designed to elucidate emergent phenomena within intricate systems of interacting agents. Our approach not only ensures theoretical convergence guarantees but also exhibits computational efficiency when handling high-dimensional observational data. The methods adeptly reconstruct both first- and second-order dynamical systems, accommodating observation and stochastic noise, intricate interaction rules, absent interaction features, and real-world observations in agent systems. The foundational aspect of our learning methodologies resides in the formulation of tailored loss functions using the variational inverse problem approach, inherently equipping our methods with dimension reduction capabilities.
△ Less
Submitted 4 April, 2024; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Design and Analysis of Robust Ballistic Landings on the Secondary of a Binary Asteroid
Authors:
Iosto Fodde,
Jinglang Feng,
Massimiliano Vasile,
Jesús Gil-Fernández
Abstract:
ESA's Hera mission aims to visit binary asteroid Didymos in late 2026, investigating its physical characteristics and the result of NASA's impact by the DART spacecraft in more detail. Two CubeSats on-board Hera plan to perform a ballistic landing on the secondary of the system, called Dimorphos. For these types of landings the translational state during descent is not controlled, reducing the spa…
▽ More
ESA's Hera mission aims to visit binary asteroid Didymos in late 2026, investigating its physical characteristics and the result of NASA's impact by the DART spacecraft in more detail. Two CubeSats on-board Hera plan to perform a ballistic landing on the secondary of the system, called Dimorphos. For these types of landings the translational state during descent is not controlled, reducing the spacecrafts complexity but also increasing its sensitivity to deployment maneuver errors and dynamical uncertainties. This paper introduces a novel methodology to analyse the effect of these uncertainties on the dynamics of the lander and design a trajectory that is robust against them. This methodology consists of propagating the uncertain state of the lander using the non-intrusive Chebyshev interpolation (NCI) technique, which approximates the uncertain dynamics using a polynomial expansion, and analysing the results using the pseudo-diffusion indicator, derived from the coefficients of the polynomial expansion, which quantifies the rate of growth of the set of possible states of the spacecraft over time. This indicator is used here to constrain the impact velocity and angle to values which allow for successful settling on the surface. This information is then used to optimize the landing trajectory by applying the NCI technique inside the transcription of the problem. The resulting trajectory increases the robustness of the trajectory compared to a conventional method, improving the landing success by 20 percent and significantly reducing the landing footprint.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Three dimensional quotient singularity and 4d $\mathcal{N}=1$ AdS/CFT correspondence
Authors:
Yuanyuan Fang,
Jing Feng,
Dan Xie
Abstract:
We systematically study the AdS/CFT correspondence induced by D3 branes probing three dimensional Gorenstein quotient singularity $\mathbb{C}^3/G$. The field theory is given by the McKay quiver, which has a vanishing NSVZ beta function assuming that all the chiral fields have the $U(1)_R$ charge $\frac{2}{3}$. Various physical quantities such as quiver Hilbert series, superconformal index, central…
▽ More
We systematically study the AdS/CFT correspondence induced by D3 branes probing three dimensional Gorenstein quotient singularity $\mathbb{C}^3/G$. The field theory is given by the McKay quiver, which has a vanishing NSVZ beta function assuming that all the chiral fields have the $U(1)_R$ charge $\frac{2}{3}$. Various physical quantities such as quiver Hilbert series, superconformal index, central charges, etc are computed, which match exactly with those computed using the singularity. We also study the relevant deformation of those theories and find the dual geometry, therefore generate many new interesting AdS/CFT pairs. The quiver gauge theory defined using finite subgroups of $SO(3)$ group has some interesting features, for example, its Seiberg duality behavior is quite interesting.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
The Fallacy in the Paradox of Achilles and the Tortoise
Authors:
James Q. Feng
Abstract:
Zeno's ancient paradox depicts a race between swift Achilles and a slow tortoise with a head start. Zeno argued that Achilles could never overtake the tortoise, as at each step Achilles arrived at the tortoise's former position, the tortoise had already moved ahead. Though Zeno's premise is valid, his conclusion that Achilles can "never" pass the tortoise relies on equating infinite steps with an…
▽ More
Zeno's ancient paradox depicts a race between swift Achilles and a slow tortoise with a head start. Zeno argued that Achilles could never overtake the tortoise, as at each step Achilles arrived at the tortoise's former position, the tortoise had already moved ahead. Though Zeno's premise is valid, his conclusion that Achilles can "never" pass the tortoise relies on equating infinite steps with an infinite amount of time. By modeling the sequence of events in terms of a converging geometric series, this paper shows that such an infinite number of events sum up to a finite distance traversed in finite time. The paradox stems from confusion between an infinite number of events, which can happen in a finite time interval, and an infinite amount of time. The fallacy is clarified by recognizing that the infinite number of events can be crammed into a finite time interval. At a given speed difference after a finite amount of time, Achilles will have completed the infinite series of gaps at the "catch-up time" and passed the tortoise. Hence this paradox of Achilles and the tortoise can be resolved by simply adding "before the catch-up time" to the concluding statement of "Achilles would never overtake the tortoise".
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
Almost sure one-endedness of a random graph model of distributed ledgers
Authors:
Jiewei Feng,
Christopher King,
Ken R. Duffy
Abstract:
Blockchain and other decentralized databases, known as distributed ledgers, are designed to store information online where all trusted network members can update the data with transparency. The dynamics of ledger's development can be mathematically represented by a directed acyclic graph (DAG). One essential property of a properly functioning shared ledger is that all network members holding a cop…
▽ More
Blockchain and other decentralized databases, known as distributed ledgers, are designed to store information online where all trusted network members can update the data with transparency. The dynamics of ledger's development can be mathematically represented by a directed acyclic graph (DAG). One essential property of a properly functioning shared ledger is that all network members holding a copy of the ledger agree on a sequence of information added to the ledger, which is referred to as consensus and is known to be related to a structural property of DAG called one-endedness. In this paper, we consider a model of distributed ledger with sequential stochastic arrivals that mimic attachment rules from the IOTA cryptocurrency. We first prove that the number of leaves in the random DAG is bounded by a constant infinitely often through the identification of a suitable martingale, and then prove that a sequence of specific events happens infinitely often. Combining those results we establish that, as time goes to infinity, the IOTA DAG is almost surely one-ended.
△ Less
Submitted 18 September, 2023; v1 submitted 14 September, 2023;
originally announced September 2023.
-
A Randomized Block Krylov Method for Tensor Train Approximation
Authors:
Gaohang Yu,
Jinhong Feng,
Zhongming Chen,
Xiaohao Cai,
Liqun Qi
Abstract:
Tensor train decomposition is a powerful tool for dealing with high-dimensional, large-scale tensor data, which is not suffering from the curse of dimensionality. To accelerate the calculation of the auxiliary unfolding matrix, some randomized algorithms have been proposed; however, they are not suitable for noisy data. The randomized block Krylov method is capable of dealing with heavy-tailed noi…
▽ More
Tensor train decomposition is a powerful tool for dealing with high-dimensional, large-scale tensor data, which is not suffering from the curse of dimensionality. To accelerate the calculation of the auxiliary unfolding matrix, some randomized algorithms have been proposed; however, they are not suitable for noisy data. The randomized block Krylov method is capable of dealing with heavy-tailed noisy data in the low-rank approximation of matrices. In this paper, we present a randomized algorithm for low-rank tensor train approximation of large-scale tensors based on randomized block Krylov subspace iteration and provide theoretical guarantees. Numerical experiments on synthetic and real-world tensor data demonstrate the effectiveness of the proposed algorithm.
△ Less
Submitted 7 August, 2023; v1 submitted 2 August, 2023;
originally announced August 2023.
-
Probabilistic computation and uncertainty quantification with emerging covariance
Authors:
Hengyuan Ma,
Yang Qi,
Li Zhang,
Wenlian Lu,
Jianfeng Feng
Abstract:
Building robust, interpretable, and secure AI system requires quantifying and representing uncertainty under a probabilistic perspective to mimic human cognitive abilities. However, probabilistic computation presents significant challenges for most conventional artificial neural network, as they are essentially implemented in a deterministic manner. In this paper, we develop an efficient probabili…
▽ More
Building robust, interpretable, and secure AI system requires quantifying and representing uncertainty under a probabilistic perspective to mimic human cognitive abilities. However, probabilistic computation presents significant challenges for most conventional artificial neural network, as they are essentially implemented in a deterministic manner. In this paper, we develop an efficient probabilistic computation framework by truncating the probabilistic representation of neural activation up to its mean and covariance and construct a moment neural network that encapsulates the nonlinear coupling between the mean and covariance of the underlying stochastic network. We reveal that when only the mean but not the covariance is supervised during gradient-based learning, the unsupervised covariance spontaneously emerges from its nonlinear coupling with the mean and faithfully captures the uncertainty associated with model predictions. Our findings highlight the inherent simplicity of probabilistic computation by seamlessly incorporating uncertainty into model prediction, paving the way for integrating it into large-scale AI systems.
△ Less
Submitted 12 January, 2024; v1 submitted 30 May, 2023;
originally announced May 2023.
-
Minimizing the number of matchings of fixed size in a $K_s$-saturated graph
Authors:
Jiejing Feng,
Doudou Hei,
Xinmin Hou
Abstract:
For a fixed graph $F$, a graph $G$ is said to be $F$-saturated if $G$ does not contain a subgraph isomorphic to $F$ but does contain $F$ after the addition of any new edge.
Let $M_k$ be a matching consisting of $k$ edges and $S_{n,k}$ be the join graph of a complete graph $K_k$ and an empty graph $\overline{K_{n-k}}$. In this paper, we prove that for $s \geq3$ and $k\geq 2$, $S_{n,s-2}$ contains…
▽ More
For a fixed graph $F$, a graph $G$ is said to be $F$-saturated if $G$ does not contain a subgraph isomorphic to $F$ but does contain $F$ after the addition of any new edge.
Let $M_k$ be a matching consisting of $k$ edges and $S_{n,k}$ be the join graph of a complete graph $K_k$ and an empty graph $\overline{K_{n-k}}$. In this paper, we prove that for $s \geq3$ and $k\geq 2$, $S_{n,s-2}$ contains the minimum number of $M_k$ among all $n$-vertex $K_s$-saturated graphs for sufficiently large $n$, and when $k \leq s-2$, it is the unique extremal graph. In addition, we also show that $S_{n,1}$ is the unique extremal graph when $k=2$ and $s=3$.
△ Less
Submitted 6 November, 2022;
originally announced November 2022.
-
Decentralized Natural Policy Gradient with Variance Reduction for Collaborative Multi-Agent Reinforcement Learning
Authors:
Jinchi Chen,
Jie Feng,
Weiguo Gao,
Ke Wei
Abstract:
This paper studies a policy optimization problem arising from collaborative multi-agent reinforcement learning in a decentralized setting where agents communicate with their neighbors over an undirected graph to maximize the sum of their cumulative rewards. A novel decentralized natural policy gradient method, dubbed Momentum-based Decentralized Natural Policy Gradient (MDNPG), is proposed, which…
▽ More
This paper studies a policy optimization problem arising from collaborative multi-agent reinforcement learning in a decentralized setting where agents communicate with their neighbors over an undirected graph to maximize the sum of their cumulative rewards. A novel decentralized natural policy gradient method, dubbed Momentum-based Decentralized Natural Policy Gradient (MDNPG), is proposed, which incorporates natural gradient, momentum-based variance reduction, and gradient tracking into the decentralized stochastic gradient ascent framework. The $\mathcal{O}(n^{-1}ε^{-3})$ sample complexity for MDNPG to converge to an $ε$-stationary point has been established under standard assumptions, where $n$ is the number of agents. It indicates that MDNPG can achieve the optimal convergence rate for decentralized policy gradient methods and possesses a linear speedup in contrast to centralized optimization methods. Moreover, superior empirical performance of MDNPG over other state-of-the-art algorithms has been demonstrated by extensive numerical experiments.
△ Less
Submitted 5 September, 2022;
originally announced September 2022.
-
Reflectors to quantales
Authors:
Xia Zhang,
Jan Paseka,
Jianjun Feng,
Yudong Chen
Abstract:
In this paper, we show that marked quantales have a reflection into quantales. To obtain the reflection we construct free quantales over marked quantales using appropriate lower sets.
A marked quantale is a posemigroup in which certain admissible subsets are required to have joins, and multiplication distributes over these. Sometimes are the admissible subsets in question specified by means of a…
▽ More
In this paper, we show that marked quantales have a reflection into quantales. To obtain the reflection we construct free quantales over marked quantales using appropriate lower sets.
A marked quantale is a posemigroup in which certain admissible subsets are required to have joins, and multiplication distributes over these. Sometimes are the admissible subsets in question specified by means of a so-called selection function. A distinguishing feature of the study of marked quantales is that a small collection of axioms of an elementary nature allows one to do much that is traditional at the level of quantales. The axioms are sufficiently general to include as examples of marked quantales the classes of posemigroups, $σ$-quantales, prequantales and quantales. Furthermore, we discuss another reflection to quantales obtained by the injective hull of a posemigroup.
△ Less
Submitted 24 August, 2022;
originally announced August 2022.
-
A note on the edge choosability of $K_{5}$-minor free graphs
Authors:
Jieru Feng,
Jianliang Wu,
Fan Yang
Abstract:
For a planar graph $G$, Borodin stated that $G$ is $(Δ+1)$-edge-choosable if $Δ\geq9$ and later Bonamy showed that $G$ is $9$-edge-choosable if $Δ=8$. At the same time, Borodin et al. proved that $G$ is $Δ$-edge-choosable if $Δ\geq12$. In the paper, we extend these results to $K_5$-minor free graphs.
For a planar graph $G$, Borodin stated that $G$ is $(Δ+1)$-edge-choosable if $Δ\geq9$ and later Bonamy showed that $G$ is $9$-edge-choosable if $Δ=8$. At the same time, Borodin et al. proved that $G$ is $Δ$-edge-choosable if $Δ\geq12$. In the paper, we extend these results to $K_5$-minor free graphs.
△ Less
Submitted 23 August, 2022;
originally announced August 2022.
-
Learning Interaction Variables and Kernels from Observations of Agent-Based Systems
Authors:
Jinchao Feng,
Mauro Maggioni,
Patrick Martin,
Ming Zhong
Abstract:
Dynamical systems across many disciplines are modeled as interacting particles or agents, with interaction rules that depend on a very small number of variables (e.g. pairwise distances, pairwise differences of phases, etc...), functions of the state of pairs of agents. Yet, these interaction rules can generate self-organized dynamics, with complex emergent behaviors (clustering, flocking, swarmin…
▽ More
Dynamical systems across many disciplines are modeled as interacting particles or agents, with interaction rules that depend on a very small number of variables (e.g. pairwise distances, pairwise differences of phases, etc...), functions of the state of pairs of agents. Yet, these interaction rules can generate self-organized dynamics, with complex emergent behaviors (clustering, flocking, swarming, etc.). We propose a learning technique that, given observations of states and velocities along trajectories of the agents, yields both the variables upon which the interaction kernel depends and the interaction kernel itself, in a nonparametric fashion. This yields an effective dimension reduction which avoids the curse of dimensionality from the high-dimensional observation data (states and velocities of all the agents). We demonstrate the learning capability of our method to a variety of first-order interacting systems.
△ Less
Submitted 4 August, 2022;
originally announced August 2022.
-
On a framework of data assimilation for neuronal networks
Authors:
Wenyong Zhang,
Boyu Chen,
Jianfeng Feng,
Wenlian Lu
Abstract:
When handling real-world data modeled by a complex network dynamical system, the number of the parameters is always even much more than the size of the data. Therefore, in many cases, it is impossible to estimate these parameters and however, the exact value of each parameter is frequently less interesting than the distribution of the parameters may contain important information towards understand…
▽ More
When handling real-world data modeled by a complex network dynamical system, the number of the parameters is always even much more than the size of the data. Therefore, in many cases, it is impossible to estimate these parameters and however, the exact value of each parameter is frequently less interesting than the distribution of the parameters may contain important information towards understanding the system and data. In this paper, we propose this question arising by employing a data assimilation approach to estimate the distribution of the parameters in the leakage-integrate-fire (LIF) neuronal network model from the experimental data, for example, the blood-oxygen-level-dependent (BOLD) signal. Herein, we assume that the parameters of the neurons and synapses are inhomogeneous but independently identical distributed following certain distribution with unknown hyperparameters. Thus, we estimate these hyperparameters of the distributions of the parameters, instead of estimating the parameters themselves. We formulate this problem under the framework of data assimilation and hierarchical Bayesian method, and present an efficient method named Hierarchical Data Assimilation (HDA) to conduct the statistical inference on the neuronal network model with the BOLD signal data simulated by the hemodynamic model. We consider the LIF neuronal networks with four synapses and show that the proposed algorithm can estimate the BOLD signals and the hyperparameters with good preciseness. In addition, we discuss the influence on the performance of the algorithm configuration and the LIF network model setup.
△ Less
Submitted 6 June, 2022;
originally announced June 2022.
-
Increasing rate of weighted product of partial quotients in continued fractions
Authors:
Ayreena Bakhtawar,
Jing Feng
Abstract:
Let $[a_1(x),a_2(x),\cdots,a_n(x),\cdots]$ be the continued fraction expansion of $x\in[0,1)$. In this paper, we study the increasing rate of the weighted product $a^{t_0}_n(x)a^{t_1}_{n+1}(x)\cdots a^{t_m}_{n+m}(x)$ ,where $t_i\in \mathbb{R}_+\ (0\leq i \leq m)$ are weights. More precisely, let $\varphi:\mathbb{N}\to\mathbb{R}_+$ be a function with $\varphi(n)/n\to \infty$ as $n\to \infty$. For a…
▽ More
Let $[a_1(x),a_2(x),\cdots,a_n(x),\cdots]$ be the continued fraction expansion of $x\in[0,1)$. In this paper, we study the increasing rate of the weighted product $a^{t_0}_n(x)a^{t_1}_{n+1}(x)\cdots a^{t_m}_{n+m}(x)$ ,where $t_i\in \mathbb{R}_+\ (0\leq i \leq m)$ are weights. More precisely, let $\varphi:\mathbb{N}\to\mathbb{R}_+$ be a function with $\varphi(n)/n\to \infty$ as $n\to \infty$. For any $(t_0,\cdots,t_m)\in \mathbb{R}^{m+1}_+$ with $t_i\geq 0$ and at least one $t_i\neq0 \ (0\leq i\leq m)$, the Hausdorff dimension of the set $$\underline{E}(\{t_i\}_{i=0}^m,\varphi)=\left\{x\in[0,1):\liminf\limits_{n\to \infty}\dfrac{\log \left(a^{t_0}_n(x)a^{t_1}_{n+1}(x)\cdots a^{t_m}_{n+m}(x)\right)}{\varphi(n)}=1\right\}$$ is obtained. Under the condition that $(t_0,\cdots,t_m)\in \mathbb{R}^{m+1}_+$ with $0<t_0\leq t_1\leq \cdots \leq t_m$, we also obtain the Hausdorff dimension of the set \begin{equation*} \overline{E}(\{t_i\}_{i=0}^m,\varphi)=\left\{x\in[0,1):\limsup\limits_{n\to \infty}\dfrac{\log \left(a^{t_0}_n(x)a^{t_1}_{n+1}(x)\cdots a^{t_m}_{n+m}(x)\right)}{\varphi(n)}=1\right\}.\end{equation*}
△ Less
Submitted 29 May, 2022;
originally announced May 2022.
-
Analytic Investigation for Spatio-temporal Patterns Propagation in Spiking Neural Networks
Authors:
Ning Hua,
Xiangnan He,
Wenlian Lu,
Jianfeng Feng
Abstract:
Based upon the moment closure approach, a Gaussian random field is constructed to quantitatively and analytically characterize the dynamics of a random point field. The approach provides us with a theoretical tool to investigate synchronized spike propagation in a feedforward or recurrent spiking neural network. We show that the balance between the excitation and inhibition postsynaptic potentials…
▽ More
Based upon the moment closure approach, a Gaussian random field is constructed to quantitatively and analytically characterize the dynamics of a random point field. The approach provides us with a theoretical tool to investigate synchronized spike propagation in a feedforward or recurrent spiking neural network. We show that the balance between the excitation and inhibition postsynaptic potentials is required for the occurrence of synfire chains. In particular, with a balanced network, the critical packet size of invasion and annihilation is observed. We also derive a sufficient analytic condition for the synchronization propagation in an asynchronous environment, which further allows us to disclose the possibility of spatial synaptic structure to sustain a stable synfire chain. Our findings are in good agreement with simulations and help us understand the propagation of spatio-temporal patterns in a random point flied.
△ Less
Submitted 6 April, 2022;
originally announced April 2022.
-
A Remark on Evolution Equation of Stochastic Logical Dynamic Systems
Authors:
Changxi Li,
Jun-e Feng,
Daizhan Cheng,
Xiao Zhang
Abstract:
Modelling is an essential procedure in analyzing and controlling a given logical dynamic system (LDS). It has been proved that deterministic LDS can be modeled as a linear-like system using algebraic state space representation. However, due to the inherently non-linear, it is difficult to obtain the algebraic expression of a stochastic LDS. This paper provides a unified framework for transition an…
▽ More
Modelling is an essential procedure in analyzing and controlling a given logical dynamic system (LDS). It has been proved that deterministic LDS can be modeled as a linear-like system using algebraic state space representation. However, due to the inherently non-linear, it is difficult to obtain the algebraic expression of a stochastic LDS. This paper provides a unified framework for transition analysis of LDSs with deterministic and stochastic dynamics. First, modelling of LDS with deterministic dynamics is reviewed. Then modeling of LDS with stochastic dynamics is considered, and non-equivalence between subsystems and global system is proposed. Next, the reason for the non-equivalence is provided. Finally, consistency condition is presented for independent model and conditional independent model.
△ Less
Submitted 3 March, 2022;
originally announced March 2022.
-
Fast solver for J2-perturbed Lambert problem using deep neural network
Authors:
Bin Yang,
Shuang Li,
Jinglang Feng,
Massimiliano Vasile
Abstract:
This paper presents a novel and fast solver for the J2-perturbed Lambert problem. The solver consists of an intelligent initial guess generator combined with a differential correction procedure. The intelligent initial guess generator is a deep neural network that is trained to correct the initial velocity vector coming from the solution of the unperturbed Lambert problem. The differential correct…
▽ More
This paper presents a novel and fast solver for the J2-perturbed Lambert problem. The solver consists of an intelligent initial guess generator combined with a differential correction procedure. The intelligent initial guess generator is a deep neural network that is trained to correct the initial velocity vector coming from the solution of the unperturbed Lambert problem. The differential correction module takes the initial guess and uses a forward shooting procedure to further update the initial velocity and exactly meet the terminal conditions. Eight sample forms are analyzed and compared to find the optimum form to train the neural network on the J2-perturbed Lambert problem. The accuracy and performance of this novel approach will be demonstrated on a representative test case: the solution of a multi-revolution J2-perturbed Lambert problem in the Jupiter system. We will compare the performance of the proposed approach against a classical standard shooting method and a homotopy-based perturbed Lambert algorithm. It will be shown that, for a comparable level of accuracy, the proposed method is significantly faster than the other two.
△ Less
Submitted 9 January, 2022;
originally announced January 2022.
-
Stabilization of continuous-time Markov/semi-Markov jump linear systems via finite data-rate feedback
Authors:
Jingyi Wang,
Jianwen Feng,
Chen Xu,
Xiaoqun Wu,
Jinhu Lü
Abstract:
This paper investigates almost sure exponential stabilization of continuous-time Markov jump linear systems (MJLSs) under communication data-rate constraints by introducing sampling and quantization into the feedback control. Different from previous works, the sampling times and the jump times are independent of each other in this paper. The quantization is recursively adjusted on the sampling tim…
▽ More
This paper investigates almost sure exponential stabilization of continuous-time Markov jump linear systems (MJLSs) under communication data-rate constraints by introducing sampling and quantization into the feedback control. Different from previous works, the sampling times and the jump times are independent of each other in this paper. The quantization is recursively adjusted on the sampling time, and its updating strategy does not depend on the switching in a sampling interval. In other words, the explicit value of the switching signal in a sampling interval is not necessary. The numerically testable condition is developed to ensure almost sure exponential stabilization of MJLSs under the proposed communication and control protocols. We also drop the assumption of stabilizability of all individual modes required in previous works about the switched systems. Moreover, we extend the result to the case of continuous-time semi-Markov jump linear systems (semi-MJLSs) via the semi-Markov kernel approach. Finally, some numerical examples are presented to illustrate the effectiveness of the proposed communication and control protocols.
△ Less
Submitted 28 October, 2021;
originally announced October 2021.
-
Model Uncertainty and Correctability for Directed Graphical Models
Authors:
Panagiota Birmpa,
Jinchao Feng,
Markos A. Katsoulakis,
Luc Rey-Bellet
Abstract:
Probabilistic graphical models are a fundamental tool in probabilistic modeling, machine learning and artificial intelligence. They allow us to integrate in a natural way expert knowledge, physical modeling, heterogeneous and correlated data and quantities of interest. For exactly this reason, multiple sources of model uncertainty are inherent within the modular structure of the graphical model. I…
▽ More
Probabilistic graphical models are a fundamental tool in probabilistic modeling, machine learning and artificial intelligence. They allow us to integrate in a natural way expert knowledge, physical modeling, heterogeneous and correlated data and quantities of interest. For exactly this reason, multiple sources of model uncertainty are inherent within the modular structure of the graphical model. In this paper we develop information-theoretic, robust uncertainty quantification methods and non-parametric stress tests for directed graphical models to assess the effect and the propagation through the graph of multi-sourced model uncertainties to quantities of interest. These methods allow us to rank the different sources of uncertainty and correct the graphical model by targeting its most impactful components with respect to the quantities of interest. Thus, from a machine learning perspective, we provide a mathematically rigorous approach to correctability that guarantees a systematic selection for improvement of components of a graphical model while controlling potential new errors created in the process in other parts of the model. We demonstrate our methods in two physico-chemical examples, namely quantum scale-informed chemical kinetics and materials screening to improve the efficiency of fuel cells.
△ Less
Submitted 17 July, 2021;
originally announced July 2021.
-
Learning particle swarming models from data with Gaussian processes
Authors:
Jinchao Feng,
Charles Kulick,
Yunxiang Ren,
Sui Tang
Abstract:
Interacting particle or agent systems that display a rich variety of swarming behaviours are ubiquitous in science and engineering. A fundamental and challenging goal is to understand the link between individual interaction rules and swarming. In this paper, we study the data-driven discovery of a second-order particle swarming model that describes the evolution of $N$ particles in $\mathbb{R}^d$…
▽ More
Interacting particle or agent systems that display a rich variety of swarming behaviours are ubiquitous in science and engineering. A fundamental and challenging goal is to understand the link between individual interaction rules and swarming. In this paper, we study the data-driven discovery of a second-order particle swarming model that describes the evolution of $N$ particles in $\mathbb{R}^d$ under radial interactions. We propose a learning approach that models the latent radial interaction function as Gaussian processes, which can simultaneously fulfill two inference goals: one is the nonparametric inference of {the} interaction function with pointwise uncertainty quantification, and the other one is the inference of unknown scalar parameters in the non-collective friction forces of the system. We formulate the learning problem as a statistical inverse problem and provide a detailed analysis of recoverability conditions, establishing that a coercivity condition is sufficient for recoverability. Given data collected from $M$ i.i.d trajectories with independent Gaussian observational noise, we provide a finite-sample analysis, showing that our posterior mean estimator converges in a Reproducing kernel Hilbert space norm, at an optimal rate in $M$ equal to the one in the classical 1-dimensional Kernel Ridge regression. As a byproduct, we show we can obtain a parametric learning rate in $M$ for the posterior marginal variance using $L^{\infty}$ norm, and the rate could also involve $N$ and $L$ (the number of observation time instances for each trajectory), depending on the condition number of the inverse problem. Numerical results on systems that exhibit different swarming behaviors demonstrate efficient learning of our approach from scarce noisy trajectory data.
△ Less
Submitted 7 March, 2023; v1 submitted 4 June, 2021;
originally announced June 2021.
-
State Feedback Stabilization of Generic Logic Systems via Ledley Antecedence Solution
Authors:
Yingzhe Jia,
Daizhan Cheng,
Jun-e Feng
Abstract:
In this paper, the application of Ledley antecedence solutions in designing state feedback stabilizers of generic logic systems has been proposed. To make the method feasible, two modifications are made to the original Ledley antecedence solution theory: (i) the preassigned logical functions have been extended from being a set of equations to an admissible set; (ii) the domain of arguments has bee…
▽ More
In this paper, the application of Ledley antecedence solutions in designing state feedback stabilizers of generic logic systems has been proposed. To make the method feasible, two modifications are made to the original Ledley antecedence solution theory: (i) the preassigned logical functions have been extended from being a set of equations to an admissible set; (ii) the domain of arguments has been extended from the whole state space to a restricted subset. In the proposed method, state feedback controls are considered as a set of extended Ledley antecedence solutions for a designed iterative admissible sets over their corresponding restricted subsets. Based on this, an algorithm has been proposed to verify the solvability, and simultaneously to provide all possible state feedback stabilizers when the problem is solvable. All stabilizers are optimal, which stabilize the logic systems from any initial state to the destination state/state set in the shortest time. The method is firstly demonstrated on Boolean control networks to achieve point stabilization. Then, with some minor modifications, the proposed method is also proven to be applicable to set stabilization problems. Finally, it is shown that in $k$-valued and mix-valued logical systems, the proposed method remains effective.
△ Less
Submitted 25 November, 2020;
originally announced November 2020.
-
The explicit formula of Hankel determinant with Catalan elements
Authors:
Jishe Feng
Abstract:
Applying Johann Cigler's Hankel determinant formula in terms of the binomial coefficient determinants, which is simplified from Christian Krattenthale's, we get an explicit formula of Hankel determinants for general. As far as I know, those are new results.
Applying Johann Cigler's Hankel determinant formula in terms of the binomial coefficient determinants, which is simplified from Christian Krattenthale's, we get an explicit formula of Hankel determinants for general. As far as I know, those are new results.
△ Less
Submitted 16 October, 2020; v1 submitted 13 October, 2020;
originally announced October 2020.
-
Towards Theoretically Understanding Why SGD Generalizes Better Than ADAM in Deep Learning
Authors:
Pan Zhou,
Jiashi Feng,
Chao Ma,
Caiming Xiong,
Steven Hoi,
Weinan E
Abstract:
It is not clear yet why ADAM-alike adaptive gradient algorithms suffer from worse generalization performance than SGD despite their faster training speed. This work aims to provide understandings on this generalization gap by analyzing their local convergence behaviors. Specifically, we observe the heavy tails of gradient noise in these algorithms. This motivates us to analyze these algorithms thr…
▽ More
It is not clear yet why ADAM-alike adaptive gradient algorithms suffer from worse generalization performance than SGD despite their faster training speed. This work aims to provide understandings on this generalization gap by analyzing their local convergence behaviors. Specifically, we observe the heavy tails of gradient noise in these algorithms. This motivates us to analyze these algorithms through their Levy-driven stochastic differential equations (SDEs) because of the similar convergence behaviors of an algorithm and its SDE. Then we establish the escaping time of these SDEs from a local basin. The result shows that (1) the escaping time of both SGD and ADAM~depends on the Radon measure of the basin positively and the heaviness of gradient noise negatively; (2) for the same basin, SGD enjoys smaller escaping time than ADAM, mainly because (a) the geometry adaptation in ADAM~via adaptively scaling each gradient coordinate well diminishes the anisotropic structure in gradient noise and results in larger Radon measure of a basin; (b) the exponential gradient average in ADAM~smooths its gradient and leads to lighter gradient noise tails than SGD. So SGD is more locally unstable than ADAM~at sharp minima defined as the minima whose local basins have small Radon measure, and can better escape from them to flatter ones with larger Radon measure. As flat minima here which often refer to the minima at flat or asymmetric basins/valleys often generalize better than sharp ones , our result explains the better generalization performance of SGD over ADAM. Finally, experimental results confirm our heavy-tailed gradient noise assumption and theoretical affirmation.
△ Less
Submitted 28 November, 2021; v1 submitted 12 October, 2020;
originally announced October 2020.
-
Scalable Deep Reinforcement Learning for Ride-Hailing
Authors:
Jiekun Feng,
Mark Gluzman,
J. G. Dai
Abstract:
Ride-hailing services, such as Didi Chuxing, Lyft, and Uber, arrange thousands of cars to meet ride requests throughout the day. We consider a Markov decision process (MDP) model of a ride-hailing service system, framing it as a reinforcement learning (RL) problem. The simultaneous control of many agents (cars) presents a challenge for the MDP optimization because the action space grows exponentia…
▽ More
Ride-hailing services, such as Didi Chuxing, Lyft, and Uber, arrange thousands of cars to meet ride requests throughout the day. We consider a Markov decision process (MDP) model of a ride-hailing service system, framing it as a reinforcement learning (RL) problem. The simultaneous control of many agents (cars) presents a challenge for the MDP optimization because the action space grows exponentially with the number of cars. We propose a special decomposition for the MDP actions by sequentially assigning tasks to the drivers. The new actions structure resolves the scalability problem and enables the use of deep RL algorithms for control policy optimization. We demonstrate the benefit of our proposed decomposition with a numerical experiment based on real data from Didi Chuxing.
△ Less
Submitted 27 September, 2020;
originally announced September 2020.
-
The edge colorings of $K_{5}$-minor free graphs
Authors:
Jieru Feng,
Yuping Gao,
Jianliang Wu
Abstract:
In 1965, Vizing proved that every planar graph $G$ with maximum degree $Δ\geq 8$ is edge $Δ$-colorable. It is also proved that every planar graph $G$ with maximum degree $Δ=7$ is edge $Δ$-colorable by Sanders and Zhao, independently by Zhang. In this paper, we extend the above results by showing that every $K_5$-minor free graph with maximum degree $Δ$ at least seven is edge $Δ$-colorable.
In 1965, Vizing proved that every planar graph $G$ with maximum degree $Δ\geq 8$ is edge $Δ$-colorable. It is also proved that every planar graph $G$ with maximum degree $Δ=7$ is edge $Δ$-colorable by Sanders and Zhao, independently by Zhang. In this paper, we extend the above results by showing that every $K_5$-minor free graph with maximum degree $Δ$ at least seven is edge $Δ$-colorable.
△ Less
Submitted 24 February, 2020;
originally announced February 2020.
-
Matrix Expression of Finite Boolean-type Algebras
Authors:
Daizhan Cheng,
Jun-e Feng,
Jianli Zhao,
Shihua Fu
Abstract:
Boolean-type algebra (BTA) is investigated. A BTA is decomposed into Boolean-type lattice (BTL) and a complementation algebra (CA). When the object set is finite, the matrix expressions of BTL and CA (and then BTA) are presented. The construction and certain properties of BTAs are investigated via their matrix expression, including the homomorphism and isomorphism, etc. Then the product/decomposit…
▽ More
Boolean-type algebra (BTA) is investigated. A BTA is decomposed into Boolean-type lattice (BTL) and a complementation algebra (CA). When the object set is finite, the matrix expressions of BTL and CA (and then BTA) are presented. The construction and certain properties of BTAs are investigated via their matrix expression, including the homomorphism and isomorphism, etc. Then the product/decomposition of BTLs are considered. A necessary and sufficient condition for decomposition of BTA is obtained. Finally, a universal generator is provided for arbitrary finite universal algebras.
△ Less
Submitted 16 September, 2019;
originally announced September 2019.
-
Two deformed Pascal's triangles and its new properties
Authors:
Jishe Feng,
Cunqin Shi,
Huani Zhao
Abstract:
In this paper, firstly, by a determinant of deformed Pascal's triangle, namely the normalized Hessenberg matrix determinant, to count Dyck paths, we give another combinatorial proof of the theorems which are of Catalan numbers determinant representations and the recurrence formula. Secondly, a determinant of normalized Toeplitz-Hessenberg matrix, whose entries are binomials, arising in power serie…
▽ More
In this paper, firstly, by a determinant of deformed Pascal's triangle, namely the normalized Hessenberg matrix determinant, to count Dyck paths, we give another combinatorial proof of the theorems which are of Catalan numbers determinant representations and the recurrence formula. Secondly, a determinant of normalized Toeplitz-Hessenberg matrix, whose entries are binomials, arising in power series, we derive new four properties of Pascal's triangle.
△ Less
Submitted 26 September, 2020; v1 submitted 2 September, 2019;
originally announced September 2019.
-
Congruences on Orthogonal Rook Monoids and Symplectic Rook Monoids
Authors:
Jianqiang Feng,
Zhenheng Li
Abstract:
We give a complete classification of all nonuniform congruences on orthogonal rook monoids and symplectic rook monoids. We find that there are four kinds of nonuniform congruences on the orthogonal rook monoids ${OR}_n$ for even $n\ne 4$, and we describe each kind of the congruences explicitly in terms of normal subgroups of maximal subgroups. We also find that if $n = 4$, there are six kinds of n…
▽ More
We give a complete classification of all nonuniform congruences on orthogonal rook monoids and symplectic rook monoids. We find that there are four kinds of nonuniform congruences on the orthogonal rook monoids ${OR}_n$ for even $n\ne 4$, and we describe each kind of the congruences explicitly in terms of normal subgroups of maximal subgroups. We also find that if $n = 4$, there are six kinds of nonuniform congruences on ${OR}_4$, and we describe these congruences using both $\mathcal{H}$-relations and certain normal subgroups of some maximal subgroups. In contrast, we find that there is only one kind of congruences on the symplectic rook monoids for all even $n\ge 2.$
△ Less
Submitted 6 June, 2019;
originally announced June 2019.
-
A Hamilton-Jacobi PDE associated with hydrodynamic fluctuations from a nonlinear diffusion equation
Authors:
Jin Feng,
Toshio Mikami,
Johannes Zimmer
Abstract:
We study a class of Hamilton-Jacobi partial differential equations in the space of probability measures. In the first part of this paper, we prove comparison principles (implying uniqueness) for this class. In the second part, we establish the existence of a solution and give a representation using a family of partial differential equations with control. A large part of our analysis exploits speci…
▽ More
We study a class of Hamilton-Jacobi partial differential equations in the space of probability measures. In the first part of this paper, we prove comparison principles (implying uniqueness) for this class. In the second part, we establish the existence of a solution and give a representation using a family of partial differential equations with control. A large part of our analysis exploits special structures of the Hamiltonian, which might look mysterious at first sight. However, we show that this Hamiltonian structure arises naturally as limit of Hamiltonians of microscopical models. Indeed, in the third part of this paper, we informally derive the Hamiltonian studied before, in a context of fluctuation theory on the hydrodynamic scale. The analysis is carried out for a specific model of stochastic interacting particles in gas kinetics, namely a version of the Carleman model. We use a two-scale averaging method on Hamiltonians defined in the space of probability measures to derive the limiting Hamiltonian.
△ Less
Submitted 3 May, 2021; v1 submitted 28 February, 2019;
originally announced March 2019.
-
Synchronization of Networked Harmonic Oscillators via Quantized Sampled Velocity Feedback
Authors:
Jingyi Wang,
Jianwen Feng,
Yijun Lou,
Guanrong Chen
Abstract:
In this technical note, we propose a practicable quantized sampled velocity data coupling protocol for synchronization of a set of harmonic oscillators. The coupling protocol is designed in a quantized way via interconnecting the velocities encoded by a uniform quantizer with a zooming parameter in either a fixed or an adjustable form over a directed communication network. We establish sufficient…
▽ More
In this technical note, we propose a practicable quantized sampled velocity data coupling protocol for synchronization of a set of harmonic oscillators. The coupling protocol is designed in a quantized way via interconnecting the velocities encoded by a uniform quantizer with a zooming parameter in either a fixed or an adjustable form over a directed communication network. We establish sufficient conditions for the networked harmonic oscillators to converge to a bounded neighborhood of the synchronized orbits with a fixed zooming parameter. We ensure the oscillators to achieve synchronization by designing the quantized coupling protocol with an adjustable zooming parameter. Finally, we show two numerical examples to illustrate the effectiveness of the proposed coupling protocol.
△ Less
Submitted 26 February, 2019;
originally announced February 2019.
-
Faster First-Order Methods for Stochastic Non-Convex Optimization on Riemannian Manifolds
Authors:
Pan Zhou,
Xiao-Tong Yuan,
Jiashi Feng
Abstract:
SPIDER (Stochastic Path Integrated Differential EstimatoR) is an efficient gradient estimation technique developed for non-convex stochastic optimization. Although having been shown to attain nearly optimal computational complexity bounds, the SPIDER-type methods are limited to linear metric spaces. In this paper, we introduce the Riemannian SPIDER (R-SPIDER) method as a novel nonlinear-metric ext…
▽ More
SPIDER (Stochastic Path Integrated Differential EstimatoR) is an efficient gradient estimation technique developed for non-convex stochastic optimization. Although having been shown to attain nearly optimal computational complexity bounds, the SPIDER-type methods are limited to linear metric spaces. In this paper, we introduce the Riemannian SPIDER (R-SPIDER) method as a novel nonlinear-metric extension of SPIDER for efficient non-convex optimization on Riemannian manifolds. We prove that for finite-sum problems with $n$ components, R-SPIDER converges to an $ε$-accuracy stationary point within $\mathcal{O}\big(\min\big(n+\frac{\sqrt{n}}{ε^2},\frac{1}{ε^3}\big)\big)$ stochastic gradient evaluations, which is sharper in magnitude than the prior Riemannian first-order methods. For online optimization, R-SPIDER is shown to converge with $\mathcal{O}\big(\frac{1}{ε^3}\big)$ complexity which is, to the best of our knowledge, the first non-asymptotic result for online Riemannian optimization. Especially, for gradient dominated functions, we further develop a variant of R-SPIDER and prove its linear convergence rate. Numerical results demonstrate the computational efficiency of the proposed methods.
△ Less
Submitted 23 November, 2018; v1 submitted 20 November, 2018;
originally announced November 2018.
-
The Hessenberg matrices and Catalan and its generalized numbers
Authors:
Jishe Feng
Abstract:
We present determinantal representations of the Catalan numbers, k-Fuss-Catalan numbers, and its generalized number. The entries of the normalized Hessenberg matrices are the binomial coefficients that related with the enumeration of lattice paths.
We present determinantal representations of the Catalan numbers, k-Fuss-Catalan numbers, and its generalized number. The entries of the normalized Hessenberg matrices are the binomial coefficients that related with the enumeration of lattice paths.
△ Less
Submitted 22 October, 2018;
originally announced October 2018.
-
Feedback pinning control of collective behaviors aroused by epidemic spread on complex networks
Authors:
Pan Yang,
Zhongpu Xu,
Jianwen Feng,
Xinchu Fu
Abstract:
This paper investigates epidemic control behavioral synchronization for a class of complex networks resulting from spread of epidemic diseases via pinning feedback control strategy. Based on the quenched mean field theory, epidemic control synchronization models with inhibition of contact behavior is constructed, combining with the epidemic transmission system and the complex dynamical network car…
▽ More
This paper investigates epidemic control behavioral synchronization for a class of complex networks resulting from spread of epidemic diseases via pinning feedback control strategy. Based on the quenched mean field theory, epidemic control synchronization models with inhibition of contact behavior is constructed, combining with the epidemic transmission system and the complex dynamical network carrying extra controllers. By the properties of convex functions and Gerschgorin theorem, the epidemic threshold of the model is obtained, and the global stability of disease-free equilibrium is analyzed. For individual's infected situation, when epidemic spreads, two types of feedback control strategies depended on the diseases' information are designed: the one only adds controllers to infected individuals, the other adds controllers both to infected and susceptible ones. And by using Lyapunov stability theory, under designed controllers, some criteria that guarantee epidemic control synchronization system achieving behavior synchronization are also derived. Several numerical simulations are performed to show the effectiveness of our theoretical results. As far as we know, this is the first work to address the controlling behavioral synchronization induced by epidemic spreading under the pinning feedback mechanism. It is hopeful that we may have more deeper insight into the essence between disease's spreading and collective behavior controlling in complex dynamical networks.
△ Less
Submitted 18 June, 2018;
originally announced June 2018.
-
Understanding Generalization and Optimization Performance of Deep CNNs
Authors:
Pan Zhou,
Jiashi Feng
Abstract:
This work aims to provide understandings on the remarkable success of deep convolutional neural networks (CNNs) by theoretically analyzing their generalization performance and establishing optimization guarantees for gradient descent based training algorithms. Specifically, for a CNN model consisting of $l$ convolutional layers and one fully connected layer, we prove that its generalization error…
▽ More
This work aims to provide understandings on the remarkable success of deep convolutional neural networks (CNNs) by theoretically analyzing their generalization performance and establishing optimization guarantees for gradient descent based training algorithms. Specifically, for a CNN model consisting of $l$ convolutional layers and one fully connected layer, we prove that its generalization error is bounded by $\mathcal{O}(\sqrt{\dt\widetilde{\varrho}/n})$ where $θ$ denotes freedom degree of the network parameters and $\widetilde{\varrho}=\mathcal{O}(\log(\prod_{i=1}^{l}\rwi{i} (\ki{i}-\si{i}+1)/p)+\log(\rf))$ encapsulates architecture parameters including the kernel size $\ki{i}$, stride $\si{i}$, pooling size $p$ and parameter magnitude $\rwi{i}$. To our best knowledge, this is the first generalization bound that only depends on $\mathcal{O}(\log(\prod_{i=1}^{l+1}\rwi{i}))$, tighter than existing ones that all involve an exponential term like $\mathcal{O}(\prod_{i=1}^{l+1}\rwi{i})$. Besides, we prove that for an arbitrary gradient descent algorithm, the computed approximate stationary point by minimizing empirical risk is also an approximate stationary point to the population risk. This well explains why gradient descent training algorithms usually perform sufficiently well in practice. Furthermore, we prove the one-to-one correspondence and convergence guarantees for the non-degenerate stationary points between the empirical and population risks. It implies that the computed local minimum for the empirical risk is also close to a local minimum for the population risk, thus ensuring the good generalization performance of CNNs.
△ Less
Submitted 28 May, 2018;
originally announced May 2018.