-
Local entropy theory, combinatorics, and local theory of Banach spaces
Authors:
Hanfeng Li,
Kairan Liu
Abstract:
Each continuous action of a countably infinite discrete group $Γ$ on a compact metrizable space X induces a continuous action of $Γ$ on the space M(X) of Borel probability measures on X. We compare the local entropy theory for these two actions, and describe the relation between their IE-tuples. Several other types of tuples are also studied. Our main tool is a new combinatorial lemma. We also giv…
▽ More
Each continuous action of a countably infinite discrete group $Γ$ on a compact metrizable space X induces a continuous action of $Γ$ on the space M(X) of Borel probability measures on X. We compare the local entropy theory for these two actions, and describe the relation between their IE-tuples. Several other types of tuples are also studied. Our main tool is a new combinatorial lemma. We also give an application of the combinatorial lemma to the local theory of Banach spaces.
△ Less
Submitted 4 July, 2025;
originally announced July 2025.
-
Moments, Time-Inversion and Source Identification for the Heat Equation
Authors:
Kang Liu,
Enrique Zuazua
Abstract:
We address the initial source identification problem for the heat equation, a notably ill-posed inverse problem characterized by exponential instability. Departing from classical Tikhonov regularization, we propose a novel approach based on moment analysis of the heat flow, transforming the problem into a more stable inverse moment formulation. By evolving the measured terminal time moments backwa…
▽ More
We address the initial source identification problem for the heat equation, a notably ill-posed inverse problem characterized by exponential instability. Departing from classical Tikhonov regularization, we propose a novel approach based on moment analysis of the heat flow, transforming the problem into a more stable inverse moment formulation. By evolving the measured terminal time moments backward through their governing ODE system, we recover the moments of the initial distribution. We then reconstruct the source by solving a convex optimization problem that minimizes the total variation of a measure subject to these moment constraints. This formulation naturally promotes sparsity, yielding atomic solutions that are sums of Dirac measures. Compared to existing methods, our moment-based approach reduces exponential error growth to polynomial growth with respect to the terminal time. We provide explicit error estimates on the recovered initial distributions in terms of moment order, terminal time, and measurement errors. In addition, we develop efficient numerical discretization schemes and demonstrate significant stability improvements of our approach through comprehensive numerical experiments.
△ Less
Submitted 3 July, 2025;
originally announced July 2025.
-
Zero Divisor Manifolds
Authors:
Keqin Liu
Abstract:
We develop the basic properties of $R^{(2)}$-modules, introduce the concept of zero divisor manifolds, construct projective $R^{(2)}$-space which generalizes the real projective space, and initiate the study of the counterpart of symplectic spaces
We develop the basic properties of $R^{(2)}$-modules, introduce the concept of zero divisor manifolds, construct projective $R^{(2)}$-space which generalizes the real projective space, and initiate the study of the counterpart of symplectic spaces
△ Less
Submitted 12 May, 2025;
originally announced May 2025.
-
Learning based convex approximation for constrained parametric optimization
Authors:
Kang Liu,
Wei Peng,
Jianchen Hu
Abstract:
We propose an input convex neural network (ICNN)-based self-supervised learning framework to solve continuous constrained optimization problems. By integrating the augmented Lagrangian method (ALM) with the constraint correction mechanism, our framework ensures \emph{non-strict constraint feasibility}, \emph{better optimality gap}, and \emph{best convergence rate} with respect to the state-of-the-…
▽ More
We propose an input convex neural network (ICNN)-based self-supervised learning framework to solve continuous constrained optimization problems. By integrating the augmented Lagrangian method (ALM) with the constraint correction mechanism, our framework ensures \emph{non-strict constraint feasibility}, \emph{better optimality gap}, and \emph{best convergence rate} with respect to the state-of-the-art learning-based methods. We provide a rigorous convergence analysis, showing that the algorithm converges to a Karush-Kuhn-Tucker (KKT) point of the original problem even when the internal solver is a neural network, and the approximation error is bounded. We test our approach on a range of benchmark tasks including quadratic programming (QP), nonconvex programming, and large-scale AC optimal power flow problems. The results demonstrate that compared to existing solvers (e.g., \texttt{OSQP}, \texttt{IPOPT}) and the latest learning-based methods (e.g., DC3, PDL), our approach achieves a superior balance among accuracy, feasibility, and computational efficiency.
△ Less
Submitted 6 May, 2025;
originally announced May 2025.
-
Output regulation for a reaction-diffusion system with input delay and unknown frequency
Authors:
Shen Wang,
Zhong-Jie Han,
Kai Liu,
Zhi-Xue Zhao
Abstract:
This study solves the output regulation problem for a reaction-diffusion system confronting concurrent input delay and fully unidentified disturbances (encompassing both unknown frequencies and amplitudes) across all channels. The principal innovation emerges from a novel adaptive control architecture that synergizes the modal decomposition technique with a dual-observer mechanism, enabling real-t…
▽ More
This study solves the output regulation problem for a reaction-diffusion system confronting concurrent input delay and fully unidentified disturbances (encompassing both unknown frequencies and amplitudes) across all channels. The principal innovation emerges from a novel adaptive control architecture that synergizes the modal decomposition technique with a dual-observer mechanism, enabling real-time concurrent estimation of unmeasurable system states and disturbances through a state observer and an adaptive disturbance estimator. Unlike existing approaches limited to either delay compensation or partial disturbance rejection, our methodology overcomes the technical barrier of coordinating these two requirements through a rigorously constructed tracking-error-based controller, achieving exponential convergence of system output to reference signals. Numerical simulations are presented to validate the effectiveness of the proposed output feedback control strategy.
△ Less
Submitted 20 April, 2025;
originally announced April 2025.
-
A novel semi-analytical multiple invariants-preserving integrator for conservative PDEs
Authors:
Wei Shi,
Xun Lu,
Kai Liu,
Bin Wang
Abstract:
Many conservative partial differential equations such as the Korteweg-de Vries (KdV) equation, and the nonlinear Schrödinger equations, the Klein-Gordon equation have more than one invariant functionals. In this paper, we propose the definition of the discrete variational derivative, based on which, a novel semi-analytical multiple invariants-preserving integrator for the conservative partial diff…
▽ More
Many conservative partial differential equations such as the Korteweg-de Vries (KdV) equation, and the nonlinear Schrödinger equations, the Klein-Gordon equation have more than one invariant functionals. In this paper, we propose the definition of the discrete variational derivative, based on which, a novel semi-analytical multiple invariants-preserving integrator for the conservative partial differential equations is constructed by projection technique. The proposed integrators are shown to have the same order of accuracy as the underlying integrators. For applications, some concrete mass-momentum-energy-preserving integrators are derived for the KdV equation.
△ Less
Submitted 1 April, 2025;
originally announced April 2025.
-
Statistical Impossibility and Possibility of Aligning LLMs with Human Preferences: From Condorcet Paradox to Nash Equilibrium
Authors:
Kaizhao Liu,
Qi Long,
Zhekun Shi,
Weijie J. Su,
Jiancong Xiao
Abstract:
Aligning large language models (LLMs) with diverse human preferences is critical for ensuring fairness and informed outcomes when deploying these models for decision-making. In this paper, we seek to uncover fundamental statistical limits concerning aligning LLMs with human preferences, with a focus on the probabilistic representation of human preferences and the preservation of diverse preference…
▽ More
Aligning large language models (LLMs) with diverse human preferences is critical for ensuring fairness and informed outcomes when deploying these models for decision-making. In this paper, we seek to uncover fundamental statistical limits concerning aligning LLMs with human preferences, with a focus on the probabilistic representation of human preferences and the preservation of diverse preferences in aligned LLMs. We first show that human preferences can be represented by a reward model if and only if the preference among LLM-generated responses is free of any Condorcet cycle. Moreover, we prove that Condorcet cycles exist with probability converging to one exponentially fast under a probabilistic preference model, thereby demonstrating the impossibility of fully aligning human preferences using reward-based approaches such as reinforcement learning from human feedback. Next, we explore the conditions under which LLMs would employ mixed strategies -- meaning they do not collapse to a single response -- when aligned in the limit using a non-reward-based approach, such as Nash learning from human feedback (NLHF). We identify a necessary and sufficient condition for mixed strategies: the absence of a response that is preferred over all others by a majority. As a blessing, we prove that this condition holds with high probability under the probabilistic preference model, thereby highlighting the statistical possibility of preserving minority preferences without explicit regularization in aligning LLMs. Finally, we leverage insights from our statistical results to design a novel, computationally efficient algorithm for finding Nash equilibria in aligning LLMs with NLHF. Our experiments show that Llama-3.2-1B, aligned with our algorithm, achieves a win rate of 60.55\% against the base model.
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
Generalized toric codes on twisted tori for quantum error correction
Authors:
Zijian Liang,
Ke Liu,
Hao Song,
Yu-An Chen
Abstract:
The Kitaev toric code is widely considered one of the leading candidates for error correction in fault-tolerant quantum computation. However, direct methods to increase its logical dimensions, such as lattice surgery or introducing punctures, often incur prohibitive overheads. In this work, we introduce a ring-theoretic approach for efficiently analyzing topological CSS codes in two dimensions, en…
▽ More
The Kitaev toric code is widely considered one of the leading candidates for error correction in fault-tolerant quantum computation. However, direct methods to increase its logical dimensions, such as lattice surgery or introducing punctures, often incur prohibitive overheads. In this work, we introduce a ring-theoretic approach for efficiently analyzing topological CSS codes in two dimensions, enabling the exploration of generalized toric codes with larger logical dimensions on twisted tori. Using Gröbner bases, we simplify stabilizer syndromes to efficiently identify anyon excitations and their geometric periodicities, even under twisted periodic boundary conditions. Since the properties of the codes are determined by the anyons, this approach allows us to directly compute the logical dimensions without constructing large parity-check matrices. Our approach provides a unified method for finding new quantum error-correcting codes and exhibiting their underlying topological orders via the Laurent polynomial ring. This framework naturally applies to bivariate bicycle codes. For example, we construct optimal weight-6 generalized toric codes on twisted tori with parameters $[[ n, k, d ]]$ for $n \leq 400$, yielding novel codes such as $[[120,8,12]]$, $[[186,10,14]]$, $[[210,10,16]]$, $[[248, 10, 18]]$, $[[254, 14, 16]]$, $[[294, 10, 20]]$, $[[310, 10, \leq 22]]$, and $[[340, 16, 18]]$. Moreover, we present a new realization of the $[[360, 12, \leq 24]]$ quantum code using the $(3,3)$-bivariate bicycle code on a twisted torus defined by the basis vectors $(0,30)$ and $(6,6)$, improving stabilizer locality relative to the previous construction. These results highlight the power of the topological order perspective in advancing the design and theoretical understanding of quantum low-density parity-check (LDPC) codes.
△ Less
Submitted 18 June, 2025; v1 submitted 5 March, 2025;
originally announced March 2025.
-
Descents and flag major index on conjugacy classes of colored permutation groups without short cycles
Authors:
Kevin Liu,
Mei Yin
Abstract:
We consider the descent and flag major index statistics on the colored permutation groups, which are wreath products of the form $\mathfrak{S}_{n,r}=\mathbb{Z}_r\wr \mathfrak{S}_n$. We show that the $k$-th moments of these statistics on $\mathfrak{S}_{n,r}$ will coincide with the corresponding moments on all conjugacy classes without cycles of lengths $1,2,\ldots,2k$. Using this, we establish the…
▽ More
We consider the descent and flag major index statistics on the colored permutation groups, which are wreath products of the form $\mathfrak{S}_{n,r}=\mathbb{Z}_r\wr \mathfrak{S}_n$. We show that the $k$-th moments of these statistics on $\mathfrak{S}_{n,r}$ will coincide with the corresponding moments on all conjugacy classes without cycles of lengths $1,2,\ldots,2k$. Using this, we establish the asymptotic normality of the descent and flag major index statistics on conjugacy classes of $\mathfrak{S}_{n,r}$ with sufficiently long cycles. Our results generalize prior work of Fulman involving the descent and major index statistics on the symmetric group $\mathfrak{S}_n$. Our methods involve an intricate extension of Fulman's work on $\mathfrak{S}_n$ combined with the theory of the degree for a colored permutation statistic, as introduced by Campion Loth, Levet, Liu, Sundaram, and Yin.
△ Less
Submitted 4 March, 2025;
originally announced March 2025.
-
Moment Monotonicity of Weibull, Gamma and Log-normal Distributions
Authors:
Kang Liu
Abstract:
This paper investigates the moment monotonicity property of Weibull, Gamma, and Log-normal distributions. We provide the first complete mathematical proofs for the monotonicity of the function $E(X^n)^{\frac{1}{n}}$ specific to these distributions. Through the derivations, we identify a key property: in many cases, one of the two parameters defining each distribution can be effectively canceled ou…
▽ More
This paper investigates the moment monotonicity property of Weibull, Gamma, and Log-normal distributions. We provide the first complete mathematical proofs for the monotonicity of the function $E(X^n)^{\frac{1}{n}}$ specific to these distributions. Through the derivations, we identify a key property: in many cases, one of the two parameters defining each distribution can be effectively canceled out. This finding opens up opportunities for improved parameter estimation of these random variables. Our results contribute to a deeper understanding of the behavior of these widely used distributions and offer valuable insights for applications in fields such as reliability engineering, econometrics, and machine learning.
△ Less
Submitted 16 February, 2025;
originally announced February 2025.
-
Parametric Scaling Law of Tuning Bias in Conformal Prediction
Authors:
Hao Zeng,
Kangdao Liu,
Bingyi Jing,
Hongxin Wei
Abstract:
Conformal prediction is a popular framework of uncertainty quantification that constructs prediction sets with coverage guarantees. To uphold the exchangeability assumption, many conformal prediction methods necessitate an additional holdout set for parameter tuning. Yet, the impact of violating this principle on coverage remains underexplored, making it ambiguous in practical applications. In thi…
▽ More
Conformal prediction is a popular framework of uncertainty quantification that constructs prediction sets with coverage guarantees. To uphold the exchangeability assumption, many conformal prediction methods necessitate an additional holdout set for parameter tuning. Yet, the impact of violating this principle on coverage remains underexplored, making it ambiguous in practical applications. In this work, we empirically find that the tuning bias - the coverage gap introduced by leveraging the same dataset for tuning and calibration, is negligible for simple parameter tuning in many conformal prediction methods. In particular, we observe the scaling law of the tuning bias: this bias increases with parameter space complexity and decreases with calibration set size. Formally, we establish a theoretical framework to quantify the tuning bias and provide rigorous proof for the scaling law of the tuning bias by deriving its upper bound. In the end, we discuss how to reduce the tuning bias, guided by the theories we developed.
△ Less
Submitted 10 July, 2025; v1 submitted 5 February, 2025;
originally announced February 2025.
-
Reconstruction of caterpillar tanglegrams
Authors:
Ann Clifton,
Eva Czabarka,
Kevin Liu,
Sarah Loeb,
Utku Okur,
Laszlo Szekely,
Kristina Wicke
Abstract:
A tanglegram consists of two rooted binary trees with the same number of leaves and a perfect matching between the leaves of the trees. Given a size-$n$ tanglegram, i.e., a tanglegram for two trees with $n$ leaves, a multiset of induced size-$(n-1)$ tanglegrams is obtained by deleting a pair of matched leaves in every possible way. Here, we analyze whether a size-$n$ tanglegram is uniquely encoded…
▽ More
A tanglegram consists of two rooted binary trees with the same number of leaves and a perfect matching between the leaves of the trees. Given a size-$n$ tanglegram, i.e., a tanglegram for two trees with $n$ leaves, a multiset of induced size-$(n-1)$ tanglegrams is obtained by deleting a pair of matched leaves in every possible way. Here, we analyze whether a size-$n$ tanglegram is uniquely encoded by this multiset of size-$(n-1)$ tanglegrams. We answer this question affirmatively in the case that at least one of the two trees of the tanglegram is a caterpillar tree.
△ Less
Submitted 23 January, 2025;
originally announced January 2025.
-
Higher Weil-Petersson volumes of the moduli space of super Riemann surfaces
Authors:
Xuanyu Huang,
Kefeng Liu,
Hao Xu
Abstract:
Inspired by the theory of JT supergravity, Stanford-Witten derived a remarkable recursion formula of Weil-Petersson volumes of moduli space of super Riemann surfaces. It is the super version of the celebrated Mirzakhani's recursion formula. In this paper, we generalize Stanford-Witten's formula to include high degree kappa classes.
Inspired by the theory of JT supergravity, Stanford-Witten derived a remarkable recursion formula of Weil-Petersson volumes of moduli space of super Riemann surfaces. It is the super version of the celebrated Mirzakhani's recursion formula. In this paper, we generalize Stanford-Witten's formula to include high degree kappa classes.
△ Less
Submitted 10 January, 2025;
originally announced January 2025.
-
Efficient Algorithm Design of Dynamic Spectrum Access by Whittle Index
Authors:
Keqin Liu,
Yiying Zhang,
Zhi Ding
Abstract:
This study addresses the dynamic spectrum access problem in a wireless sub-network that shares channels with a parent network. We approach the sequential channel allocation problem using a restless multi-armed bandits (RMAB) framework. Our objective is to maximize the expected discounted return over an infinite horizon while minimizing interference to the parent network caused by shared channels w…
▽ More
This study addresses the dynamic spectrum access problem in a wireless sub-network that shares channels with a parent network. We approach the sequential channel allocation problem using a restless multi-armed bandits (RMAB) framework. Our objective is to maximize the expected discounted return over an infinite horizon while minimizing interference to the parent network caused by shared channels with the sub-network. Due to the unavailability of direct observations of the true channel state, we leverage the channel quality indicator (CQI) feedback provided by users. However, the RMAB problem is widely acknowledged as PSPACE-hard even for finite-state models. To overcome this challenge, we propose a closed-form channel index function using an iterative online approximation method to approximate the well-known Whittle index policy, which offers a low-complexity solution for ranking the available channels that has an infinite state space. Through extensive numerical simulation experiments, we demonstrate the superior performance and robustness of our proposed algorithm.
△ Less
Submitted 30 December, 2024;
originally announced January 2025.
-
Several new Witten rigidity theorems for elliptic genus
Authors:
Jianyun Guan,
Kefeng Liu,
Yong Wang
Abstract:
Using the Liu's method, we prove a new Witten rigidity theorem of elliptic genus of twisted Dirac operators in even dimensional spin manifolds under the circle action. Combined with the Han-Yu's method, we prove the Witten rigidity theorems of elliptic genus of twisted Toplitz operators of odd-dimensional spin manifolds under the circle action. Moreover, we have obtained several similar Witten rig…
▽ More
Using the Liu's method, we prove a new Witten rigidity theorem of elliptic genus of twisted Dirac operators in even dimensional spin manifolds under the circle action. Combined with the Han-Yu's method, we prove the Witten rigidity theorems of elliptic genus of twisted Toplitz operators of odd-dimensional spin manifolds under the circle action. Moreover, we have obtained several similar Witten rigidity theorems of elliptic genus.
△ Less
Submitted 20 December, 2024; v1 submitted 19 December, 2024;
originally announced December 2024.
-
Representation and Regression Problems in Neural Networks: Relaxation, Generalization, and Numerics
Authors:
Kang Liu,
Enrique Zuazua
Abstract:
In this work, we address three non-convex optimization problems associated with the training of shallow neural networks (NNs) for exact and approximate representation, as well as for regression tasks. Through a mean-field approach, we convexify these problems and, applying a representer theorem, prove the absence of relaxation gaps. We establish generalization bounds for the resulting NN solutions…
▽ More
In this work, we address three non-convex optimization problems associated with the training of shallow neural networks (NNs) for exact and approximate representation, as well as for regression tasks. Through a mean-field approach, we convexify these problems and, applying a representer theorem, prove the absence of relaxation gaps. We establish generalization bounds for the resulting NN solutions, assessing their predictive performance on test datasets and, analyzing the impact of key hyperparameters on these bounds, propose optimal choices.
On the computational side, we examine the discretization of the convexified problems and derive convergence rates. For low-dimensional datasets, these discretized problems are efficiently solvable using the simplex method. For high-dimensional datasets, we propose a sparsification algorithm that, combined with gradient descent for over-parameterized shallow NNs, yields effective solutions to the primal problems.
△ Less
Submitted 3 April, 2025; v1 submitted 2 December, 2024;
originally announced December 2024.
-
Penrose transformation on flag domains
Authors:
Kefeng Liu,
Yang Shen
Abstract:
Building on our recent work, we construct the Penrose transformations of the cohomology groups of homogeneous line bundles on flag domains $D = G_\R / T$, where $G_\R$ is of Hermitian type. We provide sufficient conditions for the injectivity of the Penrose transformation and identify conditions under which the Penrose transformation of the automorphic cohomology groups on compact quotients of fla…
▽ More
Building on our recent work, we construct the Penrose transformations of the cohomology groups of homogeneous line bundles on flag domains $D = G_\R / T$, where $G_\R$ is of Hermitian type. We provide sufficient conditions for the injectivity of the Penrose transformation and identify conditions under which the Penrose transformation of the automorphic cohomology groups on compact quotients of flag domains is an isomorphism. Finally, we prove that the higher automorphic cohomology groups of certain homogeneous line bundles are isomorphic to the groups of automorphic forms on the Hermitian symmetric domain, and we apply this result to the cup products of the automorphic cohomology groups.
△ Less
Submitted 20 December, 2024; v1 submitted 20 November, 2024;
originally announced November 2024.
-
Counterexamples to a Weitz-Style Reduction for Multispin Systems
Authors:
Kuikui Liu,
Nitya Mani,
Francisco Pernice
Abstract:
In a seminal paper, Weitz showed that for two-state spin systems, such as the Ising and hardcore models from statistical physics, correlation decay on trees implies correlation decay on arbitrary graphs. The key gadget in Weitz's reduction has been instrumental in recent advances in approximate counting and sampling, from analysis of local Markov chains like Glauber dynamics to the design of deter…
▽ More
In a seminal paper, Weitz showed that for two-state spin systems, such as the Ising and hardcore models from statistical physics, correlation decay on trees implies correlation decay on arbitrary graphs. The key gadget in Weitz's reduction has been instrumental in recent advances in approximate counting and sampling, from analysis of local Markov chains like Glauber dynamics to the design of deterministic algorithms for estimating the partition function. A longstanding open problem in the field has been to find such a reduction for more general multispin systems like the uniform distribution over proper colorings of a graph.
In this paper, we show that for a rich class of multispin systems, including the ferromagnetic Potts model, there are fundamental obstacles to extending Weitz's reduction to the multispin setting. A central component of our investigation is establishing nonconvexity of the image of the belief propagation functional, the standard tool for analyzing spin systems on trees. On the other hand, we provide evidence of convexity for the antiferromagnetic Potts model.
△ Less
Submitted 7 February, 2025; v1 submitted 10 November, 2024;
originally announced November 2024.
-
Deep Nonparametric Inference for Conditional Hazard Function
Authors:
Wen Su,
Kin-Yat Liu,
Guosheng Yin,
Jian Huang,
Xingqiu Zhao
Abstract:
We propose a novel deep learning approach to nonparametric statistical inference for the conditional hazard function of survival time with right-censored data. We use a deep neural network (DNN) to approximate the logarithm of a conditional hazard function given covariates and obtain a DNN likelihood-based estimator of the conditional hazard function. Such an estimation approach renders model flex…
▽ More
We propose a novel deep learning approach to nonparametric statistical inference for the conditional hazard function of survival time with right-censored data. We use a deep neural network (DNN) to approximate the logarithm of a conditional hazard function given covariates and obtain a DNN likelihood-based estimator of the conditional hazard function. Such an estimation approach renders model flexibility and hence relaxes structural and functional assumptions on conditional hazard or survival functions. We establish the nonasymptotic error bound and functional asymptotic normality of the proposed estimator. Subsequently, we develop new one-sample tests for goodness-of-fit evaluation and two-sample tests for treatment comparison. Both simulation studies and real application analysis show superior performances of the proposed estimators and tests in comparison with existing methods.
△ Less
Submitted 23 October, 2024;
originally announced October 2024.
-
Mathematical Analysis and Numerical Computation of String Vibration Equations with Elastic Supports for Bridge Cable Force Evaluation
Authors:
Minhui Tan,
Qing Xu,
Hairong Yuan,
Man Xu,
Ke Liu,
Aifang Qu,
Xiaoda Xu
Abstract:
This study focuses on a critical aspect of bridge engineering -- the evaluation of cable forces, paying particular attention to the cables that are internally constrained by elastic supports. Detecting these cable forces is important for the safety and stability of bridges. The practical problem introduces a novel mathematical challenge: how to effectively address string vibration equations with o…
▽ More
This study focuses on a critical aspect of bridge engineering -- the evaluation of cable forces, paying particular attention to the cables that are internally constrained by elastic supports. Detecting these cable forces is important for the safety and stability of bridges. The practical problem introduces a novel mathematical challenge: how to effectively address string vibration equations with one or multiple internal elastic supports,~which remains a theoretical issue not fully solved in engineering. To tackle this, it is necessary to firstly establish an appropriate mathematical model and accurately define initial-boundary value problems. We then formulate the well-posedness of the solution using both classical and weak solution approaches, supplementing the existing numerical results available in engineering. Meanwhile, we attempt to use PINNs (Physics-Informed Neural Networks) instead of traditional FEM (Finite Element Method) in engineering. Consequently, in contrast to the classical solution method, we demonstrate that for a string with finite elastic supports, the weak solution method not only improves mathematical modeling efficiency but also simplifies the process of explaining the well-posedness of the solution.
△ Less
Submitted 8 October, 2024;
originally announced October 2024.
-
The noncommutative residue and sub-Riemannian limits for the twisted BCV spaces
Authors:
Hongfeng Li,
Kefeng Liu,
Yong Wang
Abstract:
In this paper, we derive the sub-Riemannian version of the Kastler-Kalau-Walze type theorem and the Dabrowski-Sitarz-Zalecki type theorem for the twisted BCV spaces. We also compute the Connes conformal invariants for the twisted product, as well as the sub-Riemannian limits of the Connes conformal invariants for the twisted BCV spaces.
In this paper, we derive the sub-Riemannian version of the Kastler-Kalau-Walze type theorem and the Dabrowski-Sitarz-Zalecki type theorem for the twisted BCV spaces. We also compute the Connes conformal invariants for the twisted product, as well as the sub-Riemannian limits of the Connes conformal invariants for the twisted BCV spaces.
△ Less
Submitted 23 September, 2024;
originally announced September 2024.
-
On a determinant involving linear combinations of Legendre symbols
Authors:
Keqin Liu,
Zhi-Wei Sun,
Li-Yuan Wang
Abstract:
In this paper, we prove a conjecture of the second author by evaluating the determinant
$$\det\left[x+\left(\frac{i-j}p\right)+\left(\frac ip\right)y+\left(\frac jp\right)z+\left(\frac{ij}p\right)w\right]_{0\le i,j\le(p-3)/2}$$
for any odd prime $p$, where $(\frac{\cdot}p)$ denotes the Legendre symbol. In particular, the determinant is equal to $x$ when $p\equiv 3\pmod4$.
In this paper, we prove a conjecture of the second author by evaluating the determinant
$$\det\left[x+\left(\frac{i-j}p\right)+\left(\frac ip\right)y+\left(\frac jp\right)z+\left(\frac{ij}p\right)w\right]_{0\le i,j\le(p-3)/2}$$
for any odd prime $p$, where $(\frac{\cdot}p)$ denotes the Legendre symbol. In particular, the determinant is equal to $x$ when $p\equiv 3\pmod4$.
△ Less
Submitted 30 September, 2024; v1 submitted 13 August, 2024;
originally announced August 2024.
-
Error Bounds for Open Quantum Systems with Harmonic Bosonic Bath
Authors:
Kaizhao Liu,
Jianfeng Lu
Abstract:
We investigate the dependence of physical observable of open quantum systems with Bosonic bath on the bath correlation function. We provide an error estimate of the difference of physical observable induced by the variation of bath correlation function, based on diagrammatic and combinatorial arguments. This gives a mathematically rigorous justification of the result in [Mascherpa et al, Phys Rev…
▽ More
We investigate the dependence of physical observable of open quantum systems with Bosonic bath on the bath correlation function. We provide an error estimate of the difference of physical observable induced by the variation of bath correlation function, based on diagrammatic and combinatorial arguments. This gives a mathematically rigorous justification of the result in [Mascherpa et al, Phys Rev Lett 2017].
△ Less
Submitted 15 February, 2025; v1 submitted 7 August, 2024;
originally announced August 2024.
-
Universal Approximation of Dynamical Systems by Semi-Autonomous Neural ODEs and Applications
Authors:
Ziqian Li,
Kang Liu,
Lorenzo Liverani,
Enrique Zuazua
Abstract:
In this paper, we introduce semi-autonomous neural ordinary differential equations (SA-NODEs), a variation of the vanilla NODEs, employing fewer parameters. We investigate the universal approximation properties of SA-NODEs for dynamical systems from both a theoretical and a numerical perspective. Within the assumption of a finite-time horizon, under general hypotheses we establish an asymptotic ap…
▽ More
In this paper, we introduce semi-autonomous neural ordinary differential equations (SA-NODEs), a variation of the vanilla NODEs, employing fewer parameters. We investigate the universal approximation properties of SA-NODEs for dynamical systems from both a theoretical and a numerical perspective. Within the assumption of a finite-time horizon, under general hypotheses we establish an asymptotic approximation result, demonstrating that the error vanishes as the number of parameters goes to infinity. Under additional regularity assumptions, we further specify this convergence rate in relation to the number of parameters, utilizing quantitative approximation results in the Barron space. Based on the previous result, we prove an approximation rate for transport equations by their neural counterparts. Our numerical experiments validate the effectiveness of SA-NODEs in capturing the dynamics of various ODE systems and transport equations. Additionally, we compare SA-NODEs with vanilla NODEs, highlighting the superior performance and reduced complexity of our approach.
△ Less
Submitted 25 July, 2024; v1 submitted 24 July, 2024;
originally announced July 2024.
-
On the Limitation of Kernel Dependence Maximization for Feature Selection
Authors:
Keli Liu,
Feng Ruan
Abstract:
A simple and intuitive method for feature selection consists of choosing the feature subset that maximizes a nonparametric measure of dependence between the response and the features. A popular proposal from the literature uses the Hilbert-Schmidt Independence Criterion (HSIC) as the nonparametric dependence measure. The rationale behind this approach to feature selection is that important feature…
▽ More
A simple and intuitive method for feature selection consists of choosing the feature subset that maximizes a nonparametric measure of dependence between the response and the features. A popular proposal from the literature uses the Hilbert-Schmidt Independence Criterion (HSIC) as the nonparametric dependence measure. The rationale behind this approach to feature selection is that important features will exhibit a high dependence with the response and their inclusion in the set of selected features will increase the HSIC. Through counterexamples, we demonstrate that this rationale is flawed and that feature selection via HSIC maximization can miss critical features.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Smart Navigation System for Parking Assignment at Large Events: Incorporating Heterogeneous Driver Characteristics
Authors:
Xi Cheng,
Gaofeng Su,
Siyuan Feng,
Ke Liu,
Chen Zhu,
Hui Lin,
Jilin Song,
Jianan Chen
Abstract:
Parking challenges escalate significantly during large events such as concerts or sports games, yet few studies address dynamic parking lot assignments for such occasions. This paper introduces a smart navigation system designed to optimize parking assignments swiftly during large events, utilizing a mixed search algorithm that accounts for the heterogeneous characteristics of drivers. We conducte…
▽ More
Parking challenges escalate significantly during large events such as concerts or sports games, yet few studies address dynamic parking lot assignments for such occasions. This paper introduces a smart navigation system designed to optimize parking assignments swiftly during large events, utilizing a mixed search algorithm that accounts for the heterogeneous characteristics of drivers. We conducted simulations in the Berkeley city area during the "Big Game" to validate our system and demonstrate the benefits of our innovative parking assignment approach.
△ Less
Submitted 14 May, 2024;
originally announced June 2024.
-
Multi-Patch Isogeometric Convolution Hierarchical Deep-learning Neural Network
Authors:
Lei Zhang,
Chanwook Park,
T. J. R. Hughes,
Wing Kam Liu
Abstract:
A seamless integration of neural networks with Isogeometric Analysis (IGA) was first introduced in [1] under the name of Hierarchical Deep-learning Neural Network (HiDeNN) and has systematically evolved into Isogeometric Convolution HiDeNN (in short, C-IGA) [2]. C-IGA achieves higher order approximations without increasing the degree of freedom. Due to the Kronecker delta property of C-IGA shape f…
▽ More
A seamless integration of neural networks with Isogeometric Analysis (IGA) was first introduced in [1] under the name of Hierarchical Deep-learning Neural Network (HiDeNN) and has systematically evolved into Isogeometric Convolution HiDeNN (in short, C-IGA) [2]. C-IGA achieves higher order approximations without increasing the degree of freedom. Due to the Kronecker delta property of C-IGA shape functions, one can refine the mesh in the physical domain like standard finite element method (FEM) while maintaining the exact geometrical mapping of IGA. In this article, C-IGA theory is generalized for multi-CAD-patch systems with a mathematical investigation of the compatibility conditions at patch interfaces and convergence of error estimates. Two compatibility conditions (nodal compatibility and G^0 (i.e., global C^0) compatibility) are presented and validated through numerical examples.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Locally Stationary Distributions: A Framework for Analyzing Slow-Mixing Markov Chains
Authors:
Kuikui Liu,
Sidhanth Mohanty,
Prasad Raghavendra,
Amit Rajaraman,
David X. Wu
Abstract:
Many natural Markov chains fail to mix to their stationary distribution in polynomially many steps. Often, this slow mixing is inevitable since it is computationally intractable to sample from their stationary measure.
Nevertheless, Markov chains can be shown to always converge quickly to measures that are locally stationary, i.e., measures that don't change over a small number of steps. These l…
▽ More
Many natural Markov chains fail to mix to their stationary distribution in polynomially many steps. Often, this slow mixing is inevitable since it is computationally intractable to sample from their stationary measure.
Nevertheless, Markov chains can be shown to always converge quickly to measures that are locally stationary, i.e., measures that don't change over a small number of steps. These locally stationary measures are analogous to local minima in continuous optimization, while stationary measures correspond to global minima.
While locally stationary measures can be statistically far from stationary measures, do they enjoy provable theoretical guarantees that have algorithmic implications? We study this question in this work and demonstrate three algorithmic applications of locally stationary measures:
1. We show that Glauber dynamics on the hardcore model can be used to find independent sets of size $Ω\left(\frac{\log d}{d} \cdot n\right)$ in triangle-free graphs of degree at most $d$.
2. Let $W$ be a symmetric real matrix with bounded spectral diameter and $v$ be a unit vector. Given the matrix $M = λvv^\top + W$ with a planted rank-one spike along vector $v$, for sufficiently large constant $λ$, Glauber dynamics on the Ising model defined by $M$ samples vectors $x \in \{\pm 1\}^n$ that have constant correlation with the vector $v$.
3. Let $M = A_{\mathbf{G}} - \frac{d}{n}\mathbf{1}\mathbf{1}^\top$ be a centered version of the adjacency matrix where the graph $\mathbf{G}$ is drawn from a sparse 2-community stochastic block model. We show that for sufficiently large constant signal-to-noise ratio, Glauber dynamics on the Ising model defined by $M$ samples vectors $x \in \{\pm 1\}^n$ that have constant correlation with the hidden community vector $\mathbfσ$.
△ Less
Submitted 6 July, 2025; v1 submitted 31 May, 2024;
originally announced May 2024.
-
Geometry of non-classical period domains
Authors:
Kefeng Liu,
Yang Shen
Abstract:
In this paper we prove a conjecture of Griffiths about vanishing of the zeroth cohomology groups of locally homogeneous vector bundles on compact quotients of non-classical period domains, and construct a new $G_\R$-invariant complex structure on any non-classical period domain $D=G_\R/V$ with $G_\R$ of Hermitian type. Various geometric and algebraic characterizations of non-classical period domai…
▽ More
In this paper we prove a conjecture of Griffiths about vanishing of the zeroth cohomology groups of locally homogeneous vector bundles on compact quotients of non-classical period domains, and construct a new $G_\R$-invariant complex structure on any non-classical period domain $D=G_\R/V$ with $G_\R$ of Hermitian type. Various geometric and algebraic characterizations of non-classical period domains and several geometric applications on their compact quotients are deduced as consequences of our results.
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
Flight Path Optimization with Optimal Control Method
Authors:
Gaofeng Su,
Xi Cheng,
Siyuan Feng,
Ke Liu,
Jilin Song,
Jianan Chen,
Chen Zhu,
Hui Lin
Abstract:
This paper is based on a crucial issue in the aviation world: how to optimize the trajectory and controls given to the aircraft in order to optimize flight time and fuel consumption. This study aims to provide elements of a response to this problem and to define, under certain simplifying assumptions, an optimal response, using Constrained Finite Time Optimal Control(CFTOC). The first step is to d…
▽ More
This paper is based on a crucial issue in the aviation world: how to optimize the trajectory and controls given to the aircraft in order to optimize flight time and fuel consumption. This study aims to provide elements of a response to this problem and to define, under certain simplifying assumptions, an optimal response, using Constrained Finite Time Optimal Control(CFTOC). The first step is to define the dynamic model of the aircraft in accordance with the controllable inputs and wind disturbances. Then we will identify a precise objective in terms of optimization and implement an optimization program to solve it under the circumstances of simulated real flight situation. Finally, the optimization result is validated and discussed by different scenarios.
△ Less
Submitted 13 August, 2024; v1 submitted 14 May, 2024;
originally announced May 2024.
-
Fast Mixing in Sparse Random Ising Models
Authors:
Kuikui Liu,
Sidhanth Mohanty,
Amit Rajaraman,
David X. Wu
Abstract:
Motivated by the community detection problem in Bayesian inference, as well as the recent explosion of interest in spin glasses from statistical physics, we study the classical Glauber dynamics for sampling from Ising models with sparse random interactions. It is now well-known that when the interaction matrix has spectral diameter less than $1$, Glauber dynamics mixes in $O(n\log n)$ steps. Unfor…
▽ More
Motivated by the community detection problem in Bayesian inference, as well as the recent explosion of interest in spin glasses from statistical physics, we study the classical Glauber dynamics for sampling from Ising models with sparse random interactions. It is now well-known that when the interaction matrix has spectral diameter less than $1$, Glauber dynamics mixes in $O(n\log n)$ steps. Unfortunately, such criteria fail dramatically for interactions supported on arguably the most well-studied sparse random graph: the Erdős--Rényi random graph $G(n,d/n)$, due to the presence of almost linearly many outlier eigenvalues of unbounded magnitude.
We prove that for the \emph{Viana--Bray spin glass}, where the interactions are supported on $G(n,d/n)$ and randomly assigned $\pmβ$, Glauber dynamics mixes in $n^{1+o(1)}$ time with high probability as long as $β\le O(1/\sqrt{d})$, independent of $n$. We further extend our results to random graphs drawn according to the $2$-community stochastic block model, as well as when the interactions are given by a "centered" version of the adjacency matrix. The latter setting is particularly relevant for the inference problem in community detection. Indeed, we use this to show that Glauber dynamics succeeds at recovering communities in the stochastic block model in a companion paper [LMR+24].
The primary technical ingredient in our proof is showing that with high probability, a sparse random graph can be decomposed into two parts -- a \emph{bulk} which behaves like a graph with bounded maximum degree and a well-behaved spectrum, and a \emph{near-forest} with favorable pseudorandom properties. We then use this decomposition to design a localization procedure that interpolates to simpler Ising models supported only on the near-forest, and then execute a pathwise analysis to establish a modified log-Sobolev inequality.
△ Less
Submitted 5 August, 2024; v1 submitted 10 May, 2024;
originally announced May 2024.
-
Orthogonal Bootstrap: Efficient Simulation of Input Uncertainty
Authors:
Kaizhao Liu,
Jose Blanchet,
Lexing Ying,
Yiping Lu
Abstract:
Bootstrap is a popular methodology for simulating input uncertainty. However, it can be computationally expensive when the number of samples is large. We propose a new approach called \textbf{Orthogonal Bootstrap} that reduces the number of required Monte Carlo replications. We decomposes the target being simulated into two parts: the \textit{non-orthogonal part} which has a closed-form result kno…
▽ More
Bootstrap is a popular methodology for simulating input uncertainty. However, it can be computationally expensive when the number of samples is large. We propose a new approach called \textbf{Orthogonal Bootstrap} that reduces the number of required Monte Carlo replications. We decomposes the target being simulated into two parts: the \textit{non-orthogonal part} which has a closed-form result known as Infinitesimal Jackknife and the \textit{orthogonal part} which is easier to be simulated. We theoretically and numerically show that Orthogonal Bootstrap significantly reduces the computational cost of Bootstrap while improving empirical accuracy and maintaining the same width of the constructed interval.
△ Less
Submitted 30 April, 2024; v1 submitted 29 April, 2024;
originally announced April 2024.
-
A modified Polak-Ribiere-Polyak type conjugate gradient method with two stepsize strategies for vector optimization
Authors:
Yushan Bai,
Jiawei Chen,
Kaiping Liu
Abstract:
In this paper, in order to find critical points of vector-valued functions with respect to the partial order induced by a closed, convex, and pointed cone with nonempty interior, we propose a nonlinear modified Polak-Ribiere-Polyak type conjugate gradient method with a nonnegative conjugate parameter. We show that the search direction in our method satisfies the sufficient descent condition indepe…
▽ More
In this paper, in order to find critical points of vector-valued functions with respect to the partial order induced by a closed, convex, and pointed cone with nonempty interior, we propose a nonlinear modified Polak-Ribiere-Polyak type conjugate gradient method with a nonnegative conjugate parameter. We show that the search direction in our method satisfies the sufficient descent condition independent of any line search. Furthermore, under mild assumptions, we obtain the results of global convergence with the standard Wolfe line search conditions as well as the standard Armijo line search strategy without convexity assumption of the objective functions. Computational experiments are given to show the effectiveness of the proposed method.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Distributed Adaptive Gradient Algorithm with Gradient Tracking for Stochastic Non-Convex Optimization
Authors:
Dongyu Han,
Kun Liu,
Yeming Lin,
Yuanqing Xia
Abstract:
This paper considers a distributed stochastic non-convex optimization problem, where the nodes in a network cooperatively minimize a sum of $L$-smooth local cost functions with sparse gradients. By adaptively adjusting the stepsizes according to the historical (possibly sparse) gradients, a distributed adaptive gradient algorithm is proposed, in which a gradient tracking estimator is used to handl…
▽ More
This paper considers a distributed stochastic non-convex optimization problem, where the nodes in a network cooperatively minimize a sum of $L$-smooth local cost functions with sparse gradients. By adaptively adjusting the stepsizes according to the historical (possibly sparse) gradients, a distributed adaptive gradient algorithm is proposed, in which a gradient tracking estimator is used to handle the heterogeneity between different local cost functions. We establish an upper bound on the optimality gap, which indicates that our proposed algorithm can reach a first-order stationary solution dependent on the upper bound on the variance of the stochastic gradients. Finally, numerical examples are presented to illustrate the effectiveness of the algorithm.
△ Less
Submitted 29 March, 2024; v1 submitted 18 March, 2024;
originally announced March 2024.
-
Fourier neural operator based fluid-structure interaction for predicting the vesicle dynamics
Authors:
Wang Xiao,
Ting Gao,
Kai Liu,
Jinqiao Duan,
Meng Zhao
Abstract:
Solving complex fluid-structure interaction (FSI) problems, characterized by nonlinear partial differential equations, is crucial in various scientific and engineering applications. Traditional computational fluid dynamics (CFD) solvers are insufficient to meet the growing requirements for large-scale and long-period simulations. Fortunately, the rapid advancement in neural networks, especially ne…
▽ More
Solving complex fluid-structure interaction (FSI) problems, characterized by nonlinear partial differential equations, is crucial in various scientific and engineering applications. Traditional computational fluid dynamics (CFD) solvers are insufficient to meet the growing requirements for large-scale and long-period simulations. Fortunately, the rapid advancement in neural networks, especially neural operator learning mappings between function spaces, has introduced novel approaches to tackle these challenges via data-driven modeling. In this paper, we propose a Fourier neural operator-based fluid-structure interaction solver (FNO-based FSI solver) for efficient simulation of FSI problems, where the solid solver based on the finite difference method is seamlessly integrated with the Fourier neural operator to predict incompressible flow using the immersed boundary method. We analyze the performance of the FNO-based FSI solver in the following three situations: training data with or without the steady state, training method with one-step label or multi-step labels, and prediction in interpolation or extrapolation. We find that the best performance for interpolation is achieved by training the operator with multi-step labels using steady-state data. Finally, we train the FNO-based FSI solver using this optimal training method and apply it to vesicle dynamics. The results show that the FNO-based FSI solver is capable of capturing the variations in the fluid and the vesicle.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
An eigenvalue problem for self-similar patterns in Hele-Shaw flows
Authors:
Wang Xiao,
Lingyu Feng,
Kai Liu,
Meng Zhao
Abstract:
Hele-Shaw problems are prototypes to study the interface dynamics. Linear theory suggests the existence of self-similar patterns in a Hele-Shaw flow. That is, with a specific injection flux the interface shape remains unchanged while its size increases. In this paper, we explore the existence of self-similar patterns in the nonlinear regime and develop a rigorous nonlinear theory characterizing th…
▽ More
Hele-Shaw problems are prototypes to study the interface dynamics. Linear theory suggests the existence of self-similar patterns in a Hele-Shaw flow. That is, with a specific injection flux the interface shape remains unchanged while its size increases. In this paper, we explore the existence of self-similar patterns in the nonlinear regime and develop a rigorous nonlinear theory characterizing their fundamental features. Using a boundary integral formulation, we pose the question of self-similarity as a generalized nonlinear eigenvalue problem, involving two nonlinear integral operators. The flux constant $C$ is the eigenvalue and the corresponding self-similar pattern $\mathbf{x}$ is the eigenvector. We develop a quasi-Newton method to solve the problem and show the existence of nonlinear shapes with $k$-fold dominated symmetries. The influence of initial guesses on the self-similar patterns is investigated. We are able to obtain a desired self-similar shape once the initial guess is properly chosen. Our results go beyond the predictions of linear theory and establish a bridge between the linear theory and simulations.
△ Less
Submitted 4 January, 2024;
originally announced January 2024.
-
The Local Landscape of Phase Retrieval Under Limited Samples
Authors:
Kaizhao Liu,
Zihao Wang,
Lei Wu
Abstract:
In this paper, we present a fine-grained analysis of the local landscape of phase retrieval under the regime of limited samples. Specifically, we aim to ascertain the minimal sample size required to guarantee a benign local landscape surrounding global minima in high dimensions. Let $n$ and $d$ denote the sample size and input dimension, respectively. We first explore the local convexity and estab…
▽ More
In this paper, we present a fine-grained analysis of the local landscape of phase retrieval under the regime of limited samples. Specifically, we aim to ascertain the minimal sample size required to guarantee a benign local landscape surrounding global minima in high dimensions. Let $n$ and $d$ denote the sample size and input dimension, respectively. We first explore the local convexity and establish that when $n=o(d\log d)$, for almost every fixed point in the local ball, the Hessian matrix has negative eigenvalues, provided $d$ is sufficiently large. % Consequently, the local landscape is highly non-convex. We next consider the one-point convexity and show that, as long as $n=ω(d)$, with high probability, the landscape is one-point strongly convex in the local annulus: $\{w\in\mathbb{R}^d: o_d(1)\leqslant \|w-w^*\|\leqslant c\}$, where $w^*$ is the ground truth and $c$ is an absolute constant. This implies that gradient descent, initialized from any point in this domain, can converge to an $o_d(1)$-loss solution exponentially fast. Furthermore, we show that when $n=o(d\log d)$, there is a radius of $\widetildeΘ\left(\sqrt{1/d}\right)$ such that one-point convexity breaks down in the corresponding smaller local ball. This indicates an impossibility of establishing a convergence to the exact $w^*$ for gradient descent under limited samples by relying solely on one-point convexity.
△ Less
Submitted 11 October, 2024; v1 submitted 26 November, 2023;
originally announced November 2023.
-
OptScaler: A Collaborative Framework for Robust Autoscaling in the Cloud
Authors:
Ding Zou,
Wei Lu,
Zhibo Zhu,
Xingyu Lu,
Jun Zhou,
Xiaojin Wang,
Kangyu Liu,
Haiqing Wang,
Kefan Wang,
Renen Sun
Abstract:
Autoscaling is a critical mechanism in cloud computing, enabling the autonomous adjustment of computing resources in response to dynamic workloads. This is particularly valuable for co-located, long-running applications with diverse workload patterns. The primary objective of autoscaling is to regulate resource utilization at a desired level, effectively balancing the need for resource optimizatio…
▽ More
Autoscaling is a critical mechanism in cloud computing, enabling the autonomous adjustment of computing resources in response to dynamic workloads. This is particularly valuable for co-located, long-running applications with diverse workload patterns. The primary objective of autoscaling is to regulate resource utilization at a desired level, effectively balancing the need for resource optimization with the fulfillment of Service Level Objectives (SLOs). Many existing proactive autoscaling frameworks may encounter prediction deviations arising from the frequent fluctuations of cloud workloads. Reactive frameworks, on the other hand, rely on realtime system feedback, but their hysteretic nature could lead to violations of stringent SLOs. Hybrid frameworks, while prevalent, often feature independently functioning proactive and reactive modules, potentially leading to incompatibility and undermining the overall decision-making efficacy. In addressing these challenges, we propose OptScaler, a collaborative autoscaling framework that integrates proactive and reactive modules through an optimization module. The proactive module delivers reliable future workload predictions to the optimization module, while the reactive module offers a self-tuning estimator for real-time updates. By embedding a Model Predictive Control (MPC) mechanism and chance constraints into the optimization module, we further enhance its robustness. Numerical results have demonstrated the superiority of our workload prediction model and the collaborative framework, leading to over a 36% reduction in SLO violations compared to prevalent reactive, proactive, or hybrid autoscalers. Notably, OptScaler has been successfully deployed at Alipay, providing autoscaling support for the world-leading payment platform.
△ Less
Submitted 5 February, 2025; v1 submitted 26 October, 2023;
originally announced November 2023.
-
Statistical Parameterized Physics-Based Machine Learning Digital Twin Models for Laser Powder Bed Fusion Process
Authors:
Yangfan Li,
Satyajit Mojumder,
Ye Lu,
Abdullah Al Amin,
Jiachen Guo,
Xiaoyu Xie,
Wei Chen,
Gregory J. Wagner,
Jian Cao,
Wing Kam Liu
Abstract:
A digital twin (DT) is a virtual representation of physical process, products and/or systems that requires a high-fidelity computational model for continuous update through the integration of sensor data and user input. In the context of laser powder bed fusion (LPBF) additive manufacturing, a digital twin of the manufacturing process can offer predictions for the produced parts, diagnostics for m…
▽ More
A digital twin (DT) is a virtual representation of physical process, products and/or systems that requires a high-fidelity computational model for continuous update through the integration of sensor data and user input. In the context of laser powder bed fusion (LPBF) additive manufacturing, a digital twin of the manufacturing process can offer predictions for the produced parts, diagnostics for manufacturing defects, as well as control capabilities. This paper introduces a parameterized physics-based digital twin (PPB-DT) for the statistical predictions of LPBF metal additive manufacturing process. We accomplish this by creating a high-fidelity computational model that accurately represents the melt pool phenomena and subsequently calibrating and validating it through controlled experiments. In PPB-DT, a mechanistic reduced-order method-driven stochastic calibration process is introduced, which enables the statistical predictions of the melt pool geometries and the identification of defects such as lack-of-fusion porosity and surface roughness, specifically for diagnostic applications. Leveraging data derived from this physics-based model and experiments, we have trained a machine learning-based digital twin (PPB-ML-DT) model for predicting, monitoring, and controlling melt pool geometries. These proposed digital twin models can be employed for predictions, control, optimization, and quality assurance within the LPBF process, ultimately expediting product development and certification in LPBF-based metal additive manufacturing.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Decks of rooted binary trees
Authors:
Ann Clifton,
Eva Czabarka,
Audace Dossou-Olory,
Kevin Liu,
Sarah Loeb,
Utku Okur,
Laszlo Szekely,
Kristina Wicke
Abstract:
We consider extremal problems related to decks and multidecks of rooted binary trees (a.k.a. rooted phylogenetic tree shapes). Here, the deck (resp. multideck) of a tree $T$ refers to the set (resp. multiset) of leaf induced binary subtrees of $T$. On the one hand, we consider the reconstruction of trees from their (multi)decks. We give lower and upper bounds on the minimum (multi)deck size requir…
▽ More
We consider extremal problems related to decks and multidecks of rooted binary trees (a.k.a. rooted phylogenetic tree shapes). Here, the deck (resp. multideck) of a tree $T$ refers to the set (resp. multiset) of leaf induced binary subtrees of $T$. On the one hand, we consider the reconstruction of trees from their (multi)decks. We give lower and upper bounds on the minimum (multi)deck size required to uniquely encode a rooted binary tree on $n$ leaves. On the other hand, we consider problems related to deck cardinalities. In particular, we characterize trees with minimum-size as well as maximum-size decks. Finally, we present some exhaustive computations for $k$-universal trees, i.e., rooted binary trees that contain all $k$-leaf rooted binary trees as induced subtrees.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
A mesh-independent method for second-order potential mean field games
Authors:
Kang Liu,
Laurent Pfeiffer
Abstract:
This article investigates the convergence of the Generalized Frank-Wolfe (GFW) algorithm for the resolution of potential and convex second-order mean field games. More specifically, the impact of the discretization of the mean-field-game system on the effectiveness of the GFW algorithm is analyzed. The article focuses on the theta-scheme introduced by the authors in a previous study. A sublinear a…
▽ More
This article investigates the convergence of the Generalized Frank-Wolfe (GFW) algorithm for the resolution of potential and convex second-order mean field games. More specifically, the impact of the discretization of the mean-field-game system on the effectiveness of the GFW algorithm is analyzed. The article focuses on the theta-scheme introduced by the authors in a previous study. A sublinear and a linear rate of convergence are obtained, for two different choices of stepsizes. These rates have the mesh-independence property: the underlying convergence constants are independent of the discretization parameters.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Mean field optimization problems: stability results and Lagrangian discretization
Authors:
Kang Liu,
Laurent Pfeiffer
Abstract:
We formulate and investigate a mean field optimization (MFO) problem over a set of probability distributions $μ$ with a prescribed marginal $m$. The cost function depends on an aggregate term, which is the expectation of $μ$ with respect to a contribution function. This problem is of particular interest in the context of Lagrangian potential mean field games (MFGs) and their discretization. We pro…
▽ More
We formulate and investigate a mean field optimization (MFO) problem over a set of probability distributions $μ$ with a prescribed marginal $m$. The cost function depends on an aggregate term, which is the expectation of $μ$ with respect to a contribution function. This problem is of particular interest in the context of Lagrangian potential mean field games (MFGs) and their discretization. We provide a first-order optimality condition and prove strong duality. We investigate stability properties of the MFO problem with respect to the prescribed marginal, from both primal and dual perspectives. In our stability analysis, we propose a method for recovering an approximate solution to an MFO problem with the help of an approximate solution to an MFO with a different marginal $m$, typically an empirical distribution. We combine this method with the stochastic Frank-Wolfe algorithm of a previous publication of ours to derive a complete resolution method.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Layered Models can "Automatically" Regularize and Discover Low-Dimensional Structures via Feature Learning
Authors:
Yunlu Chen,
Yang Li,
Keli Liu,
Feng Ruan
Abstract:
Layered models like neural networks appear to extract key features from data through empirical risk minimization, yet the theoretical understanding for this process remains unclear. Motivated by these observations, we study a two-layer nonparametric regression model where the input undergoes a linear transformation followed by a nonlinear mapping to predict the output, mirroring the structure of t…
▽ More
Layered models like neural networks appear to extract key features from data through empirical risk minimization, yet the theoretical understanding for this process remains unclear. Motivated by these observations, we study a two-layer nonparametric regression model where the input undergoes a linear transformation followed by a nonlinear mapping to predict the output, mirroring the structure of two-layer neural networks. In our model, both layers are optimized jointly through empirical risk minimization, with the nonlinear layer modeled by a reproducing kernel Hilbert space induced by a rotation and translation invariant kernel, regularized by a ridge penalty.
Our main result shows that the two-layer model can "automatically" induce regularization and facilitate feature learning. Specifically, the two-layer model promotes dimensionality reduction in the linear layer and identifies a parsimonious subspace of relevant features -- even without applying any norm penalty on the linear layer. Notably, this regularization effect arises directly from the model's layered structure, independent of optimization dynamics.
More precisely, assuming the covariates have nonzero explanatory power for the response only through a low dimensional subspace (central mean subspace), the linear layer consistently estimates both the subspace and its dimension. This demonstrates that layered models can inherently discover low-complexity solutions relevant for prediction, without relying on conventional regularization methods. Real-world data experiments further demonstrate the persistence of this phenomenon in practice.
△ Less
Submitted 30 January, 2025; v1 submitted 18 October, 2023;
originally announced October 2023.
-
On the distribution of $k$-free numbers on the view point of random walks
Authors:
Kui Liu,
Meijie Lu
Abstract:
In this paper, we investigate the distribution of $k$-free numbers in a class of $α$-random walks on the integer lattice $\mathbb{Z}$. In these walks, the walker starts from a non-negative integer $r$ and moves to the right by $a$ units with probability $α$, or by $b$ units with probability $1-α$. For $k\geq 3$, we obtain the asymptotic proportion of $k$-free numbers in a path of such $α$-random w…
▽ More
In this paper, we investigate the distribution of $k$-free numbers in a class of $α$-random walks on the integer lattice $\mathbb{Z}$. In these walks, the walker starts from a non-negative integer $r$ and moves to the right by $a$ units with probability $α$, or by $b$ units with probability $1-α$. For $k\geq 3$, we obtain the asymptotic proportion of $k$-free numbers in a path of such $α$-random walks in almost surely sense. This provides a generalization of a classical result on the distribution of $k$-free numbers in arithmetic progressions.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Cyclotomic expansions for double twist knots with an odd number of half-twists
Authors:
Qingtao Chen,
Kefeng Liu,
Shengmao Zhu
Abstract:
In this note, we compute the cyclotomic expansion formula for colored Jones polynomial of double twist knots with an odd number of half-twists $\mathcal{K}_{p,\frac{s}{2}}$ by using the Kauffman bracket skein theory. It answers a question proposed by Lovejoy and Osburn in 2019.
In this note, we compute the cyclotomic expansion formula for colored Jones polynomial of double twist knots with an odd number of half-twists $\mathcal{K}_{p,\frac{s}{2}}$ by using the Kauffman bracket skein theory. It answers a question proposed by Lovejoy and Osburn in 2019.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
Universal rooted phylogenetic tree shapes and universal tanglegrams
Authors:
Ann Clifton,
Eva Czabarka,
Kevin Liu,
Sarah Loeb,
Utku Okur,
Laszlo Szekely,
Kristina Wicke
Abstract:
We provide an $Ω(n\log n) $ lower bound and an $O(n^2)$ upper bound for the smallest size of rooted binary trees (a.k.a. phylogenetic tree shapes), which are universal for rooted binary trees with $n$ leaves, i.e., contain all of them as induced binary subtrees. We explicitly compute the smallest universal trees for $n\leq 11$. We also provide an $Ω(n^2) $ lower bound and an $O(n^4)$ upper bound f…
▽ More
We provide an $Ω(n\log n) $ lower bound and an $O(n^2)$ upper bound for the smallest size of rooted binary trees (a.k.a. phylogenetic tree shapes), which are universal for rooted binary trees with $n$ leaves, i.e., contain all of them as induced binary subtrees. We explicitly compute the smallest universal trees for $n\leq 11$. We also provide an $Ω(n^2) $ lower bound and an $O(n^4)$ upper bound for the smallest size of tanglegrams, which are universal for size $n$ tanglegrams, i.e., which contain all of them as induced subtanglegrams. Some of our results generalize to rooted $d$-ary trees and to $d$-ary tanglegrams.
△ Less
Submitted 12 August, 2023;
originally announced August 2023.
-
Extended tensor decomposition model reduction methods: training, prediction, and design under uncertainty
Authors:
Ye Lu,
Satyajit Mojumder,
Jiachen Guo,
Yangfan Li,
Wing Kam Liu
Abstract:
This paper introduces an extended tensor decomposition (XTD) method for model reduction. The proposed method is based on a sparse non-separated enrichment to the conventional tensor decomposition, which is expected to improve the approximation accuracy and the reducibility (compressibility) in highly nonlinear and singular cases. The proposed XTD method can be a powerful tool for solving nonlinear…
▽ More
This paper introduces an extended tensor decomposition (XTD) method for model reduction. The proposed method is based on a sparse non-separated enrichment to the conventional tensor decomposition, which is expected to improve the approximation accuracy and the reducibility (compressibility) in highly nonlinear and singular cases. The proposed XTD method can be a powerful tool for solving nonlinear space-time parametric problems. The method has been successfully applied to parametric elastic-plastic problems and real time additive manufacturing residual stress predictions with uncertainty quantification. Furthermore, a combined XTD-SCA (self-consistent clustering analysis) strategy has been presented for multi-scale material modeling, which enables real time multi-scale multi-parametric simulations. The efficiency of the method is demonstrated with comparison to finite element analysis. The proposed method enables a novel framework for fast manufacturing and material design with uncertainties.
△ Less
Submitted 4 November, 2023; v1 submitted 28 July, 2023;
originally announced July 2023.
-
Strictly Low Rank Constraint Optimization -- An Asymptotically $\mathcal{O}(\frac{1}{t^2})$ Method
Authors:
Mengyuan Zhang,
Kai Liu
Abstract:
We study a class of non-convex and non-smooth problems with \textit{rank} regularization to promote sparsity in optimal solution. We propose to apply the proximal gradient descent method to solve the problem and accelerate the process with a novel support set projection operation on the singular values of the intermediate update. We show that our algorithms are able to achieve a convergence rate o…
▽ More
We study a class of non-convex and non-smooth problems with \textit{rank} regularization to promote sparsity in optimal solution. We propose to apply the proximal gradient descent method to solve the problem and accelerate the process with a novel support set projection operation on the singular values of the intermediate update. We show that our algorithms are able to achieve a convergence rate of $O(\frac{1}{t^2})$, which is exactly same as Nesterov's optimal convergence rate for first-order methods on smooth and convex problems. Strict sparsity can be expected and the support set of singular values during each update is monotonically shrinking, which to our best knowledge, is novel in momentum-based algorithms.
△ Less
Submitted 4 July, 2023;
originally announced July 2023.
-
New Structures and their Applications to Variants of Zero Forcing and Propagation Time
Authors:
Leslie Hogben,
Mark Hunnell,
Kevin Liu,
Houston Schuerger,
Ben Small,
Yaqi Zhang
Abstract:
We introduce a generalization of the concept of a chronological list of forces, called a relaxed chronology. This concept is used to introduce a new way of formulating the standard zero forcing process, which we refer to as parallel increasing path covers, or PIPs. The combinatorial properties of PIPs are utilized to identify bounds comparing standard zero forcing propagation time to positive semi…
▽ More
We introduce a generalization of the concept of a chronological list of forces, called a relaxed chronology. This concept is used to introduce a new way of formulating the standard zero forcing process, which we refer to as parallel increasing path covers, or PIPs. The combinatorial properties of PIPs are utilized to identify bounds comparing standard zero forcing propagation time to positive semidefinite propagation time. A collection of paths within a set of PSD forcing trees, called a path bundle, is used to identify the PSD forcing analog of the reversal of a standard zero forcing process, as well as to draw a connection between PSD forcing and rigid-linkage forcing.
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
On the stability of critical points of the Hardy-Littlewood-Sobolev inequality
Authors:
Kuan Liu,
Qian Zhang,
Wenming Zou
Abstract:
This paper is concerned with the quantitative stability of critical points of the Hardy-Littlewood-Sobolev inequality. Namely, we give quantitative estimates for the Choquard equation: $$-Δu=(I_μ\ast|u|^{2_μ^*}) u^{2_μ^*-1}\ \ \text{in}\ \ \R^N,$$ where $u>0,\ N\geq 3,\ μ\in(0,N)$, $I_μ$ is the Riesz potential and $2_μ^* \coloneqq \frac{2N-μ}{N-2}$ is the upper Hardy-Littlewood-Sobolev critical ex…
▽ More
This paper is concerned with the quantitative stability of critical points of the Hardy-Littlewood-Sobolev inequality. Namely, we give quantitative estimates for the Choquard equation: $$-Δu=(I_μ\ast|u|^{2_μ^*}) u^{2_μ^*-1}\ \ \text{in}\ \ \R^N,$$ where $u>0,\ N\geq 3,\ μ\in(0,N)$, $I_μ$ is the Riesz potential and $2_μ^* \coloneqq \frac{2N-μ}{N-2}$ is the upper Hardy-Littlewood-Sobolev critical exponent. The Struwe's decomposition (see M. Struwe: Math Z.,1984) showed that the equation $Δu + u^{\frac{N+2}{N-2 }}=0$ has phenomenon of ``stable up to bubbling'', that is, if $u\geq0$ and $\|Δu+u^{\frac{N+2}{N-2}}\|_{(\mathcal{D}^{1,2})^{-1}}$ approaches zero, then $d(u)$ goes to zero, where $d(u)$ denotes the $\mathcal{D}^{1,2}(\R^N)$-distance between $u$ and the set of all sums of Talenti bubbles. Ciraolo, F{}igalli and Maggi (Int. Math. Res. Not.,2017) obtained the f{}irst quantitative version of Struwe's decomposition with single bubble in all dimensions $N\geq 3$, i.e, $\displaystyle d(u)\leq C\|Δu+u^{\frac{N+2}{N-2}}\|_{L^{\frac{2N}{N+2}}}.$ For multiple bubbles, F{}igalli and Glaudo (Arch. Rational Mech. Anal., 2020) obtained quantitative estimates depending on the dimension, namely $$ d(u)\leq C\|Δu+u^{\frac{N+2}{N-2}}\|_{(\mathcal{D}^{1,2})^{-1}}, \hbox{ where } 3\leq N\leq 5,$$ which is invalid as $N\geq 6.$
\vskip0.1in
\quad In this paper, we prove the quantitative estimate of the Hardy-Littlewood-Sobolev inequality, we get $$d(u)\leq C\|Δu +(I_μ\ast|u|^{2_μ^*})|u|^{2_μ^*-2}u\|_{(\mathcal{D}^{1,2})^{-1}}, \hbox{ when } N=3 \hbox{ and } 5/2< μ<3.$$
△ Less
Submitted 13 July, 2023; v1 submitted 27 June, 2023;
originally announced June 2023.