-
An inverse-free fixed-time stable dynamical system and its forward-Euler discretization for solving generalized absolute value equations
Authors:
Xuehua Li,
Linjie Chen,
Dongmei Yu,
Cairong Chen,
Deren Han
Abstract:
An inverse-free dynamical system is proposed to solve the generalized absolute value equation (GAVE) within a fixed time, where the time of convergence is finite and is uniformly bounded for all initial points. Moreover, an iterative method obtained by using the forward-Euler discretization of the proposed dynamic model are developed and sufficient conditions which guarantee that the discrete iter…
▽ More
An inverse-free dynamical system is proposed to solve the generalized absolute value equation (GAVE) within a fixed time, where the time of convergence is finite and is uniformly bounded for all initial points. Moreover, an iterative method obtained by using the forward-Euler discretization of the proposed dynamic model are developed and sufficient conditions which guarantee that the discrete iteration globally converge to an arbitrarily small neighborhood of the unique solution of GAVE within a finite number of iterative steps are given.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
Group Distributionally Robust Optimization with Flexible Sample Queries
Authors:
Haomin Bai,
Dingzhi Yu,
Shuai Li,
Haipeng Luo,
Lijun Zhang
Abstract:
Group distributionally robust optimization (GDRO) aims to develop models that perform well across $m$ distributions simultaneously. Existing GDRO algorithms can only process a fixed number of samples per iteration, either 1 or $m$, and therefore can not support scenarios where the sample size varies dynamically. To address this limitation, we investigate GDRO with flexible sample queries and cast…
▽ More
Group distributionally robust optimization (GDRO) aims to develop models that perform well across $m$ distributions simultaneously. Existing GDRO algorithms can only process a fixed number of samples per iteration, either 1 or $m$, and therefore can not support scenarios where the sample size varies dynamically. To address this limitation, we investigate GDRO with flexible sample queries and cast it as a two-player game: one player solves an online convex optimization problem, while the other tackles a prediction with limited advice (PLA) problem. Within such a game, we propose a novel PLA algorithm, constructing appropriate loss estimators for cases where the sample size is either 1 or not, and updating the decision using follow-the-regularized-leader. Then, we establish the first high-probability regret bound for non-oblivious PLA. Building upon the above approach, we develop a GDRO algorithm that allows an arbitrary and varying sample size per round, achieving a high-probability optimization error bound of $O\left(\frac{1}{t}\sqrt{\sum_{j=1}^t \frac{m}{r_j}\log m}\right)$, where $r_t$ denotes the sample size at round $t$. This result demonstrates that the optimization error decreases as the number of samples increases and implies a consistent sample complexity of $O(m\log (m)/ε^2)$ for any fixed sample size $r\in[m]$, aligning with existing bounds for cases of $r=1$ or $m$. We validate our approach on synthetic binary and real-world multi-class datasets.
△ Less
Submitted 21 May, 2025;
originally announced May 2025.
-
Asymptotic Theory of Eigenvectors for Latent Embeddings with Generalized Laplacian Matrices
Authors:
Jianqing Fan,
Yingying Fan,
Jinchi Lv,
Fan Yang,
Diwen Yu
Abstract:
Laplacian matrices are commonly employed in many real applications, encoding the underlying latent structural information such as graphs and manifolds. The use of the normalization terms naturally gives rise to random matrices with dependency. It is well-known that dependency is a major bottleneck of new random matrix theory (RMT) developments. To this end, in this paper, we formally introduce a c…
▽ More
Laplacian matrices are commonly employed in many real applications, encoding the underlying latent structural information such as graphs and manifolds. The use of the normalization terms naturally gives rise to random matrices with dependency. It is well-known that dependency is a major bottleneck of new random matrix theory (RMT) developments. To this end, in this paper, we formally introduce a class of generalized (and regularized) Laplacian matrices, which contains the Laplacian matrix and the random adjacency matrix as a specific case, and suggest the new framework of the asymptotic theory of eigenvectors for latent embeddings with generalized Laplacian matrices (ATE-GL). Our new theory is empowered by the tool of generalized quadratic vector equation for dealing with RMT under dependency, and delicate high-order asymptotic expansions of the empirical spiked eigenvectors and eigenvalues based on local laws. The asymptotic normalities established for both spiked eigenvectors and eigenvalues will enable us to conduct precise inference and uncertainty quantification for applications involving the generalized Laplacian matrices with flexibility. We discuss some applications of the suggested ATE-GL framework and showcase its validity through some numerical examples.
△ Less
Submitted 1 March, 2025;
originally announced March 2025.
-
Multifractal analysis of maximal product of consecutive partial quotients in continued fractions
Authors:
Kunkun Song,
Dingding Yu,
Yueli Yu
Abstract:
Let $[a_1(x), a_2(x), \ldots, a_n(x), \ldots]$ be the continued fraction expansion of an irrational number $x\in (0,1)$. We study the growth rate of the maximal product of consecutive partial quotients among the first $n$ terms, defined by $L_n(x)=\max_{1\leq i\leq n}\{a_i(x)a_{i+1}(x)\}$, from the viewpoint of multifractal analysis. More precisely, we determine the Hausdorff dimension of the leve…
▽ More
Let $[a_1(x), a_2(x), \ldots, a_n(x), \ldots]$ be the continued fraction expansion of an irrational number $x\in (0,1)$. We study the growth rate of the maximal product of consecutive partial quotients among the first $n$ terms, defined by $L_n(x)=\max_{1\leq i\leq n}\{a_i(x)a_{i+1}(x)\}$, from the viewpoint of multifractal analysis. More precisely, we determine the Hausdorff dimension of the level set \[L(\varphi):=\left\{x\in (0,1):\lim_{n\to \infty}\frac{L_n(x)}{\varphi(n)}=1\right\},\] where $\varphi:\mathbb{R^+}\to\mathbb{R^+}$ is an increasing function such that $\log \varphi$ is a regularly increasing function with index $ρ$. We show that there exists a jump of the Hausdorff dimension of $L(\varphi)$ when $ρ=1/2$. We also construct uncountably many discontinuous functions $ψ$ that cause the Hausdorff dimension of $L(ψ)$ to transition continuously from 1 to 1/2, filling the gap when $ρ=1/2$.
△ Less
Submitted 12 June, 2025; v1 submitted 4 February, 2025;
originally announced February 2025.
-
Mirror Descent Under Generalized Smoothness
Authors:
Dingzhi Yu,
Wei Jiang,
Yuanyu Wan,
Lijun Zhang
Abstract:
Smoothness is crucial for attaining fast rates in first-order optimization. However, many optimization problems in modern machine learning involve non-smooth objectives. Recent studies relax the smoothness assumption by allowing the Lipschitz constant of the gradient to grow with respect to the gradient norm, which accommodates a broad range of objectives in practice. Despite this progress, existi…
▽ More
Smoothness is crucial for attaining fast rates in first-order optimization. However, many optimization problems in modern machine learning involve non-smooth objectives. Recent studies relax the smoothness assumption by allowing the Lipschitz constant of the gradient to grow with respect to the gradient norm, which accommodates a broad range of objectives in practice. Despite this progress, existing generalizations of smoothness are restricted to Euclidean geometry with $\ell_2$-norm and only have theoretical guarantees for optimization in the Euclidean space. In this paper, we address this limitation by introducing a new $\ell*$-smoothness concept that measures the norm of Hessians in terms of a general norm and its dual, and establish convergence for mirror-descent-type algorithms, matching the rates under the classic smoothness. Notably, we propose a generalized self-bounding property that facilitates bounding the gradients via controlling suboptimality gaps, serving as a principal component for convergence analysis. Beyond deterministic optimization, we establish an anytime convergence for stochastic mirror descent based on a new bounded noise condition that encompasses the widely adopted bounded or affine noise assumptions.
△ Less
Submitted 15 May, 2025; v1 submitted 2 February, 2025;
originally announced February 2025.
-
Derived from expanding endomorphism on $\mathbb{T}^2$
Authors:
Daohua Yu
Abstract:
Assume that $f$ is a $C^r(r\geq 3)$ specially partially hyperbolic endomorphism on the 2-torus which is homotopic to an expanding linear endomorphism $A$ with irrational eigenvalues. We prove that $f$ and $A$ are topologically conjugate, if and only if $f$ is area-expanding. If $f$ is area-expanding and the center bundle is $C^1$, then the topological conjugacy between $f$ and $A$ is…
▽ More
Assume that $f$ is a $C^r(r\geq 3)$ specially partially hyperbolic endomorphism on the 2-torus which is homotopic to an expanding linear endomorphism $A$ with irrational eigenvalues. We prove that $f$ and $A$ are topologically conjugate, if and only if $f$ is area-expanding. If $f$ is area-expanding and the center bundle is $C^1$, then the topological conjugacy between $f$ and $A$ is $C^{\max\{r-3,1\}+α}$. In particular, if $r=ω$, the conjugacy is $C^ω$.
△ Less
Submitted 17 February, 2025; v1 submitted 14 November, 2024;
originally announced November 2024.
-
Uniformly distributed periodic orbits of endomorphisms on $n$-tori
Authors:
Daohua Yu,
Shaobo Gan
Abstract:
We prove that any ergodic endomorphism on torus admits a sequence of periodic orbits uniformly distributed in the metric sense. As a corollary, an endomorphism on torus is ergodic if and only if the Haar measure can be approximated by periodic measures.
We prove that any ergodic endomorphism on torus admits a sequence of periodic orbits uniformly distributed in the metric sense. As a corollary, an endomorphism on torus is ergodic if and only if the Haar measure can be approximated by periodic measures.
△ Less
Submitted 15 November, 2024; v1 submitted 28 July, 2024;
originally announced July 2024.
-
Timelike asymptotics for global solutions to a scalar quasilinear wave equation satisfying the weak null condition
Authors:
Dongxiao Yu
Abstract:
We study the timelike asymptotics for global solutions to a scalar quasilinear wave equation satisfying the weak null condition. Given a global solution $u$ to the scalar wave equation with sufficiently small $C_c^\infty$ initial data, we derive an asymptotic formula for this global solution inside the light cone (i.e. for $|x|<t$). It involves the scattering data obtained in the author's asymptot…
▽ More
We study the timelike asymptotics for global solutions to a scalar quasilinear wave equation satisfying the weak null condition. Given a global solution $u$ to the scalar wave equation with sufficiently small $C_c^\infty$ initial data, we derive an asymptotic formula for this global solution inside the light cone (i.e. for $|x|<t$). It involves the scattering data obtained in the author's asymptotic completeness result in arXiv:2105.11573. Using this asymptotic formula, we prove that $u$ must vanish under some decaying assumptions on $u$ or its scattering data, provided that the wave equation violates the null condition.
△ Less
Submitted 17 March, 2025; v1 submitted 28 July, 2024;
originally announced July 2024.
-
Deterministic and Stochastic Frank-Wolfe Recursion on Probability Spaces
Authors:
Di Yu,
Shane G. Henderson,
Raghu Pasupathy
Abstract:
Motivated by applications in emergency response and experimental design, we consider smooth stochastic optimization problems over probability measures supported on compact subsets of the Euclidean space. With the influence function as the variational object, we construct a deterministic Frank-Wolfe (dFW) recursion for probability spaces, made especially possible by a lemma that identifies a ``clos…
▽ More
Motivated by applications in emergency response and experimental design, we consider smooth stochastic optimization problems over probability measures supported on compact subsets of the Euclidean space. With the influence function as the variational object, we construct a deterministic Frank-Wolfe (dFW) recursion for probability spaces, made especially possible by a lemma that identifies a ``closed-form'' solution to the infinite-dimensional Frank-Wolfe sub-problem. Each iterate in dFW is expressed as a convex combination of the incumbent iterate and a Dirac measure concentrating on the minimum of the influence function at the incumbent iterate. To address common application contexts that have access only to Monte Carlo observations of the objective and influence function, we construct a stochastic Frank-Wolfe (sFW) variation that generates a random sequence of probability measures constructed using minima of increasingly accurate estimates of the influence function. We demonstrate that sFW's optimality gap sequence exhibits $O(k^{-1})$ iteration complexity almost surely and in expectation for smooth convex objectives, and $O(k^{-1/2})$ (in Frank-Wolfe gap) for smooth non-convex objectives. Furthermore, we show that an easy-to-implement fixed-step, fixed-sample version of (sFW) exhibits exponential convergence to $\varepsilon$-optimality. We end with a central limit theorem on the observed objective values at the sequence of generated random measures. To further intuition, we include several illustrative examples with exact influence function calculations.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
John's blow up examples and scattering solutions for semi-linear wave equations
Authors:
Louie Bernhardt,
Volker Schlue,
Dongxiao Yu
Abstract:
In light of recent work of the third author, we revisit a classic example given by Fritz John of a semi-linear wave equation which exhibits finite in time blow up for all compactly supported data. We present the construction of future global solutions from asymptotic data given in arXiv:2204.12870(2022) for this specific example, and clarify the relation of this result of Yu to John's theorem. Fur…
▽ More
In light of recent work of the third author, we revisit a classic example given by Fritz John of a semi-linear wave equation which exhibits finite in time blow up for all compactly supported data. We present the construction of future global solutions from asymptotic data given in arXiv:2204.12870(2022) for this specific example, and clarify the relation of this result of Yu to John's theorem. Furthermore we present a novel blow up result for finite energy solutions satisfying a sign condition due to the first author, and invoke this result to show that the constructed backwards in time solutions blow up in the past.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Common substring with shifts in b-ary expansions
Authors:
Xin Liao,
Dingding Yu
Abstract:
Denote by $S_n(x,y)$ the length of the longest common substring of $x$ and $y$ with shifts in their first $n$ digits of $b$-ary expansions. We show that the sets of pairs $(x,y)$, for which the growth rate of $S_n(x,y)$ is $α\log n$ with $0\le α\le \infty$, have full Hausdorff dimension.
Denote by $S_n(x,y)$ the length of the longest common substring of $x$ and $y$ with shifts in their first $n$ digits of $b$-ary expansions. We show that the sets of pairs $(x,y)$, for which the growth rate of $S_n(x,y)$ is $α\log n$ with $0\le α\le \infty$, have full Hausdorff dimension.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Metrical theory of power-2-decaying Gauss-like expansion
Authors:
Zhihui Li,
Xin Liao,
Dingding Yu
Abstract:
Each $x\in (0,1]$ can be uniquely expanded as a power-2-decaying Gauss-like expansion, in the form of \begin{equation*} x=\sum_{i=1}^{\infty}2^{-(d_1(x)+d_2(x)+\cdots+d_i(x))},\qquad d_i(x)\in \mathbb{N}. \end{equation*} Let $φ:\mathbb{N}\to \mathbb{R}^{+}$ be an arbitrary positive function. We are interested in the size of the set…
▽ More
Each $x\in (0,1]$ can be uniquely expanded as a power-2-decaying Gauss-like expansion, in the form of \begin{equation*} x=\sum_{i=1}^{\infty}2^{-(d_1(x)+d_2(x)+\cdots+d_i(x))},\qquad d_i(x)\in \mathbb{N}. \end{equation*} Let $φ:\mathbb{N}\to \mathbb{R}^{+}$ be an arbitrary positive function. We are interested in the size of the set $$F(φ)=\{x\in (0,1]:d_n(x)\ge φ(n)~~\text{for infinity many}~n\}.$$ We prove a Borel-Bernstein theorem on the zero-one law of the Lebesgue measure of $F(φ)$. When the Lebesgue measure of $F(φ)$ is zero, we calculate its Hausdorff dimension. Furthermore, we analyse the growth rate of the maximal digit among the first $n$ digits from probability and multifractal perspectives.
△ Less
Submitted 29 May, 2024; v1 submitted 6 March, 2024;
originally announced March 2024.
-
Model Uncertainty and Selection of Risk Models for Left-Truncated and Right-Censored Loss Data
Authors:
Qian Zhao,
Sahadeb Upretee,
Daoping Yu
Abstract:
Insurance loss data are usually in the form of left-truncation and right-censoring due to deductibles and policy limits respectively. This paper investigates the model uncertainty and selection procedure when various parametric models are constructed to accommodate such left-truncated and right-censored data. The joint asymptotic properties of the estimators have been established using the Delta m…
▽ More
Insurance loss data are usually in the form of left-truncation and right-censoring due to deductibles and policy limits respectively. This paper investigates the model uncertainty and selection procedure when various parametric models are constructed to accommodate such left-truncated and right-censored data. The joint asymptotic properties of the estimators have been established using the Delta method along with Maximum Likelihood Estimation when the model is specified. We conduct the simulation studies using Fisk, Lognormal, Lomax, Paralogistic, and Weibull distributions with various proportions of loss data below deductibles and above policy limits. A variety of graphic tools, hypothesis tests, and penalized likelihood criteria are employed to validate the models, and their performances on the model selection are evaluated through the probability of each parent distribution being correctly selected. The effectiveness of each tool on model selection is also illustrated using {well-studied} data that represent Wisconsin property losses in the United States from 2007 to 2010.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
Symmetry-Based Quantum Circuit Mapping
Authors:
Di Yu,
Kun Fang
Abstract:
Quantum circuit mapping is a crucial process in the quantum circuit compilation pipeline, facilitating the transformation of a logical quantum circuit into a list of instructions directly executable on a target quantum system. Recent research has introduced a post-compilation step known as remapping, which seeks to reconfigure the initial circuit mapping to mitigate quantum circuit errors arising…
▽ More
Quantum circuit mapping is a crucial process in the quantum circuit compilation pipeline, facilitating the transformation of a logical quantum circuit into a list of instructions directly executable on a target quantum system. Recent research has introduced a post-compilation step known as remapping, which seeks to reconfigure the initial circuit mapping to mitigate quantum circuit errors arising from system variability. As quantum processors continue to scale in size, the efficiency of quantum circuit mapping and the overall compilation process has become of paramount importance. In this work, we introduce a quantum circuit remapping algorithm that leverages the intrinsic symmetries in quantum processors, making it well-suited for large-scale quantum systems. This algorithm identifies all topologically equivalent circuit mappings by constraining the search space using symmetries and accelerates the scoring of each mapping using vector computation. Notably, this symmetry-based circuit remapping algorithm exhibits linear scaling with the number of qubits in the target quantum hardware and is proven to be optimal in terms of its time complexity. Moreover, we conduct a comparative analysis against existing methods in the literature, demonstrating the superior performance of our symmetry-based method on state-of-the-art quantum hardware architectures and highlighting the practical utility of our algorithm, particularly for quantum processors with millions of qubits.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
The neural network models with delays for solving absolute value equations
Authors:
Dongmei Yu,
Gehao Zhang,
Cairong Chen,
Deren Han
Abstract:
An inverse-free neural network model with mixed delays is proposed for solving the absolute value equation (AVE) $Ax -|x| - b =0$, which includes an inverse-free neural network model with discrete delay as a special case. By using the Lyapunov-Krasovskii theory and the linear matrix inequality (LMI) method, the developed neural network models are proved to be exponentially convergent to the soluti…
▽ More
An inverse-free neural network model with mixed delays is proposed for solving the absolute value equation (AVE) $Ax -|x| - b =0$, which includes an inverse-free neural network model with discrete delay as a special case. By using the Lyapunov-Krasovskii theory and the linear matrix inequality (LMI) method, the developed neural network models are proved to be exponentially convergent to the solution of the AVE. Compared with the existing neural network models for solving the AVE, the proposed models feature the ability of solving a class of AVE with $\|A^{-1}\|>1$. Numerical simulations are given to show the effectiveness of the two delayed neural network models.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks
Authors:
Greg Yang,
Dingli Yu,
Chen Zhu,
Soufiane Hayou
Abstract:
By classifying infinite-width neural networks and identifying the *optimal* limit, Tensor Programs IV and V demonstrated a universal way, called $μ$P, for *widthwise hyperparameter transfer*, i.e., predicting optimal hyperparameters of wide neural networks from narrow ones. Here we investigate the analogous classification for *depthwise parametrizations* of deep residual networks (resnets). We cla…
▽ More
By classifying infinite-width neural networks and identifying the *optimal* limit, Tensor Programs IV and V demonstrated a universal way, called $μ$P, for *widthwise hyperparameter transfer*, i.e., predicting optimal hyperparameters of wide neural networks from narrow ones. Here we investigate the analogous classification for *depthwise parametrizations* of deep residual networks (resnets). We classify depthwise parametrizations of block multiplier and learning rate by their infinite-width-then-depth limits. In resnets where each block has only one layer, we identify a unique optimal parametrization, called Depth-$μ$P that extends $μ$P and show empirically it admits depthwise hyperparameter transfer. We identify *feature diversity* as a crucial factor in deep networks, and Depth-$μ$P can be characterized as maximizing both feature learning and feature diversity. Exploiting this, we find that absolute value, among all homogeneous nonlinearities, maximizes feature diversity and indeed empirically leads to significantly better performance. However, if each block is deeper (such as modern transformers), then we find fundamental limitations in all possible infinite-depth limits of such parametrizations, which we illustrate both theoretically and empirically on simple networks as well as Megatron transformer trained on Common Crawl.
△ Less
Submitted 12 October, 2023; v1 submitted 3 October, 2023;
originally announced October 2023.
-
Asymptotic stability of the sine-Gordon kinks under perturbations in weighted Sobolev norms
Authors:
Herbert Koch,
Dongxiao Yu
Abstract:
We study the asymptotic stability of the sine-Gordon kinks under small perturbations in weighted Sobolev norms. Our main tool is the Bäcklund transform which reduces the study of the asymptotic stability of the kinks to the study of the asymptotic decay of solutions near zero. Our results consist of two parts. First, we prove an asymptotic stability result similar to the local results in arXiv:200…
▽ More
We study the asymptotic stability of the sine-Gordon kinks under small perturbations in weighted Sobolev norms. Our main tool is the Bäcklund transform which reduces the study of the asymptotic stability of the kinks to the study of the asymptotic decay of solutions near zero. Our results consist of two parts. First, we prove an asymptotic stability result similar to the local results in arXiv:2003.09358 and arXiv:2009.04260. Our assumptions are the same as those in the local result in arXiv:2009.04260. In its proof, we apply a result obtained by the inverse scattering method on the local decay of the solutions with sufficiently small and localized initial data. Moreover, we derive an asymptotic formula for the perturbations, i.e. the difference between solutions and kinks. This result is similar to that in arXiv:2106.09605 and the full asymptotic stability result in arXiv:2009.04260. In its proof, we apply a result obtained by the method of testing by wave packets on the pointwise decay of the solutions with small and localized data.
△ Less
Submitted 25 September, 2024; v1 submitted 15 August, 2023;
originally announced August 2023.
-
Rigidity of center Lyapunov exponents for Anosov diffeomorphisms on 3-torus
Authors:
Daohua Yu,
Ruihao Gu
Abstract:
Let f and g be two Anosov diffeomorphisms on T3 with three-subbundles partially hyperbolic splittings where the weak stable subbundles are considered as center subbundles. Assume that f is conjugate to g and the conjugacy preserves the strong stable foliation, then their center Lyapunov exponents of corresponding periodic points coincide. This is the converse of the main result of Gogolev and Guys…
▽ More
Let f and g be two Anosov diffeomorphisms on T3 with three-subbundles partially hyperbolic splittings where the weak stable subbundles are considered as center subbundles. Assume that f is conjugate to g and the conjugacy preserves the strong stable foliation, then their center Lyapunov exponents of corresponding periodic points coincide. This is the converse of the main result of Gogolev and Guysinsky in [9]. Moreover, we get the same result for partially hyperbolic diffeomorphisms derived from Anosov on T3.
△ Less
Submitted 15 June, 2023; v1 submitted 23 October, 2022;
originally announced October 2022.
-
A fixed-time inverse-free dynamical system for solving the system of absolute value equations
Authors:
Xuehua Li,
Dongmei Yu,
Yinong Yang,
Deren Han,
Cairong Chen
Abstract:
In this paper, an inverse-free dynamical system with fixed-time convergence is presented to solve the system of absolute value equations (AVEs). Under a mild condition, it is proved that the solution of the proposed dynamical system converges to the solution of the AVEs. Moreover, in contrast to the existing inverse-free dynamical system \cite{chen2021}, a conservative settling-time of the propose…
▽ More
In this paper, an inverse-free dynamical system with fixed-time convergence is presented to solve the system of absolute value equations (AVEs). Under a mild condition, it is proved that the solution of the proposed dynamical system converges to the solution of the AVEs. Moreover, in contrast to the existing inverse-free dynamical system \cite{chen2021}, a conservative settling-time of the proposed method is given. Numerical simulations illustrate the effectiveness of the new method.
△ Less
Submitted 22 August, 2022;
originally announced August 2022.
-
A dynamical system based on projection operator for solving absolute value equations associated with second-order cone
Authors:
Cairong Chen,
Dongmei Yu,
Deren Han,
Changfeng Ma
Abstract:
A new equivalent reformulation of the absolute value equations associated with second-order cone (SOCAVEs) is emphasised, from which a dynamical system based on projection operator for solving SOCAVEs is constructed. Under proper assumptions, the equilibrium points of the dynamical system exist and could be (globally) asymptotically stable. Some numerical simulations are given to show the effectiv…
▽ More
A new equivalent reformulation of the absolute value equations associated with second-order cone (SOCAVEs) is emphasised, from which a dynamical system based on projection operator for solving SOCAVEs is constructed. Under proper assumptions, the equilibrium points of the dynamical system exist and could be (globally) asymptotically stable. Some numerical simulations are given to show the effectiveness of the proposed method.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
On finite termination of the generalized Newton method for solving absolute value equations
Authors:
Jia Tang,
Wenli Zheng,
Cairong Chen,
Dongmei Yu,
Deren Han
Abstract:
Motivated by the framework constructed by Brugnano and Casulli $[$SIAM J. Sci. Comput. 30: 463--472, 2008$]$, we analyze the finite termination property of the generalized Netwon method (GNM) for solving the absolute value equation (AVE). More precisely, for some special matrices, GNM is terminated in at most $2n + 2$ iterations. A new result for the unique solvability and unsolvability of the AVE…
▽ More
Motivated by the framework constructed by Brugnano and Casulli $[$SIAM J. Sci. Comput. 30: 463--472, 2008$]$, we analyze the finite termination property of the generalized Netwon method (GNM) for solving the absolute value equation (AVE). More precisely, for some special matrices, GNM is terminated in at most $2n + 2$ iterations. A new result for the unique solvability and unsolvability of the AVE is obtained. Numerical experiments are given to demonstrate the theoretical analysis.
△ Less
Submitted 10 July, 2022;
originally announced July 2022.
-
Nontrivial global solutions to some quasilinear wave equations in three space dimensions
Authors:
Dongxiao Yu
Abstract:
In this paper, we seek to construct nontrivial global solutions to some quasilinear wave equations in three space dimensions. We first present a conditional result on the construction of nontrivial global solutions to a general system of quasilinear wave equations. Assuming that a global solution to the geometric reduced system exists and satisfies several well-chosen pointwise estimates, we find…
▽ More
In this paper, we seek to construct nontrivial global solutions to some quasilinear wave equations in three space dimensions. We first present a conditional result on the construction of nontrivial global solutions to a general system of quasilinear wave equations. Assuming that a global solution to the geometric reduced system exists and satisfies several well-chosen pointwise estimates, we find a matching exact global solution to the original wave equations. Such a conditional result is then applied to two types of equations which are of great interest. One is John's counterexamples $\Box u=u_t^2$ or $\Box u=u_t u_{tt}$, and the other is the 3D compressible Euler equations with no vorticity. We explicitly construct global solutions to the corresponding geometric reduced systems and show that these global solutions satisfy the required pointwise bounds. As a result, there exists a large family of nontrivial global solutions to these two types of equations.
△ Less
Submitted 6 October, 2024; v1 submitted 27 April, 2022;
originally announced April 2022.
-
A non-monotone smoothing Newton algorithm for solving the system of generalized absolute value equations
Authors:
Cairong Chen,
Dongmei Yu,
Deren Han,
Changfeng Ma
Abstract:
The system of generalized absolute value equations (GAVE) has attracted more and more attention in the optimization community. In this paper, by introducing a smoothing function, we develop a smoothing Newton algorithm with non-monotone line search to solve the GAVE. We show that the non-monotone algorithm is globally and locally quadratically convergent under a weaker assumption than those given…
▽ More
The system of generalized absolute value equations (GAVE) has attracted more and more attention in the optimization community. In this paper, by introducing a smoothing function, we develop a smoothing Newton algorithm with non-monotone line search to solve the GAVE. We show that the non-monotone algorithm is globally and locally quadratically convergent under a weaker assumption than those given in most existing algorithms for solving the GAVE. Numerical results are given to demonstrate the viability and efficiency of the approach.
△ Less
Submitted 26 November, 2021;
originally announced November 2021.
-
A uniqueness theorem for 3D semilinear wave equations satisfying the null condition
Authors:
Dongxiao Yu
Abstract:
In this paper, we prove a uniqueness theorem for a system of semilinear wave equations satisfying the null condition in $\mathbb{R}^{1+3}$. Suppose that two global solutions with $C_c^\infty$ initial data have equal initial data outside a ball and equal radiation fields outside a light cone. We show that these two solutions are equal either outside a hyperboloid or everywhere in the spacetime, dep…
▽ More
In this paper, we prove a uniqueness theorem for a system of semilinear wave equations satisfying the null condition in $\mathbb{R}^{1+3}$. Suppose that two global solutions with $C_c^\infty$ initial data have equal initial data outside a ball and equal radiation fields outside a light cone. We show that these two solutions are equal either outside a hyperboloid or everywhere in the spacetime, depending on the sizes of the ball and the light cone.
△ Less
Submitted 28 September, 2022; v1 submitted 30 September, 2021;
originally announced September 2021.
-
Asymptotic completeness for a scalar quasilinear wave equation satisfying the weak null condition
Authors:
Dongxiao Yu
Abstract:
In this paper, we prove the first asymptotic completeness result for a scalar quasilinear wave equation satisfying the weak null condition. The main tool we use in the study of this equation is the geometric reduced system introduced in arXiv:2002.05355. Starting from a global solution $u$ to the quasilinear wave equation, we rigorously show that well chosen asymptotic variables solve the same red…
▽ More
In this paper, we prove the first asymptotic completeness result for a scalar quasilinear wave equation satisfying the weak null condition. The main tool we use in the study of this equation is the geometric reduced system introduced in arXiv:2002.05355. Starting from a global solution $u$ to the quasilinear wave equation, we rigorously show that well chosen asymptotic variables solve the same reduced system with small error terms. This allows us to recover the scattering data for our system, as well as to construct a matching exact solution to the reduced system.
△ Less
Submitted 28 September, 2021; v1 submitted 24 May, 2021;
originally announced May 2021.
-
An inexact framework of the Newton-based matrix splitting iterative method for the generalized absolute value equation
Authors:
Dongmei Yu,
Cairong Chen,
Deren Han
Abstract:
An inexact framework of the Newton-based matrix splitting (INMS) iterative method is developed to solve the generalized absolute value equation, whose exact version was proposed by Zhou, Wu and Li [H.-Y. Zhou, S.-L. Wu and C.-X. Li, \textit{J. Comput. Appl. Math.}, 394 (2021), 113578]. Global linear convergence of the INMS iterative method is investigated in detail. Some numerical results are give…
▽ More
An inexact framework of the Newton-based matrix splitting (INMS) iterative method is developed to solve the generalized absolute value equation, whose exact version was proposed by Zhou, Wu and Li [H.-Y. Zhou, S.-L. Wu and C.-X. Li, \textit{J. Comput. Appl. Math.}, 394 (2021), 113578]. Global linear convergence of the INMS iterative method is investigated in detail. Some numerical results are given to show the superiority of the INMS iterative method.
△ Less
Submitted 14 February, 2022; v1 submitted 18 March, 2021;
originally announced March 2021.
-
An inexact Douglas-Rachford splitting method for solving absolute value equations
Authors:
Cairong Chen,
Dongmei Yu,
Deren Han
Abstract:
The last two decades witnessed the increasing of the interests on the absolute value equations (AVE) of finding $x\in\mathbb{R}^n$ such that $Ax-|x|-b=0$, where $A\in \mathbb{R}^{n\times n}$ and $b\in \mathbb{R}^n$. In this paper, we pay our attention on designing efficient algorithms. To this end, we reformulate AVE to a generalized linear complementarity problem (GLCP), which, among the equivale…
▽ More
The last two decades witnessed the increasing of the interests on the absolute value equations (AVE) of finding $x\in\mathbb{R}^n$ such that $Ax-|x|-b=0$, where $A\in \mathbb{R}^{n\times n}$ and $b\in \mathbb{R}^n$. In this paper, we pay our attention on designing efficient algorithms. To this end, we reformulate AVE to a generalized linear complementarity problem (GLCP), which, among the equivalent forms, is the most economical one in the sense that it does not increase the dimension of the variables. For solving the GLCP, we propose an inexact Douglas-Rachford splitting method which can adopt a relative error tolerance. As a consequence, in the inner iteration processes, we can employ the LSQR method ([C.C. Paige and M.A. Saunders, ACM Trans. Mathe. Softw. (TOMS), 8 (1982), pp. 43--71]) to find a qualified approximate solution for each subproblem, which makes the cost per iteration very low. We prove the convergence of the algorithm and establish its global linear rate of convergence. Comparing results with the popular algorithms such as the exact generalized Newton method [O.L. Mangasarian, Optim. Lett., 1 (2007), pp. 3--8], the inexact semi-smooth Newton method [J.Y.B. Cruz, O.P. Ferreira and L.F. Prudente, Comput. Optim. Appl., 65 (2016), pp. 93--108] and the exact SOR-like method [Y.-F. Ke and C.-F. Ma, Appl. Math. Comput., 311 (2017), pp. 195--202] are reported, which indicate that the proposed algorithm is very promising. Moreover, our method also extends the range of numerically solvable of the AVE; that is, it can deal with not only the case that $\|A^{-1}\|<1$, the commonly used in those existing literature, but also the case where $\|A^{-1}\|=1$.
△ Less
Submitted 16 March, 2021;
originally announced March 2021.
-
Analysis and Optimization for Large-Scale LoRa Networks: Throughput Fairness and Scalability
Authors:
Jiangbin Lyu,
Dan Yu,
Liqun Fu
Abstract:
LoRa networks are pivotally enabling Long Range connectivity to low-cost and power-constrained user equipments (UEs) in a wide area, whereas a critical issue is to effectively allocate wireless resources to support potentially massive UEs while resolving the prominent near-far fairness issue, which is challenging due to the lack of tractable analytical model and the practical requirement for low-c…
▽ More
LoRa networks are pivotally enabling Long Range connectivity to low-cost and power-constrained user equipments (UEs) in a wide area, whereas a critical issue is to effectively allocate wireless resources to support potentially massive UEs while resolving the prominent near-far fairness issue, which is challenging due to the lack of tractable analytical model and the practical requirement for low-complexity and low-overhead design. Leveraging on stochastic geometry, especially the Poisson rain model, we derive (semi-) closed-form formulas for the aggregate interference distribution, packet success probability and hence system throughput in both single-cell and multi-cell setups with frequency reuse, by accounting for channel fading, random UE distribution, partial packet overlapping, and/or multi-gateway packet reception. The analytical formulas require only average channel statistics and spatial UE distribution, which enable tractable network performance evaluation and incubate our proposed Iterative Balancing (IB) method that quickly yields high-level policies of joint spreading factor (SF) allocation, power control, and duty cycle adjustment for gauging the average max-min UE throughput or supported UE density with rate requirements. Numerical results validate the analytical formulas and the effectiveness of our proposed optimization scheme, which greatly alleviates the near-far fairness issue and reduces the spatial power consumption, while significantly improving the cell-edge throughput as well as the spatial (sum) throughput for the majority of UEs, by adapting to the UE/gateway densities.
△ Less
Submitted 5 November, 2021; v1 submitted 17 August, 2020;
originally announced August 2020.
-
Modified wave operators for a scalar quasilinear wave equation satisfying the weak null condition
Authors:
Dongxiao Yu
Abstract:
We prove the existence of the modified wave operators for a scalar quasilinear wave equation satisfying the weak null condition. This is accomplished in three steps. First, we derive a new reduced asymptotic system for the quasilinear wave equation by modifying Hörmander's method. Next, we construct an approximate solution, by solving our new reduced system given some scattering data at infinite t…
▽ More
We prove the existence of the modified wave operators for a scalar quasilinear wave equation satisfying the weak null condition. This is accomplished in three steps. First, we derive a new reduced asymptotic system for the quasilinear wave equation by modifying Hörmander's method. Next, we construct an approximate solution, by solving our new reduced system given some scattering data at infinite time. Finally, we prove that the quasilinear wave equation has a global solution which agrees with the approximate solution at infinite time.
△ Less
Submitted 2 November, 2020; v1 submitted 13 February, 2020;
originally announced February 2020.
-
Optimal parameter for the SOR-like iteration method for solving the system of absolute value equations
Authors:
Cairong Chen,
Dongmei Yu,
Deren Han
Abstract:
The SOR-like iteration method for solving the absolute value equations~(AVE) of finding a vector $x$ such that $Ax - |x| - b = 0$ with $ν= \|A^{-1}\|_2 < 1$ is investigated. The convergence conditions of the SOR-like iteration method proposed by Ke and Ma ([{\em Appl. Math. Comput.}, 311:195--202, 2017]) are revisited and a new proof is given, which exhibits some insights in determining the conver…
▽ More
The SOR-like iteration method for solving the absolute value equations~(AVE) of finding a vector $x$ such that $Ax - |x| - b = 0$ with $ν= \|A^{-1}\|_2 < 1$ is investigated. The convergence conditions of the SOR-like iteration method proposed by Ke and Ma ([{\em Appl. Math. Comput.}, 311:195--202, 2017]) are revisited and a new proof is given, which exhibits some insights in determining the convergent region and the optimal iteration parameter. Along this line, the optimal parameter which minimizes $\|T_ν(ω)\|_2$ with $$T_ν(ω) = \left(\begin{array}{cc} |1-ω| & ω^2ν\\ |1-ω| & |1-ω| +ω^2ν\end{array}\right)$$ and the approximate optimal parameter which minimizes $η_ν(ω) =\max\{|1-ω|,νω^2\}$ are explored. The optimal and approximate optimal parameters are iteration-independent and the bigger value of $ν$ is, the smaller convergent region of the iteration parameter $ω$ is. Numerical results are presented to demonstrate that the SOR-like iteration method with the optimal parameter is superior to that with the approximate optimal parameter proposed by Guo, Wu and Li ([{\em Appl. Math. Lett.}, 97:107--113, 2019]). In some situation, the SOR-like itration method with the optimal parameter performs better, in terms of CPU time, than the generalized Newton method (Mangasarian, [{\em Optim. Lett.}, 3:101--108, 2009]) for solving the AVE.
△ Less
Submitted 12 March, 2021; v1 submitted 16 January, 2020;
originally announced January 2020.
-
Nonparametric principal subspace regression
Authors:
Mark Koudstaal,
Dengdeng Yu,
Dehan Kong,
Fang Yao
Abstract:
In scientific applications, multivariate observations often come in tandem with temporal or spatial covariates, with which the underlying signals vary smoothly. The standard approaches such as principal component analysis and factor analysis neglect the smoothness of the data, while multivariate linear or nonparametric regression fail to leverage the correlation information among multivariate resp…
▽ More
In scientific applications, multivariate observations often come in tandem with temporal or spatial covariates, with which the underlying signals vary smoothly. The standard approaches such as principal component analysis and factor analysis neglect the smoothness of the data, while multivariate linear or nonparametric regression fail to leverage the correlation information among multivariate response variables. We propose a novel approach named nonparametric principal subspace regression to overcome these issues. By decoupling the model discrepancy, a simple and general two-step framework is introduced, which leaves much flexibility in choice of model fitting. We establish theoretical property of the general framework, and offer implementation procedures that fulfill requirements and enjoy the theoretical guarantee. We demonstrate the favorable finite-sample performance of the proposed method through simulations and a real data application from an electroencephalogram study.
△ Less
Submitted 12 October, 2019; v1 submitted 7 October, 2019;
originally announced October 2019.
-
On the Leaders' Graphical Characterization for Controllability of Path Related Graphs
Authors:
Li Dai,
Dianlong Yu,
Zheng Xie
Abstract:
The problem of leaders location plays an important role in the controllability of undirected graphs.The concept of minimal perfect critical vertex set is introduced by drawing support from the eigenvector of Laplace matrix. Using the notion of minimal perfect critical vertex set, the problem of finding the minimum number of controllable leader vertices is transformed into the problem of finding al…
▽ More
The problem of leaders location plays an important role in the controllability of undirected graphs.The concept of minimal perfect critical vertex set is introduced by drawing support from the eigenvector of Laplace matrix. Using the notion of minimal perfect critical vertex set, the problem of finding the minimum number of controllable leader vertices is transformed into the problem of finding all minimal perfect critical vertex sets. Some necessary and sufficient conditions for special minimal perfect critical vertex sets are provided, such as minimal perfect critical 2 vertex set, and minimal perfect critical vertex set of path or path related graphs. And further, the leaders location problem for path graphs is solved completely by the algorithm provided in this paper. An interesting result that there never exist a minimal perfect critical 3 vertex set is proved, too.
△ Less
Submitted 7 June, 2019;
originally announced June 2019.
-
Investigating the difference in mechanical stability of retained austenite in bainitic and martensitic high-carbon bearing steels using in situ neutron diffraction and crystal plasticity modeling
Authors:
Rohit Voothaluru,
Vikram Bedekar,
Dunji Yu,
Qingge Xie,
Ke An,
R Scott Hyde
Abstract:
In situ neutron diffraction of the uniaxial tension test was used to study the effect of the surrounding matrix microstructure on the mechanical stability of retained austenite in high-carbon bearing steels. Comparing the samples with bainitic microstructures to those with martensitic ones it was found that the retained austenite in a bainitic matrix starts transforming into martensite at a lower…
▽ More
In situ neutron diffraction of the uniaxial tension test was used to study the effect of the surrounding matrix microstructure on the mechanical stability of retained austenite in high-carbon bearing steels. Comparing the samples with bainitic microstructures to those with martensitic ones it was found that the retained austenite in a bainitic matrix starts transforming into martensite at a lower strain compared to that within a martensitic matrix. On the other hand, the rate of transformation of the austenite was found to be higher within a martensitic microstructure. Crystal plasticity modeling was used to analyze the transformation phenomenon in these two microstructures and determine the effect of surrounding microstructure on elastic, plastic and transformation components of the strain. The results showed that the predominant difference in the deformation accumulated was from the transformation strain and the critical transformation driving force within the two microstructures. The retained austenite was more stable for identical loading conditions in case of martensitic matrix compared to the bainitic one. It was also observed that the initial volume fraction of retained austenite within the bainitic matrix would alter the onset of transformation to martensite but not the rate of transformation.
△ Less
Submitted 17 July, 2018;
originally announced July 2018.
-
A Decoupled Data Based Approach to Stochastic Optimal Control Problems
Authors:
Dan Yu,
Mohammandhussen Rafieisakhaei,
Suman Chakravorty
Abstract:
This paper studies the stochastic optimal control problem for systems with unknown dynamics. A novel decoupled data based control (D2C) approach is proposed, which solves the problem in a decoupled "open loop-closed loop" fashion that is shown to be near-optimal. First, an open-loop deterministic trajectory optimization problem is solved using a black-box simulation model of the dynamical system u…
▽ More
This paper studies the stochastic optimal control problem for systems with unknown dynamics. A novel decoupled data based control (D2C) approach is proposed, which solves the problem in a decoupled "open loop-closed loop" fashion that is shown to be near-optimal. First, an open-loop deterministic trajectory optimization problem is solved using a black-box simulation model of the dynamical system using a standard nonlinear programming (NLP) solver. Then a Linear Quadratic Regulator (LQR) controller is designed for the nominal trajectory-dependent linearized system which is learned using input-output experimental data. Computational examples are used to illustrate the performance of the proposed approach with three benchmark problems.
△ Less
Submitted 10 September, 2018; v1 submitted 1 July, 2018;
originally announced July 2018.
-
Path model for an extremal weight module over the quantized hyperbolic Kac-Moody algebra of rank 2
Authors:
Daisuke Sagaki,
Dongxiao Yu
Abstract:
Let $\mathfrak{g}$ be a hyperbolic Kac-Moody algebra of rank 2, and set $λ=Λ_{1} - Λ_{2}$, where $Λ_{1}$, $Λ_{2}$ are the fundamental weights. Denote by $V(λ)$ the extremal weight module of extremal weight $λ$ with $v_λ$ the extremal weight vector, and by $\mathcal{B}(λ)$ the crystal basis of $V(λ)$ with $u_λ$ the element corresponding to $v_λ$. We prove that (i) $\mathcal{B}(λ)$ is connected, (ii…
▽ More
Let $\mathfrak{g}$ be a hyperbolic Kac-Moody algebra of rank 2, and set $λ=Λ_{1} - Λ_{2}$, where $Λ_{1}$, $Λ_{2}$ are the fundamental weights. Denote by $V(λ)$ the extremal weight module of extremal weight $λ$ with $v_λ$ the extremal weight vector, and by $\mathcal{B}(λ)$ the crystal basis of $V(λ)$ with $u_λ$ the element corresponding to $v_λ$. We prove that (i) $\mathcal{B}(λ)$ is connected, (ii) the subset $\mathcal{B}(λ)_μ$ of elements of weight $μ$ in $\mathcal{B}(λ)$ is a finite set for every integral weight $μ$, and $\mathcal{B}(λ)_λ = \{u_λ\}$, (iii) every extremal element in $\mathcal{B}(λ)$ is contained in the Weyl group orbit of $u_λ$, (iv) $V(λ)$ is irreducible. Finally, we prove that the crystal basis $\mathcal{B}(λ)$ is isomorphic, as a crystal, to the crystal $\mathbb{B}(λ)$ of Lakshmibai-Seshadri paths of shape $λ$.
△ Less
Submitted 10 August, 2018; v1 submitted 4 December, 2017;
originally announced December 2017.
-
A Separation-based Approach to Data-based Control for Large-Scale Partially Observed Systems
Authors:
Dan Yu,
Mohammadhussein Rafieisakhaei,
Suman Chakravorty
Abstract:
This paper studies the partially observed stochastic optimal control problem for systems with state dynamics governed by partial differential equations (PDEs) that leads to an extremely large problem. First, an open-loop deterministic trajectory optimization problem is solved using a black-box simulation model of the dynamical system. Next, a Linear Quadratic Gaussian (LQG) controller is designed…
▽ More
This paper studies the partially observed stochastic optimal control problem for systems with state dynamics governed by partial differential equations (PDEs) that leads to an extremely large problem. First, an open-loop deterministic trajectory optimization problem is solved using a black-box simulation model of the dynamical system. Next, a Linear Quadratic Gaussian (LQG) controller is designed for the nominal trajectory-dependent linearized system which is identified using input-output experimental data consisting of the impulse responses of the optimized nominal system. A computational nonlinear heat example is used to illustrate the performance of the proposed approach.
△ Less
Submitted 2 November, 2017;
originally announced November 2017.
-
A partisan districting protocol with provably nonpartisan outcomes
Authors:
Wesley Pegden,
Ariel D. Procaccia,
Dingli Yu
Abstract:
We design and analyze a protocol for dividing a state into districts, where parties take turns proposing a division, and freezing a district from the other party's proposed division. We show that our protocol has predictable and provable guarantees for both the number of districts in which each party has a majority of supporters, and the extent to which either party has the power to pack a specifi…
▽ More
We design and analyze a protocol for dividing a state into districts, where parties take turns proposing a division, and freezing a district from the other party's proposed division. We show that our protocol has predictable and provable guarantees for both the number of districts in which each party has a majority of supporters, and the extent to which either party has the power to pack a specific population into a single district.
△ Less
Submitted 24 October, 2017;
originally announced October 2017.
-
An Alternative Approach to Functional Linear Partial Quantile Regression
Authors:
Dengdeng Yu,
Matthew Pietrosanu,
Ivan Mizera,
Bei Jiang,
Linglong Kong,
Wei Tu
Abstract:
Functional data such as curves and surfaces have become more and more common with modern technological advancements. The use of functional predictors remains challenging due to its inherent infinite-dimensionality. The common practice is to project functional data into a finite dimensional space. The popular partial least square (PLS) method has been well studied for the functional linear model [1…
▽ More
Functional data such as curves and surfaces have become more and more common with modern technological advancements. The use of functional predictors remains challenging due to its inherent infinite-dimensionality. The common practice is to project functional data into a finite dimensional space. The popular partial least square (PLS) method has been well studied for the functional linear model [1]. As an alternative, quantile regression provides a robust and more comprehensive picture of the conditional distribution of a response when it is non-normal, heavy-tailed, or contaminated by outliers. While partial quantile regression (PQR) was proposed in [2], no theoretical guarantees were provided due to the iterative nature of the algorithm and the non-smoothness of quantile loss function. To address these issues, we propose an alternative PQR (APQR) formulation with guaranteed convergence. This novel formulation motivates new theories and allows us to establish asymptotic properties. Numerical studies on a benchmark dataset show the superiority of our new approach. We also apply our novel method to a functional magnetic resonance imaging (fMRI) data to predict attention deficit hyperactivity disorder (ADHD) and a diffusion tensor imaging (DTI) dataset to predict Alzheimer's disease (AD).
△ Less
Submitted 30 January, 2023; v1 submitted 7 September, 2017;
originally announced September 2017.
-
Convergence Analysis of Optimization Algorithms
Authors:
HyoungSeok Kim,
JiHoon Kang,
WooMyoung Park,
SukHyun Ko,
YoonHo Cho,
DaeSung Yu,
YoungSook Song,
JungWon Choi
Abstract:
The regret bound of an optimization algorithms is one of the basic criteria for evaluating the performance of the given algorithm. By inspecting the differences between the regret bounds of traditional algorithms and adaptive one, we provide a guide for choosing an optimizer with respect to the given data set and the loss function. For analysis, we assume that the loss function is convex and its g…
▽ More
The regret bound of an optimization algorithms is one of the basic criteria for evaluating the performance of the given algorithm. By inspecting the differences between the regret bounds of traditional algorithms and adaptive one, we provide a guide for choosing an optimizer with respect to the given data set and the loss function. For analysis, we assume that the loss function is convex and its gradient is Lipschitz continuous.
△ Less
Submitted 6 July, 2017;
originally announced July 2017.
-
Geometry-Oblivious FMM for Compressing Dense SPD Matrices
Authors:
Chenhan D. Yu,
James Levitt,
Severin Reiz,
George Biros
Abstract:
We present GOFMM (geometry-oblivious FMM), a novel method that creates a hierarchical low-rank approximation, "compression," of an arbitrary dense symmetric positive definite (SPD) matrix. For many applications, GOFMM enables an approximate matrix-vector multiplication in $N \log N$ or even $N$ time, where $N$ is the matrix size. Compression requires $N \log N$ storage and work. In general, our sc…
▽ More
We present GOFMM (geometry-oblivious FMM), a novel method that creates a hierarchical low-rank approximation, "compression," of an arbitrary dense symmetric positive definite (SPD) matrix. For many applications, GOFMM enables an approximate matrix-vector multiplication in $N \log N$ or even $N$ time, where $N$ is the matrix size. Compression requires $N \log N$ storage and work. In general, our scheme belongs to the family of hierarchical matrix approximation methods. In particular, it generalizes the fast multipole method (FMM) to a purely algebraic setting by only requiring the ability to sample matrix entries. Neither geometric information (i.e., point coordinates) nor knowledge of how the matrix entries have been generated is required, thus the term "geometry-oblivious." Also, we introduce a shared-memory parallel scheme for hierarchical matrix computations that reduces synchronization barriers. We present results on the Intel Knights Landing and Haswell architectures, and on the NVIDIA Pascal architecture for a variety of matrices.
△ Less
Submitted 1 July, 2017;
originally announced July 2017.
-
Sparse Wavelet Estimation in Quantile Regression with Multiple Functional Predictors
Authors:
Dengdeng Yu,
Li Zhang,
Ivan Mizera,
Bei Jiang,
Linglong Kong
Abstract:
In this manuscript, we study quantile regression in partial functional linear model where response is scalar and predictors include both scalars and multiple functions. Wavelet basis are adopted to better approximate functional slopes while effectively detect local features. The sparse group lasso penalty is imposed to select important functional predictors while capture shared information among t…
▽ More
In this manuscript, we study quantile regression in partial functional linear model where response is scalar and predictors include both scalars and multiple functions. Wavelet basis are adopted to better approximate functional slopes while effectively detect local features. The sparse group lasso penalty is imposed to select important functional predictors while capture shared information among them. The estimation problem can be reformulated into a standard second-order cone program and then solved by an interior point method. We also give a novel algorithm by using alternating direction method of multipliers (ADMM) which was recently employed by many researchers in solving penalized quantile regression problems. The asymptotic properties such as the convergence rate and prediction error bound have been established. Simulations and a real data from ADHD-200 fMRI data are investigated to show the superiority of our proposed method.
△ Less
Submitted 2 December, 2017; v1 submitted 7 June, 2017;
originally announced June 2017.
-
Lakshmibai-Seshadri paths for hyperbolic Kac-Moody algebras of rank $2$
Authors:
Dongxiao Yu
Abstract:
Let $\mathfrak{g}$ be a hyperbolic Kac-Moody algebra of rank $2$, and set $λ: = Λ_1 - Λ_2$, where $Λ_1, Λ_2$ are the fundamental weights for $\mathfrak{g}$; note that $λ$ is neither dominant nor antidominant. Let $\mathbb{B}(λ)$ be the crystal of all Lakshmibai-Seshadri paths of shape $λ$. We prove that (the crystal graph of) $\mathbb{B}(λ)$ is connected. Furthermore, we give an explicit descripti…
▽ More
Let $\mathfrak{g}$ be a hyperbolic Kac-Moody algebra of rank $2$, and set $λ: = Λ_1 - Λ_2$, where $Λ_1, Λ_2$ are the fundamental weights for $\mathfrak{g}$; note that $λ$ is neither dominant nor antidominant. Let $\mathbb{B}(λ)$ be the crystal of all Lakshmibai-Seshadri paths of shape $λ$. We prove that (the crystal graph of) $\mathbb{B}(λ)$ is connected. Furthermore, we give an explicit description of Lakshmibai-Seshadri paths of shape $λ$.
△ Less
Submitted 6 August, 2017; v1 submitted 8 February, 2017;
originally announced February 2017.
-
An $N \log N$ Parallel Fast Direct Solver for Kernel Matrices
Authors:
Chenhan D. Yu,
William B. March,
George Biros
Abstract:
Kernel matrices appear in machine learning and non-parametric statistics. Given $N$ points in $d$ dimensions and a kernel function that requires $\mathcal{O}(d)$ work to evaluate, we present an $\mathcal{O}(dN\log N)$-work algorithm for the approximate factorization of a regularized kernel matrix, a common computational bottleneck in the training phase of a learning task. With this factorization,…
▽ More
Kernel matrices appear in machine learning and non-parametric statistics. Given $N$ points in $d$ dimensions and a kernel function that requires $\mathcal{O}(d)$ work to evaluate, we present an $\mathcal{O}(dN\log N)$-work algorithm for the approximate factorization of a regularized kernel matrix, a common computational bottleneck in the training phase of a learning task. With this factorization, solving a linear system with a kernel matrix can be done with $\mathcal{O}(N\log N)$ work. Our algorithm only requires kernel evaluations and does not require that the kernel matrix admits an efficient global low rank approximation. Instead our factorization only assumes low-rank properties for the off-diagonal blocks under an appropriate row and column ordering. We also present a hybrid method that, when the factorization is prohibitively expensive, combines a partial factorization with iterative methods. As a highlight, we are able to approximately factorize a dense $11M\times11M$ kernel matrix in 2 minutes on 3,072 x86 "Haswell" cores and a $4.5M\times4.5M$ matrix in 1 minute using 4,352 "Knights Landing" cores.
△ Less
Submitted 9 January, 2017;
originally announced January 2017.
-
Inv-ASKIT: A Parallel Fast Diret Solver for Kernel Matrices
Authors:
Chenhan D. Yu,
William B. March,
Bo Xiao,
George Biros
Abstract:
We present a parallel algorithm for computing the approximate factorization of an $N$-by-$N$ kernel matrix. Once this factorization has been constructed (with $N \log^2 N $ work), we can solve linear systems with this matrix with $N \log N $ work. Kernel matrices represent pairwise interactions of points in metric spaces. They appear in machine learning, approximation theory, and computational phy…
▽ More
We present a parallel algorithm for computing the approximate factorization of an $N$-by-$N$ kernel matrix. Once this factorization has been constructed (with $N \log^2 N $ work), we can solve linear systems with this matrix with $N \log N $ work. Kernel matrices represent pairwise interactions of points in metric spaces. They appear in machine learning, approximation theory, and computational physics. Kernel matrices are typically dense (matrix multiplication scales quadratically with $N$) and ill-conditioned (solves can require 100s of Krylov iterations). Thus, fast algorithms for matrix multiplication and factorization are critical for scalability.
Recently we introduced ASKIT, a new method for approximating a kernel matrix that resembles N-body methods. Here we introduce INV-ASKIT, a factorization scheme based on ASKIT. We describe the new method, derive complexity estimates, and conduct an empirical study of its accuracy and scalability. We report results on real-world datasets including "COVTYPE" ($0.5$M points in 54 dimensions), "SUSY" ($4.5$M points in 8 dimensions) and "MNIST" (2M points in 784 dimensions) using shared and distributed memory parallelism. In our largest run we approximately factorize a dense matrix of size 32M $\times$ 32M (generated from points in 64 dimensions) on 4,096 Sandy-Bridge cores. To our knowledge these results improve the state of the art by several orders of magnitude.
△ Less
Submitted 3 February, 2016;
originally announced February 2016.
-
A Computationally Optimal Randomized Proper Orthogonal Decomposition Technique
Authors:
Dan Yu,
Suman Chakravorty
Abstract:
In this paper, we consider the model reduction problem of large-scale systems, such as systems obtained through the discretization of partial differential equations. We propose a computationally optimal randomized proper orthogonal decomposition (RPOD*) technique to obtain the reduced order model by perturbing the primal and adjoint system using Gaussian white noise. We show that the computations…
▽ More
In this paper, we consider the model reduction problem of large-scale systems, such as systems obtained through the discretization of partial differential equations. We propose a computationally optimal randomized proper orthogonal decomposition (RPOD*) technique to obtain the reduced order model by perturbing the primal and adjoint system using Gaussian white noise. We show that the computations required by the RPOD* algorithm is orders of magnitude cheaper when compared to the balanced proper orthogonal decomposition (BPOD) algorithm and BPOD output projection algorithm while the performance of the RPOD* algorithm is much better than BPOD output projection algorithm. It is optimal in the sense that a minimal number of snapshots is needed. We also relate the RPOD* algorithm to random projection algorithms. The method is tested on two advection-diffusion equations.
△ Less
Submitted 3 May, 2016; v1 submitted 18 September, 2015;
originally announced September 2015.
-
An autoregressive (AR) model based stochastic unknown input realization and filtering technique
Authors:
Dan Yu,
Suman Chakravorty
Abstract:
This paper studies the state estimation problem of linear discrete-time systems with stochastic unknown inputs. The unknown input is a wide-sense stationary process while no other prior informaton needs to be known. We propose an autoregressive (AR) model based unknown input realization technique which allows us to recover the input statistics from the output data by solving an appropriate least s…
▽ More
This paper studies the state estimation problem of linear discrete-time systems with stochastic unknown inputs. The unknown input is a wide-sense stationary process while no other prior informaton needs to be known. We propose an autoregressive (AR) model based unknown input realization technique which allows us to recover the input statistics from the output data by solving an appropriate least squares problem, then fit an AR model to the recovered input statistics and construct an innovations model of the unknown inputs using the eigensystem realization algorithm (ERA). An augmented state system is constructed and the standard Kalman filter is applied for state estimation. A reduced order model (ROM) filter is also introduced to reduce the computational cost of the Kalman filter. Two numerical examples are given to illustrate the procedure.
△ Less
Submitted 4 April, 2016; v1 submitted 23 July, 2014;
originally announced July 2014.
-
Generalized Convex Functions and Some Inequalities on Fractal Sets
Authors:
Huixia Mo,
Xin Sui,
Dongyan Yu
Abstract:
In the paper, we introduce the generalized convex function on fractal sets of real line numbers and study the properties of the generalized convex function. Based on these properties, we establish the generalized Jensen inequality and generalized Hermite-Hadamard inequality. Furthermore,some applications are given.
In the paper, we introduce the generalized convex function on fractal sets of real line numbers and study the properties of the generalized convex function. Based on these properties, we establish the generalized Jensen inequality and generalized Hermite-Hadamard inequality. Furthermore,some applications are given.
△ Less
Submitted 27 June, 2014; v1 submitted 15 April, 2014;
originally announced April 2014.
-
A Randomized Proper Orthogonal Decomposition Technique
Authors:
Dan Yu,
Suman Chakravorty
Abstract:
In this paper, we consider the problem of model reduction of large scale systems, such as those obtained through the discretization of PDEs. We propose a randomized proper orthogonal decomposition (RPOD) technique to obtain the reduced order models by randomly choosing a subset of the inputs/outputs of the system to construct a suitable small sized Hankel matrix from the full Hankel matrix. It is…
▽ More
In this paper, we consider the problem of model reduction of large scale systems, such as those obtained through the discretization of PDEs. We propose a randomized proper orthogonal decomposition (RPOD) technique to obtain the reduced order models by randomly choosing a subset of the inputs/outputs of the system to construct a suitable small sized Hankel matrix from the full Hankel matrix. It is shown that the RPOD technique is computationally orders of magnitude cheaper when compared to techniques such as the Eigensystem Realization algorithm (ERA)/Balanced POD (BPOD) while obtaining the same information in terms of the number and accuracy of the dominant modes. The method is tested on several different advection-diffusion equations.
△ Less
Submitted 13 December, 2013;
originally announced December 2013.
-
On local property of absolute summability of factored Fourier series
Authors:
Hüseyin Bor,
Dansheng Yu,
Ping Zhou
Abstract:
We establish two general theorems on the local properties of the absolute summability of factored Fourier series by applying a recently defined absolute summability, $\left\vert A,α_{n}\right\vert _{k}$ summability, and the class $\mathcal{S}\left( α_{n},φ_{n}\right) $, which generalize some well known results and can be applied to improve many classical absolute summability methods.
We establish two general theorems on the local properties of the absolute summability of factored Fourier series by applying a recently defined absolute summability, $\left\vert A,α_{n}\right\vert _{k}$ summability, and the class $\mathcal{S}\left( α_{n},φ_{n}\right) $, which generalize some well known results and can be applied to improve many classical absolute summability methods.
△ Less
Submitted 29 January, 2013;
originally announced January 2013.
-
On $L^{1}$-Convergence of Fourier Series Under $MVBV$ Condition
Authors:
Dan Sheng Yu,
Ping Zhou,
Song Ping Zhou
Abstract:
Let $f\in L_{2π}$ be a real-valued even function with its Fourier series $ \frac{a_{0}}{2}+\sum_{n=1}^{\infty}a_{n}\cos nx,$ and let $S_{n}(f,x), n\geq 1,$ be the $n$-th partial sum of the Fourier series. It is well-known that if the nonnegative sequence $\{a_{n}\}$ is decreasing and $\lim\limits_{n\to \infty}a_{n}=0$, then…
▽ More
Let $f\in L_{2π}$ be a real-valued even function with its Fourier series $ \frac{a_{0}}{2}+\sum_{n=1}^{\infty}a_{n}\cos nx,$ and let $S_{n}(f,x), n\geq 1,$ be the $n$-th partial sum of the Fourier series. It is well-known that if the nonnegative sequence $\{a_{n}\}$ is decreasing and $\lim\limits_{n\to \infty}a_{n}=0$, then $$ \lim\limits_{n\to \infty}\Vert f-S_{n}(f)\Vert_{L}=0 {if and only if} \lim\limits_{n\to \infty}a_{n}\log n=0. $$ We weaken the monotone condition in this classical result to the so-called mean value bounded variation ($MVBV$) condition. The generalization of the above classical result in real-valued function space is presented as a special case of the main result in this paper which gives the $L^{1}$% -convergence of a function $f\in L_{2π}$ in complex space. We also give results on $L^{1}$-approximation of a function $f\in L_{2π}$ under the $% MVBV$ condition.
△ Less
Submitted 14 April, 2007;
originally announced April 2007.