Skip to main content

Showing 1–30 of 30 results for author: Xiao, N

Searching in archive math. Search in all archives.
.
  1. arXiv:2505.22040  [pdf, ps, other

    math.OC

    A Hybrid Subgradient Method for Nonsmooth Nonconvex Bilevel Optimization

    Authors: Nachuan Xiao, Xiaoyin Hu, Xin Liu, Kim-Chuan Toh

    Abstract: In this paper, we focus on the nonconvex-nonconvex bilevel optimization problem (BLO), where both upper-level and lower-level objectives are nonconvex, with the upper-level problem potentially being nonsmooth. We develop a two-timescale momentum-accelerated subgradient method (TMG) that employs two-timescale stepsizes, and establish its local convergence when initialized within a sufficiently smal… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 27 pages

  2. arXiv:2505.02495  [pdf, ps, other

    math.OC

    An Exact Penalty Approach for Equality Constrained Optimization over a Convex Set

    Authors: Nachuan Xiao, Tianyun Tang, Shiwei Wang, Kim-Chuan Toh

    Abstract: In this paper, we consider the nonlinear constrained optimization problem (NCP) with constraint set $\{x \in \mathcal{X}: c(x) = 0\}$, where $\mathcal{X}$ is a closed convex subset of $\mathbb{R}^n$. We propose an exact penalty approach, named constraint dissolving approach, that transforms (NCP) into its corresponding constraint dissolving problem (CDP). The transformed problem (CDP) admits… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: 34 pages

  3. arXiv:2412.20008  [pdf, other

    math.OC

    Stochastic optimization over expectation-formulated generalized Stiefel manifold

    Authors: Linshuo Jiang, Nachuan Xiao, Xin Liu

    Abstract: In this paper, we consider a class of stochastic optimization problems over the expectation-formulated generalized Stiefel manifold (SOEGS), where the objective function $f$ is continuously differentiable. We propose a novel constraint dissolving penalty function with a customized penalty term (CDFDP), which maintains the same order of differentiability as $f$. Our theoretical analysis establishes… ▽ More

    Submitted 27 December, 2024; originally announced December 2024.

    Comments: 34 pages

  4. arXiv:2409.04998  [pdf, other

    math.OC cs.DC stat.ML

    A Double Tracking Method for Optimization with Decentralized Generalized Orthogonality Constraints

    Authors: Lei Wang, Nachuan Xiao, Xin Liu

    Abstract: In this paper, we consider the decentralized optimization problems with generalized orthogonality constraints, where both the objective function and the constraint exhibit a distributed structure. Such optimization problems, albeit ubiquitous in practical applications, remain unsolvable by existing algorithms in the presence of distributed constraints. To address this issue, we convert the origina… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

  5. arXiv:2408.17213  [pdf, ps, other

    math.OC

    A Minimization Approach for Minimax Optimization with Coupled Constraints

    Authors: Xiaoyin Hu, Kim-Chuan Toh, Shiwei Wang, Nachuan Xiao

    Abstract: In this paper, we focus on the nonconvex-strongly-concave minimax optimization problem (MCC), where the inner maximization subproblem contains constraints that couple the primal variable of the outer minimization problem. We prove that by introducing the dual variable of the inner maximization subproblem, (MCC) has the same first-order minimax points as a nonconvex-strongly-concave minimax optimiz… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 25 pages

  6. arXiv:2406.18287  [pdf, other

    math.OC

    Learning-rate-free Momentum SGD with Reshuffling Converges in Nonsmooth Nonconvex Optimization

    Authors: Xiaoyin Hu, Nachuan Xiao, Xin Liu, Kim-Chuan Toh

    Abstract: In this paper, we propose a generalized framework for developing learning-rate-free momentum stochastic gradient descent (SGD) methods in the minimization of nonsmooth nonconvex functions, especially in training nonsmooth neural networks. Our framework adaptively generates learning rates based on the historical data of stochastic subgradients and iterates. Under mild conditions, we prove that our… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 26 pages

  7. arXiv:2405.06939  [pdf, other

    math.ST

    Tests for principal eigenvalues and eigenvectors

    Authors: Jianqing Fan, Yingying Li, Ningning Xia, Xinghua Zheng

    Abstract: We establish central limit theorems for principal eigenvalues and eigenvectors under a large factor model setting, and develop two-sample tests of both principal eigenvalues and principal eigenvectors. One important application is to detect structural breaks in large factor models. Compared with existing methods for detecting structural breaks, our tests provide unique insights into the source of… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  8. arXiv:2404.09438  [pdf, other

    math.OC cs.LG stat.ML

    Developing Lagrangian-based Methods for Nonsmooth Nonconvex Optimization

    Authors: Nachuan Xiao, Kuangyu Ding, Xiaoyin Hu, Kim-Chuan Toh

    Abstract: In this paper, we consider the minimization of a nonsmooth nonconvex objective function $f(x)$ over a closed convex subset $\mathcal{X}$ of $\mathbb{R}^n$, with additional nonsmooth nonconvex constraints $c(x) = 0$. We develop a unified framework for developing Lagrangian-based methods, which takes a single-step update to the primal variables by some subgradient methods in each iteration. These su… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 30 pages, 4 figures

  9. arXiv:2403.11565  [pdf, other

    math.OC cs.LG

    Convergence of Decentralized Stochastic Subgradient-based Methods for Nonsmooth Nonconvex functions

    Authors: Siyuan Zhang, Nachuan Xiao, Xin Liu

    Abstract: In this paper, we focus on the decentralized stochastic subgradient-based methods in minimizing nonsmooth nonconvex functions without Clarke regularity, especially in the decentralized training of nonsmooth neural networks. We propose a general framework that unifies various decentralized subgradient-based methods, such as decentralized stochastic subgradient descent (DSGD), DSGD with gradient-tra… ▽ More

    Submitted 9 May, 2025; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 35 pages

  10. arXiv:2401.03565  [pdf, other

    math.OC

    An Inexact Preconditioned Zeroth-order Proximal Method for Composite Optimization

    Authors: Shanglin Liu, Lei Wang, Nachuan Xiao, Xin Liu

    Abstract: In this paper, we consider the composite optimization problem, where the objective function integrates a continuously differentiable loss function with a nonsmooth regularization term. Moreover, only the function values for the differentiable part of the objective function are available. To efficiently solve this composite optimization problem, we propose a preconditioned zeroth-order proximal gra… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  11. arXiv:2310.08858  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Adam-family Methods with Decoupled Weight Decay in Deep Learning

    Authors: Kuangyu Ding, Nachuan Xiao, Kim-Chuan Toh

    Abstract: In this paper, we investigate the convergence properties of a wide class of Adam-family methods for minimizing quadratically regularized nonsmooth nonconvex optimization problems, especially in the context of training nonsmooth neural networks with weight decay. Motivated by the AdamW method, we propose a novel framework for Adam-family methods with decoupled weight decay. Within our framework, th… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 26 pages

  12. arXiv:2307.10053  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Stochastic Subgradient Methods with Guaranteed Global Stability in Nonsmooth Nonconvex Optimization

    Authors: Nachuan Xiao, Xiaoyin Hu, Kim-Chuan Toh

    Abstract: In this paper, we focus on providing convergence guarantees for stochastic subgradient methods in minimizing nonsmooth nonconvex functions. We first investigate the global stability of a general framework for stochastic subgradient methods, where the corresponding differential inclusion admits a coercive Lyapunov function. We prove that, for any sequence of sufficiently small stepsizes and approxi… ▽ More

    Submitted 12 October, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 37 pages

  13. arXiv:2305.03938  [pdf, other

    math.OC cs.LG stat.ML

    Adam-family Methods for Nonsmooth Optimization with Convergence Guarantees

    Authors: Nachuan Xiao, Xiaoyin Hu, Xin Liu, Kim-Chuan Toh

    Abstract: In this paper, we present a comprehensive study on the convergence properties of Adam-family methods for nonsmooth optimization, especially in the training of nonsmooth neural networks. We introduce a novel two-timescale framework that adopts a two-timescale updating scheme, and prove its convergence properties under mild assumptions. Our proposed framework encompasses various popular Adam-family… ▽ More

    Submitted 19 February, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: 53 pages

  14. arXiv:2304.10092  [pdf, ps, other

    math.OC

    A Riemannian Dimension-reduced Second Order Method with Application in Sensor Network Localization

    Authors: Tianyun Tang, Kim-Chuan Toh, Nachuan Xiao, Yinyu Ye

    Abstract: In this paper, we propose a cubic-regularized Riemannian optimization method (RDRSOM), which partially exploits the second order information and achieves the iteration complexity of $\mathcal{O}(1/ε^{3/2})$. In order to reduce the per-iteration computational cost, we further propose a practical version of (RDRSOM), which is an extension of the well known Barzilai-Borwein method and achieves the it… ▽ More

    Submitted 24 April, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: 19 pages

  15. arXiv:2304.01467  [pdf, ps, other

    math.OC

    A Partial Exact Penalty Function Approach for Constrained Optimization

    Authors: Nachuan Xiao, Xin Liu, Kim-Chuan Toh

    Abstract: In this paper, we focus on a class of constrained nonlinear optimization problems (NLP), where some of its equality constraints define a closed embedded submanifold $\mathcal{M}$ in $\mathbb{R}^n$. Although NLP can be solved directly by various existing approaches for constrained optimization in Euclidean space, these approaches usually fail to recognize the manifold structure of $\mathcal{M}$. To… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 27 pages

  16. arXiv:2212.02698  [pdf, other

    math.OC cs.MS

    CDOpt: A Python Package for a Class of Riemannian Optimization

    Authors: Nachuan Xiao, Xiaoyin Hu, Xin Liu, Kim-Chuan Toh

    Abstract: Optimization over the embedded submanifold defined by constraints $c(x) = 0$ has attracted much interest over the past few decades due to its wide applications in various areas. Plenty of related optimization packages have been developed based on Riemannian optimization approaches, which rely on some basic geometrical materials of Riemannian manifolds, including retractions, vector transports, etc… ▽ More

    Submitted 12 October, 2024; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: 48 pages

  17. arXiv:2208.00732  [pdf, ps, other

    math.OC

    An Improved Unconstrained Approach for Bilevel Optimization

    Authors: Xiaoyin Hu, Nachuan Xiao, Xin Liu, Kim-Chuan Toh

    Abstract: In this paper, we focus on the nonconvex-strongly-convex bilevel optimization problem (BLO). In this BLO, the objective function of the upper-level problem is nonconvex and possibly nonsmooth, and the lower-level problem is smooth and strongly convex with respect to the underlying variable $y$. We show that the feasible region of BLO is a Riemannian manifold. Then we transform BLO to its correspon… ▽ More

    Submitted 23 December, 2022; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: 27 pages, revised version

    MSC Class: 15A18; 65F15; 65K05; 90C06

  18. arXiv:2205.10500  [pdf, other

    math.OC

    A Constraint Dissolving Approach for Nonsmooth Optimization over the Stiefel Manifold

    Authors: Xiaoyin Hu, Nachuan Xiao, Xin Liu, Kim-Chuan Toh

    Abstract: This paper focus on the minimization of a possibly nonsmooth objective function over the Stiefel manifold. The existing approaches either lack efficiency or can only tackle prox-friendly objective functions. We propose a constraint dissolving function named NCDF and show that it has the same first-order stationary points and local minimizers as the original problem in a neighborhood of the Stiefel… ▽ More

    Submitted 20 January, 2023; v1 submitted 21 May, 2022; originally announced May 2022.

    Comments: Revised version, 26 pages

  19. arXiv:2203.10319  [pdf, ps, other

    math.OC

    Dissolving Constraints for Riemannian Optimization

    Authors: Nachuan Xiao, Xin Liu, Kim-Chuan Toh

    Abstract: In this paper, we consider optimization problems over closed embedded submanifolds of $\mathbb{R}^n$, which are defined by the constraints $c(x) = 0$. We propose a class of constraint dissolving approaches for these Riemannian optimization problems. In these proposed approaches, solving a Riemannian optimization problem is transferred into the unconstrained minimization of a constraint dissolving… ▽ More

    Submitted 14 October, 2022; v1 submitted 19 March, 2022; originally announced March 2022.

    Comments: 38 pages

  20. arXiv:2110.08986  [pdf, other

    math.OC

    Solving Optimization Problems over the Stiefel Manifold by Smooth Exact Penalty Function

    Authors: Nachuan Xiao, Xin Liu

    Abstract: In this paper, we present a novel penalty model called ExPen for optimization over the Stiefel manifold. Different from existing penalty functions for orthogonality constraints, ExPen adopts a smooth penalty function without using any first-order derivative of the objective function. We show that all the first-order stationary points of ExPen with a sufficiently large penalty parameter are either… ▽ More

    Submitted 18 December, 2022; v1 submitted 17 October, 2021; originally announced October 2021.

    Comments: revised version, 28 pages

  21. arXiv:2103.03514  [pdf, ps, other

    math.OC

    A Penalty-free Infeasible Approach for a Class of Nonsmooth Optimization Problems over the Stiefel Manifold

    Authors: Nachuan Xiao, Xin Liu, Ya-xiang Yuan

    Abstract: Transforming into an exact penalty function model with convex compact constraints yields efficient infeasible approaches for optimization problems with orthogonality constraints. For smooth and $\ell_{2,1}$-norm regularized cases, these infeasible approaches adopt simple and orthonormalization-free updating scheme and show their high efficiency in the test examples. However, to avoid orthonormaliz… ▽ More

    Submitted 28 March, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

  22. arXiv:1911.01543  [pdf, other

    math.NA physics.comp-ph physics.med-ph

    Physics driven reduced order model for real time blood flow simulations

    Authors: Sethuraman Sankaran, David Lesage, Rhea Tombropoulos, Nan Xiao, Hyun Jin Kim, David Spain, Michiel Schaap, Charles A. Taylor

    Abstract: Predictive modeling of blood flow and pressure have numerous applications ranging from non-invasive assessment of functional significance of disease to planning invasive procedures. While several such predictive modeling techniques have been proposed, their use in the clinic has been limited due in part to the significant time required to perform virtual interventions and compute the resultant cha… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

  23. arXiv:1908.08670  [pdf, other

    math.ST stat.AP

    On the estimation of high-dimensional integrated covariance matrix based on high-frequency data with multiple transactions

    Authors: Moming Wang, Ningning Xia, You Zhou

    Abstract: Due to the mechanism of recording, the presence of multiple transactions at each recording time becomes a common feature for high-frequency data in financial market. Using random matrix theory, this paper considers the estimation of integrated covariance (ICV) matrices of high-dimensional diffusion processes based on multiple high-frequency observations. We start by studying the estimator, the tim… ▽ More

    Submitted 5 September, 2019; v1 submitted 23 August, 2019; originally announced August 2019.

  24. arXiv:1611.06753  [pdf, ps, other

    math.ST stat.AP

    Shrinkage estimation of covariance matrix for portfolio choice with high frequency data

    Authors: Cheng Liu, Ningning Xia, Jun Yu

    Abstract: This paper examines the usefulness of high frequency data in estimating the covariance matrix for portfolio choice when the portfolio size is large. A computationally convenient nonlinear shrinkage estimator for the integrated covariance (ICV) matrix of financial assets is developed in two steps. The eigenvectors of the ICV are first constructed from a designed time variation adjusted realized cov… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

  25. arXiv:1611.06744  [pdf, other

    math.ST

    Convergence rate of eigenvector empirical spectral distribution of large Wigner matrices

    Authors: Ningning Xia, Zhidong Bai

    Abstract: In this paper, we adopt the eigenvector empirical spectral distribution (VESD) to investigate the limiting behavior of eigenvectors of a large dimensional Wigner matrix W_n. In particular, we derive the optimal bound for the rate of convergence of the expected VESD of W_n to the semicircle law, which is of order O(n^{-1/2}) under the assumption of having finite 10th moment. We further show that th… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

  26. arXiv:1604.03638  [pdf, other

    math.ST

    On the inference about the spectral distribution of high-dimensional covariance matrix based on high-frequency noisy observations

    Authors: Ningning Xia, Xinghua Zheng

    Abstract: In practice, observations are often contaminated by noise, making the resulting sample covariance matrix a signal-plus-noise sample covariance matrix. Aiming to make inferences about the spectral distribution of the population covariance matrix under such a situation, we establish an asymptotic relationship that describes how the limiting spectral distribution of (signal) sample covariance matrice… ▽ More

    Submitted 1 March, 2017; v1 submitted 12 April, 2016; originally announced April 2016.

    Comments: arXiv admin note: text overlap with arXiv:1409.2121

  27. Mean Square Capacity of Power Constrained Fading Channels with Causal Encoders and Decoders

    Authors: Liang Xu, Lihua Xie, Nan Xiao

    Abstract: This paper is concerned with the mean square stabilization problem of discrete-time LTI systems over a power constrained fading channel. Different from existing research works, the channel considered in this paper suffers from both fading and additive noises. We allow any form of causal channel encoders/decoders, unlike linear encoders/decoders commonly studied in the literature. Sufficient condit… ▽ More

    Submitted 15 September, 2015; originally announced September 2015.

    Comments: Accepted by the 54th IEEE Conference on Decision and Control

  28. arXiv:1409.2121  [pdf, other

    math.ST

    On the inference about the spectra of high-dimensional covariance matrix based on noisy observations-with applications to integrated covolatility matrix inference in the presence of microstructure noise

    Authors: Ningning Xia, Xinghua Zheng

    Abstract: In practice, observations are often contaminated by noise, making the resulting sample covariance matrix to be an information-plus-noise-type covariance matrix. Aiming to make inferences about the spectra of the underlying true covariance matrix under such a situation, we establish an asymptotic relationship that describes how the limiting spectral distribution of (true) sample covariance matrices… ▽ More

    Submitted 22 August, 2015; v1 submitted 7 September, 2014; originally announced September 2014.

  29. arXiv:1408.3430  [pdf

    math.NA

    Interval-based parameter identification for structural static problems

    Authors: Naijia Xiao, Francesco Fedele, Rafi Muhanna

    Abstract: We present an interval-based approach for parameter identification in structural static inverse problems. The proposed inverse formulation exploits the Interval Finite Element Method (IFEM) combined with adjoint-based optimization. The inversion consists of a two-step algorithm: first, an estimate of the parameters is obtained by means of a deterministic iterative solver. Then, the algorithm switc… ▽ More

    Submitted 4 September, 2014; v1 submitted 14 August, 2014; originally announced August 2014.

  30. Convergence rates of eigenvector empirical spectral distribution of large dimensional sample covariance matrix

    Authors: Ningning Xia, Yingli Qin, Zhidong Bai

    Abstract: The eigenvector Empirical Spectral Distribution (VESD) is adopted to investigate the limiting behavior of eigenvectors and eigenvalues of covariance matrices. In this paper, we shall show that the Kolmogorov distance between the expected VESD of sample covariance matrix and the Marčenko-Pastur distribution function is of order $O(N^{-1/2})$. Given that data dimension $n$ to sample size $N$ ratio i… ▽ More

    Submitted 22 November, 2013; v1 submitted 20 November, 2013; originally announced November 2013.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOS1154 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1154

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 5, 2572-2607