Skip to main content

Showing 1–50 of 76 results for author: Ye, H

Searching in archive math. Search in all archives.
.
  1. arXiv:2506.15360  [pdf, ps, other

    math.NA

    Stochastic Diagonal Estimation Based on Matrix Quadratic Form Oracles

    Authors: Haishan Ye, Xiangyu Chang

    Abstract: We study the problem of estimating the diagonal of an implicitly given matrix $\Ab$. For such a matrix we have access to an oracle that allows us to evaluate the matrix quadratic form $ \ub^\top \Ab \ub$. Based on this query oracle, we propose a stochastic diagonal estimation method with random variable $\ub$ drawn from the standard Gaussian distribution. We provide the element-wise and norm-wise… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  2. arXiv:2505.11526  [pdf, ps, other

    math.OC cs.AI

    Code Retrieval for MILP Instance Generation

    Authors: Tianxing Yang, Huigen Ye, Hua Xu

    Abstract: Mixed-Integer Linear Programming (MILP) is widely used in fields such as scheduling, logistics, and planning. Enhancing the performance of MILP solvers, particularly learning-based solvers, requires substantial amounts of high-quality data. However, existing methods for MILP instance generation typically necessitate training a separate model for each problem class and are computationally intensive… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  3. arXiv:2504.11940  [pdf, ps, other

    math.RA

    $f$-vectors and $F$-invariant in generalized cluster algebras

    Authors: Huihui Ye, Changjian Fu

    Abstract: We establish certain fundamental properties of $f$-vectors and $F$-matrices for generalized cluster algebras, including the initial and final seed mutation formulas, the compatibility property and the symmetry property. Along the way, we also generalize the construction of $F$-invariant for generalized cluster algebras without assuming positivity and prove certain basic properties.

    Submitted 31 May, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

    Comments: 17 pages

  4. arXiv:2502.05860  [pdf, other

    math.AP

    Global Dynamics of Nonlocal Diffusion Systems on Time-Varying Domains

    Authors: Xiandong Lin, Hailong Ye, Xiao-Qiang Zhao

    Abstract: We propose a class of nonlocal diffusion systems on time-varying domains, and fully characterize their asymptotic dynamics in the asymptotically fixed, time-periodic and unbounded cases. The kernel is not necessarily symmetric or compactly supported, provoking anisotropic diffusion or convective effects. Due to the nonlocal diffusion on time-varying domains in our systems, some significant challen… ▽ More

    Submitted 9 February, 2025; originally announced February 2025.

    Comments: 36 pages, 4 figures

    MSC Class: 35B40; 35K57; 37C65; 92D25

  5. arXiv:2501.07201  [pdf, other

    cs.LG math.NA

    An Enhanced Zeroth-Order Stochastic Frank-Wolfe Framework for Constrained Finite-Sum Optimization

    Authors: Haishan Ye, Yinghui Huang, Hao Di, Xiangyu Chang

    Abstract: We propose an enhanced zeroth-order stochastic Frank-Wolfe framework to address constrained finite-sum optimization problems, a structure prevalent in large-scale machine-learning applications. Our method introduces a novel double variance reduction framework that effectively reduces the gradient approximation variance induced by zeroth-order oracles and the stochastic sampling variance from finit… ▽ More

    Submitted 22 January, 2025; v1 submitted 13 January, 2025; originally announced January 2025.

    Comments: 35 pages, 4 figures, 3 tables

  6. arXiv:2412.20237  [pdf, other

    math.OC

    Distributionally Robust Fault Detection Trade-off Design with Prior Fault Information

    Authors: Yulin Feng, Hailang Jin, Steven X. Ding, Hao Ye, Chao Shang

    Abstract: The robustness of fault detection algorithms against uncertainty is crucial in the real-world industrial environment. Recently, a new probabilistic design scheme called distributionally robust fault detection (DRFD) has emerged and received immense interest. Despite its robustness against unknown distributions in practice, current DRFD focuses on the overall detectability of all possible faults ra… ▽ More

    Submitted 11 April, 2025; v1 submitted 28 December, 2024; originally announced December 2024.

  7. arXiv:2410.03720  [pdf, other

    math.OC cs.LG

    NeuralQP: A General Hypergraph-based Optimization Framework for Large-scale QCQPs

    Authors: Zhixiao Xiong, Fangyu Zong, Huigen Ye, Hua Xu

    Abstract: Machine Learning (ML) optimization frameworks have gained attention for their ability to accelerate the optimization of large-scale Quadratically Constrained Quadratic Programs (QCQPs) by learning shared problem structures. However, existing ML frameworks often rely heavily on strong problem assumptions and large-scale solvers. This paper introduces NeuralQP, a general hypergraph-based framework f… ▽ More

    Submitted 28 September, 2024; originally announced October 2024.

  8. arXiv:2405.17761  [pdf, other

    cs.LG math.OC

    Double Variance Reduction: A Smoothing Trick for Composite Optimization Problems without First-Order Gradient

    Authors: Hao Di, Haishan Ye, Yueling Zhang, Xiangyu Chang, Guang Dai, Ivor W. Tsang

    Abstract: Variance reduction techniques are designed to decrease the sampling variance, thereby accelerating convergence rates of first-order (FO) and zeroth-order (ZO) optimization methods. However, in composite optimization problems, ZO methods encounter an additional variance called the coordinate-wise variance, which stems from the random gradient estimation. To reduce this variance, prior works require… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  9. arXiv:2405.17343  [pdf, ps, other

    math.DS math.AG math.NT

    Bounded geometry for PCF-special subvarieties

    Authors: Laura DeMarco, Niki Myrto Mavraki, Hexi Ye

    Abstract: For each integer $d\geq 2$, let $M_d$ denote the moduli space of maps $f: \mathbb{P}^1\to \mathbb{P}^1$ of degree $d$. We study the geometric configurations of subsets of postcritically finite (or PCF) maps in $M_d$. A complex-algebraic subvariety $Y \subset M_d$ is said to be PCF-special if it contains a Zariski-dense set of PCF maps. Here we prove that there are only finitely many positive-dimen… ▽ More

    Submitted 4 November, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: minor revisions

  10. arXiv:2405.16126  [pdf, other

    math.OC cs.LG

    Near-Optimal Distributed Minimax Optimization under the Second-Order Similarity

    Authors: Qihao Zhou, Haishan Ye, Luo Luo

    Abstract: This paper considers the distributed convex-concave minimax optimization under the second-order similarity. We propose stochastic variance-reduced optimistic gradient sliding (SVOGS) method, which takes the advantage of the finite-sum structure in the objective by involving the mini-batch client sampling and variance reduction. We prove SVOGS can achieve the $\varepsilon$-duality gap within commun… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  11. arXiv:2403.16734  [pdf, other

    math.OC

    Anderson Acceleration Without Restart: A Novel Method with $n$-Step Super Quadratic Convergence Rate

    Authors: Haishan Ye, Dachao Lin, Xiangyu Chang, Zhihua Zhang

    Abstract: In this paper, we propose a novel Anderson's acceleration method to solve nonlinear equations, which does \emph{not} require a restart strategy to achieve numerical stability. We propose the greedy and random versions of our algorithm. Specifically, the greedy version selects the direction to maximize a certain measure of progress for approximating the current Jacobian matrix. In contrast, the ran… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  12. On $F$-Polynomials for Generalized Quantum Cluster Algebras and Gupta's Formula

    Authors: Changjian Fu, Liangang Peng, Huihui Ye

    Abstract: We show the polynomial property of $F$-polynomials for generalized quantum cluster algebras and obtain the associated separation formulas under a mild condition. Along the way, we obtain Gupta's formulas of $F$-polynomials for generalized quantum cluster algebras. These formulas specialize to Gupta's formulas for quantum cluster algebras and cluster algebras respectively. Finally, a generalization… ▽ More

    Submitted 3 September, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    Journal ref: SIGMA 20 (2024), 080, 26 pages

  13. arXiv:2401.15549  [pdf, ps, other

    math.CO

    The Restricted Edge-Connectivity of Strong Product Graphs

    Authors: Hazhe Ye, Yingzhi Tian

    Abstract: The restricted edge-connectivity of a connected graph $G$, denoted by $λ^{\prime}(G)$, if it exists, is the minimum cardinality of a set of edges whose deletion makes $G$ disconnected and each component with at least 2 vertices. It was proved that if $G$ is not a star and $|V(G)|\geq4$, then $λ^{\prime}(G)$ exists and $λ^{\prime}(G)\leqξ(G)$, where $ξ(G)$ is the minimum edge-degree of $G$. Thus a… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  14. arXiv:2401.10060  [pdf, ps, other

    math.PR math.ST

    Poisson approximation for stochastic processes summed over amenable groups

    Authors: Haoyu Ye, Peter Orbanz, Morgane Austern

    Abstract: We generalize the Poisson limit theorem to binary functions of random objects whose law is invariant under the action of an amenable group. Examples include stationary random fields, exchangeable sequences, and exchangeable graphs. A celebrated result of E. Lindenstrauss shows that normalized sums over certain increasing subsets of such groups approximate expectations. Our results clarify that the… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  15. arXiv:2312.15845  [pdf, ps, other

    math.OC

    Optimal Decentralized Composite Optimization for Convex Functions

    Authors: Haishan Ye, Xiangyu Chang

    Abstract: In this paper, we focus on the decentralized composite optimization for convex functions. Because of advantages such as robust to the network and no communication bottle-neck in the central server, the decentralized optimization has attracted much research attention in signal processing, control, and optimization communities. Many optimal algorithms have been proposed for the objective function is… ▽ More

    Submitted 12 July, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

  16. arXiv:2309.00829  [pdf, other

    math.CO

    Characterizing the forbidden pairs for graphs to be super-edge-connected

    Authors: Hazhe Ye, Yingzhi Tian

    Abstract: Let $\mathcal{H}$ be a set of given connected graphs. A graph $G$ is said to be $\mathcal{H}$-free if $G$ contains no $H$ as an induced subgraph for any $H\in \mathcal{H}$. The graph $G$ is super-edge-connected if each minimum edge-cut isolates a vertex in $G$. In this paper, except for some special graphs, we characterize all forbidden subgraph sets $\mathcal{H}$ such that every $\mathcal{H}$-fre… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

  17. arXiv:2308.10547  [pdf, other

    math.OC cs.LG eess.SY

    Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold

    Authors: Jun Chen, Haishan Ye, Mengmeng Wang, Tianxin Huang, Guang Dai, Ivor W. Tsang, Yong Liu

    Abstract: The conjugate gradient method is a crucial first-order optimization method that generally converges faster than the steepest descent method, and its computational cost is much lower than that of second-order methods. However, while various types of conjugate gradient methods have been studied in Euclidean spaces and on Riemannian manifolds, there is little study for those in distributed scenarios.… ▽ More

    Submitted 12 March, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Journal ref: International Conference on Learning Representations, 2024

  18. arXiv:2308.00469  [pdf, ps, other

    cs.LG cs.NE math.OC

    Mirror Natural Evolution Strategies

    Authors: Haishan Ye

    Abstract: The zeroth-order optimization has been widely used in machine learning applications. However, the theoretical study of the zeroth-order optimization focus on the algorithms which approximate (first-order) gradients using (zeroth-order) function value difference at a random direction. The theory of algorithms which approximate the gradient and Hessian information by zeroth-order queries is much les… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:1910.11490

  19. arXiv:2305.14781  [pdf, other

    math.OC eess.SY

    Accelerated Nonconvex ADMM with Self-Adaptive Penalty for Rank-Constrained Model Identification

    Authors: Qingyuan Liu, Zhengchao Huang, Hao Ye, Dexian Huang, Chao Shang

    Abstract: The alternating direction method of multipliers (ADMM) has been widely adopted in low-rank approximation and low-order model identification tasks; however, the performance of nonconvex ADMM is highly reliant on the choice of penalty parameter. To accelerate ADMM for solving rank-constrained identification problems, this paper proposes a new self-adaptive strategy for automatic penalty update. Guid… ▽ More

    Submitted 8 September, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 7 pages, 5 figures. Accepted by 62nd IEEE Conference on Decision and Control (CDC 2023)

  20. arXiv:2304.07504  [pdf, other

    cs.LG math.OC stat.ML

    Stochastic Distributed Optimization under Average Second-order Similarity: Algorithms and Analysis

    Authors: Dachao Lin, Yuze Han, Haishan Ye, Zhihua Zhang

    Abstract: We study finite-sum distributed optimization problems involving a master node and $n-1$ local nodes under the popular $δ$-similarity and $μ$-strong convexity conditions. We propose two new algorithms, SVRS and AccSVRS, motivated by previous works. The non-accelerated SVRS method combines the techniques of gradient sliding and variance reduction and achieves a better communication complexity of… ▽ More

    Submitted 30 October, 2023; v1 submitted 15 April, 2023; originally announced April 2023.

    Comments: Camera-ready version for NeurIPS 2023

  21. arXiv:2212.05273  [pdf, ps, other

    math.OC

    Snap-Shot Decentralized Stochastic Gradient Tracking Methods

    Authors: Haishan Ye, Xiangyu Chang

    Abstract: In decentralized optimization, $m$ agents form a network and only communicate with their neighbors, which gives advantages in data ownership, privacy, and scalability. At the same time, decentralized stochastic gradient descent (\texttt{SGD}) methods, as popular decentralized algorithms for training large-scale machine learning models, have shown their superiority over centralized counterparts. Di… ▽ More

    Submitted 10 December, 2022; originally announced December 2022.

  22. arXiv:2212.02387  [pdf, ps, other

    cs.LG math.OC

    An Efficient Stochastic Algorithm for Decentralized Nonconvex-Strongly-Concave Minimax Optimization

    Authors: Lesi Chen, Haishan Ye, Luo Luo

    Abstract: This paper studies the stochastic nonconvex-strongly-concave minimax optimization over a multi-agent network. We propose an efficient algorithm, called Decentralized Recursive gradient descEnt Ascent Method (DREAM), which achieves the best-known theoretical guarantee for finding the $ε$-stationary points. Concretely, it requires $\mathcal{O}(\min (κ^3ε^{-3},κ^2 \sqrt{N} ε^{-2} ))$ stochastic first… ▽ More

    Submitted 14 May, 2024; v1 submitted 5 December, 2022; originally announced December 2022.

  23. arXiv:2211.11564  [pdf, other

    math.OC cs.AI

    Adaptive Constraint Partition based Optimization Framework for Large-scale Integer Linear Programming(Student Abstract)

    Authors: Huigen Ye, Hongyan Wang, Hua Xu, Chengming Wang, Yu Jiang

    Abstract: Integer programming problems (IPs) are challenging to be solved efficiently due to the NP-hardness, especially for large-scale IPs. To solve this type of IPs, Large neighborhood search (LNS) uses an initial feasible solution and iteratively improves it by searching a large neighborhood around the current solution. However, LNS easily steps into local optima and ignores the correlation between vari… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: To be published in AAAI2023 Student Abstract

  24. arXiv:2211.04874  [pdf, other

    math.ST stat.ML

    A Unified Analysis of Multi-task Functional Linear Regression Models with Manifold Constraint and Composite Quadratic Penalty

    Authors: Shiyuan He, Hanxuan Ye, Kejun He

    Abstract: This work studies the multi-task functional linear regression models where both the covariates and the unknown regression coefficients (called slope functions) are curves. For slope function estimation, we employ penalized splines to balance bias, variance, and computational complexity. The power of multi-task learning is brought in by imposing additional structures over the slope functions. We pr… ▽ More

    Submitted 31 July, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  25. arXiv:2210.13931  [pdf, other

    math.OC cs.LG

    On the Complexity of Decentralized Smooth Nonconvex Finite-Sum Optimization

    Authors: Luo Luo, Yunyan Bai, Lesi Chen, Yuxing Liu, Haishan Ye

    Abstract: We study the decentralized optimization problem $\min_{{\bf x}\in{\mathbb R}^d} f({\bf x})\triangleq \frac{1}{m}\sum_{i=1}^m f_i({\bf x})$, where the local function on the $i$-th agent has the form of $f_i({\bf x})\triangleq \frac{1}{n}\sum_{j=1}^n f_{i,j}({\bf x})$ and every individual $f_{i,j}$ is smooth but possibly nonconvex. We propose a stochastic algorithm called DEcentralized probAbilistic… ▽ More

    Submitted 11 January, 2025; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: A major revision which significantly improves the results by considering the global smoothness parameters and involving the content of PL condition in ICML paper

  26. arXiv:2204.01161  [pdf, other

    math.ST

    A Modern Theory for High-dimensional Cox Regression Models

    Authors: Xianyang Zhang, Huijuan Zhou, Hanxuan Ye

    Abstract: The proportional hazards model has been extensively used in many fields such as biomedicine to estimate and perform statistical significance testing on the effects of covariates influencing the survival time of patients. The classical theory of maximum partial-likelihood estimation (MPLE) is used by most software packages to produce inference, e.g., the coxph function in R and the PHREG procedure… ▽ More

    Submitted 3 April, 2022; originally announced April 2022.

    Comments: 30 pages, 7 figures

  27. arXiv:2202.00509  [pdf, ps, other

    math.OC cs.LG

    Decentralized Stochastic Variance Reduced Extragradient Method

    Authors: Luo Luo, Haishan Ye

    Abstract: This paper studies decentralized convex-concave minimax optimization problems of the form $\min_x\max_y f(x,y) \triangleq\frac{1}{m}\sum_{i=1}^m f_i(x,y)$, where $m$ is the number of agents and each local function can be written as $f_i(x,y)=\frac{1}{n}\sum_{j=1}^n f_{i,j}(x,y)$. We propose a novel decentralized optimization algorithm, called multi-consensus stochastic variance reduced extragradie… ▽ More

    Submitted 13 February, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

  28. arXiv:2110.14109  [pdf, other

    cs.LG math.OC

    Eigencurve: Optimal Learning Rate Schedule for SGD on Quadratic Objectives with Skewed Hessian Spectrums

    Authors: Rui Pan, Haishan Ye, Tong Zhang

    Abstract: Learning rate schedulers have been widely adopted in training deep neural networks. Despite their practical importance, there is a discrepancy between its practice and its theoretical analysis. For instance, it is not known what schedules of SGD achieve best convergence, even for simple problems such as optimizing quadratic objectives. In this paper, we propose Eigencurve, the first family of lear… ▽ More

    Submitted 14 June, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Published at ICLR 2022

  29. arXiv:2110.08572  [pdf, other

    math.NA math.OC

    Greedy and Random Broyden's Methods with Explicit Superlinear Convergence Rates in Nonlinear Equations

    Authors: Haishan Ye, Dachao Lin, Zhihua Zhang

    Abstract: In this paper, we propose the greedy and random Broyden's method for solving nonlinear equations. Specifically, the greedy method greedily selects the direction to maximize a certain measure of progress for approximating the current Jacobian matrix, while the random method randomly chooses a direction. We establish explicit (local) superlinear convergence rates of both methods if the initial point… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: text overlap with arXiv:2109.01974

  30. arXiv:2109.01974  [pdf, other

    math.OC

    Explicit Superlinear Convergence Rates of Broyden's Methods in Nonlinear Equations

    Authors: Dachao Lin, Haishan Ye, Zhihua Zhang

    Abstract: In this paper, we study the explicit superlinear convergence rates of quasi-Newton methods. We particularly focus on the classical Broyden's method for solving nonlinear equations. We establish its explicit (local) superlinear convergence rate when the initial point is close enough to a solution and the initial Jacobian approximation is also close enough to the exact Jacobian related to the soluti… ▽ More

    Submitted 10 September, 2022; v1 submitted 4 September, 2021; originally announced September 2021.

    Comments: 31 pages

  31. arXiv:2107.09238  [pdf, other

    math.OC

    From Generalized Gauss Bounds to Distributionally Robust Fault Detection with Unimodality Information

    Authors: Chao Shang, Hao Ye, Dexian Huang, Steven X. Ding

    Abstract: Probabilistic methods have attracted much interest in fault detection design, but its need for complete distributional knowledge is seldomly fulfilled. This has spurred endeavors in distributionally robust fault detection (DRFD) design, which secures robustness against inexact distributions by using moment-based ambiguity sets as a prime modelling tool. However, with the worst-case distribution be… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

  32. arXiv:2105.07162  [pdf, other

    math.OC

    Explicit Superlinear Convergence Rates of The SR1 Algorithm

    Authors: Haishan Ye, Dachao Lin, Zhihua Zhang, Xiangyu Chang

    Abstract: We study the convergence rate of the famous Symmetric Rank-1 (SR1) algorithm which has wide applications in different scenarios. Although it has been extensively investigated, SR1 still lacks a non-asymptotic superlinear rate compared with other quasi-Newton methods such as DFP and BFGS. In this paper we address this problem. Inspired by the recent work on explicit convergence analysis of quasi-Ne… ▽ More

    Submitted 3 June, 2021; v1 submitted 15 May, 2021; originally announced May 2021.

  33. arXiv:2104.08764  [pdf, other

    math.OC

    Explicit Convergence Rates of Greedy and Random Quasi-Newton Methods

    Authors: Dachao Lin, Haishan Ye, Zhihua Zhang

    Abstract: Optimization is important in machine learning problems, and quasi-Newton methods have a reputation as the most efficient numerical schemes for smooth unconstrained optimization. In this paper, we consider the explicit superlinear convergence rates of quasi-Newton methods and address two open problems mentioned by Rodomanov and Nesterov. First, we extend Rodomanov and Nesterov's results to random q… ▽ More

    Submitted 10 September, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: Fix some typos, 40 pages, 3 figures, final version

  34. Bounded weak and strong time periodic solutions to a three-dimensional chemotaxis-Stokes model with porous medium diffusion

    Authors: Hailong Ye, Chunhua Jin

    Abstract: In this paper, we study the time periodic problem to a three-dimensional chemotaxis-Stokes model with porous medium diffusion $Δn^m$ and inhomogeneous mixed boundary conditions. By using a double-level approximation method and some iterative techniques, we obtain the existence and time-space uniform boundedness of weak time periodic solutions for any $m>1$. Moreover, we improve the regularity for… ▽ More

    Submitted 12 April, 2021; v1 submitted 7 March, 2021; originally announced March 2021.

    MSC Class: 92C17; 35B10; 35M10

  35. arXiv:2102.04937  [pdf, ps, other

    math.PR stat.AP

    Stationary Distribution Convergence of the Offered Waiting Processes in Heavy Traffic under General Patience Time Scaling

    Authors: Chihoon Lee, Amy R. Ward, Heng-Qing Ye

    Abstract: We study a sequence of single server queues with customer abandonment (GI/GI/1+GI) under heavy traffic. The patience time distributions vary with the sequence, which allows for a wider scope of applications. It is known ([20, 18]) that the sequence of scaled offered waiting time processes converges weakly to a reflecting diffusion process with non-linear drift, as the traffic intensity approaches… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

  36. arXiv:2102.03990  [pdf, other

    cs.LG math.OC

    DeEPCA: Decentralized Exact PCA with Linear Convergence Rate

    Authors: Haishan Ye, Tong Zhang

    Abstract: Due to the rapid growth of smart agents such as weakly connected computational nodes and sensors, developing decentralized algorithms that can perform computations on local agents becomes a major research direction. This paper considers the problem of decentralized Principal components analysis (PCA), which is a statistical method widely used for data analysis. We introduce a technique called subs… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

  37. arXiv:2012.15010  [pdf, ps, other

    math.OC cs.AI

    PMGT-VR: A decentralized proximal-gradient algorithmic framework with variance reduction

    Authors: Haishan Ye, Wei Xiong, Tong Zhang

    Abstract: This paper considers the decentralized composite optimization problem. We propose a novel decentralized variance-reduction proximal-gradient algorithmic framework, called PMGT-VR, which is based on a combination of several techniques including multi-consensus, gradient tracking, and variance reduction. The proposed framework relies on an imitation of centralized algorithms and we demonstrate that… ▽ More

    Submitted 5 June, 2021; v1 submitted 29 December, 2020; originally announced December 2020.

    Comments: 16 pages, 4 figures

  38. arXiv:2007.05670  [pdf, other

    stat.ML cs.LG math.ST

    An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter Optimization

    Authors: Yimin Huang, Yujun Li, Hanrong Ye, Zhenguo Li, Zhihua Zhang

    Abstract: The evaluation of hyperparameters, neural architectures, or data augmentation policies becomes a critical model selection problem in advanced deep learning with a large hyperparameter search space. In this paper, we propose an efficient and robust bandit-based algorithm called Sub-Sampling (SS) in the scenario of hyperparameter search evaluation. It evaluates the potential of hyperparameters by th… ▽ More

    Submitted 16 December, 2020; v1 submitted 10 July, 2020; originally announced July 2020.

  39. arXiv:2005.00797  [pdf, ps, other

    cs.LG math.OC stat.ML

    Multi-consensus Decentralized Accelerated Gradient Descent

    Authors: Haishan Ye, Luo Luo, Ziang Zhou, Tong Zhang

    Abstract: This paper considers the decentralized convex optimization problem, which has a wide range of applications in large-scale machine learning, sensor networks, and control theory. We propose novel algorithms that achieve optimal computation complexity and near optimal communication complexity. Our theoretical results give affirmative answers to the open problem on whether there exists an algorithm th… ▽ More

    Submitted 10 October, 2023; v1 submitted 2 May, 2020; originally announced May 2020.

  40. arXiv:2001.03724  [pdf, other

    cs.LG math.OC stat.ML

    Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems

    Authors: Luo Luo, Haishan Ye, Zhichao Huang, Tong Zhang

    Abstract: We consider nonconvex-concave minimax optimization problems of the form $\min_{\bf x}\max_{\bf y\in{\mathcal Y}} f({\bf x},{\bf y})$, where $f$ is strongly-concave in $\bf y$ but possibly nonconvex in $\bf x$ and ${\mathcal Y}$ is a convex and compact set. We focus on the stochastic setting, where we can only access an unbiased stochastic gradient estimate of $f$ at each iteration. This formulatio… ▽ More

    Submitted 23 October, 2020; v1 submitted 11 January, 2020; originally announced January 2020.

  41. arXiv:1911.02458  [pdf, other

    math.DS math.NT

    Common preperiodic points for quadratic polynomials

    Authors: Laura DeMarco, Holly Krieger, Hexi Ye

    Abstract: Let $f_c(z) = z^2+c$ for $c \in \mathbb{C}$. We show there exists a uniform bound on the number of points in $\mathbb{P}^1(\mathbb{C})$ that can be preperiodic for both $f_{c_1}$ and $f_{c_2}$ with $c_1\not= c_2$ in $\mathbb{C}$. The proof combines arithmetic ingredients with complex-analytic; we estimate an adelic energy pairing when the parameters lie in $\bar{\mathbb{Q}}$, building on the quant… ▽ More

    Submitted 28 November, 2021; v1 submitted 6 November, 2019; originally announced November 2019.

    Comments: Many minor corrections made for v2, particularly in section 6, and quantitative constants corrected

  42. arXiv:1910.11490  [pdf, other

    math.OC cs.LG

    Mirror Natural Evolution Strategies

    Authors: Haishan Ye, Tong Zhang

    Abstract: Evolution Strategies such as CMA-ES (covariance matrix adaptation evolution strategy) and NES (natural evolution strategy) have been widely used in machine learning applications, where an objective function is optimized without using its derivatives. However, the convergence behaviors of these algorithms have not been carefully studied. In particular, there is no rigorous analysis for the converge… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

  43. Benders' decomposition of the unit commitment problem with semidefinite relaxation of AC power flow constraints

    Authors: M. Paredes, L. S. A. Martins, S. Soares, Hongxing Ye

    Abstract: In this paper we present a formulation of the unit commitment problem with AC power flow constraints. It is solved by a Benders decomposition in which the unit commitment master problem is formulated as a mixed-integer problem with linearization of the power generation constraints for improved convergence. Semidefinite programming relaxation of the rectangular AC optimal power flow is used in the… ▽ More

    Submitted 23 November, 2020; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: Accepted for publication in the Electric Power Systems Research journal on November 11, 2020

  44. arXiv:1901.09945  [pdf, ps, other

    math.NT math.DS

    Uniform Manin-Mumford for a family of genus 2 curves

    Authors: Laura DeMarco, Holly Krieger, Hexi Ye

    Abstract: We introduce a general strategy for proving quantitative and uniform bounds on the number of common points of height zero for a pair of inequivalent height functions on $\mathbb{P}^1(\overline{\mathbb{Q}}).$ We apply this strategy to prove a conjecture of Bogomolov, Fu, and Tschinkel asserting uniform bounds on the number of common torsion points of elliptic curves in the case of two Legendre curv… ▽ More

    Submitted 2 December, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

    Comments: v2 incorporates minor changes suggested by referees. Final version, to appear in Annals of Math

  45. A convergence result on the second boundary value problem for parabolic equations

    Authors: R. L. Huang, Y. H. Ye

    Abstract: We establish a Schn$\ddot{\text{u}}$rer's convergence result and then apply it to obtain the existence of solutions on the second boundary value problem for a family of special Lagrangian equations

    Submitted 1 June, 2018; originally announced June 2018.

    Journal ref: Pacific J. Math. 310 (2021) 159-179

  46. arXiv:1705.04873  [pdf, ps, other

    math.NT math.AG math.DS

    The Dynamical Manin-Mumford Conjecture and the Dynamical Bogomolov Conjecture for endomorphisms of (P^1)^n

    Authors: Dragos Ghioca, Khoa D. Nguyen, Hexi Ye

    Abstract: We prove Zhang's Dynamical Manin-Mumford Conjecture and Dynamical Bogomolov Conjecture for dominant endomorphisms of (P^1)^n. We use the equidistribution theorem for points of small height with respect to an algebraic dynamical system, combined with an analysis of the symmetries of the Julia set for a rational function.

    Submitted 13 May, 2017; originally announced May 2017.

    Journal ref: Compositio Math. 154 (2018) 1441-1472

  47. arXiv:1705.01331  [pdf, ps, other

    math.AP

    The sharp existence of constrained minimizers for the $L^2$-critical Schrödinger-Poisson system and Schrödinger equations

    Authors: Hongyu Ye

    Abstract: In this paper, we study the existence of minimizers for a class of constrained minimization problems derived from the Schrödinger-Poisson equations: $$-Δu+V(x)u+(|x|^{-1}*u^2)u-|u|^\frac{4}{3}u=λu,~~x\in\R^3$$ on the $L^2$-spheres $\widetilde{S}(c)=\{u\in H^1(\R^3)|~\int_{\R^3}V(x)u^2dx<+\infty,~|u|_2^2=c>0\}$. If $V(x)\equiv0$, then by a different method from Jeanjean and Luo [Z. Angrew. Math. Ph… ▽ More

    Submitted 3 May, 2017; originally announced May 2017.

  48. arXiv:1703.05365  [pdf, ps, other

    math.NT math.AG math.DS

    Bounded height in families of dynamical systems

    Authors: Laura DeMarco, Dragos Ghioca, Holly Krieger, Khoa D. Nguyen, Thomas J. Tucker, Hexi Ye

    Abstract: Let a and b be algebraic numbers such that exactly one of a and b is an algebraic integer, and let f_t(z):=z^2+t be a family of polynomials parametrized by t. We prove that the set of all algebraic numbers t for which there exist positive integers m and n such that f_t^m(a)=f_t^n(b) has bounded Weil height. This is a special case of a more general result supporting a new bounded height conjecture… ▽ More

    Submitted 15 March, 2017; originally announced March 2017.

  49. arXiv:1702.08124  [pdf, ps, other

    math.NA

    Approximate Newton Methods

    Authors: Haishan Ye, Luo Luo, Zhihua Zhang

    Abstract: Many machine learning models involve solving optimization problems. Thus, it is important to deal with a large-scale optimization problem in big data applications. Recently, subsampled Newton methods have emerged to attract much attention due to their efficiency at each iteration, rectified a weakness in the ordinary Newton method of suffering a high cost in each iteration while commanding a high… ▽ More

    Submitted 21 March, 2020; v1 submitted 26 February, 2017; originally announced February 2017.

  50. arXiv:1612.00226  [pdf, other

    math.OC

    Robust Coordinated Transmission and Generation Expansion Planning Considering Ramping Requirements and Construction Periods

    Authors: Jia Li, Zuyi Li, Feng Liu, Hongxing Ye, Xuemin Zhang, Shengwei Mei, Naichao Chang

    Abstract: Two critical issues have arisen in transmission expansion planning with the rapid growth of wind power generation. First, severe power ramping events in daily operation due to the high variability of wind power generation pose great challenges to multi-year planning decision making. Second, the long construction periods of transmission lines may not be able to keep pace with the fast growing uncer… ▽ More

    Submitted 1 December, 2016; originally announced December 2016.