Search | arXiv e-print repository

arXiv:2508.20897 [pdf, ps, other]

Enhancing Quadratic Programming Solvers via Quadratic Nonconvex Reformulation

Authors: Cheng Lu, Yu Fei, Gaojian Kang, Guangai Qu, Zhibin Deng, Qingwei Jin, Shu-Cherng Fang

Abstract: In this paper, we consider solving nonconvex quadratic programming problems using modern solvers such as Gurobi and SCIP. It is well-known that the classical techniques of quadratic convex reformulation can improve the computational efficiency of global solvers for mixed-integer quadratic optimization problems. In contrast, the use of quadratic nonconvex reformulation (QNR) has not been previously… ▽ More In this paper, we consider solving nonconvex quadratic programming problems using modern solvers such as Gurobi and SCIP. It is well-known that the classical techniques of quadratic convex reformulation can improve the computational efficiency of global solvers for mixed-integer quadratic optimization problems. In contrast, the use of quadratic nonconvex reformulation (QNR) has not been previously explored. This paper introduces a QNR framework--an unconventional yet highly effective approach for improving the performance of state-of-the-art quadratic programming solvers such as Gurobi and SCIP. Our computational experiments on diverse nonconvex quadratic programming problem instances demonstrate that QNR can substantially accelerate both Gurobi and SCIP. Notably, with QNR, Gurobi achieves state-of-the-art performance on several benchmark and randomly generated instances. △ Less

Submitted 28 August, 2025; originally announced August 2025.

arXiv:2505.02685 [pdf, ps, other]

On the Spectral Expansion of Monotone Subsets of the Hypercube

Authors: Yumou Fei, Renato Ferreira Pinto Jr

Abstract: We study the spectral gap of subgraphs of the hypercube induced by monotone subsets of vertices. For a monotone subset $A\subseteq\{0,1\}^{n}$ of density $μ(A)$, the previous best lower bound on the spectral gap, due to Cohen, was $γ\gtrsim μ(A)/n^{2}$, improving upon the earlier bound $γ\gtrsim μ(A)^{2}/n^{2}$ established by Ding and Mossel. In this paper, we prove the optimal lower bound… ▽ More We study the spectral gap of subgraphs of the hypercube induced by monotone subsets of vertices. For a monotone subset $A\subseteq\{0,1\}^{n}$ of density $μ(A)$, the previous best lower bound on the spectral gap, due to Cohen, was $γ\gtrsim μ(A)/n^{2}$, improving upon the earlier bound $γ\gtrsim μ(A)^{2}/n^{2}$ established by Ding and Mossel. In this paper, we prove the optimal lower bound $γ\gtrsim μ(A)/n$. As a corollary, we improve the mixing time upper bound of the random walk on constant-density monotone sets from $O(n^{3})$, as shown by Ding and Mossel, to $O(n^{2})$. Along the way, we develop two new inequalities that may be of independent interest: (1)~a directed $L^{2}$-Poincaré inequality on the hypercube, and (2)~an ``approximate'' FKG inequality for monotone sets. △ Less

Submitted 5 May, 2025; originally announced May 2025.

arXiv:2311.05340 [pdf, ps, other]

Characterizing positroid quotients of uniform matroids

Authors: Zhixing Chen, Yumou Fei, Jiyang Gao, Yuxuan Sun, Yuchong Zhang

Abstract: We study two-step flag positroids $(P_1, P_2)$, where $P_1$ is a quotient of $P_{2}$. We provide a complete characterization of all two-step flag positroids that contain a uniform matroid, extending and completing a partial result by Benedetti, Chávez, and Jiménez. To contrast general positroids with the special case of lattice path matroids, we show that the containment relations of Grassmann nec… ▽ More We study two-step flag positroids $(P_1, P_2)$, where $P_1$ is a quotient of $P_{2}$. We provide a complete characterization of all two-step flag positroids that contain a uniform matroid, extending and completing a partial result by Benedetti, Chávez, and Jiménez. To contrast general positroids with the special case of lattice path matroids, we show that the containment relations of Grassmann necklaces and conecklaces fully characterize flag lattice path matroids, but are insufficient for general flag positroids. Additionally, we prove that the decorated permutations of any elementary quotient pair are related by a cyclic shift, resolving a conjecture of Benedetti, Chávez and Jiménez. △ Less

Submitted 4 April, 2025; v1 submitted 9 November, 2023; originally announced November 2023.

Comments: 19 pages

MSC Class: 05B35

arXiv:2310.10441 [pdf, ps, other]

Efficiently matching random inhomogeneous graphs via degree profiles

Authors: Jian Ding, Yumou Fei, Yuanzheng Wang

Abstract: In this paper, we study the problem of recovering the latent vertex correspondence between two correlated random graphs with vastly inhomogeneous and unknown edge probabilities between different pairs of vertices. Inspired by and extending the matching algorithm via degree profiles by Ding, Ma, Wu and Xu (2021), we obtain an efficient matching algorithm as long as the minimal average degree is at… ▽ More In this paper, we study the problem of recovering the latent vertex correspondence between two correlated random graphs with vastly inhomogeneous and unknown edge probabilities between different pairs of vertices. Inspired by and extending the matching algorithm via degree profiles by Ding, Ma, Wu and Xu (2021), we obtain an efficient matching algorithm as long as the minimal average degree is at least $Ω(\log^{2} n)$ and the minimal correlation is at least $1 - O(\log^{-2} n)$. △ Less

Submitted 16 August, 2025; v1 submitted 16 October, 2023; originally announced October 2023.

Comments: real data experiments added in the second version

arXiv:2203.03110 [pdf, ps, other]

Cascaded Gaps: Towards Gap-Dependent Regret for Risk-Sensitive Reinforcement Learning

Authors: Yingjie Fei, Ruitu Xu

Abstract: In this paper, we study gap-dependent regret guarantees for risk-sensitive reinforcement learning based on the entropic risk measure. We propose a novel definition of sub-optimality gaps, which we call cascaded gaps, and we discuss their key components that adapt to the underlying structures of the problem. Based on the cascaded gaps, we derive non-asymptotic and logarithmic regret bounds for two… ▽ More In this paper, we study gap-dependent regret guarantees for risk-sensitive reinforcement learning based on the entropic risk measure. We propose a novel definition of sub-optimality gaps, which we call cascaded gaps, and we discuss their key components that adapt to the underlying structures of the problem. Based on the cascaded gaps, we derive non-asymptotic and logarithmic regret bounds for two model-free algorithms under episodic Markov decision processes. We show that, in appropriate settings, these bounds feature exponential improvement over existing ones that are independent of gaps. We also prove gap-dependent lower bounds, which certify the near optimality of the upper bounds. △ Less

Submitted 6 March, 2022; originally announced March 2022.

arXiv:2111.03947 [pdf, other]

Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning

Authors: Yingjie Fei, Zhuoran Yang, Yudong Chen, Zhaoran Wang

Abstract: We study risk-sensitive reinforcement learning (RL) based on the entropic risk measure. Although existing works have established non-asymptotic regret guarantees for this problem, they leave open an exponential gap between the upper and lower bounds. We identify the deficiencies in existing algorithms and their analysis that result in such a gap. To remedy these deficiencies, we investigate a simp… ▽ More We study risk-sensitive reinforcement learning (RL) based on the entropic risk measure. Although existing works have established non-asymptotic regret guarantees for this problem, they leave open an exponential gap between the upper and lower bounds. We identify the deficiencies in existing algorithms and their analysis that result in such a gap. To remedy these deficiencies, we investigate a simple transformation of the risk-sensitive Bellman equations, which we call the exponential Bellman equation. The exponential Bellman equation inspires us to develop a novel analysis of Bellman backup procedures in risk-sensitive RL algorithms, and further motivates the design of a novel exploration mechanism. We show that these analytic and algorithmic innovations together lead to improved regret upper bounds over existing ones. △ Less

Submitted 6 November, 2021; originally announced November 2021.

arXiv:2007.00148 [pdf, ps, other]

Dynamic Regret of Policy Optimization in Non-stationary Environments

Authors: Yingjie Fei, Zhuoran Yang, Zhaoran Wang, Qiaomin Xie

Abstract: We consider reinforcement learning (RL) in episodic MDPs with adversarial full-information reward feedback and unknown fixed transition kernels. We propose two model-free policy optimization algorithms, POWER and POWER++, and establish guarantees for their dynamic regret. Compared with the classical notion of static regret, dynamic regret is a stronger notion as it explicitly accounts for the non-… ▽ More We consider reinforcement learning (RL) in episodic MDPs with adversarial full-information reward feedback and unknown fixed transition kernels. We propose two model-free policy optimization algorithms, POWER and POWER++, and establish guarantees for their dynamic regret. Compared with the classical notion of static regret, dynamic regret is a stronger notion as it explicitly accounts for the non-stationarity of environments. The dynamic regret attained by the proposed algorithms interpolates between different regimes of non-stationarity, and moreover satisfies a notion of adaptive (near-)optimality, in the sense that it matches the (near-)optimal static regret under slow-changing environments. The dynamic regret bound features two components, one arising from exploration, which deals with the uncertainty of transition kernels, and the other arising from adaptation, which deals with non-stationary environments. Specifically, we show that POWER++ improves over POWER on the second component of the dynamic regret by actively adapting to non-stationarity through prediction. To the best of our knowledge, our work is the first dynamic regret analysis of model-free RL algorithms in non-stationary environments. △ Less

Submitted 30 June, 2020; originally announced July 2020.

arXiv:2006.13827 [pdf, other]

Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret

Authors: Yingjie Fei, Zhuoran Yang, Yudong Chen, Zhaoran Wang, Qiaomin Xie

Abstract: We study risk-sensitive reinforcement learning in episodic Markov decision processes with unknown transition kernels, where the goal is to optimize the total reward under the risk measure of exponential utility. We propose two provably efficient model-free algorithms, Risk-Sensitive Value Iteration (RSVI) and Risk-Sensitive Q-learning (RSQ). These algorithms implement a form of risk-sensitive opti… ▽ More We study risk-sensitive reinforcement learning in episodic Markov decision processes with unknown transition kernels, where the goal is to optimize the total reward under the risk measure of exponential utility. We propose two provably efficient model-free algorithms, Risk-Sensitive Value Iteration (RSVI) and Risk-Sensitive Q-learning (RSQ). These algorithms implement a form of risk-sensitive optimism in the face of uncertainty, which adapts to both risk-seeking and risk-averse modes of exploration. We prove that RSVI attains an $\tilde{O}\big(λ(|β| H^2) \cdot \sqrt{H^{3} S^{2}AT} \big)$ regret, while RSQ attains an $\tilde{O}\big(λ(|β| H^2) \cdot \sqrt{H^{4} SAT} \big)$ regret, where $λ(u) = (e^{3u}-1)/u$ for $u>0$. In the above, $β$ is the risk parameter of the exponential utility function, $S$ the number of states, $A$ the number of actions, $T$ the total number of timesteps, and $H$ the episode length. On the flip side, we establish a regret lower bound showing that the exponential dependence on $|β|$ and $H$ is unavoidable for any algorithm with an $\tilde{O}(\sqrt{T})$ regret (even when the risk objective is on the same scale as the original reward), thus certifying the near-optimality of the proposed algorithms. Our results demonstrate that incorporating risk awareness into reinforcement learning necessitates an exponential cost in $|β|$ and $H$, which quantifies the fundamental tradeoff between risk sensitivity (related to aleatoric uncertainty) and sample efficiency (related to epistemic uncertainty). To the best of our knowledge, this is the first regret analysis of risk-sensitive reinforcement learning with the exponential utility. △ Less

Submitted 22 June, 2020; originally announced June 2020.

arXiv:2006.01719 [pdf, other]

Spectral Frank-Wolfe Algorithm: Strict Complementarity and Linear Convergence

Authors: Lijun Ding, Yingjie Fei, Qiantong Xu, Chengrun Yang

Abstract: We develop a novel variant of the classical Frank-Wolfe algorithm, which we call spectral Frank-Wolfe, for convex optimization over a spectrahedron. The spectral Frank-Wolfe algorithm has a novel ingredient: it computes a few eigenvectors of the gradient and solves a small-scale SDP in each iteration. Such procedure overcomes slow convergence of the classical Frank-Wolfe algorithm due to ignoring… ▽ More We develop a novel variant of the classical Frank-Wolfe algorithm, which we call spectral Frank-Wolfe, for convex optimization over a spectrahedron. The spectral Frank-Wolfe algorithm has a novel ingredient: it computes a few eigenvectors of the gradient and solves a small-scale SDP in each iteration. Such procedure overcomes slow convergence of the classical Frank-Wolfe algorithm due to ignoring eigenvalue coalescence. We demonstrate that strict complementarity of the optimization problem is key to proving linear convergence of various algorithms, such as the spectral Frank-Wolfe algorithm as well as the projected gradient method and its accelerated version. △ Less

Submitted 17 August, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

Comments: The main text has nine pages. Reading the first three sections should give a good understanding of the paper and should take around 15 minutes. Published in proceedings of the 37th International Conference on Machine Learning, online, PMLR 119, 2020

arXiv:1904.09635 [pdf, other]

Achieving the Bayes Error Rate in Synchronization and Block Models by SDP, Robustly

Authors: Yingjie Fei, Yudong Chen

Abstract: We study the statistical performance of semidefinite programming (SDP) relaxations for clustering under random graph models. Under the $\mathbb{Z}_{2}$ Synchronization model, Censored Block Model and Stochastic Block Model, we show that SDP achieves an error rate of the form \[ \exp\Big[-\big(1-o(1)\big)\bar{n} I^* \Big]. \] Here $\bar{n}$ is an appropriate multiple of the number of nodes and… ▽ More We study the statistical performance of semidefinite programming (SDP) relaxations for clustering under random graph models. Under the $\mathbb{Z}_{2}$ Synchronization model, Censored Block Model and Stochastic Block Model, we show that SDP achieves an error rate of the form \[ \exp\Big[-\big(1-o(1)\big)\bar{n} I^* \Big]. \] Here $\bar{n}$ is an appropriate multiple of the number of nodes and $I^*$ is an information-theoretic measure of the signal-to-noise ratio. We provide matching lower bounds on the Bayes error for each model and therefore demonstrate that the SDP approach is Bayes optimal. As a corollary, our results imply that SDP achieves the optimal exact recovery threshold under each model. Furthermore, we show that SDP is robust: the above bound remains valid under semirandom versions of the models in which the observed graph is modified by a monotone adversary. Our proof is based on a novel primal-dual analysis of SDP under a unified framework for all three models, and the analysis shows that SDP tightly approximates a joint majority voting procedure. △ Less

Submitted 21 April, 2019; originally announced April 2019.

Comments: Partial preliminary results to appear in the Conference on Learning Theory (COLT) 2019

arXiv:1803.06510 [pdf, other]

Hidden Integrality and Semi-random Robustness of SDP Relaxation for Sub-Gaussian Mixture Model

Authors: Yingjie Fei, Yudong Chen

Abstract: We consider the problem of estimating the discrete clustering structures under the Sub-Gaussian Mixture Model. Our main results establish a hidden integrality property of a semidefinite programming (SDP) relaxation for this problem: while the optimal solution to the SDP is not integer-valued in general, its estimation error can be upper bounded by that of an idealized integer program. The error of… ▽ More We consider the problem of estimating the discrete clustering structures under the Sub-Gaussian Mixture Model. Our main results establish a hidden integrality property of a semidefinite programming (SDP) relaxation for this problem: while the optimal solution to the SDP is not integer-valued in general, its estimation error can be upper bounded by that of an idealized integer program. The error of the integer program, and hence that of the SDP, are further shown to decay exponentially in the signal-to-noise ratio. In addition, we show that the SDP relaxation is robust under the semi-random setting in which an adversary can modify the data generated from the mixture model. In particular, we generalize the hidden integrality property to the semi-random model and thereby show that SDP achieves the optimal error bound in this setting. These results together highlight the "global-to-local" mechanism that drives the performance of the SDP relaxation. To the best of our knowledge, our result is the first exponentially decaying error bound for convex relaxations of mixture models. A corollary of our results shows that in certain regimes the SDP solutions are in fact integral and exact. More generally, our results establish sufficient conditions for the SDP to correctly recover the cluster memberships of $(1-δ)$ fraction of the points for any $δ\in(0,1)$. As a special case, we show that under the $d$-dimensional Stochastic Ball Model, SDP achieves non-trivial (sometimes exact) recovery when the center separation is as small as $\sqrt{1/d}$, which improves upon previous exact recovery results that require constant separation. △ Less

Submitted 4 October, 2021; v1 submitted 17 March, 2018; originally announced March 2018.

Comments: To appear in Mathematics of Operations Research; added results on semi-random robustness

arXiv:1705.08391 [pdf, ps, other]

Exponential error rates of SDP for block models: Beyond Grothendieck's inequality

Authors: Yingjie Fei, Yudong Chen

Abstract: In this paper we consider the cluster estimation problem under the Stochastic Block Model. We show that the semidefinite programming (SDP) formulation for this problem achieves an error rate that decays exponentially in the signal-to-noise ratio. The error bound implies weak recovery in the sparse graph regime with bounded expected degrees, as well as exact recovery in the dense regime. An immedia… ▽ More In this paper we consider the cluster estimation problem under the Stochastic Block Model. We show that the semidefinite programming (SDP) formulation for this problem achieves an error rate that decays exponentially in the signal-to-noise ratio. The error bound implies weak recovery in the sparse graph regime with bounded expected degrees, as well as exact recovery in the dense regime. An immediate corollary of our results yields error bounds under the Censored Block Model. Moreover, these error bounds are robust, continuing to hold under heterogeneous edge probabilities and a form of the so-called monotone attack. Significantly, this error rate is achieved by the SDP solution itself without any further pre- or post-processing, and improves upon existing polynomially-decaying error bounds proved using the Grothendieck\textquoteright s inequality. Our analysis has two key ingredients: (i) showing that the graph has a well-behaved spectrum, even in the sparse regime, after discounting an exponentially small number of edges, and (ii) an order-statistics argument that governs the final error rate. Both arguments highlight the implicit regularization effect of the SDP formulation. △ Less

Submitted 23 May, 2017; originally announced May 2017.

Showing 1–12 of 12 results for author: Fei, Y