-
On the Degree Automatability of Sum-of-Squares Proofs
Authors:
Alex Bortolotti,
Monaldo Mastrolilli,
Luis Felipe Vargas
Abstract:
The Sum-of-Squares (SoS) hierarchy, also known as Lasserre hierarchy, has emerged as a promising tool in optimization. However, it remains unclear whether fixed-degree SoS proofs can be automated [O'Donnell (2017)]. Indeed, there are examples of polynomial systems with bounded coefficients that admit low-degree SoS proofs, but these proofs necessarily involve numbers with an exponential number of…
▽ More
The Sum-of-Squares (SoS) hierarchy, also known as Lasserre hierarchy, has emerged as a promising tool in optimization. However, it remains unclear whether fixed-degree SoS proofs can be automated [O'Donnell (2017)]. Indeed, there are examples of polynomial systems with bounded coefficients that admit low-degree SoS proofs, but these proofs necessarily involve numbers with an exponential number of bits, implying that low-degree SoS proofs cannot always be found efficiently.
A sufficient condition derived from the Nullstellensatz proof system [Raghavendra and Weitz (2017)] identifies cases where bit complexity issues can be circumvented. One of the main problems left open by Raghavendra and Weitz is proving any result for refutations, as their condition applies only to polynomial systems with a large set of solutions.
In this work, we broaden the class of polynomial systems for which degree-$d$ SoS proofs can be automated. To achieve this, we develop a new criterion and we demonstrate how our criterion applies to polynomial systems beyond the scope of Raghavendra and Weitz's result. In particular, we establish a separation for instances arising from Constraint Satisfaction Problems (CSPs). Moreover, our result extends to refutations, establishing that polynomial-time refutation is possible for broad classes of polynomial time solvable constraint problems, highlighting a first advancement in this area.
△ Less
Submitted 24 April, 2025;
originally announced April 2025.
-
Computational complexity of sum-of-squares bounds for copositive programs
Authors:
Marilena Palomba,
Lucas Slot,
Luis Felipe Vargas,
Monaldo Mastrolilli
Abstract:
In recent years, copositive programming has received significant attention for its ability to model hard problems in both discrete and continuous optimization. Several relaxations of copositive programs based on semidefinite programming (SDP) have been proposed in the literature, meant to provide tractable bounds. However, while these SDP-based relaxations are amenable to the ellipsoid algorithm a…
▽ More
In recent years, copositive programming has received significant attention for its ability to model hard problems in both discrete and continuous optimization. Several relaxations of copositive programs based on semidefinite programming (SDP) have been proposed in the literature, meant to provide tractable bounds. However, while these SDP-based relaxations are amenable to the ellipsoid algorithm and interior point methods, it is not immediately obvious that they can be solved in polynomial time (even approximately). In this paper, we consider the sum-of-squares (SOS) hierarchies of relaxations for copositive programs introduced by Parrilo (2000), de Klerk & Pasechnik (2002) and Peña, Vera & Zuluaga (2006), which can be formulated as SDPs. We establish sufficient conditions that guarantee the polynomial-time computability (up to fixed precision) of these relaxations. These conditions are satisfied by copositive programs that represent standard quadratic programs and their reciprocals. As an application, we show that the SOS bounds for the (weighted) stability number of a graph can be computed efficiently. Additionally, we provide pathological examples of copositive programs (that do not satisfy the sufficient conditions) whose SOS relaxations admit only feasible solutions of doubly-exponential size.
△ Less
Submitted 7 January, 2025;
originally announced January 2025.
-
On the hardness of deciding the finite convergence of Lasserre hierarchies
Authors:
Luis Felipe Vargas
Abstract:
A polynomial optimization problem (POP) asks for minimizing a polynomial function given a finite set of polynomial constraints (equations and inequalities). This problem is well-known to be hard in general, as it encodes many hard combinatorial problems. The Lasserre hierarchy is a sequence of semidefinite relaxations for solving (POP). Under the standard archimedean condition, this hierarchy is g…
▽ More
A polynomial optimization problem (POP) asks for minimizing a polynomial function given a finite set of polynomial constraints (equations and inequalities). This problem is well-known to be hard in general, as it encodes many hard combinatorial problems. The Lasserre hierarchy is a sequence of semidefinite relaxations for solving (POP). Under the standard archimedean condition, this hierarchy is guaranteed to converge asymptotically to the optimal value of (POP) (Lasserre, 2001) and, moreover, finite convergence holds generically (Nie, 2012). In this paper, we aim to investigate whether there is an efficient algorithmic procedure to decide whether the Lasserre hierarchy of (POP) has finite convergence. We show that unless P=NP there cannot exist such an algorithmic procedure that runs in polynomial time. We show this already for the standard quadratic programs. Our approach relies on characterizing when finite convergence holds for the so-called Motzkin-Straus formulation (and some variations of it) for the stability number of a graph.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
Sum-of-squares certificates for copositivity via test states
Authors:
Markus Schweighofer,
Luis Felipe Vargas
Abstract:
In 1995, Reznick showed an important variant of the obvious fact that any positive semidefinite (real) quadratic form is a sum of squares of linear forms: If a form (of arbitrary even degree) is positive definite then it becomes a sum of squares of forms after being multiplied by a sufficiently high power of the sum of its squared variables. If the form is just positive \emph{semi}definite instead…
▽ More
In 1995, Reznick showed an important variant of the obvious fact that any positive semidefinite (real) quadratic form is a sum of squares of linear forms: If a form (of arbitrary even degree) is positive definite then it becomes a sum of squares of forms after being multiplied by a sufficiently high power of the sum of its squared variables. If the form is just positive \emph{semi}definite instead of positive definite, this fails badly in general. In this work, we identify however two classes of positive semidefinite even quartic forms for which the statement continues to hold even though they have in general infinitely many projective real zeros. The first class consists of all even quartic positive semidefinite forms in five variables. This provides a natural certificate for a matrix of size five being copositive and answers positively a question asked by Laurent and the second author in 2022. The second class consists of certain quartic positive semidefinite forms that arise from graphs and their stability number. This shows finite convergence of a hierarchy of semidefinite approximations for the stability number of a graph proposed by de Klerk and Pasechnik in 2002. In both cases, the main tool for the proofs is the method of pure states on ideals developed by Burgdorf, Scheiderer and the first author in 2012. We hope to make this method more accessible by introducing the notion of a \emph{test state}.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
The limiting behavior of solutions to p-Laplacian problems with convection and exponential terms
Authors:
Anderson L. A. de Araujo,
Grey Ercole,
Julio C. Lanazca Vargas
Abstract:
We consider, for $a,l\geq1,$ $b,s,α>0,$ and $p>q\geq1,$ the homogeneous Dirichlet problem for the equation $-Δ_{p}u=λu^{q-1}+βu^{a-1}\left\vert \nabla u\right\vert ^{b}+mu^{l-1}e^{αu^{s}}$ in a smooth bounded domain $Ω\subset\mathbb{R}^{N}.$ We prove that under certain setting of the parameters $λ,$ $β$ and $m$ the problem admits at least one positive solution. Using this result we prove that if…
▽ More
We consider, for $a,l\geq1,$ $b,s,α>0,$ and $p>q\geq1,$ the homogeneous Dirichlet problem for the equation $-Δ_{p}u=λu^{q-1}+βu^{a-1}\left\vert \nabla u\right\vert ^{b}+mu^{l-1}e^{αu^{s}}$ in a smooth bounded domain $Ω\subset\mathbb{R}^{N}.$ We prove that under certain setting of the parameters $λ,$ $β$ and $m$ the problem admits at least one positive solution. Using this result we prove that if $λ,β>0$ are arbitrarily fixed and $m$ is sufficiently small, then the problem has a positive solution $u_{p},$ for all $p$ sufficiently large. In addition, we show that $u_{p}$ converges uniformly to the distance function to the boundary of $Ω,$ as $p\rightarrow\infty.$ This convergence result is new for nonlinearities involving a convection term.
△ Less
Submitted 2 May, 2023; v1 submitted 28 February, 2023;
originally announced March 2023.
-
Semidefinite approximations for bicliques and biindependent pairs
Authors:
Monique Laurent,
Sven Polak,
Luis Felipe Vargas
Abstract:
We investigate some graph parameters dealing with biindependent pairs $(A,B)$ in a bipartite graph $G=(V_1\cup V_2,E)$, i.e., pairs $(A,B)$ where $A\subseteq V_1$, $B\subseteq V_2$ and $A\cup B$ is independent. These parameters also allow to study bicliques in general graphs. When maximizing the cardinality $|A\cup B|$ one finds the stability number $α(G)$, well-known to be polynomial-time computa…
▽ More
We investigate some graph parameters dealing with biindependent pairs $(A,B)$ in a bipartite graph $G=(V_1\cup V_2,E)$, i.e., pairs $(A,B)$ where $A\subseteq V_1$, $B\subseteq V_2$ and $A\cup B$ is independent. These parameters also allow to study bicliques in general graphs. When maximizing the cardinality $|A\cup B|$ one finds the stability number $α(G)$, well-known to be polynomial-time computable. When maximizing the product $|A|\cdot |B|$ one finds the parameter $g(G)$, shown to be NP-hard by Peeters (2003), and when maximizing the ratio $|A|\cdot |B|/|A\cup B|$ one finds $h(G)$, introduced by Vallentin (2020) for bounding product-free sets in finite groups. We show that $h(G)$ is an NP-hard parameter and, as a crucial ingredient, that it is NP-complete to decide whether a bipartite graph $G$ has a balanced maximum independent set. These hardness results motivate introducing semidefinite programming bounds for $g(G)$, $h(G)$, and $α_{\text{bal}}(G)$ (the maximum cardinality of a balanced independent set). We show that these bounds can be seen as natural variations of the Lovász $\vartheta$-number, a well-known semidefinite bound on $α(G)$. In addition we formulate closed-form eigenvalue bounds and we show relationships among them as well as with earlier spectral parameters by Hoffman, Haemers (2001) and Vallentin (2020).
△ Less
Submitted 9 January, 2024; v1 submitted 17 February, 2023;
originally announced February 2023.
-
Copositive matrices, sums of squares and the stability number of a graph
Authors:
Luis Felipe Vargas,
Monique Laurent
Abstract:
This chapter investigates the cone of copositive matrices, with a focus on the design and analysis of conic inner approximations for it. These approximations are based on various sufficient conditions for matrix copositivity, relying on positivity certificates in terms of sums of squares of polynomials. Their application to the discrete optimization problem asking for a maximum stable set in a gra…
▽ More
This chapter investigates the cone of copositive matrices, with a focus on the design and analysis of conic inner approximations for it. These approximations are based on various sufficient conditions for matrix copositivity, relying on positivity certificates in terms of sums of squares of polynomials. Their application to the discrete optimization problem asking for a maximum stable set in a graph is also discussed. A central theme in this chapter is understanding when the conic approximations suffice for describing the full copositive cone, and when the corresponding bounds for the stable set problem admit finite convergence.
△ Less
Submitted 20 March, 2023; v1 submitted 9 February, 2023;
originally announced February 2023.
-
On the Exactness of Sum-of-Squares Approximations for the Cone of $5\times 5$ Copositive Matrices
Authors:
Monique Laurent,
Luis Felipe Vargas
Abstract:
We investigate the hierarchy of conic inner approximations $\mathcal{K}^{(r)}_n$ ($r\in \mathbb{N}$) for the copositive cone $\text{COP}_n$, introduced by Parrilo (Structured Semidefinite Programs and Semialgebraic Geometry Methods in Robustness and Optimization, PhD Thesis, California Institute of Technology, 2001). It is known that $\text{COP}_4=\mathcal{K}^{(0)}_4$ and that, while the union of…
▽ More
We investigate the hierarchy of conic inner approximations $\mathcal{K}^{(r)}_n$ ($r\in \mathbb{N}$) for the copositive cone $\text{COP}_n$, introduced by Parrilo (Structured Semidefinite Programs and Semialgebraic Geometry Methods in Robustness and Optimization, PhD Thesis, California Institute of Technology, 2001). It is known that $\text{COP}_4=\mathcal{K}^{(0)}_4$ and that, while the union of the cones $\mathcal{K}^{(r)}_n$ covers the interior of $\text{COP}_n$, it does not cover the full cone $\text{COP}_n$ if $n\geq 6$. Here we investigate the remaining case $n=5$, where all extreme rays have been fully characterized by Hildebrand (The extreme rays of the 5 $\times$ 5 copositive cone. Linear Algebra and its Applications, 437(7):1538--1547, 2012). We show that the Horn matrix $H$ and its positive diagonal scalings play an exceptional role among the extreme rays of $\text{COP}_5$. We show that equality $\text{COP}_5=\bigcup_{r\geq 0} \mathcal{K}^{(r)}_5$ holds if and only if any positive diagonal scaling of $H$ belongs to $\mathcal{K}^{(r)}_5$ for some $r\in \mathbb{N}$. As a main ingredient for the proof, we introduce new Lasserre-type conic inner approximations for $\text{COP}_n$, based on sums of squares of polynomials. We show their links to the cones $\mathcal{K}^{(r)}_n$, and we use an optimization approach that permits to exploit finite convergence results on Lasserre hierarchy to show membership in the new cones.
△ Less
Submitted 11 May, 2022;
originally announced May 2022.
-
The Forward-Backward Envelope for Sampling with the Overdamped Langevin Algorithm
Authors:
Armin Eftekhari,
Luis Vargas,
Konstantinos Zygalakis
Abstract:
In this paper, we analyse a proximal method based on the idea of forward-backward splitting for sampling from distributions with densities that are not necessarily smooth. In particular, we study the non-asymptotic properties of the Euler-Maruyama discretization of the Langevin equation, where the forward-backward envelope is used to deal with the non-smooth part of the dynamics. An advantage of t…
▽ More
In this paper, we analyse a proximal method based on the idea of forward-backward splitting for sampling from distributions with densities that are not necessarily smooth. In particular, we study the non-asymptotic properties of the Euler-Maruyama discretization of the Langevin equation, where the forward-backward envelope is used to deal with the non-smooth part of the dynamics. An advantage of this envelope, when compared to widely-used Moreu-Yoshida one and the MYULA algorithm, is that it maintains the MAP estimator of the original non-smooth distribution. We also study a number of numerical experiments that corroborate that support our theoretical findings.
△ Less
Submitted 22 January, 2022;
originally announced January 2022.
-
Exactness of Parrilo's conic approximations for copositive matrices and associated low order bounds for the stability number of a graph
Authors:
Monique Laurent,
Luis Felipe Vargas
Abstract:
De Klerk and Pasechnik (2002) introduced the bounds $\vartheta^{(r)}(G)$ ($r\in \mathbb{N}$) for the stability number $α(G)$ of a graph $G$ and conjectured exactness at order $α(G)-1$: $\vartheta^{(α(G)-1)}(G)=α(G)$. These bounds rely on the conic approximations $\mathcal{K}_n^{(r)}$ by Parrilo (2000) for the copositive cone $\text{COP}_n$. A difficulty in the convergence analysis of…
▽ More
De Klerk and Pasechnik (2002) introduced the bounds $\vartheta^{(r)}(G)$ ($r\in \mathbb{N}$) for the stability number $α(G)$ of a graph $G$ and conjectured exactness at order $α(G)-1$: $\vartheta^{(α(G)-1)}(G)=α(G)$. These bounds rely on the conic approximations $\mathcal{K}_n^{(r)}$ by Parrilo (2000) for the copositive cone $\text{COP}_n$. A difficulty in the convergence analysis of $\vartheta^{(r)}$ is the bad behaviour of the cones $\mathcal{K}_n^{(r)}$ under adding a zero row/column: when applied to a matrix not in $\mathcal{K}^{(0)}_n$ this gives a matrix not in any ${\mathcal{K}}^{(r)}_{n+1}$, thereby showing strict inclusion $\bigcup_{r\ge 0}{\mathcal{K}}^{(r)}_n\subset \text{COP}_n$ for $n\ge 6$. We investigate the graphs with $\vartheta^{(r)}(G)=α(G)$ for $r=0,1$: we algorithmically reduce testing exactness of $\vartheta^{(0)}$ to acritical graphs, we characterize critical graphs with $\vartheta^{(0)}$ exact, and we exhibit graphs for which exactness of $\vartheta^{(1)}$ is not preserved under adding an isolated node. This disproves a conjecture by Gvozdenović and Laurent (2007) which, if true, would have implied the above conjecture by de Klerk and Pasechnik.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
Minimum cross-entropy distributions on Wasserstein balls and their applications
Authors:
Luis Felipe Vargas,
Mauricio Velasco
Abstract:
Given a prior probability density $p$ on a compact set $K$ we characterize the probability distribution $q_δ^*$ on $K$ contained in a Wasserstein ball $B_δ(μ)$ centered in a given discrete measure $μ$ for which the relative-entropy $H(q,p)$ achieves its minimum. This characterization gives us an algorithm for computing such distributions efficiently
Given a prior probability density $p$ on a compact set $K$ we characterize the probability distribution $q_δ^*$ on $K$ contained in a Wasserstein ball $B_δ(μ)$ centered in a given discrete measure $μ$ for which the relative-entropy $H(q,p)$ achieves its minimum. This characterization gives us an algorithm for computing such distributions efficiently
△ Less
Submitted 6 June, 2021;
originally announced June 2021.
-
Finite convergence of sum-of-squares hierarchies for the stability number of a graph
Authors:
Monique Laurent,
Luis Felipe Vargas
Abstract:
We investigate a hierarchy of semidefinite bounds $\vartheta^{(r)}(G)$ for the stability number $α(G)$ of a graph $G$, based on its copositive programming formulation and introduced by de Klerk and Pasechnik [{\em SIAM J. Optim.} 12 (2002), pp.875--892], who conjectured convergence to $α(G)$ in $r=α(G)-1$ steps. Even the weaker conjecture claiming finite convergence is still open. We establish lin…
▽ More
We investigate a hierarchy of semidefinite bounds $\vartheta^{(r)}(G)$ for the stability number $α(G)$ of a graph $G$, based on its copositive programming formulation and introduced by de Klerk and Pasechnik [{\em SIAM J. Optim.} 12 (2002), pp.875--892], who conjectured convergence to $α(G)$ in $r=α(G)-1$ steps. Even the weaker conjecture claiming finite convergence is still open. We establish links between this hierarchy and sum-of-squares hierarchies based on the Motzkin-Straus formulation of $α(G)$, which we use to show finite convergence when $G$ is acritical, i.e., when $α(G\setminus e)=α(G)$ for all edges $e$ of $G$. This relies, in particular, on understanding the structure of the minimizers of the Motzkin-Straus formulation and showing that their number is finite precisely when $G$ is acritical. Moreover we show that these results hold in the general setting of the weighted stable set problem for graphs equipped with positive node weights. In addition, as a byproduct we show that deciding whether a standard quadratic program has finitely many minimizers does not admit a polynomial-time algorithm unless P=NP.
△ Less
Submitted 22 January, 2024; v1 submitted 2 March, 2021;
originally announced March 2021.
-
Accelerating proximal Markov chain Monte Carlo by using an explicit stabilised method
Authors:
Luis Vargas,
Marcelo Pereyra,
Konstantinos C. Zygalakis
Abstract:
We present a highly efficient proximal Markov chain Monte Carlo methodology to perform Bayesian computation in imaging problems. Similarly to previous proximal Monte Carlo approaches, the proposed method is derived from an approximation of the Langevin diffusion. However, instead of the conventional Euler-Maruyama approximation that underpins existing proximal Monte Carlo methods, here we use a st…
▽ More
We present a highly efficient proximal Markov chain Monte Carlo methodology to perform Bayesian computation in imaging problems. Similarly to previous proximal Monte Carlo approaches, the proposed method is derived from an approximation of the Langevin diffusion. However, instead of the conventional Euler-Maruyama approximation that underpins existing proximal Monte Carlo methods, here we use a state-of-the-art orthogonal Runge-Kutta-Chebyshev stochastic approximation that combines several gradient evaluations to significantly accelerate its convergence speed, similarly to accelerated gradient optimisation methods. The proposed methodology is demonstrated via a range of numerical experiments, including non-blind image deconvolution, hyperspectral unmixing, and tomographic reconstruction, with total-variation and $\ell_1$-type priors. Comparisons with Euler-type proximal Monte Carlo methods confirm that the Markov chains generated with our method exhibit significantly faster convergence speeds, achieve larger effective sample sizes, and produce lower mean square estimation errors at equal computational budget.
△ Less
Submitted 19 March, 2020; v1 submitted 23 August, 2019;
originally announced August 2019.