-
Adaptive Open-Loop Step-Sizes for Accelerated Convergence Rates of the Frank-Wolfe Algorithm
Authors:
Elias Wirth,
Javier Peña,
Sebastian Pokutta
Abstract:
Recent work has shown that in certain settings, the Frank-Wolfe algorithm (FW) with open-loop step-sizes $η_t = \frac{\ell}{t+\ell}$ for a fixed parameter $\ell \in \mathbb{N},\, \ell \geq 2$, attains a convergence rate faster than the traditional $O(t^{-1})$ rate. In particular, when a strong growth property holds, the convergence rate attainable with open-loop step-sizes…
▽ More
Recent work has shown that in certain settings, the Frank-Wolfe algorithm (FW) with open-loop step-sizes $η_t = \frac{\ell}{t+\ell}$ for a fixed parameter $\ell \in \mathbb{N},\, \ell \geq 2$, attains a convergence rate faster than the traditional $O(t^{-1})$ rate. In particular, when a strong growth property holds, the convergence rate attainable with open-loop step-sizes $η_t = \frac{\ell}{t+\ell}$ is $O(t^{-\ell})$. In this setting there is no single value of the parameter $\ell$ that prevails as superior. This paper shows that FW with log-adaptive open-loop step-sizes $η_t = \frac{2+\log(t+1)}{t+2+\log(t+1)}$ attains a convergence rate that is at least as fast as that attainable with fixed-parameter open-loop step-sizes $η_t = \frac{\ell}{t+\ell}$ for any value of $\ell \in \mathbb{N},\,\ell\geq 2$. To establish our main convergence results, we extend our previous affine-invariant accelerated convergence results for FW to more general open-loop step-sizes of the form $η_t = g(t)/(t+g(t))$, where $g:\mathbb{N}\to\mathbb{R}_{\geq 0}$ is any non-decreasing function such that the sequence of step-sizes $(η_t)$ is non-increasing. This covers in particular the fixed-parameter case by choosing $g(t) = \ell$ and the log-adaptive case by choosing $g(t) = 2+ \log(t+1)$. To facilitate adoption of log-adaptive open-loop step-sizes, we have incorporated this rule into the {\tt FrankWolfe.jl} software package.
△ Less
Submitted 14 May, 2025;
originally announced May 2025.
-
Strong odd coloring in minor-closed classes
Authors:
Miriam Goetze,
Fabian Klute,
Kolja Knauer,
Irene Parada,
Juan Pablo Peña,
Torsten Ueckerdt
Abstract:
We show that the strong odd chromatic number on any proper minor-closed graph class is bounded by a constant. We almost determine the smallest such constant for outerplanar graphs.
We show that the strong odd chromatic number on any proper minor-closed graph class is bounded by a constant. We almost determine the smallest such constant for outerplanar graphs.
△ Less
Submitted 5 May, 2025;
originally announced May 2025.
-
Locally connected graphs: metric properties
Authors:
Martín Matamala,
Juan Pablo Peña,
José Zamora
Abstract:
In this work we show that any connected locally connected graph defines a metric space having at least as many lines as vertices with only three exception: the complete multipartite graphs $K_{1,2,2}$, $K_{2,2,2}$ and $K_{2,2,2,2}$. This proves that this class fulfills a conjecture, proposed by Chen and Chvátal, saying that any metric space on n points has at least n lines or a line containing all…
▽ More
In this work we show that any connected locally connected graph defines a metric space having at least as many lines as vertices with only three exception: the complete multipartite graphs $K_{1,2,2}$, $K_{2,2,2}$ and $K_{2,2,2,2}$. This proves that this class fulfills a conjecture, proposed by Chen and Chvátal, saying that any metric space on n points has at least n lines or a line containing all the points.
△ Less
Submitted 27 February, 2025;
originally announced February 2025.
-
An evaluation algorithm for q-Bézier triangular patches formed by convex combinations
Authors:
Jorge Delgado,
Héctor Orera,
Juan Manuel Peña
Abstract:
An extension to triangular domains of the univariate q-Bernstein basis functions is introduced and analyzed. Some recurrence relations and properties such as partition of unity and degree elevation are proved for them. It is also proved that they form a basis for the space of polynomials of total degree less than or equal to n on a triangle. In addition, it is presented a de Casteljau type evaluat…
▽ More
An extension to triangular domains of the univariate q-Bernstein basis functions is introduced and analyzed. Some recurrence relations and properties such as partition of unity and degree elevation are proved for them. It is also proved that they form a basis for the space of polynomials of total degree less than or equal to n on a triangle. In addition, it is presented a de Casteljau type evaluation algorithm whose steps are all linear convex combinations.
△ Less
Submitted 22 January, 2025;
originally announced January 2025.
-
Accurate Bidiagonal Decomposition and Computations with Generalized Pascal Matrices
Authors:
Jorge Delgado,
Héctor Orera,
Juan Manuel Peña
Abstract:
This paper provides an accurate method to obtain the bidiagonal factorization of many generalized Pascal matrices, which in turn can be used to compute with high relative accuracy the eigenvalues, singular values and inverses of these matrices. Numerical examples are included.
This paper provides an accurate method to obtain the bidiagonal factorization of many generalized Pascal matrices, which in turn can be used to compute with high relative accuracy the eigenvalues, singular values and inverses of these matrices. Numerical examples are included.
△ Less
Submitted 21 January, 2025;
originally announced January 2025.
-
Optimal properties of tensor product of B-bases
Authors:
Jorge Delgado,
Héctor Orera,
Juan Manuel Peña
Abstract:
It is proved the optimal conditioning for the infinity norm of collocation matrices of the tensor product of normalized B-bases among the tensor product of all normalized totally positive bases of the corresponding space of functions. Bounds for the minimal eigenvalue and singular value and illustrative numerical examples are also included.
It is proved the optimal conditioning for the infinity norm of collocation matrices of the tensor product of normalized B-bases among the tensor product of all normalized totally positive bases of the corresponding space of functions. Bounds for the minimal eigenvalue and singular value and illustrative numerical examples are also included.
△ Less
Submitted 20 January, 2025;
originally announced January 2025.
-
Accurate algorithms for Bessel matrices
Authors:
Jorge Delgado,
Héctor Orera,
Juan Manuel Peña
Abstract:
In this paper, we prove that any collocation matrix of Bessel polynomials at positive points is strictly totally positive, that is, all its minors are positive. Moreover, an accurate method to construct the bidiagonal factorization of these matrices is obtained and used to compute with high relative accuracy the eigenvalues, singular values and inverses. Similar results for the collocation matrice…
▽ More
In this paper, we prove that any collocation matrix of Bessel polynomials at positive points is strictly totally positive, that is, all its minors are positive. Moreover, an accurate method to construct the bidiagonal factorization of these matrices is obtained and used to compute with high relative accuracy the eigenvalues, singular values and inverses. Similar results for the collocation matrices for the reverse Bessel polynomials are also obtained. Numerical examples illustrating the theoretical results are included.
△ Less
Submitted 17 January, 2025;
originally announced January 2025.
-
Infinity norm bounds for the inverse of Nekrasov matrices using scaling matrices
Authors:
Héctor Orera,
Juan Manuel Peña
Abstract:
For many applications, it is convenient to have good upper bounds for the norm of the inverse of a given matrix. In this paper, we obtain such bounds when A is a Nekrasov matrix, by means of a scaling matrix transforming A into a strictly diagonally dominant matrix. Numerical examples and comparisons with other bounds are included. The scaling matrices are also used to derive new error bounds for…
▽ More
For many applications, it is convenient to have good upper bounds for the norm of the inverse of a given matrix. In this paper, we obtain such bounds when A is a Nekrasov matrix, by means of a scaling matrix transforming A into a strictly diagonally dominant matrix. Numerical examples and comparisons with other bounds are included. The scaling matrices are also used to derive new error bounds for the linear complementarity problems when the involved matrix is a Nekrasov matrix. These error bounds can improve considerably other previous bounds.
△ Less
Submitted 16 January, 2025;
originally announced January 2025.
-
Uniform $h$-dichotomies: noncritical uniformity and expansivity
Authors:
Heli Elorreaga,
Juan Peña,
Gonzalo Robledo
Abstract:
The property of exponential dichotomy can be seen as a generalization of the hyperbolicity condition for non autonomous linear finite dimensional systems of ordinary differential equations. In 1978 W.A. Coppel proved that the exponential dichotomy on the half line is equivalent to the property of noncritical uniformity provided that a condition of bounded growth is verified. In 2006 K.J. Palmer ex…
▽ More
The property of exponential dichotomy can be seen as a generalization of the hyperbolicity condition for non autonomous linear finite dimensional systems of ordinary differential equations. In 1978 W.A. Coppel proved that the exponential dichotomy on the half line is equivalent to the property of noncritical uniformity provided that a condition of bounded growth is verified. In 2006 K.J. Palmer extended this result by proving that -- also assuming the bounded growth property -- the exponential dichotomy on the half line, noncritical uniformity and the exponential expansiveness are equivalent. The main contribution of this article is to generalize these results for the property of uniform $h$-dichotomy. This has been carried out due to a recent idea: under suitable conditions any $h$-dichotomy can be associated to a totally ordered topological group, which becomes the additive group $(\mathbb{R},+)$ in case of the exponential dichotomy. The properties of this new group make possible such generalization.
△ Less
Submitted 8 November, 2024;
originally announced November 2024.
-
Lines on digraphs of low diameter
Authors:
Gabriela Araujo-Pardo,
Martín Matamala,
Juan P. Peña,
José Zamora
Abstract:
A set of n non-collinear points in the Euclidean plane defines at least n different lines. Chen and Chvtal in 2008 conjectured that the same results is true in metric spaces for an adequate definition of line. More recently, it was conjectured in 2018 by Aboulker et al. that any large enough bridgeless graph on n vertices defines a metric space that has at least n lines. We study the natural exten…
▽ More
A set of n non-collinear points in the Euclidean plane defines at least n different lines. Chen and Chvtal in 2008 conjectured that the same results is true in metric spaces for an adequate definition of line. More recently, it was conjectured in 2018 by Aboulker et al. that any large enough bridgeless graph on n vertices defines a metric space that has at least n lines. We study the natural extension of Aboulker et al.'s conjecture into the context of quasi-metric spaces defined by digraphs of low diameter. We prove that it is valid for quasi-metric spaces defined by bipartite digraphs of diameter at most three, oriented graphs of diameter two and, digraphs of diameter three and directed girth four.
△ Less
Submitted 28 October, 2024;
originally announced October 2024.
-
A study on token digraphs
Authors:
Cristina G. Fernandes,
Carla N. Lintzmayer,
Juan P. Peña,
Giovanne Santos,
Ana Trujillo-Negrete,
Jose Zamora
Abstract:
For a digraph $D$ of order $n$ and an integer $1 \leq k \leq n-1$, the $k$-token digraph of $D$ is the graph whose vertices are all $k$-subsets of vertices of $D$ and, given two such $k$-subsets $A$ and $B$, $(A,B)$ is an arc in the $k$-token digraph whenever $\{a\} = A \setminus B$, $\{b\} = B \setminus A$, and there is an arc $(a,b)$ in $D$. Token digraphs are a generalization of token graphs. I…
▽ More
For a digraph $D$ of order $n$ and an integer $1 \leq k \leq n-1$, the $k$-token digraph of $D$ is the graph whose vertices are all $k$-subsets of vertices of $D$ and, given two such $k$-subsets $A$ and $B$, $(A,B)$ is an arc in the $k$-token digraph whenever $\{a\} = A \setminus B$, $\{b\} = B \setminus A$, and there is an arc $(a,b)$ in $D$. Token digraphs are a generalization of token graphs. In this paper, we study some properties of token digraphs, including strong and unilateral connectivity, kernels, girth, circumference and Eulerianity. We also extend some known results on the clique and chromatic numbers of $k$-token graphs, addressing the bidirected clique number and dichromatic number of $k$-token digraphs. Additionally, we prove that determining whether $2$-token digraphs have a kernel is NP-complete.
△ Less
Submitted 26 October, 2024;
originally announced October 2024.
-
Fast convergence of Frank-Wolfe algorithms on polytopes
Authors:
Elias Wirth,
Javier Pena,
Sebastian Pokutta
Abstract:
We provide a template to derive convergence rates for the following popular versions of the Frank-Wolfe algorithm on polytopes: vanilla Frank-Wolfe, Frank-Wolfe with away steps, Frank-Wolfe with blended pairwise steps, and Frank-Wolfe with in-face directions. Our template shows how the convergence rates follow from two affine-invariant properties of the problem, namely, error bound and extended cu…
▽ More
We provide a template to derive convergence rates for the following popular versions of the Frank-Wolfe algorithm on polytopes: vanilla Frank-Wolfe, Frank-Wolfe with away steps, Frank-Wolfe with blended pairwise steps, and Frank-Wolfe with in-face directions. Our template shows how the convergence rates follow from two affine-invariant properties of the problem, namely, error bound and extended curvature. These properties depend solely on the polytope and objective function but not on any affine-dependent object like norms. For each one of the above algorithms, we derive rates of convergence ranging from sublinear to linear depending on the degree of the error bound.
△ Less
Submitted 15 February, 2025; v1 submitted 26 June, 2024;
originally announced June 2024.
-
Quasimetric spaces with few lines
Authors:
Guillermo Gamboa Quintero,
Martín Matamala,
Juan Pablo Peña
Abstract:
Chen and Chvátal conjectured in 2008 that in any finite metric space either there is a line containing all the points - a universal line -, or the number of lines is at least the number of points. This is a generalization of a classical result due to Erdős that says that a set of $n$ non-collinear points in the Euclidean plane defines at least $n$ different lines.
A line of a metric space with m…
▽ More
Chen and Chvátal conjectured in 2008 that in any finite metric space either there is a line containing all the points - a universal line -, or the number of lines is at least the number of points. This is a generalization of a classical result due to Erdős that says that a set of $n$ non-collinear points in the Euclidean plane defines at least $n$ different lines.
A line of a metric space with metric $ρ$ is defined in terms of a notion called the betweenness of the space which is the set of all triples $(x,z,y)$ such that $ρ(x,y)=ρ(x,z)+ρ(z,y)$.
In this work we prove that for each $n\geq 4$ there are $p_3(n)$ non isomorphic betweennesses arising from \emph{quasimetric} spaces with $n$ points, without universal lines and with exactly 3 lines, where $p_3(n)$ is the number of partitions of an integer $n$ into three parts. We also prove that for $n\geq 5$, there are $2p_3(n-1)$ non isomorphic betweennesses arising from quasimetric spaces on $n$ points, without universal lines and with exactly 4 lines. Here two betweennesses are isomorphic if they are isomorphic as relational structures.
None of the betweennesses mentioned above is metric which implies that Chen and Chvátal's conjecture is valid for metric spaces with at most five points.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Combined matrices of almost strictly sign regular matrices
Authors:
Pedro Alonso,
Juan Manuel Peña,
María Luisa Serrano
Abstract:
The combined matrix is a very useful concept for many applications. Almost strictly sign regular (ASSR) matrices form an important structured class of matrices with two possible zero patterns, which are either type-I staircase or type-II staircase. We prove that, under an irreducibility condition, the pattern of zero and nonzero entries of an ASSR matrix is preserved by the corresponding combined…
▽ More
The combined matrix is a very useful concept for many applications. Almost strictly sign regular (ASSR) matrices form an important structured class of matrices with two possible zero patterns, which are either type-I staircase or type-II staircase. We prove that, under an irreducibility condition, the pattern of zero and nonzero entries of an ASSR matrix is preserved by the corresponding combined matrix. Without the irreducibility condition, it is proved that type-I and type-II staircases are still preserved. Illustrative numerical examples are included.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Almost strictly sign regular rectangular matrices
Authors:
P. Alonso,
J. M. Peña,
M. L. Serrano
Abstract:
Almost strictly sign regular matrices are sign regular matrices with a special zero pattern and whose nontrivial minors are nonzero. In this paper we provide several properties of almost strictly sign regular rectangular matrices and analyze their QR factorization.
Almost strictly sign regular matrices are sign regular matrices with a special zero pattern and whose nontrivial minors are nonzero. In this paper we provide several properties of almost strictly sign regular rectangular matrices and analyze their QR factorization.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Duality of Hoffman constants
Authors:
Javier F. Pena,
Juan C. Vera,
Luis F. Zuluaga
Abstract:
Suppose $A\in \mathbb{R}^{m\times n}$ and consider the following canonical systems of inequalities defined by $A$: $$ \begin{array}{l} Ax=b\\ x \ge 0 \end{array} \qquad \text{ and }\qquad A^T y - c \le 0. $$ We establish some novel duality relationships between the Hoffman constants for the above constraint systems of linear inequalities provided some suitable Slater condition holds. The crux of o…
▽ More
Suppose $A\in \mathbb{R}^{m\times n}$ and consider the following canonical systems of inequalities defined by $A$: $$ \begin{array}{l} Ax=b\\ x \ge 0 \end{array} \qquad \text{ and }\qquad A^T y - c \le 0. $$ We establish some novel duality relationships between the Hoffman constants for the above constraint systems of linear inequalities provided some suitable Slater condition holds. The crux of our approach is a Hoffman duality inequality for polyhedral systems of constraints. The latter in turn yields an interesting duality identity between the Hoffman constants of the following box-constrained systems of inequalities: $$ \begin{array}{l} Ax=b\\ \ell \le x \le u \end{array}\qquad \text{ and }\qquad \ell \le A^T y - c \le u $$ for $\ell, u\in \mathbb{R}^n$ with $\ell < u.$
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Accelerated Affine-Invariant Convergence Rates of the Frank-Wolfe Algorithm with Open-Loop Step-Sizes
Authors:
Elias Wirth,
Javier Pena,
Sebastian Pokutta
Abstract:
Recent papers have shown that the Frank-Wolfe algorithm (FW) with open-loop step-sizes exhibits rates of convergence faster than the iconic $\mathcal{O}(t^{-1})$ rate. In particular, when the minimizer of a strongly convex function over a polytope lies in the relative interior of a feasible region face, the FW with open-loop step-sizes $η_t = \frac{\ell}{t+\ell}$ for…
▽ More
Recent papers have shown that the Frank-Wolfe algorithm (FW) with open-loop step-sizes exhibits rates of convergence faster than the iconic $\mathcal{O}(t^{-1})$ rate. In particular, when the minimizer of a strongly convex function over a polytope lies in the relative interior of a feasible region face, the FW with open-loop step-sizes $η_t = \frac{\ell}{t+\ell}$ for $\ell \in \mathbb{N}_{\geq 2}$ has accelerated convergence $\mathcal{O}(t^{-2})$ in contrast to the rate $Ω(t^{-1-ε})$ attainable with more complex line-search or short-step step-sizes. Given the relevance of this scenario in data science problems, research has grown to explore the settings enabling acceleration in open-loop FW. However, despite FW's well-known affine invariance, existing acceleration results for open-loop FW are affine-dependent. This paper remedies this gap in the literature by merging two recent research trajectories: affine invariance (Wirth et al., 2023b) and open-loop step-sizes (Pena, 2021). In particular, we extend all known non-affine-invariant convergence rates for FW with open-loop step-sizes to affine-invariant results.
△ Less
Submitted 20 January, 2025; v1 submitted 6 October, 2023;
originally announced October 2023.
-
On the Probability of Immunity
Authors:
Jose M. Peña
Abstract:
This work is devoted to the study of the probability of immunity, i.e. the effect occurs whether exposed or not. We derive necessary and sufficient conditions for non-immunity and $ε$-bounded immunity, i.e. the probability of immunity is zero and $ε$-bounded, respectively. The former allows us to estimate the probability of benefit (i.e., the effect occurs if and only if exposed) from a randomized…
▽ More
This work is devoted to the study of the probability of immunity, i.e. the effect occurs whether exposed or not. We derive necessary and sufficient conditions for non-immunity and $ε$-bounded immunity, i.e. the probability of immunity is zero and $ε$-bounded, respectively. The former allows us to estimate the probability of benefit (i.e., the effect occurs if and only if exposed) from a randomized controlled trial, and the latter allows us to produce bounds of the probability of benefit that are tighter than the existing ones. We also introduce the concept of indirect immunity (i.e., through a mediator) and repeat our previous analysis for it. Finally, we propose a method for sensitivity analysis of the probability of immunity under unmeasured confounding.
△ Less
Submitted 11 October, 2023; v1 submitted 21 September, 2023;
originally announced September 2023.
-
Boundedness for proper conflict-free and odd colorings
Authors:
Andrea Jiménez,
Kolja Knauer,
Carla Negri Lintzmayer,
Martín Matamala,
Juan Pablo Peña,
Daniel A. Quiroz,
Maycon Sambinelli,
Yoshiko Wakabayashi,
Weiqiang Yu,
José Zamora
Abstract:
The proper conflict-free chromatic number, $χ_{pcf}(G)$, of a graph $G$ is the least $k$ such that $G$ has a proper $k$-coloring in which for each non-isolated vertex there is a color appearing exactly once among its neighbors. The proper odd chromatic number, $χ_{o}(G)$, of $G$ is the least $k$ such that $G$ has a proper coloring in which for every non-isolated vertex there is a color appearing a…
▽ More
The proper conflict-free chromatic number, $χ_{pcf}(G)$, of a graph $G$ is the least $k$ such that $G$ has a proper $k$-coloring in which for each non-isolated vertex there is a color appearing exactly once among its neighbors. The proper odd chromatic number, $χ_{o}(G)$, of $G$ is the least $k$ such that $G$ has a proper coloring in which for every non-isolated vertex there is a color appearing an odd number of times among its neighbors. We say that a graph class $\mathcal{G}$ is $χ_{pcf}$-bounded ($χ_{o}$-bounded) if there is a function $f$ such that $χ_{pcf}(G) \leq f(χ(G))$ ($χ_{o}(G) \leq f(χ(G))$) for every $G \in \mathcal{G}$. Caro et al. (2022) asked for classes that are linearly $χ_{pcf}$-bounded ($χ_{pcf}$-bounded), and as a starting point, they showed that every claw-free graph $G$ satisfies $χ_{pcf}(G) \le 2Δ(G)+1$, which implies $χ_{pcf}(G) \le 4χ(G)+1$.
In this paper, we improve the bound for claw-free graphs to a nearly tight bound by showing that such a graph $G$ satisfies $χ_{pcf}(G) \le Δ(G)+6$, and even $χ_{pcf}(G) \le Δ(G)+4$ if it is a quasi-line graph. These results also give evidence for a conjecture by Caro et al. Moreover, we show that convex-round graphs and permutation graphs are linearly $χ_{pcf}$-bounded. For these last two results, we prove a lemma that reduces the problem of deciding if a hereditary class is linearly $χ_{pcf}$-bounded to deciding if the bipartite graphs in the class are $χ_{pcf}$-bounded by an absolute constant. This lemma complements a theorem of Liu (2022) and motivates us to study boundedness in bipartite graphs. In particular, we show that biconvex bipartite graphs are $χ_{pcf}$-bounded while convex bipartite graphs are not even $χ_o$-bounded, and exhibit a class of bipartite circle graphs that is linearly $χ_o$-bounded but not $χ_{pcf}$-bounded.
△ Less
Submitted 9 February, 2024; v1 submitted 31 July, 2023;
originally announced August 2023.
-
An easily computable upper bound on the Hoffman constant for homogeneous inequality systems
Authors:
Javier Peña
Abstract:
Let $A\in \mathbb{R}^{m\times n}\setminus \{0\}$ and $P:=\{x:Ax\le 0\}$. This paper provides a procedure to compute an upper bound on the following homogeneous Hoffman constant: \[ H_0(A) := \sup_{u\in \mathbb{R}^n \setminus P} \frac{\text{dist}(u,P)}{\text{dist}(Au, \mathbb{R}^m_-)}. \] In sharp contrast to the intractability of computing more general Hoffman constants, the procedure described in…
▽ More
Let $A\in \mathbb{R}^{m\times n}\setminus \{0\}$ and $P:=\{x:Ax\le 0\}$. This paper provides a procedure to compute an upper bound on the following homogeneous Hoffman constant: \[ H_0(A) := \sup_{u\in \mathbb{R}^n \setminus P} \frac{\text{dist}(u,P)}{\text{dist}(Au, \mathbb{R}^m_-)}. \] In sharp contrast to the intractability of computing more general Hoffman constants, the procedure described in this paper is entirely tractable and easily implementable.
△ Less
Submitted 14 July, 2023; v1 submitted 4 February, 2023;
originally announced February 2023.
-
A validation study of normoglycemia and dysglycemia indices as a diabetes risk model
Authors:
Paola Vargas,
Miguel Angel Moreles,
Joaquin Peña,
Adriana Monroy
Abstract:
In this work, we test the performance of Peak glucose concentration ($A$) and average of glucose removal rates ($α$), as normoglycemia and dysglycemia indices on a population monitored at the Mexico General Hospital between the years 2017 - 2019. A total of 1911 volunteer patients at the Mexico General Hospital are considered. 1282 female patients age ranging from 17 to 80 years old, and 629 male…
▽ More
In this work, we test the performance of Peak glucose concentration ($A$) and average of glucose removal rates ($α$), as normoglycemia and dysglycemia indices on a population monitored at the Mexico General Hospital between the years 2017 - 2019. A total of 1911 volunteer patients at the Mexico General Hospital are considered. 1282 female patients age ranging from 17 to 80 years old, and 629 male patients age ranging from 18 to 79 years old. For each volunteer, OGTT data is gathered and indices are estimated in Ackerman's model. A binary separation of normoglycemic and disglycemic patients using a Support Vector Machine with a linear kernel is carried out. Classification indices are successful for 83\%. Population clusters on diabetic conditions and progression from Normoglycemic to T2DM may be concluded. The classification indices, $A$ and $α$ may be regarded as patient's indices and used to detect diabetes risk. Also, criteria for the applicability of glucose-insulin regulation models are introduced. The performance of Ackerman's model is shown.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Bidiagonal Decompositions of Vandermonde-Type Matrices of Arbitrary Rank
Authors:
Jorge Delgado,
Plamen Koev,
Ana Marco,
Jose-Javier Martinez,
Juan Manuel Pena,
Per-Olof Persson,
Steven Spasov
Abstract:
We present a method to derive new explicit expressions for bidiagonal decompositions of Vandermonde and related matrices such as the (q-, h-) Bernstein-Vandermonde ones, among others. These results generalize the existing expressions for nonsingular matrices to matrices of arbitrary rank. For totally nonnegative matrices of the above classes, the new decompositions can be computed efficiently and…
▽ More
We present a method to derive new explicit expressions for bidiagonal decompositions of Vandermonde and related matrices such as the (q-, h-) Bernstein-Vandermonde ones, among others. These results generalize the existing expressions for nonsingular matrices to matrices of arbitrary rank. For totally nonnegative matrices of the above classes, the new decompositions can be computed efficiently and to high relative accuracy componentwise in floating point arithmetic. In turn, matrix computations (e.g., eigenvalue computation) can also be performed efficiently and to high relative accuracy.
△ Less
Submitted 18 October, 2022;
originally announced October 2022.
-
Affine invariant convergence rates of the conditional gradient method
Authors:
Javier Pena
Abstract:
We show that the conditional gradient method for the convex composite problem \[\min_x\{f(x) + Ψ(x)\}\] generates primal and dual iterates with a duality gap converging to zero provided a suitable {\em growth property} holds and the algorithm makes a judicious choice of stepsizes. The rate of convergence of the duality gap to zero ranges from sublinear to linear depending on the degree of the grow…
▽ More
We show that the conditional gradient method for the convex composite problem \[\min_x\{f(x) + Ψ(x)\}\] generates primal and dual iterates with a duality gap converging to zero provided a suitable {\em growth property} holds and the algorithm makes a judicious choice of stepsizes. The rate of convergence of the duality gap to zero ranges from sublinear to linear depending on the degree of the growth property. The growth property and convergence results depend on the pair $(f,Ψ)$ in an affine invariant and norm-independent fashion.
△ Less
Submitted 26 May, 2023; v1 submitted 13 December, 2021;
originally announced December 2021.
-
Linear convergence of the Douglas-Rachford algorithm via a generic error bound condition
Authors:
Javier Peña,
Juan C. Vera,
Luis F. Zuluaga
Abstract:
We provide new insight into the convergence properties of the Douglas-Rachford algorithm for the problem $\min_x \{f(x)+g(x)\}$, where $f$ and $g$ are convex functions. Our approach relies on and highlights the natural primal-dual symmetry between the above problem and its Fenchel dual $\min_{u} \{ f^*(u) + g_*(u)\}$ where $g_*(u):=g^*(-u)$. Our main development is to show the linear convergence o…
▽ More
We provide new insight into the convergence properties of the Douglas-Rachford algorithm for the problem $\min_x \{f(x)+g(x)\}$, where $f$ and $g$ are convex functions. Our approach relies on and highlights the natural primal-dual symmetry between the above problem and its Fenchel dual $\min_{u} \{ f^*(u) + g_*(u)\}$ where $g_*(u):=g^*(-u)$. Our main development is to show the linear convergence of the algorithm when a natural error bound condition on the Douglas-Rachford operator holds. We leverage our error bound condition approach to show and estimate the algorithm's linear rate of convergence for three special classes of problems. The first one is when $f$ or$g$ and $f^*$ or $g_*$ are strongly convex relative to the primal and dual optimal sets respectively. The second one is when~$f$ and~$g$ are piecewise linear-quadratic functions. The third one is when~$f$ and~$g$ are the indicator functions of closed convex cones. In all three cases the rate of convergence is determined by a suitable measure of well-posedness of the problem. In the conic case, if the two closed convex cones are a linear subspace $L$ and $\mathbb{R}^n_+$, we establish the following stronger {\em finite termination} result: the Douglas-Rachford algorithm identifies the {\em maximum support sets} for $L\cap \mathbb{R}^n_+$ and $L^{\perp}\cap\mathbb{R}^n_+$ in finitely many steps. Our developments have straightforward extensions to the more general linearly constrained problem $\min_{x,y} \{f(x) + g( y):Ax + By = b\}$ thereby highlighting a direct and straightforward relationship between the Douglas-Rachford algorithm and the alternating direction method of multipliers (ADMM).
△ Less
Submitted 11 November, 2021;
originally announced November 2021.
-
Towards Conditional Path Analysis
Authors:
Jose M. Peña
Abstract:
We extend path analysis by giving sufficient conditions for computing the partial covariance of two random variables from their covariance. This is specifically done by correcting the covariance with the product of some partial variance ratios. As a result, the partial covariance retains the covariance's salient feature of factorizing over the edges in the paths between the two variables of intere…
▽ More
We extend path analysis by giving sufficient conditions for computing the partial covariance of two random variables from their covariance. This is specifically done by correcting the covariance with the product of some partial variance ratios. As a result, the partial covariance retains the covariance's salient feature of factorizing over the edges in the paths between the two variables of interest.
△ Less
Submitted 21 December, 2020;
originally announced December 2020.
-
Projection and rescaling algorithm for finding maximum support solutions to polyhedral conic systems
Authors:
Javier Pena,
Negar Soheili
Abstract:
We propose a simple projection and rescaling algorithm that finds maximum support solutions to the pair of feasibility problems \[ \text{find} \; x\in L\cap\mathbb{R}^n_{+} \;\;\;\; \text{ and } \; \;\;\;\; \text{find} \; \hat x\in L^\perp\cap\mathbb{R}^n_{+}, \] where $L$ is a linear subspace of $\mathbb{R}^n$ and $L^\perp$ is its orthogonal complement. The algorithm complements a basic procedure…
▽ More
We propose a simple projection and rescaling algorithm that finds maximum support solutions to the pair of feasibility problems \[ \text{find} \; x\in L\cap\mathbb{R}^n_{+} \;\;\;\; \text{ and } \; \;\;\;\; \text{find} \; \hat x\in L^\perp\cap\mathbb{R}^n_{+}, \] where $L$ is a linear subspace of $\mathbb{R}^n$ and $L^\perp$ is its orthogonal complement. The algorithm complements a basic procedure that involves only projections onto $L$ and $L^\perp$ with a periodic rescaling step. The number of rescaling steps and thus overall computational work performed by the algorithm are bounded above in terms of a condition measure of the above pair of problems.
Our algorithm is a natural but significant extension of a previous projection and rescaling algorithm that finds a solution to the problem \[ \text{find} \; x\in L\cap\mathbb{R}^n_{++} \] when this problem is feasible. As a byproduct of our new developments, we obtain a sharper analysis of the projection and rescaling algorithm in the latter special case.
△ Less
Submitted 3 December, 2021; v1 submitted 19 March, 2020;
originally announced March 2020.
-
Computing odd periods of alternating systems of affine circle maps
Authors:
J. S. Cánovas Peña,
A. Linero Bas,
G. Soler López
Abstract:
Let $f,g$ be affine circle maps and let $[f,g]$ be the alternating system generated by $f$ and $g$. We present an algorithm to compute the periodic structure of $[f,g]$.
Let $f,g$ be affine circle maps and let $[f,g]$ be the alternating system generated by $f$ and $g$. We present an algorithm to compute the periodic structure of $[f,g]$.
△ Less
Submitted 27 September, 2019;
originally announced September 2019.
-
A GPU implementation of the Discontinuous Galerkin method for simulation of diffusion in brain tissue
Authors:
Daniel Cervantes,
Miguel angel Moreles,
Joaquin Peña,
Alonso Ramirez-Manzanares
Abstract:
In this work we develop a methodology to approximate the covariance matrix associated to the simulation of water diffusion inside the brain tissue. The computation is based on an implementation of the Discontinuous Galerkin method of the diffusion equation, in accord with the physical phenomenon. The implementation in in parallel using GPUs in the CUDA language. Numerical results are presented in…
▽ More
In this work we develop a methodology to approximate the covariance matrix associated to the simulation of water diffusion inside the brain tissue. The computation is based on an implementation of the Discontinuous Galerkin method of the diffusion equation, in accord with the physical phenomenon. The implementation in in parallel using GPUs in the CUDA language. Numerical results are presented in 2D problems.
△ Less
Submitted 14 July, 2019;
originally announced July 2019.
-
Equivalence and invariance of the chi and Hoffman constants of a matrix
Authors:
Javier F. Pena,
Juan C. Vera,
Luis F. Zuluaga
Abstract:
We show that the following two condition measures of a full column rank matrix $A \in \mathbb{R}^{m\times n}$ are identical: the chi constant and a signed Hoffman constant. This identity is naturally suggested by the evident invariance of the chi constant under sign changes of the rows of $A$. We also show that similar equivalence and invariance properties extend to variants of the chi and Hoffman…
▽ More
We show that the following two condition measures of a full column rank matrix $A \in \mathbb{R}^{m\times n}$ are identical: the chi constant and a signed Hoffman constant. This identity is naturally suggested by the evident invariance of the chi constant under sign changes of the rows of $A$. We also show that similar equivalence and invariance properties extend to variants of the chi and Hoffman constants that depend only on the linear subspace $A(\mathbb{R}^n):=\{Ax: x\in\mathbb{R}^n\} \subseteq \mathbb{R}^m$. Finally, we show similar identities between the chi constants and signed versions of Renegar's and Grassmannian condition measures.
△ Less
Submitted 18 May, 2020; v1 submitted 15 May, 2019;
originally announced May 2019.
-
New characterizations of Hoffman constants for systems of linear constraints
Authors:
Javier Pena,
Juan Vera,
Luis Zuluaga
Abstract:
We give a characterization of the Hoffman constant of a system of linear constraints in $\R^n$ {\em relative} to a {\em reference polyhedron} $R\subseteq\R^n$. The reference polyhedron $R$ represents constraints that are easy to satisfy such as box constraints. In the special case $R = \R^n$, we obtain a novel characterization of the classical Hoffman constant.
More precisely, suppose…
▽ More
We give a characterization of the Hoffman constant of a system of linear constraints in $\R^n$ {\em relative} to a {\em reference polyhedron} $R\subseteq\R^n$. The reference polyhedron $R$ represents constraints that are easy to satisfy such as box constraints. In the special case $R = \R^n$, we obtain a novel characterization of the classical Hoffman constant.
More precisely, suppose $R\subseteq \mathbb{R}^n$ is a reference polyhedron, $A\in \R^{m\times n},$ and $A(R):=\{Ax: x\in R\}$. We characterize the sharpest constant $H(A|R)$ such that for all $b \in A(R) + \R^m_+$ and $u\in R$ \[ \dist(u, P_{A}(b)\cap R) \le H(A|R) \cdot \|(Au-b)_+\|, \] where $P_A(b) = \{x\in \R^n:Ax\le b\}$. Our characterization is stated in terms of the largest of a canonical collection of easily computable Hoffman constants. Our characterization in turn suggests new algorithmic procedures to compute Hoffman constants.
△ Less
Submitted 23 January, 2020; v1 submitted 7 May, 2019;
originally announced May 2019.
-
Generalized conditional subgradient and generalized mirror descent: duality, convergence, and symmetry
Authors:
Javier Pena
Abstract:
We provide new insight into a {\em generalized conditional subgradient} algorithm and a {\em generalized mirror descent} algorithm for the convex minimization problem \[ \min_x \; \{f(Ax) + h(x)\}.\] As Bach showed in [{\em SIAM J. Optim.}, 25 (2015), pp. 115--129], applying either of these two algorithms to this problem is equivalent to applying the other one to its Fenchel dual. We leverage this…
▽ More
We provide new insight into a {\em generalized conditional subgradient} algorithm and a {\em generalized mirror descent} algorithm for the convex minimization problem \[ \min_x \; \{f(Ax) + h(x)\}.\] As Bach showed in [{\em SIAM J. Optim.}, 25 (2015), pp. 115--129], applying either of these two algorithms to this problem is equivalent to applying the other one to its Fenchel dual. We leverage this duality relationship to develop new upper bounds and convergence results for the gap between the primal and dual iterates generated by these two algorithms. We also propose a new {\em primal-dual hybrid} algorithm that combines features of the conditional subgradient and mirror descent algorithms to solve the primal and dual problems in a symmetric fashion. Our algorithms and main results rely only on the availability of computable oracles for $\partial f$ and $\partial h^*$, and for $A$ and $A^*$.
△ Less
Submitted 3 June, 2019; v1 submitted 1 March, 2019;
originally announced March 2019.
-
Extremal and optimal properties of B-bases Collocation Matrices
Authors:
Jorge Delgado,
J. M. Peña
Abstract:
Totally positive matrices are related with the shape preserving representations of a space of functions. The normalized B-basis of the space has optimal shape preserving properties. B-splines and rational Bernstein bases are examples of normalized B-bases. Some results on the optimal conditioning and on extremal properties of the minimal eigenvalue and singular value of the collocation matrices of…
▽ More
Totally positive matrices are related with the shape preserving representations of a space of functions. The normalized B-basis of the space has optimal shape preserving properties. B-splines and rational Bernstein bases are examples of normalized B-bases. Some results on the optimal conditioning and on extremal properties of the minimal eigenvalue and singular value of the collocation matrices of normalized B-bases are proved. Numerical examples confirm the theoretical results and answer related questions.
△ Less
Submitted 28 January, 2019;
originally announced January 2019.
-
The condition number of a function relative to a set
Authors:
David H. Gutman,
Javier F. Pena
Abstract:
The condition number of a differentiable convex function, namely the ratio of its smoothness to strong convexity constants, is closely tied to fundamental properties of the function. In particular, the condition number of a quadratic convex function is the square of the aspect ratio of a canonical ellipsoid associated to the function. Furthermore, the condition number of a function bounds the line…
▽ More
The condition number of a differentiable convex function, namely the ratio of its smoothness to strong convexity constants, is closely tied to fundamental properties of the function. In particular, the condition number of a quadratic convex function is the square of the aspect ratio of a canonical ellipsoid associated to the function. Furthermore, the condition number of a function bounds the linear rate of convergence of the gradient descent algorithm for unconstrained convex minimization.
We propose a condition number of a differentiable convex function relative to a reference convex set and distance function pair. This relative condition number is defined as the ratio of a relative smoothness to a relative strong convexity constants. We show that the relative condition number extends the main properties of the traditional condition number both in terms of its geometric insight and in terms of its role in characterizing the linear convergence of first-order methods for constrained convex minimization.
When the reference set $X$ is a convex cone or a polyhedron and the function $f$ is of the form $f = g\circ A$, we provide characterizations of and bounds on the condition number of $f$ relative to $X$ in terms of the usual condition number of $g$ and a suitable condition number of the pair $(A,X)$.
△ Less
Submitted 18 April, 2020; v1 submitted 24 January, 2019;
originally announced January 2019.
-
Perturbed Fenchel duality and first-order methods
Authors:
David H. Gutman,
Javier F. Peña
Abstract:
We show that the iterates generated by a generic first-order meta-algorithm satisfy a canonical perturbed Fenchel duality inequality. The latter in turn readily yields a unified derivation of the best known convergence rates for various popular first-order algorithms including the conditional gradient method as well as the main kinds of Bregman proximal methods: subgradient, gradient, fast gradien…
▽ More
We show that the iterates generated by a generic first-order meta-algorithm satisfy a canonical perturbed Fenchel duality inequality. The latter in turn readily yields a unified derivation of the best known convergence rates for various popular first-order algorithms including the conditional gradient method as well as the main kinds of Bregman proximal methods: subgradient, gradient, fast gradient, and universal gradient methods.
△ Less
Submitted 3 December, 2021; v1 submitted 25 December, 2018;
originally announced December 2018.
-
SVD update methods for large matrices and applications
Authors:
Juan Manuel Peña,
Tomas Sauer
Abstract:
We consider the problem of updating the SVD when augmenting a "tall thin" matrix, i.e., a rectangular matrix $A \in \RR^{m \times n}$ with $m \gg n$. Supposing that an SVD of $A$ is already known, and given a matrix $B \in \RR^{m \times n'}$, we derive an efficient method to compute and efficiently store the SVD of the augmented matrix $[ A B ] \in \RR^{m \times (n+n')}$. This is an important tool…
▽ More
We consider the problem of updating the SVD when augmenting a "tall thin" matrix, i.e., a rectangular matrix $A \in \RR^{m \times n}$ with $m \gg n$. Supposing that an SVD of $A$ is already known, and given a matrix $B \in \RR^{m \times n'}$, we derive an efficient method to compute and efficiently store the SVD of the augmented matrix $[ A B ] \in \RR^{m \times (n+n')}$. This is an important tool for two types of applications: in the context of principal component analysis, the dominant left singular vectors provided by this decomposition form an orthonormal basis for the best linear subspace of a given dimension, while from the right singular vectors one can extract an orthonormal basis of the kernel of the matrix. We also describe two concrete applications of these concepts which motivated the development of our method and to which it is very well adapted.
△ Less
Submitted 10 September, 2018;
originally announced September 2018.
-
A data-independent distance to infeasibility for linear conic systems
Authors:
Javier Pena,
Vera Roshchina
Abstract:
We offer a unified treatment of distinct measures of well-posedness for homogeneous conic systems. To that end, we introduce a distance to infeasibility based entirely on geometric considerations of the elements defining the conic system. Our approach sheds new light on and connects several well-known condition measures for conic systems, including {\em Renegar's} distance to infeasibility, the {\…
▽ More
We offer a unified treatment of distinct measures of well-posedness for homogeneous conic systems. To that end, we introduce a distance to infeasibility based entirely on geometric considerations of the elements defining the conic system. Our approach sheds new light on and connects several well-known condition measures for conic systems, including {\em Renegar's} distance to infeasibility, the {\em Grassmannian} condition measure, a measure of the {\em most interior} solution, and other geometric measures of {\em symmetry} and of {\em depth} of the conic system.
△ Less
Submitted 23 January, 2020; v1 submitted 23 May, 2018;
originally announced May 2018.
-
An algorithm to compute the Hoffman constant of a system of linear constraints
Authors:
Javier Pena,
Juan Vera,
Luis Zuluaga
Abstract:
We propose a combinatorial algorithm to compute the Hoffman constant of a system of linear equations and inequalities. The algorithm is based on a characterization of the Hoffman constant as the largest of a finite canonical collection of easy-to-compute Hoffman constants. Our algorithm and characterization extend to the more general context where some of the constraints are easy to satisfy as in…
▽ More
We propose a combinatorial algorithm to compute the Hoffman constant of a system of linear equations and inequalities. The algorithm is based on a characterization of the Hoffman constant as the largest of a finite canonical collection of easy-to-compute Hoffman constants. Our algorithm and characterization extend to the more general context where some of the constraints are easy to satisfy as in the case of box constraints. We highlight some natural connections between our characterizations of the Hoffman constant and Renegar's distance to ill-posedness for systems of linear constraints.
△ Less
Submitted 23 April, 2018;
originally announced April 2018.
-
Computational performance of a projection and rescaling algorithm
Authors:
Javier Pena,
Negar Soheili
Abstract:
This paper documents a computational implementation of a {\em projection and rescaling algorithm} for finding most interior solutions to the pair of feasibility problems \[ \text{find} \; x\in L\cap\mathbb{R}^n_{+} \;\;\;\; \text{ and } \; \;\;\;\; \text{find} \; \hat x\in L^\perp\cap\mathbb{R}^n_{+}, \] where $L$ denotes a linear subspace in $\mathbb{R}^n$ and $L^\perp$ denotes its orthogonal com…
▽ More
This paper documents a computational implementation of a {\em projection and rescaling algorithm} for finding most interior solutions to the pair of feasibility problems \[ \text{find} \; x\in L\cap\mathbb{R}^n_{+} \;\;\;\; \text{ and } \; \;\;\;\; \text{find} \; \hat x\in L^\perp\cap\mathbb{R}^n_{+}, \] where $L$ denotes a linear subspace in $\mathbb{R}^n$ and $L^\perp$ denotes its orthogonal complement. The projection and rescaling algorithm is a recently developed method that combines a {\em basic procedure} involving only low-cost operations with a periodic {\em rescaling step.} We give a full description of a MATLAB implementation of this algorithm and present multiple sets of numerical experiments on synthetic problem instances with varied levels of conditioning. Our computational experiments provide promising evidence of the effectiveness of the projection and rescaling algorithm.
Our MATLAB code is publicly available. Furthermore, the simplicity of the algorithm makes a computational implementation in other environments completely straightforward.
△ Less
Submitted 3 June, 2019; v1 submitted 19 March, 2018;
originally announced March 2018.
-
Biot's parameters estimation in ultrasound propagation through cancellous bone
Authors:
Miguel Angel Moreles,
Jose Angel Neria,
Joaquin Peña
Abstract:
Of interest is the characterization of a cancellous bone immersed in an acoustic fluid. The bone is placed between an ultrasonic point source and a receiver. Cancellous bone is regarded as a porous medium saturated with fluid according to Biot's theory. This model is coupled with the fluid in an open pore configuration and solved by means of the Finite Volume Method. Characterization is posed as a…
▽ More
Of interest is the characterization of a cancellous bone immersed in an acoustic fluid. The bone is placed between an ultrasonic point source and a receiver. Cancellous bone is regarded as a porous medium saturated with fluid according to Biot's theory. This model is coupled with the fluid in an open pore configuration and solved by means of the Finite Volume Method. Characterization is posed as a Bayesian parameter estimation problem in Biot's model given pressure data collected at the receiver. As a first step we present numerical results in 2D for signal recovery. It is shown that as point estimators, the Conditional Mean outperforms the classical PDE-constrained minimization solution.
△ Less
Submitted 1 February, 2018;
originally announced February 2018.
-
The condition of a function relative to a polytope
Authors:
David H. Gutman,
Javier F. Pena
Abstract:
The condition number of a smooth convex function, namely the ratio of its smoothness to strong convexity constants, is closely tied to fundamental properties of the function. In particular, the condition number of a quadratic convex function is precisely the square of the diameter-to-width ratio of a canonical ellipsoid associated to the function. Furthermore, the condition number of a function bo…
▽ More
The condition number of a smooth convex function, namely the ratio of its smoothness to strong convexity constants, is closely tied to fundamental properties of the function. In particular, the condition number of a quadratic convex function is precisely the square of the diameter-to-width ratio of a canonical ellipsoid associated to the function. Furthermore, the condition number of a function bounds the linear rate of convergence of the gradient descent algorithm for unconstrained minimization.
We propose a condition number of a smooth convex function relative to a reference polytope. This relative condition number is defined as the ratio of a relative smooth constant to a relative strong convexity constant of the function, where both constants are relative to the reference polytope. The relative condition number extends the main properties of the traditional condition number. In particular, we show that the condition number of a quadratic convex function relative to a polytope is precisely the square of the diameter-to-facial-distance ratio of a scaled polytope for a canonical scaling induced by the function. Furthermore, we illustrate how the relative condition number of a function bounds the linear rate of convergence of first-order methods for minimization of the function over the polytope.
△ Less
Submitted 1 February, 2018;
originally announced February 2018.
-
Convergence rates of proximal gradient methods via the convex conjugate
Authors:
David H. Gutman,
Javier F. Pena
Abstract:
We give a novel proof of the $O(1/k)$ and $O(1/k^2)$ convergence rates of the proximal gradient and accelerated proximal gradient methods for composite convex minimization. The crux of the new proof is an upper bound constructed via the convex conjugate of the objective function.
We give a novel proof of the $O(1/k)$ and $O(1/k^2)$ convergence rates of the proximal gradient and accelerated proximal gradient methods for composite convex minimization. The crux of the new proof is an upper bound constructed via the convex conjugate of the objective function.
△ Less
Submitted 8 January, 2018; v1 submitted 8 January, 2018;
originally announced January 2018.
-
Identification of Strong Edges in AMP Chain Graphs
Authors:
Jose M. Peña
Abstract:
The essential graph is a distinguished member of a Markov equivalence class of AMP chain graphs. However, the directed edges in the essential graph are not necessarily strong or invariant, i.e. they may not be shared by every member of the equivalence class. Likewise for the undirected edges. In this paper, we develop a procedure for identifying which edges in an essential graph are strong. We als…
▽ More
The essential graph is a distinguished member of a Markov equivalence class of AMP chain graphs. However, the directed edges in the essential graph are not necessarily strong or invariant, i.e. they may not be shared by every member of the equivalence class. Likewise for the undirected edges. In this paper, we develop a procedure for identifying which edges in an essential graph are strong. We also show how this makes it possible to bound some causal effects when the true chain graph is unknown.
△ Less
Submitted 25 June, 2018; v1 submitted 23 November, 2017;
originally announced November 2017.
-
Optimal interval length for the collocation of the Newton basis
Authors:
J. M. Carnicer,
Y. Khiar,
J. M. Peña
Abstract:
It is known that the Lagrange interpolation problem at equidistant nodes is ill-conditioned. We explore the influence of the interval length in the computation of divided differences of the Newton interpolation formula. Condition numbers are computed for lower triangular matrices associated to the Newton interpolation formula at equidistant nodes. We consider the collocation matrices $L$ and…
▽ More
It is known that the Lagrange interpolation problem at equidistant nodes is ill-conditioned. We explore the influence of the interval length in the computation of divided differences of the Newton interpolation formula. Condition numbers are computed for lower triangular matrices associated to the Newton interpolation formula at equidistant nodes. We consider the collocation matrices $L$ and $P_L$ of the monic Newton basis and a normalized Newton basis, so that $P_L$ is the lower triangular Pascal matrix. In contrast to $L$, $P_L$ does not depend on the interval length, and we show that the Skeel condition number of the $(n+1)\times (n+1)$ lower triangular Pascal matrix is $3^n$. The $\infty$-norm condition number of the collocation matrix $L$ of the monic Newton basis is computed in terms of the interval length. The minimum asymptotic growth rate is achieved for intervals of length 3.
△ Less
Submitted 20 September, 2017;
originally announced September 2017.
-
Positive polynomials on unbounded domains
Authors:
Javer Pena,
Juan C. Vera,
Luis F. Zuluaga
Abstract:
Certificates of non-negativity such as Putinar's Positivstellensatz have been used to obtain powerful numerical techniques to solve polynomial optimization (PO) problems. Putinar's certificate uses sum-of-squares (sos) polynomials to certify the non-negativity of a given polynomial over a domain defined by polynomial inequalities. This certificate assumes the Archimedean property of the associated…
▽ More
Certificates of non-negativity such as Putinar's Positivstellensatz have been used to obtain powerful numerical techniques to solve polynomial optimization (PO) problems. Putinar's certificate uses sum-of-squares (sos) polynomials to certify the non-negativity of a given polynomial over a domain defined by polynomial inequalities. This certificate assumes the Archimedean property of the associated quadratic module, which in particular implies compactness of the domain. In this paper we characterize the existence of a certificate of non-negativity for polynomials over a possibly unbounded domain, without the use of the associated quadratic module. Next, we show that the certificate can be used to convergent linear matrix inequality (LMI) hierarchies for PO problems with unbounded feasible sets. Furthermore, by using copositive polynomials to certify non-negativity, instead of sos polynomials, the certificate allows the use of a very rich class of convergent LMI hierarchies to approximate the solution of general PO problems. Throughout the article we illustrate our results with various examples certifying the non-negativity of polynomials over possibly unbounded sets defined by polynomial equalities or inequalities.
△ Less
Submitted 11 September, 2017;
originally announced September 2017.
-
Convergence of first-order methods via the convex conjugate
Authors:
Javier Pena
Abstract:
This paper gives a unified and succinct approach to the $O(1/\sqrt{k}), O(1/k),$ and $O(1/k^2)$ convergence rates of the subgradient, gradient, and accelerated gradient methods for unconstrained convex minimization. In the three cases the proof of convergence follows from a generic bound defined by the convex conjugate of the objective function.
This paper gives a unified and succinct approach to the $O(1/\sqrt{k}), O(1/k),$ and $O(1/k^2)$ convergence rates of the subgradient, gradient, and accelerated gradient methods for unconstrained convex minimization. In the three cases the proof of convergence follows from a generic bound defined by the convex conjugate of the objective function.
△ Less
Submitted 27 July, 2017;
originally announced July 2017.
-
Permutation tests in the two-sample problem for functional data
Authors:
Alejandra Cabaña,
Ana Maria Estrada,
Jairo I. Peña,
Adolfo J. Quiroz
Abstract:
Three different permutation test schemes are discussed and compared in the context of the two-sample problem for functional data. One of the procedures was essentially introduced by Lopez-Pintado and Romo (2009), using notions of functional data depth to adapt the ideas originally proposed by Liu and Singh (1993) for multivariate data. Of the new methods introduced here, one is also based on funct…
▽ More
Three different permutation test schemes are discussed and compared in the context of the two-sample problem for functional data. One of the procedures was essentially introduced by Lopez-Pintado and Romo (2009), using notions of functional data depth to adapt the ideas originally proposed by Liu and Singh (1993) for multivariate data. Of the new methods introduced here, one is also based on functional data depths, but uses a different way (inspired by Meta-Analysis) to assess the significance of the depth differences. The second new method presented here adapts, to the functional data setting, the k-nearest-neighbors statistic of Schilling (1986). The three methods are compared among them and against the test of Horvath and Kokoszka (2012) in simulated examples and real data. The comparison considers the performance of the statistics in terms of statistical power and in terms of computational cost.
△ Less
Submitted 21 October, 2016;
originally announced October 2016.
-
On the Grassmann condition number
Authors:
Javier Pena,
Vera Roshchina
Abstract:
We give new insight into the Grassmann condition of the conic feasibility problem \[ x \in L \cap K \setminus\{0\}. \] Here $K\subseteq V$ is a regular convex cone and $L\subseteq V$ is a linear subspace of the finite dimensional Euclidean vector space $V$. The Grassmann condition of this problem is the reciprocal of the distance from $L$ to the set of ill-posed instances in the Grassmann manifold…
▽ More
We give new insight into the Grassmann condition of the conic feasibility problem \[ x \in L \cap K \setminus\{0\}. \] Here $K\subseteq V$ is a regular convex cone and $L\subseteq V$ is a linear subspace of the finite dimensional Euclidean vector space $V$. The Grassmann condition of this problem is the reciprocal of the distance from $L$ to the set of ill-posed instances in the Grassmann manifold where $L$ lives.
We consider a very general distance in the Grassmann manifold defined by two possibly different norms in $V$. We establish the equivalence between the Grassmann distance to ill-posedness of the above problem and a natural measure of the least violated trial solution to its alternative feasibility problem. We also show a tight relationship between the Grassmann and Renegar's condition measures, and between the Grassman measure and a symmetry measure of the above feasibility problem.
Our approach can be readily specialized to a canonical norm in $V$ induced by $K$, a prime example being the one-norm for the non-negative orthant. For this special case we show that the Grassmann distance ill-posedness of is equivalent to a measure of the most interior solution to the above conic feasibility problem.
△ Less
Submitted 26 April, 2016; v1 submitted 15 April, 2016;
originally announced April 2016.
-
Solving Conic Systems via Projection and Rescaling
Authors:
Javier Pena,
Negar Soheili
Abstract:
We propose a simple projection and rescaling algorithm to solve the feasibility problem \[ \text{ find } x \in L \cap Ω, \] where $L$ and $Ω$ are respectively a linear subspace and the interior of a symmetric cone in a finite-dimensional vector space $V$.
This projection and rescaling algorithm is inspired by previous work on rescaled versions of the perceptron algorithm and by Chubanov's projec…
▽ More
We propose a simple projection and rescaling algorithm to solve the feasibility problem \[ \text{ find } x \in L \cap Ω, \] where $L$ and $Ω$ are respectively a linear subspace and the interior of a symmetric cone in a finite-dimensional vector space $V$.
This projection and rescaling algorithm is inspired by previous work on rescaled versions of the perceptron algorithm and by Chubanov's projection-based method for linear feasibility problems. As in these predecessors, each main iteration of our algorithm contains two steps: a {\em basic procedure} and a {\em rescaling} step. When $L \cap Ω\ne \emptyset$, the projection and rescaling algorithm finds a point $x \in L \cap Ω$ in at most $O(\log(1/δ(L \cap Ω)))$ iterations, where $δ(L \cap Ω) \in (0,1]$ is a measure of the most interior point in $L \cap Ω$. The ideal value $δ(L\cap Ω) = 1$ is attained when $L \cap Ω$ contains the center of the symmetric cone $Ω$.
We describe several possible implementations for the basic procedure including a perceptron scheme and a smooth perceptron scheme. The perceptron scheme requires $O(r^4)$ perceptron updates and the smooth perceptron scheme requires $O(r^2)$ smooth perceptron updates, where $r$ stands for the Jordan algebra rank of $V$.
△ Less
Submitted 15 December, 2016; v1 submitted 18 December, 2015;
originally announced December 2015.
-
Polytope conditioning and linear convergence of the Frank-Wolfe algorithm
Authors:
Javier Pena,
Daniel Rodriguez
Abstract:
It is known that the gradient descent algorithm converges linearly when applied to a strongly convex function with Lipschitz gradient. In this case the algorithm's rate of convergence is determined by the condition number of the function. In a similar vein, it has been shown that a variant of the Frank-Wolfe algorithm with away steps converges linearly when applied to a strongly convex function wi…
▽ More
It is known that the gradient descent algorithm converges linearly when applied to a strongly convex function with Lipschitz gradient. In this case the algorithm's rate of convergence is determined by the condition number of the function. In a similar vein, it has been shown that a variant of the Frank-Wolfe algorithm with away steps converges linearly when applied to a strongly convex function with Lipschitz gradient over a polytope. In a nice extension of the unconstrained case, the algorithm's rate of convergence is determined by the product of the condition number of the function and a certain condition number of the polytope.
We shed new light into the latter type of polytope conditioning. In particular, we show that previous and seemingly different approaches to define a suitable condition measure for the polytope are essentially equivalent to each other. Perhaps more interesting, they can all be unified via a parameter of the polytope that formalizes a key premise linked to the algorithm's linear convergence. We also give new insight into the linear convergence property. For a convex quadratic objective, we show that the rate of convergence is determined by a condition number of a suitably scaled polytope.
△ Less
Submitted 24 December, 2016; v1 submitted 18 December, 2015;
originally announced December 2015.
-
On the von Neumann and Frank-Wolfe Algorithms with Away Steps
Authors:
Javier Pena,
Daniel Rodriguez,
Negar Soheili
Abstract:
The von Neumann algorithm is a simple coordinate-descent algorithm to determine whether the origin belongs to a polytope generated by a finite set of points. When the origin is in the of the polytope, the algorithm generates a sequence of points in the polytope that converges linearly to zero. The algorithm's rate of convergence depends on the radius of the largest ball around the origin contained…
▽ More
The von Neumann algorithm is a simple coordinate-descent algorithm to determine whether the origin belongs to a polytope generated by a finite set of points. When the origin is in the of the polytope, the algorithm generates a sequence of points in the polytope that converges linearly to zero. The algorithm's rate of convergence depends on the radius of the largest ball around the origin contained in the polytope.
We show that under the weaker condition that the origin is in the polytope, possibly on its boundary, a variant of the von Neumann algorithm that includes generates a sequence of points in the polytope that converges linearly to zero. The new algorithm's rate of convergence depends on a certain geometric parameter of the polytope that extends the above radius but is always positive. Our linear convergence result and geometric insights also extend to a variant of the Frank-Wolfe algorithm with away steps for minimizing a strongly convex function over a polytope.
△ Less
Submitted 25 November, 2015; v1 submitted 14 July, 2015;
originally announced July 2015.