Search | arXiv e-print repository

Adaptive Open-Loop Step-Sizes for Accelerated Convergence Rates of the Frank-Wolfe Algorithm

Authors: Elias Wirth, Javier Peña, Sebastian Pokutta

Abstract: Recent work has shown that in certain settings, the Frank-Wolfe algorithm (FW) with open-loop step-sizes $η_t = \frac{\ell}{t+\ell}$ for a fixed parameter $\ell \in \mathbb{N},\, \ell \geq 2$, attains a convergence rate faster than the traditional $O(t^{-1})$ rate. In particular, when a strong growth property holds, the convergence rate attainable with open-loop step-sizes… ▽ More Recent work has shown that in certain settings, the Frank-Wolfe algorithm (FW) with open-loop step-sizes $η_t = \frac{\ell}{t+\ell}$ for a fixed parameter $\ell \in \mathbb{N},\, \ell \geq 2$, attains a convergence rate faster than the traditional $O(t^{-1})$ rate. In particular, when a strong growth property holds, the convergence rate attainable with open-loop step-sizes $η_t = \frac{\ell}{t+\ell}$ is $O(t^{-\ell})$. In this setting there is no single value of the parameter $\ell$ that prevails as superior. This paper shows that FW with log-adaptive open-loop step-sizes $η_t = \frac{2+\log(t+1)}{t+2+\log(t+1)}$ attains a convergence rate that is at least as fast as that attainable with fixed-parameter open-loop step-sizes $η_t = \frac{\ell}{t+\ell}$ for any value of $\ell \in \mathbb{N},\,\ell\geq 2$. To establish our main convergence results, we extend our previous affine-invariant accelerated convergence results for FW to more general open-loop step-sizes of the form $η_t = g(t)/(t+g(t))$, where $g:\mathbb{N}\to\mathbb{R}_{\geq 0}$ is any non-decreasing function such that the sequence of step-sizes $(η_t)$ is non-increasing. This covers in particular the fixed-parameter case by choosing $g(t) = \ell$ and the log-adaptive case by choosing $g(t) = 2+ \log(t+1)$. To facilitate adoption of log-adaptive open-loop step-sizes, we have incorporated this rule into the {\tt FrankWolfe.jl} software package. △ Less

Submitted 14 May, 2025; originally announced May 2025.

arXiv:2505.02736 [pdf, ps, other]

Strong odd coloring in minor-closed classes

Authors: Miriam Goetze, Fabian Klute, Kolja Knauer, Irene Parada, Juan Pablo Peña, Torsten Ueckerdt

Abstract: We show that the strong odd chromatic number on any proper minor-closed graph class is bounded by a constant. We almost determine the smallest such constant for outerplanar graphs. We show that the strong odd chromatic number on any proper minor-closed graph class is bounded by a constant. We almost determine the smallest such constant for outerplanar graphs. △ Less

Submitted 5 May, 2025; originally announced May 2025.

arXiv:2502.20628 [pdf, ps, other]

Locally connected graphs: metric properties

Authors: Martín Matamala, Juan Pablo Peña, José Zamora

Abstract: In this work we show that any connected locally connected graph defines a metric space having at least as many lines as vertices with only three exception: the complete multipartite graphs $K_{1,2,2}$, $K_{2,2,2}$ and $K_{2,2,2,2}$. This proves that this class fulfills a conjecture, proposed by Chen and Chvátal, saying that any metric space on n points has at least n lines or a line containing all… ▽ More In this work we show that any connected locally connected graph defines a metric space having at least as many lines as vertices with only three exception: the complete multipartite graphs $K_{1,2,2}$, $K_{2,2,2}$ and $K_{2,2,2,2}$. This proves that this class fulfills a conjecture, proposed by Chen and Chvátal, saying that any metric space on n points has at least n lines or a line containing all the points. △ Less

Submitted 27 February, 2025; originally announced February 2025.

MSC Class: 52C10; 51F99

arXiv:2501.13016 [pdf, other]

doi 10.1016/j.cam.2023.115184

An evaluation algorithm for q-Bézier triangular patches formed by convex combinations

Authors: Jorge Delgado, Héctor Orera, Juan Manuel Peña

Abstract: An extension to triangular domains of the univariate q-Bernstein basis functions is introduced and analyzed. Some recurrence relations and properties such as partition of unity and degree elevation are proved for them. It is also proved that they form a basis for the space of polynomials of total degree less than or equal to n on a triangle. In addition, it is presented a de Casteljau type evaluat… ▽ More An extension to triangular domains of the univariate q-Bernstein basis functions is introduced and analyzed. Some recurrence relations and properties such as partition of unity and degree elevation are proved for them. It is also proved that they form a basis for the space of polynomials of total degree less than or equal to n on a triangle. In addition, it is presented a de Casteljau type evaluation algorithm whose steps are all linear convex combinations. △ Less

Submitted 22 January, 2025; originally announced January 2025.

Journal ref: J. Comput. Appl. Math. 428 (2023), Paper No. 115184, 11 pp

arXiv:2501.11987 [pdf, ps, other]

doi 10.1016/j.cam.2021.113443

Accurate Bidiagonal Decomposition and Computations with Generalized Pascal Matrices

Authors: Jorge Delgado, Héctor Orera, Juan Manuel Peña

Abstract: This paper provides an accurate method to obtain the bidiagonal factorization of many generalized Pascal matrices, which in turn can be used to compute with high relative accuracy the eigenvalues, singular values and inverses of these matrices. Numerical examples are included. This paper provides an accurate method to obtain the bidiagonal factorization of many generalized Pascal matrices, which in turn can be used to compute with high relative accuracy the eigenvalues, singular values and inverses of these matrices. Numerical examples are included. △ Less

Submitted 21 January, 2025; originally announced January 2025.

MSC Class: 65F05; 65F15; 65G50; 15A23; 05A05; 11B65

Journal ref: Comput. Appl. Math. 391 (2021), Paper No. 113443, 10 pp

arXiv:2501.11365 [pdf, ps, other]

doi 10.1016/j.aml.2021.107473

Optimal properties of tensor product of B-bases

Authors: Jorge Delgado, Héctor Orera, Juan Manuel Peña

Abstract: It is proved the optimal conditioning for the infinity norm of collocation matrices of the tensor product of normalized B-bases among the tensor product of all normalized totally positive bases of the corresponding space of functions. Bounds for the minimal eigenvalue and singular value and illustrative numerical examples are also included. It is proved the optimal conditioning for the infinity norm of collocation matrices of the tensor product of normalized B-bases among the tensor product of all normalized totally positive bases of the corresponding space of functions. Bounds for the minimal eigenvalue and singular value and illustrative numerical examples are also included. △ Less

Submitted 20 January, 2025; originally announced January 2025.

MSC Class: 65D17; 65F35; 15A123; 15A18

Journal ref: Appl. Math. Lett. 121 (2021), Paper No. 107473, 5 pp

arXiv:2501.10076 [pdf, ps, other]

doi 10.1007/s10915-019-00975-6

Accurate algorithms for Bessel matrices

Authors: Jorge Delgado, Héctor Orera, Juan Manuel Peña

Abstract: In this paper, we prove that any collocation matrix of Bessel polynomials at positive points is strictly totally positive, that is, all its minors are positive. Moreover, an accurate method to construct the bidiagonal factorization of these matrices is obtained and used to compute with high relative accuracy the eigenvalues, singular values and inverses. Similar results for the collocation matrice… ▽ More In this paper, we prove that any collocation matrix of Bessel polynomials at positive points is strictly totally positive, that is, all its minors are positive. Moreover, an accurate method to construct the bidiagonal factorization of these matrices is obtained and used to compute with high relative accuracy the eigenvalues, singular values and inverses. Similar results for the collocation matrices for the reverse Bessel polynomials are also obtained. Numerical examples illustrating the theoretical results are included. △ Less

Submitted 17 January, 2025; originally announced January 2025.

MSC Class: 65F05; 65F15; 65G50; 33C10; 33C45; 15A23

Journal ref: J. Sci. Comput. 80 (2019), no. 2, 1264-1278

arXiv:2501.09704 [pdf, ps, other]

doi 10.1016/j.amc.2019.04.027

Infinity norm bounds for the inverse of Nekrasov matrices using scaling matrices

Authors: Héctor Orera, Juan Manuel Peña

Abstract: For many applications, it is convenient to have good upper bounds for the norm of the inverse of a given matrix. In this paper, we obtain such bounds when A is a Nekrasov matrix, by means of a scaling matrix transforming A into a strictly diagonally dominant matrix. Numerical examples and comparisons with other bounds are included. The scaling matrices are also used to derive new error bounds for… ▽ More For many applications, it is convenient to have good upper bounds for the norm of the inverse of a given matrix. In this paper, we obtain such bounds when A is a Nekrasov matrix, by means of a scaling matrix transforming A into a strictly diagonally dominant matrix. Numerical examples and comparisons with other bounds are included. The scaling matrices are also used to derive new error bounds for the linear complementarity problems when the involved matrix is a Nekrasov matrix. These error bounds can improve considerably other previous bounds. △ Less

Submitted 16 January, 2025; originally announced January 2025.

MSC Class: 65F35; 15A60; 65F05; 90C33

Journal ref: Appl. Math. Comput. 358 (2019), 119-127

arXiv:2411.05765 [pdf, ps, other]

Uniform $h$-dichotomies: noncritical uniformity and expansivity

Authors: Heli Elorreaga, Juan Peña, Gonzalo Robledo

Abstract: The property of exponential dichotomy can be seen as a generalization of the hyperbolicity condition for non autonomous linear finite dimensional systems of ordinary differential equations. In 1978 W.A. Coppel proved that the exponential dichotomy on the half line is equivalent to the property of noncritical uniformity provided that a condition of bounded growth is verified. In 2006 K.J. Palmer ex… ▽ More The property of exponential dichotomy can be seen as a generalization of the hyperbolicity condition for non autonomous linear finite dimensional systems of ordinary differential equations. In 1978 W.A. Coppel proved that the exponential dichotomy on the half line is equivalent to the property of noncritical uniformity provided that a condition of bounded growth is verified. In 2006 K.J. Palmer extended this result by proving that -- also assuming the bounded growth property -- the exponential dichotomy on the half line, noncritical uniformity and the exponential expansiveness are equivalent. The main contribution of this article is to generalize these results for the property of uniform $h$-dichotomy. This has been carried out due to a recent idea: under suitable conditions any $h$-dichotomy can be associated to a totally ordered topological group, which becomes the additive group $(\mathbb{R},+)$ in case of the exponential dichotomy. The properties of this new group make possible such generalization. △ Less

Submitted 8 November, 2024; originally announced November 2024.

MSC Class: 34D09; 34A30; 34C11

arXiv:2410.21433 [pdf, ps, other]

Lines on digraphs of low diameter

Authors: Gabriela Araujo-Pardo, Martín Matamala, Juan P. Peña, José Zamora

Abstract: A set of n non-collinear points in the Euclidean plane defines at least n different lines. Chen and Chvtal in 2008 conjectured that the same results is true in metric spaces for an adequate definition of line. More recently, it was conjectured in 2018 by Aboulker et al. that any large enough bridgeless graph on n vertices defines a metric space that has at least n lines. We study the natural exten… ▽ More A set of n non-collinear points in the Euclidean plane defines at least n different lines. Chen and Chvtal in 2008 conjectured that the same results is true in metric spaces for an adequate definition of line. More recently, it was conjectured in 2018 by Aboulker et al. that any large enough bridgeless graph on n vertices defines a metric space that has at least n lines. We study the natural extension of Aboulker et al.'s conjecture into the context of quasi-metric spaces defined by digraphs of low diameter. We prove that it is valid for quasi-metric spaces defined by bipartite digraphs of diameter at most three, oriented graphs of diameter two and, digraphs of diameter three and directed girth four. △ Less

Submitted 28 October, 2024; originally announced October 2024.

MSC Class: 05c20; 05c12; 52c99

arXiv:2410.20189 [pdf, other]

A study on token digraphs

Authors: Cristina G. Fernandes, Carla N. Lintzmayer, Juan P. Peña, Giovanne Santos, Ana Trujillo-Negrete, Jose Zamora

Abstract: For a digraph $D$ of order $n$ and an integer $1 \leq k \leq n-1$, the $k$-token digraph of $D$ is the graph whose vertices are all $k$-subsets of vertices of $D$ and, given two such $k$-subsets $A$ and $B$, $(A,B)$ is an arc in the $k$-token digraph whenever $\{a\} = A \setminus B$, $\{b\} = B \setminus A$, and there is an arc $(a,b)$ in $D$. Token digraphs are a generalization of token graphs. I… ▽ More For a digraph $D$ of order $n$ and an integer $1 \leq k \leq n-1$, the $k$-token digraph of $D$ is the graph whose vertices are all $k$-subsets of vertices of $D$ and, given two such $k$-subsets $A$ and $B$, $(A,B)$ is an arc in the $k$-token digraph whenever $\{a\} = A \setminus B$, $\{b\} = B \setminus A$, and there is an arc $(a,b)$ in $D$. Token digraphs are a generalization of token graphs. In this paper, we study some properties of token digraphs, including strong and unilateral connectivity, kernels, girth, circumference and Eulerianity. We also extend some known results on the clique and chromatic numbers of $k$-token graphs, addressing the bidirected clique number and dichromatic number of $k$-token digraphs. Additionally, we prove that determining whether $2$-token digraphs have a kernel is NP-complete. △ Less

Submitted 26 October, 2024; originally announced October 2024.

Comments: 24 pages, 9 figures

arXiv:2406.18789 [pdf, other]

Fast convergence of Frank-Wolfe algorithms on polytopes

Authors: Elias Wirth, Javier Pena, Sebastian Pokutta

Abstract: We provide a template to derive convergence rates for the following popular versions of the Frank-Wolfe algorithm on polytopes: vanilla Frank-Wolfe, Frank-Wolfe with away steps, Frank-Wolfe with blended pairwise steps, and Frank-Wolfe with in-face directions. Our template shows how the convergence rates follow from two affine-invariant properties of the problem, namely, error bound and extended cu… ▽ More We provide a template to derive convergence rates for the following popular versions of the Frank-Wolfe algorithm on polytopes: vanilla Frank-Wolfe, Frank-Wolfe with away steps, Frank-Wolfe with blended pairwise steps, and Frank-Wolfe with in-face directions. Our template shows how the convergence rates follow from two affine-invariant properties of the problem, namely, error bound and extended curvature. These properties depend solely on the polytope and objective function but not on any affine-dependent object like norms. For each one of the above algorithms, we derive rates of convergence ranging from sublinear to linear depending on the degree of the error bound. △ Less

Submitted 20 May, 2025; v1 submitted 26 June, 2024; originally announced June 2024.

Comments: 29 pages, 6 figures

MSC Class: 90C25; 90C52

arXiv:2405.19208 [pdf, other]

Quasimetric spaces with few lines

Authors: Guillermo Gamboa Quintero, Martín Matamala, Juan Pablo Peña

Abstract: Chen and Chvátal conjectured in 2008 that in any finite metric space either there is a line containing all the points - a universal line -, or the number of lines is at least the number of points. This is a generalization of a classical result due to Erdős that says that a set of $n$ non-collinear points in the Euclidean plane defines at least $n$ different lines. A line of a metric space with m… ▽ More Chen and Chvátal conjectured in 2008 that in any finite metric space either there is a line containing all the points - a universal line -, or the number of lines is at least the number of points. This is a generalization of a classical result due to Erdős that says that a set of $n$ non-collinear points in the Euclidean plane defines at least $n$ different lines. A line of a metric space with metric $ρ$ is defined in terms of a notion called the betweenness of the space which is the set of all triples $(x,z,y)$ such that $ρ(x,y)=ρ(x,z)+ρ(z,y)$. In this work we prove that for each $n\geq 4$ there are $p_3(n)$ non isomorphic betweennesses arising from \emph{quasimetric} spaces with $n$ points, without universal lines and with exactly 3 lines, where $p_3(n)$ is the number of partitions of an integer $n$ into three parts. We also prove that for $n\geq 5$, there are $2p_3(n-1)$ non isomorphic betweennesses arising from quasimetric spaces on $n$ points, without universal lines and with exactly 4 lines. Here two betweennesses are isomorphic if they are isomorphic as relational structures. None of the betweennesses mentioned above is metric which implies that Chen and Chvátal's conjecture is valid for metric spaces with at most five points. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: 19 pages, 4 figures

arXiv:2402.10933 [pdf, ps, other]

doi 10.1016/j.cam.2018.09.029

Combined matrices of almost strictly sign regular matrices

Authors: Pedro Alonso, Juan Manuel Peña, María Luisa Serrano

Abstract: The combined matrix is a very useful concept for many applications. Almost strictly sign regular (ASSR) matrices form an important structured class of matrices with two possible zero patterns, which are either type-I staircase or type-II staircase. We prove that, under an irreducibility condition, the pattern of zero and nonzero entries of an ASSR matrix is preserved by the corresponding combined… ▽ More The combined matrix is a very useful concept for many applications. Almost strictly sign regular (ASSR) matrices form an important structured class of matrices with two possible zero patterns, which are either type-I staircase or type-II staircase. We prove that, under an irreducibility condition, the pattern of zero and nonzero entries of an ASSR matrix is preserved by the corresponding combined matrix. Without the irreducibility condition, it is proved that type-I and type-II staircases are still preserved. Illustrative numerical examples are included. △ Less

Submitted 2 February, 2024; originally announced February 2024.

Comments: 8 pages

MSC Class: 65F05; 65F15; 65F35

arXiv:2402.10225 [pdf, ps, other]

doi 10.1016/j.cam.2020.113121

Almost strictly sign regular rectangular matrices

Authors: P. Alonso, J. M. Peña, M. L. Serrano

Abstract: Almost strictly sign regular matrices are sign regular matrices with a special zero pattern and whose nontrivial minors are nonzero. In this paper we provide several properties of almost strictly sign regular rectangular matrices and analyze their QR factorization. Almost strictly sign regular matrices are sign regular matrices with a special zero pattern and whose nontrivial minors are nonzero. In this paper we provide several properties of almost strictly sign regular rectangular matrices and analyze their QR factorization. △ Less

Submitted 2 February, 2024; originally announced February 2024.

Comments: 10 pages

MSC Class: 65F05; 65F15; 65F35

Journal ref: Journal of Computational and Applied Mathematics 404 (2022) 113121

arXiv:2312.09858 [pdf, ps, other]

Duality of Hoffman constants

Authors: Javier F. Pena, Juan C. Vera, Luis F. Zuluaga

Abstract: Suppose $A\in \mathbb{R}^{m\times n}$, and $R\subseteq \mathbb{R}^n$ and $S\subseteq \mathbb{R}^m$ are {\em reference} polyhedral cones with dual cones $R^*\subseteq \mathbb{R}^n, \; S^*\subseteq \mathbb{R}^m$. We show that a suitable Slater condition implies a {\em duality inequality} between the Hoffman constants of the feasibility problems… ▽ More Suppose $A\in \mathbb{R}^{m\times n}$, and $R\subseteq \mathbb{R}^n$ and $S\subseteq \mathbb{R}^m$ are {\em reference} polyhedral cones with dual cones $R^*\subseteq \mathbb{R}^n, \; S^*\subseteq \mathbb{R}^m$. We show that a suitable Slater condition implies a {\em duality inequality} between the Hoffman constants of the feasibility problems $$ \begin{array}{r} Ax-b \in S\\ x \in R \end{array} \qquad\text{ and }\qquad \begin{array}{r} c-A\transp y \in R^*\\ y \in S^*. \end{array} $$ As an interesting application, we show a striking identity between the Hoffman constants of {\em box-constrained} feasibility problems with a similar primal-dual format, but where one of the reference sets is a box and the other is a linear subspace. We also establish a surprising identity between Hoffman constants of box-constrained feasibility problems and the chi condition measures for weighted least-squares problems. △ Less

Submitted 16 May, 2025; v1 submitted 15 December, 2023; originally announced December 2023.

Comments: 21 pages

MSC Class: 90C05; 90C25; 90C57

arXiv:2310.04096 [pdf, other]

doi 10.1007/s10107-024-02180-2

Accelerated Affine-Invariant Convergence Rates of the Frank-Wolfe Algorithm with Open-Loop Step-Sizes

Authors: Elias Wirth, Javier Pena, Sebastian Pokutta

Abstract: Recent papers have shown that the Frank-Wolfe algorithm (FW) with open-loop step-sizes exhibits rates of convergence faster than the iconic $\mathcal{O}(t^{-1})$ rate. In particular, when the minimizer of a strongly convex function over a polytope lies in the relative interior of a feasible region face, the FW with open-loop step-sizes $η_t = \frac{\ell}{t+\ell}$ for… ▽ More Recent papers have shown that the Frank-Wolfe algorithm (FW) with open-loop step-sizes exhibits rates of convergence faster than the iconic $\mathcal{O}(t^{-1})$ rate. In particular, when the minimizer of a strongly convex function over a polytope lies in the relative interior of a feasible region face, the FW with open-loop step-sizes $η_t = \frac{\ell}{t+\ell}$ for $\ell \in \mathbb{N}_{\geq 2}$ has accelerated convergence $\mathcal{O}(t^{-2})$ in contrast to the rate $Ω(t^{-1-ε})$ attainable with more complex line-search or short-step step-sizes. Given the relevance of this scenario in data science problems, research has grown to explore the settings enabling acceleration in open-loop FW. However, despite FW's well-known affine invariance, existing acceleration results for open-loop FW are affine-dependent. This paper remedies this gap in the literature by merging two recent research trajectories: affine invariance (Wirth et al., 2023b) and open-loop step-sizes (Pena, 2021). In particular, we extend all known non-affine-invariant convergence rates for FW with open-loop step-sizes to affine-invariant results. △ Less

Submitted 20 January, 2025; v1 submitted 6 October, 2023; originally announced October 2023.

arXiv:2309.11942 [pdf, other]

On the Probability of Immunity

Authors: Jose M. Peña

Abstract: This work is devoted to the study of the probability of immunity, i.e. the effect occurs whether exposed or not. We derive necessary and sufficient conditions for non-immunity and $ε$-bounded immunity, i.e. the probability of immunity is zero and $ε$-bounded, respectively. The former allows us to estimate the probability of benefit (i.e., the effect occurs if and only if exposed) from a randomized… ▽ More This work is devoted to the study of the probability of immunity, i.e. the effect occurs whether exposed or not. We derive necessary and sufficient conditions for non-immunity and $ε$-bounded immunity, i.e. the probability of immunity is zero and $ε$-bounded, respectively. The former allows us to estimate the probability of benefit (i.e., the effect occurs if and only if exposed) from a randomized controlled trial, and the latter allows us to produce bounds of the probability of benefit that are tighter than the existing ones. We also introduce the concept of indirect immunity (i.e., through a mediator) and repeat our previous analysis for it. Finally, we propose a method for sensitivity analysis of the probability of immunity under unmeasured confounding. △ Less

Submitted 11 October, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

arXiv:2308.00170 [pdf, other]

Boundedness for proper conflict-free and odd colorings

Authors: Andrea Jiménez, Kolja Knauer, Carla Negri Lintzmayer, Martín Matamala, Juan Pablo Peña, Daniel A. Quiroz, Maycon Sambinelli, Yoshiko Wakabayashi, Weiqiang Yu, José Zamora

Abstract: The proper conflict-free chromatic number, $χ_{pcf}(G)$, of a graph $G$ is the least $k$ such that $G$ has a proper $k$-coloring in which for each non-isolated vertex there is a color appearing exactly once among its neighbors. The proper odd chromatic number, $χ_{o}(G)$, of $G$ is the least $k$ such that $G$ has a proper coloring in which for every non-isolated vertex there is a color appearing a… ▽ More The proper conflict-free chromatic number, $χ_{pcf}(G)$, of a graph $G$ is the least $k$ such that $G$ has a proper $k$-coloring in which for each non-isolated vertex there is a color appearing exactly once among its neighbors. The proper odd chromatic number, $χ_{o}(G)$, of $G$ is the least $k$ such that $G$ has a proper coloring in which for every non-isolated vertex there is a color appearing an odd number of times among its neighbors. We say that a graph class $\mathcal{G}$ is $χ_{pcf}$-bounded ($χ_{o}$-bounded) if there is a function $f$ such that $χ_{pcf}(G) \leq f(χ(G))$ ($χ_{o}(G) \leq f(χ(G))$) for every $G \in \mathcal{G}$. Caro et al. (2022) asked for classes that are linearly $χ_{pcf}$-bounded ($χ_{pcf}$-bounded), and as a starting point, they showed that every claw-free graph $G$ satisfies $χ_{pcf}(G) \le 2Δ(G)+1$, which implies $χ_{pcf}(G) \le 4χ(G)+1$. In this paper, we improve the bound for claw-free graphs to a nearly tight bound by showing that such a graph $G$ satisfies $χ_{pcf}(G) \le Δ(G)+6$, and even $χ_{pcf}(G) \le Δ(G)+4$ if it is a quasi-line graph. These results also give evidence for a conjecture by Caro et al. Moreover, we show that convex-round graphs and permutation graphs are linearly $χ_{pcf}$-bounded. For these last two results, we prove a lemma that reduces the problem of deciding if a hereditary class is linearly $χ_{pcf}$-bounded to deciding if the bipartite graphs in the class are $χ_{pcf}$-bounded by an absolute constant. This lemma complements a theorem of Liu (2022) and motivates us to study boundedness in bipartite graphs. In particular, we show that biconvex bipartite graphs are $χ_{pcf}$-bounded while convex bipartite graphs are not even $χ_o$-bounded, and exhibit a class of bipartite circle graphs that is linearly $χ_o$-bounded but not $χ_{pcf}$-bounded. △ Less

Submitted 9 February, 2024; v1 submitted 31 July, 2023; originally announced August 2023.

Comments: 24 pages, 1 figure. Slight changes in introduction. References added

MSC Class: 05C15; 05C62

arXiv:2302.02193 [pdf, ps, other]

An easily computable upper bound on the Hoffman constant for homogeneous inequality systems

Authors: Javier Peña

Abstract: Let $A\in \mathbb{R}^{m\times n}\setminus \{0\}$ and $P:=\{x:Ax\le 0\}$. This paper provides a procedure to compute an upper bound on the following homogeneous Hoffman constant: \[ H_0(A) := \sup_{u\in \mathbb{R}^n \setminus P} \frac{\text{dist}(u,P)}{\text{dist}(Au, \mathbb{R}^m_-)}. \] In sharp contrast to the intractability of computing more general Hoffman constants, the procedure described in… ▽ More Let $A\in \mathbb{R}^{m\times n}\setminus \{0\}$ and $P:=\{x:Ax\le 0\}$. This paper provides a procedure to compute an upper bound on the following homogeneous Hoffman constant: \[ H_0(A) := \sup_{u\in \mathbb{R}^n \setminus P} \frac{\text{dist}(u,P)}{\text{dist}(Au, \mathbb{R}^m_-)}. \] In sharp contrast to the intractability of computing more general Hoffman constants, the procedure described in this paper is entirely tractable and easily implementable. △ Less

Submitted 14 July, 2023; v1 submitted 4 February, 2023; originally announced February 2023.

Comments: 13 pages

arXiv:2211.16348 [pdf, other]

A validation study of normoglycemia and dysglycemia indices as a diabetes risk model

Authors: Paola Vargas, Miguel Angel Moreles, Joaquin Peña, Adriana Monroy

Abstract: In this work, we test the performance of Peak glucose concentration ($A$) and average of glucose removal rates ($α$), as normoglycemia and dysglycemia indices on a population monitored at the Mexico General Hospital between the years 2017 - 2019. A total of 1911 volunteer patients at the Mexico General Hospital are considered. 1282 female patients age ranging from 17 to 80 years old, and 629 male… ▽ More In this work, we test the performance of Peak glucose concentration ($A$) and average of glucose removal rates ($α$), as normoglycemia and dysglycemia indices on a population monitored at the Mexico General Hospital between the years 2017 - 2019. A total of 1911 volunteer patients at the Mexico General Hospital are considered. 1282 female patients age ranging from 17 to 80 years old, and 629 male patients age ranging from 18 to 79 years old. For each volunteer, OGTT data is gathered and indices are estimated in Ackerman's model. A binary separation of normoglycemic and disglycemic patients using a Support Vector Machine with a linear kernel is carried out. Classification indices are successful for 83\%. Population clusters on diabetic conditions and progression from Normoglycemic to T2DM may be concluded. The classification indices, $A$ and $α$ may be regarded as patient's indices and used to detect diabetes risk. Also, criteria for the applicability of glucose-insulin regulation models are introduced. The performance of Ackerman's model is shown. △ Less

Submitted 29 November, 2022; originally announced November 2022.

MSC Class: 92B15; 62P10; 92B05

arXiv:2210.10148 [pdf, other]

Bidiagonal Decompositions of Vandermonde-Type Matrices of Arbitrary Rank

Authors: Jorge Delgado, Plamen Koev, Ana Marco, Jose-Javier Martinez, Juan Manuel Pena, Per-Olof Persson, Steven Spasov

Abstract: We present a method to derive new explicit expressions for bidiagonal decompositions of Vandermonde and related matrices such as the (q-, h-) Bernstein-Vandermonde ones, among others. These results generalize the existing expressions for nonsingular matrices to matrices of arbitrary rank. For totally nonnegative matrices of the above classes, the new decompositions can be computed efficiently and… ▽ More We present a method to derive new explicit expressions for bidiagonal decompositions of Vandermonde and related matrices such as the (q-, h-) Bernstein-Vandermonde ones, among others. These results generalize the existing expressions for nonsingular matrices to matrices of arbitrary rank. For totally nonnegative matrices of the above classes, the new decompositions can be computed efficiently and to high relative accuracy componentwise in floating point arithmetic. In turn, matrix computations (e.g., eigenvalue computation) can also be performed efficiently and to high relative accuracy. △ Less

Submitted 18 October, 2022; originally announced October 2022.

MSC Class: 65F15; 15A23; 15B48; 15B35

arXiv:2112.06727 [pdf, ps, other]

Affine invariant convergence rates of the conditional gradient method

Authors: Javier Pena

Abstract: We show that the conditional gradient method for the convex composite problem \[\min_x\{f(x) + Ψ(x)\}\] generates primal and dual iterates with a duality gap converging to zero provided a suitable {\em growth property} holds and the algorithm makes a judicious choice of stepsizes. The rate of convergence of the duality gap to zero ranges from sublinear to linear depending on the degree of the grow… ▽ More We show that the conditional gradient method for the convex composite problem \[\min_x\{f(x) + Ψ(x)\}\] generates primal and dual iterates with a duality gap converging to zero provided a suitable {\em growth property} holds and the algorithm makes a judicious choice of stepsizes. The rate of convergence of the duality gap to zero ranges from sublinear to linear depending on the degree of the growth property. The growth property and convergence results depend on the pair $(f,Ψ)$ in an affine invariant and norm-independent fashion. △ Less

Submitted 26 May, 2023; v1 submitted 13 December, 2021; originally announced December 2021.

Comments: 25 pages. To Appear in SIAM Journal on Optimization

MSC Class: 90C25; 90C52; 90C46

arXiv:2111.06071 [pdf, ps, other]

Linear convergence of the Douglas-Rachford algorithm via a generic error bound condition

Authors: Javier Peña, Juan C. Vera, Luis F. Zuluaga

Abstract: We provide new insight into the convergence properties of the Douglas-Rachford algorithm for the problem $\min_x \{f(x)+g(x)\}$, where $f$ and $g$ are convex functions. Our approach relies on and highlights the natural primal-dual symmetry between the above problem and its Fenchel dual $\min_{u} \{ f^*(u) + g_*(u)\}$ where $g_*(u):=g^*(-u)$. Our main development is to show the linear convergence o… ▽ More We provide new insight into the convergence properties of the Douglas-Rachford algorithm for the problem $\min_x \{f(x)+g(x)\}$, where $f$ and $g$ are convex functions. Our approach relies on and highlights the natural primal-dual symmetry between the above problem and its Fenchel dual $\min_{u} \{ f^*(u) + g_*(u)\}$ where $g_*(u):=g^*(-u)$. Our main development is to show the linear convergence of the algorithm when a natural error bound condition on the Douglas-Rachford operator holds. We leverage our error bound condition approach to show and estimate the algorithm's linear rate of convergence for three special classes of problems. The first one is when $f$ or$g$ and $f^*$ or $g_*$ are strongly convex relative to the primal and dual optimal sets respectively. The second one is when~$f$ and~$g$ are piecewise linear-quadratic functions. The third one is when~$f$ and~$g$ are the indicator functions of closed convex cones. In all three cases the rate of convergence is determined by a suitable measure of well-posedness of the problem. In the conic case, if the two closed convex cones are a linear subspace $L$ and $\mathbb{R}^n_+$, we establish the following stronger {\em finite termination} result: the Douglas-Rachford algorithm identifies the {\em maximum support sets} for $L\cap \mathbb{R}^n_+$ and $L^{\perp}\cap\mathbb{R}^n_+$ in finitely many steps. Our developments have straightforward extensions to the more general linearly constrained problem $\min_{x,y} \{f(x) + g( y):Ax + By = b\}$ thereby highlighting a direct and straightforward relationship between the Douglas-Rachford algorithm and the alternating direction method of multipliers (ADMM). △ Less

Submitted 11 November, 2021; originally announced November 2021.

MSC Class: 90C25 ACM Class: G.0

arXiv:2012.11281 [pdf, ps, other]

Towards Conditional Path Analysis

Authors: Jose M. Peña

Abstract: We extend path analysis by giving sufficient conditions for computing the partial covariance of two random variables from their covariance. This is specifically done by correcting the covariance with the product of some partial variance ratios. As a result, the partial covariance retains the covariance's salient feature of factorizing over the edges in the paths between the two variables of intere… ▽ More We extend path analysis by giving sufficient conditions for computing the partial covariance of two random variables from their covariance. This is specifically done by correcting the covariance with the product of some partial variance ratios. As a result, the partial covariance retains the covariance's salient feature of factorizing over the edges in the paths between the two variables of interest. △ Less

Submitted 21 December, 2020; originally announced December 2020.

Comments: arXiv admin note: substantial text overlap with arXiv:2002.05226

arXiv:2003.08911 [pdf, ps, other]

Projection and rescaling algorithm for finding maximum support solutions to polyhedral conic systems

Authors: Javier Pena, Negar Soheili

Abstract: We propose a simple projection and rescaling algorithm that finds maximum support solutions to the pair of feasibility problems \[ \text{find} \; x\in L\cap\mathbb{R}^n_{+} \;\;\;\; \text{ and } \; \;\;\;\; \text{find} \; \hat x\in L^\perp\cap\mathbb{R}^n_{+}, \] where $L$ is a linear subspace of $\mathbb{R}^n$ and $L^\perp$ is its orthogonal complement. The algorithm complements a basic procedure… ▽ More We propose a simple projection and rescaling algorithm that finds maximum support solutions to the pair of feasibility problems \[ \text{find} \; x\in L\cap\mathbb{R}^n_{+} \;\;\;\; \text{ and } \; \;\;\;\; \text{find} \; \hat x\in L^\perp\cap\mathbb{R}^n_{+}, \] where $L$ is a linear subspace of $\mathbb{R}^n$ and $L^\perp$ is its orthogonal complement. The algorithm complements a basic procedure that involves only projections onto $L$ and $L^\perp$ with a periodic rescaling step. The number of rescaling steps and thus overall computational work performed by the algorithm are bounded above in terms of a condition measure of the above pair of problems. Our algorithm is a natural but significant extension of a previous projection and rescaling algorithm that finds a solution to the problem \[ \text{find} \; x\in L\cap\mathbb{R}^n_{++} \] when this problem is feasible. As a byproduct of our new developments, we obtain a sharper analysis of the projection and rescaling algorithm in the latter special case. △ Less

Submitted 3 December, 2021; v1 submitted 19 March, 2020; originally announced March 2020.

Comments: 18 pages

arXiv:1909.13614 [pdf, ps, other]

Computing odd periods of alternating systems of affine circle maps

Authors: J. S. Cánovas Peña, A. Linero Bas, G. Soler López

Abstract: Let $f,g$ be affine circle maps and let $[f,g]$ be the alternating system generated by $f$ and $g$. We present an algorithm to compute the periodic structure of $[f,g]$. Let $f,g$ be affine circle maps and let $[f,g]$ be the alternating system generated by $f$ and $g$. We present an algorithm to compute the periodic structure of $[f,g]$. △ Less

Submitted 27 September, 2019; originally announced September 2019.

MSC Class: 37E10; 37E05; 39A11

arXiv:1907.06191 [pdf, other]

A GPU implementation of the Discontinuous Galerkin method for simulation of diffusion in brain tissue

Authors: Daniel Cervantes, Miguel angel Moreles, Joaquin Peña, Alonso Ramirez-Manzanares

Abstract: In this work we develop a methodology to approximate the covariance matrix associated to the simulation of water diffusion inside the brain tissue. The computation is based on an implementation of the Discontinuous Galerkin method of the diffusion equation, in accord with the physical phenomenon. The implementation in in parallel using GPUs in the CUDA language. Numerical results are presented in… ▽ More In this work we develop a methodology to approximate the covariance matrix associated to the simulation of water diffusion inside the brain tissue. The computation is based on an implementation of the Discontinuous Galerkin method of the diffusion equation, in accord with the physical phenomenon. The implementation in in parallel using GPUs in the CUDA language. Numerical results are presented in 2D problems. △ Less

Submitted 14 July, 2019; originally announced July 2019.

arXiv:1905.06366 [pdf, ps, other]

Equivalence and invariance of the chi and Hoffman constants of a matrix

Authors: Javier F. Pena, Juan C. Vera, Luis F. Zuluaga

Abstract: We show that the following two condition measures of a full column rank matrix $A \in \mathbb{R}^{m\times n}$ are identical: the chi constant and a signed Hoffman constant. This identity is naturally suggested by the evident invariance of the chi constant under sign changes of the rows of $A$. We also show that similar equivalence and invariance properties extend to variants of the chi and Hoffman… ▽ More We show that the following two condition measures of a full column rank matrix $A \in \mathbb{R}^{m\times n}$ are identical: the chi constant and a signed Hoffman constant. This identity is naturally suggested by the evident invariance of the chi constant under sign changes of the rows of $A$. We also show that similar equivalence and invariance properties extend to variants of the chi and Hoffman constants that depend only on the linear subspace $A(\mathbb{R}^n):=\{Ax: x\in\mathbb{R}^n\} \subseteq \mathbb{R}^m$. Finally, we show similar identities between the chi constants and signed versions of Renegar's and Grassmannian condition measures. △ Less

Submitted 18 May, 2020; v1 submitted 15 May, 2019; originally announced May 2019.

Comments: 14 pages

MSC Class: 65K10; 65F22; 90C25; 90C57

arXiv:1905.02894 [pdf, ps, other]

New characterizations of Hoffman constants for systems of linear constraints

Authors: Javier Pena, Juan Vera, Luis Zuluaga

Abstract: We give a characterization of the Hoffman constant of a system of linear constraints in $\R^n$ {\em relative} to a {\em reference polyhedron} $R\subseteq\R^n$. The reference polyhedron $R$ represents constraints that are easy to satisfy such as box constraints. In the special case $R = \R^n$, we obtain a novel characterization of the classical Hoffman constant. More precisely, suppose… ▽ More We give a characterization of the Hoffman constant of a system of linear constraints in $\R^n$ {\em relative} to a {\em reference polyhedron} $R\subseteq\R^n$. The reference polyhedron $R$ represents constraints that are easy to satisfy such as box constraints. In the special case $R = \R^n$, we obtain a novel characterization of the classical Hoffman constant. More precisely, suppose $R\subseteq \mathbb{R}^n$ is a reference polyhedron, $A\in \R^{m\times n},$ and $A(R):=\{Ax: x\in R\}$. We characterize the sharpest constant $H(A|R)$ such that for all $b \in A(R) + \R^m_+$ and $u\in R$ \[ \dist(u, P_{A}(b)\cap R) \le H(A|R) \cdot \|(Au-b)_+\|, \] where $P_A(b) = \{x\in \R^n:Ax\le b\}$. Our characterization is stated in terms of the largest of a canonical collection of easily computable Hoffman constants. Our characterization in turn suggests new algorithmic procedures to compute Hoffman constants. △ Less

Submitted 23 January, 2020; v1 submitted 7 May, 2019; originally announced May 2019.

Comments: 30 pages. To Appear in Mathematical Programming. arXiv admin note: text overlap with arXiv:1804.08418

arXiv:1903.00459 [pdf, ps, other]

Generalized conditional subgradient and generalized mirror descent: duality, convergence, and symmetry

Authors: Javier Pena

Abstract: We provide new insight into a {\em generalized conditional subgradient} algorithm and a {\em generalized mirror descent} algorithm for the convex minimization problem \[ \min_x \; \{f(Ax) + h(x)\}.\] As Bach showed in [{\em SIAM J. Optim.}, 25 (2015), pp. 115--129], applying either of these two algorithms to this problem is equivalent to applying the other one to its Fenchel dual. We leverage this… ▽ More We provide new insight into a {\em generalized conditional subgradient} algorithm and a {\em generalized mirror descent} algorithm for the convex minimization problem \[ \min_x \; \{f(Ax) + h(x)\}.\] As Bach showed in [{\em SIAM J. Optim.}, 25 (2015), pp. 115--129], applying either of these two algorithms to this problem is equivalent to applying the other one to its Fenchel dual. We leverage this duality relationship to develop new upper bounds and convergence results for the gap between the primal and dual iterates generated by these two algorithms. We also propose a new {\em primal-dual hybrid} algorithm that combines features of the conditional subgradient and mirror descent algorithms to solve the primal and dual problems in a symmetric fashion. Our algorithms and main results rely only on the availability of computable oracles for $\partial f$ and $\partial h^*$, and for $A$ and $A^*$. △ Less

Submitted 3 June, 2019; v1 submitted 1 March, 2019; originally announced March 2019.

Comments: 21 pages

MSC Class: 90C25; 90C46

arXiv:1901.09768 [pdf, ps, other]

Extremal and optimal properties of B-bases Collocation Matrices

Authors: Jorge Delgado, J. M. Peña

Abstract: Totally positive matrices are related with the shape preserving representations of a space of functions. The normalized B-basis of the space has optimal shape preserving properties. B-splines and rational Bernstein bases are examples of normalized B-bases. Some results on the optimal conditioning and on extremal properties of the minimal eigenvalue and singular value of the collocation matrices of… ▽ More Totally positive matrices are related with the shape preserving representations of a space of functions. The normalized B-basis of the space has optimal shape preserving properties. B-splines and rational Bernstein bases are examples of normalized B-bases. Some results on the optimal conditioning and on extremal properties of the minimal eigenvalue and singular value of the collocation matrices of normalized B-bases are proved. Numerical examples confirm the theoretical results and answer related questions. △ Less

Submitted 28 January, 2019; originally announced January 2019.

Comments: 11 pages

MSC Class: 65F35; 65F15; 15B48; 15A12; 15A18; 65D17

arXiv:1901.08359 [pdf, other]

The condition number of a function relative to a set

Authors: David H. Gutman, Javier F. Pena

Abstract: The condition number of a differentiable convex function, namely the ratio of its smoothness to strong convexity constants, is closely tied to fundamental properties of the function. In particular, the condition number of a quadratic convex function is the square of the aspect ratio of a canonical ellipsoid associated to the function. Furthermore, the condition number of a function bounds the line… ▽ More The condition number of a differentiable convex function, namely the ratio of its smoothness to strong convexity constants, is closely tied to fundamental properties of the function. In particular, the condition number of a quadratic convex function is the square of the aspect ratio of a canonical ellipsoid associated to the function. Furthermore, the condition number of a function bounds the linear rate of convergence of the gradient descent algorithm for unconstrained convex minimization. We propose a condition number of a differentiable convex function relative to a reference convex set and distance function pair. This relative condition number is defined as the ratio of a relative smoothness to a relative strong convexity constants. We show that the relative condition number extends the main properties of the traditional condition number both in terms of its geometric insight and in terms of its role in characterizing the linear convergence of first-order methods for constrained convex minimization. When the reference set $X$ is a convex cone or a polyhedron and the function $f$ is of the form $f = g\circ A$, we provide characterizations of and bounds on the condition number of $f$ relative to $X$ in terms of the usual condition number of $g$ and a suitable condition number of the pair $(A,X)$. △ Less

Submitted 18 April, 2020; v1 submitted 24 January, 2019; originally announced January 2019.

Comments: 40 pages, 4 figures. To Appear in Mathematical Programming

arXiv:1812.10198 [pdf, ps, other]

Perturbed Fenchel duality and first-order methods

Authors: David H. Gutman, Javier F. Peña

Abstract: We show that the iterates generated by a generic first-order meta-algorithm satisfy a canonical perturbed Fenchel duality inequality. The latter in turn readily yields a unified derivation of the best known convergence rates for various popular first-order algorithms including the conditional gradient method as well as the main kinds of Bregman proximal methods: subgradient, gradient, fast gradien… ▽ More We show that the iterates generated by a generic first-order meta-algorithm satisfy a canonical perturbed Fenchel duality inequality. The latter in turn readily yields a unified derivation of the best known convergence rates for various popular first-order algorithms including the conditional gradient method as well as the main kinds of Bregman proximal methods: subgradient, gradient, fast gradient, and universal gradient methods. △ Less

Submitted 3 December, 2021; v1 submitted 25 December, 2018; originally announced December 2018.

Comments: 26 pages

arXiv:1809.03285 [pdf, other]

SVD update methods for large matrices and applications

Authors: Juan Manuel Peña, Tomas Sauer

Abstract: We consider the problem of updating the SVD when augmenting a "tall thin" matrix, i.e., a rectangular matrix $A \in \RR^{m \times n}$ with $m \gg n$. Supposing that an SVD of $A$ is already known, and given a matrix $B \in \RR^{m \times n'}$, we derive an efficient method to compute and efficiently store the SVD of the augmented matrix $[ A B ] \in \RR^{m \times (n+n')}$. This is an important tool… ▽ More We consider the problem of updating the SVD when augmenting a "tall thin" matrix, i.e., a rectangular matrix $A \in \RR^{m \times n}$ with $m \gg n$. Supposing that an SVD of $A$ is already known, and given a matrix $B \in \RR^{m \times n'}$, we derive an efficient method to compute and efficiently store the SVD of the augmented matrix $[ A B ] \in \RR^{m \times (n+n')}$. This is an important tool for two types of applications: in the context of principal component analysis, the dominant left singular vectors provided by this decomposition form an orthonormal basis for the best linear subspace of a given dimension, while from the right singular vectors one can extract an orthonormal basis of the kernel of the matrix. We also describe two concrete applications of these concepts which motivated the development of our method and to which it is very well adapted. △ Less

Submitted 10 September, 2018; originally announced September 2018.

MSC Class: 65F30

arXiv:1805.09494 [pdf, other]

A data-independent distance to infeasibility for linear conic systems

Authors: Javier Pena, Vera Roshchina

Abstract: We offer a unified treatment of distinct measures of well-posedness for homogeneous conic systems. To that end, we introduce a distance to infeasibility based entirely on geometric considerations of the elements defining the conic system. Our approach sheds new light on and connects several well-known condition measures for conic systems, including {\em Renegar's} distance to infeasibility, the {\… ▽ More We offer a unified treatment of distinct measures of well-posedness for homogeneous conic systems. To that end, we introduce a distance to infeasibility based entirely on geometric considerations of the elements defining the conic system. Our approach sheds new light on and connects several well-known condition measures for conic systems, including {\em Renegar's} distance to infeasibility, the {\em Grassmannian} condition measure, a measure of the {\em most interior} solution, and other geometric measures of {\em symmetry} and of {\em depth} of the conic system. △ Less

Submitted 23 January, 2020; v1 submitted 23 May, 2018; originally announced May 2018.

Comments: 25 pages, 2 figures. arXiv admin note: text overlap with arXiv:1604.04637

MSC Class: 65K10; 65F22; 90C25

arXiv:1804.08418 [pdf, other]

An algorithm to compute the Hoffman constant of a system of linear constraints

Authors: Javier Pena, Juan Vera, Luis Zuluaga

Abstract: We propose a combinatorial algorithm to compute the Hoffman constant of a system of linear equations and inequalities. The algorithm is based on a characterization of the Hoffman constant as the largest of a finite canonical collection of easy-to-compute Hoffman constants. Our algorithm and characterization extend to the more general context where some of the constraints are easy to satisfy as in… ▽ More We propose a combinatorial algorithm to compute the Hoffman constant of a system of linear equations and inequalities. The algorithm is based on a characterization of the Hoffman constant as the largest of a finite canonical collection of easy-to-compute Hoffman constants. Our algorithm and characterization extend to the more general context where some of the constraints are easy to satisfy as in the case of box constraints. We highlight some natural connections between our characterizations of the Hoffman constant and Renegar's distance to ill-posedness for systems of linear constraints. △ Less

Submitted 23 April, 2018; originally announced April 2018.

Comments: 28 pages, 1 figure

arXiv:1803.07107 [pdf, other]

Computational performance of a projection and rescaling algorithm

Authors: Javier Pena, Negar Soheili

Abstract: This paper documents a computational implementation of a {\em projection and rescaling algorithm} for finding most interior solutions to the pair of feasibility problems \[ \text{find} \; x\in L\cap\mathbb{R}^n_{+} \;\;\;\; \text{ and } \; \;\;\;\; \text{find} \; \hat x\in L^\perp\cap\mathbb{R}^n_{+}, \] where $L$ denotes a linear subspace in $\mathbb{R}^n$ and $L^\perp$ denotes its orthogonal com… ▽ More This paper documents a computational implementation of a {\em projection and rescaling algorithm} for finding most interior solutions to the pair of feasibility problems \[ \text{find} \; x\in L\cap\mathbb{R}^n_{+} \;\;\;\; \text{ and } \; \;\;\;\; \text{find} \; \hat x\in L^\perp\cap\mathbb{R}^n_{+}, \] where $L$ denotes a linear subspace in $\mathbb{R}^n$ and $L^\perp$ denotes its orthogonal complement. The projection and rescaling algorithm is a recently developed method that combines a {\em basic procedure} involving only low-cost operations with a periodic {\em rescaling step.} We give a full description of a MATLAB implementation of this algorithm and present multiple sets of numerical experiments on synthetic problem instances with varied levels of conditioning. Our computational experiments provide promising evidence of the effectiveness of the projection and rescaling algorithm. Our MATLAB code is publicly available. Furthermore, the simplicity of the algorithm makes a computational implementation in other environments completely straightforward. △ Less

Submitted 3 June, 2019; v1 submitted 19 March, 2018; originally announced March 2018.

Comments: 19 pages

arXiv:1802.00558 [pdf, other]

Biot's parameters estimation in ultrasound propagation through cancellous bone

Authors: Miguel Angel Moreles, Jose Angel Neria, Joaquin Peña

Abstract: Of interest is the characterization of a cancellous bone immersed in an acoustic fluid. The bone is placed between an ultrasonic point source and a receiver. Cancellous bone is regarded as a porous medium saturated with fluid according to Biot's theory. This model is coupled with the fluid in an open pore configuration and solved by means of the Finite Volume Method. Characterization is posed as a… ▽ More Of interest is the characterization of a cancellous bone immersed in an acoustic fluid. The bone is placed between an ultrasonic point source and a receiver. Cancellous bone is regarded as a porous medium saturated with fluid according to Biot's theory. This model is coupled with the fluid in an open pore configuration and solved by means of the Finite Volume Method. Characterization is posed as a Bayesian parameter estimation problem in Biot's model given pressure data collected at the receiver. As a first step we present numerical results in 2D for signal recovery. It is shown that as point estimators, the Conditional Mean outperforms the classical PDE-constrained minimization solution. △ Less

Submitted 1 February, 2018; originally announced February 2018.

MSC Class: 35L53; 65M08; 62F15

arXiv:1802.00271 [pdf, ps, other]

The condition of a function relative to a polytope

Authors: David H. Gutman, Javier F. Pena

Abstract: The condition number of a smooth convex function, namely the ratio of its smoothness to strong convexity constants, is closely tied to fundamental properties of the function. In particular, the condition number of a quadratic convex function is precisely the square of the diameter-to-width ratio of a canonical ellipsoid associated to the function. Furthermore, the condition number of a function bo… ▽ More The condition number of a smooth convex function, namely the ratio of its smoothness to strong convexity constants, is closely tied to fundamental properties of the function. In particular, the condition number of a quadratic convex function is precisely the square of the diameter-to-width ratio of a canonical ellipsoid associated to the function. Furthermore, the condition number of a function bounds the linear rate of convergence of the gradient descent algorithm for unconstrained minimization. We propose a condition number of a smooth convex function relative to a reference polytope. This relative condition number is defined as the ratio of a relative smooth constant to a relative strong convexity constant of the function, where both constants are relative to the reference polytope. The relative condition number extends the main properties of the traditional condition number. In particular, we show that the condition number of a quadratic convex function relative to a polytope is precisely the square of the diameter-to-facial-distance ratio of a scaled polytope for a canonical scaling induced by the function. Furthermore, we illustrate how the relative condition number of a function bounds the linear rate of convergence of first-order methods for minimization of the function over the polytope. △ Less

Submitted 1 February, 2018; originally announced February 2018.

Comments: 18 pages

arXiv:1801.02509 [pdf, ps, other]

Convergence rates of proximal gradient methods via the convex conjugate

Authors: David H. Gutman, Javier F. Pena

Abstract: We give a novel proof of the $O(1/k)$ and $O(1/k^2)$ convergence rates of the proximal gradient and accelerated proximal gradient methods for composite convex minimization. The crux of the new proof is an upper bound constructed via the convex conjugate of the objective function. We give a novel proof of the $O(1/k)$ and $O(1/k^2)$ convergence rates of the proximal gradient and accelerated proximal gradient methods for composite convex minimization. The crux of the new proof is an upper bound constructed via the convex conjugate of the objective function. △ Less

Submitted 8 January, 2018; v1 submitted 8 January, 2018; originally announced January 2018.

MSC Class: 90C25; 90C46; 90C52

arXiv:1711.09990 [pdf, ps, other]

Identification of Strong Edges in AMP Chain Graphs

Authors: Jose M. Peña

Abstract: The essential graph is a distinguished member of a Markov equivalence class of AMP chain graphs. However, the directed edges in the essential graph are not necessarily strong or invariant, i.e. they may not be shared by every member of the equivalence class. Likewise for the undirected edges. In this paper, we develop a procedure for identifying which edges in an essential graph are strong. We als… ▽ More The essential graph is a distinguished member of a Markov equivalence class of AMP chain graphs. However, the directed edges in the essential graph are not necessarily strong or invariant, i.e. they may not be shared by every member of the equivalence class. Likewise for the undirected edges. In this paper, we develop a procedure for identifying which edges in an essential graph are strong. We also show how this makes it possible to bound some causal effects when the true chain graph is unknown. △ Less

Submitted 25 June, 2018; v1 submitted 23 November, 2017; originally announced November 2017.

Comments: arXiv admin note: text overlap with arXiv:1303.0691

Journal ref: UAI 2018

arXiv:1709.06787 [pdf, ps, other]

Optimal interval length for the collocation of the Newton basis

Authors: J. M. Carnicer, Y. Khiar, J. M. Peña

Abstract: It is known that the Lagrange interpolation problem at equidistant nodes is ill-conditioned. We explore the influence of the interval length in the computation of divided differences of the Newton interpolation formula. Condition numbers are computed for lower triangular matrices associated to the Newton interpolation formula at equidistant nodes. We consider the collocation matrices $L$ and… ▽ More It is known that the Lagrange interpolation problem at equidistant nodes is ill-conditioned. We explore the influence of the interval length in the computation of divided differences of the Newton interpolation formula. Condition numbers are computed for lower triangular matrices associated to the Newton interpolation formula at equidistant nodes. We consider the collocation matrices $L$ and $P_L$ of the monic Newton basis and a normalized Newton basis, so that $P_L$ is the lower triangular Pascal matrix. In contrast to $L$, $P_L$ does not depend on the interval length, and we show that the Skeel condition number of the $(n+1)\times (n+1)$ lower triangular Pascal matrix is $3^n$. The $\infty$-norm condition number of the collocation matrix $L$ of the monic Newton basis is computed in terms of the interval length. The minimum asymptotic growth rate is achieved for intervals of length 3. △ Less

Submitted 20 September, 2017; originally announced September 2017.

MSC Class: 41A05; 65F35; 15A12

arXiv:1709.03435 [pdf, ps, other]

Positive polynomials on unbounded domains

Authors: Javer Pena, Juan C. Vera, Luis F. Zuluaga

Abstract: Certificates of non-negativity such as Putinar's Positivstellensatz have been used to obtain powerful numerical techniques to solve polynomial optimization (PO) problems. Putinar's certificate uses sum-of-squares (sos) polynomials to certify the non-negativity of a given polynomial over a domain defined by polynomial inequalities. This certificate assumes the Archimedean property of the associated… ▽ More Certificates of non-negativity such as Putinar's Positivstellensatz have been used to obtain powerful numerical techniques to solve polynomial optimization (PO) problems. Putinar's certificate uses sum-of-squares (sos) polynomials to certify the non-negativity of a given polynomial over a domain defined by polynomial inequalities. This certificate assumes the Archimedean property of the associated quadratic module, which in particular implies compactness of the domain. In this paper we characterize the existence of a certificate of non-negativity for polynomials over a possibly unbounded domain, without the use of the associated quadratic module. Next, we show that the certificate can be used to convergent linear matrix inequality (LMI) hierarchies for PO problems with unbounded feasible sets. Furthermore, by using copositive polynomials to certify non-negativity, instead of sos polynomials, the certificate allows the use of a very rich class of convergent LMI hierarchies to approximate the solution of general PO problems. Throughout the article we illustrate our results with various examples certifying the non-negativity of polynomials over possibly unbounded sets defined by polynomial equalities or inequalities. △ Less

Submitted 11 September, 2017; originally announced September 2017.

arXiv:1707.09084 [pdf, ps, other]

Convergence of first-order methods via the convex conjugate

Authors: Javier Pena

Abstract: This paper gives a unified and succinct approach to the $O(1/\sqrt{k}), O(1/k),$ and $O(1/k^2)$ convergence rates of the subgradient, gradient, and accelerated gradient methods for unconstrained convex minimization. In the three cases the proof of convergence follows from a generic bound defined by the convex conjugate of the objective function. This paper gives a unified and succinct approach to the $O(1/\sqrt{k}), O(1/k),$ and $O(1/k^2)$ convergence rates of the subgradient, gradient, and accelerated gradient methods for unconstrained convex minimization. In the three cases the proof of convergence follows from a generic bound defined by the convex conjugate of the objective function. △ Less

Submitted 27 July, 2017; originally announced July 2017.

arXiv:1610.06960 [pdf, other]

Permutation tests in the two-sample problem for functional data

Authors: Alejandra Cabaña, Ana Maria Estrada, Jairo I. Peña, Adolfo J. Quiroz

Abstract: Three different permutation test schemes are discussed and compared in the context of the two-sample problem for functional data. One of the procedures was essentially introduced by Lopez-Pintado and Romo (2009), using notions of functional data depth to adapt the ideas originally proposed by Liu and Singh (1993) for multivariate data. Of the new methods introduced here, one is also based on funct… ▽ More Three different permutation test schemes are discussed and compared in the context of the two-sample problem for functional data. One of the procedures was essentially introduced by Lopez-Pintado and Romo (2009), using notions of functional data depth to adapt the ideas originally proposed by Liu and Singh (1993) for multivariate data. Of the new methods introduced here, one is also based on functional data depths, but uses a different way (inspired by Meta-Analysis) to assess the significance of the depth differences. The second new method presented here adapts, to the functional data setting, the k-nearest-neighbors statistic of Schilling (1986). The three methods are compared among them and against the test of Horvath and Kokoszka (2012) in simulated examples and real data. The comparison considers the performance of the statistics in terms of statistical power and in terms of computational cost. △ Less

Submitted 21 October, 2016; originally announced October 2016.

MSC Class: 62G10 (Primary) 62M99 (Secondary)

arXiv:1604.04637 [pdf, ps, other]

On the Grassmann condition number

Authors: Javier Pena, Vera Roshchina

Abstract: We give new insight into the Grassmann condition of the conic feasibility problem \[ x \in L \cap K \setminus\{0\}. \] Here $K\subseteq V$ is a regular convex cone and $L\subseteq V$ is a linear subspace of the finite dimensional Euclidean vector space $V$. The Grassmann condition of this problem is the reciprocal of the distance from $L$ to the set of ill-posed instances in the Grassmann manifold… ▽ More We give new insight into the Grassmann condition of the conic feasibility problem \[ x \in L \cap K \setminus\{0\}. \] Here $K\subseteq V$ is a regular convex cone and $L\subseteq V$ is a linear subspace of the finite dimensional Euclidean vector space $V$. The Grassmann condition of this problem is the reciprocal of the distance from $L$ to the set of ill-posed instances in the Grassmann manifold where $L$ lives. We consider a very general distance in the Grassmann manifold defined by two possibly different norms in $V$. We establish the equivalence between the Grassmann distance to ill-posedness of the above problem and a natural measure of the least violated trial solution to its alternative feasibility problem. We also show a tight relationship between the Grassmann and Renegar's condition measures, and between the Grassman measure and a symmetry measure of the above feasibility problem. Our approach can be readily specialized to a canonical norm in $V$ induced by $K$, a prime example being the one-norm for the non-negative orthant. For this special case we show that the Grassmann distance ill-posedness of is equivalent to a measure of the most interior solution to the above conic feasibility problem. △ Less

Submitted 26 April, 2016; v1 submitted 15 April, 2016; originally announced April 2016.

arXiv:1512.06154 [pdf, ps, other]

Solving Conic Systems via Projection and Rescaling

Authors: Javier Pena, Negar Soheili

Abstract: We propose a simple projection and rescaling algorithm to solve the feasibility problem \[ \text{ find } x \in L \cap Ω, \] where $L$ and $Ω$ are respectively a linear subspace and the interior of a symmetric cone in a finite-dimensional vector space $V$. This projection and rescaling algorithm is inspired by previous work on rescaled versions of the perceptron algorithm and by Chubanov's projec… ▽ More We propose a simple projection and rescaling algorithm to solve the feasibility problem \[ \text{ find } x \in L \cap Ω, \] where $L$ and $Ω$ are respectively a linear subspace and the interior of a symmetric cone in a finite-dimensional vector space $V$. This projection and rescaling algorithm is inspired by previous work on rescaled versions of the perceptron algorithm and by Chubanov's projection-based method for linear feasibility problems. As in these predecessors, each main iteration of our algorithm contains two steps: a {\em basic procedure} and a {\em rescaling} step. When $L \cap Ω\ne \emptyset$, the projection and rescaling algorithm finds a point $x \in L \cap Ω$ in at most $O(\log(1/δ(L \cap Ω)))$ iterations, where $δ(L \cap Ω) \in (0,1]$ is a measure of the most interior point in $L \cap Ω$. The ideal value $δ(L\cap Ω) = 1$ is attained when $L \cap Ω$ contains the center of the symmetric cone $Ω$. We describe several possible implementations for the basic procedure including a perceptron scheme and a smooth perceptron scheme. The perceptron scheme requires $O(r^4)$ perceptron updates and the smooth perceptron scheme requires $O(r^2)$ smooth perceptron updates, where $r$ stands for the Jordan algebra rank of $V$. △ Less

Submitted 15 December, 2016; v1 submitted 18 December, 2015; originally announced December 2015.

arXiv:1512.06142 [pdf, other]

Polytope conditioning and linear convergence of the Frank-Wolfe algorithm

Authors: Javier Pena, Daniel Rodriguez

Abstract: It is known that the gradient descent algorithm converges linearly when applied to a strongly convex function with Lipschitz gradient. In this case the algorithm's rate of convergence is determined by the condition number of the function. In a similar vein, it has been shown that a variant of the Frank-Wolfe algorithm with away steps converges linearly when applied to a strongly convex function wi… ▽ More It is known that the gradient descent algorithm converges linearly when applied to a strongly convex function with Lipschitz gradient. In this case the algorithm's rate of convergence is determined by the condition number of the function. In a similar vein, it has been shown that a variant of the Frank-Wolfe algorithm with away steps converges linearly when applied to a strongly convex function with Lipschitz gradient over a polytope. In a nice extension of the unconstrained case, the algorithm's rate of convergence is determined by the product of the condition number of the function and a certain condition number of the polytope. We shed new light into the latter type of polytope conditioning. In particular, we show that previous and seemingly different approaches to define a suitable condition measure for the polytope are essentially equivalent to each other. Perhaps more interesting, they can all be unified via a parameter of the polytope that formalizes a key premise linked to the algorithm's linear convergence. We also give new insight into the linear convergence property. For a convex quadratic objective, we show that the rate of convergence is determined by a condition number of a suitably scaled polytope. △ Less

Submitted 24 December, 2016; v1 submitted 18 December, 2015; originally announced December 2015.

arXiv:1507.04073 [pdf, ps, other]

On the von Neumann and Frank-Wolfe Algorithms with Away Steps

Authors: Javier Pena, Daniel Rodriguez, Negar Soheili

Abstract: The von Neumann algorithm is a simple coordinate-descent algorithm to determine whether the origin belongs to a polytope generated by a finite set of points. When the origin is in the of the polytope, the algorithm generates a sequence of points in the polytope that converges linearly to zero. The algorithm's rate of convergence depends on the radius of the largest ball around the origin contained… ▽ More The von Neumann algorithm is a simple coordinate-descent algorithm to determine whether the origin belongs to a polytope generated by a finite set of points. When the origin is in the of the polytope, the algorithm generates a sequence of points in the polytope that converges linearly to zero. The algorithm's rate of convergence depends on the radius of the largest ball around the origin contained in the polytope. We show that under the weaker condition that the origin is in the polytope, possibly on its boundary, a variant of the von Neumann algorithm that includes generates a sequence of points in the polytope that converges linearly to zero. The new algorithm's rate of convergence depends on a certain geometric parameter of the polytope that extends the above radius but is always positive. Our linear convergence result and geometric insights also extend to a variant of the Frank-Wolfe algorithm with away steps for minimizing a strongly convex function over a polytope. △ Less

Submitted 25 November, 2015; v1 submitted 14 July, 2015; originally announced July 2015.

Showing 1–50 of 89 results for author: Peña, J