Skip to main content

Showing 1–44 of 44 results for author: Toint, P L

.
  1. arXiv:2505.06374  [pdf, ps, other

    math.OC

    Fast Stochastic Second-Order Adagrad for Nonconvex Bound-Constrained Optimization

    Authors: S. Bellavia, S. Gratton, B. Morini, Ph. L. Toint

    Abstract: ADAGB2, a generalization of the Adagrad algorithm for stochastic optimization is introduced, which is also applicable to bound-constrained problems and capable of using second-order information when available. It is shown that, given $δ\in(0,1)$ and $ε\in(0,1]$, the ADAGB2 algorithm needs at most $\calO(ε^{-2})$ iterations to ensure an $ε$-approximate first-order critical point of the bound-constr… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    MSC Class: 49M37; 65K05; 68Q17; 68W40; 90C30 ACM Class: F.2.1; G.1.6; I.1.2

  2. arXiv:2505.04807  [pdf, other

    math.OC

    A Fast Newton Method Under Local Lipschitz Smoothness

    Authors: Serge Gratton, Sadok Jerad, Philippe L. Toint

    Abstract: A new, fast second-order method is proposed that achieves the optimal $\mathcal{O}\left(|\log(ε)|ε^{-3/2}\right)$ complexity to obtain first-order $ε$-stationary points. Crucially, this is deduced without assuming the standard global Lipschitz Hessian continuity condition, but only using an appropriate local smoothness requirement. The algorithm exploits Hessian information to compute… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  3. arXiv:2502.08308  [pdf, ps, other

    math.OC

    prunAdag: an adaptive pruning-aware gradient method

    Authors: Margherita Porcelli, Giovanni Seraghiti, Philippe L. Toint

    Abstract: A pruning-aware adaptive gradient method is proposed which classifies the variables in two sets before updating them using different strategies. This technique extends the ``relevant/irrelevant" approach of Ding (2019) and Zimmer et al. (2022) and allows a posteriori sparsification of the solution of model parameter fitting problems. The new method is proved to be convergent with a global rate of… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

    MSC Class: 49M05; 49M15; 65K10; 68Q25; 90C26 ACM Class: F.2.1; I.2.6; G.1.6

  4. arXiv:2409.16047  [pdf, other

    math.OC cs.CC

    Examples of slow convergence for adaptive regularization optimization methods are not isolated

    Authors: Philippe L. Toint

    Abstract: The adaptive regularization algorithm for unconstrained nonconvex optimization was shown in Nesterov and Polyak (2006) and Cartis, Gould and Toint (2011) to require, under standard assumptions, at most $\mathcal{O}(ε^{3/(3-q)})$ evaluations of the objective function and its derivatives of degrees one and two to produce an $ε$-approximate critical point of order $q\in\{1,2\}$. This bound was shown… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: 11 pages, 1 figure

    MSC Class: 49M37; 65K05; 68Q17; 68W40; 90C30 ACM Class: F.2.1; G.1.6; I.1.2

  5. arXiv:2408.09124  [pdf, other

    math.OC cs.CC

    Refining asymptotic complexity bounds for nonconvex optimization methods, including why steepest descent is $o(ε^{-2})$ rather than $\mathcal{O}(ε^{-2})$

    Authors: Serge Gratton, Chee-Khian Sim, Philippe L. Toint

    Abstract: We revisit the standard ``telescoping sum'' argument ubiquitous in the final steps of analyzing evaluation complexity of algorithms for smooth nonconvex optimization, and obtain a refined formulation of the resulting bound as a function of the requested accuracy $ε$. While bounds obtained using the standard argument typically are of the form $\mathcal{O}(ε^{-α})$ for some positive $α$, the refined… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: 10 ages, 1 figure

    MSC Class: 49M37; 65K05; 68Q17; 68W40; 90C30 ACM Class: F.2.1; G.1.6; I.1.2

  6. arXiv:2407.08018  [pdf, other

    math.OC

    A Stochastic Objective-Function-Free Adaptive Regularization Method with Optimal Complexity

    Authors: Serge Gratton, Sadok Jerad, Philippe L. Toint

    Abstract: A fully stochastic second-order adaptive-regularization method for unconstrained nonconvex optimization is presented which never computes the objective-function value, but yet achieves the optimal $\mathcal{O}(ε^{-3/2})$ complexity bound for finding first-order critical points. The method is noise-tolerant and the inexactness conditions required for convergence depend on the history of past steps.… ▽ More

    Submitted 21 January, 2025; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: 32 pages, 9 figures

    MSC Class: 49M37; 65K05; 68Q17; 68W40; 90C30 ACM Class: F.2.1; G.1.6; I.1.2

  7. arXiv:2407.07812  [pdf, ps, other

    math.OC cs.PF

    S2MPJ and CUTEst optimization problems for Matlab, Python and Julia

    Authors: Serge Gratton, Philippe L. Toint

    Abstract: A new decoder for the SIF test problems of the CUTEst collection is described, which produces problem files allowing the computation of values and derivatives of the objective function and constraints of most \cutest\ problems directly within ``native'' Matlab, Python or Julia, without any additional installation or interfacing with MEX files or Fortran programs. When used with Matlab, the new pro… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    MSC Class: 49N99; 65K05; 65Y20; 68N99; 90C30 ACM Class: G.1.6

  8. arXiv:2406.15793  [pdf, ps, other

    math.OC

    Complexity of Adagrad and other first-order methods for nonconvex optimization problems with bounds constraints

    Authors: Serge Gratton, Sadok Jerad, Philippe L. Toint

    Abstract: A parametric class of trust-region algorithms for constrained nonconvex optimization is analyzed, where the objective function is never computed. By defining appropriate first-order stationarity criteria, we are able to extend the Adagrad method to the newly considered problem and retrieve the standard complexity rate of the projected gradient method that uses both the gradient and objective funct… ▽ More

    Submitted 1 November, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

    MSC Class: 90C60; 90C30; 90C15; 90C26; 49N30 ACM Class: F.2.1; G.1.6

  9. arXiv:2310.16580  [pdf, ps, other

    math.OC

    An optimally fast objective-function-free minimization algorithm using random subspaces

    Authors: S. Bellavia, S. Gratton, B. Morini, Ph. L. Toint

    Abstract: An algorithm for unconstrained non-convex optimization is described, which does not evaluate the objective function and in which minimization is carried out, at each iteration, within a randomly selected subspace. It is shown that this random approximation technique does not affect the method's convergence nor its evaluation complexity for the search of an $ε$-approximate first-order critical poin… ▽ More

    Submitted 30 January, 2025; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: 23 pages

    MSC Class: 60G99; 65K05; 68M20; 68Q17; 90C26 ACM Class: G.6.1; F.2.1

  10. arXiv:2308.00720  [pdf, ps, other

    cs.LG

    Divergence of the ADAM algorithm with fixed-stepsize: a (very) simple example

    Authors: Ph. L. Toint

    Abstract: A very simple unidimensional function with Lipschitz continuous gradient is constructed such that the ADAM algorithm with constant stepsize, started from the origin, diverges when applied to minimize this function in the absence of noise on the gradient. Divergence occurs irrespective of the choice of the method parameters.

    Submitted 1 August, 2023; originally announced August 2023.

    MSC Class: 65K10; 90C26; 90C30 ACM Class: G.6.1; I.2.6

  11. arXiv:2305.14477  [pdf, other

    cs.LG math.OC

    A Block-Coordinate Approach of Multi-level Optimization with an Application to Physics-Informed Neural Networks

    Authors: Serge Gratton, Valentin Mercier, Elisa Riccietti, Philippe L. Toint

    Abstract: Multi-level methods are widely used for the solution of large-scale problems, because of their computational advantages and exploitation of the complementarity between the involved sub-problems. After a re-interpretation of multi-level methods from a block-coordinate point of view, we propose a multi-level algorithm for the solution of nonlinear optimization problems and analyze its evaluation com… ▽ More

    Submitted 25 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  12. arXiv:2302.10065  [pdf, ps, other

    math.OC

    Yet another fast variant of Newton's method for nonconvex optimization

    Authors: Serge Gratton, Sadok Jerad, Philippe L. Toint

    Abstract: A class of second-order algorithms is proposed for minimizing smooth nonconvex functions that alternates between regularized Newton and negative curvature steps in an iteration-dependent subspace. In most cases, the Hessian matrix is regularized with the square root of the current gradient and an additional term taking moderate negative curvature into account, a negative curvature step being taken… ▽ More

    Submitted 20 August, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: 32 pages, 2 Figure, 4 Tables

    MSC Class: 49M37; 65K05; 90C26; 90C30 ACM Class: G.1.6; F.2.1

  13. arXiv:2302.07049  [pdf, other

    math.OC cs.AI cs.CC

    Multilevel Objective-Function-Free Optimization with an Application to Neural Networks Training

    Authors: S. Gratton, A. Kopanicakova, Ph. L. Toint

    Abstract: A class of multi-level algorithms for unconstrained nonlinear optimization is presented which does not require the evaluation of the objective function. The class contains the momentum-less AdaGrad method as a particular (single-level) instance. The choice of avoiding the evaluation of the objective function is intended to make the algorithms of the class less sensitive to noise, while the multi-l… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 29 pages, 4 figures

    MSC Class: 49K20; 65M55; 65Y20; 68Q25; 68T05; 90C26; 90C30 ACM Class: F.2.1; G.1.8; I.2.5

  14. arXiv:2203.09947  [pdf, ps, other

    math.OC

    Convergence properties of an Objective-Function-Free Optimization regularization algorithm, including an $\mathcal{O}(ε^{-3/2})$ complexity bound

    Authors: S. Gratton, S. Jerad, Ph. L. Toint

    Abstract: An adaptive regularization algorithm for unconstrained nonconvex optimization is presented in which the objective function is never evaluated, but only derivatives are used. This algorithm belongs to the class of adaptive regularization methods, for which optimal worst-case complexity results are known for the standard framework where the objective function is evaluated. It is shown in this paper… ▽ More

    Submitted 4 May, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

    MSC Class: 90C60; 90C30; 90C15; 90C26; 49N30 ACM Class: F.2.1; G.1.6

  15. arXiv:2203.03351  [pdf, ps, other

    math.OC

    OFFO minimization algorithms for second-order optimality and their complexity

    Authors: S. Gratton, Ph. L. Toint

    Abstract: An Adagrad-inspired class of algorithms for smooth unconstrained optimization is presented in which the objective function is never evaluated and yet the gradient norms decrease at least as fast as $\calO(1/\sqrt{k+1})$ while second-order optimality measures converge to zero at least as fast as $\calO(1/(k+1)^{1/3})$. This latter rate of convergence is shown to be essentially sharp and is identica… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    MSC Class: 90C60; 90C30; 90C26; 90C15; 49N30 ACM Class: F.2.1; G.1.6

    Journal ref: Computational Optimization and Applications, 84, pages 573 - 607, 2023

  16. arXiv:2203.01757  [pdf, other

    math.OC

    Complexity and performance for two classes of noise-tolerant first-order algorithms

    Authors: S. Gratton, S. Jerad, Ph. L. Toint

    Abstract: Two classes of algorithms for optimization in the presence of noise are presented, that do not require the evaluation of the objective function. The first generalizes the well-known Adagrad method. Its complexity is then analyzed as a function of its parameters. A second class of algorithms is then derived whose complexity is at least as good as that of the first class. Initial numerical experimen… ▽ More

    Submitted 29 January, 2025; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: 3 figures. arXiv admin note: substantial text overlap with arXiv:2203.01647

    MSC Class: 90C60; 90C30; 90C15; 90C26; 49N30 ACM Class: F.2.1; G.1.6

  17. arXiv:2203.01647  [pdf, ps, other

    math.OC cs.CC

    Complexity of a Class of First-Order Objective-Function-Free Optimization Algorithms

    Authors: S. Gratton, S. Jerad, Ph. L. Toint

    Abstract: A parametric class of trust-region algorithms for unconstrained nonconvex optimization is considered where the value of the objective function is never computed. The class contains a deterministic version of the first-order Adagrad method typically used for minimization of noisy function, but also allows the use of (possibly approximate) second-order information when available. The rate of converg… ▽ More

    Submitted 6 June, 2023; v1 submitted 3 March, 2022; originally announced March 2022.

    MSC Class: 90C60; 90C30; 90C15; 90C26; 49N30 ACM Class: F.2.1; G.1.6

  18. arXiv:2112.06176  [pdf, ps, other

    math.OC

    Trust-region algorithms: probabilistic complexity and intrinsic noise with applications to subsampling techniques

    Authors: S. Bellavia, G. Gurioli, B. Morini, Ph. L. Toint

    Abstract: A trust-region algorithm is presented for finding approximate minimizers of smooth unconstrained functions whose values and derivatives are subject to random noise. It is shown that, under suitable probabilistic assumptions, the new method finds (in expectation) an $ε$-approximate minimizer of arbitrary order $ q \geq 1$ in at most $\mathcal{O}(ε^{-(q+1)})$ inexact evaluations of the function and… ▽ More

    Submitted 30 December, 2021; v1 submitted 12 December, 2021; originally announced December 2021.

    MSC Class: 65K05; 65C50; 90C26 ACM Class: F.2.1; G.1.6

  19. arXiv:2112.05636  [pdf, ps, other

    math.OC

    OPM, a collection of Optimization Problems in Matlab

    Authors: Serge Gratton, Philippe L. Toint

    Abstract: OPM is a small collection of CUTEst unconstrained and bound-constrained nonlinear optimization problems, which can be used in Matlab for testing optimization algorithms directly (i.e. without installing additional software).

    Submitted 16 January, 2025; v1 submitted 10 December, 2021; originally announced December 2021.

    MSC Class: 90C25; 90C26; 90C30 ACM Class: G.1.6; G.4

  20. arXiv:2111.14098  [pdf, ps, other

    math.OC

    An adaptive regularization algorithm for unconstrained optimization with inexact function and derivatives values

    Authors: N. I. M. Gould, Ph. L. Toint

    Abstract: An adaptive regularization algorithm for unconstrained nonconvex optimization is proposed that is capable of handling inexact objective-function and derivative values, and also of providing approximate minimizer of arbitrary order. In comparison with a similar algorithm proposed in Cartis, Gould, Toint (2021), its distinguishing feature is that it is based on controlling the relative error between… ▽ More

    Submitted 28 November, 2021; originally announced November 2021.

    MSC Class: 49M37; 90C26; 90C30; 90C56 ACM Class: F.2.2; G.1.6

  21. arXiv:2105.07765  [pdf, other

    math.OC

    Adaptive Regularization Minimization Algorithms with Non-Smooth Norms and Euclidean Curvature

    Authors: Serge Gratton, Philippe L. Toint

    Abstract: A regularization algorithm (AR1pGN) for unconstrained nonlinear minimization is considered, which uses a model consisting of a Taylor expansion of arbitrary degree and regularization term involving a possibly non-smooth norm. It is shown that the non-smoothness of the norm does not affect the $O(ε_1^{-(p+1)/p})$ upper bound on evaluation complexity for finding first-order $ε_1$-approximate minimiz… ▽ More

    Submitted 27 May, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: A correction will be available soon

    MSC Class: 90C60; 90C26; 49J52; 49M37 ACM Class: G.1.6; F.2.1

  22. arXiv:2104.02564  [pdf, ps, other

    math.OC cs.CC math.FA math.NA

    Hölder Gradient Descent and Adaptive Regularization Methods in Banach Spaces for First-Order Points

    Authors: Serge Gratton, Sadok Jerad, Philippe L. Toint

    Abstract: This paper considers optimization of smooth nonconvex functionals in smooth infinite dimensional spaces. A Hölder gradient descent algorithm is first proposed for finding approximate first-order points of regularized polynomial functionals. This method is then applied to analyze the evaluation complexity of an adaptive regularization method which searches for approximate first-order points of func… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    MSC Class: 49K27; 49M37; 49M05; 49M20; 90C48; 90C26; 90C30 ACM Class: F.2.1; G.1.6

  23. arXiv:2104.02519  [pdf, ps, other

    math.OC cs.CC math.NA

    The Impact of Noise on Evaluation Complexity: The Deterministic Trust-Region Case

    Authors: Stefania Bellavia, Gianmarco Gurioli, Benedetta Morini, Philippe L. Toint

    Abstract: Intrinsic noise in objective function and derivatives evaluations may cause premature termination of optimization algorithms. Evaluation complexity bounds taking this situation into account are presented in the framework of a deterministic trust-region method. The results show that the presence of intrinsic noise may dominate these bounds, in contrast with what is known for methods in which the in… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    MSC Class: 90C26; 90C30; 90C56; 90C59; 49M37; 49M05 ACM Class: F.2.1; G.1.6

  24. arXiv:2104.00592  [pdf, ps, other

    math.NA

    Quadratic and Cubic Regularisation Methods with Inexact function and Random Derivatives for Finite-Sum Minimisation

    Authors: Stefania Bellavia, Gianmarco Gurioli, Benedetta Morini, Philippe L. Toint

    Abstract: This paper focuses on regularisation methods using models up to the third order to search for up to second-order critical points of a finite-sum minimisation problem. The variant presented belongs to the framework of [3]: it employs random models with accuracy guaranteed with a sufficiently large prefixed probability and deterministic inexact function evaluations within a prescribed level of accur… ▽ More

    Submitted 2 April, 2021; v1 submitted 30 March, 2021; originally announced April 2021.

    Comments: 9 pages

  25. arXiv:2011.00854  [pdf, ps, other

    math.OC

    Strong Evaluation Complexity of An Inexact Trust-Region Algorithm for Arbitrary-Order Unconstrained Nonconvex Optimization

    Authors: C. Cartis, N. I. M. Gould, Ph. L. Toint

    Abstract: A trust-region algorithm using inexact function and derivatives values is introduced for solving unconstrained smooth optimization problems. This algorithm uses high-order Taylor models and allows the search of strong approximate minimizers of arbitrary order. The evaluation complexity of finding a $q$-th approximate minimizer using this algorithm is then shown, under standard conditions, to be… ▽ More

    Submitted 12 October, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    MSC Class: 65Y20; 90C30; 90C60 ACM Class: F.2.1; G.1.6

  26. arXiv:2005.04639  [pdf, ps, other

    math.OC

    Adaptive Regularization for Nonconvex Optimization Using Inexact Function Values and Randomly Perturbed Derivatives

    Authors: S. Bellavia, G. Gurioli, B. Morini, Ph. L. Toint

    Abstract: A regularization algorithm allowing random noise in derivatives and inexact function values is proposed for computing approximate local critical points of any order for smooth unconstrained optimization problems. For an objective function with Lipschitz continuous $p$-th derivative and given an arbitrary optimality order $q \leq p$, it is shown that this algorithm will, in expectation, compute suc… ▽ More

    Submitted 6 April, 2021; v1 submitted 10 May, 2020; originally announced May 2020.

    Comments: 22 pages

    MSC Class: 49K10; 49M37; 65K05; 68W40; 90C15 ACM Class: G.1.6; F.2.1

  27. arXiv:2001.10802  [pdf, ps, other

    math.OC

    Strong Evaluation Complexity Bounds for Arbitrary-Order Optimization of Nonconvex Nonsmooth Composite Functions

    Authors: Coralia Cartis, Nick Gould, Philippe L. Toint

    Abstract: We introduce the concept of strong high-order approximate minimizers for nonconvex optimization problems. These apply in both standard smooth and composite non-smooth settings, and additionally allow convex or inexpensive constraints. An adaptive regularization algorithm is then proposed to find such approximate minimizers. Under suitable Lipschitz continuity assumptions, whenever the feasible set… ▽ More

    Submitted 29 January, 2020; originally announced January 2020.

    Comments: 32 pages, 1 figure

    MSC Class: 90C60; 90C46; 90C30; 90C26; 65K10; 49M37 ACM Class: F.2.1; G.1.6

  28. arXiv:2001.04801  [pdf, ps, other

    math.OC

    Exploiting problem structure in derivative free optimization

    Authors: Margherita Porcelli, Philippe L. Toint

    Abstract: A structured version of derivative-free random pattern search optimization algorithms is introduced which is able to exploit coordinate partially separable structure (typically associated with sparsity) often present in unconstrained and bound-constrained optimization problems. This technique improves performance by orders of magnitude and makes it possible to solve large problems that otherwise a… ▽ More

    Submitted 12 January, 2021; v1 submitted 14 January, 2020; originally announced January 2020.

    MSC Class: 65K05; 90C56; 90C90

  29. arXiv:1909.04991  [pdf, other

    math.OC math-ph

    An algorithm for optimization with disjoint linear constraints and its application for predicting rain

    Authors: Tijana Janjic, Yvonne Ruckstuhl, Philippe L. Toint

    Abstract: A specialized algorithm for quadratic optimization (QO, or, formerly, QP) with disjoint linear constraints is presented. In the considered class of problems, a subset of variables are subject to linear equality constraints, while variables in a different subset are constrained to remain in a convex set. The proposed algorithm exploits the structure by combining steps in the nullspace of the equali… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Comments: 13 pages, 2 figures

    MSC Class: 65K05; 90C20; 86A10 ACM Class: G.1.6; J.1

  30. arXiv:1902.10767  [pdf, ps, other

    math.OC

    High-Order Evaluation Complexity for Convexly-Constrained Optimization with Non-Lipschitzian Group Sparsity Terms

    Authors: X. Chen, Ph. L. Toint

    Abstract: This paper studies high-order evaluation complexity for partially separable convexly-constrained optimization involving non-Lipschitzian group sparsity terms in a nonconvex objective function. We propose a partially separable adaptive regularization algorithm using a $p$-th order Taylor model and show that the algorithm can produce an (epsilon,delta)-approximate q-th-order stationary point in at m… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

    Comments: 27 pages

    MSC Class: 90C30; 90C46; 65K05

  31. arXiv:1902.10406  [pdf, ps, other

    math.OC

    Minimization of nonsmooth nonconvex functions using inexact evaluations and its worst-case complexity

    Authors: S. Gratton, E. Simon, Ph. L. Toint

    Abstract: An adaptive regularization algorithm using inexact function and derivatives evaluations is proposed for the solution of composite nonsmooth nonconvex optimization. It is shown that this algorithm needs at most $O(|\log(ε)|\,ε^{-2})$ evaluations of the problem's functions and their derivatives for finding an $ε$-approximate first-order stationary point. This complexity bound therefore generalizes t… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

    Comments: 19 pages

    MSC Class: 49K10; 49M37; 65K05; 68T05; 68W40 ACM Class: F.1.3; F.2.1; G.1.6; I.2.6

  32. arXiv:1902.03056  [pdf, ps, other

    math.ST math.NA math.OC

    Bernstein Concentration Inequalities for Tensors via Einstein Products

    Authors: Z. Luo, L. Qi, Ph. L. Toint

    Abstract: A generalization of the Bernstein matrix concentration inequality to random tensors of general order is proposed. This generalization is based on the use of Einstein products between tensors, from which a strong link can be established between matrices and tensors, in turn allowing exploitation of existing results for the former.

    Submitted 8 February, 2019; originally announced February 2019.

    Comments: 12 pages

    MSC Class: 15A52; 15A72; 49J55; 60H25 ACM Class: F.2.1; G.1.3; G.3

    Journal ref: Frontiers of Mathematics in China, vol. 5(2), pp. 367-384, 2020

  33. arXiv:1812.03467  [pdf, ps, other

    math.NA cs.LG cs.MS math.OC

    A note on solving nonlinear optimization problems in variable precision

    Authors: S. Gratton, Ph. L. Toint

    Abstract: This short note considers an efficient variant of the trust-region algorithm with dynamic accuracy proposed Carter (1993) and Conn, Gould and Toint (2000) as a tool for very high-performance computing, an area where it is critical to allow multi-precision computations for keeping the energy dissipation under control. Numerical experiments are presented indicating that the use of the considered met… ▽ More

    Submitted 12 April, 2019; v1 submitted 9 December, 2018; originally announced December 2018.

    Comments: 11 pages, 2 figures

    MSC Class: 90C26; 90C30; 65K05 ACM Class: G.1.6; F.2.1; B.2.3; B.2.4; I.2.5

  34. arXiv:1811.07057  [pdf, ps, other

    math.OC

    Universal regularization methods - varying the power, the smoothness and the accuracy

    Authors: Coralia Cartis, Nicholas I. M. Gould, Philippe L. Toint

    Abstract: Adaptive cubic regularization methods have emerged as a credible alternative to linesearch and trust-region for smooth nonconvex optimization, with optimal complexity amongst second-order methods. Here we consider a general/new class of adaptive regularization methods, that use first- or higher-order local Taylor models of the objective regularized by a(ny) power of the step size and applied to co… ▽ More

    Submitted 16 November, 2018; originally announced November 2018.

    Report number: Technical report, Oxford University, Numerical Analysis Group, 2017

  35. arXiv:1811.03831  [pdf, ps, other

    math.OC cs.AI cs.CC

    Adaptive Regularization Algorithms with Inexact Evaluations for Nonconvex Optimization

    Authors: S. Bellavia, G. Gurioli, B. Morini, Ph. L. Toint

    Abstract: A regularization algorithm using inexact function values and inexact derivatives is proposed and its evaluation complexity analyzed. This algorithm is applicable to unconstrained problems and to problems with inexpensive constraints (that is constraints whose evaluation and enforcement has negligible cost) under the assumption that the derivative of highest degree is $β$-Hölder continuous. It feat… ▽ More

    Submitted 19 April, 2019; v1 submitted 9 November, 2018; originally announced November 2018.

    Comments: 32 pages

    MSC Class: 49K10; 49M37; 65K05; 68T05; 68W40 ACM Class: F.1.3; F.2.1; G.1.6; I.2.6

  36. arXiv:1811.01220  [pdf, ps, other

    math.OC cs.AI cs.CC math.NA

    Sharp worst-case evaluation complexity bounds for arbitrary-order nonconvex optimization with inexpensive constraints

    Authors: Coralia Cartis, Nick I. M. Gould, Philippe L. Toint

    Abstract: We provide sharp worst-case evaluation complexity bounds for nonconvex minimization problems with general inexpensive constraints, i.e.\ problems where the cost of evaluating/enforcing of the (possibly nonconvex or even disconnected) constraints, if any, is negligible compared to that of evaluating the objective function. These bounds unify, extend or improve all known upper and lower complexity b… ▽ More

    Submitted 3 November, 2018; originally announced November 2018.

    Comments: 30 pages

    MSC Class: 49K10; 49M37; 65K05; 65Y20; 68T05; 68W40 ACM Class: F.1.3, F.2.1, G.1.6, I.2.6

    Journal ref: SIAM Journal on Optimization,, vol. 30(1), pp. 513-541, 2020

  37. arXiv:1807.07476  [pdf, other

    math.NA math.OC

    Minimizing convex quadratic with variable precision conjugate gradients

    Authors: S. Gratton, E. Simon, D. Titley-Peloquin, Ph. L. Toint

    Abstract: We investigate the method of conjugate gradients, exploiting inaccurate matrix-vector products, for the solution of convex quadratic optimization problems. Theoretical performance bounds are derived, and the necessary quantities occurring in the theoretical bounds estimated, leading to a practical algorithm. Numerical experiments suggest that this approach has significant potential, including in t… ▽ More

    Submitted 21 September, 2020; v1 submitted 17 July, 2018; originally announced July 2018.

    MSC Class: 90C20; 65F10; 65G99; ACM Class: F.2.1; G.1.3; B.2.3; B.2.4

  38. arXiv:1711.09407  [pdf, ps, other

    math.OC cs.PF

    A note on using performance and data profilesfor training algorithms

    Authors: Margherita Porcelli, Philippe L. Toint

    Abstract: It is shown how to use the performance and data profile benchmarking tools to improve algorithms' performance. An illustration for the BFO derivative-free optimizer suggests that the obtained gains are potentially significant.

    Submitted 26 November, 2017; originally announced November 2017.

    Comments: 8 pages, 4 tables, 4 figures

    MSC Class: 65K05; 90C56; 90C90 ACM Class: G.4; D.2.2; D.2.8; G.1.6

    Journal ref: Transactions of the AMS on Mathematical Software, vol. 45(2), 2019

  39. arXiv:1709.09031  [pdf, other

    math.NA math.OC

    A note on preconditioning weighted linear least squares, with consequences for weakly-constrained variational data assimilation

    Authors: Serge Gratton, Selime Gürol, Ehouarn Simon, Philippe L. Toint

    Abstract: The effect of preconditioning linear weighted least-squares using an approximation of the model matrix is analyzed, showing the interplay of the eigenstructures of both the model and weighting matrices. A small example is given illustrating the resulting potential inefficiency of such preconditioners. Consequences of these results in the context of the weakly-constrained 4D-Var data assimilation p… ▽ More

    Submitted 26 September, 2017; originally announced September 2017.

    Comments: 10 pages, 2 figures

    MSC Class: 86A5; 86A10; 90C06; 90C30; 15A12 ACM Class: G.1.3; G.1.6

    Journal ref: Quarterly Journal of the Royal Meteorological Society, vol. 144(172), pp. 934--940, 2018

  40. arXiv:1709.07180  [pdf, ps, other

    math.OC

    Worst-case evaluation complexity and optimality of second-order methods for nonconvex smooth optimization

    Authors: Coralia Cartis, Nick I. M. Gould, Philippe L. Toint

    Abstract: We establish or refute the optimality of inexact second-order methods for unconstrained nonconvex optimization from the point of view of worst-case evaluation complexity, improving and generalizing the results of Cartis, Gould and Toint (2010,2011). To this aim, we consider a new general class of inexact second-order algorithms for unconstrained optimization that includes regularization and trust-… ▽ More

    Submitted 21 September, 2017; originally announced September 2017.

    Report number: naXys, University of Namur, 2017 MSC Class: 90C60

    Journal ref: Mathematical Programming, vol. 163(1), pp. 359-368, 2017

  41. arXiv:1709.06383  [pdf, ps, other

    math.NA math.OC

    On the use of the saddle formulation in weakly-constrained 4D-VAR data assimilation

    Authors: S. Gratton, S. Gürol, E. Simon, Ph. L. Toint

    Abstract: This paper discusses the practical use of the saddle variational formulation for the weakly-constrained 4D-VAR method in data assimilation. It is shown that the method, in its original form, may produce erratic results or diverge because of the inherent lack of monotonicity of the produced objective function values. Convergent, variationaly coherent variants of the algorithm are then proposed whos… ▽ More

    Submitted 19 September, 2017; originally announced September 2017.

    Journal ref: Quarterly Journal of the Royal Meteorological Society, 144(717), pp. 2792-2602, 2018

  42. arXiv:1708.04044  [pdf, ps, other

    math.OC cs.CC

    Improved second-order evaluation complexity for unconstrained nonlinear optimization using high-order regularized models

    Authors: Coralia Cartis, Nicholas I. M. Gould, Philippe L. Toint

    Abstract: The unconstrained minimization of a sufficiently smooth objective function $f(x)$ is considered, for which derivatives up to order $p$, $p\geq 2$, are assumed to be available. An adaptive regularization algorithm is proposed that uses Taylor models of the objective of order $p$ and that is guaranteed to find a first- and second-order critical point in at most… ▽ More

    Submitted 14 August, 2017; originally announced August 2017.

    Journal ref: Optimization methods and Software, vol. 35(2), pp. 243-256, 2020

  43. arXiv:1705.07285  [pdf, ps, other

    math.OC cs.CC math.NA

    Optimality of orders one to three and beyond: characterization and evaluation complexity in constrained nonconvex optimization

    Authors: C. Cartis, N. I. M. Gould, Ph. L. Toint

    Abstract: Necessary conditions for high-order optimality in smooth nonlinear constrained optimization are explored and their inherent intricacy discussed. A two-phase minimization algorithm is proposed which can achieve approximate first-, second- and third-order criticality and its evaluation complexity is analyzed as a function of the choice (among existing methods) of an inner algorithm for solving subpr… ▽ More

    Submitted 7 January, 2018; v1 submitted 20 May, 2017; originally announced May 2017.

    Comments: 32 pages, 3 figures

    MSC Class: 90C26; 90C46; 90C30 ACM Class: F.2.1; G.1.6

    Journal ref: Journal of Complexity, vol. 53, pp. 68-94, 2019

  44. arXiv:1705.04895  [pdf, other

    math.OC cs.CC

    Evaluation complexity bounds for smooth constrained nonlinear optimisation using scaled KKT conditions, high-order models and the criticality measure $χ$

    Authors: Coralia Cartis, Nick Gould, Philippe L Toint

    Abstract: Evaluation complexity for convexly constrained optimization is considered and it is shown first that the complexity bound of $O(ε^{-3/2})$ proved by Cartis, Gould and Toint (IMAJNA 32(4) 2012, pp.1662-1695) for computing an $ε$-approximate first-order critical point can be obtained under significantly weaker assumptions. Moreover, the result is generalized to the case where high-order derivatives… ▽ More

    Submitted 13 May, 2017; originally announced May 2017.