Showing 1–2 of 2 results for author: Welzel, K

Search v0.5.6 released 2020-02-24

arXiv:2501.00404 [pdf, other]

math.OC

Efficient Implementation of Third-order Tensor Methods with Adaptive Regularization for Unconstrained Optimization

Authors: Coralia Cartis, Raphael Hauser, Yang Liu, Karl Welzel, Wenqi Zhu

Abstract: High-order tensor methods that employ local Taylor models of degree $p$ within adaptive regularization frameworks (AR$p$) have recently received significant attention, due to their optimal global and local rates of convergence for both convex and nonconvex optimization problems. However, their numerical performance for general unconstrained optimization problems remains insufficiently explored, wh… ▽ More High-order tensor methods that employ local Taylor models of degree $p$ within adaptive regularization frameworks (AR$p$) have recently received significant attention, due to their optimal global and local rates of convergence for both convex and nonconvex optimization problems. However, their numerical performance for general unconstrained optimization problems remains insufficiently explored, which we address by showcasing the numerical performance of standard second- and third-order variants ($p=2,3$) and proposing novel techniques for key algorithmic aspects when $p\geq3$ to improve numerical efficiency. To improve the adaptive choice of the regularization parameter, we extend the interpolation-based updating strategy introduced in (Gould, Porcelli, and Toint, 2012) for $p=2$ to $p\geq3$. We identify fundamental differences between the local minima of regularized subproblems for $p=2$ and $p\geq3$ and their effect on performance. Then, for $p\geq3$, we introduce a novel pre-rejection technique that rejects poor subproblem minimizers (referred to as `transient') before any function evaluation, reducing cost and selecting useful (`persistent') ones. Numerical studies confirm efficiency improvements in our modified AR$3$ algorithm. We also assess the effect of different subproblem termination conditions and the choice of the initial regularization parameter on overall performance. Finally, we benchmark our best-performing AR$3$ variants, along with those in (Birgin et al., 2020), against second-order ones (AR$2$). Encouraging results on standard test problems confirm that AR$3$ variants can outperform AR$2$ in terms of objective evaluations, derivative evaluations, and subproblem solves. We provide an efficient, extensive, and modular MATLAB software package including various AR$2$ and AR$3$ variants, allowing ease of use and experimentation for interested users. △ Less

Submitted 28 February, 2025; v1 submitted 31 December, 2024; originally announced January 2025.
arXiv:2301.11678 [pdf, ps, other]

math.OC

doi 10.1137/23M1549687

Approximating Higher-Order Derivative Tensors Using Secant Updates

Authors: Karl Welzel, Raphael A. Hauser

Abstract: Quasi-Newton methods employ an update rule that gradually improves the Hessian approximation using the already available gradient evaluations. We propose higher-order secant updates which generalize this idea to higher-order derivatives, approximating for example third derivatives (which are tensors) from given Hessian evaluations. Our generalization is based on the observation that quasi-Newton u… ▽ More Quasi-Newton methods employ an update rule that gradually improves the Hessian approximation using the already available gradient evaluations. We propose higher-order secant updates which generalize this idea to higher-order derivatives, approximating for example third derivatives (which are tensors) from given Hessian evaluations. Our generalization is based on the observation that quasi-Newton updates are least-change updates satisfying the secant equation, with different methods using different norms to measure the size of the change. We present a full characterization for least-change updates in weighted Frobenius norms (satisfying an analogue of the secant equation) for derivatives of arbitrary order. Moreover, we establish convergence of the approximations to the true derivative under standard assumptions and explore the quality of the generated approximations in numerical experiments. △ Less

Submitted 15 August, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

MSC Class: 90C53; 65D25

Journal ref: SIAM Journal on Optimization Vol. 34 Iss. 1 (2024) pp. 893-917

Search v0.5.6 released 2020-02-24