-
An adaptive importance sampling algorithm for risk-averse optimization
Authors:
Sandra Pieraccini,
Tommaso Vanzan
Abstract:
Adaptive sampling algorithms are modern and efficient methods that dynamically adjust the sample size throughout the optimization process. However, they may encounter difficulties in risk-averse settings, particularly due to the challenge of accurately sampling from the tails of the underlying distribution of random inputs. This often leads to a much faster growth of the sample size compared to ri…
▽ More
Adaptive sampling algorithms are modern and efficient methods that dynamically adjust the sample size throughout the optimization process. However, they may encounter difficulties in risk-averse settings, particularly due to the challenge of accurately sampling from the tails of the underlying distribution of random inputs. This often leads to a much faster growth of the sample size compared to risk-neutral problems. In this work, we propose a novel adaptive sampling algorithm that adapts both the sample size and the sampling distribution at each iteration. The biasing distributions are constructed on the fly, leveraging a reduced-order model of the objective function to be minimized, and are designed to oversample a so-called risk region. As a result, a reduction of the variance of the gradients is achieved, which permits to use fewer samples per iteration compared to a standard algorithm, while still preserving the asymptotic convergence rate. Our focus is on the minimization of the Conditional Value-at-Risk (CVaR), and we establish the convergence of the proposed computational framework. Numerical experiments confirm the substantial computational savings achieved by our approach.
△ Less
Submitted 14 February, 2025;
originally announced February 2025.
-
Variable reduction as a nonlinear preconditioning approach for optimization problems
Authors:
Gabriele Ciaremalla,
Tommaso Vanzan
Abstract:
When considering an unconstrained minimization problem, a standard approach is to solve the optimality system with a Newton method possibly preconditioned by, e.g., nonlinear elimination. In this contribution, we argue that nonlinear elimination could be used to reduce the number of optimization variables by artificially constraining them to satisfy a subset of the optimality conditions. Consequen…
▽ More
When considering an unconstrained minimization problem, a standard approach is to solve the optimality system with a Newton method possibly preconditioned by, e.g., nonlinear elimination. In this contribution, we argue that nonlinear elimination could be used to reduce the number of optimization variables by artificially constraining them to satisfy a subset of the optimality conditions. Consequently, a reduced objective function is derived which can now be minimized with any optimization algorithm. By choosing suitable variables to eliminate, the conditioning of the reduced optimization problem is largely improved. We here focus in particular on a right preconditioned gradient descent and show theoretical and numerical results supporting the validity of the presented approach.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Nonlinear Schwarz methods to compute geodesics on manifolds
Authors:
Marco Sutti,
Tommaso Vanzan
Abstract:
We consider the leapfrog algorithm by Noakes for computing geodesics on Riemannian manifolds. The main idea behind this algorithm is to subdivide the original endpoint geodesic problem into several local problems, for which the endpoint geodesic problem can be solved more easily by any local method (e.g., the single shooting method). The algorithm then iteratively updates a piecewise geodesic to o…
▽ More
We consider the leapfrog algorithm by Noakes for computing geodesics on Riemannian manifolds. The main idea behind this algorithm is to subdivide the original endpoint geodesic problem into several local problems, for which the endpoint geodesic problem can be solved more easily by any local method (e.g., the single shooting method). The algorithm then iteratively updates a piecewise geodesic to obtain a global geodesic between the original endpoints. From a domain decomposition perspective, we show that the leapfrog algorithm can be viewed as a classical Schwarz alternating method. Thanks to this analogy, we use techniques from nonlinear preconditioning to improve the convergence properties of the method. Preliminary numerical experiments suggest that this is a promising approach.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Multilevel quadrature formulae for the optimal control of random PDEs
Authors:
Fabio Nobile,
Tommaso Vanzan
Abstract:
This manuscript presents a framework for using multilevel quadrature formulae to compute the solution of optimal control problems constrained by random partial differential equations. Our approach consists in solving a sequence of optimal control problems discretized with different levels of accuracy of the physical and probability discretizations. The final approximation of the control is then ob…
▽ More
This manuscript presents a framework for using multilevel quadrature formulae to compute the solution of optimal control problems constrained by random partial differential equations. Our approach consists in solving a sequence of optimal control problems discretized with different levels of accuracy of the physical and probability discretizations. The final approximation of the control is then obtained in a postprocessing step, by suitably combining the adjoint variables computed on the different levels. We present a general convergence and complexity analysis for an unconstrained linear quadratic problem under abstract assumptions on the spatial discretization and on the quadrature formulae. We detail our framework for the specific case of a MultiLevel Monte Carlo (MLMC) quadrature formula, and numerical experiments confirm the better computational complexity of our MLMC approach compared to a standard Monte Carlo sample average approximation, even beyond the theoretical assumptions.
△ Less
Submitted 16 May, 2025; v1 submitted 9 July, 2024;
originally announced July 2024.
-
Optimized Schwarz methods for the time-dependent Stokes-Darcy coupling
Authors:
Marco Discacciati,
Tommaso Vanzan
Abstract:
This paper derives optimal coefficients for optimized Schwarz iterations for the time-dependent Stokes-Darcy problem using an innovative strategy to solve a nonstandard min-max problem. The coefficients take into account both physical and discretization parameters that characterize the coupled problem, and they guarantee the robustness of the associated domain decomposition method. Numerical resul…
▽ More
This paper derives optimal coefficients for optimized Schwarz iterations for the time-dependent Stokes-Darcy problem using an innovative strategy to solve a nonstandard min-max problem. The coefficients take into account both physical and discretization parameters that characterize the coupled problem, and they guarantee the robustness of the associated domain decomposition method. Numerical results validate the proposed approach in several test cases with physically relevant parameters.
△ Less
Submitted 12 May, 2023;
originally announced May 2023.
-
Robust optimization of control parameters for WEC arrays using stochastic methods
Authors:
Marco Gambarini,
Gabriele Ciaramella,
Edie Miglio,
Tommaso Vanzan
Abstract:
This work presents a new computational optimization framework for the robust control of parks of Wave Energy Converters (WEC) in irregular waves. The power of WEC parks is maximized with respect to the individual control damping and stiffness coefficients of each device. The results are robust with respect to the incident wave direction, which is treated as a random variable. Hydrodynamic properti…
▽ More
This work presents a new computational optimization framework for the robust control of parks of Wave Energy Converters (WEC) in irregular waves. The power of WEC parks is maximized with respect to the individual control damping and stiffness coefficients of each device. The results are robust with respect to the incident wave direction, which is treated as a random variable. Hydrodynamic properties are computed using the linear potential model, and the dynamics of the system is computed in the frequency domain. A slamming constraint is enforced to ensure that the results are physically realistic. We show that the stochastic optimization problem is well posed. Two optimization approaches for dealing with stochasticity are then considered: stochastic approximation and sample average approximation. The outcomes of the above mentioned methods in terms of accuracy and computational time are presented. The results of the optimization for complex and realistic array configurations of possible engineering interest are then discussed. Results of extensive numerical experiments demonstrate the efficiency of the proposed computational framework.
△ Less
Submitted 29 August, 2023; v1 submitted 6 May, 2023;
originally announced May 2023.
-
A multigrid solver for PDE-constrained optimization with uncertain inputs
Authors:
Gabriele Ciaramella,
Fabio Nobile,
Tommaso Vanzan
Abstract:
In this manuscript, we present a collective multigrid algorithm to solve efficiently the large saddle-point systems of equations that typically arise in PDE-constrained optimization under uncertainty, and develop a novel convergence analysis of collective smoothers and collective two-level methods. The multigrid algorithm is based on a collective smoother that at each iteration sweeps over the nod…
▽ More
In this manuscript, we present a collective multigrid algorithm to solve efficiently the large saddle-point systems of equations that typically arise in PDE-constrained optimization under uncertainty, and develop a novel convergence analysis of collective smoothers and collective two-level methods. The multigrid algorithm is based on a collective smoother that at each iteration sweeps over the nodes of the computational mesh, and solves a reduced saddle-point system whose size is proportional to the number $N$ of samples used to discretized the probability space. We show that this reduced system can be solved with optimal $O(N)$ complexity.
The multigrid method is tested both as a stationary method and as a preconditioner for GMRES on three problems: a linear-quadratic problem, possibly with a local or a boundary control, for which the multigrid method is used to solve directly the linear optimality system; a nonsmooth problem with box constraints and $L^1$-norm penalization on the control, in which the multigrid scheme is used as an inner solver within a semismooth Newton iteration; a risk-averse problem with the smoothed CVaR risk measure where the multigrid method is called within a preconditioned Newton iteration. In all cases, the multigrid algorithm exhibits excellent performances and robustness with respect to the parameters of interest.
△ Less
Submitted 17 May, 2024; v1 submitted 27 February, 2023;
originally announced February 2023.
-
Weak scalability of domain decomposition methods for discrete fracture networks
Authors:
Stefano Berrone,
Tommaso Vanzan
Abstract:
Discrete Fracture Networks (DFNs) are complex three-dimensional structures characterized by the intersections of planar polygonal fractures, and are used to model flows in fractured media. Despite being suitable for Domain Decomposition (DD) techniques, there are relatively few works on the application of DD methods to DFNs. In this manuscript, we present a theoretical study of Optimized Schwarz M…
▽ More
Discrete Fracture Networks (DFNs) are complex three-dimensional structures characterized by the intersections of planar polygonal fractures, and are used to model flows in fractured media. Despite being suitable for Domain Decomposition (DD) techniques, there are relatively few works on the application of DD methods to DFNs. In this manuscript, we present a theoretical study of Optimized Schwarz Methods (OSMs) applied to DFNs. Interestingly, we prove that the OSMs can be weakly scalable (that is, they converge to a given tolerance in a number of iterations independent of the number of fractures) under suitable assumptions on the domain decomposition. This contribution fits in the renewed interest on the weak scalability of DD methods after recent works showed weak scalability of DD methods for specific geometric configurations, even without coarse spaces. Despite simplifying assumptions which may be violated in practice, our analysis provides heuristics to minimize the computational efforts in realistic settings. Finally, we emphasize that the methodology proposed can be straightforwardly generalized to study other classical DD methods applied to DFNs.
△ Less
Submitted 22 November, 2022;
originally announced November 2022.
-
A combination technique for optimal control problems constrained by random PDEs
Authors:
Fabio Nobile,
Tommaso Vanzan
Abstract:
We present a combination technique based on mixed differences of both spatial approximations and quadrature formulae for the stochastic variables to solve efficiently a class of Optimal Control Problems (OCPs) constrained by random partial differential equations. The method requires to solve the OCP for several low-fidelity spatial grids and quadrature formulae for the objective functional. All th…
▽ More
We present a combination technique based on mixed differences of both spatial approximations and quadrature formulae for the stochastic variables to solve efficiently a class of Optimal Control Problems (OCPs) constrained by random partial differential equations. The method requires to solve the OCP for several low-fidelity spatial grids and quadrature formulae for the objective functional. All the computed solutions are then linearly combined to get a final approximation which, under suitable regularity assumptions, preserves the same accuracy of fine tensor product approximations, while drastically reducing the computational cost. The combination technique involves only tensor product quadrature formulae, thus the discretized OCPs preserve the (possible) convexity of the continuous OCP. Hence, the combination technique avoids the inconveniences of Multilevel Monte Carlo and/or sparse grids approaches, but remains suitable for high dimensional problems. The manuscript presents an a-priori procedure to choose the most important mixed differences and an asymptotic complexity analysis, which states that the asymptotic complexity is exclusively determined by the spatial solver. Numerical experiments validate the results.
△ Less
Submitted 28 March, 2024; v1 submitted 1 November, 2022;
originally announced November 2022.
-
Preconditioners for robust optimal control problems under uncertainty
Authors:
Fabio Nobile,
Tommaso Vanzan
Abstract:
The discretization of robust quadratic optimal control problems under uncertainty using the finite element method and the stochastic collocation method leads to large saddle-point systems, which are fully coupled across the random realizations. Despite its relevance for numerous engineering problems, the solution of such systems is notoriusly challenging. In this manuscript, we study efficient pre…
▽ More
The discretization of robust quadratic optimal control problems under uncertainty using the finite element method and the stochastic collocation method leads to large saddle-point systems, which are fully coupled across the random realizations. Despite its relevance for numerous engineering problems, the solution of such systems is notoriusly challenging. In this manuscript, we study efficient preconditioners for all-at-once approaches using both an algebraic and an operator preconditioning framework. We show in particular that for values of the regularization parameter not too small, the saddle-point system can be efficiently solved by preconditioning in parallel all the state and adjoint equations. For small values of the regularization parameter, robustness can be recovered by the additional solution of a small linear system, which however couples all realizations. A mean approximation and a Chebyshev semi-iterative method are investigated to solve this reduced system. Our analysis considers a random elliptic partial differential equation whose diffusion coefficient $κ(x,ω)$ is modeled as an almost surely continuous and positive random field, though not necessarily uniformly bounded and coercive. We further provide estimates on the dependence of the preconditioned system on the variance of the random field. Such estimates involve either the first or second moment of the random variables $1/\min_{x\in \overline{D}} κ(x,ω)$ and $\max_{x\in \overline{D}}κ(x,ω)$, where $D$ is the spatial domain. The theoretical results are confirmed by numerical experiments, and implementation details are further addressed.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
Linear and nonlinear substructured Restricted Additive Schwarz iterations and preconditioning
Authors:
Faycal Chaouqui,
Martin J. Gander,
Pratik M. Kumbhar,
Tommaso Vanzan
Abstract:
Substructured domain decomposition (DD) methods have been extensively studied, and they are usually associated with nonoverlapping decompositions. We introduce here a substructured version of Restricted Additive Schwarz (RAS) which we call SRAS, and we discuss its advantages compared to the standard volume formulation of the Schwarz method when they are used both as iterative solvers and precondit…
▽ More
Substructured domain decomposition (DD) methods have been extensively studied, and they are usually associated with nonoverlapping decompositions. We introduce here a substructured version of Restricted Additive Schwarz (RAS) which we call SRAS, and we discuss its advantages compared to the standard volume formulation of the Schwarz method when they are used both as iterative solvers and preconditioners for a Krylov method. To extend SRAS to nonlinear problems, we introduce SRASPEN (Substructured Restricted Additive Schwarz Preconditioned Exact Newton), where SRAS is used as a preconditioner for Newton's method. We study carefully the impact of substructuring on the convergence and performance of these methods as well as their implementations. We finally introduce two-level versions of nonlinear SRAS and SRASPEN. Numerical experiments confirm the advantages of formulating a Schwarz method at the substructured level.
△ Less
Submitted 31 March, 2021;
originally announced March 2021.
-
On the nonlinear Dirichlet-Neumann method and preconditioner for Newton's method
Authors:
Faycal Chaouqui,
Martin J. Gander,
Pratik M. Kumbhar,
Tommaso Vanzan
Abstract:
The Dirichlet-Neumann (DN) method has been extensively studied for linear partial differential equations, while little attention has been devoted to the nonlinear case. In this paper, we analyze the DN method both as a nonlinear iterative method and as a preconditioner for Newton's method. We discuss the nilpotent property and prove that under special conditions, there exists a relaxation paramete…
▽ More
The Dirichlet-Neumann (DN) method has been extensively studied for linear partial differential equations, while little attention has been devoted to the nonlinear case. In this paper, we analyze the DN method both as a nonlinear iterative method and as a preconditioner for Newton's method. We discuss the nilpotent property and prove that under special conditions, there exists a relaxation parameter such that the DN method converges quadratically. We further prove that the convergence of Newton's method preconditioned by the DN method is independent of the relaxation parameter. Our numerical experiments further illustrate the mesh independent convergence of the DN method and compare it with other standard nonlinear preconditioners.
△ Less
Submitted 12 April, 2022; v1 submitted 22 March, 2021;
originally announced March 2021.
-
On the asymptotic optimality of spectral coarse spaces
Authors:
Gabriele Ciaramella,
Tommaso Vanzan
Abstract:
This paper is concerned with the asymptotic optimality of spectral coarse spaces for two-level iterative methods. Spectral coarse spaces, namely coarse spaces obtained as the span of the slowest modes of the used one-level smoother, are known to be very efficient and, in some cases, optimal. However, the results of this paper show that spectral coarse spaces do not necessarily minimize the asympto…
▽ More
This paper is concerned with the asymptotic optimality of spectral coarse spaces for two-level iterative methods. Spectral coarse spaces, namely coarse spaces obtained as the span of the slowest modes of the used one-level smoother, are known to be very efficient and, in some cases, optimal. However, the results of this paper show that spectral coarse spaces do not necessarily minimize the asymptotic contraction factor of a two-level iterative method. Moreover, numerical experiments show that there exist coarse spaces that are asymptotically more efficient and lead to preconditioned systems with improved conditioning properties.
△ Less
Submitted 17 March, 2021;
originally announced March 2021.
-
A numerical algorithm based on probing to find optimized transmission conditions
Authors:
Martin J. Gander,
Roland Masson,
Tommaso Vanzan
Abstract:
Optimized Schwarz Methods (OSMs) are based on optimized transmission conditions along the interfaces between the subdomains. Optimized transmission conditions are derived at the theoretical level, using techniques developed in the last decades. The hypothesis behind these analyses are quite strong, so that the applicability of OSMs is still limited. In this manuscript, we present a numerical algor…
▽ More
Optimized Schwarz Methods (OSMs) are based on optimized transmission conditions along the interfaces between the subdomains. Optimized transmission conditions are derived at the theoretical level, using techniques developed in the last decades. The hypothesis behind these analyses are quite strong, so that the applicability of OSMs is still limited. In this manuscript, we present a numerical algorithm to obtain optimized transmission conditions for any given problem at hand. This algorithm requires few subdomain solves to be performed in an offline phase. This additional cost is usually negligible due to the resulting faster convergence, even in a single-query context.
△ Less
Submitted 19 August, 2021; v1 submitted 17 March, 2021;
originally announced March 2021.
-
Spectral substructured two-level domain decomposition methods
Authors:
Gabriele Ciaramella,
Tommaso Vanzan
Abstract:
Two-level domain decomposition (DD) methods are very powerful techniques for the efficient numerical solution of partial differential equations (PDEs). A two-level domain decomposition method requires two main components: a one-level preconditioner (or its corresponding smoothing iterative method), which is based on domain decomposition techniques, and a coarse correction step, which relies on a c…
▽ More
Two-level domain decomposition (DD) methods are very powerful techniques for the efficient numerical solution of partial differential equations (PDEs). A two-level domain decomposition method requires two main components: a one-level preconditioner (or its corresponding smoothing iterative method), which is based on domain decomposition techniques, and a coarse correction step, which relies on a coarse space. The coarse space must properly represent the error components that the chosen one-level method is not capable to deal with. In the literature most of the works introduced efficient coarse spaces obtained as the span of functions defined on the entire space domain of the considered PDE. Therefore, the corresponding two-level preconditioners and iterative methods are defined in volume.
In this paper, a new class of substructured two-level methods is introduced,for which both domain decomposition smoothers and coarse correction steps are defined on the interfaces (or skeletons). This approach has several advantages. On the one hand, the required computational effort is cheaper than the one required by classical volumetric two-level methods. On the other hand, it allows one to use some of the well-known efficient coarse spaces proposed in the literature. While analyzing in detail the new substructured methods, we present a new convergence analysis for two-level iterative methods, which covers the proposed substructured framework. Further, we study the asymptotic optimality of coarse spaces both theoretically and numerically using deep neural networks. Numerical experiments demonstrate the effectiveness of the proposed new numerical framework.
△ Less
Submitted 21 April, 2021; v1 submitted 15 August, 2019;
originally announced August 2019.