-
Deterministic Kalman filters for uncertain dynamical systems
Authors:
Karl Kunisch,
Jesper Schröder
Abstract:
The Kalman(-Bucy) filter is the natural choice for the state reconstruction of disturbed, linear dynamical systems based on flawed and incomplete measurements. Taking a deterministic viewpoint this work investigates possible extensions of the concept to systems with uncertain dynamics and noise covariances. In a theoretical analysis error bounds in terms of the variance of the uncertainties are de…
▽ More
The Kalman(-Bucy) filter is the natural choice for the state reconstruction of disturbed, linear dynamical systems based on flawed and incomplete measurements. Taking a deterministic viewpoint this work investigates possible extensions of the concept to systems with uncertain dynamics and noise covariances. In a theoretical analysis error bounds in terms of the variance of the uncertainties are derived. The article concludes with a numerical implementation of two example systems allowing for a comparison of the estimators.
△ Less
Submitted 31 May, 2025;
originally announced June 2025.
-
Pareto-optimal treatment of uncertainties in model-based process design and operation
Authors:
Jan Schwientek,
Katrin Teichert,
Jan Schröder,
Johannes Höller,
Norbert Asprion,
Pascal Schäfer,
Martin Wlotzka,
Michael Bortz
Abstract:
Model-based process design and operation involves here-and-now and wait-and-see decisions. Here-and-now decisions include design variables like the size of heat exchangers or the height of distillation columns, whereas wait-and-see decisions are directed towards operational variables like reflux and split ratios. In this contribution, we describe how to deal with these different types of decisions…
▽ More
Model-based process design and operation involves here-and-now and wait-and-see decisions. Here-and-now decisions include design variables like the size of heat exchangers or the height of distillation columns, whereas wait-and-see decisions are directed towards operational variables like reflux and split ratios. In this contribution, we describe how to deal with these different types of decisions in a multicriteria framework, offering an adjustability for the wait-and-see variables while at the same time respecting optimality guarantees on process KPIs.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.
-
Large-scale Thermo-Mechanical Simulation of Laser Beam Welding Using High-Performance Computing: A Qualitative Reproduction of Experimental Results
Authors:
Tommaso Bevilacqua,
Andrey Gumenyuk,
Niloufar Habibi,
Philipp Hartwig,
Axel Klawonn,
Martin Lanser,
Michael Rethmeier,
Lisa Scheunemann,
Jörg Schröder
Abstract:
Laser beam welding is a non-contact joining technique that has gained significant importance in the course of the increasing degree of automation in industrial manufacturing. This process has established itself as a suitable joining tool for metallic materials due to its non-contact processing, short cycle times, and small heat-affected zones. One potential problem, however, is the formation of so…
▽ More
Laser beam welding is a non-contact joining technique that has gained significant importance in the course of the increasing degree of automation in industrial manufacturing. This process has established itself as a suitable joining tool for metallic materials due to its non-contact processing, short cycle times, and small heat-affected zones. One potential problem, however, is the formation of solidification cracks, which particularly affects alloys with a pronounced melting range. Since solidification cracking is influenced by both temperature and strain rate, precise measurement technologies are of crucial importance. For this purpose, as an experimental setup, a Controlled Tensile Weldability (CTW) test combined with a local deformation measurement technique is used.
The aim of the present work is the development of computational methods and software tools to numerically simulate the CTW. The numerical results are compared with those obtained from the experimental CTW. In this study, an austenitic stainless steel sheet is selected. A thermo-elastoplastic material behavior with temperature-dependent material parameters is assumed. The time-dependent problem is first discretized in time and then the resulting nonlinear problem is linearized with Newton's method. For the discretization in space, finite elements are used. In order to obtain a sufficiently accurate solution, a large number of finite elements has to be used. In each Newton step, this yields a large linear system of equations that has to be solved. Therefore, a highly parallel scalable solver framework, based on the software library PETSc, was used to solve this computationally challenging problem on a high-performance computing architecture. Finally, the experimental results and the numerical simulations are compared, showing to be qualitatively in good agreement.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
Nodal AMG Coarsening and Interpolation for PDE Systems
Authors:
James Brannick,
Robert Falgout,
Karsten Kahl,
Jacob Schroder,
Taoli Shen
Abstract:
We present an approach to constructing a practical coarsening algorithm and interpolation operator for the algebraic multigrid (AMG) method, tailored towards systems of partial differential equations (PDEs) with large near-kernels, such as H(curl) and H(div). Our method builds on compatible relaxation (CR) and the ideal interpolation model within the generalized AMG (GAMG) framework but introduces…
▽ More
We present an approach to constructing a practical coarsening algorithm and interpolation operator for the algebraic multigrid (AMG) method, tailored towards systems of partial differential equations (PDEs) with large near-kernels, such as H(curl) and H(div). Our method builds on compatible relaxation (CR) and the ideal interpolation model within the generalized AMG (GAMG) framework but introduces several modifications to define an AMG method for PDE systems. We construct an interpolation operator through a coarsening process that first coarsens a nodal dual problem and then builds the coarse and fine variables using a matching algorithm. Our interpolation follows the ideal formulation; however, we enhance the sparsity of ideal interpolation by decoupling the fine and coarse variables completely. When the coarse variables align with the geometric refinement, our method reproduces re-discretization on unstructured meshes. Together with an automatic smoother construction scheme that identifies the local near kernels, our approach forms a complete two-grid method. Finally, we also show numerical results that demonstrate the effectiveness of this interpolation scheme by applying it to targeted problems and the Stokes system.
△ Less
Submitted 27 January, 2025;
originally announced January 2025.
-
Generalized Optimal AMG Convergence Theory for Stokes Equations Using Smooth Aggregation and Vanka Relaxation Strategies
Authors:
Ahsan Ali,
James J. Brannick,
Karsten Kahl,
Oliver A. Krzysik,
Jacob B. Schroder,
Ben S. Southworth,
Alexey Voronin
Abstract:
This paper discusses our recent generalized optimal algebraic multigrid (AMG) convergence theory applied to the steady-state Stokes equations discretized using Taylor-Hood elements ($\pmb{ \mathbb{P}}_2/\mathbb{P}_{1}$). The generalized theory is founded on matrix-induced orthogonality of the left and right eigenvectors of a generalized eigenvalue problem involving the system matrix and relaxation…
▽ More
This paper discusses our recent generalized optimal algebraic multigrid (AMG) convergence theory applied to the steady-state Stokes equations discretized using Taylor-Hood elements ($\pmb{ \mathbb{P}}_2/\mathbb{P}_{1}$). The generalized theory is founded on matrix-induced orthogonality of the left and right eigenvectors of a generalized eigenvalue problem involving the system matrix and relaxation operator. This framework establishes a rigorous lower bound on the spectral radius of the two-grid error-propagation operator, enabling precise predictions of the convergence rate for symmetric indefinite problems, such as those arising from saddle-point systems. We apply this theory to the recently developed monolithic smooth aggregation AMG (SA-AMG) solver for Stokes, constructed using evolution-based strength of connection, standard aggregation, and smoothed prolongation. The performance of these solvers is evaluated using additive and multiplicative Vanka relaxation strategies. Additive Vanka relaxation constructs patches algebraically on each level, resulting in a nonsymmetric relaxation operator due to the partition of unity being applied on one side of the block-diagonal matrix. Although symmetry can be restored by eliminating the partition of unity, this compromises convergence. Alternatively, multiplicative Vanka relaxation updates velocity and pressure sequentially within each patch, propagating updates multiplicatively across the domain and effectively addressing velocity-pressure coupling, ensuring a symmetric relaxation. We demonstrate that the generalized optimal AMG theory consistently provides accurate lower bounds on the convergence rate for SA-AMG applied to Stokes equations. These findings suggest potential avenues for further enhancement in AMG solver design for saddle-point systems.
△ Less
Submitted 11 January, 2025;
originally announced January 2025.
-
Local well-posedness of the minimum energy estimator for a defocusing cubic wave equation
Authors:
Jesper Schröder
Abstract:
This work is concerned with the minimum energy estimator for a nonlinear hyperbolic partial differential equation. The Mortensen observer - originally introduced for the energy-optimal reconstruction of the state of nonlinear finite-dimensional systems - is formulated for a disturbed cubic wave equation and the associated observer equation is derived. An in depth study of the associated optimal co…
▽ More
This work is concerned with the minimum energy estimator for a nonlinear hyperbolic partial differential equation. The Mortensen observer - originally introduced for the energy-optimal reconstruction of the state of nonlinear finite-dimensional systems - is formulated for a disturbed cubic wave equation and the associated observer equation is derived. An in depth study of the associated optimal control problem and sensitivity analysis of the corresponding value function reveals that the energy optimal state estimator is well-defined. Deploying a classical fixed point argument we proceed to show that the observer equation is locally well-posed.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
Immunity to Increasing Condition Numbers of Linear Superiorization versus Linear Programming
Authors:
Jan Schröder,
Yair Censor,
Philipp Süss,
Karl-Heinz Küfer
Abstract:
Given a family of linear constraints and a linear objective function one can consider whether to apply a Linear Programming (LP) algorithm or use a Linear Superiorization (LinSup) algorithm on this data. In the LP methodology one aims at finding a point that fulfills the constraints and has the minimal value of the objective function over these constraints. The Linear Superiorization approach cons…
▽ More
Given a family of linear constraints and a linear objective function one can consider whether to apply a Linear Programming (LP) algorithm or use a Linear Superiorization (LinSup) algorithm on this data. In the LP methodology one aims at finding a point that fulfills the constraints and has the minimal value of the objective function over these constraints. The Linear Superiorization approach considers the same data as linear programming problems but instead of attempting to solve those with linear optimization methods it employs perturbation resilient feasibility-seeking algorithms and steers them toward feasible points with reduced (not necessarily minimal) objective function values. Previous studies compared LP and LinSup in terms of their respective outputs and the resources they use. In this paper we investigate these two approaches in terms of their sensitivity to condition numbers of the system of linear constraints. Condition numbers are a measure for the impact of deviations in the input data on the output of a problem and, in particular, they describe the factor of error propagation when given wrong or erroneous data. Therefore, the ability of LP and LinSup to cope with increased condition numbers, thus with ill-posed problems, is an important matter to consider which was not studied until now. We investigate experimentally the advantages and disadvantages of both LP and LinSup on examplary problems of linear programming with multiple condition numbers and different problem dimensions.
△ Less
Submitted 26 July, 2024;
originally announced July 2024.
-
Parallel-in-time solution of hyperbolic PDE systems via characteristic-variable block preconditioning
Authors:
H. De Sterck,
R. D. Falgout,
O. A. Krzysik,
J. B. Schroder
Abstract:
We consider the parallel-in-time solution of hyperbolic partial differential equation (PDE) systems in one spatial dimension, both linear and nonlinear. In the nonlinear setting, the discretized equations are solved with a preconditioned residual iteration based on a global linearization. The linear(ized) equation systems are approximately solved parallel-in-time using a block preconditioner appli…
▽ More
We consider the parallel-in-time solution of hyperbolic partial differential equation (PDE) systems in one spatial dimension, both linear and nonlinear. In the nonlinear setting, the discretized equations are solved with a preconditioned residual iteration based on a global linearization. The linear(ized) equation systems are approximately solved parallel-in-time using a block preconditioner applied in the characteristic variables of the underlying linear(ized) hyperbolic PDE. This change of variables is motivated by the observation that inter-variable coupling for characteristic variables is weak relative to intra-variable coupling, at least locally where spatio-temporal variations in the eigenvectors of the associated flux Jacobian are sufficiently small. For an $\ell$-dimensional system of PDEs, applying the preconditioner consists of solving a sequence of $\ell$ scalar linear(ized)-advection-like problems, each being associated with a different characteristic wave-speed in the underlying linear(ized) PDE. We approximately solve these linear advection problems using multigrid reduction-in-time (MGRIT); however, any other suitable parallel-in-time method could be used. Numerical examples are shown for the (linear) acoustics equations in heterogeneous media, and for the (nonlinear) shallow water equations and Euler equations of gas dynamics with shocks and rarefactions.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Computing $\vec{\mathcal{S}}$-DAGs and Parity Games
Authors:
Meike Hatzel,
Johannes Schröder
Abstract:
Treewidth on undirected graphs is known to have many algorithmic applications. When considering directed width-measures there are much less results on their deployment for algorithmic results. In 2022 the first author, Rabinovich and Wiederrecht introduced a new directed width measure, $\vec{\mathcal{S}}$-DAG-width, using directed separations and obtained a structural duality for it. In 2012 Berwa…
▽ More
Treewidth on undirected graphs is known to have many algorithmic applications. When considering directed width-measures there are much less results on their deployment for algorithmic results. In 2022 the first author, Rabinovich and Wiederrecht introduced a new directed width measure, $\vec{\mathcal{S}}$-DAG-width, using directed separations and obtained a structural duality for it. In 2012 Berwanger~et~al.~solved Parity Games in polynomial time on digraphs of bounded DAG-width. With generalising this result to digraphs of bounded $\vec{\mathcal{S}}$-DAG-width and also providing an algorithm to compute the $\vec{\mathcal{S}}$-DAG-width of a given digraphs we give first algorithmical results for this new parameter.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
A computational approach to identify the material parameters of the relaxed micromorphic model
Authors:
Mohammad Sarhil,
Lisa Scheunemann,
Peter Lewintan,
Jörg Schröder,
Patrizio Neff
Abstract:
We determine the material parameters in the relaxed micromorphic generalized continuum model for a given periodic microstructure in this work. This is achieved through a least squares fitting of the total energy of the relaxed micromorphic homogeneous continuum to the total energy of the fully-resolved heterogeneous microstructure, governed by classical linear elasticity. The relaxed micromorphic…
▽ More
We determine the material parameters in the relaxed micromorphic generalized continuum model for a given periodic microstructure in this work. This is achieved through a least squares fitting of the total energy of the relaxed micromorphic homogeneous continuum to the total energy of the fully-resolved heterogeneous microstructure, governed by classical linear elasticity. The relaxed micromorphic model is a generalized continuum that utilizes the $\Curl$ of a micro-distortion field instead of its full gradient as in the classical micromorphic theory, leading to several advantages and differences. The most crucial advantage is that it operates between two well-defined scales. These scales are determined by linear elasticity with microscopic and macroscopic elasticity tensors, which respectively bound the stiffness of the relaxed micromorphic continuum from above and below. While the macroscopic elasticity tensor is established a priori through standard periodic first-order homogenization, the microscopic elasticity tensor remains to be determined. Additionally, the characteristic length parameter, associated with curvature measurement, controls the transition between the micro- and macro-scales. Both the microscopic elasticity tensor and the characteristic length parameter are here determined using a computational approach based on the least squares fitting of energies. This process involves the consideration of an adequate number of quadratic deformation modes and different specimen sizes. We conduct a comparative analysis between the least square fitting results of the relaxed micromorphic model, the fitting of a skew-symmetric micro-distortion field (Cosserat-micropolar model), and the fitting of the classical micromorphic model with two different formulations for the curvature...
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
Generalized Optimal AMG Convergence Theory for Nonsymmetric and Indefinite Problems
Authors:
Ahsan Ali,
James Brannick,
Karsten Kahl,
Oliver A. Krzysik,
Jacob B. Schroder,
Ben S. Southworth
Abstract:
Algebraic multigrid (AMG) is known to be an effective solver for many sparse symmetric positive definite (SPD) linear systems. For SPD systems, the convergence theory of AMG is well-understood in terms of the $A$-norm, but in a nonsymmetric setting, such an energy norm is non-existent. For this reason, convergence of AMG for nonsymmetric systems of equations remains an open area of research. A par…
▽ More
Algebraic multigrid (AMG) is known to be an effective solver for many sparse symmetric positive definite (SPD) linear systems. For SPD systems, the convergence theory of AMG is well-understood in terms of the $A$-norm, but in a nonsymmetric setting, such an energy norm is non-existent. For this reason, convergence of AMG for nonsymmetric systems of equations remains an open area of research. A particular aspect missing from theory of nonsymmetric and indefinite AMG is the incorporation of general relaxation schemes. In the SPD setting, the classical form of optimal AMG interpolation provides a useful insight in determining the best possible two-grid convergence rate of a method based on an arbitrary symmetrized relaxation scheme. In this work, we discuss a generalization of the optimal AMG convergence theory targeting nonsymmetric problems, using a certain matrix-induced orthogonality of the left and right eigenvectors of a generalized eigenvalue problem relating the system matrix and relaxation operator. We show that using this generalization of the optimal convergence theory, one can obtain a measure of the spectral radius of the two grid error transfer operator that is mathematically equivalent to the derivation in the SPD setting for optimal interpolation, which instead uses norms. In addition, this generalization of the optimal AMG convergence theory can be further extended for symmetric indefinite problems, such as those arising from saddle point systems so that one can obtain a precise convergence rate of the resulting two-grid method based on optimal interpolation. We provide supporting numerical examples of the convergence theory for nonsymmetric advection-diffusion problems, two-dimensional Dirac equation motivated by $γ_5$-symmetry, and the mixed Darcy flow problem corresponding to a saddle point system.
△ Less
Submitted 11 January, 2025; v1 submitted 20 January, 2024;
originally announced January 2024.
-
Parallel-in-time solution of scalar nonlinear conservation laws
Authors:
H. De Sterck,
R. D. Falgout,
O. A. Krzysik,
J. B. Schroder
Abstract:
We consider the parallel-in-time solution of scalar nonlinear conservation laws in one spatial dimension. The equations are discretized in space with a conservative finite-volume method using weighted essentially non-oscillatory (WENO) reconstructions, and in time with high-order explicit Runge-Kutta methods. The solution of the global, discretized space-time problem is sought via a nonlinear iter…
▽ More
We consider the parallel-in-time solution of scalar nonlinear conservation laws in one spatial dimension. The equations are discretized in space with a conservative finite-volume method using weighted essentially non-oscillatory (WENO) reconstructions, and in time with high-order explicit Runge-Kutta methods. The solution of the global, discretized space-time problem is sought via a nonlinear iteration that uses a novel linearization strategy in cases of non-differentiable equations. Under certain choices of discretization and algorithmic parameters, the nonlinear iteration coincides with Newton's method, although, more generally, it is a preconditioned residual correction scheme. At each nonlinear iteration, the linearized problem takes the form of a certain discretization of a linear conservation law over the space-time domain in question. An approximate parallel-in-time solution of the linearized problem is computed with a single multigrid reduction-in-time (MGRIT) iteration. The MGRIT iteration employs a novel coarse-grid operator that is a modified conservative semi-Lagrangian discretization and generalizes those we have developed previously for non-conservative scalar linear hyperbolic problems. Numerical tests are performed for the inviscid Burgers and Buckley--Leverett equations. For many test problems, the solver converges in just a handful of iterations with convergence rate independent of mesh resolution, including problems with (interacting) shocks and rarefactions.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Numerical realization of the Mortensen observer via a Hessian-augmented polynomial approximation of the value function
Authors:
Tobias Breiten,
Karl Kunisch,
Jesper Schröder
Abstract:
Two related numerical schemes for the realization of the Mortensen observer or minimum energy estimator for the state reconstruction of non-linear dynamical systems subject to deterministic disturbances are proposed and compared. Both approaches rely on a polynomial approximation of the value function associated with the energy of the disturbances of the system. Such an approximation is obtained v…
▽ More
Two related numerical schemes for the realization of the Mortensen observer or minimum energy estimator for the state reconstruction of non-linear dynamical systems subject to deterministic disturbances are proposed and compared. Both approaches rely on a polynomial approximation of the value function associated with the energy of the disturbances of the system. Such an approximation is obtained via interpolation considering not only the values but also first and second order derivatives of the value function in a set of sampling points. The scheme is applied to four examples and the results are compared with the well known extended Kalman filter.
△ Less
Submitted 8 November, 2023; v1 submitted 31 October, 2023;
originally announced October 2023.
-
Space-Time Block Preconditioning for Incompressible Resistive Magnetohydrodynamics
Authors:
Federico Danieli,
Ben S. Southworth,
Jacob B. Schroder
Abstract:
This work develops a novel all-at-once space-time preconditioning approach for resistive magnetohydrodynamics (MHD), with a focus on model problems targeting fusion reactor design. We consider parallel-in-time due to the long time domains required to capture the physics of interest, as well as the complexity of the underlying system and thereby computational cost of long-time integration. To ameli…
▽ More
This work develops a novel all-at-once space-time preconditioning approach for resistive magnetohydrodynamics (MHD), with a focus on model problems targeting fusion reactor design. We consider parallel-in-time due to the long time domains required to capture the physics of interest, as well as the complexity of the underlying system and thereby computational cost of long-time integration. To ameliorate this cost by using many processors, we thus develop a novel approach to solving the whole space-time system that is parallelizable in both space and time. We develop a space-time block preconditioning for resistive MHD, following the space-time block preconditioning concept first introduced by Danieli et al. in 2022 for incompressible flow, where an effective preconditioner for classic sequential time-stepping is extended to the space-time setting. The starting point for our derivation is the continuous Schur complement preconditioner by Cyr et al. in 2021, which we proceed to generalise in order to produce, to our knowledge, the first space-time block preconditioning approach for the challenging equations governing incompressible resistive MHD. The numerical results are promising for the model problems of island coalescence and tearing mode, with the overhead computational cost associated with space-time preconditioning versus sequential time-stepping being modest and primarily in the range of 2x-5x, which is low for parallel-in-time schemes in general. Additionally, the scaling results for inner (linear) and outer (nonlinear) iterations are flat in the case of fixed time-step size and only grow very slowly in the case of time-step refinement.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Constrained Local Approximate Ideal Restriction for Advection-Diffusion Problems
Authors:
Ahsan Ali,
James Brannick,
Karsten Kahl,
Oliver A. Krzysik,
Jacob B. Schroder,
Ben S. Southworth
Abstract:
This paper focuses on developing a reduction-based algebraic multigrid method that is suitable for solving general (non)symmetric linear systems and is naturally robust from pure advection to pure diffusion. Initial motivation comes from a new reduction-based algebraic multigrid (AMG) approach, $\ell$AIR (local approximate ideal restriction), that was developed for solving advection-dominated prob…
▽ More
This paper focuses on developing a reduction-based algebraic multigrid method that is suitable for solving general (non)symmetric linear systems and is naturally robust from pure advection to pure diffusion. Initial motivation comes from a new reduction-based algebraic multigrid (AMG) approach, $\ell$AIR (local approximate ideal restriction), that was developed for solving advection-dominated problems. Though this new solver is very effective in the advection dominated regime, its performance degrades in cases where diffusion becomes dominant. This is consistent with the fact that in general, reduction-based AMG methods tend to suffer from growth in complexity and/or convergence rates as the problem size is increased, especially for diffusion dominated problems in two or three dimensions. Motivated by the success of $\ell$AIR in the advective regime, our aim in this paper is to generalize the AIR framework with the goal of improving the performance of the solver in diffusion dominated regimes. To do so, we propose a novel way to combine mode constraints as used commonly in energy minimization AMG methods with the local approximation of ideal operators used in $\ell$AIR. The resulting constrained $\ell$AIR (C$\ell$AIR) algorithm is able to achieve fast scalable convergence on advective and diffusive problems. In addition, it is able to achieve standard low complexity hierarchies in the diffusive regime through aggressive coarsening, something that has been previously difficult for reduction-based methods.
△ Less
Submitted 14 May, 2024; v1 submitted 1 July, 2023;
originally announced July 2023.
-
Local well-posedness of the Mortensen observer
Authors:
Tobias Breiten,
Jesper Schröder
Abstract:
The analytical background of nonlinear observers based on minimal energy estimation is discussed. It is shown that locally the derivation of the observer equation based on a trajectory with pointwise minimal energy can be done rigorously. The result is obtained by a local sensitivity analysis of the value function based on Pontryagin's maximum principle and the Hamilton-Jacobi-Bellman equation. Th…
▽ More
The analytical background of nonlinear observers based on minimal energy estimation is discussed. It is shown that locally the derivation of the observer equation based on a trajectory with pointwise minimal energy can be done rigorously. The result is obtained by a local sensitivity analysis of the value function based on Pontryagin's maximum principle and the Hamilton-Jacobi-Bellman equation. The consideration of a differential Riccati equation reveals that locally the second derivative of the value function is a positive definite matrix. The local convexity ensures existence of a trajectory minimizing the energy, which is then shown to satisfy the observer equation.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
A surrogate model for data-driven magnetic stray field calculations
Authors:
Rainer Niekamp,
Johanna Niemann,
Maximilian Reichel,
Hongbin Zhang,
Jörg Schröder
Abstract:
In this contribution we propose a data-driven surrogate model for the prediction of magnetic stray fields in two-dimensional random micro-heterogeneous materials. Since data driven models require thousands of training data sets, FEM simulations appear to be too time consuming. Hence, a stochastic model based on Brownian motion, which utilizes an efficient evaluation of stochastic transition matric…
▽ More
In this contribution we propose a data-driven surrogate model for the prediction of magnetic stray fields in two-dimensional random micro-heterogeneous materials. Since data driven models require thousands of training data sets, FEM simulations appear to be too time consuming. Hence, a stochastic model based on Brownian motion, which utilizes an efficient evaluation of stochastic transition matrices, is applied for the training data generation. For the encoding of the microstructure and the optimization of the surrogate model, two architectures are compared, i.e. the so-called UResNet model and the Fourier Convolutional neural network (FCNN). Here we analyze two FCNNs, one based on the discrete cosine transformation and one based on the complex-valued discrete Fourier transformation. Finally, we compare the magnetic stray fields for independent microstructures (not used in the training set) with results from the FE$^2$ method, a numerical homogenization scheme, to demonstrate the efficiency of the proposed surrogate model.
△ Less
Submitted 8 April, 2023;
originally announced April 2023.
-
Modeling martensitic transformation in shape memory alloys using multi-phase-field elasticity models based on partial rank-one energy relaxation on pairwise interfaces
Authors:
Mohammad Sarhil,
Oleg Shchyglo,
Dominik Brands,
Jörg Schröder,
Ingo Steinbach
Abstract:
To model the mechanically-driven phase transformations, e.g. martensitic transformation, using the phase-field theory, suitable models are needed for describing the mechanical fields of the individual non-vanishing phase-fields in the interface regions in order to obtain the mechanical driving forces of phase-field motion. Quantitative modeling requires satisfying the interfacial static equilibriu…
▽ More
To model the mechanically-driven phase transformations, e.g. martensitic transformation, using the phase-field theory, suitable models are needed for describing the mechanical fields of the individual non-vanishing phase-fields in the interface regions in order to obtain the mechanical driving forces of phase-field motion. Quantitative modeling requires satisfying the interfacial static equilibrium and kinematic compatibility conditions which have already been achieved in the literature for dual-phase-field materials by using the rank-one relaxation (or convexification) of the energy density. A direct generalization to the multi-phase-field case is not applicable without breaking these conditions partially. To the best of our knowledge, no existing multi-phase-field elasticity model has been able to satisfy the jump conditions between all the locally-active phase-fields on their pairwise normals in triple and higher-order junctions. In this work, we introduce a novel multi-phase-field elasticity model based on the partial rank-one relaxation of the elastic energy density defined on the pairwise interfaces...... (see PDF for the rest of the abstract)
△ Less
Submitted 5 April, 2023;
originally announced April 2023.
-
Size-effects of metamaterial beams subjected to pure bending: on boundary conditions and parameter identification in the relaxed micromorphic model
Authors:
Mohammad Sarhil,
Lisa Scheunemann,
Jörg Schröder,
Patrizio Neff
Abstract:
In this paper we model the size-effects of metamaterial beams under bending with the aid of the relaxed micromorphic continuum. We analyze first the size-dependent bending stiffness of heterogeneous fully discretized metamaterial beams subjected to pure bending loads. Two equivalent loading schemes are introduced which lead to a constant moment along the beam length with no shear force. The relaxe…
▽ More
In this paper we model the size-effects of metamaterial beams under bending with the aid of the relaxed micromorphic continuum. We analyze first the size-dependent bending stiffness of heterogeneous fully discretized metamaterial beams subjected to pure bending loads. Two equivalent loading schemes are introduced which lead to a constant moment along the beam length with no shear force. The relaxed micromorphic model is employed then to retrieve the size-effects. We present a procedure for the determination of the material parameters of the relaxed micromorphic model based on the fact that the model operates between two well-defined scales. These scales are given by linear elasticity with micro and macro elasticity tensors which bound the relaxed micromorphic continuum from above and below, respectively. The micro elasticity tensor is specified as the maximum possible stiffness that is exhibited by the assumed metamaterial while the macro elasticity tensor is given by standard periodic first-order homogenization. For the identification of the micro elasticity tensor, two different approaches are shown which rely on affine and non-affine Dirichlet boundary conditions of candidate unit cell variants with the possible stiffest response. The consistent coupling condition is shown to allow the model to act on the whole intended range between macro and micro elasticity tensors for both loading cases. We fit the relaxed micromorphic model against the fully resolved metamaterial solution by controlling the curvature magnitude after linking it with the specimen's size. The obtained parameters of the relaxed micromorphic model are tested for two additional loading scenarios.
△ Less
Submitted 28 March, 2023; v1 submitted 31 October, 2022;
originally announced October 2022.
-
Four-periodic infinite staircases for four-dimensional polydisks
Authors:
Caden Farley,
Tara Holm,
Nicki Magill,
Jemma Schroder,
Morgan Weiler,
Zichen Wang,
Elizaveta Zabelina
Abstract:
The ellipsoid embedding function of a symplectic four-manifold measures the amount by which its symplectic form must be scaled in order for it to admit an embedding of an ellipsoid of varying eccentricity. This function generalizes the Gromov width and ball packing numbers. In the one continuous family of symplectic four-manifolds that has been analyzed, one-point blowups of the complex projective…
▽ More
The ellipsoid embedding function of a symplectic four-manifold measures the amount by which its symplectic form must be scaled in order for it to admit an embedding of an ellipsoid of varying eccentricity. This function generalizes the Gromov width and ball packing numbers. In the one continuous family of symplectic four-manifolds that has been analyzed, one-point blowups of the complex projective plane, there is an open dense set of symplectic forms whose ellipsoid embedding functions are completely described by finitely many obstructions, while there is simultaneously a Cantor set of symplectic forms for which an infinite number of obstructions are needed. In the latter case, we say that the embedding function has an infinite staircase. In this paper we identify a new infinite staircase when the target is a four-dimensional polydisk, extending a countable family identified by Usher in 2019. Our work computes the function on infinitely many intervals and thereby indicates a method of proof for a conjecture of Usher.
△ Less
Submitted 28 August, 2023; v1 submitted 26 October, 2022;
originally announced October 2022.
-
Efficient multigrid reduction-in-time for method-of-lines discretizations of linear advection
Authors:
H. De Sterck,
R. D. Falgout,
O. A. Krzysik,
J. B. Schroder
Abstract:
Parallel-in-time methods for partial differential equations (PDEs) have been the subject of intense development over recent decades, particularly for diffusion-dominated problems. It has been widely reported in the literature, however, that many of these methods perform quite poorly for advection-dominated problems. Here we analyze the particular iterative parallel-in-time algorithm of multigrid r…
▽ More
Parallel-in-time methods for partial differential equations (PDEs) have been the subject of intense development over recent decades, particularly for diffusion-dominated problems. It has been widely reported in the literature, however, that many of these methods perform quite poorly for advection-dominated problems. Here we analyze the particular iterative parallel-in-time algorithm of multigrid reduction-in-time (MGRIT) for discretizations of constant-wave-speed linear advection problems. We focus on common method-of-lines discretizations that employ upwind finite differences in space and Runge-Kutta methods in time. Using a convergence framework we developed in previous work, we prove for a subclass of these discretizations that, if using the standard approach of rediscretizing the fine-grid problem on the coarse grid, robust MGRIT convergence with respect to CFL number and coarsening factor is not possible. This poor convergence and non-robustness is caused, at least in part, by an inadequate coarse-grid correction for smooth Fourier modes known as characteristic components.We propose an alternative coarse-grid that provides a better correction of these modes. This coarse-grid operator is related to previous work and uses a semi-Lagrangian discretization combined with an implicitly treated truncation error correction. Theory and numerical experiments show the coarse-grid operator yields fast MGRIT convergence for many of the method-of-lines discretizations considered, including for both implicit and explicit discretizations of high order. Parallel results demonstrate substantial speed-up over sequential time-stepping.
△ Less
Submitted 20 March, 2023; v1 submitted 14 September, 2022;
originally announced September 2022.
-
Multigrid Reduction in Time for Chaotic Dynamical Systems
Authors:
David A. Vargas,
Robert D. Falgout,
Stefanie Günther,
Jacob B. Schroder
Abstract:
As CPU clock speeds have stagnated and high performance computers continue to have ever higher core counts, increased parallelism is needed to take advantage of these new architectures. Traditional serial time-marching schemes can be a significant bottleneck, as many types of simulations require large numbers of time-steps which must be computed sequentially. Parallel in Time schemes, such as the…
▽ More
As CPU clock speeds have stagnated and high performance computers continue to have ever higher core counts, increased parallelism is needed to take advantage of these new architectures. Traditional serial time-marching schemes can be a significant bottleneck, as many types of simulations require large numbers of time-steps which must be computed sequentially. Parallel in Time schemes, such as the Multigrid Reduction in Time (MGRIT) method, remedy this by parallelizing across time-steps, and have shown promising results for parabolic problems. However, chaotic problems have proved more difficult, since chaotic initial value problems (IVPs) are inherently ill-conditioned. MGRIT relies on a hierarchy of successively coarser time-grids to iteratively correct the solution on the finest time-grid, but due to the nature of chaotic systems, small inaccuracies on the coarser levels can be greatly magnified and lead to poor coarse-grid corrections. Here we introduce a modified MGRIT algorithm based on an existing quadratically converging nonlinear extension to the multigrid Full Approximation Scheme (FAS), as well as a novel time-coarsening scheme. Together, these approaches better capture long-term chaotic behavior on coarse-grids and greatly improve convergence of MGRIT for chaotic IVPs. Further, we introduce a novel low memory variant of the algorithm for solving chaotic PDEs with MGRIT which not only solves the IVP, but also provides estimates for the unstable Lyapunov vectors of the system. We provide supporting numerical results for the Lorenz system and demonstrate parallel speedup for the chaotic Kuramoto- Sivashinsky partial differential equation over a significantly longer time-domain than in previous works.
△ Less
Submitted 26 August, 2022;
originally announced August 2022.
-
Parallel Energy-Minimization Prolongation for Algebraic Multigrid
Authors:
Carlo Janna,
Andrea Franceschini,
Jacob B. Schroder,
Luke Olson
Abstract:
Algebraic multigrid (AMG) is one of the most widely used solution techniques for linear systems of equations arising from discretized partial differential equations. The popularity of AMG stems from its potential to solve linear systems in almost linear time, that is with an O(n) complexity, where n is the problem size. This capability is crucial at the present, where the increasing availability o…
▽ More
Algebraic multigrid (AMG) is one of the most widely used solution techniques for linear systems of equations arising from discretized partial differential equations. The popularity of AMG stems from its potential to solve linear systems in almost linear time, that is with an O(n) complexity, where n is the problem size. This capability is crucial at the present, where the increasing availability of massive HPC platforms pushes for the solution of very large problems. The key for a rapidly converging AMG method is a good interplay between the smoother and the coarse-grid correction, which in turn requires the use of an effective prolongation. From a theoretical viewpoint, the prolongation must accurately represent near kernel components and, at the same time, be bounded in the energy norm. For challenging problems, however, ensuring both these requirements is not easy and is exactly the goal of this work. We propose a constrained minimization procedure aimed at reducing prolongation energy while preserving the near kernel components in the span of interpolation. The proposed algorithm is based on previous energy minimization approaches utilizing a preconditioned restricted conjugate gradients method, but has new features and a specific focus on parallel performance and implementation. It is shown that the resulting solver, when used for large real-world problems from various application fields, exhibits excellent convergence rates and scalability and outperforms at least some more traditional AMG approaches.
△ Less
Submitted 5 August, 2022;
originally announced August 2022.
-
Toward Parallel in Time for Chaotic Dynamical Systems
Authors:
David A. Vargas,
Robert D. Falgout,
Stefanie Günther,
Jacob B. Schroder
Abstract:
As CPU clock speeds have stagnated, and high performance computers continue to have ever higher core counts, increased parallelism is needed to take advantage of these new architectures. Traditional serial time-marching schemes are a significant bottleneck, as many types of simulations require large numbers of time-steps which must be computed sequentially. Parallel in Time schemes, such as the Mu…
▽ More
As CPU clock speeds have stagnated, and high performance computers continue to have ever higher core counts, increased parallelism is needed to take advantage of these new architectures. Traditional serial time-marching schemes are a significant bottleneck, as many types of simulations require large numbers of time-steps which must be computed sequentially. Parallel in Time schemes, such as the Multigrid Reduction in Time (MGRIT) method, remedy this by parallelizing across time-steps, and have shown promising results for parabolic problems. However, chaotic problems have proved more difficult, since chaotic initial value problems are inherently ill-conditioned. MGRIT relies on a hierarchy of successively coarser time-grids to iteratively correct the solution on the finest time-grid, but due to the nature of chaotic systems, subtle inaccuracies on the coarser levels can lead to poor coarse-grid corrections. Here we propose a modification to nonlinear FAS multigrid, as well as a novel time-coarsening scheme, which together better capture long term behavior on coarse grids and greatly improve convergence of MGRIT for chaotic initial value problems. We provide supporting numerical results for the Lorenz system model problem.
△ Less
Submitted 25 January, 2022;
originally announced January 2022.
-
Lagrange and $H(\operatorname{curl},{\cal B})$ based Finite Element formulations for the relaxed micromorphic model
Authors:
Jörg Schröder,
Mohammad Sarhil,
Lisa Scheunemann,
Patrizio Neff
Abstract:
Modeling the unusual mechanical properties of metamaterials is a challenging topic for the mechanics community and enriched continuum theories are promising computational tools for such materials. The so-called relaxed micromorphic model has shown many advantages in this field. In this contribution, we present the significant aspects related to the relaxed micromorphic model realization with the f…
▽ More
Modeling the unusual mechanical properties of metamaterials is a challenging topic for the mechanics community and enriched continuum theories are promising computational tools for such materials. The so-called relaxed micromorphic model has shown many advantages in this field. In this contribution, we present the significant aspects related to the relaxed micromorphic model realization with the finite element method. The variational problem is derived and different FEM-formulations for the two-dimensional case are presented. These are a nodal standard formulation $H^1({\cal B}) \times H^1({\cal B})$ and a nodal-edge formulation $H^1({\cal B}) \times H(\operatorname{curl}, {\cal B})$, where the latter employs the Nédélec space. However, the implementation of higher-order Nédélec elements is not trivial and requires some technicalities which are demonstrated. We discuss the convergence behavior of Lagrange-type and tangential-conforming finite element discretizations. Moreover, we analyze the characteristic length effect on the different components of the model and reveal how the size-effect property is captured via this characteristic length.
△ Less
Submitted 1 December, 2021;
originally announced December 2021.
-
On the 3-colorability of triangle-free and fork-free graphs
Authors:
Joshua Schroeder,
Zhiyu Wang,
Xingxing Yu
Abstract:
A graph $G$ is said to satisfy the Vizing bound if $χ(G)\leq ω(G)+1$, where $χ(G)$ and $ω(G)$ denote the chromatic number and clique number of $G$, respectively. It was conjectured by Randerath in 1998 that if $G$ is a triangle-free and fork-free graph, where the fork (also known as trident) is obtained from $K_{1,4}$ by subdividing two edges, then $G$ satisfies the Vizing bound. In this paper, we…
▽ More
A graph $G$ is said to satisfy the Vizing bound if $χ(G)\leq ω(G)+1$, where $χ(G)$ and $ω(G)$ denote the chromatic number and clique number of $G$, respectively. It was conjectured by Randerath in 1998 that if $G$ is a triangle-free and fork-free graph, where the fork (also known as trident) is obtained from $K_{1,4}$ by subdividing two edges, then $G$ satisfies the Vizing bound. In this paper, we confirm this conjecture.
△ Less
Submitted 1 April, 2025; v1 submitted 19 November, 2021;
originally announced November 2021.
-
Weighted Relaxation for Multigrid Reduction in Time
Authors:
Masumi Sugiyama,
Jacob B. Schroder,
Ben S. Southworth,
Stephanie Friedhoff
Abstract:
Based on current trends in computer architectures, faster compute speeds must come from increased parallelism rather than increased clock speeds, which are currently stagnate. This situation has created the well-known bottleneck for sequential time-integration, where each individual time-value (i.e., time-step) is computed sequentially. One approach to alleviate this and achieve parallelism in tim…
▽ More
Based on current trends in computer architectures, faster compute speeds must come from increased parallelism rather than increased clock speeds, which are currently stagnate. This situation has created the well-known bottleneck for sequential time-integration, where each individual time-value (i.e., time-step) is computed sequentially. One approach to alleviate this and achieve parallelism in time is with multigrid. In this work, we consider multigrid-reduction-in-time (MGRIT), a multilevel method applied to the time dimension that computes multiple time-steps in parallel. Like all multigrid methods, MGRIT relies on the complementary relationship between relaxation on a fine-grid and a correction from the coarse grid to solve the problem. All current MGRIT implementations are based on unweighted-Jacobi relaxation; here we introduce the concept of weighted relaxation to MGRIT. We derive new convergence bounds for weighted relaxation, and use this analysis to guide the selection of relaxation weights. Numerical results then demonstrate that non-unitary relaxation weights consistently yield faster convergence rates and lower iteration counts for MGRIT when compared with unweighted relaxation. In most cases, weighted relaxation yields a 10%-20% saving in iterations. For A-stable integration schemes, results also illustrate that under-relaxation can restore convergence in some cases where unweighted relaxation is not convergent.
△ Less
Submitted 5 July, 2021;
originally announced July 2021.
-
Polynomial $χ$-binding functions for $t$-broom-free graphs
Authors:
Xiaonan Liu,
Joshua Schroeder,
Zhiyu Wang,
Xingxing Yu
Abstract:
For any positive integer $t$, a \emph{$t$-broom} is a graph obtained from $K_{1,t+1}$ by subdividing an edge once. In this paper, we show that, for graphs $G$ without induced $t$-brooms, we have $χ(G) = o(ω(G)^{t+1})$, where $χ(G)$ and $ω(G)$ are the chromatic number and clique number of $G$, respectively. When $t=2$, this answers a question of Schiermeyer and Randerath. Moreover, for $t=2$, we st…
▽ More
For any positive integer $t$, a \emph{$t$-broom} is a graph obtained from $K_{1,t+1}$ by subdividing an edge once. In this paper, we show that, for graphs $G$ without induced $t$-brooms, we have $χ(G) = o(ω(G)^{t+1})$, where $χ(G)$ and $ω(G)$ are the chromatic number and clique number of $G$, respectively. When $t=2$, this answers a question of Schiermeyer and Randerath. Moreover, for $t=2$, we strengthen the bound on $χ(G)$ to $7ω(G)^2$, confirming a conjecture of Sivaraman. For $t\geq 3$ and \{$t$-broom, $K_{t,t}$\}-free graphs, we improve the bound to $o(ω^{t})$.
△ Less
Submitted 29 November, 2022; v1 submitted 16 June, 2021;
originally announced June 2021.
-
Multilevel Initialization for Layer-Parallel Deep Neural Network Training
Authors:
Eric C. Cyr,
Stefanie Günther,
Jacob B. Schroder
Abstract:
This paper investigates multilevel initialization strategies for training very deep neural networks with a layer-parallel multigrid solver. The scheme is based on the continuous interpretation of the training problem as a problem of optimal control, in which neural networks are represented as discretizations of time-dependent ordinary differential equations. A key goal is to develop a method able…
▽ More
This paper investigates multilevel initialization strategies for training very deep neural networks with a layer-parallel multigrid solver. The scheme is based on the continuous interpretation of the training problem as a problem of optimal control, in which neural networks are represented as discretizations of time-dependent ordinary differential equations. A key goal is to develop a method able to intelligently initialize the network parameters for the very deep networks enabled by scalable layer-parallel training. To do this, we apply a refinement strategy across the time domain, that is equivalent to refining in the layer dimension. The resulting refinements create deep networks, with good initializations for the network parameters coming from the coarser trained networks. We investigate the effectiveness of such multilevel "nested iteration" strategies for network training, showing supporting numerical evidence of reduced run time for equivalent accuracy. In addition, we study whether the initialization strategies provide a regularizing effect on the overall training process and reduce sensitivity to hyperparameters and randomness in initial network parameters.
△ Less
Submitted 18 December, 2019;
originally announced December 2019.
-
The Role of Energy Minimization in Algebraic Multigrid Interpolation
Authors:
James Brannick,
Scott P. MacLachlan,
Jacob B. Schroder,
Ben S. Southworth
Abstract:
Algebraic multigrid (AMG) methods are powerful solvers with linear or near-linear computational complexity for certain classes of linear systems, Ax=b. Broadening the scope of problems that AMG can effectively solve requires the development of improved interpolation operators. Such development is often based on AMG convergence theory. However, convergence theory in AMG tends to have a disconnect w…
▽ More
Algebraic multigrid (AMG) methods are powerful solvers with linear or near-linear computational complexity for certain classes of linear systems, Ax=b. Broadening the scope of problems that AMG can effectively solve requires the development of improved interpolation operators. Such development is often based on AMG convergence theory. However, convergence theory in AMG tends to have a disconnect with AMG in practice due to the practical constraints of (i) maintaining matrix sparsity in transfer and coarse-grid operators, and (ii) retaining linear complexity in the setup and solve phase. This paper presents a review of fundamental results in AMG convergence theory, followed by a discussion on how these results can be used to motivate interpolation operators in practice. A general weighted energy minimization functional is then proposed to form interpolation operators, and a novel `diagonal' preconditioner for Sylvester- or Lyapunov-type equations developed simultaneously. Although results based on the weighted energy minimization typically underperform compared to a fully constrained energy minimization, numerical results provide new insight into the role of energy minimization and constraint vectors in AMG interpolation.
△ Less
Submitted 13 February, 2019;
originally announced February 2019.
-
Multilevel convergence analysis of multigrid-reduction-in-time
Authors:
Andreas Hessenthaler,
Ben S. Southworth,
David Nordsletten,
Oliver Röhrle,
Robert D. Falgout,
Jacob B. Schroder
Abstract:
This paper presents a multilevel convergence framework for multigrid-reduction-in-time (MGRIT) as a generalization of previous two-grid estimates. The framework provides a priori upper bounds on the convergence of MGRIT V- and F-cycles, with different relaxation schemes, by deriving the respective residual and error propagation operators. The residual and error operators are functions of the time…
▽ More
This paper presents a multilevel convergence framework for multigrid-reduction-in-time (MGRIT) as a generalization of previous two-grid estimates. The framework provides a priori upper bounds on the convergence of MGRIT V- and F-cycles, with different relaxation schemes, by deriving the respective residual and error propagation operators. The residual and error operators are functions of the time stepping operator, analyzed directly and bounded in norm, both numerically and analytically. We present various upper bounds of different computational cost and varying sharpness. These upper bounds are complemented by proposing analytic formulae for the approximate convergence factor of V-cycle algorithms that take the number of fine grid time points, the temporal coarsening factors, and the eigenvalues of the time stepping operator as parameters.
The paper concludes with supporting numerical investigations of parabolic (anisotropic diffusion) and hyperbolic (wave equation) model problems. We assess the sharpness of the bounds and the quality of the approximate convergence factors. Observations from these numerical investigations demonstrate the value of the proposed multilevel convergence framework for estimating MGRIT convergence a priori and for the design of a convergent algorithm. We further highlight that observations in the literature are captured by the theory, including that two-level Parareal and multilevel MGRIT with F-relaxation do not yield scalable algorithms and the benefit of a stronger relaxation scheme. An important observation is that with increasing numbers of levels MGRIT convergence deteriorates for the hyperbolic model problem, while constant convergence factors can be achieved for the diffusion equation. The theory also indicates that L-stable Runge-Kutta schemes are more amendable to multilevel parallel-in-time integration with MGRIT than A-stable Runge-Kutta schemes.
△ Less
Submitted 4 June, 2019; v1 submitted 30 December, 2018;
originally announced December 2018.
-
Efficient Calculation of the Joint Distribution of Order Statistics
Authors:
Jonathan von Schroeder,
Thorsten Dickhaus
Abstract:
We consider the problem of computing the joint distribution of order statistics of stochastically independent random variables in one- and two-group models. While recursive formulas for evaluating the joint cumulative distribution function of such order statistics exist in the literature for a longer time, their numerical implementation remains a challenging task. We tackle this task by presenting…
▽ More
We consider the problem of computing the joint distribution of order statistics of stochastically independent random variables in one- and two-group models. While recursive formulas for evaluating the joint cumulative distribution function of such order statistics exist in the literature for a longer time, their numerical implementation remains a challenging task. We tackle this task by presenting novel generalizations of known recursions which we utilize to obtain exact results (calculated in rational arithmetic) as well as faithfully rounded results. Finally, some applications in stepwise multiple hypothesis testing are discussed.
△ Less
Submitted 21 December, 2018;
originally announced December 2018.
-
Layer-Parallel Training of Deep Residual Neural Networks
Authors:
S. Günther,
L. Ruthotto,
J. B. Schroder,
E. C. Cyr,
N. R. Gauger
Abstract:
Residual neural networks (ResNets) are a promising class of deep neural networks that have shown excellent performance for a number of learning tasks, e.g., image classification and recognition. Mathematically, ResNet architectures can be interpreted as forward Euler discretizations of a nonlinear initial value problem whose time-dependent control variables represent the weights of the neural netw…
▽ More
Residual neural networks (ResNets) are a promising class of deep neural networks that have shown excellent performance for a number of learning tasks, e.g., image classification and recognition. Mathematically, ResNet architectures can be interpreted as forward Euler discretizations of a nonlinear initial value problem whose time-dependent control variables represent the weights of the neural network. Hence, training a ResNet can be cast as an optimal control problem of the associated dynamical system. For similar time-dependent optimal control problems arising in engineering applications, parallel-in-time methods have shown notable improvements in scalability. This paper demonstrates the use of those techniques for efficient and effective training of ResNets. The proposed algorithms replace the classical (sequential) forward and backward propagation through the network layers by a parallel nonlinear multigrid iteration applied to the layer domain. This adds a new dimension of parallelism across layers that is attractive when training very deep networks. From this basic idea, we derive multiple layer-parallel methods. The most efficient version employs a simultaneous optimization approach where updates to the network parameters are based on inexact gradient information in order to speed up the training process. Using numerical examples from supervised classification, we demonstrate that the new approach achieves similar training performance to traditional methods, but enables layer-parallelism and thus provides speedup over layer-serial methods through greater concurrency.
△ Less
Submitted 25 July, 2019; v1 submitted 11 December, 2018;
originally announced December 2018.
-
A Non-Intrusive Parallel-in-Time Approach for Simultaneous Optimization with Unsteady PDEs
Authors:
Stefanie Günther,
Nicolas R. Gauger,
Jacob B. Schroder
Abstract:
This paper presents a non-intrusive framework for integrating existing unsteady partial differential equation (PDE) solvers into a parallel-in-time simultaneous optimization algorithm. The time-parallelization is provided by the non-intrusive software library XBraid, which applies an iterative multigrid reduction technique to the time domain of existing time-marching schemes for solving unsteady P…
▽ More
This paper presents a non-intrusive framework for integrating existing unsteady partial differential equation (PDE) solvers into a parallel-in-time simultaneous optimization algorithm. The time-parallelization is provided by the non-intrusive software library XBraid, which applies an iterative multigrid reduction technique to the time domain of existing time-marching schemes for solving unsteady PDEs. Its general user-interface has been extended for computing adjoint sensitivities such that gradients of output quantities with respect to design changes can be computed parallel-in-time alongside with the primal PDE solution. In this paper, the primal and adjoint XBraid iterations are embedded into a simultaneous optimization framework, namely the One-shot method. In this method, design updates towards optimality are employed after each state and adjoint update such that optimality and feasibility of the design and the PDE solution are reached simultaneously. The time-parallel optimization method is validated on an advection-dominated flow control problem which shows significant speedup over a classical time-serial optimization algorithm.
△ Less
Submitted 28 February, 2018; v1 submitted 19 January, 2018;
originally announced January 2018.
-
Parallelizing Over Artificial Neural Network Training Runs with Multigrid
Authors:
Jacob B. Schroder
Abstract:
Artificial neural networks are a popular and effective machine learning technique. Great progress has been made parallelizing the expensive training phase of an individual network, leading to highly specialized pieces of hardware, many based on GPU-type architectures, and more concurrent algorithms such as synthetic gradients. However, the training phase continues to be a bottleneck, where the tra…
▽ More
Artificial neural networks are a popular and effective machine learning technique. Great progress has been made parallelizing the expensive training phase of an individual network, leading to highly specialized pieces of hardware, many based on GPU-type architectures, and more concurrent algorithms such as synthetic gradients. However, the training phase continues to be a bottleneck, where the training data must be processed serially over thousands of individual training runs. This work considers a multigrid reduction in time (MGRIT) algorithm that is able to parallelize over the thousands of training runs and converge to the exact same solution as traditional training would provide. MGRIT was originally developed to provide parallelism for time evolution problems that serially step through a finite number of time-steps. This work recasts the training of a neural network similarly, treating neural network training as an evolution equation that evolves the network weights from one step to the next. Thus, this work concerns distributed computing approaches for neural networks, but is distinct from other approaches which seek to parallelize only over individual training runs. The work concludes with supporting numerical results for two model problems.
△ Less
Submitted 1 October, 2017; v1 submitted 7 August, 2017;
originally announced August 2017.
-
A lower bound on the number of rough numbers
Authors:
J. Z. Schroeder
Abstract:
Conceptually, a rough number is a positive integer with no small prime factors. Formally, for real numbers $x$ and $y$, let $Φ(x,y)$ denote the number of positive integers at most $x$ with no prime factors less than $y$. In this paper we establish the lower bound $Φ(n,p)\geq \lfloor 2n/p \rfloor +1$ when $p\geq 11$ is prime and $n\geq 2p$.
Conceptually, a rough number is a positive integer with no small prime factors. Formally, for real numbers $x$ and $y$, let $Φ(x,y)$ denote the number of positive integers at most $x$ with no prime factors less than $y$. In this paper we establish the lower bound $Φ(n,p)\geq \lfloor 2n/p \rfloor +1$ when $p\geq 11$ is prime and $n\geq 2p$.
△ Less
Submitted 16 May, 2017; v1 submitted 13 May, 2017;
originally announced May 2017.
-
A Non-Intrusive Parallel-in-Time Adjoint Solver with the XBraid Library
Authors:
Stefanie Günther,
Nicolas R. Gauger,
Jacob B. Schroder
Abstract:
In this paper, an adjoint solver for the multigrid in time software library XBraid is presented. XBraid provides a non-intrusive approach for simulating unsteady dynamics on multiple processors while parallelizing not only in space but also in the time domain. It applies an iterative multigrid reduction in time algorithm to existing spatially parallel classical time propagators and computes the un…
▽ More
In this paper, an adjoint solver for the multigrid in time software library XBraid is presented. XBraid provides a non-intrusive approach for simulating unsteady dynamics on multiple processors while parallelizing not only in space but also in the time domain. It applies an iterative multigrid reduction in time algorithm to existing spatially parallel classical time propagators and computes the unsteady solution parallel in time. Techniques from Automatic Differentiation are used to develop a consistent discrete adjoint solver which provides sensitivity information of output quantities with respect to design parameter changes. The adjoint code runs backwards through the primal XBraid actions and accumulates gradient information parallel in time. It is highly non-intrusive as existing adjoint time propagators can easily be integrated through the adjoint interface. The adjoint code is validated on advection-dominated flow with periodic upstream boundary condition. It provides similar strong scaling results as the primal XBraid solver and offers great potential for speeding up the overall computational costs for sensitivity analysis using multiple processors.
△ Less
Submitted 19 January, 2018; v1 submitted 1 May, 2017;
originally announced May 2017.
-
A Root-Node Based Algebraic Multigrid Method
Authors:
Thomas A. Manteuffel,
Luke N. Olson,
Jacob B. Schroder,
Ben S. Southworth
Abstract:
This paper provides a unified and detailed presentation of root-node style algebraic multigrid (AMG). Algebraic multigrid is a popular and effective iterative method for solving large, sparse linear systems that arise from discretizing partial differential equations. However, while AMG is designed for symmetric positive definite matrices (SPD), certain SPD problems, such as anisotropic diffusion,…
▽ More
This paper provides a unified and detailed presentation of root-node style algebraic multigrid (AMG). Algebraic multigrid is a popular and effective iterative method for solving large, sparse linear systems that arise from discretizing partial differential equations. However, while AMG is designed for symmetric positive definite matrices (SPD), certain SPD problems, such as anisotropic diffusion, are still not adequately addressed by existing methods. Non-SPD problems pose an even greater challenge, and in practice AMG is often not considered as a solver for such problems.
The focus of this paper is on so-called root-node AMG, which can be viewed as a combination of classical and aggregation-based multigrid. An algorithm for root-node is outlined and a filtering strategy is developed, which is able to control the cost of using root-node AMG, particularly on difficult problems. New theoretical motivation is provided for root-node and energy-minimization as applied to symmetric as well non-symmetric systems. Numerical results are then presented demonstrating the robust ability of root-node to solve non-symmetric problems, systems-based problems, and difficult SPD problems, including strongly anisotropic diffusion, convection-diffusion, and upwind steady-state transport, in a scalable manner. New, detailed estimates of the computational cost of the setup and solve phase are given for each example, providing additional support for root-node AMG over alternative methods.
△ Less
Submitted 28 January, 2018; v1 submitted 10 October, 2016;
originally announced October 2016.
-
Minimal geodesics and integrable behavior in geodesic flows
Authors:
Jan Philipp Schröder
Abstract:
In this survey article we gather classical as well as recent results on minimal geodesics of Riemannian or Finsler metrics, giving special attention to the two-dimensional case. Moreover, we present open problems together with some first ideas as to the solutions.
In this survey article we gather classical as well as recent results on minimal geodesics of Riemannian or Finsler metrics, giving special attention to the two-dimensional case. Moreover, we present open problems together with some first ideas as to the solutions.
△ Less
Submitted 25 January, 2016;
originally announced January 2016.
-
Reducing Parallel Communication in Algebraic Multigrid through Sparsification
Authors:
Amanda Bienz,
Robert D. Falgout William Gropp,
Luke N. Olson,
Jacob B. Schroder
Abstract:
Algebraic multigrid (AMG) is an $\mathcal{O}(n)$ solution process for many large sparse linear systems. A hierarchy of progressively coarser grids is constructed that utilize complementary relaxation and interpolation operators. High-energy error is reduced by relaxation, while low-energy error is mapped to coarse-grids and reduced there. However, large parallel communication costs often limit par…
▽ More
Algebraic multigrid (AMG) is an $\mathcal{O}(n)$ solution process for many large sparse linear systems. A hierarchy of progressively coarser grids is constructed that utilize complementary relaxation and interpolation operators. High-energy error is reduced by relaxation, while low-energy error is mapped to coarse-grids and reduced there. However, large parallel communication costs often limit parallel scalability. As the multigrid hierarchy is formed, each coarse matrix is formed through a triple matrix product. The resulting coarse-grids often have significantly more nonzeros per row than the original fine-grid operator, thereby generating high parallel communication costs on coarse-levels. In this paper, we introduce a method that systematically removes entries in coarse-grid matrices after the hierarchy is formed, leading to an improved communication costs. We sparsify by removing weakly connected or unimportant entries in the matrix, leading to improved solve time. The main trade-off is that if the heuristic identifying unimportant entries is used too aggressively, then AMG convergence can suffer. To counteract this, the original hierarchy is retained, allowing entries to be reintroduced into the solver hierarchy if convergence is too slow. This enables a balance between communication cost and convergence, as necessary. In this paper we present new algorithms for reducing communication and present a number of computational experiments in support.
△ Less
Submitted 14 December, 2015;
originally announced December 2015.
-
The stable norm on the 2-torus at irrational directions
Authors:
Stefan Klempnauer,
Jan Philipp Schröder
Abstract:
We study the structure of the stable norm of Finsler metrics on the 2-torus with a focus to points of irrational slope. By our results, the stable norm detects KAM-tori and hyperbolicity in the geodesic flow. Moreover, we study the stable norm in some natural examples.
We study the structure of the stable norm of Finsler metrics on the 2-torus with a focus to points of irrational slope. By our results, the stable norm detects KAM-tori and hyperbolicity in the geodesic flow. Moreover, we study the stable norm in some natural examples.
△ Less
Submitted 8 December, 2015;
originally announced December 2015.
-
Minimal rays on surfaces of genus greater than one -- Part II
Authors:
Jan Philipp Schröder
Abstract:
We consider any Finsler metric on a closed, orientable surface of genus greater than one. H. M. Morse proved that we can associate an asymptotic direction to minimal rays in the universal cover (in the Poincaré disc: a point on the unit circle). We prove here that, if two minimal rays have a common asymptotic direction, which is not a fixed point of the group of deck transformations, then the two…
▽ More
We consider any Finsler metric on a closed, orientable surface of genus greater than one. H. M. Morse proved that we can associate an asymptotic direction to minimal rays in the universal cover (in the Poincaré disc: a point on the unit circle). We prove here that, if two minimal rays have a common asymptotic direction, which is not a fixed point of the group of deck transformations, then the two rays can intersect at most in a common initial point. This has strong consequences for the structure of the set of minimal geodesics, as well as for the set of Busemann functions associated to the Finsler metric.
△ Less
Submitted 5 September, 2014;
originally announced September 2014.
-
Ergodic components and topological entropy in geodesic flows of surfaces
Authors:
Jan Philipp Schröder
Abstract:
We consider the geodesic flow of reversible Finsler metrics on the 2-sphere and the 2-torus, whose geodesic flow has vanishing topological entropy. Following a construction of A. Katok, we discuss examples of Finsler metrics on both surfaces, which have large ergodic components for the geodesic flow in the unit tangent bundle. On the other hand, using results of J. Franks and M. Handel, we prove t…
▽ More
We consider the geodesic flow of reversible Finsler metrics on the 2-sphere and the 2-torus, whose geodesic flow has vanishing topological entropy. Following a construction of A. Katok, we discuss examples of Finsler metrics on both surfaces, which have large ergodic components for the geodesic flow in the unit tangent bundle. On the other hand, using results of J. Franks and M. Handel, we prove that ergodicity and dense orbits cannot occur in the full unit tangent bundle of the 2-sphere, if the Finsler metric has positive flag curvatures and at least two closed geodesics. In the case of the 2-torus, we show that ergodicity is restricted to strict subsets of tubes between flow-invariant tori in the unit tangent bundle of the 2-torus.
△ Less
Submitted 23 July, 2014;
originally announced July 2014.
-
Uniqueness of shortest closed geodesics for generic Finsler metrics
Authors:
Jan Philipp Schröder
Abstract:
In every conformal class of Finsler (or Riemannian) metrics on a closed manifold there exists a residual subset of Finsler metrics, such that, with respect to the residual Finsler metrics, in any non-trivial homotopy class of free loops there is precisely one shortest geodesic loop.
In every conformal class of Finsler (or Riemannian) metrics on a closed manifold there exists a residual subset of Finsler metrics, such that, with respect to the residual Finsler metrics, in any non-trivial homotopy class of free loops there is precisely one shortest geodesic loop.
△ Less
Submitted 12 May, 2014;
originally announced May 2014.
-
Minimal rays on surfaces of genus greater than one
Authors:
Jan Philipp Schröder
Abstract:
For Finsler metrics (no reversibility assumed) on closed orientable surfaces of genus greater than one, we study the dynamics of minimal rays and minimal geodesics in the universal cover. We prove in particular, that for almost all asymptotic directions the minimal rays with these directions laminate the universal cover and that the Busemann functions with these directions are unique up to adding…
▽ More
For Finsler metrics (no reversibility assumed) on closed orientable surfaces of genus greater than one, we study the dynamics of minimal rays and minimal geodesics in the universal cover. We prove in particular, that for almost all asymptotic directions the minimal rays with these directions laminate the universal cover and that the Busemann functions with these directions are unique up to adding constants. Moreover, using a kind of weak KAM theory, we show that for almost all types of minimal geodesics in the sense of Morse, there is precisely one minimal geodesic of this type.
△ Less
Submitted 2 April, 2014;
originally announced April 2014.
-
Tonelli Lagrangian systems on the 2-torus and topological entropy
Authors:
Jan Philipp Schröder
Abstract:
We study Tonelli Lagrangian systems on the 2-torus in energy levels above Mañé's strict critical value and analyize the structure of global minimizers in the spirit of Morse, Hedlund and Bangert. In the case where the topological entropy of the Euler-Lagrange flow on the fixed energy level vanishes, we show that there are invariant tori for all rotation vectors indicating integrable-like behavior…
▽ More
We study Tonelli Lagrangian systems on the 2-torus in energy levels above Mañé's strict critical value and analyize the structure of global minimizers in the spirit of Morse, Hedlund and Bangert. In the case where the topological entropy of the Euler-Lagrange flow on the fixed energy level vanishes, we show that there are invariant tori for all rotation vectors indicating integrable-like behavior on a large scale. On the other hand, using a construction of Katok, we give examples of reversible Finsler geodesic flows with vanishing topological entropy, but having ergodic components of positive measure in the unit tangent bundle.
△ Less
Submitted 29 August, 2013;
originally announced August 2013.
-
Topological entropy of minimal geodesics and volume growth on surfaces
Authors:
Gerhard Knieper,
Carlos Ogouyandjou,
Jan Philipp Schröder
Abstract:
Let (M,g) be a compact Riemannian manifold of hyperbolic type, i.e M is a manifold admitting another metric of strictly negative curvature. In this paper we study the geodesic flow restricted to the set of geodesics which are minimal on the universal covering. In particular for surfaces we show that the topological entropy of the minimal geodesics coincides with the volume entropy of (M, g) genera…
▽ More
Let (M,g) be a compact Riemannian manifold of hyperbolic type, i.e M is a manifold admitting another metric of strictly negative curvature. In this paper we study the geodesic flow restricted to the set of geodesics which are minimal on the universal covering. In particular for surfaces we show that the topological entropy of the minimal geodesics coincides with the volume entropy of (M, g) generalizing work of Freire and Mane.
△ Less
Submitted 9 August, 2013;
originally announced August 2013.