Search | arXiv e-print repository

Adjoint Monte Carlo Method

Abstract: This survey explores the development of adjoint Monte Carlo methods for solving optimization problems governed by kinetic equations, a common challenge in areas such as plasma control and device design. These optimization problems are particularly demanding due to the high dimensionality of the phase space and the randomness in evaluating the objective functional, a consequence of using a forward… ▽ More This survey explores the development of adjoint Monte Carlo methods for solving optimization problems governed by kinetic equations, a common challenge in areas such as plasma control and device design. These optimization problems are particularly demanding due to the high dimensionality of the phase space and the randomness in evaluating the objective functional, a consequence of using a forward Monte Carlo solver. To overcome these difficulties, a range of ``adjoint Monte Carlo methods'' have been devised. These methods skillfully combine Monte Carlo gradient estimators with PDE-constrained optimization, introducing innovative solutions tailored for kinetic applications. In this review, we begin by examining three primary strategies for Monte Carlo gradient estimation: the score function approach, the reparameterization trick, and the coupling method. We also delve into the adjoint-state method, an essential element in PDE-constrained optimization. Focusing on applications in the radiative transfer equation and the nonlinear Boltzmann equation, we provide a comprehensive guide on how to integrate Monte Carlo gradient techniques within both the optimize-then-discretize and the discretize-then-optimize frameworks from PDE-constrained optimization. This approach leads to the formulation of effective adjoint Monte Carlo methods, enabling efficient gradient estimation in complex, high-dimensional optimization problems. △ Less

Submitted 22 May, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

Comments: 39 pages, 7 figures

arXiv:2308.02904 [pdf, other]

doi 10.1007/s10915-024-02614-1

Gradient-based Monte Carlo methods for relaxation approximations of hyperbolic conservation laws

Authors: Giulia Bertaglia, Lorenzo Pareschi, Russel E. Caflisch

Abstract: Particle methods based on evolving the spatial derivatives of the solution were originally introduced to simulate reaction-diffusion processes, inspired by vortex methods for the Navier--Stokes equations. Such methods, referred to as gradient random walk methods, were extensively studied in the '90s and have several interesting features, such as being grid free, automatically adapting to the solut… ▽ More Particle methods based on evolving the spatial derivatives of the solution were originally introduced to simulate reaction-diffusion processes, inspired by vortex methods for the Navier--Stokes equations. Such methods, referred to as gradient random walk methods, were extensively studied in the '90s and have several interesting features, such as being grid free, automatically adapting to the solution by concentrating elements where the gradient is large and significantly reducing the variance of the standard random walk approach. In this work, we revive these ideas by showing how to generalize the approach to a larger class of partial differential equations, including hyperbolic systems of conservation laws. To achieve this goal, we first extend the classical Monte Carlo method to relaxation approximation of systems of conservation laws, and subsequently consider a novel particle dynamics based on the spatial derivatives of the solution. The methodology, combined with asymptotic-preserving splitting discretization, yields a way to construct a new class of gradient-based Monte Carlo methods for hyperbolic systems of conservation laws. Several results in one spatial dimension for scalar equations and systems of conservation laws show that the new methods are very promising and yield remarkable improvements compared to standard Monte Carlo approaches, either in terms of variance reduction as well as in describing the shock structure. △ Less

Submitted 5 August, 2023; originally announced August 2023.

MSC Class: 65C05; 65C35; 35L65; 35L04; 82B40

Journal ref: J. Sci. Comput., 100 (2024) 60

arXiv:2207.11579 [pdf, other]

doi 10.1016/j.jcp.2023.112247

Adjoint DSMC for Nonlinear Spatially-Homogeneous Boltzmann Equation With a General Collision Model

Authors: Yunan Yang, Denis Silantyev, Russel Caflisch

Abstract: We derive an adjoint method for the Direct Simulation Monte Carlo (DSMC) method for the spatially homogeneous Boltzmann equation with a general collision law. This generalizes our previous results in [Caflisch, R., Silantyev, D. and Yang, Y., 2021. Journal of Computational Physics, 439, p.110404], which was restricted to the case of Maxwell molecules, for which the collision rate is constant. The… ▽ More We derive an adjoint method for the Direct Simulation Monte Carlo (DSMC) method for the spatially homogeneous Boltzmann equation with a general collision law. This generalizes our previous results in [Caflisch, R., Silantyev, D. and Yang, Y., 2021. Journal of Computational Physics, 439, p.110404], which was restricted to the case of Maxwell molecules, for which the collision rate is constant. The main difficulty in generalizing the previous results is that a rejection sampling step is required in the DSMC algorithm in order to handle the variable collision rate. We find a new term corresponding to the so-called score function in the adjoint equation and a new adjoint Jacobian matrix capturing the dependence of the collision parameter on the velocities. The new formula works for a much more general class of collision models. △ Less

Submitted 30 July, 2022; v1 submitted 23 July, 2022; originally announced July 2022.

Comments: 28 pages, 5 figures

MSC Class: 76P05; 82C80; 65C05; 65K10; 82B40; 65M32

arXiv:2009.01363 [pdf, other]

doi 10.1016/j.jcp.2021.110404

Adjoint DSMC for nonlinear Boltzmann equation constrained optimization

Authors: Russel Caflisch, Denis Silantyev, Yunan Yang

Abstract: Applications for kinetic equations such as optimal design and inverse problems often involve finding unknown parameters through gradient-based optimization algorithms. Based on the adjoint-state method, we derive two different frameworks for approximating the gradient of an objective functional constrained by the nonlinear Boltzmann equation. While the forward problem can be solved by the DSMC met… ▽ More Applications for kinetic equations such as optimal design and inverse problems often involve finding unknown parameters through gradient-based optimization algorithms. Based on the adjoint-state method, we derive two different frameworks for approximating the gradient of an objective functional constrained by the nonlinear Boltzmann equation. While the forward problem can be solved by the DSMC method, it is difficult to efficiently solve the high-dimensional continuous adjoint equation obtained by the "optimize-then-discretize" approach. This challenge motivates us to propose an adjoint DSMC method following the "discretize-then-optimize" approach for Boltzmann-constrained optimization. We also analyze the properties of the two frameworks and their connections. Several numerical examples are presented to demonstrate their accuracy and efficiency. △ Less

Submitted 10 May, 2021; v1 submitted 2 September, 2020; originally announced September 2020.

Comments: 32 pages. 8 figures

MSC Class: 76P05; 82C80; 65C05; 65K10; 82B40; 65M32

Journal ref: Journal of Computational Physics, 2021, 110404, ISSN 0021-9991. (https://www.sciencedirect.com/science/article/pii/S0021999121002990)

arXiv:1512.02107 [pdf, other]

Complex singularities and PDEs

Authors: R. E. Caflisch, F. Gargano, M. Sammartino, V. Sciacca

Abstract: In this paper we give a review on the computational methods used to characterize the complex singularities developed by some relevant PDEs. We begin by reviewing the singularity tracking method based on the analysis of the Fourier spectrum. We then introduce other methods generally used to detect the hidden singularities. In particular we show some applications of the Padé approximation, of the Ki… ▽ More In this paper we give a review on the computational methods used to characterize the complex singularities developed by some relevant PDEs. We begin by reviewing the singularity tracking method based on the analysis of the Fourier spectrum. We then introduce other methods generally used to detect the hidden singularities. In particular we show some applications of the Padé approximation, of the Kida method, and of Borel-Polya method. We apply these techniques to the study of the singularity formation of some nonlinear dispersive and dissipative one dimensional PDE of the 2D Prandtl equation, of the 2D KP equation, and to Navier-Stokes equation for high Reynolds number incompressible flows in the case of interaction with rigid boundaries. △ Less

Submitted 25 November, 2015; originally announced December 2015.

Journal ref: Riv.Mat.Univ.Parma, Volume 6 - Number 1 - 2015

arXiv:1403.6883 [pdf, other]

Compressed Wannier modes found from an $L_1$ regularized energy functional

Authors: Farzin Barekat, Ke Yin, Russel E. Caflisch, Stanley J. Osher, Rongjie Lai, Vidvuds Ozolins

Abstract: We propose a method for calculating Wannier functions of periodic solids directly from a modified variational principle for the energy, subject to the requirement that the Wannier functions are orthogonal to all their translations ("shift-orthogonality"). Localization is achieved by adding an $L_1$ regularization term to the energy functional. This approach results in "compressed" Wannier modes wi… ▽ More We propose a method for calculating Wannier functions of periodic solids directly from a modified variational principle for the energy, subject to the requirement that the Wannier functions are orthogonal to all their translations ("shift-orthogonality"). Localization is achieved by adding an $L_1$ regularization term to the energy functional. This approach results in "compressed" Wannier modes with compact support, where one parameter $μ$ controls the trade-off between the accuracy of the total energy and the size of the support of the Wannier modes. Efficient algorithms for shift-orthogonalization and solution of the variational minimization problem are demonstrated. △ Less

Submitted 26 March, 2014; originally announced March 2014.

Comments: 5 pages, 3 figures

arXiv:1402.5158 [pdf, ps, other]

Projection to the Set of Shift Orthogonal Functions

Authors: Farzin Barekat, Rongjie Lai, Ke Yin, Stanley Osher, Russel Caflisch, Vidvuds Ozolins

Abstract: This paper presents a fast algorithm for projecting a given function to the set of shift orthogonal functions (i.e. set containing functions with unit $L^2$ norm that are orthogonal to their prescribed shifts). The algorithm can be parallelized easily and its computational complexity is bounded by $O(M\log(M))$, where $M$ is the number of coefficients used for storing the input. To derive the algo… ▽ More This paper presents a fast algorithm for projecting a given function to the set of shift orthogonal functions (i.e. set containing functions with unit $L^2$ norm that are orthogonal to their prescribed shifts). The algorithm can be parallelized easily and its computational complexity is bounded by $O(M\log(M))$, where $M$ is the number of coefficients used for storing the input. To derive the algorithm, a particular class of basis called Shift Orthogonal Basis Functions are introduced and some theory regarding them is developed. △ Less

Submitted 20 February, 2014; originally announced February 2014.

Comments: 32 pages, 2 figures

arXiv:1311.5850 [pdf, ps, other]

PDEs with Compressed Solutions

Authors: Russel E. Caflisch, Stanley J. Osher, Hayden Schaeffer, Giang Tran

Abstract: Sparsity plays a central role in recent developments in signal processing, linear algebra, statistics, optimization, and other fields. In these developments, sparsity is promoted through the addition of an $L^1$ norm (or related quantity) as a constraint or penalty in a variational principle. We apply this approach to partial differential equations that come from a variational quantity, either by… ▽ More Sparsity plays a central role in recent developments in signal processing, linear algebra, statistics, optimization, and other fields. In these developments, sparsity is promoted through the addition of an $L^1$ norm (or related quantity) as a constraint or penalty in a variational principle. We apply this approach to partial differential equations that come from a variational quantity, either by minimization (to obtain an elliptic PDE) or by gradient flow (to obtain a parabolic PDE). Also, we show that some PDEs can be rewritten in an $L^1$ form, such as the divisible sandpile problem and signum-Gordon. Addition of an $L^1$ term in the variational principle leads to a modified PDE where a subgradient term appears. It is known that modified PDEs of this form will often have solutions with compact support, which corresponds to the discrete solution being sparse. We show that this is advantageous numerically through the use of efficient algorithms for solving $L^1$ based problems. △ Less

Submitted 1 August, 2014; v1 submitted 22 November, 2013; originally announced November 2013.

Comments: 21 pages, 15 figures

arXiv:1310.4555 [pdf, ps, other]

doi 10.4208/cicp.300313.130114a

Simulation with Fluctuating and Singular Rates

Authors: Farzin Barekat, Russel Caflisch

Abstract: In this paper we present a method to generate independent samples for a general random variable, either continuous or discrete. The algorithm is an extension of the acceptance-rejection method, and it is particularly useful for kinetic simulation in which the rates are fluctuating in time and have singular limits, as occurs for example in simulation of recombination interactions in a plasma. Altho… ▽ More In this paper we present a method to generate independent samples for a general random variable, either continuous or discrete. The algorithm is an extension of the acceptance-rejection method, and it is particularly useful for kinetic simulation in which the rates are fluctuating in time and have singular limits, as occurs for example in simulation of recombination interactions in a plasma. Although it depends on some additional requirements, the new method is easy to implement and rejects less samples than the acceptance-rejection method. △ Less

Submitted 16 October, 2013; originally announced October 2013.

Comments: 22 pages, 6 figures

arXiv:1310.3591 [pdf, other]

doi 10.1016/j.jcp.2014.05.030

Multilevel Monte Carlo simulation of Coulomb collisions

Authors: M. S. Rosin, L. F. Ricketson, A. M. Dimits, R. E. Caflisch, B. I. Cohen

Abstract: We present a new, for plasma physics, highly efficient multilevel Monte Carlo numerical method for simulating Coulomb collisions. The method separates and optimally minimizes the finite-timestep and finite-sampling errors inherent in the Langevin representation of the Landau-Fokker-Planck equation. It does so by combining multiple solutions to the underlying equations with varying numbers of times… ▽ More We present a new, for plasma physics, highly efficient multilevel Monte Carlo numerical method for simulating Coulomb collisions. The method separates and optimally minimizes the finite-timestep and finite-sampling errors inherent in the Langevin representation of the Landau-Fokker-Planck equation. It does so by combining multiple solutions to the underlying equations with varying numbers of timesteps. For a desired level of accuracy epsilon, the computational cost of the method is order(epsilon^{-2}) or order(epsilon^{-2} (\ln epsilon)^2), depending on the underlying discretization, Milstein or Euler-Maruyama respectively. This is to be contrasted with a cost of order(epsilon^{-3}) for direct simulation Monte Carlo or binary collision methods. We successfully demonstrate the method with a classic beam diffusion test case in 2D, making use of the Levy area approximation for the correlated Milstein cross terms, and generating a computational saving of a factor of 100 for epsilon = 10^{-5}. We discuss the importance of the method for problems in which collisions constitute the computational rate limiting step, and its limitations. △ Less

Submitted 14 October, 2013; originally announced October 2013.

Comments: 32 pages

arXiv:1308.1758 [pdf, ps, other]

doi 10.1073/pnas.1318679110

Compressed Modes for Variational Problems in Mathematics and Physics

Authors: Vidvuds Ozoliņš, Rongjie Lai, Russel Caflisch, Stanley Osher

Abstract: This paper describes a general formalism for obtaining localized solutions to a class of problems in mathematical physics, which can be recast as variational optimization problems. This class includes the important cases of Schrödinger's equation in quantum mechanics and electromagnetic equations for light propagation in photonic crystals. These ideas can also be applied to develop a spatially loc… ▽ More This paper describes a general formalism for obtaining localized solutions to a class of problems in mathematical physics, which can be recast as variational optimization problems. This class includes the important cases of Schrödinger's equation in quantum mechanics and electromagnetic equations for light propagation in photonic crystals. These ideas can also be applied to develop a spatially localized basis that spans the eigenspace of a differential operator, for instance, the Laplace operator, generalizing the concept of plane waves to an orthogonal real-space basis with multi-resolution capabilities. △ Less

Submitted 28 August, 2013; v1 submitted 8 August, 2013; originally announced August 2013.

Comments: 18 pages

arXiv:1301.5678 [pdf, ps, other]

doi 10.1016/j.jcp.2014.04.059

An entropy based thermalization scheme for hybrid simulations of Coulomb collisions

Authors: L. F. Ricketson, M. S. Rosin, R. E. Caflisch, Andris M. Dimits

Abstract: We formulate and test a hybrid fluid-Monte Carlo scheme for the treatment of elastic collisions in gases and plasmas. While our primary focus and demonstrations of applicability are for moderately collisional plasmas, as described by the Landau-Fokker-Planck equation, the method is expected to be applicable also to collision processes described by the Boltzmann equation. This scheme is similar to… ▽ More We formulate and test a hybrid fluid-Monte Carlo scheme for the treatment of elastic collisions in gases and plasmas. While our primary focus and demonstrations of applicability are for moderately collisional plasmas, as described by the Landau-Fokker-Planck equation, the method is expected to be applicable also to collision processes described by the Boltzmann equation. This scheme is similar to the previously discussed velocity-based scheme [R. Caflisch, et. al, Multiscale Modeling & Simulation 7, 865, (2008)] and the scattering-angle-based scheme [A.M. Dimits, et. al, Bull. APS 55, no. 15 (2010, Abstract: XP9.00006)], but with a firmer theoretical basis and without the inherent limitation to the Landau-Fokker-Planck case. It gives a significant performance improvement (e.g., error for a given computational effort) over the velocity-based scheme. These features are achieved by assigning passive scalars to each simulated particle and tracking their evolution through collisions. The method permits a detailed error analysis that is confirmed by numerical results. The tests performed are for the evolution from anisotropic Maxwellian and a bump-on-tail distribution. △ Less

Submitted 23 January, 2013; originally announced January 2013.

arXiv:1212.4132 [pdf, ps, other]

doi 10.1073/pnas.1302752110

Sparse Dynamics for Partial Differential Equations

Authors: Hayden Schaeffer, Stanley Osher, Russel Caflisch, Cory Hauck

Abstract: We investigate the approximate dynamics of several differential equations when the solutions are restricted to a sparse subset of a given basis. The restriction is enforced at every time step by simply applying soft thresholding to the coefficients of the basis approximation. By reducing or compressing the information needed to represent the solution at every step, only the essential dynamics are… ▽ More We investigate the approximate dynamics of several differential equations when the solutions are restricted to a sparse subset of a given basis. The restriction is enforced at every time step by simply applying soft thresholding to the coefficients of the basis approximation. By reducing or compressing the information needed to represent the solution at every step, only the essential dynamics are represented. In many cases, there are natural bases derived from the differential equations which promote sparsity. We find that our method successfully reduces the dynamics of convection equations, diffusion equations, weak shocks, and vorticity equations with high frequency source terms. △ Less

Submitted 17 December, 2012; originally announced December 2012.

arXiv:1211.4396 [pdf, ps, other]

European Option Pricing with Transaction Costs and Stochastic Volatility: an Asymptotic Analysis

Authors: R. E. Caflisch, G. Gambino, M. Sammartino, C. Sgarra

Abstract: In this paper the valuation problem of a European call option in presence of both stochastic volatility and transaction costs is considered. In the limit of small transaction costs and fast mean reversion, an asymptotic expression for the option price is obtained. While the dominant term in the expansion it is shown to be the classical Black and Scholes solution, the correction terms appear at… ▽ More In this paper the valuation problem of a European call option in presence of both stochastic volatility and transaction costs is considered. In the limit of small transaction costs and fast mean reversion, an asymptotic expression for the option price is obtained. While the dominant term in the expansion it is shown to be the classical Black and Scholes solution, the correction terms appear at $O(\varepsilon^{1/2})$ and $O(\varepsilon)$. The optimal hedging strategy is then explicitly obtained for the Scott's model. △ Less

Submitted 19 November, 2012; originally announced November 2012.

arXiv:1110.2840 [pdf, ps, other]

Beyond the Child-Langmuir Limit

Authors: R. E. Caflisch, M. S. Rosin

Abstract: This paper describes a new solution formulation for fully nonlinear and unsteady planar flow of an electron beam in a diode. Using characteristic variables - i.e., variables that follow particle paths - the solution is expressed through an exact analytic, but implicit, formula for any choice of incoming velocity $v_0$, electric field $E_0$ and current $J_0$. For steady solutions, this approach cla… ▽ More This paper describes a new solution formulation for fully nonlinear and unsteady planar flow of an electron beam in a diode. Using characteristic variables - i.e., variables that follow particle paths - the solution is expressed through an exact analytic, but implicit, formula for any choice of incoming velocity $v_0$, electric field $E_0$ and current $J_0$. For steady solutions, this approach clarifies the origin of the maximal current $J_max$, derived by Child and Langmuir for $v_0=0$ and by Jaffe for $v_0>0$. The implicit formulation is used to find (1) unsteady solutions having constant incoming flux $J_0>J_max$, which leads formation of a virtual cathode, and (2) time-periodic solutions whose average flux exceeds the adiabatic average of $J_max$. △ Less

Submitted 13 October, 2011; originally announced October 2011.

Comments: 4 pages, 4 figures, submitted for publication

arXiv:1010.0108 [pdf, ps, other]

Direct simulation Monte Carlo schemes for Coulomb interactions in plasmas

Authors: Giacomo Dimarco, Russell Caflisch, Lorenzo Pareschi

Abstract: We consider the development of Monte Carlo schemes for molecules with Coulomb interactions. We generalize the classic algorithms of Bird and Nanbu-Babovsky for rarefied gas dynamics to the Coulomb case thanks to the approximation introduced by Bobylev and Nanbu (Theory of collision algorithms for gases and plasmas based on the Boltzmann equation and the Landau-Fokker-Planck equation, Physical Revi… ▽ More We consider the development of Monte Carlo schemes for molecules with Coulomb interactions. We generalize the classic algorithms of Bird and Nanbu-Babovsky for rarefied gas dynamics to the Coulomb case thanks to the approximation introduced by Bobylev and Nanbu (Theory of collision algorithms for gases and plasmas based on the Boltzmann equation and the Landau-Fokker-Planck equation, Physical Review E, Vol. 61, 2000). Thus, instead of considering the original Boltzmann collision operator, the schemes are constructed through the use of an approximated Boltzmann operator. With the above choice larger time steps are possible in simulations; moreover the expensive acceptance-rejection procedure for collisions is avoided and every particle collides. Error analysis and comparisons with the original Bobylev-Nanbu (BN) scheme are performed. The numerical results show agreement with the theoretical convergence rate of the approximated Boltzmann operator and the better performance of Bird-type schemes with respect to the original scheme. △ Less

Submitted 4 October, 2010; v1 submitted 1 October, 2010; originally announced October 2010.

MSC Class: 65M99; 65L06; 82D05

Journal ref: Communications in Applied and Industrial Mathematics 2010

arXiv:0709.2726 [pdf, other]

Anisotropic step stiffness from a kinetic model of epitaxial growth

Authors: Dionisios Margetis, Russel E. Caflisch

Abstract: Starting from a detailed model for the kinetics of a step edge or island boundary, we derive a Gibbs-Thomson type formula and the associated step stiffness as a function of the step edge orientation angle, $theta$. Basic ingredients of the model are: (i) the diffusion of point defects (``adatoms'') on terraces and along step edges; (ii) the convection of kinks along step edges; and (iii) constit… ▽ More Starting from a detailed model for the kinetics of a step edge or island boundary, we derive a Gibbs-Thomson type formula and the associated step stiffness as a function of the step edge orientation angle, $theta$. Basic ingredients of the model are: (i) the diffusion of point defects (``adatoms'') on terraces and along step edges; (ii) the convection of kinks along step edges; and (iii) constitutive laws that relate adatom fluxes, sources for kinks, and the kink velocity with densities via a mean-field approach. This model has a kinetic (nonequilibrium) steady-state solution that corresponds to epitaxial growth through step flow. The step stiffness, $\tbe(θ)$, is determined via perturbations of the kinetic steady state for small edge Peclet number, P, which is the ratio of the deposition to the diffusive flux along a step edge. In particular, $\tbe$ is found to satisfy $\tbe =O(θ^{-1})$ for $O(P^{1/3}) <θ\ll 1$, which is in agreement with independent, equilibrium-based calculations. △ Less

Submitted 17 September, 2007; originally announced September 2007.

Comments: 32 pages, 1 figure; submitted to Multiscale Modeling and Simulation

arXiv:cond-mat/0111258 [pdf, ps, other]

doi 10.1103/PhysRevE.64.061602

Level Set Approach to Reversible Epitaxial Growth

Authors: M. Petersen, C. Ratsch, R. E. Caflisch, A. Zangwill

Abstract: We generalize the level set approach to model epitaxial growth to include thermal detachment of atoms from island edges. This means that islands do not always grow and island dissociation can occur. We make no assumptions about a critical nucleus. Excellent quantitative agreement is obtained with kinetic Monte Carlo simulations for island densities and island size distributions in the submonolay… ▽ More We generalize the level set approach to model epitaxial growth to include thermal detachment of atoms from island edges. This means that islands do not always grow and island dissociation can occur. We make no assumptions about a critical nucleus. Excellent quantitative agreement is obtained with kinetic Monte Carlo simulations for island densities and island size distributions in the submonolayer regime. △ Less

Submitted 14 November, 2001; originally announced November 2001.

Comments: 7 pages, 9 figures

Showing 1–18 of 18 results for author: Caflisch, R