Skip to main content

Showing 1–12 of 12 results for author: von Wurstemberger, P

.
  1. arXiv:2412.01371  [pdf, other

    cs.LG cs.AI

    An overview of diffusion models for generative artificial intelligence

    Authors: Davide Gallon, Arnulf Jentzen, Philippe von Wurstemberger

    Abstract: This article provides a mathematically rigorous introduction to denoising diffusion probabilistic models (DDPMs), sometimes also referred to as diffusion probabilistic models or diffusion models, for generative artificial intelligence. We provide a detailed basic mathematical framework for DDPMs and explain the main ideas behind training and generation procedures. In this overview article we also… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 56 pages, 5 figures

  2. arXiv:2408.13222  [pdf, other

    math.NA stat.ML

    An Overview on Machine Learning Methods for Partial Differential Equations: from Physics Informed Neural Networks to Deep Operator Learning

    Authors: Lukas Gonon, Arnulf Jentzen, Benno Kuckuck, Siyu Liang, Adrian Riekert, Philippe von Wurstemberger

    Abstract: The approximation of solutions of partial differential equations (PDEs) with numerical algorithms is a central topic in applied mathematics. For many decades, various types of methods for this purpose have been developed and extensively studied. One class of methods which has received a lot of attention in recent years are machine learning-based methods, which typically involve the training of art… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  3. arXiv:2310.20360  [pdf, other

    cs.LG cs.AI math.NA math.PR stat.ML

    Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory

    Authors: Arnulf Jentzen, Benno Kuckuck, Philippe von Wurstemberger

    Abstract: This book aims to provide an introduction to the topic of deep learning algorithms. We review essential components of deep learning algorithms in full mathematical detail including different artificial neural network (ANN) architectures (such as fully-connected feedforward ANNs, convolutional ANNs, recurrent ANNs, residual ANNs, and ANNs with batch normalization) and different optimization algorit… ▽ More

    Submitted 25 February, 2025; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: 712 pages, 36 figures, 45 source codes, 87 exercises. In v2, the material on optimization algorithms/methods has been significantly expanded

    MSC Class: 68T07

  4. arXiv:2302.03286  [pdf, other

    math.NA stat.ML

    Algorithmically Designed Artificial Neural Networks (ADANNs): Higher order deep operator learning for parametric partial differential equations

    Authors: Arnulf Jentzen, Adrian Riekert, Philippe von Wurstemberger

    Abstract: In this article we propose a new deep learning approach to approximate operators related to parametric partial differential equations (PDEs). In particular, we introduce a new strategy to design specific artificial neural network (ANN) architectures in conjunction with specific ANN initialization schemes which are tailor-made for the particular approximation problem under consideration. In the pro… ▽ More

    Submitted 29 May, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 39 pages, 16 Figures

  5. arXiv:2202.02717  [pdf, other

    math.NA math.AP math.PR

    Learning the random variables in Monte Carlo simulations with stochastic gradient descent: Machine learning for parametric PDEs and financial derivative pricing

    Authors: Sebastian Becker, Arnulf Jentzen, Marvin S. Müller, Philippe von Wurstemberger

    Abstract: In financial engineering, prices of financial products are computed approximately many times each trading day with (slightly) different parameters in each calculation. In many financial models such prices can be approximated by means of Monte Carlo (MC) simulations. To obtain a good approximation the MC sample size usually needs to be considerably large resulting in a long computing time to obtain… ▽ More

    Submitted 8 June, 2023; v1 submitted 6 February, 2022; originally announced February 2022.

    Comments: 71 pages, 4 Figures, 14 Tables; to appear in Math. Finance

    MSC Class: 35K15; 65C05; 65M75; 68T99; 91G20

  6. arXiv:2012.04326  [pdf, other

    math.NA

    High-dimensional approximation spaces of artificial neural networks and applications to partial differential equations

    Authors: Pierfrancesco Beneventano, Patrick Cheridito, Arnulf Jentzen, Philippe von Wurstemberger

    Abstract: In this paper we develop a new machinery to study the capacity of artificial neural networks (ANNs) to approximate high-dimensional functions without suffering from the curse of dimensionality. Specifically, we introduce a concept which we refer to as approximation spaces of artificial neural networks and we present several tools to handle those spaces. Roughly speaking, approximation spaces consi… ▽ More

    Submitted 28 January, 2025; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: 31 pages

  7. Numerical simulations for full history recursive multilevel Picard approximations for systems of high-dimensional partial differential equations

    Authors: Sebastian Becker, Ramon Braunwarth, Martin Hutzenthaler, Arnulf Jentzen, Philippe von Wurstemberger

    Abstract: One of the most challenging issues in applied mathematics is to develop and analyze algorithms which are able to approximately compute solutions of high-dimensional nonlinear partial differential equations (PDEs). In particular, it is very hard to develop approximation algorithms which do not suffer under the curse of dimensionality in the sense that the number of computational operations needed b… ▽ More

    Submitted 25 May, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: 21 pages

    MSC Class: 65M75 ACM Class: G.1.8

    Journal ref: Commun. Comput. Phys. 28 (2020), no. 5, 2109-2138

  8. Overcoming the curse of dimensionality in the approximative pricing of financial derivatives with default risks

    Authors: Martin Hutzenthaler, Arnulf Jentzen, Philippe von Wurstemberger

    Abstract: Parabolic partial differential equations (PDEs) are widely used in the mathematical modeling of natural phenomena and man made complex systems. In particular, parabolic PDEs are a fundamental tool to determine fair prices of financial derivatives in the financial industry. The PDEs appearing in financial engineering applications are often nonlinear and high dimensional since the dimension typicall… ▽ More

    Submitted 14 March, 2019; originally announced March 2019.

    Comments: 71 pages. arXiv admin note: text overlap with arXiv:1807.01212

    Journal ref: Electron. J. Probab. 25 (2020), 101

  9. arXiv:1809.02362  [pdf, ps, other

    math.NA cs.LG math.PR q-fin.MF

    A proof that artificial neural networks overcome the curse of dimensionality in the numerical approximation of Black-Scholes partial differential equations

    Authors: Philipp Grohs, Fabian Hornung, Arnulf Jentzen, Philippe von Wurstemberger

    Abstract: Artificial neural networks (ANNs) have very successfully been used in numerical simulations for a series of computational problems ranging from image classification/image recognition, speech recognition, time series analysis, game intelligence, and computational advertising to numerical approximations of partial differential equations (PDEs). Such numerical simulations suggest that ANNs have the c… ▽ More

    Submitted 25 January, 2023; v1 submitted 7 September, 2018; originally announced September 2018.

    Comments: To appear in Mem. Amer. Math. Soc.; 126 pages

    Journal ref: Mem. Amer. Math. Soc.284(2023), no.1410, v+93 pp

  10. arXiv:1807.01212  [pdf, ps, other

    math.PR math.AP math.NA

    Overcoming the curse of dimensionality in the numerical approximation of semilinear parabolic partial differential equations

    Authors: Martin Hutzenthaler, Arnulf Jentzen, Thomas Kruse, Tuan Anh Nguyen, Philippe von Wurstemberger

    Abstract: For a long time it is well-known that high-dimensional linear parabolic partial differential equations (PDEs) can be approximated by Monte Carlo methods with a computational effort which grows polynomially both in the dimension and in the reciprocal of the prescribed accuracy. In other words, linear PDEs do not suffer from the curse of dimensionality. For general semilinear PDEs with Lipschitz coe… ▽ More

    Submitted 24 June, 2020; v1 submitted 3 July, 2018; originally announced July 2018.

    MSC Class: 65M75

    Journal ref: Proceedings of the Royal Society A 476, no. 2244 (2020): 20190630

  11. arXiv:1803.08600  [pdf, ps, other

    math.NA math.PR stat.ML

    Lower error bounds for the stochastic gradient descent optimization algorithm: Sharp convergence rates for slowly and fast decaying learning rates

    Authors: Arnulf Jentzen, Philippe von Wurstemberger

    Abstract: The stochastic gradient descent (SGD) optimization algorithm plays a central role in a series of machine learning applications. The scientific literature provides a vast amount of upper error bounds for the SGD method. Much less attention as been paid to proving lower error bounds for the SGD method. It is the key contribution of this paper to make a step in this direction. More precisely, in this… ▽ More

    Submitted 22 March, 2018; originally announced March 2018.

    Comments: 42 pages

    Journal ref: J. Complexity 57 (2020), 101438

  12. arXiv:1801.09324  [pdf, ps, other

    math.NA math.PR

    Strong error analysis for stochastic gradient descent optimization algorithms

    Authors: Arnulf Jentzen, Benno Kuckuck, Ariel Neufeld, Philippe von Wurstemberger

    Abstract: Stochastic gradient descent (SGD) optimization algorithms are key ingredients in a series of machine learning applications. In this article we perform a rigorous strong error analysis for SGD optimization algorithms. In particular, we prove for every arbitrarily small $\varepsilon \in (0,\infty)$ and every arbitrarily large $p\in (0,\infty)$ that the considered SGD optimization algorithm converges… ▽ More

    Submitted 28 January, 2018; originally announced January 2018.

    Journal ref: IMA J. Numer. Anal. (2020), drz055