-
A class of locally state-dependent models for forward curves
Authors:
Nils Detering,
Silvia Lavagnini
Abstract:
We present a dynamic model for forward curves within the Heath-Jarrow-Morton framework under the Musiela parametrization. The forward curves take values in a function space H, and their dynamics follows a stochastic partial differential equation with state-dependent coefficients. In particular, the coefficients are defined through point-wise operating maps on H, resulting in a locally state-depend…
▽ More
We present a dynamic model for forward curves within the Heath-Jarrow-Morton framework under the Musiela parametrization. The forward curves take values in a function space H, and their dynamics follows a stochastic partial differential equation with state-dependent coefficients. In particular, the coefficients are defined through point-wise operating maps on H, resulting in a locally state-dependent structure. We first explore conditions under which these point-wise operators are well defined on H. Next, we determine conditions to ensure that the resulting coefficient functions satisfy local growth and Lipschitz properties, so to guarantee the existence and uniqueness of mild solutions. The proposed model captures the behavior of the entire forward curve through a single equation, yet retains remarkable simplicity. Notably, we demonstrate that certain one-dimensional projections of the model are Markovian and satisfy a one-dimensional stochastic differential equation. This connects our Hilbert-space approach to well established models for forward contracts with fixed delivery times, for which existing formulas and numerical techniques can be applied. This link allows us to examine also conditions for maintaining positivity of the solutions. As concrete examples, we analyze Hilbert-space valued variants of an exponential model and of a constant elasticity of variance model.
△ Less
Submitted 13 March, 2025; v1 submitted 13 February, 2025;
originally announced February 2025.
-
In-Context Operator Learning for Linear Propagator Models
Authors:
Tingwei Meng,
Moritz Voß,
Nils Detering,
Giulio Farolfi,
Stanley Osher,
Georg Menz
Abstract:
We study operator learning in the context of linear propagator models for optimal order execution problems with transient price impact à la Bouchaud et al. (2004) and Gatheral (2010). Transient price impact persists and decays over time according to some propagator kernel. Specifically, we propose to use In-Context Operator Networks (ICON), a novel transformer-based neural network architecture int…
▽ More
We study operator learning in the context of linear propagator models for optimal order execution problems with transient price impact à la Bouchaud et al. (2004) and Gatheral (2010). Transient price impact persists and decays over time according to some propagator kernel. Specifically, we propose to use In-Context Operator Networks (ICON), a novel transformer-based neural network architecture introduced by Yang et al. (2023), which facilitates data-driven learning of operators by merging offline pre-training with an online few-shot prompting inference. First, we train ICON to learn the operator from various propagator models that maps the trading rate to the induced transient price impact. The inference step is then based on in-context prediction, where ICON is presented only with a few examples. We illustrate that ICON is capable of accurately inferring the underlying price impact model from the data prompts, even with propagator kernels not seen in the training data. In a second step, we employ the pre-trained ICON model provided with context as a surrogate operator in solving an optimal order execution problem via a neural network control policy, and demonstrate that the exact optimal execution strategies from Abi Jaber and Neuman (2022) for the models generating the context are correctly retrieved. Our introduced methodology is very general, offering a new approach to solving optimal stochastic control problems with unknown state dynamics, inferred data-efficiently from a limited number of examples by leveraging the few-shot and transfer learning capabilities of transformer networks.
△ Less
Submitted 25 January, 2025;
originally announced January 2025.
-
Structure-informed operator learning for parabolic Partial Differential Equations
Authors:
Fred Espen Benth,
Nils Detering,
Luca Galimberti
Abstract:
In this paper, we present a framework for learning the solution map of a backward parabolic Cauchy problem. The solution depends continuously but nonlinearly on the final data, source, and force terms, all residing in Banach spaces of functions. We utilize Fréchet space neural networks (Benth et al. (2023)) to address this operator learning problem. Our approach provides an alternative to Deep Ope…
▽ More
In this paper, we present a framework for learning the solution map of a backward parabolic Cauchy problem. The solution depends continuously but nonlinearly on the final data, source, and force terms, all residing in Banach spaces of functions. We utilize Fréchet space neural networks (Benth et al. (2023)) to address this operator learning problem. Our approach provides an alternative to Deep Operator Networks (DeepONets), using basis functions to span the relevant function spaces rather than relying on finite-dimensional approximations through censoring. With this method, structural information encoded in the basis coefficients is leveraged in the learning process. This results in a neural network designed to learn the mapping between infinite-dimensional function spaces. Our numerical proof-of-concept demonstrates the effectiveness of our method, highlighting some advantages over DeepONets.
△ Less
Submitted 14 November, 2024;
originally announced November 2024.
-
Reinforcement Learning for Intra-and-Inter-Bank Borrowing and Lending Mean Field Control Game
Authors:
Andrea Angiuli,
Nils Detering,
Jean-Pierre Fouque,
Mathieu Laurière,
Jimin Lin
Abstract:
We propose a mean field control game model for the intra-and-inter-bank borrowing and lending problem. This framework allows to study the competitive game arising between groups of collaborative banks. The solution is provided in terms of an asymptotic Nash equilibrium between the groups in the infinite horizon. A three-timescale reinforcement learning algorithm is applied to learn the optimal bor…
▽ More
We propose a mean field control game model for the intra-and-inter-bank borrowing and lending problem. This framework allows to study the competitive game arising between groups of collaborative banks. The solution is provided in terms of an asymptotic Nash equilibrium between the groups in the infinite horizon. A three-timescale reinforcement learning algorithm is applied to learn the optimal borrowing and lending strategy in a data driven way when the model is unknown. An empirical numerical analysis shows the importance of the three-timescale, the impact of the exploration strategy when the model is unknown, and the convergence of the algorithm.
△ Less
Submitted 7 July, 2022;
originally announced July 2022.
-
Percolation in Random Graphs of Unbounded Rank
Authors:
Nils Detering,
Jimin Lin
Abstract:
Bootstrap percolation in (random) graphs is a contagion dynamics among a set of vertices with certain threshold levels. The process is started by a set of initially infected vertices, and an initially uninfected vertex with threshold $k$ gets infected as soon as the number of its infected neighbors exceeds $k$. This process has been studied extensively in so called \textit{rank one} models. These…
▽ More
Bootstrap percolation in (random) graphs is a contagion dynamics among a set of vertices with certain threshold levels. The process is started by a set of initially infected vertices, and an initially uninfected vertex with threshold $k$ gets infected as soon as the number of its infected neighbors exceeds $k$. This process has been studied extensively in so called \textit{rank one} models. These models can generate random graphs with heavy-tailed degree sequences but they are not capable of clustering. In this paper, we treat a class of random graphs of unbounded rank which allow for extensive clustering. Our main result determines the final fraction of infected vertices as the fixed point of a non-linear operator defined on a suitable function space. We propose an algorithm that facilitates neural networks to calculate this fixed point efficiently. We further derive criteria based on the Fréchet derivative of the operator that allows one to determine whether small infections spread through the entire graph or rather stay local.
△ Less
Submitted 2 November, 2022; v1 submitted 29 May, 2022;
originally announced May 2022.
-
Reinforcement Learning Algorithm for Mixed Mean Field Control Games
Authors:
Andrea Angiuli,
Nils Detering,
Jean-Pierre Fouque,
Mathieu Lauriere,
Jimin Lin
Abstract:
We present a new combined \textit{mean field control game} (MFCG) problem which can be interpreted as a competitive game between collaborating groups and its solution as a Nash equilibrium between groups. Players coordinate their strategies within each group. An example is a modification of the classical trader's problem. Groups of traders maximize their wealth. They face cost for their transactio…
▽ More
We present a new combined \textit{mean field control game} (MFCG) problem which can be interpreted as a competitive game between collaborating groups and its solution as a Nash equilibrium between groups. Players coordinate their strategies within each group. An example is a modification of the classical trader's problem. Groups of traders maximize their wealth. They face cost for their transactions, for their own terminal positions, and for the average holding within their group. The asset price is impacted by the trades of all agents. We propose a three-timescale reinforcement learning algorithm to approximate the solution of such MFCG problems. We test the algorithm on benchmark linear-quadratic specifications for which we provide analytic solutions.
△ Less
Submitted 15 February, 2023; v1 submitted 4 May, 2022;
originally announced May 2022.
-
Pricing options on flow forwards by neural networks in Hilbert space
Authors:
Fred Espen Benth,
Nils Detering,
Luca Galimberti
Abstract:
We propose a new methodology for pricing options on flow forwards by applying infinite-dimensional neural networks. We recast the pricing problem as an optimization problem in a Hilbert space of real-valued function on the positive real line, which is the state space for the term structure dynamics. This optimization problem is solved by facilitating a novel feedforward neural network architecture…
▽ More
We propose a new methodology for pricing options on flow forwards by applying infinite-dimensional neural networks. We recast the pricing problem as an optimization problem in a Hilbert space of real-valued function on the positive real line, which is the state space for the term structure dynamics. This optimization problem is solved by facilitating a novel feedforward neural network architecture designed for approximating continuous functions on the state space. The proposed neural net is built upon the basis of the Hilbert space. We provide an extensive case study that shows excellent numerical efficiency, with superior performance over that of a classical neural net trained on sampling the term structure curves.
△ Less
Submitted 17 February, 2022;
originally announced February 2022.
-
Optimal Support for Distressed Subsidiaries -- a Systemic Risk Perspective
Authors:
Maxim Bichuch,
Nils Detering
Abstract:
We consider a network of bank holdings, where every holding has two subsidiaries of different types. A subsidiary can trade with another holding's subsidiary of the same type. Holdings support their subsidiaries up to a certain level when they would otherwise fail to honor their financial obligations. We investigate the spread of contagion in this banking network when the number of bank holdings i…
▽ More
We consider a network of bank holdings, where every holding has two subsidiaries of different types. A subsidiary can trade with another holding's subsidiary of the same type. Holdings support their subsidiaries up to a certain level when they would otherwise fail to honor their financial obligations. We investigate the spread of contagion in this banking network when the number of bank holdings is large, and find the final number of defaulted subsidiaries under different rules for the holding support. We also consider resilience of this multilayered network to small shocks. Our work sheds light onto the role that holding structures can play in the amplification of financial stress. We find that depending on the capitalization of the network, a holding structure can be beneficial as compared to smaller separated entities. In other instances, it can be harmful and actually increase contagion. We illustrate our results in a numerical case study and also determine the optimal level of holding support from a regulator perspective.
△ Less
Submitted 7 March, 2024; v1 submitted 30 January, 2022;
originally announced January 2022.
-
Neural Networks in Fréchet spaces
Authors:
Fred Espen Benth,
Nils Detering,
Luca Galimberti
Abstract:
We define a neural network in infinite dimensional spaces for which we can show the universal approximation property. Indeed, we derive approximation results for continuous functions from a Fréchet space $\X$ into a Banach space $\Y$. The approximation results are generalising the well known universal approximation theorem for continuous functions from $\mathbb{R}^n$ to $\mathbb{R}$, where approxi…
▽ More
We define a neural network in infinite dimensional spaces for which we can show the universal approximation property. Indeed, we derive approximation results for continuous functions from a Fréchet space $\X$ into a Banach space $\Y$. The approximation results are generalising the well known universal approximation theorem for continuous functions from $\mathbb{R}^n$ to $\mathbb{R}$, where approximation is done with (multilayer) neural networks [15, 25, 18, 29]. Our infinite dimensional networks are constructed using activation functions being nonlinear operators and affine transforms. Several examples are given of such activation functions. We show furthermore that our neural networks on infinite dimensional spaces can be projected down to finite dimensional subspaces with any desirable accuracy, thus obtaining approximating networks that are easy to implement and allow for fast computation and fitting. The resulting neural network architecture is therefore applicable for prediction tasks based on functional data.
△ Less
Submitted 16 May, 2022; v1 submitted 28 September, 2021;
originally announced September 2021.
-
Abstract polynomial processes
Authors:
Fred Espen Benth,
Nils Detering,
Paul Kruhner
Abstract:
We suggest a novel approach to polynomial processes solely based on a polynomial action operator. With this approach, we can analyse such processes on general state spaces, going far beyond Banach spaces. Moreover, we can be very flexible in the definition of what "polynomial" means. We show that "polynomial process" universally means "affine drift". Simple assumptions on the polynomial action ope…
▽ More
We suggest a novel approach to polynomial processes solely based on a polynomial action operator. With this approach, we can analyse such processes on general state spaces, going far beyond Banach spaces. Moreover, we can be very flexible in the definition of what "polynomial" means. We show that "polynomial process" universally means "affine drift". Simple assumptions on the polynomial action operators lead to stronger characterisations on the polynomial class of processes.
In our framework we do not need to specify polynomials explicitly but can work with a general sequence of graded vector spaces of functions on the state space. Elements of these graded vector spaces form the monomials by introducing a sequence of vector space complements. The basic tool of our analysis is the polynomial action operator, which is a semigroup of operators mapping conditional expected values of monomials acting on the polynomial process to monomials of the same or lower grade. Unlike the classical Euclidean case, the polynomial action operator may not form a finite-dimensional subspace after a finite iteration, a property we call locally finite. We study abstract polynomial processes under both algebraic and topological assumptions on the polynomial actions, and establish an affine drift structure. Moreover, we characterize the covariance structure under similar but slightly stronger conditions. A crucial part in our analysis is the use of the (algebraic or topological) dual of the monomials of grade one, which serves as a linearization of the state space of the polynomial process. Our general framework covers polynomial processes with values in Banach spaces recently studied by Cuchiero and Svaluto-Ferro.
△ Less
Submitted 6 October, 2020;
originally announced October 2020.
-
Stochastic Volterra integral equations and a class of first order stochastic partial differential equations
Authors:
Fred Espen Benth,
Nils Detering,
Paul Kruehner
Abstract:
We investigate stochastic Volterra equations and their limiting laws. The stochastic Volterra equations we consider are driven by a Hilbert space valued \Levy noise and integration kernels may have non-linear dependence on the current state of the process. Our method is based on an embedding into a Hilbert space of functions which allows to represent the solution of the Volterra equation as the bo…
▽ More
We investigate stochastic Volterra equations and their limiting laws. The stochastic Volterra equations we consider are driven by a Hilbert space valued \Levy noise and integration kernels may have non-linear dependence on the current state of the process. Our method is based on an embedding into a Hilbert space of functions which allows to represent the solution of the Volterra equation as the boundary value of a solution to a stochastic partial differential equation. We first gather abstract results and give more detailed conditions in more specific function spaces.
△ Less
Submitted 20 July, 2020; v1 submitted 12 March, 2019;
originally announced March 2019.
-
Independent increment processes: A multilinearity preserving property
Authors:
Fred Espen Benth,
Nils Detering,
Paul Kruhner
Abstract:
We observe a multilinearity preserving property of conditional expectation for infinite dimensional independent increment processes defined on some abstract Banach space $B$. It is similar in nature to the polynomial preserving property analysed greatly for finite dimensional stochastic processes and thus offers an infinite dimensional generalisation. However, while polynomials are defined using t…
▽ More
We observe a multilinearity preserving property of conditional expectation for infinite dimensional independent increment processes defined on some abstract Banach space $B$. It is similar in nature to the polynomial preserving property analysed greatly for finite dimensional stochastic processes and thus offers an infinite dimensional generalisation. However, while polynomials are defined using the multiplication operator and as such require a Banach algebra structure, the multilinearity preserving property we prove here holds even for processes defined on a Banach space which is not necessary a Banach algebra. In the special case of $B$ being a commutative Banach algebra, we show that independent increment processes are polynomial processes in a sense that coincides with a canonical extension of polynomial processes from the finite dimensional case. The assumption of commutativity is shown to be crucial and in a non-commutative Banach algebra the multilinearity concept arises naturally. Some of our results hold beyond independent increment processes and thus shed light on infinite dimensional polynomial processes in general.
△ Less
Submitted 20 July, 2020; v1 submitted 5 September, 2018;
originally announced September 2018.
-
Directed Chain Stochastic Differential Equations
Authors:
Nils Detering,
Jean-Pierre Fouque,
Tomoyuki Ichiba
Abstract:
We propose a particle system of diffusion processes coupled through a chain-like network structure described by an infinite-dimensional, nonlinear stochastic differential equation of McKean-Vlasov type. It has both (i) a local chain interaction and (ii) a mean-field interaction. It can be approximated by a limit of finite particle systems, as the number of particles goes to infinity. Due to the lo…
▽ More
We propose a particle system of diffusion processes coupled through a chain-like network structure described by an infinite-dimensional, nonlinear stochastic differential equation of McKean-Vlasov type. It has both (i) a local chain interaction and (ii) a mean-field interaction. It can be approximated by a limit of finite particle systems, as the number of particles goes to infinity. Due to the local chain interaction, propagation of chaos does not necessarily hold. Furthermore, we exhibit a dichotomy of presence or absence of mean-field interaction, and we discuss the problem of detecting its presence from the observation of a single component process.
△ Less
Submitted 17 July, 2019; v1 submitted 4 May, 2018;
originally announced May 2018.
-
Financial Contagion in a Generalized Stochastic Block Model
Authors:
Nils Detering,
Thilo Meyer-Brandis,
Konstantinos Panagiotou,
Daniel Ritter
Abstract:
One of the most defining features of the global financial network is its inherent complex and intertwined structure. From the perspective of systemic risk it is important to understand the influence of this network structure on default contagion. Using sparse random graphs to model the financial network, asymptotic methods turned out powerful to analytically describe the contagion process and to m…
▽ More
One of the most defining features of the global financial network is its inherent complex and intertwined structure. From the perspective of systemic risk it is important to understand the influence of this network structure on default contagion. Using sparse random graphs to model the financial network, asymptotic methods turned out powerful to analytically describe the contagion process and to make statements about resilience. So far, however, they have been limited to so-called {\em rank one} models in which informally the only network parameter is the degree sequence (see (Amini et. al. 2016) and (Detering et. al. 2019) for example) and the contagion process can be described by a one dimensional fix-point equation. These networks fail to account for a pronounced block structure such as core/periphery or a network composed of different connected blocks for different countries. We present a much more general model here, where we distinguish vertices (institutions) of different types and let edge probabilities and exposures depend on the types of both, the receiving and the sending vertex plus additional parameters. Our main result allows to compute explicitly the systemic damage caused by some initial local shock event, and we derive a complete characterisation of resilient respectively non-resilient financial systems. This is the first instance that default contagion is rigorously studied in a model outside the class of rank one models and several technical challenges arise. Moreover, in contrast to previous work, in which networks could be classified as resilient or non resilient, independent of the distribution of the shock, information about the shock becomes important in our model and a more refined resilience condition arises. Among other applications of our theory we derive resilience conditions for the global network based on subnetwork conditions only.
△ Less
Submitted 9 December, 2019; v1 submitted 21 March, 2018;
originally announced March 2018.
-
Bootstrap percolation in directed and inhomogeneous random graphs
Authors:
Nils Detering,
Thilo Meyer-Brandis,
Konstantinos Panagiotou
Abstract:
Bootstrap percolation is a process that is used to model the spread of an infection on a given graph. In the model considered here each vertex is equipped with an individual threshold. As soon as the number of infected neighbors exceeds that threshold, the vertex gets infected as well and remains so forever. We perform a thorough analysis of bootstrap percolation on a novel model of directed and i…
▽ More
Bootstrap percolation is a process that is used to model the spread of an infection on a given graph. In the model considered here each vertex is equipped with an individual threshold. As soon as the number of infected neighbors exceeds that threshold, the vertex gets infected as well and remains so forever. We perform a thorough analysis of bootstrap percolation on a novel model of directed and inhomogeneous random graphs, where the distribution of the edges is specified by assigning two distinct weights to each vertex, describing the tendency of it to receive edges from or to send edges to other vertices. Under the assumption that the limiting degree distribution of the graph is integrable we determine the typical fraction of infected vertices. Our model allows us to study a variety of settings, in particular the prominent case in which the degree distribution has an unbounded variance. Among other results, we quantify the notion of "systemic risk", that is, to what extent local adverse shocks can propagate to large parts of the graph through a cascade, and discover novel features that make graphs prone/resilient to initially small infections.
△ Less
Submitted 27 April, 2017; v1 submitted 25 November, 2015;
originally announced November 2015.