-
In-Context Operator Learning for Linear Propagator Models
Authors:
Tingwei Meng,
Moritz Voß,
Nils Detering,
Giulio Farolfi,
Stanley Osher,
Georg Menz
Abstract:
We study operator learning in the context of linear propagator models for optimal order execution problems with transient price impact à la Bouchaud et al. (2004) and Gatheral (2010). Transient price impact persists and decays over time according to some propagator kernel. Specifically, we propose to use In-Context Operator Networks (ICON), a novel transformer-based neural network architecture int…
▽ More
We study operator learning in the context of linear propagator models for optimal order execution problems with transient price impact à la Bouchaud et al. (2004) and Gatheral (2010). Transient price impact persists and decays over time according to some propagator kernel. Specifically, we propose to use In-Context Operator Networks (ICON), a novel transformer-based neural network architecture introduced by Yang et al. (2023), which facilitates data-driven learning of operators by merging offline pre-training with an online few-shot prompting inference. First, we train ICON to learn the operator from various propagator models that maps the trading rate to the induced transient price impact. The inference step is then based on in-context prediction, where ICON is presented only with a few examples. We illustrate that ICON is capable of accurately inferring the underlying price impact model from the data prompts, even with propagator kernels not seen in the training data. In a second step, we employ the pre-trained ICON model provided with context as a surrogate operator in solving an optimal order execution problem via a neural network control policy, and demonstrate that the exact optimal execution strategies from Abi Jaber and Neuman (2022) for the models generating the context are correctly retrieved. Our introduced methodology is very general, offering a new approach to solving optimal stochastic control problems with unknown state dynamics, inferred data-efficiently from a limited number of examples by leveraging the few-shot and transfer learning capabilities of transformer networks.
△ Less
Submitted 25 January, 2025;
originally announced January 2025.
-
Broadening the perspective for sustainable AI: Comprehensive sustainability criteria and indicators for AI systems
Authors:
Friederike Rohde,
Josephin Wagner,
Andreas Meyer,
Philipp Reinhard,
Marcus Voss,
Ulrich Petschow,
Anne Mollen
Abstract:
The increased use of AI systems is associated with multi-faceted societal, environmental, and economic consequences. These include non-transparent decision-making processes, discrimination, increasing inequalities, rising energy consumption and greenhouse gas emissions in AI model development and application, and an increasing concentration of economic power. By considering the multi-dimensionalit…
▽ More
The increased use of AI systems is associated with multi-faceted societal, environmental, and economic consequences. These include non-transparent decision-making processes, discrimination, increasing inequalities, rising energy consumption and greenhouse gas emissions in AI model development and application, and an increasing concentration of economic power. By considering the multi-dimensionality of sustainability, this paper takes steps towards substantiating the call for an overarching perspective on "sustainable AI". It presents the SCAIS Framework (Sustainability Criteria and Indicators for Artificial Intelligence Systems) which contains a set 19 sustainability criteria for sustainable AI and 67 indicators that is based on the results of a critical review and expert workshops. This interdisciplinary approach contributes a unique holistic perspective to facilitate and structure the discourse on sustainable AI. Further, it provides a concrete framework that lays the foundation for developing standards and tools to support the conscious development and application of AI systems.
△ Less
Submitted 22 November, 2023; v1 submitted 22 June, 2023;
originally announced June 2023.
-
Short-Term Density Forecasting of Low-Voltage Load using Bernstein-Polynomial Normalizing Flows
Authors:
Marcel Arpogaus,
Marcus Voss,
Beate Sick,
Mark Nigge-Uricher,
Oliver Dürr
Abstract:
The transition to a fully renewable energy grid requires better forecasting of demand at the low-voltage level to increase efficiency and ensure reliable control. However, high fluctuations and increasing electrification cause huge forecast variability, not reflected in traditional point estimates. Probabilistic load forecasts take future uncertainties into account and thus allow more informed dec…
▽ More
The transition to a fully renewable energy grid requires better forecasting of demand at the low-voltage level to increase efficiency and ensure reliable control. However, high fluctuations and increasing electrification cause huge forecast variability, not reflected in traditional point estimates. Probabilistic load forecasts take future uncertainties into account and thus allow more informed decision-making for the planning and operation of low-carbon energy systems. We propose an approach for flexible conditional density forecasting of short-term load based on Bernstein polynomial normalizing flows, where a neural network controls the parameters of the flow. In an empirical study with 363 smart meter customers, our density predictions compare favorably against Gaussian and Gaussian mixture densities. Also, they outperform a non-parametric approach based on the pinball loss for 24h-ahead load forecasting for two different neural network architectures.
△ Less
Submitted 15 June, 2023; v1 submitted 29 April, 2022;
originally announced April 2022.
-
On Parametric Optimal Execution and Machine Learning Surrogates
Authors:
Tao Chen,
Mike Ludkovski,
Moritz Voß
Abstract:
We investigate optimal order execution problems in discrete time with instantaneous price impact and stochastic resilience. First, in the setting of linear transient price impact we derive a closed-form recursion for the optimal strategy, extending the deterministic results from Obizhaeva and Wang (J Financial Markets, 2013). Second, we develop a numerical algorithm based on dynamic programming an…
▽ More
We investigate optimal order execution problems in discrete time with instantaneous price impact and stochastic resilience. First, in the setting of linear transient price impact we derive a closed-form recursion for the optimal strategy, extending the deterministic results from Obizhaeva and Wang (J Financial Markets, 2013). Second, we develop a numerical algorithm based on dynamic programming and deep learning for the case of nonlinear transient price impact as proposed by Bouchaud et al. (Quant. Finance, 2004). Specifically, we utilize an actor-critic framework that constructs two neural-network (NN) surrogates for the value function and the feedback control. The flexible scalability of NN functional approximators enables parametric learning, i.e., incorporating several model or market parameters as part of the input space. Precise calibration of price impact, resilience, etc., is known to be extremely challenging and hence it is critical to understand sensitivity of the execution policy to these parameters. Our NN learner organically scales across multiple input dimensions and is shown to accurately approximate optimal strategies across a wide range of parameter configurations. We provide a fully reproducible Jupyter Notebook with our NN implementation, which is of independent pedagogical interest, demonstrating the ease of use of NN surrogates in (parametric) stochastic control problems.
△ Less
Submitted 29 October, 2023; v1 submitted 18 April, 2022;
originally announced April 2022.
-
An Arduino based heartbeat detection device (ArdMob-ECG) for real-time ECG analysis
Authors:
Tim Julian Möller,
Martin Voss,
Laura Kaltwasser
Abstract:
This technical paper provides a tutorial to build a low-cost (10-100 USD) and easy to assemble ECG device (ArdMob-ECG) that can be easily used for a variety of different scientific studies. The advantage of this device is that it automatically stores the data and has a built-in detection algorithm for heartbeats. Compared to a clinical ECG, this device entails a serial interface that can send trig…
▽ More
This technical paper provides a tutorial to build a low-cost (10-100 USD) and easy to assemble ECG device (ArdMob-ECG) that can be easily used for a variety of different scientific studies. The advantage of this device is that it automatically stores the data and has a built-in detection algorithm for heartbeats. Compared to a clinical ECG, this device entails a serial interface that can send triggers via USB directly to a computer and software (e.g. Unity, Matlab) with minimal delay due to its architecture. Its software and hardware is open-source and publicly available. The performance of the device regarding sensitivity and specificity is comparable to a professional clinical ECG and is assessed in this paper. Due to the open-source software, a variety of different research questions and individual alterations can be adapted using this ECG. The code as well as the circuit is publicly available and accessible for everyone to promote a better health system in remote areas, Open Science, and to boost scientific progress and the development of new paradigms that ultimately foster innovation.
△ Less
Submitted 1 April, 2022;
originally announced April 2022.
-
Online Search with Maximum Clearance
Authors:
Spyros Angelopoulos,
Malachi Voss
Abstract:
We study the setting in which a mobile agent must locate a hidden target in a bounded or unbounded environment, with no information about the hider's position. In particular, we consider online search, in which the performance of the search strategy is evaluated by its worst case competitive ratio. We introduce a multi-criteria search problem in which the searcher has a budget on its allotted sear…
▽ More
We study the setting in which a mobile agent must locate a hidden target in a bounded or unbounded environment, with no information about the hider's position. In particular, we consider online search, in which the performance of the search strategy is evaluated by its worst case competitive ratio. We introduce a multi-criteria search problem in which the searcher has a budget on its allotted search time, and the objective is to design strategies that are competitively efficient, respect the budget, and maximize the total searched ground. We give analytically optimal strategies for the line and the star environments, and efficient heuristics for general networks.
△ Less
Submitted 28 November, 2020;
originally announced November 2020.