-
A General Purpose Spectral Foundational Model for Both Proximal and Remote Sensing Spectral Imaging
Authors:
William Michael Laprade,
Jesper Cairo Westergaard,
Svend Christensen,
Mads Nielsen,
Anders Bjorholm Dahl
Abstract:
Spectral imaging data acquired via multispectral and hyperspectral cameras can have hundreds of channels, where each channel records the reflectance at a specific wavelength and bandwidth. Time and resource constraints limit our ability to collect large spectral datasets, making it difficult to build and train predictive models from scratch. In the RGB domain, we can often alleviate some of the li…
▽ More
Spectral imaging data acquired via multispectral and hyperspectral cameras can have hundreds of channels, where each channel records the reflectance at a specific wavelength and bandwidth. Time and resource constraints limit our ability to collect large spectral datasets, making it difficult to build and train predictive models from scratch. In the RGB domain, we can often alleviate some of the limitations of smaller datasets by using pretrained foundational models as a starting point. However, most existing foundation models are pretrained on large datasets of 3-channel RGB images, severely limiting their effectiveness when used with spectral imaging data. The few spectral foundation models that do exist usually have one of two limitations: (1) they are built and trained only on remote sensing data limiting their application in proximal spectral imaging, (2) they utilize the more widely available multispectral imaging datasets with less than 15 channels restricting their use with hundred-channel hyperspectral images. To alleviate these issues, we propose a large-scale foundational model and dataset built upon the masked autoencoder architecture that takes advantage of spectral channel encoding, spatial-spectral masking and ImageNet pretraining for an adaptable and robust model for downstream spectral imaging tasks.
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions
Authors:
Sören Christensen,
Claudia Strauch,
Lukas Trottner
Abstract:
We introduce a new class of generative diffusion models that, unlike conventional denoising diffusion models, achieve a time-homogeneous structure for both the noising and denoising processes, allowing the number of steps to adaptively adjust based on the noise level. This is accomplished by conditioning the forward process using Doob's $h$-transform, which terminates the process at a suitable sam…
▽ More
We introduce a new class of generative diffusion models that, unlike conventional denoising diffusion models, achieve a time-homogeneous structure for both the noising and denoising processes, allowing the number of steps to adaptively adjust based on the noise level. This is accomplished by conditioning the forward process using Doob's $h$-transform, which terminates the process at a suitable sampling distribution at a random time. The model is particularly well suited for generating data with lower intrinsic dimensions, as the termination criterion simplifies to a first-hitting rule. A key feature of the model is its adaptability to the target data, enabling a variety of downstream tasks using a pre-trained unconditional generative model. These tasks include natural conditioning through appropriate initialization of the denoising process and classification of noisy data.
△ Less
Submitted 31 January, 2025;
originally announced January 2025.
-
NeuralPLexer3: Accurate Biomolecular Complex Structure Prediction with Flow Models
Authors:
Zhuoran Qiao,
Feizhi Ding,
Thomas Dresselhaus,
Mia A. Rosenfeld,
Xiaotian Han,
Owen Howell,
Aniketh Iyengar,
Stephen Opalenski,
Anders S. Christensen,
Sai Krishna Sirumalla,
Frederick R. Manby,
Thomas F. Miller III,
Matthew Welborn
Abstract:
Structure determination is essential to a mechanistic understanding of diseases and the development of novel therapeutics. Machine-learning-based structure prediction methods have made significant advancements by computationally predicting protein and bioassembly structures from sequences and molecular topology alone. Despite substantial progress in the field, challenges remain to deliver structur…
▽ More
Structure determination is essential to a mechanistic understanding of diseases and the development of novel therapeutics. Machine-learning-based structure prediction methods have made significant advancements by computationally predicting protein and bioassembly structures from sequences and molecular topology alone. Despite substantial progress in the field, challenges remain to deliver structure prediction models to real-world drug discovery. Here, we present NeuralPLexer3 -- a physics-inspired flow-based generative model that achieves state-of-the-art prediction accuracy on key biomolecular interaction types and improves training and sampling efficiency compared to its predecessors and alternative methodologies. Examined through newly developed benchmarking strategies, NeuralPLexer3 excels in vital areas that are crucial to structure-based drug design, such as physical validity and ligand-induced conformational changes.
△ Less
Submitted 18 December, 2024; v1 submitted 14 December, 2024;
originally announced December 2024.
-
General Markovian randomized equilibrium existence and construction in zero-sum Dynkin games for diffusions
Authors:
Sören Christensen,
Kristoffer Lindensjö
Abstract:
One of the most classical games for stochastic processes is the zero-sum Dynkin (stopping) game. We present a complete equilibrium solution to a general formulation of this game with an underlying one-dimensional diffusion. A key result is the construction of a characterizable global $ε$-Nash equilibrium in Markovian randomized stopping times for every $ε> 0$. This is achieved by leveraging the we…
▽ More
One of the most classical games for stochastic processes is the zero-sum Dynkin (stopping) game. We present a complete equilibrium solution to a general formulation of this game with an underlying one-dimensional diffusion. A key result is the construction of a characterizable global $ε$-Nash equilibrium in Markovian randomized stopping times for every $ε> 0$. This is achieved by leveraging the well-known equilibrium structure under a restrictive ordering condition on the payoff functions, leading to a novel approach based on an appropriate notion of randomization that allows for solving the general game without any ordering condition. Additionally, we provide conditions for the existence of pure and randomized Nash equilibria (with $ε=0$). Our results enable explicit identification of equilibrium stopping times and their corresponding values in many cases, illustrated by several examples.
△ Less
Submitted 12 December, 2024;
originally announced December 2024.
-
Learning to steer with Brownian noise
Authors:
Stefan Ankirchner,
Sören Christensen,
Jan Kallsen,
Philip Le Borne,
Stefan Perko
Abstract:
This paper considers an ergodic version of the bounded velocity follower problem, assuming that the decision maker lacks knowledge of the underlying system parameters and must learn them while simultaneously controlling. We propose algorithms based on moving empirical averages and develop a framework for integrating statistical methods with stochastic control theory. Our primary result is a logari…
▽ More
This paper considers an ergodic version of the bounded velocity follower problem, assuming that the decision maker lacks knowledge of the underlying system parameters and must learn them while simultaneously controlling. We propose algorithms based on moving empirical averages and develop a framework for integrating statistical methods with stochastic control theory. Our primary result is a logarithmic expected regret rate. To achieve this, we conduct a rigorous analysis of the ergodic convergence rates of the underlying processes and the risks of the considered estimators.
△ Less
Submitted 4 October, 2024;
originally announced October 2024.
-
On the existence of Markovian randomized equilibria in Dynkin games of war-of-attrition-type
Authors:
Sören Christensen,
Boy Schultz
Abstract:
In optimal stopping problems, a Markov structure guarantees Markovian optimal stopping times (first exit times). Surprisingly, there is no analogous result for Markovian stopping games once randomization is required. This paper addresses this gap by proving the existence of Markov-perfect equilibria in a specific type of stopping game - a general nonzero-sum Dynkin games of the war-of-attrition ty…
▽ More
In optimal stopping problems, a Markov structure guarantees Markovian optimal stopping times (first exit times). Surprisingly, there is no analogous result for Markovian stopping games once randomization is required. This paper addresses this gap by proving the existence of Markov-perfect equilibria in a specific type of stopping game - a general nonzero-sum Dynkin games of the war-of-attrition type with underlying linear diffusions. Our main mathematical contribution lies in the development of appropriate topologies for Markovian randomized stopping times. This allows us to establish the existence of equilibria within a tractable and interpretable class of stopping times, paving the way for further analysis of Markovian stopping games.
△ Less
Submitted 1 August, 2024; v1 submitted 14 June, 2024;
originally announced June 2024.
-
On first passage time problems of Brownian motion -- The inverse method of images revisited
Authors:
Sören Christensen,
Oskar Hallmann,
Maike Klein
Abstract:
Let $W$ be a standard Brownian motion with $W_0 = 0$ and let $b\colon[0,\infty) \to \mathbb{R}$ be a continuous function with $b(0) > 0$. In this article, we look at the classical First Passage Time (FPT) problem, i.e., the question of determining the distribution of $τ:= \inf \{ t\in [0,\infty)\colon W_t \geq b(t) \}.$ More specifically, we revisit the method of images, which we feel has received…
▽ More
Let $W$ be a standard Brownian motion with $W_0 = 0$ and let $b\colon[0,\infty) \to \mathbb{R}$ be a continuous function with $b(0) > 0$. In this article, we look at the classical First Passage Time (FPT) problem, i.e., the question of determining the distribution of $τ:= \inf \{ t\in [0,\infty)\colon W_t \geq b(t) \}.$ More specifically, we revisit the method of images, which we feel has received less attention than it deserves. The main observation of this approach is that the FPT problem is fully solved if a measure $μ$ exists such that \begin{align*} \int_{(0,\infty)} \exp\left(-\frac{θ^2}{2t}+\frac{θb(t)}{t}\right)μ(dθ)=1, \qquad t\in(0,\infty). \end{align*} The goal of this article is to lay the foundation for answering the still open question of the existence and characterisation of such a measure $μ$ for a given curve $b$. We present a new duality approach that allows us to give sufficient conditions for the existence. Moreover, we introduce a very efficient algorithm for approximating the representing measure $μ$ and provide a rigorous theoretical foundation.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Data-driven optimal stopping: A pure exploration analysis
Authors:
Sören Christensen,
Niklas Dexheimer,
Claudia Strauch
Abstract:
The standard theory of optimal stopping is based on the idealised assumption that the underlying process is essentially known. In this paper, we drop this restriction and study data-driven optimal stopping for a general diffusion process, focusing on investigating the statistical performance of the proposed estimator of the optimal stopping barrier. More specifically, we derive non-asymptotic uppe…
▽ More
The standard theory of optimal stopping is based on the idealised assumption that the underlying process is essentially known. In this paper, we drop this restriction and study data-driven optimal stopping for a general diffusion process, focusing on investigating the statistical performance of the proposed estimator of the optimal stopping barrier. More specifically, we derive non-asymptotic upper bounds on the simple regret, along with uniform and non-asymptotic PAC bounds. Minimax optimality is verified by completing the upper bound results with matching lower bounds on the simple regret. All results are shown both under general conditions on the payoff functions and under more refined assumptions that mimic the margin condition used in binary classification, leading to an improved rate of convergence. Additionally, we investigate how our results on the simple regret transfer to the cumulative regret for a specific exploration-exploitation strategy, both with respect to lower bounds and upper bounds.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Data-driven rules for multidimensional reflection problems
Authors:
Sören Christensen,
Asbjørn Holk Thomsen,
Lukas Trottner
Abstract:
Over the recent past data-driven algorithms for solving stochastic optimal control problems in face of model uncertainty have become an increasingly active area of research. However, for singular controls and underlying diffusion dynamics the analysis has so far been restricted to the scalar case. In this paper we fill this gap by studying a multivariate singular control problem for reversible dif…
▽ More
Over the recent past data-driven algorithms for solving stochastic optimal control problems in face of model uncertainty have become an increasingly active area of research. However, for singular controls and underlying diffusion dynamics the analysis has so far been restricted to the scalar case. In this paper we fill this gap by studying a multivariate singular control problem for reversible diffusions with controls of reflection type. Our contributions are threefold. We first explicitly determine the long-run average costs as a domain-dependent functional, showing that the control problem can be equivalently characterized as a shape optimization problem. For given diffusion dynamics, assuming the optimal domain to be strongly star-shaped, we then propose a gradient descent algorithm based on polytope approximations to numerically determine a cost-minimizing domain. Finally, we investigate data-driven solutions when the diffusion dynamics are unknown to the controller. Using techniques from nonparametric statistics for stochastic processes, we construct an optimal domain estimator, whose static regret is bounded by the minimax optimal estimation rate of the unreflected process' invariant density. In the most challenging situation, when the dynamics must be learned simultaneously to controlling the process, we develop an episodic learning algorithm to overcome the emerging exploration-exploitation dilemma and show that given the static regret as a baseline, the loss in its sublinear regret per time unit is of natural order compared to the one-dimensional case.
△ Less
Submitted 11 November, 2023;
originally announced November 2023.
-
On the time consistent solution to optimal stopping problems with expectation constraint
Authors:
Sören Christensen,
Maike Klein,
Boy Schultz
Abstract:
We study the (weak) equilibrium problem arising from the problem of optimally stopping a one-dimensional diffusion subject to an expectation constraint on the time until stopping. The weak equilibrium problem is realized with a set of randomized but purely state dependent stopping times as admissible strategies. We derive a verification theorem and necessary conditions for equilibria, which togeth…
▽ More
We study the (weak) equilibrium problem arising from the problem of optimally stopping a one-dimensional diffusion subject to an expectation constraint on the time until stopping. The weak equilibrium problem is realized with a set of randomized but purely state dependent stopping times as admissible strategies. We derive a verification theorem and necessary conditions for equilibria, which together basically characterize all equilibria. Furthermore, additional structural properties of equilibria are obtained to feed a possible guess-and-verify approach, which is then illustrated by an example.
△ Less
Submitted 13 June, 2024; v1 submitted 9 November, 2023;
originally announced November 2023.
-
Is Learning in Biological Neural Networks based on Stochastic Gradient Descent? An analysis using stochastic processes
Authors:
Sören Christensen,
Jan Kallsen
Abstract:
In recent years, there has been an intense debate about how learning in biological neural networks (BNNs) differs from learning in artificial neural networks. It is often argued that the updating of connections in the brain relies only on local information, and therefore a stochastic gradient-descent type optimization method cannot be used. In this paper, we study a stochastic model for supervised…
▽ More
In recent years, there has been an intense debate about how learning in biological neural networks (BNNs) differs from learning in artificial neural networks. It is often argued that the updating of connections in the brain relies only on local information, and therefore a stochastic gradient-descent type optimization method cannot be used. In this paper, we study a stochastic model for supervised learning in BNNs. We show that a (continuous) gradient step occurs approximately when each learning opportunity is processed by many local updates. This result suggests that stochastic gradient descent may indeed play a role in optimizing BNNs.
△ Less
Submitted 10 April, 2024; v1 submitted 10 September, 2023;
originally announced September 2023.
-
Oxide layer formation prevents deteriorating ion migration in thermoelectric Cu$_2$Se during operation in air
Authors:
Rasmus S. Christensen,
Peter S. Thorup,
Lasse R. Jørgensen,
Martin Roelsgaard,
Karl F. F. Fischer,
Ann-Christin Dippel,
Bo Brummerstedt Iversen
Abstract:
Cu$_2$Se is a mixed ionic-electronic conductor with outstanding thermoelectric performance originally envisioned for space missions. Applications were discontinued due to material instability, where elemental Cu grows at the electrode interfaces during operation in vacuum. Here, we show that when Cu$_2$Se is operating in air, formation of an oxide surface layer suppresses Cu$^+$ migration along th…
▽ More
Cu$_2$Se is a mixed ionic-electronic conductor with outstanding thermoelectric performance originally envisioned for space missions. Applications were discontinued due to material instability, where elemental Cu grows at the electrode interfaces during operation in vacuum. Here, we show that when Cu$_2$Se is operating in air, formation of an oxide surface layer suppresses Cu$^+$ migration along the current direction. In operando X-ray scattering and electrical resistivity measurements quantify Cu$^+$ migration through refinement of atomic occupancies and phase composition analysis. Cu deposition can be prevented during operation in air, irrespective of a critical voltage, if the thermal gradient is applied along the current direction. Maximum entropy electron density analysis provides experimental evidence that Cu$^+$ migration pathways under thermal and electrical gradients differ substantially from equilibrium diffusion. The study establishes new promise for inexpensive sustainable Cu$_2$Se in thermoelectric applications, and it underscores the importance of atomistic insight into materials during thermoelectric operating conditions.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
Markovian randomized equilibria for general Markovian Dynkin games in discrete time
Authors:
Sören Christensen,
Kristoffer Lindensjö,
Berenice Anne Neumann
Abstract:
We study a general formulation of the classical two-player Dynkin game in a Markovian discrete time setting. We show that an appropriate class of mixed, i.e., randomized, strategies in this context are \textit{Markovian randomized stopping times}, which correspond to stopping at any given state with a state-dependent probability. One main result is an explicit characterization of Wald-Bellman type…
▽ More
We study a general formulation of the classical two-player Dynkin game in a Markovian discrete time setting. We show that an appropriate class of mixed, i.e., randomized, strategies in this context are \textit{Markovian randomized stopping times}, which correspond to stopping at any given state with a state-dependent probability. One main result is an explicit characterization of Wald-Bellman type for Nash equilibria based on this notion of randomization. In particular, this provides a novel characterization for randomized equilibria for the zero-sum game, which we use, e.g., to establish a new condition for the existence and construction of pure equilibria, to obtain necessary and sufficient conditions for the non-existence of pure strategy equilibria, and to construct an explicit example with a unique mixed, but no pure equilibrium. We also provide existence and characterization results for the symmetric specification of our game. Finally, we establish existence of a characterizable equilibrium in Markovian randomized stopping times for the general game formulation under the assumption that the state space is countable.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
Two sided ergodic singular control and mean field game for diffusions
Authors:
Sören Christensen,
Ernesto Mordecki,
Facundo Oliú Eguren
Abstract:
In a probabilistic mean-field game driven by a linear diffusion an individual player aims to minimize an ergodic long-run cost by controlling the diffusion through a pair of -- increasing and decreasing -- càdlàg processes, while he is interacting with an aggregate of players through the expectation of a similar diffusion controlled by another pair of càdlàg processes. In order to find equilibrium…
▽ More
In a probabilistic mean-field game driven by a linear diffusion an individual player aims to minimize an ergodic long-run cost by controlling the diffusion through a pair of -- increasing and decreasing -- càdlàg processes, while he is interacting with an aggregate of players through the expectation of a similar diffusion controlled by another pair of càdlàg processes. In order to find equilibrium points in this game, we first consider the control problem, in which the individual player has no interaction with the aggregate of players. In this case, we prove that the best policy is to reflect the diffusion process within two thresholds. Based on these results, we obtain criteria for the existence of equilibrium points in the mean-field game in the case when the controls of the aggregate of players are of reflection type, and give a pair of nonlinear equations to find these equilibrium points. In addition, we present an approximation result for Nash equilibria of erdogic games with finitely many players to the mean-field game equilibria considered above when the number of players tends to infinity. These results are illustrated by several examples where the existence and uniqueness of the equilibrium points depend on the coefficients of the underlying diffusion.
△ Less
Submitted 11 June, 2024; v1 submitted 15 June, 2023;
originally announced June 2023.
-
Playing the system: address manipulation and access to schools
Authors:
Andreas Bjerre-Nielsen,
Lykke Sterll Christensen,
Mikkel Høst Gandil,
Hans Henrik Sievertsen
Abstract:
Strategic incentives may lead to inefficient and unequal provision of public services. A prominent example is school admissions. Existing research shows that applicants "play the system" by submitting school rankings strategically. We investigate whether applicants also play the system by manipulating their eligibility at schools. We analyze this applicant deception in a theoretical model and prov…
▽ More
Strategic incentives may lead to inefficient and unequal provision of public services. A prominent example is school admissions. Existing research shows that applicants "play the system" by submitting school rankings strategically. We investigate whether applicants also play the system by manipulating their eligibility at schools. We analyze this applicant deception in a theoretical model and provide testable predictions for commonly-used admission procedures. We confirm these model predictions empirically by analyzing the implementation of two reforms. First, we find that the introduction of a residence-based school-admission criterion in Denmark caused address changes to increase by more than 100% before the high-school application deadline. This increase occurred only in areas where the incentive to manipulate is high-powered. Second, to assess whether this behavior reflects actual address changes, we study a second reform that required applicants to provide additional proof of place of residence to approve an address change. The second reform significantly reduced address changes around the school application deadline, suggesting that the observed increase in address changes mainly reflects manipulation. The manipulation is driven by applicants from more affluent households and their behavior affects non-manipulating applicants. Counter-factual simulations show that among students not enrolling in their first listed school, more than 25% would have been offered a place in the absence of address manipulation and their peer GPA is 0.2SD lower due to the manipulative behavior of other applicants. Our findings show that popular school choice systems give applicants the incentive to play the system with real implications for non-strategic applicants.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
CUQIpy: II. Computational uncertainty quantification for PDE-based inverse problems in Python
Authors:
Amal M A Alghamdi,
Nicolai A B Riis,
Babak M Afkham,
Felipe Uribe,
Silja L Christensen,
Per Christian Hansen,
Jakob S Jørgensen
Abstract:
Inverse problems, particularly those governed by Partial Differential Equations (PDEs), are prevalent in various scientific and engineering applications, and uncertainty quantification (UQ) of solutions to these problems is essential for informed decision-making. This second part of a two-paper series builds upon the foundation set by the first part, which introduced CUQIpy, a Python software pack…
▽ More
Inverse problems, particularly those governed by Partial Differential Equations (PDEs), are prevalent in various scientific and engineering applications, and uncertainty quantification (UQ) of solutions to these problems is essential for informed decision-making. This second part of a two-paper series builds upon the foundation set by the first part, which introduced CUQIpy, a Python software package for computational UQ in inverse problems using a Bayesian framework. In this paper, we extend CUQIpy's capabilities to solve PDE-based Bayesian inverse problems through a general framework that allows the integration of PDEs in CUQIpy, whether expressed natively or using third-party libraries such as FEniCS. CUQIpy offers concise syntax that closely matches mathematical expressions, streamlining the modeling process and enhancing the user experience. The versatility and applicability of CUQIpy to PDE-based Bayesian inverse problems are demonstrated on examples covering parabolic, elliptic and hyperbolic PDEs. This includes problems involving the heat and Poisson equations and application case studies in electrical impedance tomography and photo-acoustic tomography, showcasing the software's efficiency, consistency, and intuitive interface. This comprehensive approach to UQ in PDE-based inverse problems provides accessibility for non-experts and advanced features for experts.
△ Less
Submitted 21 March, 2024; v1 submitted 26 May, 2023;
originally announced May 2023.
-
CUQIpy: I. Computational uncertainty quantification for inverse problems in Python
Authors:
Nicolai A B Riis,
Amal M A Alghamdi,
Felipe Uribe,
Silja L Christensen,
Babak M Afkham,
Per Christian Hansen,
Jakob S Jørgensen
Abstract:
This paper introduces CUQIpy, a versatile open-source Python package for computational uncertainty quantification (UQ) in inverse problems, presented as Part I of a two-part series. CUQIpy employs a Bayesian framework, integrating prior knowledge with observed data to produce posterior probability distributions that characterize the uncertainty in computed solutions to inverse problems. The packag…
▽ More
This paper introduces CUQIpy, a versatile open-source Python package for computational uncertainty quantification (UQ) in inverse problems, presented as Part I of a two-part series. CUQIpy employs a Bayesian framework, integrating prior knowledge with observed data to produce posterior probability distributions that characterize the uncertainty in computed solutions to inverse problems. The package offers a high-level modeling framework with concise syntax, allowing users to easily specify their inverse problems, prior information, and statistical assumptions. CUQIpy supports a range of efficient sampling strategies and is designed to handle large-scale problems. Notably, the automatic sampler selection feature analyzes the problem structure and chooses a suitable sampler without user intervention, streamlining the process. With a selection of probability distributions, test problems, computational methods, and visualization tools, CUQIpy serves as a powerful, flexible, and adaptable tool for UQ in a wide selection of inverse problems. Part II of the series focuses on the use of CUQIpy for UQ in inverse problems with partial differential equations (PDEs).
△ Less
Submitted 21 March, 2024; v1 submitted 26 May, 2023;
originally announced May 2023.
-
Inertial Migration in Micro-Centrifuge Devices
Authors:
Samuel Christensen,
Marcus Roper
Abstract:
Within microcentrifuge devices, a microfluidic vortex separates larger particles from a heterogeneous suspension using inertial migration, a phenomenon that causes particles to migrate across streamlines. The ability to selectively capture particles based on size differences of a few microns makes microcentrifuges useful diagnostic tools for trapping rare cells within blood samples. However, ratio…
▽ More
Within microcentrifuge devices, a microfluidic vortex separates larger particles from a heterogeneous suspension using inertial migration, a phenomenon that causes particles to migrate across streamlines. The ability to selectively capture particles based on size differences of a few microns makes microcentrifuges useful diagnostic tools for trapping rare cells within blood samples. However, rational design of microcentrifuges has been held back from its full potential by a lack of quantitative modeling of particle capture mechanics. Here we use an asymptotic method, in which particles are accurately modeled as singularities in a linearized flow field, to rapidly calculate particle trajectories within microcentrifuges. Our predictions for trapping thresholds and trajectories agree well with published experimental data. Our results clarify how capture reflects a balance between advection of particles within a background flow and their inertial focusing and shows why the close proximity of trapped and untrapped incoming streamlines makes it challenging to design microcentrifuges with sharp trapping thresholds.
△ Less
Submitted 12 March, 2023;
originally announced March 2023.
-
Uniqueness of First Passage Time Distributions via Fredholm Integral Equations
Authors:
Sören Christensen,
Simon Fischer,
Oskar Hallmann
Abstract:
Let $W$ be a standard Brownian motion with $W_0 = 0$ and let $b: \mathbb{R}_+ \to \mathbb{R}$ be a continuous function with $b(0) > 0$. The first passage time (from below) is then defined as \begin{align*} τ:= \inf \{ t \geq 0 \vert W_t \geq b(t) \}. \end{align*} It is well-known that the distribution $F$ of $τ$ satisfies a set of Fredholm equations of the first kind, which is used, for example, a…
▽ More
Let $W$ be a standard Brownian motion with $W_0 = 0$ and let $b: \mathbb{R}_+ \to \mathbb{R}$ be a continuous function with $b(0) > 0$. The first passage time (from below) is then defined as \begin{align*} τ:= \inf \{ t \geq 0 \vert W_t \geq b(t) \}. \end{align*} It is well-known that the distribution $F$ of $τ$ satisfies a set of Fredholm equations of the first kind, which is used, for example, as a starting point for numerical approaches. For this, it is fundamental that the Fredholm equations have a unique solution. In this article, we prove this in a general setting using analytical methods.
△ Less
Submitted 9 March, 2023;
originally announced March 2023.
-
Fast universal control of a flux qubit via exponentially tunable wave-function overlap
Authors:
Svend Krøjer,
Anders Enevold Dahl,
Kasper Sangild Christensen,
Morten Kjaergaard,
Karsten Flensberg
Abstract:
Fast, high fidelity control and readout of protected superconducting qubits are fundamentally challenging due to their inherent insensitivity. We propose a flux qubit variation which enjoys a tunable level of protection against relaxation to resolve this outstanding issue. Our qubit design, the double-shunted flux qubit (DSFQ), realizes a generic double-well potential through its three junction ri…
▽ More
Fast, high fidelity control and readout of protected superconducting qubits are fundamentally challenging due to their inherent insensitivity. We propose a flux qubit variation which enjoys a tunable level of protection against relaxation to resolve this outstanding issue. Our qubit design, the double-shunted flux qubit (DSFQ), realizes a generic double-well potential through its three junction ring geometry. One of the junctions is tunable, making it possible to control the barrier height and thus the level of protection. We analyze single- and two-qubit gate operations that rely on lowering the barrier. We show that this is a viable method that results in high fidelity gates as the non-computational states are not occupied during operations. Further, we show how the effective coupling to a readout resonator can be controlled by adjusting the externally applied flux while the DSFQ is protected from decaying into the readout resonator. Finally, we also study a double-loop gradiometric version of the DSFQ which is exponentially insensitive to variations in the global magnetic field, even when the loop areas are non-identical.
△ Less
Submitted 28 November, 2023; v1 submitted 2 March, 2023;
originally announced March 2023.
-
Scheme for parity-controlled multi-qubit gates with superconducting qubits
Authors:
Kasper Sangild Christensen,
Nikolaj Thomas Zinner,
Morten Kjaergaard
Abstract:
Multi-qubit parity measurements are at the core of many quantum error correction schemes. Extracting multi-qubit parity information typically involves using a sequence of multiple two-qubit gates. In this paper, we propose a superconducting circuit device with native support for multi-qubit parity-controlled gates (PCG). These are gates that perform rotations on a parity ancilla based on the multi…
▽ More
Multi-qubit parity measurements are at the core of many quantum error correction schemes. Extracting multi-qubit parity information typically involves using a sequence of multiple two-qubit gates. In this paper, we propose a superconducting circuit device with native support for multi-qubit parity-controlled gates (PCG). These are gates that perform rotations on a parity ancilla based on the multi-qubit parity operator of adjacent qubits, and can be directly used to perform multi-qubit parity measurements. The circuit consists of a set of concatenated Josephson ring modulators and effectively realizes a set of transmon-like qubits with strong longitudinal nearest-neighbor couplings. PCGs are implemented by applying microwave drives to the parity ancilla at specific frequencies. We investigate the scheme's performance with numerical simulation using realistic parameter choices and decoherence rates, and find that the device can perform four-qubit PCGs in 30 ns with process fidelity surpassing 99%. Furthermore, we study the effects of parameter disorder and spurious coupling between next-nearest neighboring qubits. Our results indicate that this approach to realizing PCGs constitute an interesting candidate for near-term quantum error correction experiments.
△ Less
Submitted 10 April, 2023; v1 submitted 1 February, 2023;
originally announced February 2023.
-
Glass Hardness: Predicting Composition and Load Effects via Symbolic Reasoning-Informed Machine Learning
Authors:
Sajid Mannan,
Mohd Zaki,
Suresh Bishnoi,
Daniel R. Cassar,
Jeanini Jiusti,
Julio Cesar Ferreira Faria,
Johan F. S. Christensen,
Nitya Nand Gosvami,
Morten M. Smedskjaer,
Edgar Dutra Zanotto,
N. M. Anoop Krishnan
Abstract:
Glass hardness varies in a non-linear fashion with the chemical composition and applied load, a phenomenon known as the indentation size effect (ISE), which is challenging to predict quantitatively. Here, using a curated dataset of over approx. 3000 inorganic glasses from the literature comprising the composition, indentation load, and hardness, we develop machine learning (ML) models to predict t…
▽ More
Glass hardness varies in a non-linear fashion with the chemical composition and applied load, a phenomenon known as the indentation size effect (ISE), which is challenging to predict quantitatively. Here, using a curated dataset of over approx. 3000 inorganic glasses from the literature comprising the composition, indentation load, and hardness, we develop machine learning (ML) models to predict the composition and load dependence of Vickers hardness. Interestingly, when tested on new glass compositions unseen during the training, the standard data-driven ML model failed to capture the ISE. To address this gap, we combined an empirical expression (Bernhardt law) to describe the ISE with ML to develop a framework that incorporates the symbolic law representing the domain reasoning in ML, namely Symbolic Reasoning-Informed ML Procedure (SRIMP). We show that the resulting SRIMP outperforms the data-driven ML model in predicting the ISE. Finally, we interpret the SRIMP model to understand the contribution of the glass network formers and modifiers toward composition and load-dependent (ISE) and load-independent hardness. The deconvolution of the hardness into load-dependent and load-independent terms paves the way toward a holistic understanding of composition and ISE in glasses, enabling the accelerated discovery of new glass compositions with targeted hardness.
△ Less
Submitted 19 January, 2023;
originally announced January 2023.
-
Non-inferiority of Deep Learning Acute Ischemic Stroke Segmentation on Non-Contrast CT Compared to Expert Neuroradiologists
Authors:
Sophie Ostmeier,
Brian Axelrod,
Benjamin F. J. Verhaaren,
Soren Christensen,
Abdelkader Mahammedi,
Yongkai Liu,
Benjamin Pulli,
Li-Jia Li,
Greg Zaharchuk,
Jeremy J. Heit
Abstract:
To determine if a convolutional neural network (CNN) deep learning model can accurately segment acute ischemic changes on non-contrast CT compared to neuroradiologists. Non-contrast CT (NCCT) examinations from 232 acute ischemic stroke patients who were enrolled in the DEFUSE 3 trial were included in this study. Three experienced neuroradiologists independently segmented hypodensity that reflected…
▽ More
To determine if a convolutional neural network (CNN) deep learning model can accurately segment acute ischemic changes on non-contrast CT compared to neuroradiologists. Non-contrast CT (NCCT) examinations from 232 acute ischemic stroke patients who were enrolled in the DEFUSE 3 trial were included in this study. Three experienced neuroradiologists independently segmented hypodensity that reflected the ischemic core on each scan. The neuroradiologist with the most experience (expert A) served as the ground truth for deep learning model training. Two additional neuroradiologists (experts B and C) segmentations were used for data testing. The 232 studies were randomly split into training and test sets. The training set was further randomly divided into 5 folds with training and validation sets. A 3-dimensional CNN architecture was trained and optimized to predict the segmentations of expert A from NCCT. The performance of the model was assessed using a set of volume, overlap, and distance metrics using non-inferiority thresholds of 20%, 3ml, and 3mm. The optimized model trained on expert A was compared to test experts B and C. We used a one-sided Wilcoxon signed-rank test to test for the non-inferiority of the model-expert compared to the inter-expert agreement. The final model performance for the ischemic core segmentation task reached a performance of 0.46+-0.09 Surface Dice at Tolerance 5mm and 0.47+-0.13 Dice when trained on expert A. Compared to the two test neuroradiologists the model-expert agreement was non-inferior to the inter-expert agreement, p < 0.05. The CNN accurately delineates the hypodense ischemic core on NCCT in acute ischemic stroke patients with an accuracy comparable to neuroradiologists.
△ Less
Submitted 7 September, 2023; v1 submitted 24 November, 2022;
originally announced November 2022.
-
Design of an Intake and a Thruster for an Atmosphere-Breathing Electric Propulsion System
Authors:
F. Romano,
G. Herdrich,
Y. -A. Chan,
N. H. Crisp,
P. C. E. Roberts,
B. E. A. Holmes,
S. Edmondson,
S. Haigh,
A. Macario-Rojas,
V. T. A. Oiko,
L. A. Sinpetru K. Smith,
J. Becedas,
V. Sulliotti-Linner,
M. Bisgaard,
S. Christensen,
V. Hanessian,
T. Kauffman Jensen,
J. Nielsen,
S. Fasoulas,
C. Traub,
D. García-Almiñana,
S. Rodríguez-Donaire,
M. Sureda,
D. Kataria,
B. Belkouchi
, et al. (3 additional authors not shown)
Abstract:
Challenging space missions include those at very low altitudes, where the atmosphere is source of aerodynamic drag on the spacecraft that finally defines the missions lifetime unless way to compensate for it is provided. This environment is named Very Low Earth Orbit (VLEO) and is defined for $h<450~km$. In addition to the satellite's aerodynamic design, to extend the lifetime of such missions an…
▽ More
Challenging space missions include those at very low altitudes, where the atmosphere is source of aerodynamic drag on the spacecraft that finally defines the missions lifetime unless way to compensate for it is provided. This environment is named Very Low Earth Orbit (VLEO) and is defined for $h<450~km$. In addition to the satellite's aerodynamic design, to extend the lifetime of such missions an efficient propulsion system is required.
One solution is Atmosphere-Breathing Electric Propulsion (ABEP) that collects atmospheric particles to be used as propellant for an electric thruster. The system would minimize the requirement of limited propellant availability and can also be applied to any planetary body with atmosphere, enabling new missions at low altitude ranges for longer times. One of the objectives of the H2020 DISCOVERER project, is the development of an intake and an electrode-less plasma thruster for an ABEP system.
The article describes the characteristics of intake design and the respective final deigns providing collection efficiencies up to $94\%$. On the other side, the radio frequency (RF) Helicon-based plasma thruster (IPT) developed at IRS, is hereby presented as well, while its performances are being evaluated, the thruster has been operated with single atmospheric species as propellant, and has highlighted very low input power requirement for operation at comparable mass flow rates $P\sim 60~W$.
△ Less
Submitted 23 November, 2022; v1 submitted 18 November, 2022;
originally announced November 2022.
-
Development and analysis of novel mission scenarios based on Atmosphere-Breathing Electric Propulsion (ABEP)
Authors:
S. Vaidya,
C. Traub,
F. Romano,
G. Herdrich,
Y. -A. Chan,
S. Fasoulas,
P. C. E. Roberts,
N. Crisp,
S. Edmondson,
S. Haigh,
B. A. Holmes,
A. Macario-Rojas,
V. T. Abrao Oiko,
K. Smith,
L. Sinpetru,
J. Becedas,
V. Sulliotti-Linner,
S. Christensen,
V. Hanessian,
T. K. Jensen,
J. Nielsen,
M. Bisgaard,
D. Garcia-Alminana,
S. Rodriguez-Donaire,
M. Suerda
, et al. (6 additional authors not shown)
Abstract:
Operating satellites in Very Low Earth Orbit (VLEO) benefits the already expanding New Space industry in applications including Earth Observation and beyond. However, long-term operations at such low altitudes require propulsion systems to compensate for the large aerodynamic drag forces. When using conventional propulsion systems, the amount of storable propellant limits the maximum mission lifet…
▽ More
Operating satellites in Very Low Earth Orbit (VLEO) benefits the already expanding New Space industry in applications including Earth Observation and beyond. However, long-term operations at such low altitudes require propulsion systems to compensate for the large aerodynamic drag forces. When using conventional propulsion systems, the amount of storable propellant limits the maximum mission lifetime. The latter can be avoided by employing Atmosphere-Breathing Electric Propulsion (ABEP) system, which collects the residual atmospheric particles and uses them as propellant for an electric thruster. Thus, the requirement of on-board propellant storage can ideally be nullified. At the Institute of Space Systems (IRS) of the University of Stuttgart, an intake, and a RF Helicon-based Plasma Thruster (IPT) for ABEP system are developed within the Horizons 2020 funded DISCOVERER project. In order to assess possible future use cases, this paper proposes and analyzes several novel ABEP based mission scenarios. Beginning with technology demonstration mission in VLEO, more complex mission scenarios are derived and discussed in detail. These include, amongst others, orbit maintenance around Mars as well as refuelling and space tug missions. The results show that the ABEP system is not only able to compensate drag for orbit maintenance but also capable of performing orbital maneuvers and collect propellant for applications such as Space Tug and Refuelling. Thus, showing a multitude of different future mission applications.
△ Less
Submitted 21 November, 2022; v1 submitted 17 November, 2022;
originally announced November 2022.
-
USE-Evaluator: Performance Metrics for Medical Image Segmentation Models with Uncertain, Small or Empty Reference Annotations
Authors:
Sophie Ostmeier,
Brian Axelrod,
Jeroen Bertels,
Fabian Isensee,
Maarten G. Lansberg,
Soren Christensen,
Gregory W. Albers,
Li-Jia Li,
Jeremy J. Heit
Abstract:
Performance metrics for medical image segmentation models are used to measure the agreement between the reference annotation and the predicted segmentation. Usually, overlap metrics, such as the Dice, are used as a metric to evaluate the performance of these models in order for results to be comparable. However, there is a mismatch between the distributions of cases and difficulty level of segment…
▽ More
Performance metrics for medical image segmentation models are used to measure the agreement between the reference annotation and the predicted segmentation. Usually, overlap metrics, such as the Dice, are used as a metric to evaluate the performance of these models in order for results to be comparable. However, there is a mismatch between the distributions of cases and difficulty level of segmentation tasks in public data sets compared to clinical practice. Common metrics fail to measure the impact of this mismatch, especially for clinical data sets that include low signal pathologies, a difficult segmentation task, and uncertain, small, or empty reference annotations. This limitation may result in ineffective research of machine learning practitioners in designing and optimizing models. Dimensions of evaluating clinical value include consideration of the uncertainty of reference annotations, independence from reference annotation volume size, and evaluation of classification of empty reference annotations. We study how uncertain, small, and empty reference annotations influence the value of metrics for medical image segmentation on an in-house data set regardless of the model. We examine metrics behavior on the predictions of a standard deep learning framework in order to identify metrics with clinical value. We compare to a public benchmark data set (BraTS 2019) with a high-signal pathology and certain, larger, and no empty reference annotations. We may show machine learning practitioners, how uncertain, small, or empty reference annotations require a rethinking of the evaluation and optimizing procedures. The evaluation code was released to encourage further analysis of this topic. https://github.com/SophieOstmeier/UncertainSmallEmpty.git
△ Less
Submitted 7 September, 2023; v1 submitted 26 September, 2022;
originally announced September 2022.
-
Local time pushed mixed stopping and smooth fit for time-inconsistent stopping problems
Authors:
Andi Bodnariu,
Sören Christensen,
Kristoffer Lindensjö
Abstract:
We consider the game-theoretic approach to time-inconsistent stopping of a one-dimensional diffusion where the time-inconsistency is due to the presence of a non-exponential (weighted) discount function. In particular, we study (weak) equilibria for this problem in a novel class of mixed (i.e., randomized) stopping times based on a local time construction of the stopping intensity. For a general f…
▽ More
We consider the game-theoretic approach to time-inconsistent stopping of a one-dimensional diffusion where the time-inconsistency is due to the presence of a non-exponential (weighted) discount function. In particular, we study (weak) equilibria for this problem in a novel class of mixed (i.e., randomized) stopping times based on a local time construction of the stopping intensity. For a general formulation of the problem we provide a verification theorem giving sufficient conditions for mixed (and pure) equilibria in terms of a set of variational inequalities, including a smooth fit condition. We apply the theory to prove the existence of (mixed) equilibria in a recently studied real options problem in which no pure equilibria exist.
△ Less
Submitted 30 June, 2022;
originally announced June 2022.
-
Uncertainties and Design of Active Aerodynamic Attitude Control in Very Low Earth Orbit
Authors:
Sabrina Livadiotti,
Nicholas H. Crisp,
Peter C. E. Roberts,
Vitor T. A. Oiko,
Simon Christensen,
Rosa Maria Dominguez,
Georg H. Herdrich
Abstract:
This paper discusses the design and the performance achievable with active aerodynamic attitude control in very low Earth orbit, i.e. below 450 km in altitude. A novel real-time algorithm is proposed for selecting the angles of deflection of aerodynamic actuators providing the closest match to the control signal computed by a selected control law. The algorithm is based on a panel method for the c…
▽ More
This paper discusses the design and the performance achievable with active aerodynamic attitude control in very low Earth orbit, i.e. below 450 km in altitude. A novel real-time algorithm is proposed for selecting the angles of deflection of aerodynamic actuators providing the closest match to the control signal computed by a selected control law. The algorithm is based on a panel method for the computation of the aerodynamic coefficients and relies on approximate environmental parameters estimation and worst-case scenario assumptions for the re-emission properties of space materials. Discussion of results is performed by assuming two representative pointing manoeuvres, for which momentum wheels and aerodynamic actuators are used synergistically. A quaternion feedback PID controller implemented in discrete time is assumed to determine the control signal at a sampling frequency of 1 Hz. The outcome of a Monte Carlo analysis, performed for a wide range of orbital conditions, shows that the target attitude is successfully achieved for the vast majority of the cases, thus proving the robustness of the approach in the presence of environmental uncertainties and realistic attitude hardware limitations.
△ Less
Submitted 15 March, 2022;
originally announced March 2022.
-
Structural Gaussian Priors for Bayesian CT reconstruction of Subsea Pipes
Authors:
Silja L. Christensen,
Nicolai A. B. Riis,
Felipe Uribe,
Jakob S. Jørgensen
Abstract:
A non-destructive testing (NDT) application of X-ray computed tomography (CT) is inspection of subsea pipes in operation via 2D cross-sectional scans. Data acquisition is time-consuming and costly due to the challenging subsea environment. Reducing the number of projections in a scan can yield time and cost savings, but compromises the reconstruction quality, if conventional reconstruction methods…
▽ More
A non-destructive testing (NDT) application of X-ray computed tomography (CT) is inspection of subsea pipes in operation via 2D cross-sectional scans. Data acquisition is time-consuming and costly due to the challenging subsea environment. Reducing the number of projections in a scan can yield time and cost savings, but compromises the reconstruction quality, if conventional reconstruction methods are used. In this work we take a Bayesian approach to CT reconstruction and focus on designing an effective prior to make use of available structural information about the pipe geometry. We propose a new class of structural Gaussian priors to enforce expected material properties in different regions of the reconstructed image based on independent Gaussian priors in combination with global regularity through a Gaussian Markov Random Field (GMRF) prior. Numerical experiments with synthetic and real data show that the proposed structural Gaussian prior can reduce artifacts and enhance contrast in the reconstruction compared to using only a global GMRF prior or no prior at all. We show how the resulting posterior distribution can be efficiently sampled even for large-scale images, which is essential for practical NDT applications.
△ Less
Submitted 21 September, 2022; v1 submitted 2 March, 2022;
originally announced March 2022.
-
Flexible forward improvement iteration for infinite time horizon Markovian optimal stopping problems
Authors:
Sören Christensen,
Albrecht Irle,
Julian Peter Lemburg
Abstract:
In this paper, we propose an extension of the forward improvement iteration algorithm, originally introduced in Irle (2006) and recently reconsidered in Miclo and Villeneuve (2021). The main new ingredient is a flexible window parameter describing the look-ahead distance in the improvement step. We consider the framework of a Markovian optimal stopping problem in discrete time with random discount…
▽ More
In this paper, we propose an extension of the forward improvement iteration algorithm, originally introduced in Irle (2006) and recently reconsidered in Miclo and Villeneuve (2021). The main new ingredient is a flexible window parameter describing the look-ahead distance in the improvement step. We consider the framework of a Markovian optimal stopping problem in discrete time with random discounting and infinite time horizon. We prove convergence and show that the additional flexibility may significantly reduce the runtime.
△ Less
Submitted 26 November, 2021;
originally announced November 2021.
-
Fast Asymptotic-Numerical Method For Coarse Mesh Particle Simulation In Channels Of Arbitrary Cross Section
Authors:
Samuel Christensen,
Raymond Chu,
Christopher R Anderson,
Marcus Roper
Abstract:
Particles traveling through inertial microfluidic devices migrate to focusing streamlines. We present a numerical method that calculates migration velocities of particles in inertial microfluidic channels of arbitrary cross section by representing particles by singularities. Refinements to asymptotic analysis are given that improve the regularity of the singularity approximation, making finite ele…
▽ More
Particles traveling through inertial microfluidic devices migrate to focusing streamlines. We present a numerical method that calculates migration velocities of particles in inertial microfluidic channels of arbitrary cross section by representing particles by singularities. Refinements to asymptotic analysis are given that improve the regularity of the singularity approximation, making finite element approximations of flow and pressure fields more effective. Sample results demonstrate that the method is computationally efficient and able to capture bifurcations in particle focusing positions due to changes in channel shape and Reynolds number.
△ Less
Submitted 26 October, 2021;
originally announced October 2021.
-
A new integral equation for Brownian stopping problems with finite time horizon
Authors:
Sören Christensen,
Simon Fischer
Abstract:
For classical finite time horizon stopping problems driven by a Brownian motion
\[V(t,x) = \sup_{t\leqτ\leq0}E_{(t,x)}[g(τ,W_τ)],\] we derive a new class of Fredholm type integral equations for the stopping set. For large problem classes of interest, we show by analytical arguments that the equation uniquely characterizes the stopping boundary of the problem. Regardless of the uniqueness, we use…
▽ More
For classical finite time horizon stopping problems driven by a Brownian motion
\[V(t,x) = \sup_{t\leqτ\leq0}E_{(t,x)}[g(τ,W_τ)],\] we derive a new class of Fredholm type integral equations for the stopping set. For large problem classes of interest, we show by analytical arguments that the equation uniquely characterizes the stopping boundary of the problem. Regardless of the uniqueness, we use the representation to rigorously find the limit behavior of the stopping boundary close to the terminal time. Interestingly, it turns out that the leading-order coefficient is universal for wide classes of problems. We also discuss how the representation can be used for numerical purposes.
△ Less
Submitted 9 March, 2023; v1 submitted 6 October, 2021;
originally announced October 2021.
-
OrbNet Denali: A machine learning potential for biological and organic chemistry with semi-empirical cost and DFT accuracy
Authors:
Anders S. Christensen,
Sai Krishna Sirumalla,
Zhuoran Qiao,
Michael B. O'Connor,
Daniel G. A. Smith,
Feizhi Ding,
Peter J. Bygrave,
Animashree Anandkumar,
Matthew Welborn,
Frederick R. Manby,
Thomas F. Miller III
Abstract:
We present OrbNet Denali, a machine learning model for electronic structure that is designed as a drop-in replacement for ground-state density functional theory (DFT) energy calculations. The model is a message-passing neural network that uses symmetry-adapted atomic orbital features from a low-cost quantum calculation to predict the energy of a molecule. OrbNet Denali is trained on a vast dataset…
▽ More
We present OrbNet Denali, a machine learning model for electronic structure that is designed as a drop-in replacement for ground-state density functional theory (DFT) energy calculations. The model is a message-passing neural network that uses symmetry-adapted atomic orbital features from a low-cost quantum calculation to predict the energy of a molecule. OrbNet Denali is trained on a vast dataset of 2.3 million DFT calculations on molecules and geometries. This dataset covers the most common elements in bio- and organic chemistry (H, Li, B, C, N, O, F, Na, Mg, Si, P, S, Cl, K, Ca, Br, I) as well as charged molecules. OrbNet Denali is demonstrated on several well-established benchmark datasets, and we find that it provides accuracy that is on par with modern DFT methods while offering a speedup of up to three orders of magnitude. For the GMTKN55 benchmark set, OrbNet Denali achieves WTMAD-1 and WTMAD-2 scores of 7.19 and 9.84, on par with modern DFT functionals. For several GMTKN55 subsets, which contain chemical problems that are not present in the training set, OrbNet Denali produces a mean absolute error comparable to those of DFT methods. For the Hutchison conformers benchmark set, OrbNet Denali has a median correlation coefficient of R^2=0.90 compared to the reference DLPNO-CCSD(T) calculation, and R^2=0.97 compared to the method used to generate the training data (wB97X-D3/def2-TZVP), exceeding the performance of any other method with a similar cost. Similarly, the model reaches chemical accuracy for non-covalent interactions in the S66x10 dataset. For torsional profiles, OrbNet Denali reproduces the torsion profiles of wB97X-D3/def2-TZVP with an average MAE of 0.12 kcal/mol for the potential energy surfaces of the diverse fragments in the TorsionNet500 dataset.
△ Less
Submitted 2 July, 2021; v1 submitted 1 July, 2021;
originally announced July 2021.
-
Intake Design for an Atmosphere-Breathing Electric Propulsion System (ABEP)
Authors:
F. Romano,
J. Espinosa-Orozco,
M. Pfeiffer,
G. Herdrich,
N. H. Crisp,
P. C. E. Roberts,
B. E. A. Holmes,
S. Edmondson,
S. Haigh,
S. Livadiotti,
A. Macario-Rojas,
V. T. A. Oiko,
L. A. Sinpetru,
K. Smith,
J. Becedas,
V. Sulliotti-Linner,
M. Bisgaard,
S. Christensen,
V. Hanessian,
T. Kauffman Jensen,
J. Nielsen,
Y. -A. Chan,
S. Fasoulas,
C. Traub,
D. García-Almiñana
, et al. (7 additional authors not shown)
Abstract:
Challenging space missions include those at very low altitudes, where the atmosphere is source of aerodynamic drag on the spacecraft. To extend the lifetime of such missions, an efficient propulsion system is required. One solution is Atmosphere-Breathing Electric Propulsion (ABEP) that collects atmospheric particles to be used as propellant for an electric thruster. The system would minimize the…
▽ More
Challenging space missions include those at very low altitudes, where the atmosphere is source of aerodynamic drag on the spacecraft. To extend the lifetime of such missions, an efficient propulsion system is required. One solution is Atmosphere-Breathing Electric Propulsion (ABEP) that collects atmospheric particles to be used as propellant for an electric thruster. The system would minimize the requirement of limited propellant availability and can also be applied to any planetary body with atmosphere, enabling new missions at low altitude ranges for longer times. IRS is developing, within the H2020 DISCOVERER project, an intake and a thruster for an ABEP system. The article describes the design and simulation of the intake, optimized to feed the radio frequency (RF) Helicon-based plasma thruster developed at IRS. The article deals in particular with the design of intakes based on diffuse and specular reflecting materials, which are analysed by the PICLas DSMC-PIC tool. Orbital altitudes $h=150-250$ km and the respective species based on the NRLMSISE-00 model (O, $N_2$, $O_2$, He, Ar, H, N) are investigated for several concepts based on fully diffuse and specular scattering, including hybrid designs. The major focus has been on the intake efficiency defined as $η_c=\dot{N}_{out}/\dot{N}_{in}$, with $\dot{N}_{in}$ the incoming particle flux, and $\dot{N}_{out}$ the one collected by the intake. Finally, two concepts are selected and presented providing the best expected performance for the operation with the selected thruster. The first one is based on fully diffuse accommodation yielding to $η_c<0.46$ and the second one based un fully specular accommodation yielding to $η_c<0.94$. Finally, also the influence of misalignment with the flow is analysed, highlighting a strong dependence of $η_c$ in the diffuse-based intake while, ...
△ Less
Submitted 1 July, 2021; v1 submitted 30 June, 2021;
originally announced June 2021.
-
Informing Geometric Deep Learning with Electronic Interactions to Accelerate Quantum Chemistry
Authors:
Zhuoran Qiao,
Anders S. Christensen,
Matthew Welborn,
Frederick R. Manby,
Anima Anandkumar,
Thomas F. Miller III
Abstract:
Predicting electronic energies, densities, and related chemical properties can facilitate the discovery of novel catalysts, medicines, and battery materials. By developing a physics-inspired equivariant neural network, we introduce a method to learn molecular representations based on the electronic interactions among atomic orbitals. Our method, OrbNet-Equi, leverages efficient tight-binding simul…
▽ More
Predicting electronic energies, densities, and related chemical properties can facilitate the discovery of novel catalysts, medicines, and battery materials. By developing a physics-inspired equivariant neural network, we introduce a method to learn molecular representations based on the electronic interactions among atomic orbitals. Our method, OrbNet-Equi, leverages efficient tight-binding simulations and learned mappings to recover high fidelity quantum chemical properties. OrbNet-Equi models a wide spectrum of target properties with an accuracy consistently better than standard machine learning methods and a speed orders of magnitude greater than density functional theory. Despite only using training samples collected from readily available small-molecule libraries, OrbNet-Equi outperforms traditional methods on comprehensive downstream benchmarks that encompass diverse main-group chemical processes. Our method also describes interactions in challenging charge-transfer complexes and open-shell systems. We anticipate that the strategy presented here will help to expand opportunities for studies in chemistry and materials science, where the acquisition of experimental or reference training data is costly.
△ Less
Submitted 1 April, 2022; v1 submitted 30 May, 2021;
originally announced May 2021.
-
Learning to reflect: A unifying approach for data-driven stochastic control strategies
Authors:
Sören Christensen,
Claudia Strauch,
Lukas Trottner
Abstract:
Stochastic optimal control problems have a long tradition in applied probability, with the questions addressed being of high relevance in a multitude of fields. Even though theoretical solutions are well understood in many scenarios, their practicability suffers from the assumption of known dynamics of the underlying stochastic process, raising the statistical challenge of developing purely data-d…
▽ More
Stochastic optimal control problems have a long tradition in applied probability, with the questions addressed being of high relevance in a multitude of fields. Even though theoretical solutions are well understood in many scenarios, their practicability suffers from the assumption of known dynamics of the underlying stochastic process, raising the statistical challenge of developing purely data-driven strategies. For the mathematically separated classes of continuous diffusion processes and Lévy processes, we show that developing efficient strategies for related singular stochastic control problems can essentially be reduced to finding rate-optimal estimators with respect to the sup-norm risk of objects associated to the invariant distribution of ergodic processes which determine the theoretical solution of the control problem. From a statistical perspective, we exploit the exponential $β$-mixing property as the common factor of both scenarios to drive the convergence analysis, indicating that relying on general stability properties of Markov processes is a sufficiently powerful and flexible approach to treat complex applications requiring statistical methods. We show moreover that in the Lévy case $-$ even though per se jump processes are more difficult to handle both in statistics and control theory $-$ a fully data-driven strategy with regret of significantly better order than in the diffusion case can be constructed.
△ Less
Submitted 23 April, 2021;
originally announced April 2021.
-
The superconducting circuit companion -- an introduction with worked examples
Authors:
S. E. Rasmussen,
K. S. Christensen,
S. P. Pedersen,
L. B. Kristensen,
T. Bækkegaard,
N. J. S. Loft,
N. T. Zinner
Abstract:
This tutorial aims at giving an introductory treatment of the circuit analysis of superconducting qubits, i.e., two-level systems in superconducting circuits. It also touches upon couplings between such qubits and how microwave driving and these couplings can be used for single- and two-qubit gates, as well as how to include noise when calculating the dynamics of the system. We also discuss higher…
▽ More
This tutorial aims at giving an introductory treatment of the circuit analysis of superconducting qubits, i.e., two-level systems in superconducting circuits. It also touches upon couplings between such qubits and how microwave driving and these couplings can be used for single- and two-qubit gates, as well as how to include noise when calculating the dynamics of the system. We also discuss higher-dimensional superconducting qudits. The tutorial is intended for new researchers with limited or no experience with the field but should be accessible to anyone with a bachelor's degree in physics. The tutorial introduces the basic methods used in quantum circuit analysis, starting from a circuit diagram and ending with a quantized Hamiltonian, that may be truncated to the lowest levels. We provide examples of all the basic techniques throughout the discussion, while in the last part of the tutorial we discuss several of the most commonly used circuits for quantum-information applications. This includes both worked examples of single qubits and examples of how to analyze the coupling methods that allow multiqubit operations. In several detailed appendices, we provide the interested reader with an introduction to more advanced techniques for handling larger circuit designs.
△ Less
Submitted 10 November, 2023; v1 submitted 1 March, 2021;
originally announced March 2021.
-
Improved Segmentation and Detection Sensitivity of Diffusion-Weighted Brain Infarct Lesions with Synthetically Enhanced Deep Learning
Authors:
Christian Federau,
Soren Christensen,
Nino Scherrer,
Johanna Ospel,
Victor Schulze-Zachau,
Noemi Schmidt,
Hanns-Christian Breit,
Julian Maclaren,
Maarten Lansberg,
Sebastian Kozerke
Abstract:
Purpose: To compare the segmentation and detection performance of a deep learning model trained on a database of human-labelled clinical diffusion-weighted (DW) stroke lesions to a model trained on the same database enhanced with synthetic DW stroke lesions. Methods: In this institutional review board approved study, a stroke database of 962 cases (mean age 65+/-17 years, 255 males, 449 scans with…
▽ More
Purpose: To compare the segmentation and detection performance of a deep learning model trained on a database of human-labelled clinical diffusion-weighted (DW) stroke lesions to a model trained on the same database enhanced with synthetic DW stroke lesions. Methods: In this institutional review board approved study, a stroke database of 962 cases (mean age 65+/-17 years, 255 males, 449 scans with DW positive stroke lesions) and a normal database of 2,027 patients (mean age 38+/-24 years,1088 females) were obtained. Brain volumes with synthetic DW stroke lesions were produced by warping the relative signal increase of real strokes to normal brain volumes. A generic 3D U-Net was trained on four different databases to generate four different models: (a) 375 neuroradiologist-labeled clinical DW positive stroke cases(CDB);(b) 2,000 synthetic cases(S2DB);(c) CDB+2,000 synthetic cases(CS2DB); or (d) CDB+40,000 synthetic cases(CS40DB). The models were tested on 20%(n=192) of the cases of the stroke database, which were excluded from the training set. Segmentation accuracy was characterized using Dice score and lesion volume of the stroke segmentation, and statistical significance was tested using a paired, two-tailed, Student's t-test. Detection sensitivity and specificity was compared to three neuroradiologists. Results: The performance of the 3D U-Net model trained on the CS40DB(mean Dice 0.72) was better than models trained on the CS2DB (0.70,P <0.001) or the CDB(0.65,P<0.001). The deep learning model was also more sensitive (91%[89%-93%]) than each of the three human readers(84%[81%-87%],78%[75%-81%],and 79%[76%-82%]), but less specific(75%[72%-78%] vs for the three human readers (96%[94%-97%],92%[90%-94%] and 89%[86%-91%]). Conclusion: Deep learning training for segmentation and detection of DW stroke lesions was significantly improved by enhancing the training set with synthetic lesions.
△ Less
Submitted 29 December, 2020;
originally announced December 2020.
-
In-Orbit Aerodynamic Coefficient Measurements using SOAR (Satellite for Orbital Aerodynamics Research)
Authors:
N. H. Crisp,
P. C. E. Roberts,
S. Livadiotti,
A. Macario Rojas,
V. T. A. Oiko,
S. Edmondson,
S. J. Haigh,
B. E. A. Holmes,
L. A. Sinpetru,
K. L. Smith,
J. Becedas,
R. M. Dominguez,
V. Sulliotti-Linner,
S. Christensen,
J. Nielsen,
M. Bisgaard,
Y-A. Chan,
S. Fasoulas,
G. H. Herdrich,
F. Romano,
C. Traub,
D. Garcia-Alminana,
S. Rodriguez-Donaire,
M. Sureda,
D. Kataria
, et al. (4 additional authors not shown)
Abstract:
The Satellite for Orbital Aerodynamics Research (SOAR) is a CubeSat mission, due to be launched in 2021, to investigate the interaction between different materials and the atmospheric flow regime in very low Earth orbits (VLEO). Improving knowledge of the gas-surface interactions at these altitudes and identification of novel materials that can minimise drag or improve aerodynamic control are impo…
▽ More
The Satellite for Orbital Aerodynamics Research (SOAR) is a CubeSat mission, due to be launched in 2021, to investigate the interaction between different materials and the atmospheric flow regime in very low Earth orbits (VLEO). Improving knowledge of the gas-surface interactions at these altitudes and identification of novel materials that can minimise drag or improve aerodynamic control are important for the design of future spacecraft that can operate in lower altitude orbits. Such satellites may be smaller and cheaper to develop or can provide improved Earth observation data or communications link-budgets and latency. Using precise orbit and attitude determination information and the measured atmospheric flow characteristics the forces and torques experienced by the satellite in orbit can be studied and estimates of the aerodynamic coefficients calculated. This paper presents the scientific concept and design of the SOAR mission. The methodology for recovery of the aerodynamic coefficients from the measured orbit, attitude, and in-situ atmospheric data using a least-squares orbit determination and free-parameter fitting process is described and the experimental uncertainty of the resolved aerodynamic coefficients is estimated. The presented results indicate that the combination of the satellite design and experimental methodology are capable of clearly illustrating the variation of drag and lift coefficient for differing surface incidence angle. The lowest uncertainties for the drag coefficient measurement are found at approximately 300 km, whilst the measurement of lift coefficient improves for reducing orbital altitude to 200 km.
△ Less
Submitted 17 December, 2020; v1 submitted 14 December, 2020;
originally announced December 2020.
-
Competition versus Cooperation: A class of solvable mean field impulse control problems
Authors:
Sören Christensen,
Berenice Anne Neumann,
Tobias Sohr
Abstract:
We discuss a class of explicitly solvable mean field type control problems/mean field games with a clear economic interpretation. More precisely, we consider long term average impulse control problems with underlying general one-dimensional diffusion processes motivated by optimal harvesting problems in natural resource management. We extend the classical stochastic Faustmann models by allowing th…
▽ More
We discuss a class of explicitly solvable mean field type control problems/mean field games with a clear economic interpretation. More precisely, we consider long term average impulse control problems with underlying general one-dimensional diffusion processes motivated by optimal harvesting problems in natural resource management. We extend the classical stochastic Faustmann models by allowing the prices to depend on the state of the market using a mean field structure. In a competitive market model, we prove that, under natural conditions, there exists an equilibrium strategy of threshold-type and furthermore characterize the threshold explicitly. If the agents cooperate with each other, we are faced with the mean field type control problem. Using a Lagrange-type argument, we prove that the optimizer of this non-standard impulse control problem is of threshold-type as well and characterize the optimal threshold. Furthermore, we compare the solutions and illustrate the findings in an example.
△ Less
Submitted 27 April, 2021; v1 submitted 13 October, 2020;
originally announced October 2020.
-
A Review of Gas-Surface Interaction Models for Orbital Aerodynamics Applications
Authors:
Sabrina Livadiotti,
Nicholas H. Crisp,
Peter C. E. Roberts,
Stephen D. Worrall,
Vitor T. A. Oiko,
Steve Edmondson,
Sarah J. Haigh,
Claire Huyton,
Katharine L. Smith,
Luciana A. Sinpetru,
Brandon E. A. Holmes,
Jonathan Becedas,
Rosa María Domínguez,
Valentín Cañas,
Simon Christensen,
Anders Mølgaard,
Jens Nielsen,
Morten Bisgaard,
Yung-An Chan,
Georg H. Herdrich,
Francesco Romano,
Stefanos Fasoulas,
Constantin Traub,
Daniel Garcia-Almiñana,
Silvia Rodriguez-Donaire
, et al. (7 additional authors not shown)
Abstract:
Renewed interest in Very Low Earth Orbits (VLEO) - i.e. altitudes below 450 km - has led to an increased demand for accurate environment characterisation and aerodynamic force prediction. While the former requires knowledge of the mechanisms that drive density variations in the thermosphere, the latter also depends on the interactions between the gas-particles in the residual atmosphere and the su…
▽ More
Renewed interest in Very Low Earth Orbits (VLEO) - i.e. altitudes below 450 km - has led to an increased demand for accurate environment characterisation and aerodynamic force prediction. While the former requires knowledge of the mechanisms that drive density variations in the thermosphere, the latter also depends on the interactions between the gas-particles in the residual atmosphere and the surfaces exposed to the flow. The determination of the aerodynamic coefficients is hindered by the numerous uncertainties that characterise the physical processes occurring at the exposed surfaces. Several models have been produced over the last 60 years with the intent of combining accuracy with relatively simple implementations. In this paper the most popular models have been selected and reviewed using as discriminating factors relevance with regards to orbital aerodynamics applications and theoretical agreement with gas-beam experimental data. More sophisticated models were neglected, since their increased accuracy is generally accompanied by a substantial increase in computation times which is likely to be unsuitable for most space engineering applications. For the sake of clarity, a distinction was introduced between physical and scattering kernel theory based gas-surface interaction models. The physical model category comprises the Hard Cube model, the Soft Cube model and the Washboard model, while the scattering kernel family consists of the Maxwell model, the Nocilla-Hurlbut-Sherman model and the Cercignani-Lampis-Lord model. Limits and assets of each model have been discussed with regards to the context of this paper. Wherever possible, comments have been provided to help the reader to identify possible future challenges for gas-surface interaction science with regards to orbital aerodynamic applications.
△ Less
Submitted 22 November, 2020; v1 submitted 1 October, 2020;
originally announced October 2020.
-
An assessment of the structural resolution of various fingerprints commonly used in machine learning
Authors:
Behnam Parsaeifard,
Deb Sankar De,
Anders S. Christensen,
Felix A. Faber,
Emir Kocer,
Sandip De,
Joerg Behler,
Anatole von Lilienfeld,
Stefan Goedecker
Abstract:
Atomic environment fingerprints are widely used in computational materials science, from machine learning potentials to the quantification of similarities between atomic configurations. Many approaches to the construction of such fingerprints, also called structural descriptors, have been proposed. In this work, we compare the performance of fingerprints based on the Overlap Matrix(OM), the Smooth…
▽ More
Atomic environment fingerprints are widely used in computational materials science, from machine learning potentials to the quantification of similarities between atomic configurations. Many approaches to the construction of such fingerprints, also called structural descriptors, have been proposed. In this work, we compare the performance of fingerprints based on the Overlap Matrix(OM), the Smooth Overlap of Atomic Positions (SOAP), Behler-Parrinello atom-centered symmetry functions (ACSF), modified Behler-Parrinello symmetry functions (MBSF) used in the ANI-1ccx potential and the Faber-Christensen-Huang-Lilienfeld (FCHL) fingerprint under various aspects. We study their ability to resolve differences in local environments and in particular examine whether there are certain atomic movements that leave the fingerprints exactly or nearly invariant. For this purpose, we introduce a sensitivity matrix whose eigenvalues quantify the effect of atomic displacement modes on the fingerprint. Further, we check whether these displacements correlate with the variation of localized physical quantities such as forces. Finally, we extend our examination to the correlation between molecular fingerprints obtained from the atomic fingerprints and global quantities of entire molecules.
△ Less
Submitted 7 August, 2020;
originally announced August 2020.
-
On the role of gradients for machine learning of molecular energies and forces
Authors:
Anders S. Christensen,
O. Anatole von Lilienfeld
Abstract:
The accuracy of any machine learning potential can only be as good as the data used in the fitting process. The most efficient model therefore selects the training data that will yield the highest accuracy compared to the cost of obtaining the training data. We investigate the convergence of prediction errors of quantum machine learning models for organic molecules trained on energy and force labe…
▽ More
The accuracy of any machine learning potential can only be as good as the data used in the fitting process. The most efficient model therefore selects the training data that will yield the highest accuracy compared to the cost of obtaining the training data. We investigate the convergence of prediction errors of quantum machine learning models for organic molecules trained on energy and force labels, two common data types in molecular simulations. When training and predicting on different geometries corresponding to the same single molecule, we find that the inclusion of atomic forces in the training data increases the accuracy of the predicted energies and forces 7-fold, compared to models trained on energy only. Surprisingly, for models trained on sets of organic molecules of varying size and composition in non-equilibrium conformations, inclusion of forces in the training does not improve the predicted energies of unseen molecules in new conformations. Predicted forces, however, also improve about 7-fold. For the systems studied, we find that force labels and energy labels contribute equally per label to the convergence of the prediction errors. Choosing to include derivatives such as atomic forces in the training set or not should thus depend on, not only on the computational cost of acquiring the force labels for training, but also on the application domain, the property of interest, and the desirable size of the machine learning model. Based on our observations we describe key considerations for the creation of datasets for potential energy surfaces of molecules which maximize the efficiency of the resulting machine learning models.
△ Less
Submitted 19 July, 2020;
originally announced July 2020.
-
ML Models of Vibrating H$_2$CO: Comparing Reproducing Kernels, FCHL and PhysNet
Authors:
Silvan Käser,
Debasish Koner,
Anders S. Christensen,
O. Anatole von Lilienfeld,
Markus Meuwly
Abstract:
Machine Learning (ML) has become a promising tool for improving the quality of atomistic simulations. Using formaldehyde as a benchmark system for intramolecular interactions, a comparative assessment of ML models based on state-of-the-art variants of deep neural networks (NN), reproducing kernel Hilbert space (RKHS+F), and kernel ridge regression (KRR) is presented. Learning curves for energies a…
▽ More
Machine Learning (ML) has become a promising tool for improving the quality of atomistic simulations. Using formaldehyde as a benchmark system for intramolecular interactions, a comparative assessment of ML models based on state-of-the-art variants of deep neural networks (NN), reproducing kernel Hilbert space (RKHS+F), and kernel ridge regression (KRR) is presented. Learning curves for energies and atomic forces indicate rapid convergence towards excellent predictions for B3LYP, MP2, and CCSD(T)-F12 reference results for modestly sized (in the hundreds) training sets. Typically, learning curve off-sets decay as one goes from NN (PhysNet) to RKHS+F to KRR (FCHL). Conversely, the predictive power for extrapolation of energies towards new geometries increases in the same order with RKHS+F and FCHL performing almost equally. For harmonic vibrational frequencies, the picture is less clear, with PhysNet and FCHL yielding respectively flat learning at $\sim$ 1 and $\sim$ 0.2 cm$^{-1}$ no matter which reference method, while RKHS+F models level off for B3LYP, and exhibit continued improvements for MP2 and CCSD(T)-F12. Finite-temperature molecular dynamics (MD) simulations with the same initial conditions yield indistinguishable infrared spectra with good performance compared with experiment except for the high-frequency modes involving hydrogen stretch motion which is a known limitation of MD for vibrational spectroscopy. For sufficiently large training set sizes all three models can detect insufficient convergence (``noise'') of the reference electronic structure calculations in that the learning curves level off. Transfer learning (TL) from B3LYP to CCSD(T)-F12 with PhysNet indicates that additional improvements in data efficiency can be achieved.
△ Less
Submitted 30 June, 2020;
originally announced June 2020.
-
Are American options European after all?
Authors:
Sören Christensen,
Jan Kallsen,
Matthias Lenga
Abstract:
We call a given American option representable if there exists a European claim which dominates the American payoff at any time and such that the values of the two options coincide in the continuation region of the American option. This concept has interesting implications from a probabilistic, analytic, financial, and numeric point of view. Relying on methods from Jourdain and Martini (2001, 2002)…
▽ More
We call a given American option representable if there exists a European claim which dominates the American payoff at any time and such that the values of the two options coincide in the continuation region of the American option. This concept has interesting implications from a probabilistic, analytic, financial, and numeric point of view. Relying on methods from Jourdain and Martini (2001, 2002), Chrsitensen (2014) and convex duality, we make a first step towards verifying representability of American options.
△ Less
Submitted 13 February, 2020;
originally announced February 2020.
-
General Optimal Stopping with Linear Costs
Authors:
Sören Christensen,
Tobias Sohr
Abstract:
This article treats both discrete time and continuous time stopping problems for general Markov processes on the real line with general linear costs. Using an auxiliary function of maximum representation type, conditions are given to guarantee the optimal stopping time to be of threshold type. The optimal threshold is then characterized as the root of that function. For random walks our results co…
▽ More
This article treats both discrete time and continuous time stopping problems for general Markov processes on the real line with general linear costs. Using an auxiliary function of maximum representation type, conditions are given to guarantee the optimal stopping time to be of threshold type. The optimal threshold is then characterized as the root of that function. For random walks our results condense in the fact that all combinations of concave increasing pay-off functions and convex cost functions lead to a one-sided solution. For Lévy processes an explicit way to obtain the auxiliary function and the threshold is given by use of the ladder height processes. Lastly, the connection from discrete and continuous problem and possible approximation of the latter one via the former one is discussed.
△ Less
Submitted 26 January, 2020;
originally announced January 2020.
-
Neural networks and kernel ridge regression for excited states dynamics of CH$_2$NH$_2^+$: From single-state to multi-state representations and multi-property machine learning models
Authors:
Julia Westermayr,
Felix A. Faber,
Anders S. Christensen,
O. Anatole von Lilienfeld,
Philipp Marquetand
Abstract:
Excited-state dynamics simulations are a powerful tool to investigate photo-induced reactions of molecules and materials and provide complementary information to experiments. Since the applicability of these simulation techniques is limited by the costs of the underlying electronic structure calculations, we develop and assess different machine learning models for this task. The machine learning m…
▽ More
Excited-state dynamics simulations are a powerful tool to investigate photo-induced reactions of molecules and materials and provide complementary information to experiments. Since the applicability of these simulation techniques is limited by the costs of the underlying electronic structure calculations, we develop and assess different machine learning models for this task. The machine learning models are trained on {\emph ab initio} calculations for excited electronic states, using the methylenimmonium cation (CH$_2$NH$_2^+$) as a model system. For the prediction of excited-state properties, multiple outputs are desirable, which is straightforward with neural networks but less explored with kernel ridge regression. We overcome this challenge for kernel ridge regression in the case of energy predictions by encoding the electronic states explicitly in the inputs, in addition to the molecular representation. We adopt this strategy also for our neural networks for comparison. Such a state encoding enables not only kernel ridge regression with multiple outputs but leads also to more accurate machine learning models for state-specific properties. An important goal for excited-state machine learning models is their use in dynamics simulations, which needs not only state-specific information but also couplings, i.e., properties involving pairs of states. Accordingly, we investigate the performance of different models for such coupling elements. Furthermore, we explore how combining all properties in a single neural network affects the accuracy. As an ultimate test for our machine learning models, we carry out excited-state dynamics simulations based on the predicted energies, forces and couplings and, thus, show the scopes and possibilities of machine learning for the treatment of electronically excited states.
△ Less
Submitted 18 December, 2019;
originally announced December 2019.
-
Note on the (non-)smoothness of discrete time value functions
Authors:
Simon Fischer,
Sören Christensen
Abstract:
We consider the discrete time stopping problem \[ V(t,x) = \sup_τE_{(t,x)}[g(τ, X_τ)],\] where $X$ is a random walk. It is well known that the value function $V$ is in general not smooth on the boundary of the continuation set $\partial C$. We show that under some conditions $V$ is not smooth in the interior of $C$ either. More precisely we show that $V$ is not differentiable in the $x$ component…
▽ More
We consider the discrete time stopping problem \[ V(t,x) = \sup_τE_{(t,x)}[g(τ, X_τ)],\] where $X$ is a random walk. It is well known that the value function $V$ is in general not smooth on the boundary of the continuation set $\partial C$. We show that under some conditions $V$ is not smooth in the interior of $C$ either. More precisely we show that $V$ is not differentiable in the $x$ component on a dense subset of $C$. As an example we consider the Chow-Robbins game. We give evidence that as well $\partial C$ is not smooth and that $C$ is not convex, even if $g(t,\cdot)$ is for every $t$.
△ Less
Submitted 13 November, 2019;
originally announced November 2019.
-
Quantum thermal transistor in superconducting circuits
Authors:
Marco Majland,
Kasper Sangild Christensen,
Nikolaj Thomas Zinner
Abstract:
Logical devices based on electrical currents are ubiquitous in modern society. However, digital logic does have some drawbacks such as a relatively high power consumption. It is therefore of great interest to seek alternative means to build logical circuits that can either work as stand-alone devices or in conjunction with more traditional electronic circuits. One direction that holds great promis…
▽ More
Logical devices based on electrical currents are ubiquitous in modern society. However, digital logic does have some drawbacks such as a relatively high power consumption. It is therefore of great interest to seek alternative means to build logical circuits that can either work as stand-alone devices or in conjunction with more traditional electronic circuits. One direction that holds great promise is the use of heat currents for logical components. In the present paper, we discuss a recent abstract proposal for a quantum thermal transistor and provide a concrete design of such a device using superconducting circuits. Using a circuit quantum electrodynamics Jaynes-Cummings model, we propose a three-terminal device that allows heat transfer from source to drain, depending on the temperature of a bath coupled at the gate modulator, and show that it provides similar properties to a conventional semiconductor transistor.
△ Less
Submitted 15 May, 2020; v1 submitted 4 November, 2019;
originally announced November 2019.
-
Operator quantum machine learning: Navigating the chemical space of response properties
Authors:
Anders S. Christensen,
O. Anatole von Lilienfeld
Abstract:
The identification and use of structure property relationships lies at the heart of the chemical sciences. Quantum mechanics forms the basis for the unbiased virtual exploration of chemical compound space (CCS), imposing substantial compute needs if chemical accuracy is to be reached. In order to accelerate predictions of quantum properties without compromising accuracy, our lab has been developin…
▽ More
The identification and use of structure property relationships lies at the heart of the chemical sciences. Quantum mechanics forms the basis for the unbiased virtual exploration of chemical compound space (CCS), imposing substantial compute needs if chemical accuracy is to be reached. In order to accelerate predictions of quantum properties without compromising accuracy, our lab has been developing quantum machine learning (QML) based models which can be applied throughout CCS. Here, we briefly explain, review, and discuss the recently introduced operator formalism which substantially improves the data efficiency for QML models of common response properties.
△ Less
Submitted 31 October, 2019;
originally announced October 2019.