Search | arXiv e-print repository

Simulating dynamics of correlated matter with neural quantum states

Abstract: While experimental advancements continue to expand the capabilities to control and probe non-equilibrium quantum matter at an unprecedented level, the numerical simulation of the dynamics of correlated quantum systems remains a pivotal challenge - especially in intermediate spatial dimensions. Neural quantum states are emerging as a new computational tool to investigate the time evolution of many-… ▽ More While experimental advancements continue to expand the capabilities to control and probe non-equilibrium quantum matter at an unprecedented level, the numerical simulation of the dynamics of correlated quantum systems remains a pivotal challenge - especially in intermediate spatial dimensions. Neural quantum states are emerging as a new computational tool to investigate the time evolution of many-body quantum systems in previously inaccessible regimes. We review the recent progress in the field with a focus on the different time propagation methods, an overview of the reported applications, and a discussion of the major current challenges. △ Less

Submitted 3 June, 2025; originally announced June 2025.

arXiv:2505.23860 [pdf, ps, other]

Quantum computing and artificial intelligence: status and perspectives

Authors: Giovanni Acampora, Andris Ambainis, Natalia Ares, Leonardo Banchi, Pallavi Bhardwaj, Daniele Binosi, G. Andrew D. Briggs, Tommaso Calarco, Vedran Dunjko, Jens Eisert, Olivier Ezratty, Paul Erker, Federico Fedele, Elies Gil-Fuster, Martin Gärttner, Mats Granath, Markus Heyl, Iordanis Kerenidis, Matthias Klusch, Anton Frisk Kockum, Richard Kueng, Mario Krenn, Jörg Lässig, Antonio Macaluso, Sabrina Maniscalco , et al. (13 additional authors not shown)

Abstract: This white paper discusses and explores the various points of intersection between quantum computing and artificial intelligence (AI). It describes how quantum computing could support the development of innovative AI solutions. It also examines use cases of classical AI that can empower research and development in quantum technologies, with a focus on quantum computing and quantum sensing. The pur… ▽ More This white paper discusses and explores the various points of intersection between quantum computing and artificial intelligence (AI). It describes how quantum computing could support the development of innovative AI solutions. It also examines use cases of classical AI that can empower research and development in quantum technologies, with a focus on quantum computing and quantum sensing. The purpose of this white paper is to provide a long-term research agenda aimed at addressing foundational questions about how AI and quantum computing interact and benefit one another. It concludes with a set of recommendations and challenges, including how to orchestrate the proposed theoretical work, align quantum AI developments with quantum hardware roadmaps, estimate both classical and quantum resources - especially with the goal of mitigating and optimizing energy consumption - advance this emerging hybrid software engineering discipline, and enhance European industrial competitiveness while considering societal implications. △ Less

Submitted 29 May, 2025; originally announced May 2025.

Comments: 32 pages, 3 figures

arXiv:2505.07612 [pdf, other]

Time evolution of the quantum Ising model in two dimensions using Tree Tensor Networks

Authors: Wladislaw Krinitsin, Niklas Tausendpfund, Markus Heyl, Matteo Rizzi, Markus Schmitt

Abstract: The numerical simulation of two-dimensional quantum many-body systems away from equilibrium constitutes a major challenge for all known computational methods. We investigate the utility of Tree Tensor Network (TTN) states to solve the dynamics of the quantum Ising model in two dimensions. Within the perturbative regime of small transverse fields, TTNs faithfully reproduce analytically known, but n… ▽ More The numerical simulation of two-dimensional quantum many-body systems away from equilibrium constitutes a major challenge for all known computational methods. We investigate the utility of Tree Tensor Network (TTN) states to solve the dynamics of the quantum Ising model in two dimensions. Within the perturbative regime of small transverse fields, TTNs faithfully reproduce analytically known, but non-trivial and physically interesting results, for lattices up to $16 \times 16$ sites. Limitations of the method related to the rapid growth of entanglement entropy are explored within more general, paradigmatic quench settings. We provide and discuss comprehensive benchmarks regarding the benefit of \emph{GPU} acceleration and the impact of using local operator sums on the performance. △ Less

Submitted 12 May, 2025; originally announced May 2025.

Comments: 12 pages, 11 figures

arXiv:2505.03129 [pdf, other]

Finite-temperature properties of the prototypical perovskite CaTiO$_3$ from second-principles effective interatomic potential

Authors: Huazhang Zhang, Michael Marcus Schmitt, Louis Bastogne, Xu He, Philippe Ghosez

Abstract: We introduce a second-principles effective interatomic potential for the perovskite $\rm CaTiO_3$ (CTO), relying on a Taylor polynomial expansion of the Born-Oppenheimer energy surface around the cubic reference structure, in terms of atomic displacements and macroscopic strains. This model captures various phases of CTO, in particular successfully reproducing the structure, energy, and dynamical… ▽ More We introduce a second-principles effective interatomic potential for the perovskite $\rm CaTiO_3$ (CTO), relying on a Taylor polynomial expansion of the Born-Oppenheimer energy surface around the cubic reference structure, in terms of atomic displacements and macroscopic strains. This model captures various phases of CTO, in particular successfully reproducing the structure, energy, and dynamical properties of the nonpolar $Pbnm$ ground state as well as the ferroelectric $R3c$ phase. The finite-temperature simulations suggest that the sequence of structural phase transitions over heating of CTO is: $Pbnm \ (a^-a^-c^+) \rightarrow C2/m \ (a^-b^-c^0) \rightarrow I4/mcm \ (a^-c^0c^0) \rightarrow Pm\bar{3}m \ (a^0a^0a^0)$, during which the oxygen octahedral rotations around the three pseudocubic axes vanish progressively. The model also provides the opportunity of investigating the properties of the ferroelectric $R3c$ phase, which is a metastable phase free of lattice instability at zero Kelvin. Our model-based simulations confirm that the $R3c$ phase remains stable below a certain finite temperature. Additionally, we find that the minimum energy path connecting the $Pbnm$ and $R3c$ phases involves localized layer-by-layer flipping of octahedral rotations. A similar mechanism is also observed in the thermal destabilization process of the $R3c$ phase toward the $Pbnm$ ground state in our simulation. △ Less

Submitted 5 May, 2025; originally announced May 2025.

arXiv:2504.17593 [pdf, other]

The stellar corona-chromosphere connection. A comprehensive study of X-ray and Ca II IRT fluxes from eROSITA and Gaia

Authors: S. Freund, S. Czesla, B. Fuhrmeister, P. Predehl, J. Robrade, P. C. Schneider, J. H. M. M. Schmitt

Abstract: Stellar activity can be observed at different wavelengths in a variety of different activity indicators. We investigated the correlation between coronal and chromospheric emissions by combining X-ray data from stars detected in the eROSITA all-sky surveys (eRASS1 and eRASS:5) with Ca II infrared triplet (IRT) activity indices as published in the third Gaia data release (Gaia DR3). We specifically… ▽ More Stellar activity can be observed at different wavelengths in a variety of different activity indicators. We investigated the correlation between coronal and chromospheric emissions by combining X-ray data from stars detected in the eROSITA all-sky surveys (eRASS1 and eRASS:5) with Ca II infrared triplet (IRT) activity indices as published in the third Gaia data release (Gaia DR3). We specifically studied 24 300 and 43 200 stellar sources with reliable Ca II IRT measurement and X-ray detection in eRASS1 and eRASS:5, which is by far the largest stellar sample available so far. The largest detection fraction is obtained for highly active sources and stars of a late spectral type, while F-type and less active stars (as measured in the Ca II IRT) remain mostly undetected in X-rays. Also, the correlation is the strongest for late-type sources, while F-type stars show a rather weak correlation between the X-ray to bolometric flux ratio and the Ca II IRT activity index. The relation between the X-ray and Ca II IRT surface fluxes changes with the fractional X-ray flux without showing two separated branches as described in previous studies. For fast rotators, both activity indicators saturate at a similar Rossby number and the X-ray to bolometric flux ratio decreases faster than the IRT index for slower rotating stars. As a consequence, the ratio between X-ray and IRT fluxes is constant in the saturation regime and decreases for slow rotators. △ Less

Submitted 24 April, 2025; originally announced April 2025.

Comments: 9 pages, 10 figures, 3 tables, accepted for publication in A&A

arXiv:2504.08441 [pdf, other]

SARFormer -- An Acquisition Parameter Aware Vision Transformer for Synthetic Aperture Radar Data

Authors: Jonathan Prexl, Michael Recla, Michael Schmitt

Abstract: This manuscript introduces SARFormer, a modified Vision Transformer (ViT) architecture designed for processing one or multiple synthetic aperture radar (SAR) images. Given the complex image geometry of SAR data, we propose an acquisition parameter encoding module that significantly guides the learning process, especially in the case of multiple images, leading to improved performance on downstream… ▽ More This manuscript introduces SARFormer, a modified Vision Transformer (ViT) architecture designed for processing one or multiple synthetic aperture radar (SAR) images. Given the complex image geometry of SAR data, we propose an acquisition parameter encoding module that significantly guides the learning process, especially in the case of multiple images, leading to improved performance on downstream tasks. We further explore self-supervised pre-training, conduct experiments with limited labeled data, and benchmark our contribution and adaptations thoroughly in ablation experiments against a baseline, where the model is tested on tasks such as height reconstruction and segmentation. Our approach achieves up to 17% improvement in terms of RMSE over baseline models △ Less

Submitted 11 April, 2025; originally announced April 2025.

arXiv:2504.03181 [pdf, other]

MIMRS: A Survey on Masked Image Modeling in Remote Sensing

Authors: Shabnam Choudhury, Akhil Vasim, Michael Schmitt, Biplab Banerjee

Abstract: Masked Image Modeling (MIM) is a self-supervised learning technique that involves masking portions of an image, such as pixels, patches, or latent representations, and training models to predict the missing information using the visible context. This approach has emerged as a cornerstone in self-supervised learning, unlocking new possibilities in visual understanding by leveraging unannotated data… ▽ More Masked Image Modeling (MIM) is a self-supervised learning technique that involves masking portions of an image, such as pixels, patches, or latent representations, and training models to predict the missing information using the visible context. This approach has emerged as a cornerstone in self-supervised learning, unlocking new possibilities in visual understanding by leveraging unannotated data for pre-training. In remote sensing, MIM addresses challenges such as incomplete data caused by cloud cover, occlusions, and sensor limitations, enabling applications like cloud removal, multi-modal data fusion, and super-resolution. By synthesizing and critically analyzing recent advancements, this survey (MIMRS) is a pioneering effort to chart the landscape of mask image modeling in remote sensing. We highlight state-of-the-art methodologies, applications, and future research directions, providing a foundational review to guide innovation in this rapidly evolving field. △ Less

Submitted 7 April, 2025; v1 submitted 4 April, 2025; originally announced April 2025.

Comments: 6 pages

arXiv:2504.02338 [pdf, other]

Coronal and chromospheric activity of Teegarden's star

Authors: B. Fuhrmeister, J. H. M. M. Schmitt, A. Reienrs, S. Czesla, V. J. S. Béjar, J. Caballero, Th. Henning, J. C. Morales, A. Quirrenbach, I. Ribas, J. Robrade, P. C. Schneider, M. Zechmeister

Abstract: Teegarden's star is a late-type M-dwarf planet host, typically showing only rather low levels of activity. In this paper we present an extensive characterisation of this activity at photospheric, chromospheric, and coronal levels. We specifically investigated TESS observations of Teegarden's star, which showed two very large flares with an estimated flare fluence between 10$^{29}$ and 10$^{32}$\,e… ▽ More Teegarden's star is a late-type M-dwarf planet host, typically showing only rather low levels of activity. In this paper we present an extensive characterisation of this activity at photospheric, chromospheric, and coronal levels. We specifically investigated TESS observations of Teegarden's star, which showed two very large flares with an estimated flare fluence between 10$^{29}$ and 10$^{32}$\,erg comparable to the largest solar flares. We furthermore analysed nearly 300 CARMENES spectra and 11 ESPRESSO spectra covering all the usually used chromospheric lines in the optical from the \ion{Ca}{ii} H \& K lines at 3930\,Å\, to the \ion{He}{i} infrared triplet at 10830\,Å. These lines show different behaviour: The \ion{He}{i} infrared triplet is the only one absent in all spectra, some lines show up only during flares, and others are always present and highly variable. Specifically, the H$α$ line is more or less filled in during quiescence; however, the higher Balmer lines are still observed in emission. Many chromospheric lines show a correlation with H$α$ variability, which, in addition to stochastic behaviour, also shows systematic behaviour on different timescales including the rotation period. Moreover, we found several flares and also report hints of an erupting prominence, which may have led to a coronal mass ejection. Finally, we present X-ray observations of Teegarden's star (i.e. a discovery pointing obtained with the \emph{Chandra} observatory) and an extensive study with the \emph{XMM-Newton} observatory; when these two large flares were observed, one of them showed clear signatures of the Neupert effect, suggesting the production of hard X-rays in the system. △ Less

Submitted 3 April, 2025; originally announced April 2025.

Comments: 15 pages, 20 figures

Journal ref: 2024A&A...691A.208F

arXiv:2503.24011 [pdf, other]

Simulations in Statistical Workflows

Authors: Paul-Christian Bürkner, Marvin Schmitt, Stefan T. Radev

Abstract: Simulations play important and diverse roles in statistical workflows, for example, in model specification, checking, validation, and even directly in model inference. Over the past decades, the application areas and overall potential of simulations in statistical workflows have expanded significantly, driven by the development of new simulation-based algorithms and exponentially increasing comput… ▽ More Simulations play important and diverse roles in statistical workflows, for example, in model specification, checking, validation, and even directly in model inference. Over the past decades, the application areas and overall potential of simulations in statistical workflows have expanded significantly, driven by the development of new simulation-based algorithms and exponentially increasing computational resources. In this paper, we examine past and current trends in the field and offer perspectives on how simulations may shape the future of statistical practice. △ Less

Submitted 31 March, 2025; originally announced March 2025.

arXiv:2502.15932 [pdf, other]

CVE-LLM : Ontology-Assisted Automatic Vulnerability Evaluation Using Large Language Models

Authors: Rikhiya Ghosh, Hans-Martin von Stockhausen, Martin Schmitt, George Marica Vasile, Sanjeev Kumar Karn, Oladimeji Farri

Abstract: The National Vulnerability Database (NVD) publishes over a thousand new vulnerabilities monthly, with a projected 25 percent increase in 2024, highlighting the crucial need for rapid vulnerability identification to mitigate cybersecurity attacks and save costs and resources. In this work, we propose using large language models (LLMs) to learn vulnerability evaluation from historical assessments of… ▽ More The National Vulnerability Database (NVD) publishes over a thousand new vulnerabilities monthly, with a projected 25 percent increase in 2024, highlighting the crucial need for rapid vulnerability identification to mitigate cybersecurity attacks and save costs and resources. In this work, we propose using large language models (LLMs) to learn vulnerability evaluation from historical assessments of medical device vulnerabilities in a single manufacturer's portfolio. We highlight the effectiveness and challenges of using LLMs for automatic vulnerability evaluation and introduce a method to enrich historical data with cybersecurity ontologies, enabling the system to understand new vulnerabilities without retraining the LLM. Our LLM system integrates with the in-house application - Cybersecurity Management System (CSMS) - to help Siemens Healthineers (SHS) product cybersecurity experts efficiently assess the vulnerabilities in our products. Also, we present guidelines for efficient integration of LLMs into the cybersecurity tool. △ Less

Submitted 21 February, 2025; originally announced February 2025.

Comments: arXiv admin note: substantial text overlap with arXiv:2407.14640

arXiv:2502.03279 [pdf, other]

Posterior SBC: Simulation-Based Calibration Checking Conditional on Data

Authors: Teemu Säilynoja, Marvin Schmitt, Paul-Christian Bürkner, Aki Vehtari

Abstract: Simulation-based calibration checking (SBC) refers to the validation of an inference algorithm and model implementation through repeated inference on data simulated from a generative model. In the original and commonly used approach, the generative model uses parameters drawn from the prior, and thus the approach is testing whether the inference works for simulated data generated with parameter va… ▽ More Simulation-based calibration checking (SBC) refers to the validation of an inference algorithm and model implementation through repeated inference on data simulated from a generative model. In the original and commonly used approach, the generative model uses parameters drawn from the prior, and thus the approach is testing whether the inference works for simulated data generated with parameter values plausible under that prior. This approach is natural and desirable when we want to test whether the inference works for a wide range of datasets we might observe. However, after observing data, we are interested in answering whether the inference works conditional on that particular data. In this paper, we propose posterior SBC and demonstrate how it can be used to validate the inference conditionally on observed data. We illustrate the utility of posterior SBC in three case studies: (1) A simple multilevel model; (2) a model that is governed by differential equations; and (3) a joint integrative neuroscience model which is approximated via amortized Bayesian inference with neural networks. △ Less

Submitted 10 March, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

Comments: 25 pages

arXiv:2501.13483 [pdf, other]

Robust Amortized Bayesian Inference with Self-Consistency Losses on Unlabeled Data

Authors: Aayush Mishra, Daniel Habermann, Marvin Schmitt, Stefan T. Radev, Paul-Christian Bürkner

Abstract: Amortized Bayesian inference (ABI) with neural networks can solve probabilistic inverse problems orders of magnitude faster than classical methods. However, ABI is not yet sufficiently robust for widespread and safe application. When performing inference on observations outside the scope of the simulated training data, posterior approximations are likely to become highly biased, which cannot be co… ▽ More Amortized Bayesian inference (ABI) with neural networks can solve probabilistic inverse problems orders of magnitude faster than classical methods. However, ABI is not yet sufficiently robust for widespread and safe application. When performing inference on observations outside the scope of the simulated training data, posterior approximations are likely to become highly biased, which cannot be corrected by additional simulations due to the bad pre-asymptotic behavior of current neural posterior estimators. In this paper, we propose a semi-supervised approach that enables training not only on labeled simulated data generated from the model, but also on \textit{unlabeled} data originating from any source, including real data. To achieve this, we leverage Bayesian self-consistency properties that can be transformed into strictly proper losses that do not require knowledge of ground-truth parameters. We test our approach on several real-world case studies, including applications to high-dimensional time-series and image data. Our results show that semi-supervised learning with unlabeled data drastically improves the robustness of ABI in the out-of-simulation regime. Notably, inference remains accurate even when evaluated on observations far away from the labeled and unlabeled data seen during training. △ Less

Submitted 15 May, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

arXiv:2501.09025 [pdf, other]

doi 10.1109/TAI.2025.3527398

Cyber Shadows: Neutralizing Security Threats with AI and Targeted Policy Measures

Authors: Marc Schmitt, Pantelis Koutroumpis

Abstract: The digital age, driven by the AI revolution, brings significant opportunities but also conceals security threats, which we refer to as cyber shadows. These threats pose risks at individual, organizational, and societal levels. This paper examines the systemic impact of these cyber threats and proposes a comprehensive cybersecurity strategy that integrates AI-driven solutions, such as Intrusion De… ▽ More The digital age, driven by the AI revolution, brings significant opportunities but also conceals security threats, which we refer to as cyber shadows. These threats pose risks at individual, organizational, and societal levels. This paper examines the systemic impact of these cyber threats and proposes a comprehensive cybersecurity strategy that integrates AI-driven solutions, such as Intrusion Detection Systems (IDS), with targeted policy interventions. By combining technological and regulatory measures, we create a multilevel defense capable of addressing both direct threats and indirect negative externalities. We emphasize that the synergy between AI-driven solutions and policy interventions is essential for neutralizing cyber threats and mitigating their negative impact on the digital economy. Finally, we underscore the need for continuous adaptation of these strategies, especially in response to the rapid advancement of autonomous AI-driven attacks, to ensure the creation of secure and resilient digital ecosystems. △ Less

Submitted 28 January, 2025; v1 submitted 3 January, 2025; originally announced January 2025.

Comments: IEEE Transactions on Artificial Intelligence

Journal ref: IEEE Transactions on Artificial Intelligence (2025)

arXiv:2412.13394 [pdf, other]

Distribution Shifts at Scale: Out-of-distribution Detection in Earth Observation

Authors: Burak Ekim, Girmaw Abebe Tadesse, Caleb Robinson, Gilles Hacheme, Michael Schmitt, Rahul Dodhia, Juan M. Lavista Ferres

Abstract: Training robust deep learning models is crucial in Earth Observation, where globally deployed models often face distribution shifts that degrade performance, especially in low-data regions. Out-of-distribution (OOD) detection addresses this by identifying inputs that deviate from in-distribution (ID) data. However, existing methods either assume access to OOD data or compromise primary task perfor… ▽ More Training robust deep learning models is crucial in Earth Observation, where globally deployed models often face distribution shifts that degrade performance, especially in low-data regions. Out-of-distribution (OOD) detection addresses this by identifying inputs that deviate from in-distribution (ID) data. However, existing methods either assume access to OOD data or compromise primary task performance, limiting real-world use. We introduce TARDIS, a post-hoc OOD detection method designed for scalable geospatial deployment. Our core innovation lies in generating surrogate distribution labels by leveraging ID data within the feature space. TARDIS takes a pre-trained model, ID data, and data from an unknown distribution (WILD), separates WILD into surrogate ID and OOD labels based on internal activations, and trains a binary classifier to detect distribution shifts. We validate on EuroSAT and xBD across 17 setups covering covariate and semantic shifts, showing near-upper-bound surrogate labeling performance in 13 cases and matching the performance of top post-hoc activation- and scoring-based methods. Finally, deploying TARDIS on Fields of the World reveals actionable insights into pre-trained model behavior at scale. The code is available at \href{https://github.com/microsoft/geospatial-ood-detection}{https://github.com/microsoft/geospatial-ood-detection} △ Less

Submitted 8 April, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

arXiv:2412.11830 [pdf, other]

Many-body dynamics with explicitly time-dependent neural quantum states

Authors: Anka Van de Walle, Markus Schmitt, Annabelle Bohrdt

Abstract: Simulating the dynamics of many-body quantum systems is a significant challenge, especially in higher dimensions where entanglement grows rapidly. Neural quantum states (NQS) offer a promising tool for representing quantum wavefunctions, but their application to time evolution faces scaling challenges. We introduce the time-dependent neural quantum state (t-NQS), a novel approach incorporating exp… ▽ More Simulating the dynamics of many-body quantum systems is a significant challenge, especially in higher dimensions where entanglement grows rapidly. Neural quantum states (NQS) offer a promising tool for representing quantum wavefunctions, but their application to time evolution faces scaling challenges. We introduce the time-dependent neural quantum state (t-NQS), a novel approach incorporating explicit time dependence into the neural network ansatz. This framework optimizes a single, time-independent set of parameters to solve the time-dependent Schrödinger equation across an entire time interval. We detail an autoregressive, attention-based transformer architecture and techniques for extending the model's applicability. To benchmark and demonstrate our method, we simulate quench dynamics in the 2D transverse field Ising model and the time-dependent preparation of the 2D antiferromagnetic state in a Heisenberg model, demonstrating state of the art performance, scalability, and extrapolation to unseen intervals. These results establish t-NQS as a powerful framework for exploring quantum dynamics in strongly correlated systems. △ Less

Submitted 16 December, 2024; originally announced December 2024.

Comments: 11 pages, 6 figures

arXiv:2412.10145 [pdf, other]

Roughening dynamics of interfaces in the two-dimensional quantum Ising model

Authors: Wladislaw Krinitsin, Niklas Tausendpfund, Matteo Rizzi, Markus Heyl, Markus Schmitt

Abstract: The properties of interfaces are key to understand the physics of matter. However, the study of quantum interface dynamics has remained an outstanding challenge. Here, we use large-scale Tree Tensor Network simulations to identify the dynamical signature of an interface roughening transition within the ferromagnetic phase of the 2D quantum Ising model. For initial domain wall profiles we find exte… ▽ More The properties of interfaces are key to understand the physics of matter. However, the study of quantum interface dynamics has remained an outstanding challenge. Here, we use large-scale Tree Tensor Network simulations to identify the dynamical signature of an interface roughening transition within the ferromagnetic phase of the 2D quantum Ising model. For initial domain wall profiles we find extended prethermal plateaus for smooth interfaces, whereas above the roughening transition the domain wall decays quickly. Our results can be readily explored experimentally in Rydberg atomic systems. △ Less

Submitted 21 May, 2025; v1 submitted 13 December, 2024; originally announced December 2024.

Comments: 17 pages, 10 figures

arXiv:2412.06577 [pdf, other]

doi 10.1063/5.0236904

Effect of simple shear on knotted polymer coils and globules

Authors: Andrey Milchev, Maurice P. Schmitt, Peter Virnau

Abstract: We explore the effect of Couette flow on knotted linear polymer chains with extensive Molecular Dynamics (MD) simulations. Hydrodynamic interactions are accounted for by means of Multi-Particle Collision Dynamics (MPCD). The polymer chain, containing originally a simple trefoil knot at rest, is described by a coarse-grained bead-spring model in a coil or globular state. We demonstrate that under s… ▽ More We explore the effect of Couette flow on knotted linear polymer chains with extensive Molecular Dynamics (MD) simulations. Hydrodynamic interactions are accounted for by means of Multi-Particle Collision Dynamics (MPCD). The polymer chain, containing originally a simple trefoil knot at rest, is described by a coarse-grained bead-spring model in a coil or globular state. We demonstrate that under shear existing loosely localized knots in polymer coils typically tighten to several segments beyond a certain shear rate threshold. At large shear rates the polymer undergoes a tumbling-like motion during which knot sizes can fluctuate. In contrast, sheared knotted globules unwind into a convoluted pearl-necklace structure of sub-globules that folds back onto itself and in which knot types change over time. △ Less

Submitted 9 December, 2024; originally announced December 2024.

Comments: This article may be downloaded for personal use only. Any other use requires prior permission of the author and AIP Publishing. This article appeared in J. Chem. Phys. 161, 224905 (2024) and may be found at https://doi.org/10.1063/5.0236904

Journal ref: J. Chem. Phys. 161, 224905 (2024)

arXiv:2411.13357 [pdf, other]

doi 10.1063/5.0228826

Topological comparison of flexible and semiflexible chains in polymer melts with $θ$-chains

Authors: Maurice P. Schmitt, Sarah Wettermann, Kostas Ch. Daoulas, Hendrik Meyer, Peter Virnau

Abstract: A central paradigm of polymer physics states that chains in melts behave like random walks as intra- and interchain interactions effectively cancel each other out. Likewise, $θ$-chains, i.e., chains at the transition from a swollen coil to a globular phase, are also thought to behave like ideal chains, as attractive forces are counterbalanced by repulsive entropic contributions. While the simple m… ▽ More A central paradigm of polymer physics states that chains in melts behave like random walks as intra- and interchain interactions effectively cancel each other out. Likewise, $θ$-chains, i.e., chains at the transition from a swollen coil to a globular phase, are also thought to behave like ideal chains, as attractive forces are counterbalanced by repulsive entropic contributions. While the simple mapping to an equivalent Kuhn chain works rather well in most scenarios with corrections to scaling, random walks do not accurately capture the topology and knots particularly for flexible chains. In this paper, we demonstrate with Monte Carlo and molecular dynamics simulations that chains in polymer melts and $θ$-chains not only agree on a structural level for a range of stiffnesses, but also topologically. They exhibit similar knotting probabilities and knot sizes, both of which are not captured by ideal chain representations. This discrepancy comes from the suppression of small knots in real chains, which is strongest for very flexible chains because excluded volume effects are still active locally and become weaker with increasing semiflexibility. Our findings suggest that corrections to ideal behavior are indeed similar for the two scenarios of real chains and that structure and topology of a chain in a melt can be approximately reproduced by a corresponding $θ$-chain. △ Less

Submitted 20 November, 2024; originally announced November 2024.

Comments: This article may be downloaded for personal use only. Any other use requires prior permission of the author and AIP Publishing. This article appeared in J. Chem. Phys. 161, 144904 (2024) and may be found at https://pubs.aip.org/aip/jcp/article/161/14/144904/3316214/Topological-comparison-of-flexible-and

Journal ref: J. Chem. Phys. 161, 144904 (2024)

arXiv:2410.03616 [pdf, other]

A long-duration superflare on the K giant HD 251108

Authors: Hans Moritz Günther, Dheeraj Pasham, Alexander Binks, Stefan Czesla, Teruaki Enoto, Michael Fausnaugh, Franz-Josef Hambsch, Shun Inoue, Hiroyuki Maehara, Yuta Notsu, Jan Robrade, J. H. M. M. Schmitt, P. C. Schneider

Abstract: Many giant stars are magnetically active, which causes rotational variability, chromospheric emission lines, and X-ray emission. Large outbursts in these emission features can set limits on the magnetic field strength and thus constrain the mechanism of the underlying dynamo. HD~251108 is a Li-rich active K-type giant. We find a rotational period of 21.3~d with color changes and additional long-te… ▽ More Many giant stars are magnetically active, which causes rotational variability, chromospheric emission lines, and X-ray emission. Large outbursts in these emission features can set limits on the magnetic field strength and thus constrain the mechanism of the underlying dynamo. HD~251108 is a Li-rich active K-type giant. We find a rotational period of 21.3~d with color changes and additional long-term photometric variability. Both can be explained with very stable stellar spots. We followed the decay phase of a superflare for 28 days with NICER and from the ground. We track the flare decay in unprecedented detail in several coronal temperature components. With a peak flux around $10^{34}$~erg~s$^{-1}$ (0.5-4.0~keV) and an exponential decay time of 2.2~days in the early decay phase, this is one of the strongest flares ever observed; yet it follows trends established from samples of smaller flares, for example for the relations between H$α$ and X-ray flux, indicating that the physical process that powers the flare emission is consistent over a large range of flare energies. We estimate a flare loop length about 2-4 times the stellar radius. No evidence is seen for abundance changes during the flare. △ Less

Submitted 4 October, 2024; originally announced October 2024.

Comments: submitted to ApJ, one electronic figures and data will be available with the journal publication. The version on arXiv contains a static image of that figure

arXiv:2409.04332 [pdf, other]

Amortized Bayesian Workflow

Authors: Chengkun Li, Aki Vehtari, Paul-Christian Bürkner, Stefan T. Radev, Luigi Acerbi, Marvin Schmitt

Abstract: Bayesian inference often faces a trade-off between computational speed and sampling accuracy. We propose an adaptive workflow that integrates rapid amortized inference with gold-standard MCMC techniques to achieve a favorable combination of both speed and accuracy when performing inference on many observed datasets. Our approach uses principled diagnostics to guide the choice of inference method f… ▽ More Bayesian inference often faces a trade-off between computational speed and sampling accuracy. We propose an adaptive workflow that integrates rapid amortized inference with gold-standard MCMC techniques to achieve a favorable combination of both speed and accuracy when performing inference on many observed datasets. Our approach uses principled diagnostics to guide the choice of inference method for each dataset, moving along the Pareto front from fast amortized sampling via generative neural networks to slower but guaranteed-accurate MCMC when needed. By reusing computations across steps, our workflow synergizes amortized and MCMC-based inference. We demonstrate the effectiveness of this integrated approach on several synthetic and real-world problems with tens of thousands of datasets, showing efficiency gains while maintaining high posterior quality. △ Less

Submitted 27 May, 2025; v1 submitted 6 September, 2024; originally announced September 2024.

Comments: 26 pages, 11 figures

arXiv:2408.13230 [pdf, other]

Amortized Bayesian Multilevel Models

Authors: Daniel Habermann, Marvin Schmitt, Lars Kühmichel, Andreas Bulling, Stefan T. Radev, Paul-Christian Bürkner

Abstract: Multilevel models (MLMs) are a central building block of the Bayesian workflow. They enable joint, interpretable modeling of data across hierarchical levels and provide a fully probabilistic quantification of uncertainty. Despite their well-recognized advantages, MLMs pose significant computational challenges, often rendering their estimation and evaluation intractable within reasonable time const… ▽ More Multilevel models (MLMs) are a central building block of the Bayesian workflow. They enable joint, interpretable modeling of data across hierarchical levels and provide a fully probabilistic quantification of uncertainty. Despite their well-recognized advantages, MLMs pose significant computational challenges, often rendering their estimation and evaluation intractable within reasonable time constraints. Recent advances in simulation-based inference offer promising solutions for addressing complex probabilistic models using deep generative networks. However, the utility and reliability of deep learning methods for estimating Bayesian MLMs remains largely unexplored, especially when compared with gold-standard samplers. To this end, we explore a family of neural network architectures that leverage the probabilistic factorization of multilevel models to facilitate efficient neural network training and subsequent near-instant posterior inference on unseen datasets. We test our method on several real-world case studies and provide comprehensive comparisons to Stan's gold standard sampler, where possible. Finally, we provide an open-source implementation of our methods to stimulate further research in the nascent field of amortized Bayesian inference. △ Less

Submitted 9 April, 2025; v1 submitted 23 August, 2024; originally announced August 2024.

Comments: 24 pages, 13 figures

arXiv:2408.11000 [pdf, other]

SenPa-MAE: Sensor Parameter Aware Masked Autoencoder for Multi-Satellite Self-Supervised Pretraining

Authors: Jonathan Prexl, Michael Schmitt

Abstract: This paper introduces SenPa-MAE, a transformer architecture that encodes the sensor parameters of an observed multispectral signal into the image embeddings. SenPa-MAE can be pre-trained on imagery of different satellites with non-matching spectral or geometrical sensor characteristics. To incorporate sensor parameters, we propose a versatile sensor parameter encoding module as well as a data augm… ▽ More This paper introduces SenPa-MAE, a transformer architecture that encodes the sensor parameters of an observed multispectral signal into the image embeddings. SenPa-MAE can be pre-trained on imagery of different satellites with non-matching spectral or geometrical sensor characteristics. To incorporate sensor parameters, we propose a versatile sensor parameter encoding module as well as a data augmentation strategy for the diversification of the pre-training dataset. This enables the model to effectively differentiate between various sensors and gain an understanding of sensor parameters and the correlation to the observed signal. Given the rising number of Earth observation satellite missions and the diversity in their sensor specifications, our approach paves the way towards a sensor-independent Earth observation foundation model. This opens up possibilities such as cross-sensor training and sensor-independent inference. △ Less

Submitted 20 August, 2024; originally announced August 2024.

Comments: GCPR 2024

arXiv:2408.07625 [pdf, other]

Neural Quantum States and Peaked Molecular Wave Functions: Curse or Blessing?

Authors: Aleksei Malyshev, Markus Schmitt, A. I. Lvovsky

Abstract: The field of neural quantum states has recently experienced a tremendous progress, making them a competitive tool of computational quantum many-body physics. However, their largest achievements to date mostly concern interacting spin systems, while their utility for quantum chemistry remains yet to be demonstrated. Two main complications are the peaked structure of the molecular wave functions, wh… ▽ More The field of neural quantum states has recently experienced a tremendous progress, making them a competitive tool of computational quantum many-body physics. However, their largest achievements to date mostly concern interacting spin systems, while their utility for quantum chemistry remains yet to be demonstrated. Two main complications are the peaked structure of the molecular wave functions, which impedes sampling, and large number of terms in second quantised Hamiltonians, which hinders scaling to larger molecule sizes. In this paper we address these issues jointly and argue that the peaked structure might actually be key to drastically more efficient calculations. Specifically, we introduce a novel algorithm for autoregressive sampling without replacement and a procedure to calculate a computationally cheaper surrogate for the local energy. We complement them with a custom modification of the stochastic reconfiguration optimisation technique and a highly optimised GPU implementation. As a result, our calculations require substantially less resources and exhibit more than order of magnitude speedup compared to the previous works. On a single GPU we study molecules comprising up to 118 qubits and outperform the ``golden standard'' CCSD(T) benchmark in Hilbert spaces of $\sim 10^{15}$ Slater determinants, which is orders of magnitude larger than what was previously achieved. We believe that our work underscores the prospect of NQS for challenging quantum chemistry calculations and serves as a favourable ground for the future method development. △ Less

Submitted 14 August, 2024; originally announced August 2024.

Comments: Main text: 15 pages, 5 figures; Supplementary Material: 20 pages, 23 figures

arXiv:2408.03750 [pdf, other]

Chirality in the Kagome Metal CsV$_3$Sb$_5$

Authors: H. J. Elmers, O. Tkach, Y. Lytvynenko, P. Yogi, M. Schmitt, D. Biswas, J. Liu, S. V. Chernov, M. Hoesch, D. Kutnyakhov, N. Wind, L. Wenthaus, M. Scholz, K. Rossnagel, A. Gloskovskii, C. Schlueter, A. Winkelmann, A. -A. Haghighirad, T. -L. Lee, M. Sing, R. Claessen, M. Le Tacon, J. Demsar, G. Schonhense, O. Fedchenko

Abstract: Using x-ray photoelectron diffraction (XPD) and angle-resolved photoemission spectroscopy, we study photoemission intensity changes related to changes in the geometric and electronic structure in the kagome metal CsV$_3$Sb$_5$ upon transition to an unconventional charge density wave (CDW) state. The XPD patterns reveal the presence of a chiral atomic structure in the CDW phase. Furthermore, using… ▽ More Using x-ray photoelectron diffraction (XPD) and angle-resolved photoemission spectroscopy, we study photoemission intensity changes related to changes in the geometric and electronic structure in the kagome metal CsV$_3$Sb$_5$ upon transition to an unconventional charge density wave (CDW) state. The XPD patterns reveal the presence of a chiral atomic structure in the CDW phase. Furthermore, using circularly polarized x-rays, we have found a pronounced non-trivial circular dichroism in the angular distribution of the valence band photoemission in the CDW phase, indicating a chirality of the electronic structure. This observation is consistent with the proposed orbital loop current order. In view of a negligible spontaneous Kerr signal in recent magneto-optical studies, the results suggest an antiferromagnetic coupling of the orbital magnetic moments along the $c$-axis. While the inherent structural chirality may also induce circular dichroism, the observed asymmetry values seem to be too large in the case of the weak structural distortions caused by the CDW. △ Less

Submitted 7 August, 2024; originally announced August 2024.

arXiv:2407.14640 [pdf, other]

CVE-LLM : Automatic vulnerability evaluation in medical device industry using large language models

Authors: Rikhiya Ghosh, Oladimeji Farri, Hans-Martin von Stockhausen, Martin Schmitt, George Marica Vasile

Abstract: The healthcare industry is currently experiencing an unprecedented wave of cybersecurity attacks, impacting millions of individuals. With the discovery of thousands of vulnerabilities each month, there is a pressing need to drive the automation of vulnerability assessment processes for medical devices, facilitating rapid mitigation efforts. Generative AI systems have revolutionized various industr… ▽ More The healthcare industry is currently experiencing an unprecedented wave of cybersecurity attacks, impacting millions of individuals. With the discovery of thousands of vulnerabilities each month, there is a pressing need to drive the automation of vulnerability assessment processes for medical devices, facilitating rapid mitigation efforts. Generative AI systems have revolutionized various industries, offering unparalleled opportunities for automation and increased efficiency. This paper presents a solution leveraging Large Language Models (LLMs) to learn from historical evaluations of vulnerabilities for the automatic assessment of vulnerabilities in the medical devices industry. This approach is applied within the portfolio of a single manufacturer, taking into account device characteristics, including existing security posture and controls. The primary contributions of this paper are threefold. Firstly, it provides a detailed examination of the best practices for training a vulnerability Language Model (LM) in an industrial context. Secondly, it presents a comprehensive comparison and insightful analysis of the effectiveness of Language Models in vulnerability assessment. Finally, it proposes a new human-in-the-loop framework to expedite vulnerability evaluation processes. △ Less

Submitted 19 July, 2024; originally announced July 2024.

arXiv:2407.10247 [pdf]

Strategic Integration of Artificial Intelligence in the C-Suite: The Role of the Chief AI Officer

Authors: Marc Schmitt

Abstract: The integration of Artificial Intelligence (AI) into corporate strategy has become a pivotal focus for organizations aiming to maintain a competitive advantage in the digital age. As AI reshapes business operations and drives innovation, the need for specialized leadership to effectively manage these changes becomes increasingly apparent. In this paper, I explore the role of the Chief AI Officer (… ▽ More The integration of Artificial Intelligence (AI) into corporate strategy has become a pivotal focus for organizations aiming to maintain a competitive advantage in the digital age. As AI reshapes business operations and drives innovation, the need for specialized leadership to effectively manage these changes becomes increasingly apparent. In this paper, I explore the role of the Chief AI Officer (CAIO) within the C-suite, emphasizing the necessity of this position for successful AI strategy, integration, and governance. I analyze future scenarios based on current trends in three key areas: the AI Economy, AI Organization, and Competition in the Age of AI. These explorations lay the foundation for identifying the antecedents (environmental, structural, and strategic factors) that justify the inclusion of a CAIO in top management teams. This sets the stage for a comprehensive examination of the CAIO's role and the broader implications of AI leadership. This paper advances the discussion on AI leadership by providing a rationale for the strategic integration of AI at the executive level and examining the role of the Chief AI Officer within organizations. △ Less

Submitted 30 April, 2024; originally announced July 2024.

arXiv:2406.19302 [pdf, other]

Mapping Land Naturalness from Sentinel-2 using Deep Contextual and Geographical Priors

Authors: Burak Ekim, Michael Schmitt

Abstract: In recent decades, the causes and consequences of climate change have accelerated, affecting our planet on an unprecedented scale. This change is closely tied to the ways in which humans alter their surroundings. As our actions continue to impact natural areas, using satellite images to observe and measure these effects has become crucial for understanding and combating climate change. Aiming to m… ▽ More In recent decades, the causes and consequences of climate change have accelerated, affecting our planet on an unprecedented scale. This change is closely tied to the ways in which humans alter their surroundings. As our actions continue to impact natural areas, using satellite images to observe and measure these effects has become crucial for understanding and combating climate change. Aiming to map land naturalness on the continuum of modern human pressure, we have developed a multi-modal supervised deep learning framework that addresses the unique challenges of satellite data and the task at hand. We incorporate contextual and geographical priors, represented by corresponding coordinate information and broader contextual information, including and surrounding the immediate patch to be predicted. Our framework improves the model's predictive performance in mapping land naturalness from Sentinel-2 data, a type of multi-spectral optical satellite imagery. Recognizing that our protective measures are only as effective as our understanding of the ecosystem, quantifying naturalness serves as a crucial step toward enhancing our environmental stewardship. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: 6 pages, 3 figures, ICLR 2024 Tackling Climate Change with Machine Learning Workshop

arXiv:2406.11937 [pdf, other]

doi 10.1088/1748-0221/19/11/P11025

Using graph neural networks to reconstruct charged pion showers in the CMS High Granularity Calorimeter

Authors: M. Aamir, G. Adamov, T. Adams, C. Adloff, S. Afanasiev, C. Agrawal, C. Agrawal, A. Ahmad, H. A. Ahmed, S. Akbar, N. Akchurin, B. Akgul, B. Akgun, R. O. Akpinar, E. Aktas, A. Al Kadhim, V. Alexakhin, J. Alimena, J. Alison, A. Alpana, W. Alshehri, P. Alvarez Dominguez, M. Alyari, C. Amendola, R. B. Amir , et al. (550 additional authors not shown)

Abstract: A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadr… ▽ More A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadronic section. The shower reconstruction method is based on graph neural networks and it makes use of a dynamic reduction network architecture. It is shown that the algorithm is able to capture and mitigate the main effects that normally hinder the reconstruction of hadronic showers using classical reconstruction methods, by compensating for fluctuations in the multiplicity, energy, and spatial distributions of the shower's constituents. The performance of the algorithm is evaluated using test beam data collected in 2018 prototype of the CMS HGCAL accompanied by a section of the CALICE AHCAL prototype. The capability of the method to mitigate the impact of energy leakage from the calorimeter is also demonstrated. △ Less

Submitted 18 December, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

Journal ref: JINST 19 (2024) P11025

arXiv:2406.03154 [pdf, other]

Detecting Model Misspecification in Amortized Bayesian Inference with Neural Networks: An Extended Investigation

Authors: Marvin Schmitt, Paul-Christian Bürkner, Ullrich Köthe, Stefan T. Radev

Abstract: Recent advances in probabilistic deep learning enable efficient amortized Bayesian inference in settings where the likelihood function is only implicitly defined by a simulation program (simulation-based inference; SBI). But how faithful is such inference if the simulation represents reality somewhat inaccurately, that is, if the true system behavior at test time deviates from the one seen during… ▽ More Recent advances in probabilistic deep learning enable efficient amortized Bayesian inference in settings where the likelihood function is only implicitly defined by a simulation program (simulation-based inference; SBI). But how faithful is such inference if the simulation represents reality somewhat inaccurately, that is, if the true system behavior at test time deviates from the one seen during training? We conceptualize the types of such model misspecification arising in SBI and systematically investigate how the performance of neural posterior approximators gradually deteriorates as a consequence, making inference results less and less trustworthy. To notify users about this problem, we propose a new misspecification measure that can be trained in an unsupervised fashion (i.e., without training data from the true distribution) and reliably detects model misspecification at test time. Our experiments clearly demonstrate the utility of our new measure both on toy examples with an analytical ground-truth and on representative scientific tasks in cell biology, cognitive decision making, disease outbreak dynamics, and computer vision. We show how the proposed misspecification test warns users about suspicious outputs, raises an alarm when predictions are not trustworthy, and guides model designers in their search for better simulators. △ Less

Submitted 6 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

Comments: Extended version of the conference paper https://doi.org/10.1007/978-3-031-54605-1_35. arXiv admin note: text overlap with arXiv:2112.08866

arXiv:2406.00771 [pdf]

Hybrid Photoelectron Momentum Microscope at the Soft X-ray Beamline I09 of the Diamond Light Source

Authors: Matthias Schmitt, Deepnarayan Biswas, Olena Tkach, Olena Fedchenko, Jieyi Liu, Hans-Joachim Elmers, Michael Sing, Ralph Claessen, Tien-Lin Lee, Gerd Schönhense

Abstract: Soft X-ray momentum microscopy of crystalline solids is a highly efficient approach to map the photoelectron distribution in four-dimensional (E,k) parameter space over the entire Brillouin zone. The fixed sample geometry eliminates any modulation of the matrix element otherwise caused by changing the angle of incidence. We present a new endstation at the soft X-ray branch of beamline I09 at the D… ▽ More Soft X-ray momentum microscopy of crystalline solids is a highly efficient approach to map the photoelectron distribution in four-dimensional (E,k) parameter space over the entire Brillouin zone. The fixed sample geometry eliminates any modulation of the matrix element otherwise caused by changing the angle of incidence. We present a new endstation at the soft X-ray branch of beamline I09 at the Diamond Light Source, UK. The key component is a large single hemispherical spectrometer combined with a time-of-flight analyzer behind the exit slit. The photon energy ranges from hv = 105 eV to 2 keV, with circular polarization available for hv > 150 eV, allowing for circular dichroism measurements in angle-resolved photoemission (CD-ARPES). A focused and monochromatized He lamp is used for offline measurements. Under k-imaging conditions, energy and momentum resolution are 10.2 meV (FWHM) and 0.010 angstroms^-1 (base resolution 4.2 meV with smallest slits and a pass energy of 8 eV). The large angular filling of the entrance lens and hemisphere (225 mm path radius) allows k-field-of-view diameters > 6 angstroms^-1. Energy filtered X-PEEM mode using synchrotron radiation revealed a resolution of 300 nm. As examples we show 2D band mapping of bilayer graphene, 3D mapping of the Fermi surface of Cu, CD-ARPES for intercalated indenene layers and the sp valence bands of Cu and Au, and full-field photoelectron diffraction patterns of Ge. △ Less

Submitted 2 June, 2024; originally announced June 2024.

Comments: 20 pages, 9 figures

arXiv:2405.12878 [pdf, other]

Epitaxial RuO$_2$ and IrO$_2$ films by pulsed laser deposition on TiO$_2$(110)

Authors: Philipp Keßler, Tim Waldsauer, Vedran Jovic, Martin Kamp, Matthias Schmitt, Michael Sing, Ralph Claessen, Simon Moser

Abstract: We present a systematic growth study of epitaxial RuO$_2$(110) and IrO$_2$(110) on TiO$_2$(110) substrates by pulsed laser deposition. We describe the main challenges encountered in the growth process, such as a deteriorating material flux due to laser induced target metallization or the delicate balance of under- vs over-oxidation of the 'stubborn' Ru and Ir metals. We identify growth temperature… ▽ More We present a systematic growth study of epitaxial RuO$_2$(110) and IrO$_2$(110) on TiO$_2$(110) substrates by pulsed laser deposition. We describe the main challenges encountered in the growth process, such as a deteriorating material flux due to laser induced target metallization or the delicate balance of under- vs over-oxidation of the 'stubborn' Ru and Ir metals. We identify growth temperatures and oxygen partial pressures of 700 K, $1\times 10^{-3}$ mbar for RuO$_2$ and 770 K, $5\times 10^{-4}$ mbar for IrO$_2$ to optimally balance between metal oxidation and particle mobility during nucleation. In contrast to IrO$_2$, RuO$_2$ exhibits layer-by-layer growth up to 5 unit cells if grown at high deposition rates. At low deposition rates, the large lattice mismatch between film and substrate fosters initial 3D island growth and cluster formation. In analogy to reports for RuO$_2$ based on physical vapor deposition, we find these islands to eventually merge and growth to continue in a step flow mode, resulting in highly crystalline, flat, stoichiometric films of RuO$_2$(110) (up to 30 nm thickness) and IrO$_2$(110) (up to 13 nm thickness) with well defined line defects. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 15 pages, 12 figures

arXiv:2403.11141 [pdf, other]

The Simplex Projection: Lossless Visualization of 4D Compositional Data on a 2D Canvas

Authors: Marvin Schmitt, Yuga Hikida, Stefan T Radev, Filip Sadlo, Paul-Christian Bürkner

Abstract: The simplex projection expands the capabilities of simplex plots (also known as ternary plots) to achieve a lossless visualization of 4D compositional data on a 2D canvas. Previously, this was only possible for 3D compositional data. We demonstrate how our approach can be applied to individual data points, point clouds, and continuous probability density functions on simplices. While we showcase o… ▽ More The simplex projection expands the capabilities of simplex plots (also known as ternary plots) to achieve a lossless visualization of 4D compositional data on a 2D canvas. Previously, this was only possible for 3D compositional data. We demonstrate how our approach can be applied to individual data points, point clouds, and continuous probability density functions on simplices. While we showcase our visualization technique specifically for 4D compositional data, we offer rigorous proofs that support its extension to compositional data of any (finite) dimensionality. △ Less

Submitted 17 March, 2024; originally announced March 2024.

arXiv:2403.07397 [pdf]

Skyrmion flow in periodically modulated channels

Authors: Klaus Raab, Maurice Schmitt, Maarten A. Brems, Jan Rothörl, Fabian Kammerbauer, Sachin Krishnia, Mathias Kläui, Peter Virnau

Abstract: Magnetic skyrmions, topologically stabilized chiral magnetic textures with particle-like properties have so far primarily been studied statically. Here, we experimentally investigate the dynamics of skyrmion ensembles in metallic thin film conduits where they behave as quasi-particle fluids. By exploiting our access to the full trajectories of all fluid particles by means of time-resolved magneto-… ▽ More Magnetic skyrmions, topologically stabilized chiral magnetic textures with particle-like properties have so far primarily been studied statically. Here, we experimentally investigate the dynamics of skyrmion ensembles in metallic thin film conduits where they behave as quasi-particle fluids. By exploiting our access to the full trajectories of all fluid particles by means of time-resolved magneto-optical Kerr microscopy, we demonstrate that boundary conditions of skyrmion fluids can be tuned by modulation of the channel geometry. We observe as a function of channel width deviations from classical flow profiles even into the no- or partial-slip regime. Unlike conventional colloids, the skyrmion Hall effect can also introduce transversal flow-asymmetries and even local motion of single skyrmions against the driving force which we explore with particle-based simulations, demonstrating the unique properties of skyrmion liquid flow that uniquely deviates from previously known behavior of other quasi-particles. △ Less

Submitted 13 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

arXiv:2402.07933 [pdf]

Human-Centered AI Product Prototyping with No-Code AutoML: Conceptual Framework, Potentials and Limitations

Authors: Mario Truss, Marc Schmitt

Abstract: This paper addresses the complexities inherent in AI product prototyping, focusing on the challenges posed by the probabilistic nature of AI behavior and the limited accessibility of prototyping tools to non-experts. A Design Science Research (DSR) approach is presented which culminates in a conceptual framework aimed at improving the AI prototyping process. Through a comprehensive literature revi… ▽ More This paper addresses the complexities inherent in AI product prototyping, focusing on the challenges posed by the probabilistic nature of AI behavior and the limited accessibility of prototyping tools to non-experts. A Design Science Research (DSR) approach is presented which culminates in a conceptual framework aimed at improving the AI prototyping process. Through a comprehensive literature review, key challenges were identified and no-code AutoML was analyzed as a solution. The framework describes the seamless incorporation of non-expert input and evaluation during prototyping, leveraging the potential of no-code AutoML to enhance accessibility and interpretability. A hybrid approach of combining naturalistic (case study) and artificial evaluation methods (criteria-based analysis) validated the utility of our approach, highlighting its efficacy in supporting AI non-experts and streamlining decision-making and its limitations. Implications for academia and industry, emphasizing the strategic integration of no-code AutoML to enhance AI product development processes, mitigate risks, and foster innovation, are discussed. △ Less

Submitted 7 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

ACM Class: I.2.0; H.5.0; D.2.2; H.1.2; I.2.5; K.6.1

arXiv:2402.03806 [pdf]

Explainable Automated Machine Learning for Credit Decisions: Enhancing Human Artificial Intelligence Collaboration in Financial Engineering

Authors: Marc Schmitt

Abstract: This paper explores the integration of Explainable Automated Machine Learning (AutoML) in the realm of financial engineering, specifically focusing on its application in credit decision-making. The rapid evolution of Artificial Intelligence (AI) in finance has necessitated a balance between sophisticated algorithmic decision-making and the need for transparency in these systems. The focus is on ho… ▽ More This paper explores the integration of Explainable Automated Machine Learning (AutoML) in the realm of financial engineering, specifically focusing on its application in credit decision-making. The rapid evolution of Artificial Intelligence (AI) in finance has necessitated a balance between sophisticated algorithmic decision-making and the need for transparency in these systems. The focus is on how AutoML can streamline the development of robust machine learning models for credit scoring, while Explainable AI (XAI) methods, particularly SHapley Additive exPlanations (SHAP), provide insights into the models' decision-making processes. This study demonstrates how the combination of AutoML and XAI not only enhances the efficiency and accuracy of credit decisions but also fosters trust and collaboration between humans and AI systems. The findings underscore the potential of explainable AutoML in improving the transparency and accountability of AI-driven financial decisions, aligning with regulatory requirements and ethical considerations. △ Less

Submitted 6 February, 2024; originally announced February 2024.

arXiv:2401.17302 [pdf, other]

doi 10.1051/0004-6361/202449351

The high-energy environment of the heavy sub-Earth GJ 367 b indicates likely complete evaporation of its atmosphere

Authors: K. Poppenhaeger, L. Ketzer, N. Ilic, E. Magaudda, J. Robrade, B. Stelzer, J. H. M. M. Schmitt, P. C. Schneider

Abstract: The planet GJ 367 b is a recently discovered high-density sub-Earth orbiting an M dwarf star. Its composition was modelled to be predominantly iron with a potential remainder of a hydrogen-helium envelope. Here we report an X-ray detection of this planet's host star for the first time, using data from the spectro-imaging X-ray telescope eROSITA onboard the Spectrum-Roentgen-Gamma (SRG) mission. We… ▽ More The planet GJ 367 b is a recently discovered high-density sub-Earth orbiting an M dwarf star. Its composition was modelled to be predominantly iron with a potential remainder of a hydrogen-helium envelope. Here we report an X-ray detection of this planet's host star for the first time, using data from the spectro-imaging X-ray telescope eROSITA onboard the Spectrum-Roentgen-Gamma (SRG) mission. We characterise the magnetic activity of the host star from the X-ray data and estimate its effects on a potential atmosphere of the planet. We find that despite the very low activity level of the host star the expected mass loss rates, both under core-powered and photoevaporative mass loss regimes, are so high that a potential primordial or outgassed atmosphere would evaporate very quickly. Since the activity level of the host star indicates that the system is several Gigayears old, it is very unlikely that the planet currently still hosts any atmosphere. △ Less

Submitted 1 August, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

Comments: 7 pages, accepted for publication by A&A, part of the eROSITA DR1 paper splash

Journal ref: A&A 689, A188 (2024)

arXiv:2401.17282 [pdf, other]

The SRG/eROSITA all-sky survey -- Identifying the coronal content with HamStar

Authors: S. Freund, S. Czesla, P. Predehl, J. Robrade, M. Salvato, P. C. Schneider, H. Starck, J. Wolf, J. H. M. M. Schmitt

Abstract: The first eROSITA all-sky survey (eRASS1) performed on board the Spectrum-Roentgen-Gamma mission (SRG) provides more than 900,000 X-ray sources in the 0.2 - 2.3 keV band located in the western hemisphere. We present identifications of the eRASS1 sources obtained using our HamStar method, which was designed for the identification of coronal X-ray sources. HamStar is a Bayesian framework that estima… ▽ More The first eROSITA all-sky survey (eRASS1) performed on board the Spectrum-Roentgen-Gamma mission (SRG) provides more than 900,000 X-ray sources in the 0.2 - 2.3 keV band located in the western hemisphere. We present identifications of the eRASS1 sources obtained using our HamStar method, which was designed for the identification of coronal X-ray sources. HamStar is a Bayesian framework that estimates coronal probabilities for each eRASS1 source based on a cross-match with optical counterparts from Gaia DR3. It considers geometric properties, such as angular separation and positional uncertainty, as well the additional properties of fractional X-ray flux, color, and distance. We identify 138,800 coronal eRASS1 sources and estimate a completeness and reliability of about 91.5% for this sample, which we confirmed with Chandra detections. This is the largest available sample of coronal X-ray emitters and we find nearly five times as many coronal sources as in the ROSAT all-sky survey. The coronal eRASS1 sources are made up of all spectral types and the onset of convection and the saturation limit are clearly visible. As opposed to previous samples, rare source types are also well populated. About 10% of the coronal eRASS1 sources have a correlated secondary counterpart, which is a wide binary companion or belongs to the same stellar cluster. We also identify 6700 known unresolved binaries, and an excess of fast binary periods below 10 d. Furthermore, the binary sequence is clearly visible in a color-magnitude diagram. When combining the coronal eRASS1 sources with rotation modulations from Gaia DR3, we find 3700 X-ray sources with known rotation periods, which is the largest sample of this kind. We fitted the rotation-activity relation and convection turnover times for our flux-limited sample. We do not detect the low-amplitude fast rotators discovered in the Gaia DR3 sample in X-rays. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: 19 pages, 19 figures; Accepted for publication in A&A

arXiv:2401.17273 [pdf, other]

doi 10.1051/0004-6361/202449181

''Forbidden" stars in the eROSITA all-sky survey: X-ray emission from very late-type giants

Authors: J. H. M. M. Schmitt, M. Hünsch, P. C. Schneider, S. Freund, S. Czesla, J. Robrade, A. Schwope

Abstract: We present the results of the first X-ray all-sky survey (eRASS1) performed by the eROSITA instrument onboard the Spectrum-Roentgen-Gamma (SRG) mission on X-ray emitting red giants and supergiants. Focussing on stars positioned at high galactic latitudes above 20 deg, we construct a complete sample of such objects using the Gaia DR3 catalog and identify a sample 96 stars appearing as bona fide ent… ▽ More We present the results of the first X-ray all-sky survey (eRASS1) performed by the eROSITA instrument onboard the Spectrum-Roentgen-Gamma (SRG) mission on X-ray emitting red giants and supergiants. Focussing on stars positioned at high galactic latitudes above 20 deg, we construct a complete sample of such objects using the Gaia DR3 catalog and identify a sample 96 stars appearing as bona fide entries in the eRASS1 source catalog. Restricting again the sample to objects nearer than 1300~pc and eliminating all catalog entries which are due to optical contamination, we end up with a sample of 16 genuine red giant/supergiant X-ray sources, which represent -- with the exception of one source (CL~Hyi) -- new X-ray detections. We furthermore present a low SNR X-ray spectrum of the nearby low activity giant Arcturus obtained from a pointed observation with the XMM-Newton satellite and give a detailed account of our data analysis. We show that Arcturus-like X-ray emission cannot be the explanation for the X-ray emissions observed by eROSITA and provide a discussion of the possible nature of the detected X-ray sources. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Journal ref: A&A 688, A9 (2024)

arXiv:2401.08798 [pdf, other]

Engineering Symmetry Breaking Interfaces by Nanoscale Structural-Energetics in Orthorhombic Perovskite Thin Films

Authors: Duncan T. L. Alexander, Hugo Meley, Michael Marcus Schmitt, Bernat Mundet, Philippe Ghosez, Jean-Marc Triscone, Stefano Gariglio

Abstract: The atomic configuration of phases and their interfaces is fundamental to materials design and engineering. Here, we unveil a transition metal oxide interface, whose formation is driven by energetic influences - epitaxial tensile strain versus oxygen octahedra connectivity - that compete in determining the orientation of an orthorhombic perovskite film. We study this phenomenon in a system of LaVO… ▽ More The atomic configuration of phases and their interfaces is fundamental to materials design and engineering. Here, we unveil a transition metal oxide interface, whose formation is driven by energetic influences - epitaxial tensile strain versus oxygen octahedra connectivity - that compete in determining the orientation of an orthorhombic perovskite film. We study this phenomenon in a system of LaVO$_3$ grown on (101) DyScO$_3$, using atomic-resolution scanning transmission electron microscopy to measure intrinsic markers of orthorhombic symmetry. We identify that the film resolves this energetic conflict by switching its orientation by 90 degrees at an atomically-flat plane within its volume, not at the film/substrate interface. At either side of this "switching plane", characteristic orthorhombic distortions tend to zero to couple mismatched oxygen octahedra rotations. The resulting boundary is highly energetic, which makes it a priori unlikely; by using second-principles atomistic modeling, we show how its formation requires structural relaxation of an entire film grown beyond a critical thickness measuring tens of unit cells. The switching plane breaks the inversion symmetry of the Pnma orthorhombic structure, and sharply joins two regions, a thin intermediate layer and the film bulk, that are held under different mechanical strain states. By therefore contacting two distinct phases of one compound that would never otherwise coexist, this alternative type of interface opens new avenues for nanoscale engineering of functional systems, such as a chemically-uniform but magnetically inhomogeneous heterostructure. △ Less

Submitted 25 November, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

arXiv:2401.02590 [pdf, ps, other]

doi 10.1093/mnras/sty2768

Prominence activation, optical flare, and post-flare loops on the RS Canum Venaticorum star SZ Piscium

Authors: Dongtao Cao, Shenghong Gu, Jian Ge, Tinggui Wang, Jilin Zhou, Liang Chang, U. Wolter, M. Mittag, J. H. M. M. Schmitt, V. Perdelwitz

Abstract: We present the results of time-resolved high-resolution spectroscopic observations of the very active RS Canum Venaticorum (RS CVn) star SZ Piscium (SZ Psc), obtained during two consecutive observing nights on October 24 and 25, 2011. Several optical chromospheric activity indicators are analyzed using the spectral subtraction technique, which show the remarkably different behavior between two nig… ▽ More We present the results of time-resolved high-resolution spectroscopic observations of the very active RS Canum Venaticorum (RS CVn) star SZ Piscium (SZ Psc), obtained during two consecutive observing nights on October 24 and 25, 2011. Several optical chromospheric activity indicators are analyzed using the spectral subtraction technique, which show the remarkably different behavior between two nights. Gradually blue-shifted and strengthened excess absorption features presented in the series of the subtracted spectra (especially for the H$_α$, He I D$_{3}$ and H$_β$ lines), as a result of active stellar prominence that is rising its height along the line of our sight, was detected in the observations on October 24. This prominence activation event was probably associated with the subsequently occurred optical flare, and part of that flare decay phase was hunted in the observations on October 25. The flare was characterized by the prominent He I D$_{3}$ line emission, as well as stronger chromospheric emission in the H$_α$, H$_β$ and other active lines. The gradual decay of flare was accompanied by an obviously developmental absorption feature in the blue wing of the H$_α$ and other active lines, which could be explained as cool post-flare loops which projected against the bright flare background. Therefore, a series of possibly associated magnetic activity phenomena, including flare-related prominence activation, optical flare and post-flare loops, were detected during our observations. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 482, Issue 1, p.988-998, January 2019

arXiv:2401.02583 [pdf, ps, other]

doi 10.1093/mnras/stad1700

Prominence detection and chromosphere feature on the prototype RS CVn of active binary systems

Authors: Dongtao Cao, Shenghong Gu, U. Wolter, M. Mittag, J. H. M. M. Schmitt, Dongyang Gao, Shaoming Hu

Abstract: We present a study of high-resolution spectra of RS Canum Venaticorum (RS CVn), a prototype of active binary systems. Our data were obtained from 1998 to 2017 using different telescopes. We analyze the chromospheric activity indicators Ca II IRT, H$_α$, Na I D$_{1}$, D$_{2}$ doublet, He I D$_{3}$, and H$_β$ using a spectral subtraction technique. The chromospheric emission stems mainly from the K2… ▽ More We present a study of high-resolution spectra of RS Canum Venaticorum (RS CVn), a prototype of active binary systems. Our data were obtained from 1998 to 2017 using different telescopes. We analyze the chromospheric activity indicators Ca II IRT, H$_α$, Na I D$_{1}$, D$_{2}$ doublet, He I D$_{3}$, and H$_β$ using a spectral subtraction technique. The chromospheric emission stems mainly from the K2 IV primary star, while the F5 V secondary star only shows weak emission features in a few of our spectra. We find excess absorption features in the subtracted H$_α$ lines and other activity indicators from spectra taken near primary eclipse, which we ascribe to prominence-like material associated with the primary star. We estimate size limits of these tentative prominences based on the geometry of the binary system, and investigate the physical properties of the strongest prominence. An optical flare, characterized by He I D$_{3}$ line emission, together with stronger emission in other activity lines, was detected. The flare energy is roughly comparable to strong flares observed on other RS CVn-type stars. The chromospherically active longitudes of RS CVn most frequently appear near the two quadratures of the system and display changes between observing runs, which indicates an ongoing evolution of its active regions. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 523, Issue 3, pp.4146-4157, August 2023

arXiv:2401.01342 [pdf]

doi 10.1016/j.jii.2023.100520

Securing the Digital World: Protecting smart infrastructures and digital industries with Artificial Intelligence (AI)-enabled malware and intrusion detection

Authors: Marc Schmitt

Abstract: The last decades have been characterized by unprecedented technological advances, many of them powered by modern technologies such as Artificial Intelligence (AI) and Machine Learning (ML). The world has become more digitally connected than ever, but we face major challenges. One of the most significant is cybercrime, which has emerged as a global threat to governments, businesses, and civil socie… ▽ More The last decades have been characterized by unprecedented technological advances, many of them powered by modern technologies such as Artificial Intelligence (AI) and Machine Learning (ML). The world has become more digitally connected than ever, but we face major challenges. One of the most significant is cybercrime, which has emerged as a global threat to governments, businesses, and civil societies. The pervasiveness of digital technologies combined with a constantly shifting technological foundation has created a complex and powerful playground for cybercriminals, which triggered a surge in demand for intelligent threat detection systems based on machine and deep learning. This paper investigates AI-based cyber threat detection to protect our modern digital ecosystems. The primary focus is on evaluating ML-based classifiers and ensembles for anomaly-based malware detection and network intrusion detection and how to integrate those models in the context of network security, mobile security, and IoT security. The discussion highlights the challenges when deploying and integrating AI-enabled cybersecurity solutions into existing enterprise systems and IT infrastructures, including options to overcome those challenges. Finally, the paper provides future research directions to further increase the security and resilience of our modern digital industries, infrastructures, and ecosystems. △ Less

Submitted 15 October, 2023; originally announced January 2024.

Journal ref: Journal of Industrial Information Integration, Volume 36, 2023, 100520

arXiv:2312.06608 [pdf, other]

Information theory for data-driven model reduction in physics and biology

Authors: Matthew S. Schmitt, Maciej Koch-Janusz, Michel Fruchart, Daniel S. Seara, Michael Rust, Vincenzo Vitelli

Abstract: Model reduction is the construction of simple yet predictive descriptions of the dynamics of many-body systems in terms of a few relevant variables. A prerequisite to model reduction is the identification of these relevant variables, a task for which no general method exists. Here, we develop a systematic approach based on the information bottleneck to identify the relevant variables, defined as t… ▽ More Model reduction is the construction of simple yet predictive descriptions of the dynamics of many-body systems in terms of a few relevant variables. A prerequisite to model reduction is the identification of these relevant variables, a task for which no general method exists. Here, we develop a systematic approach based on the information bottleneck to identify the relevant variables, defined as those most predictive of the future. We elucidate analytically the relation between these relevant variables and the eigenfunctions of the transfer operator describing the dynamics. Further, we show that in the limit of high compression, the relevant variables are directly determined by the slowest-decaying eigenfunctions. Our information-based approach indicates when to optimally stop increasing the complexity of the reduced model. Furthermore, it provides a firm foundation to construct interpretable deep learning tools that perform model reduction. We illustrate how these tools work in practice by considering uncurated videos of atmospheric flows from which our algorithms automatically extract the dominant slow collective variables, as well as experimental videos of cyanobacteria colonies in which we discover an emergent synchronization order parameter. △ Less

Submitted 17 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

Comments: 39 pages, 19 figures

arXiv:2312.05440 [pdf, other]

Consistency Models for Scalable and Fast Simulation-Based Inference

Authors: Marvin Schmitt, Valentin Pratz, Ullrich Köthe, Paul-Christian Bürkner, Stefan T Radev

Abstract: Simulation-based inference (SBI) is constantly in search of more expressive and efficient algorithms to accurately infer the parameters of complex simulation models. In line with this goal, we present consistency models for posterior estimation (CMPE), a new conditional sampler for SBI that inherits the advantages of recent unconstrained architectures and overcomes their sampling inefficiency at i… ▽ More Simulation-based inference (SBI) is constantly in search of more expressive and efficient algorithms to accurately infer the parameters of complex simulation models. In line with this goal, we present consistency models for posterior estimation (CMPE), a new conditional sampler for SBI that inherits the advantages of recent unconstrained architectures and overcomes their sampling inefficiency at inference time. CMPE essentially distills a continuous probability flow and enables rapid few-shot inference with an unconstrained architecture that can be flexibly tailored to the structure of the estimation problem. We provide hyperparameters and default architectures that support consistency training over a wide range of different dimensions, including low-dimensional ones which are important in SBI workflows but were previously difficult to tackle even with unconditional consistency models. Our empirical evaluation demonstrates that CMPE not only outperforms current state-of-the-art algorithms on hard low-dimensional benchmarks, but also achieves competitive performance with much faster sampling speed on two realistic estimation problems with high data and/or parameter dimensions. △ Less

Submitted 4 November, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

Journal ref: Neural Information Processing Systems (NeurIPS 2024)

arXiv:2312.03634 [pdf, ps, other]

Singular cohomology of symplectic quotients by circle actions and Kirwan surjectivity

Authors: Benjamin Delarue, Pablo Ramacher, Maximilian Schmitt

Abstract: Let $M$ be a symplectic manifold carrying a Hamiltonian $S^1$-action with momentum map $J:M \rightarrow \mathbb{R}$ and consider the corresponding symplectic quotient $\mathcal{M}_0:=J^{-1}(0)/S^1$. We extend Sjamaar's complex of differential forms on $\mathcal{M}_0$, whose cohomology is isomorphic to the singular cohomology $H(\mathcal{M}_0;\mathbb{R})$ of $\mathcal{M}_0$ with real coefficients,… ▽ More Let $M$ be a symplectic manifold carrying a Hamiltonian $S^1$-action with momentum map $J:M \rightarrow \mathbb{R}$ and consider the corresponding symplectic quotient $\mathcal{M}_0:=J^{-1}(0)/S^1$. We extend Sjamaar's complex of differential forms on $\mathcal{M}_0$, whose cohomology is isomorphic to the singular cohomology $H(\mathcal{M}_0;\mathbb{R})$ of $\mathcal{M}_0$ with real coefficients, to a complex of differential forms on $\mathcal{M}_0$ associated with a partial desingularization $\widetilde{\mathcal{M}}_0$, which we call resolution differential forms. The cohomology of that complex turns out to be isomorphic to the de Rham cohomology $H(\widetilde{ \mathcal{M}}_0)$ of $\widetilde{\mathcal{M}}_0$. Based on this, we derive a long exact sequence involving both $H(\mathcal{M}_0;\mathbb{R})$ and $H(\widetilde{ \mathcal{M}}_0)$ and give conditions for its splitting. We then define a Kirwan map $\mathcal{K}:H_{S^1}(M) \rightarrow H(\widetilde{\mathcal{M}}_0)$ from the equivariant cohomology $H_{S^1}(M)$ of $M$ to $H(\widetilde{\mathcal{M}}_0)$ and show that its image contains the image of $H(\mathcal{M}_0;\mathbb{R})$ in $H(\widetilde{\mathcal{M}}_0)$ under the natural inclusion. Combining both results in the case that all fixed point components of $M$ have vanishing odd cohomology we obtain a surjection $\check κ:H^\textrm{ev}_{S^1}(M) \rightarrow H^\textrm{ev}(\mathcal{M}_0;\mathbb{R})$ in even degrees, while already simple examples show that a similar surjection in odd degrees does not exist in general. As an interesting class of examples we study abelian polygon spaces. △ Less

Submitted 6 December, 2023; originally announced December 2023.

Comments: 46 Pages

arXiv:2311.10671 [pdf, other]

Fuse It or Lose It: Deep Fusion for Multimodal Simulation-Based Inference

Authors: Marvin Schmitt, Leona Odole, Stefan T. Radev, Paul-Christian Bürkner

Abstract: We present multimodal neural posterior estimation (MultiNPE), a method to integrate heterogeneous data from different sources in simulation-based inference with neural networks. Inspired by advances in deep fusion, it allows researchers to analyze data from different domains and infer the parameters of complex mathematical models with increased accuracy. We consider three fusion approaches for Mul… ▽ More We present multimodal neural posterior estimation (MultiNPE), a method to integrate heterogeneous data from different sources in simulation-based inference with neural networks. Inspired by advances in deep fusion, it allows researchers to analyze data from different domains and infer the parameters of complex mathematical models with increased accuracy. We consider three fusion approaches for MultiNPE (early, late, hybrid) and evaluate their performance in three challenging experiments. MultiNPE not only outperforms single-source baselines on a reference task, but also achieves superior inference on scientific models from cognitive neuroscience and cardiology. We systematically investigate the impact of partially missing data on the different fusion strategies. Across our experiments, late and hybrid fusion techniques emerge as the methods of choice for practical applications of multimodal simulation-based inference. △ Less

Submitted 4 November, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

arXiv:2311.03684 [pdf, other]

doi 10.1088/2632-2153/ad4f4d

Reinforcement learning pulses for transmon qubit entangling gates

Authors: Ho Nam Nguyen, Felix Motzoi, Mekena Metcalf, K. Birgitta Whaley, Marin Bukov, Markus Schmitt

Abstract: The utility of a quantum computer depends heavily on the ability to reliably perform accurate quantum logic operations. For finding optimal control solutions, it is of particular interest to explore model-free approaches, since their quality is not constrained by the limited accuracy of theoretical models for the quantum processor - in contrast to many established gate implementation strategies. I… ▽ More The utility of a quantum computer depends heavily on the ability to reliably perform accurate quantum logic operations. For finding optimal control solutions, it is of particular interest to explore model-free approaches, since their quality is not constrained by the limited accuracy of theoretical models for the quantum processor - in contrast to many established gate implementation strategies. In this work, we utilize a continuous-control reinforcement learning algorithm to design entangling two-qubit gates for superconducting qubits; specifically, our agent constructs cross-resonance and CNOT gates without any prior information about the physical system. Using a simulated environment of fixed-frequency, fixed-coupling transmon qubits, we demonstrate the capability to generate novel pulse sequences that outperform the standard cross-resonance gates in both fidelity and gate duration, while maintaining a comparable susceptibility to stochastic unitary noise. We further showcase an augmentation in training and input information that allows our agent to adapt its pulse design abilities to drifting hardware characteristics, importantly with little to no additional optimization. Our results exhibit clearly the advantages of unbiased adaptive-feedback learning-based optimization methods for transmon gate design. △ Less

Submitted 14 June, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

Comments: 18 + 8 pages, 13 + 6 figures

Journal ref: Machine Learning: Science and Technology 5, 025066 (2024)

arXiv:2310.19231 [pdf, other]

doi 10.1109/MGRS.2023.3293459

There Are No Data Like More Data- Datasets for Deep Learning in Earth Observation

Authors: Michael Schmitt, Seyed Ali Ahmadi, Yonghao Xu, Gulsen Taskin, Ujjwal Verma, Francescopaolo Sica, Ronny Hansch

Abstract: Carefully curated and annotated datasets are the foundation of machine learning, with particularly data-hungry deep neural networks forming the core of what is often called Artificial Intelligence (AI). Due to the massive success of deep learning applied to Earth Observation (EO) problems, the focus of the community has been largely on the development of ever-more sophisticated deep neural network… ▽ More Carefully curated and annotated datasets are the foundation of machine learning, with particularly data-hungry deep neural networks forming the core of what is often called Artificial Intelligence (AI). Due to the massive success of deep learning applied to Earth Observation (EO) problems, the focus of the community has been largely on the development of ever-more sophisticated deep neural network architectures and training strategies largely ignoring the overall importance of datasets. For that purpose, numerous task-specific datasets have been created that were largely ignored by previously published review articles on AI for Earth observation. With this article, we want to change the perspective and put machine learning datasets dedicated to Earth observation data and applications into the spotlight. Based on a review of the historical developments, currently available resources are described and a perspective for future developments is formed. We hope to contribute to an understanding that the nature of our data is what distinguishes the Earth observation community from many other communities that apply deep learning techniques to image data, and that a detailed understanding of EO data peculiarities is among the core competencies of our discipline. △ Less

Submitted 29 October, 2023; originally announced October 2023.

Journal ref: Published in IEEE Geoscience and Remote Sensing Magazine, vol. 11, no. 3, pp. 63-97, Sept. 2023

arXiv:2310.14715 [pdf, other]

doi 10.1051/0004-6361/202346524

The CARMENES search for exoplanets around M dwarfs. Telluric absorption corrected high S/N optical and near-infrared template spectra of 382 M dwarf stars

Authors: E. Nagel, S. Czesla, A. Kaminski, M. Zechmeister, L. Tal-Or, J. H. M. M. Schmitt, A. Reiners, A. Quirrenbach, A. García López, J. A. Caballero, I. Ribas, P. J. Amado, V. J. S. Béjar, M. Cortés-Contreras, S. Dreizler, A. P. Hatzes, Th. Henning, S. V. Jeffers, M. Kürster, M. Lafarga, M. López-Puertas, D. Montes, J. C. Morales, S. Pedraz, A. Schweitzer

Abstract: Light from celestial objects interacts with the molecules of the Earth's atmosphere, resulting in the production of telluric absorption lines in ground-based spectral data. Correcting for these lines, which strongly affect red and infrared wavelengths, is often needed in a wide variety of scientific applications. Here, we present the template division telluric modeling (TDTM) technique, a method f… ▽ More Light from celestial objects interacts with the molecules of the Earth's atmosphere, resulting in the production of telluric absorption lines in ground-based spectral data. Correcting for these lines, which strongly affect red and infrared wavelengths, is often needed in a wide variety of scientific applications. Here, we present the template division telluric modeling (TDTM) technique, a method for accurately removing telluric absorption lines in stars that exhibit numerous intrinsic features. Based on the Earth's barycentric motion throughout the year, our approach is suited for disentangling telluric and stellar spectral components. By fitting a synthetic transmission model, telluric-free spectra are derived. We demonstrate the performance of the TDTM technique in correcting telluric contamination using a high-resolution optical spectral time series of the feature-rich M3.0 dwarf star Wolf 294 that was obtained with the CARMENES spectrograph. We apply the TDTM approach to the CARMENES survey sample, which consists of 382 targets encompassing 22357 optical and 20314 near-infrared spectra, to correct for telluric absorption. The corrected spectra are coadded to construct template spectra for each of our targets. This library of telluric-free, high signal-to-noise ratio, high-resolution (R>80000) templates comprises the most comprehensive collection of spectral M-dwarf data available to date, both in terms of quantity and quality, and is available at the project website (http://carmenes.cab.inta-csic.es). △ Less

Submitted 23 October, 2023; originally announced October 2023.

Comments: 31 pages, 24 figures, 3 tables, accepted for publication by A&A

Journal ref: A&A 680, A73 (2023)

arXiv:2310.13715 [pdf]

doi 10.1007/s10462-024-10973-2

Digital Deception: Generative Artificial Intelligence in Social Engineering and Phishing

Authors: Marc Schmitt, Ivan Flechais

Abstract: The advancement of Artificial Intelligence (AI) and Machine Learning (ML) has profound implications for both the utility and security of our digital interactions. This paper investigates the transformative role of Generative AI in Social Engineering (SE) attacks. We conduct a systematic review of social engineering and AI capabilities and use a theory of social engineering to identify three pillar… ▽ More The advancement of Artificial Intelligence (AI) and Machine Learning (ML) has profound implications for both the utility and security of our digital interactions. This paper investigates the transformative role of Generative AI in Social Engineering (SE) attacks. We conduct a systematic review of social engineering and AI capabilities and use a theory of social engineering to identify three pillars where Generative AI amplifies the impact of SE attacks: Realistic Content Creation, Advanced Targeting and Personalization, and Automated Attack Infrastructure. We integrate these elements into a conceptual model designed to investigate the complex nature of AI-driven SE attacks - the Generative AI Social Engineering Framework. We further explore human implications and potential countermeasures to mitigate these risks. Our study aims to foster a deeper understanding of the risks, human implications, and countermeasures associated with this emerging paradigm, thereby contributing to a more secure and trustworthy human-computer interaction. △ Less

Submitted 15 October, 2023; originally announced October 2023.

Comments: Submitted to CHI 2024

Journal ref: Artificial Intelligence Review, 2024

Showing 1–50 of 500 results for author: Schmitt, M