-
An anisotropic Alt-Caffarelli problem of higher order
Authors:
Marius Müller
Abstract:
We study a higher order version of the Alt-Caffarelli problem in two dimensions, where the Dirichlet energy is replaced by an anisotropic bending energy. This extends a previous study of the isotropic case in [41]. It turns out that smooth anisotropies do not affect the optimal $C^{2,1}$-regularity of minimizers. The proof requires an anisotropic version of an estimate by Frehse for the fundamenta…
▽ More
We study a higher order version of the Alt-Caffarelli problem in two dimensions, where the Dirichlet energy is replaced by an anisotropic bending energy. This extends a previous study of the isotropic case in [41]. It turns out that smooth anisotropies do not affect the optimal $C^{2,1}$-regularity of minimizers. The proof requires an anisotropic version of an estimate by Frehse for the fundamental solution of the bilaplacian. This generalization paves the way for further studies of various free boundary problems of higher order.
△ Less
Submitted 27 May, 2025;
originally announced May 2025.
-
Atomic-scale ultrafast dynamics of local charge order in a THz-induced metastable state of 1T-TaS2
Authors:
Luis E. Parra López,
Alkisti Vaitsi,
Vivien Sleziona,
Fabian Schulz,
Martin Wolf,
Melanie Müller
Abstract:
Light-induced control of quantum materials enables manipulation of electronic and structural phases on ultrafast timescales. Probing their atomic-scale dynamics is essential to understand the role of defects and domain boundaries, but conventional time-resolved techniques lack the required spatial resolution. Here, we use terahertz (THz) scanning tunneling microscopy to investigate a THz-light-ind…
▽ More
Light-induced control of quantum materials enables manipulation of electronic and structural phases on ultrafast timescales. Probing their atomic-scale dynamics is essential to understand the role of defects and domain boundaries, but conventional time-resolved techniques lack the required spatial resolution. Here, we use terahertz (THz) scanning tunneling microscopy to investigate a THz-light-induced metastable state near a defect in 1T-TaS2, and follow its photoinduced dynamics in real space and time. THz excitation induces quasi-stationary changes in the insulating gap on angstrom scales, which we associate with interlayer stacking changes. Simultaneously, THz-lightwave-driven tunneling provides access to ultrafast dynamics of the metastable state, revealing 2.5 THz oscillations of the charge density wave amplitude mode and a 1.3 THz mode attributed to an interlayer shear vibration emerging near the defect. Our results demonstrate the dual role of tip-enhanced THz fields in driving metastability and ultrafast tunneling, opening new avenues for ultrafast atomic-scale control of quantum materials.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
Context-Driven Dynamic Pruning for Large Speech Foundation Models
Authors:
Masao Someki,
Shikhar Bharadwaj,
Atharva Anand Joshi,
Chyi-Jiunn Lin,
Jinchuan Tian,
Jee-weon Jung,
Markus Müller,
Nathan Susanj,
Jing Liu,
Shinji Watanabe
Abstract:
Speech foundation models achieve strong generalization across languages and acoustic conditions, but require significant computational resources for inference. In the context of speech foundation models, pruning techniques have been studied that dynamically optimize model structures based on the target audio leveraging external context. In this work, we extend this line of research and propose con…
▽ More
Speech foundation models achieve strong generalization across languages and acoustic conditions, but require significant computational resources for inference. In the context of speech foundation models, pruning techniques have been studied that dynamically optimize model structures based on the target audio leveraging external context. In this work, we extend this line of research and propose context-driven dynamic pruning, a technique that optimizes the model computation depending on the context between different input frames and additional context during inference. We employ the Open Whisper-style Speech Model (OWSM) and incorporate speaker embeddings, acoustic event embeddings, and language information as additional context. By incorporating the speaker embedding, our method achieves a reduction of 56.7 GFLOPs while improving BLEU scores by a relative 25.7% compared to the fully fine-tuned OWSM model.
△ Less
Submitted 24 May, 2025;
originally announced May 2025.
-
Mahalanobis++: Improving OOD Detection via Feature Normalization
Authors:
Maximilian Mueller,
Matthias Hein
Abstract:
Detecting out-of-distribution (OOD) examples is an important task for deploying reliable machine learning models in safety-critial applications. While post-hoc methods based on the Mahalanobis distance applied to pre-logit features are among the most effective for ImageNet-scale OOD detection, their performance varies significantly across models. We connect this inconsistency to strong variations…
▽ More
Detecting out-of-distribution (OOD) examples is an important task for deploying reliable machine learning models in safety-critial applications. While post-hoc methods based on the Mahalanobis distance applied to pre-logit features are among the most effective for ImageNet-scale OOD detection, their performance varies significantly across models. We connect this inconsistency to strong variations in feature norms, indicating severe violations of the Gaussian assumption underlying the Mahalanobis distance estimation. We show that simple $\ell_2$-normalization of the features mitigates this problem effectively, aligning better with the premise of normally distributed data with shared covariance matrix. Extensive experiments on 44 models across diverse architectures and pretraining schemes show that $\ell_2$-normalization improves the conventional Mahalanobis distance-based approaches significantly and consistently, and outperforms other recently proposed OOD detection methods.
△ Less
Submitted 23 May, 2025;
originally announced May 2025.
-
Sufficient Conditions for Detectability of Approximately Discretized Nonlinear Systems
Authors:
Seth Siriya,
Julian D. Schiller,
Victor G. Lopez,
Matthias A. Müller
Abstract:
In many sampled-data applications, observers are designed based on approximately discretized models of continuous-time systems, where usually only the discretized system is analyzed in terms of its detectability. In this paper, we show that if the continuous-time system satisfies certain linear matrix inequality (LMI) conditions, and the sampling period of the discretization scheme is sufficiently…
▽ More
In many sampled-data applications, observers are designed based on approximately discretized models of continuous-time systems, where usually only the discretized system is analyzed in terms of its detectability. In this paper, we show that if the continuous-time system satisfies certain linear matrix inequality (LMI) conditions, and the sampling period of the discretization scheme is sufficiently small, then the whole family of discretized systems (parameterized by the sampling period) satisfies analogous discrete-time LMI conditions that imply detectability. Our results are applicable to general discretization schemes, as long as they produce approximate models whose linearizations are in some sense consistent with the linearizations of the continuous-time ones. We explicitly show that the Euler and second-order Runge-Kutta methods satisfy this condition. A batch-reactor system example is provided to highlight the usefulness of our results from a practical perspective.
△ Less
Submitted 23 May, 2025;
originally announced May 2025.
-
Measurement-free quantum error correction optimized for biased noise
Authors:
Katharina Brechtelsbauer,
Friederike Butt,
David F. Locher,
Santiago Higuera Quintero,
Sebastian Weber,
Markus Müller,
Hans Peter Büchler
Abstract:
In this paper, we derive optimized measurement-free protocols for quantum error correction and the implementation of a universal gate set optimized for an error model that is noise biased . The noise bias is adapted for neutral atom platforms, where two- and multi-qubit gates are realized with Rydberg interactions and are thus expected to be the dominating source of noise. Careful design of the ga…
▽ More
In this paper, we derive optimized measurement-free protocols for quantum error correction and the implementation of a universal gate set optimized for an error model that is noise biased . The noise bias is adapted for neutral atom platforms, where two- and multi-qubit gates are realized with Rydberg interactions and are thus expected to be the dominating source of noise. Careful design of the gates allows to further reduce the noise model to Pauli-Z errors. In addition, the presented circuits are robust to arbitrary single-qubit gate errors, and we demonstrate that the break-even point can be significantly improved compared to fully fault-tolerant measurement-free schemes. The obtained logical qubits with their suppressed error rates on logical gate operations can then be used as building blocks in a first step of error correction in order to push the effective error rates below the threshold of a fully fault-tolerant and scalable quantum error correction scheme.
△ Less
Submitted 21 May, 2025;
originally announced May 2025.
-
Guidelines for the Quality Assessment of Energy-Aware NAS Benchmarks
Authors:
Nick Kocher,
Christian Wassermann,
Leona Hennig,
Jonas Seng,
Holger Hoos,
Kristian Kersting,
Marius Lindauer,
Matthias Müller
Abstract:
Neural Architecture Search (NAS) accelerates progress in deep learning through systematic refinement of model architectures. The downside is increasingly large energy consumption during the search process. Surrogate-based benchmarking mitigates the cost of full training by querying a pre-trained surrogate to obtain an estimate for the quality of the model. Specifically, energy-aware benchmarking a…
▽ More
Neural Architecture Search (NAS) accelerates progress in deep learning through systematic refinement of model architectures. The downside is increasingly large energy consumption during the search process. Surrogate-based benchmarking mitigates the cost of full training by querying a pre-trained surrogate to obtain an estimate for the quality of the model. Specifically, energy-aware benchmarking aims to make it possible for NAS to favourably trade off model energy consumption against accuracy. Towards this end, we propose three design principles for such energy-aware benchmarks: (i) reliable power measurements, (ii) a wide range of GPU usage, and (iii) holistic cost reporting. We analyse EA-HAS-Bench based on these principles and find that the choice of GPU measurement API has a large impact on the quality of results. Using the Nvidia System Management Interface (SMI) on top of its underlying library influences the sampling rate during the initial data collection, returning faulty low-power estimations. This results in poor correlation with accurate measurements obtained from an external power meter. With this study, we bring to attention several key considerations when performing energy-aware surrogate-based benchmarking and derive first guidelines that can help design novel benchmarks. We show a narrow usage range of the four GPUs attached to our device, ranging from 146 W to 305 W in a single-GPU setting, and narrowing down even further when using all four GPUs. To improve holistic energy reporting, we propose calibration experiments over assumptions made in popular tools, such as Code Carbon, thus achieving reductions in the maximum inaccuracy from 10.3 % to 8.9 % without and to 6.6 % with prior estimation of the expected load on the device.
△ Less
Submitted 21 May, 2025;
originally announced May 2025.
-
Duawlfin: A Drone with Unified Actuation for Wheeled Locomotion and Flight Operation
Authors:
Jerry Tang,
Ruiqi Zhang,
Kaan Beyduz,
Yiwei Jiang,
Cody Wiebe,
Haoyu Zhang,
Osaruese Asoro,
Mark W. Mueller
Abstract:
This paper presents Duawlfin, a drone with unified actuation for wheeled locomotion and flight operation that achieves efficient, bidirectional ground mobility. Unlike existing hybrid designs, Duawlfin eliminates the need for additional actuators or propeller-driven ground propulsion by leveraging only its standard quadrotor motors and introducing a differential drivetrain with one-way bearings. T…
▽ More
This paper presents Duawlfin, a drone with unified actuation for wheeled locomotion and flight operation that achieves efficient, bidirectional ground mobility. Unlike existing hybrid designs, Duawlfin eliminates the need for additional actuators or propeller-driven ground propulsion by leveraging only its standard quadrotor motors and introducing a differential drivetrain with one-way bearings. This innovation simplifies the mechanical system, significantly reduces energy usage, and prevents the disturbance caused by propellers spinning near the ground, such as dust interference with sensors. Besides, the one-way bearings minimize the power transfer from motors to propellers in the ground mode, which enables the vehicle to operate safely near humans. We provide a detailed mechanical design, present control strategies for rapid and smooth mode transitions, and validate the concept through extensive experimental testing. Flight-mode tests confirm stable aerial performance comparable to conventional quadcopters, while ground-mode experiments demonstrate efficient slope climbing (up to 30°) and agile turning maneuvers approaching 1g lateral acceleration. The seamless transitions between aerial and ground modes further underscore the practicality and effectiveness of our approach for applications like urban logistics and indoor navigation. All the materials including 3-D model files, demonstration video and other assets are open-sourced at https://sites.google.com/view/Duawlfin.
△ Less
Submitted 19 May, 2025;
originally announced May 2025.
-
Boxi: Design Decisions in the Context of Algorithmic Performance for Robotics
Authors:
Jonas Frey,
Turcan Tuna,
Lanke Frank Tarimo Fu,
Cedric Weibel,
Katharine Patterson,
Benjamin Krummenacher,
Matthias Müller,
Julian Nubert,
Maurice Fallon,
Cesar Cadena,
Marco Hutter
Abstract:
Achieving robust autonomy in mobile robots operating in complex and unstructured environments requires a multimodal sensor suite capable of capturing diverse and complementary information. However, designing such a sensor suite involves multiple critical design decisions, such as sensor selection, component placement, thermal and power limitations, compute requirements, networking, synchronization…
▽ More
Achieving robust autonomy in mobile robots operating in complex and unstructured environments requires a multimodal sensor suite capable of capturing diverse and complementary information. However, designing such a sensor suite involves multiple critical design decisions, such as sensor selection, component placement, thermal and power limitations, compute requirements, networking, synchronization, and calibration. While the importance of these key aspects is widely recognized, they are often overlooked in academia or retained as proprietary knowledge within large corporations. To improve this situation, we present Boxi, a tightly integrated sensor payload that enables robust autonomy of robots in the wild. This paper discusses the impact of payload design decisions made to optimize algorithmic performance for downstream tasks, specifically focusing on state estimation and mapping. Boxi is equipped with a variety of sensors: two LiDARs, 10 RGB cameras including high-dynamic range, global shutter, and rolling shutter models, an RGB-D camera, 7 inertial measurement units (IMUs) of varying precision, and a dual antenna RTK GNSS system. Our analysis shows that time synchronization, calibration, and sensor modality have a crucial impact on the state estimation performance. We frame this analysis in the context of cost considerations and environment-specific challenges. We also present a mobile sensor suite `cookbook` to serve as a comprehensive guideline, highlighting generalizable key design considerations and lessons learned during the development of Boxi. Finally, we demonstrate the versatility of Boxi being used in a variety of applications in real-world scenarios, contributing to robust autonomy. More details and code: https://github.com/leggedrobotics/grand_tour_box
△ Less
Submitted 25 April, 2025;
originally announced April 2025.
-
Ge$_{1-x}$Si$_{x}$ single crystals for Ge hole spin qubit integration
Authors:
Andreas Fuhrberg,
Pia M. Düring,
Olena Fedchenko,
Olena Tkach,
Yaryna Lytvynenko,
Kevin Gradwohl,
Sergii Chernov,
Andrei Gloskovskii,
Christoph Schlueter,
Gerd Schönhense,
Hans-Joachim Elmers,
Martina Müller
Abstract:
Spin qubits are fundamental building blocks of modern quantum computing devices. The path of Ge-based hole-spin qubits has several advantages over Si-based electron-spin systems, such as the absence of valley band degeneracy, the possibility of efficient field control due to large spin-orbit coupling, and smaller effective masses. Among the possible Ge qubit devices, Ge/GeSi planar heterostructure…
▽ More
Spin qubits are fundamental building blocks of modern quantum computing devices. The path of Ge-based hole-spin qubits has several advantages over Si-based electron-spin systems, such as the absence of valley band degeneracy, the possibility of efficient field control due to large spin-orbit coupling, and smaller effective masses. Among the possible Ge qubit devices, Ge/GeSi planar heterostructures have proven to be favourable for upscaling and fabrication. The Si concentration of the straining GeSi buffer serves as an important tuning parameter for the electronic structure of Ge/GeSi qubits. A particularly low Si concentration of x = 0.15 of the Ge$_{0.85}$Si$_{0.15}$ crystal should enable minimal lattice strain for spin qubit heterostructures, which is difficult to stabilize as a random alloy. We present a synchrotron-based study to investigate the chemical composition, valence band electronic structure and local atomic structure of a Ge$_{0.85}$Si$_{0.15}$ single crystal using the advanced combination of hard X-ray photoelectron spectroscopy (HAXPES), hard X-ray momentum microscopy (HarMoMic) and X-ray photoelectron diffraction (XPD). We found that the Ge$_{0.85}$Si$_{0.15}$ crystal has an individual, uniform valence band structure, with no signs of phase separation. The shapes of the valence bands resemble those of pure Ge, as do the low effective masses. XPD experiments and Bloch wave calculations, show the Si atoms located at Ge lattice sites within the crystal, forming a random alloy. This high chemical, electronic and structural quality of Ge$_{0.85}$Si$_{0.15}$ single-crystal substrates is of crucial importance for their implementation to enable long spin lifetimes in Ge-based hole-spin qubits. The results emphasise the power of combined X-ray spectromicroscopy techniques, which provide key insights into the qubit building blocks that form the basis of quantum technologies.
△ Less
Submitted 22 April, 2025;
originally announced April 2025.
-
KOMPASS: the new cold neutron triple-axis-spectrometer specialized for polarization analysis
Authors:
D. Gorkov,
M. Müller,
G. Waldherr,
A. Grünwald,
J. Stein,
S. Giemsa,
A. C. Komarek,
P. Böni,
M. Braden
Abstract:
KOMPASS is a polarized triple-axis cold neutron spectrometer recently installed at the FRM II neutron source. The instrument is designed to operate exclusively with polarized neutrons and is specialized in longitudinal polarization analysis using Helmholtz coils and spherical zero-field neutron polarimetry using a Cryopad device. The advanced guide system polarizes and focuses flexibly in the scat…
▽ More
KOMPASS is a polarized triple-axis cold neutron spectrometer recently installed at the FRM II neutron source. The instrument is designed to operate exclusively with polarized neutrons and is specialized in longitudinal polarization analysis using Helmholtz coils and spherical zero-field neutron polarimetry using a Cryopad device. The advanced guide system polarizes and focuses flexibly in the scattering plane. A first, fixed parabolic focusing part contains a series of three polarizing supermirror V-cavities that produce a highly polarized beam. By exchanging straight and parabolic front-end guide sections, the resolution of the instrument can be optimized to meet experimental requirements. Large, double-focusing monochromator and analyzer units with pyrolytic graphite crystals enable efficient and adaptive energy selection, and a compact supermirror cavity analyzes the final neutron polarization. Alternatively, or in combination with this cavity, a Heusler polarization analyzer can be used. KOMPASS offers full- and half-polarized configurations with or without secondary energy analysis and provides a wide range of polarization options. Therefore, KOMPASS is well suited for various studies of static and dynamic magnetic correlations with energy transfers on the neutron energy loss side up to ~12 meV.
△ Less
Submitted 23 April, 2025; v1 submitted 22 April, 2025;
originally announced April 2025.
-
Lattice surgery-based logical state teleportation via noisy links
Authors:
Áron Márton,
Luis Colmenarez,
Lukas Bödeker,
Markus Müller
Abstract:
For planar architectures surface code-based quantum error correction is one of the most promising approaches to fault-tolerant quantum computation. This is partially due to the variety of fault-tolerant logical protocols that can be implemented in two dimensions using local operations. One such protocol is the lattice surgery-based logical state teleportation, which transfers a logical quantum sta…
▽ More
For planar architectures surface code-based quantum error correction is one of the most promising approaches to fault-tolerant quantum computation. This is partially due to the variety of fault-tolerant logical protocols that can be implemented in two dimensions using local operations. One such protocol is the lattice surgery-based logical state teleportation, which transfers a logical quantum state from an initial location on a quantum chip to a target location through a linking region of qubits. This protocol serves as a basis for higher-level routines, such as the entangling CNOT gate or magic state injection. In this work we investigate the correctability phase diagram of this protocol for distinct error rates inside the surface code patches and within the linking region. We adopt techniques from statistical physics to describe the numerically observed crossover regime between correctable and uncorrectable quantum error correction phases, where the correctability depends on the separation between the initial and target locations. We find that inside the crossover regime the correctability-threshold lines decay as a power law with increasing separation, which we explain accurately using a finite-size scaling analysis. Our results indicate that the logical state teleportation protocol can tolerate much higher noise rates in the linking region compared to the bulk of the surface code patches, provided the separation between the positions is relatively small.
△ Less
Submitted 22 April, 2025;
originally announced April 2025.
-
Magnecko: Design and Control of a Quadrupedal Magnetic Climbing Robot
Authors:
Stefan Leuthard,
Timo Eugster,
Nicolas Faesch,
Riccardo Feingold,
Connor Flynn,
Michael Fritsche,
Nicolas Hürlimann,
Elena Morbach,
Fabian Tischhauser,
Matthias Müller,
Markus Montenegro,
Valerio Schelbert,
Jia-Ruei Chiu,
Philip Arm,
Marco Hutter
Abstract:
Climbing robots hold significant promise for applications such as industrial inspection and maintenance, particularly in hazardous or hard-to-reach environments. This paper describes the quadrupedal climbing robot Magnecko, developed with the major goal of providing a research platform for legged climbing locomotion. With its 12 actuated degrees of freedom arranged in an insect-style joint configu…
▽ More
Climbing robots hold significant promise for applications such as industrial inspection and maintenance, particularly in hazardous or hard-to-reach environments. This paper describes the quadrupedal climbing robot Magnecko, developed with the major goal of providing a research platform for legged climbing locomotion. With its 12 actuated degrees of freedom arranged in an insect-style joint configuration, Magnecko's high manipulability and high range of motion allow it to handle challenging environments like overcoming concave 90 degree corners. A model predictive controller enables Magnecko to crawl on the ground on horizontal overhangs and on vertical walls. Thanks to the custom actuators and the electro-permanent magnets that are used for adhesion on ferrous surfaces, the system is powerful enough to carry additional payloads of at least 65 percent of its own weight in all orientations. The Magnecko platform serves as a foundation for climbing locomotion in complex three-dimensional environments.
△ Less
Submitted 18 April, 2025;
originally announced April 2025.
-
Extended source fringe flats for the JWST MIRI Medium Resolution Spectrometer
Authors:
N. Crouzet,
M. Mueller,
B. Sargent,
F. Lahuis,
D. Kester,
G. Yang,
I. Argyriou,
D. Gasman,
P. J. Kavanagh,
A. Labiano,
K. Larson,
D. R. Law,
J. Álvarez-Márquez,
B. R. Brandl,
A. Glasse,
P. Patapis,
P. R. Roelfsema,
Ł. Tychoniec,
E. F. van Dishoeck,
G. S. Wright
Abstract:
The detectors of the JWST Mid-Infrared Instrument (MIRI) Medium Resolution Spectrometer (MRS) form low-finesse resonating cavities that cause periodic count rate modulations (fringes) with peak amplitudes of up to 15% for sources external to MIRI. To detect weak features on a strong continuum and reliably measure line fluxes and line-flux ratios, fringe correction is crucial. This paper describes…
▽ More
The detectors of the JWST Mid-Infrared Instrument (MIRI) Medium Resolution Spectrometer (MRS) form low-finesse resonating cavities that cause periodic count rate modulations (fringes) with peak amplitudes of up to 15% for sources external to MIRI. To detect weak features on a strong continuum and reliably measure line fluxes and line-flux ratios, fringe correction is crucial. This paper describes the first of two steps implemented in the JWST Science Calibration Pipeline, which is the division by a static fringe flat that removes the bulk of the fringes for extended sources. Fringe flats were derived by fitting a numerical model to observations of spatially extended sources. The model includes fringes that originate from two resonating cavities in the detector substrate (a third fringe component that originates from the dichroic filters is not included). The model, numerical implementation, and resulting fringe flats are described, and the efficiency of the calibration was evaluated for sources of various spatial extents on the detector. Flight fringe flats are obtained from observations of the planetary nebula NGC 7027. The two fringe components are well recovered and fitted by the model. The derived parameters are used to build a fringe flat for each MRS spectral band, except for 1A and 1B due to the low signal-to-noise ratio of NGC 7027 in these bands. When applied to extended sources, fringe amplitudes are reduced to the sub-percent level on individual spaxels. For point sources, they are reduced to amplitudes of 1 to 5% considering individual spaxels and a single dither position, and decrease to the 1 to 2% level after two-dimensional residual fringe correction. The fringe flats derived from this work are the reference files currently in use by the JWST Science Calibration Pipeline. They provide an efficient calibration for extended sources, and are less efficient for point sources.
△ Less
Submitted 15 April, 2025;
originally announced April 2025.
-
A nongraphical obstacle problem for elastic curves
Authors:
Marius Müller,
Kensuke Yoshizawa
Abstract:
We study an obstacle problem for the length-penalized elastic bending energy for open planar curves pinned at the boundary. We first consider the case without length penalization and investigate the role of global minimizers among graph curves in our minimization problem for planar curves. In addition, for large values of the length-penalization parameter $λ>0$, we expose an explicit threshold par…
▽ More
We study an obstacle problem for the length-penalized elastic bending energy for open planar curves pinned at the boundary. We first consider the case without length penalization and investigate the role of global minimizers among graph curves in our minimization problem for planar curves. In addition, for large values of the length-penalization parameter $λ>0$, we expose an explicit threshold parameter above which minimizers touch the obstacle, regardless of its shape. On contrary, for small values of $λ>0$ we show that the minimizers do not touch the obstacle, and they are given by an explicit elastica.
△ Less
Submitted 8 April, 2025;
originally announced April 2025.
-
The Sunrise Ultraviolet Spectropolarimeter and Imager: Instrument description
Authors:
A. Feller,
A. Gandorfer,
B. Grauf,
J. Hölken,
F. A. Iglesias,
A. Korpi-Lagg,
T. L. Riethmüller,
J. Staub,
G. Fernandez-Rico,
J. S. Castellanos Durán,
S. K. Solanki,
H. N. Smitha,
K. Sant,
P. Barthol,
M. Bayon Laguna,
M. Bergmann,
J. Bischoff,
J. Bochmann,
S. Bruns,
W. Deutsch,
M. Eberhardt,
R. Enge,
S. Goodyear,
K. Heerlein,
J. Heinrichs
, et al. (24 additional authors not shown)
Abstract:
The third science flight of the balloon-borne solar observatory Sunrise carries three entirely new post-focus science instruments with spectropolarimetric capabilities, concurrently covering an extended spectral range from the near ultraviolet to the near infrared. Sampling a larger height range, from the low photosphere to the chromosphere, with the sub-arcsecond resolution provided by the 1-m Su…
▽ More
The third science flight of the balloon-borne solar observatory Sunrise carries three entirely new post-focus science instruments with spectropolarimetric capabilities, concurrently covering an extended spectral range from the near ultraviolet to the near infrared. Sampling a larger height range, from the low photosphere to the chromosphere, with the sub-arcsecond resolution provided by the 1-m Sunrise telescope, is key in understanding critical small-scale phenomena which energetically couple different layers of the solar atmosphere. The Sunrise Ultraviolet Spectropolarimeter and Imager (SUSI) operates between 309 nm and 417 nm. A key feature of SUSI is its capability to record up to several hundred spectral lines simultaneously without the harmful effects of the Earth's atmosphere. The rich SUSI spectra can be exploited in terms of many-line inversions. Another important innovation of the instrument is the synchronized 2D context imaging which allows to numerically correct the spectrograph scans for residual optical aberrations. In this work we describe the main design aspects of SUSI, the instrument characterization and testing, and finally its operation, expected performance and data products.
△ Less
Submitted 7 April, 2025;
originally announced April 2025.
-
The Cesàro Value Iteration
Authors:
Jonas Mair,
Lukas Schwenkel,
Matthias A. Müller,
Frank Allgöwer
Abstract:
In this paper, we address the problem of undiscouted infinite-horizon optimal control for deterministic systems where the classic value iteration does not converge. For such systems, we propose to use the Cesàro mean to define the infinite-horizon optimal control problem and the corresponding infinite-horizon value function. Moreover, for this value function, we introduce the Cesàro value iteratio…
▽ More
In this paper, we address the problem of undiscouted infinite-horizon optimal control for deterministic systems where the classic value iteration does not converge. For such systems, we propose to use the Cesàro mean to define the infinite-horizon optimal control problem and the corresponding infinite-horizon value function. Moreover, for this value function, we introduce the Cesàro value iteration and prove its convergence for the special case of systems with periodic optimal operating behavior.
△ Less
Submitted 23 May, 2025; v1 submitted 7 April, 2025;
originally announced April 2025.
-
Distributed Model Predictive Control for Dynamic Cooperation of Multi-Agent Systems
Authors:
Matthias Köhler,
Matthias A. Müller,
Frank Allgöwer
Abstract:
We propose a distributed model predictive control (MPC) framework for coordinating heterogeneous, nonlinear multi-agent systems under individual and coupling constraints. The cooperative task is encoded as a shared objective function minimized collectively by the agents. Each agent optimizes an artificial reference as an intermediate step towards the cooperative objective, along with a control inp…
▽ More
We propose a distributed model predictive control (MPC) framework for coordinating heterogeneous, nonlinear multi-agent systems under individual and coupling constraints. The cooperative task is encoded as a shared objective function minimized collectively by the agents. Each agent optimizes an artificial reference as an intermediate step towards the cooperative objective, along with a control input to track it. We establish recursive feasibility, asymptotic stability, and transient performance bounds under suitable assumptions. The solution to the cooperative task is not predetermined but emerges from the optimized interactions of the agents. We demonstrate the framework on numerical examples inspired by satellite constellation control, collision-free narrow-passage traversal, and coordinated quadrotor flight.
△ Less
Submitted 23 April, 2025; v1 submitted 31 March, 2025;
originally announced April 2025.
-
Output-feedback model predictive control under dynamic uncertainties using integral quadratic constraints
Authors:
Lukas Schwenkel,
Johannes Köhler,
Matthias A. Müller,
Frank Allgöwer
Abstract:
In this work, we propose an output-feedback tube-based model predictive control (MPC) scheme for linear systems under dynamic uncertainties that are described via integral quadratic constraints (IQC). By leveraging IQCs, a large class of nonlinear and dynamic uncertainties can be addressed. We leverage recent IQC synthesis tools to design a dynamic controller and an observer that are robust to the…
▽ More
In this work, we propose an output-feedback tube-based model predictive control (MPC) scheme for linear systems under dynamic uncertainties that are described via integral quadratic constraints (IQC). By leveraging IQCs, a large class of nonlinear and dynamic uncertainties can be addressed. We leverage recent IQC synthesis tools to design a dynamic controller and an observer that are robust to these uncertainties and minimize the size of the resulting constraint tightening in the MPC. Thereby, we show that the robust estimation problem using IQCs with peak-to-peak performance can be convexified. We guarantee recursive feasibility, robust constraint satisfaction, and input-to-state stability of the resulting MPC scheme.
△ Less
Submitted 31 March, 2025;
originally announced April 2025.
-
Simple general magnification of circuit lower bounds
Authors:
Albert Atserias,
Moritz Müller
Abstract:
We construct so-called distinguishers, sparse matrices that retain some properties of error correcting codes. They provide a technically and conceptually simple approach to magnification. We generalize and strengthen known general (not problem specific) magnification results and in particular achieve magnification thresholds below known lower bounds. For example, we show that fixed polynomial form…
▽ More
We construct so-called distinguishers, sparse matrices that retain some properties of error correcting codes. They provide a technically and conceptually simple approach to magnification. We generalize and strengthen known general (not problem specific) magnification results and in particular achieve magnification thresholds below known lower bounds. For example, we show that fixed polynomial formula size lower bounds for NP are implied by slightly superlinear formula size lower bounds for approximating any sufficiently sparse problem in NP. We also show that the thresholds achieved are sharp. Additionally, our approach yields a uniform magnification result for the minimum circuit size problem. This seems to sidestep the localization barrier.
△ Less
Submitted 31 March, 2025;
originally announced March 2025.
-
Multi-objective robust controller synthesis with integral quadratic constraints in discrete-time
Authors:
Lukas Schwenkel,
Johannes Köhler,
Matthias A. Müller,
Carsten W. Scherer,
Frank Allgöwer
Abstract:
This article presents a novel framework for the robust controller synthesis problem in discrete-time systems using dynamic Integral Quadratic Constraints (IQCs). We present an algorithm to minimize closed-loop performance measures such as the $\mathcal H_\infty$-norm, the energy-to-peak gain, the peak-to-peak gain, or a multi-objective mix thereof. While IQCs provide a powerful tool for modeling s…
▽ More
This article presents a novel framework for the robust controller synthesis problem in discrete-time systems using dynamic Integral Quadratic Constraints (IQCs). We present an algorithm to minimize closed-loop performance measures such as the $\mathcal H_\infty$-norm, the energy-to-peak gain, the peak-to-peak gain, or a multi-objective mix thereof. While IQCs provide a powerful tool for modeling structured uncertainties and nonlinearities, existing synthesis methods are limited to the $\mathcal H_\infty$-norm, continuous-time systems, or special system structures. By minimizing the energy-to-peak and peak-to-peak gain, the proposed synthesis can be utilized to bound the peak of the output, which is crucial in many applications requiring robust constraint satisfaction, input-to-state stability, reachability analysis, or other pointwise-in-time bounds. Numerical examples demonstrate the robustness and performance of the controllers synthesized with the proposed algorithm.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
Theory of polarization-dependent phonon pumping in ferromagnetic/non-magnetic bilayers
Authors:
Mikhail Cherkasskii,
Fabian Engelhardt,
Manuel Müller,
Johannes Weber,
Matthias Althammer,
Sebastian T. B. Goennenwein,
Hans Huebl,
Silvia Viola Kusminskiy
Abstract:
We develop a theoretical model for polarization-selective phonon pumping induced by magnon-phonon coupling in a ferromagnetic/non-magnetic acoustic bilayer structure, focusing on the effects arising from a misalignment between the magnetic and crystallographic symmetry axes. Our model considers the coupled equations of motion describing uniform magnetization dynamics (the Kittel mode) and elastic…
▽ More
We develop a theoretical model for polarization-selective phonon pumping induced by magnon-phonon coupling in a ferromagnetic/non-magnetic acoustic bilayer structure, focusing on the effects arising from a misalignment between the magnetic and crystallographic symmetry axes. Our model considers the coupled equations of motion describing uniform magnetization dynamics (the Kittel mode) and elastic waves in both layers, incorporating phonon pumping and boundary conditions at the interface. We show that even small misalignments lift the degeneracy of transverse shear elastic modes, resulting in phononic birefringence characterized by distinct propagation velocities for linearly polarized modes. Furthermore, our analysis reveals that magnon-phonon hybridization gives magnetic-field-dependent properties to otherwise non-magnetic phonons. We show that the polarization transfer between linearly polarized phonons and the circularly polarized Kittel mode can be tuned with an external magnetic field. Our theoretical results quantitatively reproduce recent experimental findings [1].
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
Maximum number of edge colorings avoiding rainbow copies of $K_4$
Authors:
Hiêp Hàn,
Carlos Hoppen,
Nicolas Moro Müller,
Dionatan Ricardo Schmidt
Abstract:
In this paper we show that for $r\geq 12$ and any sufficiently large $n$-vertex graph $G$ the number of $r$-edge-colorings of $G$ with no rainbow $K_4$ is at most $r^{ex(n,K_4)}$, where $ex(n,K_4)$ denotes the Turán number of $K_4$. Moreover, $G$ attains equality if and only if it is the Turán graph $T_3(n)$.
The bound on the number of colors $r\geq 12$ is best possible. It improves upon a resul…
▽ More
In this paper we show that for $r\geq 12$ and any sufficiently large $n$-vertex graph $G$ the number of $r$-edge-colorings of $G$ with no rainbow $K_4$ is at most $r^{ex(n,K_4)}$, where $ex(n,K_4)$ denotes the Turán number of $K_4$. Moreover, $G$ attains equality if and only if it is the Turán graph $T_3(n)$.
The bound on the number of colors $r\geq 12$ is best possible. It improves upon a result of H. Lefmann, D.A. Nolibos, and the second author who showed the same result for $r \geq 5434$ and it confirms a conjecture by Gupta, Pehova, Powierski and Staden.
△ Less
Submitted 30 April, 2025; v1 submitted 24 March, 2025;
originally announced March 2025.
-
Insights into the explainability of Lasso-based DeePC for nonlinear systems
Authors:
Gianluca Giacomelli,
Simone Formentin,
Victor G. Lopez,
Matthias A. Müller,
Valentina Breschi
Abstract:
Data-enabled Predictive Control (DeePC) has recently gained the spotlight as an easy-to-use control technique that allows for constraint handling while relying on raw data only. Initially proposed for linear time-invariant systems, several DeePC extensions are now available to cope with nonlinear systems. Nonetheless, these solutions mainly focus on ensuring the controller's effectiveness, overloo…
▽ More
Data-enabled Predictive Control (DeePC) has recently gained the spotlight as an easy-to-use control technique that allows for constraint handling while relying on raw data only. Initially proposed for linear time-invariant systems, several DeePC extensions are now available to cope with nonlinear systems. Nonetheless, these solutions mainly focus on ensuring the controller's effectiveness, overlooking the explainability of the final result. As a step toward explaining the outcome of DeePC for the control of nonlinear systems, in this paper, we focus on analyzing the earliest and simplest DeePC approach proposed to cope with nonlinearities in the controlled system, using a Lasso regularization. Our theoretical analysis highlights that the decisions undertaken by DeePC with Lasso regularization are unexplainable, as control actions are determined by data incoherent with the system's local behavior. This result is true even when the available input/output samples are grouped according to the different operating conditions explored during data collection. Our numerical study confirms these findings, highlighting the benefits of data grouping in terms of performance while showing that explainability remains a challenge in control design via DeePC.
△ Less
Submitted 11 April, 2025; v1 submitted 24 March, 2025;
originally announced March 2025.
-
Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach
Authors:
Jakob Abeßer,
Simon Schwär,
Meinard Müller
Abstract:
This study examines pitch contours as a unifying semantic construct prevalent across various audio domains including music, speech, bioacoustics, and everyday sounds. Analyzing pitch contours offers insights into the universal role of pitch in the perceptual processing of audio signals and contributes to a deeper understanding of auditory mechanisms in both humans and animals. Conventional pitch-t…
▽ More
This study examines pitch contours as a unifying semantic construct prevalent across various audio domains including music, speech, bioacoustics, and everyday sounds. Analyzing pitch contours offers insights into the universal role of pitch in the perceptual processing of audio signals and contributes to a deeper understanding of auditory mechanisms in both humans and animals. Conventional pitch-tracking methods, while optimized for music and speech, face challenges in handling much broader frequency ranges and more rapid pitch variations found in other audio domains. This study introduces a vision-based approach to pitch contour analysis that eliminates the need for explicit pitch-tracking. The approach uses a convolutional neural network, pre-trained for object detection in natural images and fine-tuned with a dataset of synthetically generated pitch contours, to extract key contour parameters from the time-frequency representation of short audio segments. A diverse set of eight downstream tasks from four audio domains were selected to provide a challenging evaluation scenario for cross-domain pitch contour analysis. The results show that the proposed method consistently surpasses traditional techniques based on pitch-tracking on a wide range of tasks. This suggests that the vision-based approach establishes a foundation for comparative studies of pitch contour characteristics across diverse audio domains.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
Control Lyapunov Function Design via Configuration-Constrained Polyhedral Computing
Authors:
Boris Houska,
Matthias A. Müller,
Mario E. Villanueva
Abstract:
This paper proposes novel approaches for designing control Lyapunov functions (CLFs) for constrained linear systems. We leverage recent configuration-constrained polyhedral computing techniques to devise piecewise affine convex CLFs. Additionally, we generalize these methods to uncertain systems with both additive and multiplicative disturbances. The proposed design methods are capable of approxim…
▽ More
This paper proposes novel approaches for designing control Lyapunov functions (CLFs) for constrained linear systems. We leverage recent configuration-constrained polyhedral computing techniques to devise piecewise affine convex CLFs. Additionally, we generalize these methods to uncertain systems with both additive and multiplicative disturbances. The proposed design methods are capable of approximating the infinite horizon cost function of both nominal and min-max optimal control problems by solving a single, one-stage, convex optimization problem. As such, these methods find practical applications in explicit controller design as well as in determining terminal regions and cost functions for nominal and min-max model predictive control (MPC). Numerical examples illustrate the effectiveness of this approach.
△ Less
Submitted 20 March, 2025;
originally announced March 2025.
-
Performance of the spin qubit shuttling architecture for a surface code implementation
Authors:
Berat Yenilen,
Arnau Sala,
Hendrik Bluhm,
Markus Müller,
Manuel Rispler
Abstract:
Qubit shuttling promises to advance some quantum computing platforms to the qubit register sizes needed for effective quantum error correction (QEC), but also introduces additional errors whose impact must be evaluated. The established method to investigate the performance of QEC codes in a realistic scenario is to employ a standard noise model known as circuit-level noise, where all quantum opera…
▽ More
Qubit shuttling promises to advance some quantum computing platforms to the qubit register sizes needed for effective quantum error correction (QEC), but also introduces additional errors whose impact must be evaluated. The established method to investigate the performance of QEC codes in a realistic scenario is to employ a standard noise model known as circuit-level noise, where all quantum operations are modeled as noisy. In the present work, we take this noise model and single out the effect of shuttling errors by introducing them as an additional so-called error location. This hardware abstraction is motivated by the SpinBus architecture and allows a systematic numerical investigation to map out the resulting two-dimensional parameter space. To this end, we take the Surface code and perform large scale simulations, most notably extracting the threshold across said two-dimensional parameter space. We study two scenarios for shuttling errors, depolarization on the one hand and dephasing on the other hand. For a purely dephasing shuttling error, we find a threshold of several percent, provided that all other operations have a high fidelity. The qubit overhead needed to reach a logical error rate of $10^{-12}$ (known as the "teraquop" regime~\cite{Gidney2021Jul}) increases only moderately for shuttling error rates up to about 1 \% per shuttling operation. The error rates at which practically useful, i.e. well below threshold error correction is predicted to be possible are comfortably higher than what is expected to be achievable for spin qubits. Our results thus show that it is reasonable to expect shuttling operations to fall below threshold already at surprisingly large error rates. With realistic efforts in the near term, this offers positive prospects for spin qubit based quantum processors as a viable avenue for scalable fault-tolerant error-corrected quantum computing.
△ Less
Submitted 28 March, 2025; v1 submitted 13 March, 2025;
originally announced March 2025.
-
Low-Rank Matrix Regression via Least-Angle Regression
Authors:
Mingzhou Yin,
Matthias A. Müller
Abstract:
Low-rank matrix regression is a fundamental problem in data science with various applications in systems and control. Nuclear norm regularization has been widely applied to solve this problem due to its convexity. However, it suffers from high computational complexity and the inability to directly specify the rank. This work introduces a novel framework for low-rank matrix regression that addresse…
▽ More
Low-rank matrix regression is a fundamental problem in data science with various applications in systems and control. Nuclear norm regularization has been widely applied to solve this problem due to its convexity. However, it suffers from high computational complexity and the inability to directly specify the rank. This work introduces a novel framework for low-rank matrix regression that addresses both unstructured and Hankel matrices. By decomposing the low-rank matrix into rank-1 bases, the problem is reformulated as an infinite-dimensional sparse learning problem. The least-angle regression (LAR) algorithm is then employed to solve this problem efficiently. For unstructured matrices, a closed-form LAR solution is derived with equivalence to a normalized nuclear norm regularization problem. For Hankel matrices, a real-valued polynomial basis reformulation enables effective LAR implementation. Two numerical examples in network modeling and system realization demonstrate that the proposed approach significantly outperforms the nuclear norm method in terms of estimation accuracy and computational efficiency.
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
Geometric realizations of $ν$-associahedra via brick polyhedra
Authors:
Cesar Ceballos,
Matthias Müller
Abstract:
Brick polytopes constitute a remarkable family of polytopes associated to the spherical subword complexes of Knutson and Miller. They were introduced for finite Coxeter groups by Pilaud and Stump, who used them to produce geometric realizations of generalized associahedra arising from the theory of cluster algebras of finite types. In this paper, we present an application of the vast generalizatio…
▽ More
Brick polytopes constitute a remarkable family of polytopes associated to the spherical subword complexes of Knutson and Miller. They were introduced for finite Coxeter groups by Pilaud and Stump, who used them to produce geometric realizations of generalized associahedra arising from the theory of cluster algebras of finite types. In this paper, we present an application of the vast generalization of brick polyhedra for general subword complexes (not necessarily spherical) recently introduced by Jahn and Stump.
More precisely, we show that the $ν$-associahedron, a polytopal complex whose edge graph is the Hasse diagram of the $ν$-Tamari lattice introduced by Préville-Ratelle and Viennot, can be geometrically realized as the complex of bounded faces of the brick polyhedron of a well chosen subword complex. We also present a suitable projection to the appropriate dimension, which leads to an elegant vertex-coordinate description.
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
Validity in Design Science
Authors:
K. Larsen,
R. Lukyanenko,
Roland M. Mueller,
V. Storey,
J. Parsons,
D. Vandermeer,
D. Hovorka
Abstract:
Researchers must ensure that the claims about the knowledge produced by their work are valid. However, validity is neither well-understood nor consistently established in design science, which involves the development and evaluation of artifacts (models, methods, instantiations, and theories) to solve problems. As a result, it is challenging to demonstrate and communicate the validity of knowledge…
▽ More
Researchers must ensure that the claims about the knowledge produced by their work are valid. However, validity is neither well-understood nor consistently established in design science, which involves the development and evaluation of artifacts (models, methods, instantiations, and theories) to solve problems. As a result, it is challenging to demonstrate and communicate the validity of knowledge claims about artifacts. This paper defines validity in design science and derives the Design Science Validity Framework and a process model for applying it. The framework comprises three high-level claim and validity types-criterion, causal, and context-as well as validity subtypes. The framework guides researchers in integrating validity considerations into projects employing design science and contributes to the growing body of research on design science methodology. It also provides a systematic way to articulate and validate the knowledge claims of design science projects. We apply the framework to examples from existing research and then use it to demonstrate the validity of knowledge claims about the framework itself.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
Contrasting $c$-axis and in-plane uniaxial stress effects on superconductivity and stripe order in La$_{1.885}$Ba$_{0.115}$CuO$_4$
Authors:
S. S. Islam,
V. Sazgari,
J. N. Graham,
O. Gerguri,
P. Král,
I. Maetsu,
H. Gopakumar,
M. Müller,
R. Sarkar,
V. Grinenko,
G. Simutis,
T. Shiroka,
R. Khasanov,
M. Janoschek,
J. M. Tranquada,
H. H. Klauss,
T. Adachi,
H. Luetkens,
Z. Guguchia
Abstract:
The cuprate superconductor La$_{2-x}$Ba$_x$CuO$_4$ (LBCO) near $x=0.125$ is a striking example of intertwined electronic orders, where 3D superconductivity is anomalously suppressed, allowing spin and charge stripe order to develop, in a manner consistent with the emergence of a pair-density-wave (PDW) state. Understanding this interplay remains a key challenge in cuprates, highlighting the necess…
▽ More
The cuprate superconductor La$_{2-x}$Ba$_x$CuO$_4$ (LBCO) near $x=0.125$ is a striking example of intertwined electronic orders, where 3D superconductivity is anomalously suppressed, allowing spin and charge stripe order to develop, in a manner consistent with the emergence of a pair-density-wave (PDW) state. Understanding this interplay remains a key challenge in cuprates, highlighting the necessity of external tuning for deeper insight. While in-plane (within the CuO plane) uniaxial stress enhances superconductivity and suppresses stripe order, the effects of $c$-axis compression (perpendicular to the CuO plane) remains largely unexplored. Here, we use muon spin rotation ($μ$SR) and AC susceptibility with an in situ piezoelectric stress device to investigate the spin-stripe order and superconductivity in LBCO-0.115 under $c$-axis compression. The measurements reveal a gradual suppression of the superconducting transition temperature ($T_{\rm c}$) with increasing $c$-axis stress, in stark contrast to the strong enhancement observed under in-plane stress. We further show that while in-plane stress rapidly reduces both the magnetic volume fraction ($V_{\rm m}$) and the spin-stripe ordering temperature ($T_{\rm so}$), $c$-axis compression has no effect, with $V_{\rm m}$ and $T_{\rm so}$ exhibiting an almost unchanged behavior up to the highest applied stress of 0.21 GPa. These findings demonstrate a strong anisotropy in stress response, underscoring the critical role of crystallographic anisotropy in governing competing electronic phases in LBCO.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
Ferroelectricity of Wurtzite Al$_{1-x}$Hf$_{x}$N Heterovalent Alloys
Authors:
Nate S. P. Bernstein,
Daniel Drury,
Cheng-Wei Lee,
Tatau Shimada,
Yuki Sakai,
Oliver Rehm,
Lutz Baumgarten,
Martina Müller,
Prashun Gorai,
Yoshiki Iwazaki,
Glen R. Fox,
Keisuke Yazawa,
Brendan Hanrahan,
Geoff L. Brennecka
Abstract:
Thin films of aluminum hafnium nitride (Al$_{1-x}$Hf$_{x}$N) were synthesized via reactive magnetron sputtering for Hf contents up to $x$ = 0.13. X-ray diffraction showed a single $c$-axis oriented wurtzite phase for all films. Hard X-ray photoelectron spectroscopy demonstrated homogeneous Al:Hf distribution through the thin films and confirmed their insulating character. A collection of complemen…
▽ More
Thin films of aluminum hafnium nitride (Al$_{1-x}$Hf$_{x}$N) were synthesized via reactive magnetron sputtering for Hf contents up to $x$ = 0.13. X-ray diffraction showed a single $c$-axis oriented wurtzite phase for all films. Hard X-ray photoelectron spectroscopy demonstrated homogeneous Al:Hf distribution through the thin films and confirmed their insulating character. A collection of complementary tests showed unambiguous polarization inversion, and thus ferroelectricity in multiple samples. Current density vs. electric field hysteresis measurements showed distinct ferroelectric switching current peaks, the piezoelectric coefficient d$_{33,f,meas}$ measured using a double beam laser interferometer (DBLI) showed a reversal in sign with similar magnitude, and anisotropic wet etching confirmed field-induced polarization inversion. This demonstrates the possibility of using tetravalent--and not just trivalent--alloying elements to enable ferroelectricity in AlN-based thin films, highlighting the compositional flexibility of ferroelectricity in wurtzites and greatly expanding the chemistries that can be considered for future devices.
△ Less
Submitted 11 March, 2025;
originally announced March 2025.
-
Automated Benchmark Generation for Repository-Level Coding Tasks
Authors:
Konstantinos Vergopoulos,
Mark Niklas Müller,
Martin Vechev
Abstract:
Code Agent development is an extremely active research area, where a reliable performance metric is critical for tracking progress and guiding new developments. This demand is underscored by the meteoric rise in popularity of SWE-Bench. This benchmark challenges code agents to generate patches addressing GitHub issues given the full repository as context. The correctness of generated patches is th…
▽ More
Code Agent development is an extremely active research area, where a reliable performance metric is critical for tracking progress and guiding new developments. This demand is underscored by the meteoric rise in popularity of SWE-Bench. This benchmark challenges code agents to generate patches addressing GitHub issues given the full repository as context. The correctness of generated patches is then evaluated by executing a human-written test suite extracted from the repository after the issue's resolution. However, constructing benchmarks like SWE-Bench requires substantial manual effort to set up historically accurate execution environments for testing. Crucially, this severely limits the number of considered repositories, e.g., just 12 for SWE-Bench. Considering so few repositories, selected for their popularity runs the risk of leading to a distributional mismatch, i.e., the measured performance may not be representative of real-world scenarios potentially misguiding development efforts. In this work, we address this challenge and introduce SetUpAgent, a fully automated system capable of historically accurate dependency setup, test execution, and result parsing. Using SetUpAgent, we generate two new datasets: (i) SWEE-Bench an extended version of SWE-Bench encompassing hundreds of repositories, and (ii) SWA-Bench a benchmark focusing on applications rather than libraries. Comparing these datasets to SWE-Bench with respect to their characteristics and code agent performance, we find significant distributional differences, including lower issue description quality and detail level, higher fix complexity, and most importantly up to 40% lower agent success rates.
△ Less
Submitted 10 March, 2025;
originally announced March 2025.
-
Wanda++: Pruning Large Language Models via Regional Gradients
Authors:
Yifan Yang,
Kai Zhen,
Bhavana Ganesh,
Aram Galstyan,
Goeric Huybrechts,
Markus Müller,
Jonas M. Kübler,
Rupak Vignesh Swaminathan,
Athanasios Mouchtaris,
Sravan Babu Bodapati,
Nathan Susanj,
Zheng Zhang,
Jack FitzGerald,
Abhishek Kumar
Abstract:
Large Language Models (LLMs) pruning seeks to remove unimportant weights for inference speedup with minimal accuracy impact. However, existing methods often suffer from accuracy degradation without full-model sparsity-aware fine-tuning. This paper presents Wanda++, a novel pruning framework that outperforms the state-of-the-art methods by utilizing decoder-block-level \textbf{regional} gradients.…
▽ More
Large Language Models (LLMs) pruning seeks to remove unimportant weights for inference speedup with minimal accuracy impact. However, existing methods often suffer from accuracy degradation without full-model sparsity-aware fine-tuning. This paper presents Wanda++, a novel pruning framework that outperforms the state-of-the-art methods by utilizing decoder-block-level \textbf{regional} gradients. Specifically, Wanda++ improves the pruning score with regional gradients for the first time and proposes an efficient regional optimization method to minimize pruning-induced output discrepancies between the dense and sparse decoder output. Notably, Wanda++ improves perplexity by up to 32\% over Wanda in the language modeling task and generalizes effectively to downstream tasks. Moreover, despite updating weights with regional optimization, Wanda++ remains orthogonal to sparsity-aware fine-tuning, further reducing perplexity with LoRA in great extend. Our approach is lightweight, pruning a 7B LLaMA model in under 10 minutes on a single H100 GPU.
△ Less
Submitted 27 May, 2025; v1 submitted 6 March, 2025;
originally announced March 2025.
-
On the interpretability of neural network decoders
Authors:
Lukas Bödeker,
Luc J. B. Kusters,
Markus Müller
Abstract:
Neural-network (NN) based decoders are becoming increasingly popular in the field of quantum error correction (QEC), including for decoding of state-of-the-art quantum computation experiments. In this work, we make use of established interpretability methods from the field of machine learning, to introduce a toolbox to achieve an understanding of the underlying decoding logic of NN decoders, which…
▽ More
Neural-network (NN) based decoders are becoming increasingly popular in the field of quantum error correction (QEC), including for decoding of state-of-the-art quantum computation experiments. In this work, we make use of established interpretability methods from the field of machine learning, to introduce a toolbox to achieve an understanding of the underlying decoding logic of NN decoders, which have been trained but otherwise typically operate as black-box models. To illustrate the capabilities of the employed interpretability method, based on the Shapley value approximation, we provide an examplary case study of a NN decoder that is trained for flag-qubit based fault-tolerant (FT) QEC with the Steane code. We show how particular decoding decisions of the NN can be interpreted, and reveal how the NN learns to capture fundamental structures in the information gained from syndrome and flag qubit measurements, in order to come to a FT correction decision. Further, we show that the understanding of how the NN obtains a decoding decision can be used on the one hand to identify flawed processing of error syndrome information by the NN, resulting in decreased decoding performance, as well as for well-informed improvements of the NN architecture. The diagnostic capabilities of the interpretability method we present can help ensure successful application of machine learning for decoding of QEC protocols.
△ Less
Submitted 27 February, 2025;
originally announced February 2025.
-
Experimentally Informed Decoding of Stabilizer Codes Based on Syndrome Correlations
Authors:
Ants Remm,
Nathan Lacroix,
Lukas Bödeker,
Elie Genois,
Christoph Hellings,
François Swiadek,
Graham J. Norris,
Christopher Eichler,
Alexandre Blais,
Markus Müller,
Sebastian Krinner,
Andreas Wallraff
Abstract:
High-fidelity decoding of quantum error correction codes relies on an accurate experimental model of the physical errors occurring in the device. Because error probabilities can depend on the context of the applied operations, the error model is ideally calibrated using the same circuit as is used for the error correction experiment. Here, we present an experimental approach guided by a novel anal…
▽ More
High-fidelity decoding of quantum error correction codes relies on an accurate experimental model of the physical errors occurring in the device. Because error probabilities can depend on the context of the applied operations, the error model is ideally calibrated using the same circuit as is used for the error correction experiment. Here, we present an experimental approach guided by a novel analytical formula to characterize the probability of independent errors using correlations in the syndrome data generated by executing the error correction circuit. Using the method on a distance-three surface code, we analyze error channels that flip an arbitrary number of syndrome elements, including Pauli Y errors, hook errors, multi-qubit errors, and leakage, in addition to standard Pauli X and Z errors. We use the method to find the optimal weights for a minimum-weight perfect matching decoder without relying on a theoretical error model. Additionally, we investigate whether improved knowledge of the Pauli Y error channel, based on correlating the X- and Z-type error syndromes, can be exploited to enhance matching decoding. Furthermore, we find correlated errors that flip many syndrome elements over up-to-eight cycles, potentially caused by leakage of the data qubits out of the computational subspace. The presented method provides the tools for accurately calibrating a broad family of decoders, beyond the minimum-weight perfect matching decoder, without relying on prior knowledge of the error model.
△ Less
Submitted 24 February, 2025;
originally announced February 2025.
-
Invariance under quantum permutations rules out parastatistics
Authors:
Manuel Mekonnen,
Thomas D. Galley,
Markus P. Mueller
Abstract:
Quantum systems invariant under particle exchange are either Bosons or Fermions, even though quantum theory could in principle admit further types of behavior under permutations. But why do we not observe such "paraparticles" in nature? The analysis of this question was previously limited primarily to specific quantum field theory models. Here we give two independent arguments that rule out parast…
▽ More
Quantum systems invariant under particle exchange are either Bosons or Fermions, even though quantum theory could in principle admit further types of behavior under permutations. But why do we not observe such "paraparticles" in nature? The analysis of this question was previously limited primarily to specific quantum field theory models. Here we give two independent arguments that rule out parastatistics universally, originating in quantum information theory and recent research on internal quantum reference frames. First, we introduce a notion of complete invariance: quantum systems should not only preserve their local state under permutations, but also the quantum information that they carry about other systems, in analogy to the notion of complete positivity in quantum information theory. Second, we demand that quantum systems are invariant under quantum permutations, i.e. permutations that are conditioned on the values of permutation-invariant observables. For both, we show that the respective principle is fulfilled if and only if the particle is a Boson or Fermion. Our results show how quantum reference frames can shed light on a longstanding problem of quantum physics, they underline the crucial role played by the compositional structure of quantum information, and they demonstrate the explanatory power but also subtle limitations of recently proposed quantum covariance principles.
△ Less
Submitted 24 February, 2025;
originally announced February 2025.
-
The MAGPI Survey: the kinematic morphology-density relation (or lack thereof) and the Hubble sequence at $z\sim0.3$
Authors:
Caroline Foster,
Mark W. Donoghoe,
Andrew Battisti,
Francesco D'Eugenio,
Katherine Harborne,
Thomas Venville,
Claudia Del P. Lagos,
J. Trevor Mendel,
Ryan Bagge,
Stefania Barsanti,
Sabine Bellstedt,
Alina Boecker,
Qianhui Chen,
Caro Derkenne,
Anna Ferre-Matteu,
Eda Gjergo,
Anshu Gupta,
Eric G. M. Muller,
Giulia Santucci,
Hye-Jin Park,
Rhea-Silvia Remus,
Sabine Thater,
Jesse van de Sande,
Sam Vaughan,
Sarah Brough
, et al. (4 additional authors not shown)
Abstract:
This work presents visual morphological and dynamical classifications for 637 spatially resolved galaxies, most of which are at intermediate redshift ($z\sim0.3$), in the Middle-Ages Galaxy Properties with Integral field spectroscopy (MAGPI) Survey. For each galaxy, we obtain a minimum of 11 independent visual classifications by knowledgeable classifiers. We use an extension of the standard Dawid-…
▽ More
This work presents visual morphological and dynamical classifications for 637 spatially resolved galaxies, most of which are at intermediate redshift ($z\sim0.3$), in the Middle-Ages Galaxy Properties with Integral field spectroscopy (MAGPI) Survey. For each galaxy, we obtain a minimum of 11 independent visual classifications by knowledgeable classifiers. We use an extension of the standard Dawid-Skene bayesian model introducing classifier-specific confidence parameters and galaxy-specific difficulty parameters to quantify classifier confidence and infer reliable statistical confidence estimates. Selecting sub-samples of 86 bright ($r<20$ mag) high-confidence ($>0.98$) morphological classifications at redshifts ($0.2 \le z \le0.4$), we confirm the full range of morphological types is represented in MAGPI as intended in the survey design. Similarly, with a sub-sample of 82 bright high-confidence stellar kinematic classifications, we find that the rotating and non-rotating galaxies seen at low redshift are already in place at intermediate redshifts. We \textit{do not} find evidence that the kinematic morphology-density relation seen at $z\sim0$ is established at $z\sim0.3$. We suggest that galaxies without obvious stellar rotation are dynamically pre-processed sometime before $z\sim0.3$ within lower mass groups before joining denser environments.
△ Less
Submitted 23 February, 2025;
originally announced February 2025.
-
Dimension reduction for Willmore flows of tori: fixed conformal class and analysis of singularities
Authors:
Anna Dall'Acqua,
Marius Müller,
Fabian Rupp,
Manuel Schlierf
Abstract:
This work studies Willmore flows of tori and their singularities via a dimension reduction approach. We introduce a Willmore flow that preserves the degenerate constraint of prescribed conformal class and, for rotationally symmetric initial data, we establish a strong relation with the length-preserving elastic flow in the hyperbolic plane. We provide a necessary condition for singularities and a…
▽ More
This work studies Willmore flows of tori and their singularities via a dimension reduction approach. We introduce a Willmore flow that preserves the degenerate constraint of prescribed conformal class and, for rotationally symmetric initial data, we establish a strong relation with the length-preserving elastic flow in the hyperbolic plane. We provide a necessary condition for singularities and a criterion for the initial datum that allows to exclude them. Our results allow for initial data with arbitrarily large energy, in particular exceeding the usual Li-Yau threshold of $8π$. As an application, we obtain existence of a new class of conformally constrained Willmore tori. Moreover, we investigate singularities of the classical Willmore flow. For a class of tori, we identify a non-smooth object, the inverted catenoid, as the limit shape and we show that the flow can be restarted at this singular surface and converges to a round sphere.
△ Less
Submitted 18 February, 2025;
originally announced February 2025.
-
Scintillation Light Detection in Polycrystalline Diamond Using Single Photon Detectors
Authors:
Niccolò Gallice,
Aleksey Bolotnikov,
Erik M. Muller,
Thomas Tsang
Abstract:
This study investigates the scintillation properties of polycrystalline diamond for particle detection applications, particularly in neutron and alpha radiation environments. Polycrystalline diamonds provide a cost-effective alternative to monocrystalline diamonds while retaining essential detection properties. Photoluminescence measurements were performed to analyze emission spectra, revealing di…
▽ More
This study investigates the scintillation properties of polycrystalline diamond for particle detection applications, particularly in neutron and alpha radiation environments. Polycrystalline diamonds provide a cost-effective alternative to monocrystalline diamonds while retaining essential detection properties. Photoluminescence measurements were performed to analyze emission spectra, revealing distinct characteristics based on impurity content and crystallinity. Scintillation responses were assessed using Silicon Photomultipliers (SiPMs), demonstrating the capability of polycrystalline diamond powders to respond to alpha irradiation, albeit with reduced resolution compared to traditional scintillators. A prototype neutron detector was developed by combining diamond powder with neutron-sensitive ${}^6$LiF, and its performance was evaluated through experimental testing and Geant4 simulations. The findings indicate that polycrystalline diamond-based detectors can achieve significant detection efficiency while remaining insensitive to gamma radiation, offering potential for portable neutron detection applications.
△ Less
Submitted 13 February, 2025;
originally announced February 2025.
-
Are all models wrong? Fundamental limits in distribution-free empirical model falsification
Authors:
Manuel M. Müller,
Yuetian Luo,
Rina Foygel Barber
Abstract:
In statistics and machine learning, when we train a fitted model on available data, we typically want to ensure that we are searching within a model class that contains at least one accurate model -- that is, we would like to ensure an upper bound on the model class risk (the lowest possible risk that can be attained by any model in the class). However, it is also of interest to establish lower bo…
▽ More
In statistics and machine learning, when we train a fitted model on available data, we typically want to ensure that we are searching within a model class that contains at least one accurate model -- that is, we would like to ensure an upper bound on the model class risk (the lowest possible risk that can be attained by any model in the class). However, it is also of interest to establish lower bounds on the model class risk, for instance so that we can determine whether our fitted model is at least approximately optimal within the class, or, so that we can decide whether the model class is unsuitable for the particular task at hand. Particularly in the setting of interpolation learning where machine learning models are trained to reach zero error on the training data, we might ask if, at the very least, a positive lower bound on the model class risk is possible -- or are we unable to detect that "all models are wrong"? In this work, we answer these questions in a distribution-free setting by establishing a model-agnostic, fundamental hardness result for the problem of constructing a lower bound on the best test error achievable over a model class, and examine its implications on specific model classes such as tree-based methods and linear regression.
△ Less
Submitted 10 February, 2025;
originally announced February 2025.
-
Sunrise III: Overview of Observatory and Instruments
Authors:
Andreas Korpi-Lagg,
Achim Gandorfer,
Sami K. Solanki,
Jose Carlos del Toro Iniesta,
Yukio Katsukawa,
Pietro Bernasconi,
Thomas Berkefeld,
Alex Feller,
Tino L. Riethmüller,
Alberto Álvarez-Herrero,
Masahito Kubo,
Valentín Martínez Pillet,
H. N. Smitha,
David Orozco Suárez,
Bianca Grauf,
Michael Carpenter,
Alexander Bell,
María-Teresa Álvarez-Alonso,
Daniel Álvarez García,
Beatriz Aparicio del Moral,
Daniel Ayoub,
Francisco Javier Bailén,
Eduardo Bailón Martínez,
Maria Balaguer Jiménez,
Peter Barthol
, et al. (95 additional authors not shown)
Abstract:
In July 2024, Sunrise completed its third successful science flight. The Sunrise III observatory had been upgraded significantly after the two previous successful flights in 2009 and 2013. Three completely new instruments focus on the small-scale physical processes and their complex interaction from the deepest observable layers in the photosphere up to chromospheric heights. Previously poorly exp…
▽ More
In July 2024, Sunrise completed its third successful science flight. The Sunrise III observatory had been upgraded significantly after the two previous successful flights in 2009 and 2013. Three completely new instruments focus on the small-scale physical processes and their complex interaction from the deepest observable layers in the photosphere up to chromospheric heights. Previously poorly explored spectral regions and lines are exploited to paint a three-dimensional picture of the solar atmosphere with unprecedented completeness and level of detail. The full polarimetric information is captured by all three instruments to reveal the interaction between the magnetic fields and the hydrodynamic processes. Two slit-based spectropolarimeters, the Sunrise UV Spectropolarimeter and Imager (SUSI) and the Sunrise Chromospheric Infrared spectro-Polarimeter (SCIP), focus on the near-ultraviolet and the near-infrared regions respectively, and the imaging spectropolarimeter Tunable Magnetograph (TuMag) simultaneously obtains maps of the full field-of-view of $46 \times 46$ Mm$^2$ in the photosphere and the chromosphere in the visible. The instruments are operated in an orchestrated mode, benefiting from a new Image Stabilization and Light Distribution unit (ISLiD), with the Correlating Wavefront Sensor (CWS) providing the autofocus control and an image stability with a root-mean-square value smaller than 0.005''. A new gondola was constructed to significantly improve the telescope pointing stability, required to achieve uninterrupted observations over many hours. Sunrise III was launched successfully on July 10, 2024, from the Esrange Space Center near Kiruna (Sweden). It reached the landing site between the Mackenzie River and the Great Bear Lake in Canada after a flight duration of 6.5 days. In this paper, we give an overview of the Sunrise III observatory and its instruments.
△ Less
Submitted 10 February, 2025;
originally announced February 2025.
-
Comparison of the detector response and calibration function of metallic microcalorimeters for X-ray photons and external electrons
Authors:
Neven Kovac,
Fabienne Adam,
Sebastian Kempf,
Marie-Christin Langer,
Michael Müller,
Rudolf Sack,
Magnus Schlösser,
Markus Steidl,
Kathrin Valerius
Abstract:
Metallic microcalorimeters (MMCs) are cryogenic single-particle detectors that rely on a calorimetric detection principle. Due to their excellent energy resolution, close-to-ideal linear detector response, fast signal rise time and the potential for \SI{100}{\%} quantum efficiency, MMCs outperform conventional detectors by several orders of magnitude in resolution. These attributes make them parti…
▽ More
Metallic microcalorimeters (MMCs) are cryogenic single-particle detectors that rely on a calorimetric detection principle. Due to their excellent energy resolution, close-to-ideal linear detector response, fast signal rise time and the potential for \SI{100}{\%} quantum efficiency, MMCs outperform conventional detectors by several orders of magnitude in resolution. These attributes make them particularly interesting for a broad spectrum of applications, including a next-generation neutrino mass experiment based on the measurement of the tritium beta-decay spectrum, with an objective of achieving a sensitivity surpassing that of the pioneering KATRIN experiment. However, although MMCs have been used in measurements of photons and heavy ions with great success, no information is currently available on the interaction between MMCs and external light charged particles such as electrons. This work aims to provide such missing information and to demonstrate that MMC-based detectors are suitable for high-resolution spectroscopy of external electron sources. Particularly, we present the first-ever measurements of external electrons using a metallic microcalorimeter, comprehensively discuss the characteristics of the signal shape and the calibration function and give a direct comparison between well-defined conversion electron and X-ray photon signals from the same $^{83}$Rb/$^{83m}$Kr source.
△ Less
Submitted 9 February, 2025;
originally announced February 2025.
-
Monitored interacting Dirac fermions
Authors:
Thomas Martin Müller,
Michael Buchhold,
Sebastian Diehl
Abstract:
We analytically study interacting Dirac fermions, described by the Thirring model, under weak local particle number measurements with monitoring rate $γ$. This system maps to a bosonic replica field theory, analyzed via the renormalization group. For a nonzero attractive interaction, a phase transition occurs at a critical measurement strength $γ_c$. When $γ>γ_c$, the system enters a localized pha…
▽ More
We analytically study interacting Dirac fermions, described by the Thirring model, under weak local particle number measurements with monitoring rate $γ$. This system maps to a bosonic replica field theory, analyzed via the renormalization group. For a nonzero attractive interaction, a phase transition occurs at a critical measurement strength $γ_c$. When $γ>γ_c$, the system enters a localized phase characterized by exponentially decaying density-density correlations beyond a finite correlation length; for $γ<γ_c$, the correlations decay algebraically. The transition is of BKT-type, reflected by a characteristic scaling of the correlation length. In the non-interacting limit, $γ_c\to0$ shifts to zero, reducing the algebraic phase to a single point in parameter space. This identifies weak measurements in the free case as an implicit double fine-tuning to the critical endpoint of the BKT phase transition. Along the non-interacting line, we compute the entanglement entropy from density-density correlation functions and find no entanglement transition at nonzero measurement strength in the thermodynamic limit.
△ Less
Submitted 21 February, 2025; v1 submitted 4 February, 2025;
originally announced February 2025.
-
Performance guarantees for optimization-based state estimation using turnpike properties
Authors:
Julian D. Schiller,
Lars Grüne,
and Matthias A. Müller
Abstract:
In this paper, we develop novel accuracy and performance guarantees for optimal state estimation of general nonlinear systems (in particular, moving horizon estimation, MHE). Our results rely on a turnpike property of the optimal state estimation problem, which essentially states that the omniscient infinite-horizon solution involving all past and future data serves as turnpike for the solutions o…
▽ More
In this paper, we develop novel accuracy and performance guarantees for optimal state estimation of general nonlinear systems (in particular, moving horizon estimation, MHE). Our results rely on a turnpike property of the optimal state estimation problem, which essentially states that the omniscient infinite-horizon solution involving all past and future data serves as turnpike for the solutions of finite-horizon estimation problems involving a subset of the data. This leads to the surprising observation that MHE problems naturally exhibit a leaving arc, which may have a strong negative impact on the estimation accuracy. To address this, we propose a delayed MHE scheme, and we show that the resulting performance (both averaged and non-averaged) is approximately optimal and achieves bounded dynamic regret with respect to the infinite-horizon solution, with error terms that can be made arbitrarily small by an appropriate choice of the delay. In various simulation examples, we observe that already a very small delay in the MHE scheme is sufficient to significantly improve the overall estimation error by 20-25 % compared to standard MHE (without delay). This finding is of great importance for practical applications (especially for monitoring, fault detection, and parameter estimation) where a small delay in the estimation is rather irrelevant but may significantly improve the estimation results.
△ Less
Submitted 30 January, 2025;
originally announced January 2025.
-
Fault Localization via Fine-tuning Large Language Models with Mutation Generated Stack Traces
Authors:
Neetha Jambigi,
Bartosz Bogacz,
Moritz Mueller,
Thomas Bach,
Michael Felderer
Abstract:
Abrupt and unexpected terminations of software are termed as software crashes. They can be challenging to analyze. Finding the root cause requires extensive manual effort and expertise to connect information sources like stack traces, source code, and logs. Typical approaches to fault localization require either test failures or source code. Crashes occurring in production environments, such as th…
▽ More
Abrupt and unexpected terminations of software are termed as software crashes. They can be challenging to analyze. Finding the root cause requires extensive manual effort and expertise to connect information sources like stack traces, source code, and logs. Typical approaches to fault localization require either test failures or source code. Crashes occurring in production environments, such as that of SAP HANA, provide solely crash logs and stack traces. We present a novel approach to localize faults based only on the stack trace information and no additional runtime information, by fine-tuning large language models (LLMs). We address complex cases where the root cause of a crash differs from the technical cause, and is not located in the innermost frame of the stack trace. As the number of historic crashes is insufficient to fine-tune LLMs, we augment our dataset by leveraging code mutators to inject synthetic crashes into the code base. By fine-tuning on 64,369 crashes resulting from 4.1 million mutations of the HANA code base, we can correctly predict the root cause location of a crash with an accuracy of 66.9\% while baselines only achieve 12.6% and 10.6%. We substantiate the generalizability of our approach by evaluating on two additional open-source databases, SQLite and DuckDB, achieving accuracies of 63% and 74%, respectively. Across all our experiments, fine-tuning consistently outperformed prompting non-finetuned LLMs for localizing faults in our datasets.
△ Less
Submitted 11 February, 2025; v1 submitted 29 January, 2025;
originally announced January 2025.
-
Controlling AI Agent Participation in Group Conversations: A Human-Centered Approach
Authors:
Stephanie Houde,
Kristina Brimijoin,
Michael Muller,
Steven I. Ross,
Dario Andres Silva Moran,
Gabriel Enrique Gonzalez,
Siya Kunde,
Morgan A. Foreman,
Justin D. Weisz
Abstract:
Conversational AI agents are commonly applied within single-user, turn-taking scenarios. The interaction mechanics of these scenarios are trivial: when the user enters a message, the AI agent produces a response. However, the interaction dynamics are more complex within group settings. How should an agent behave in these settings? We report on two experiments aimed at uncovering users' experiences…
▽ More
Conversational AI agents are commonly applied within single-user, turn-taking scenarios. The interaction mechanics of these scenarios are trivial: when the user enters a message, the AI agent produces a response. However, the interaction dynamics are more complex within group settings. How should an agent behave in these settings? We report on two experiments aimed at uncovering users' experiences of an AI agent's participation within a group, in the context of group ideation (brainstorming). In the first study, participants benefited from and preferred having the AI agent in the group, but participants disliked when the agent seemed to dominate the conversation and they desired various controls over its interactive behaviors. In the second study, we created functional controls over the agent's behavior, operable by group members, to validate their utility and probe for additional requirements. Integrating our findings across both studies, we developed a taxonomy of controls for when, what, and where a conversational AI agent in a group should respond, who can control its behavior, and how those controls are specified and implemented. Our taxonomy is intended to aid AI creators to think through important considerations in the design of mixed-initiative conversational agents.
△ Less
Submitted 28 January, 2025;
originally announced January 2025.
-
Gaussian Process-Based Prediction and Control of Hammerstein-Wiener Systems
Authors:
Mingzhou Yin,
Matthias A. Müller
Abstract:
This work investigates data-driven prediction and control of Hammerstein-Wiener systems using physics-informed Gaussian process models. Data-driven prediction algorithms have been developed for structured nonlinear systems based on Willems' fundamental lemma. However, existing frameworks cannot treat output nonlinearities and require a dictionary of basis functions for Hammerstein systems. In this…
▽ More
This work investigates data-driven prediction and control of Hammerstein-Wiener systems using physics-informed Gaussian process models. Data-driven prediction algorithms have been developed for structured nonlinear systems based on Willems' fundamental lemma. However, existing frameworks cannot treat output nonlinearities and require a dictionary of basis functions for Hammerstein systems. In this work, an implicit predictor structure is considered, leveraging the multi-step-ahead ARX structure for the linear part of the model. This implicit function is learned by Gaussian process regression with kernel functions designed from Gaussian process priors for the nonlinearities. The linear model parameters are estimated as hyperparameters by assuming a stable spline hyperprior. The implicit Gaussian process model provides explicit output prediction by optimizing selected optimality criteria. The model is also applied to receding horizon control with the expected control cost and chance constraint satisfaction guarantee. Numerical results demonstrate that the proposed prediction and control algorithms are superior to black-box Gaussian process models.
△ Less
Submitted 27 January, 2025;
originally announced January 2025.
-
Stochastic bubble dynamics in phase-separated scalar active matter
Authors:
Mingqi Yan,
Erwin Frey,
Marcus Müller,
Stefan Klumpp
Abstract:
In ABP systems, phase separation is accompanied by the emergence of vapor bubbles within liquid domains. Using large-scale particle-based simulations, we study the stochastic dynamics of these bubbles and find that most nucleate, grow, and dissolve within liquid domains. We show that their area dynamics can be described by a Langevin equation with a constant negative drift and noise proportional t…
▽ More
In ABP systems, phase separation is accompanied by the emergence of vapor bubbles within liquid domains. Using large-scale particle-based simulations, we study the stochastic dynamics of these bubbles and find that most nucleate, grow, and dissolve within liquid domains. We show that their area dynamics can be described by a Langevin equation with a constant negative drift and noise proportional to the perimeter, fully characterizing bubble area and lifetime statistics. Additionally, we develop a lattice gas model that captures the morphological properties, including the decrease in bubble asphericity with increasing area. These findings provide new insights into phase separation in active matter and highlight limitations in current continuum theories.
△ Less
Submitted 20 January, 2025;
originally announced January 2025.
-
DFingerNet: Noise-Adaptive Speech Enhancement for Hearing Aids
Authors:
Iosif Tsangko,
Andreas Triantafyllopoulos,
Michael Müller,
Hendrik Schröter,
Björn W. Schuller
Abstract:
The DeepFilterNet (DFN) architecture was recently proposed as a deep learning model suited for hearing aid devices. Despite its competitive performance on numerous benchmarks, it still follows a `one-size-fits-all' approach, which aims to train a single, monolithic architecture that generalises across different noises and environments. However, its limited size and computation budget can hamper it…
▽ More
The DeepFilterNet (DFN) architecture was recently proposed as a deep learning model suited for hearing aid devices. Despite its competitive performance on numerous benchmarks, it still follows a `one-size-fits-all' approach, which aims to train a single, monolithic architecture that generalises across different noises and environments. However, its limited size and computation budget can hamper its generalisability. Recent work has shown that in-context adaptation can improve performance by conditioning the denoising process on additional information extracted from background recordings to mitigate this. These recordings can be offloaded outside the hearing aid, thus improving performance while adding minimal computational overhead. We introduce these principles to the DFN model, thus proposing the DFingerNet (DFiN) model, which shows superior performance on various benchmarks inspired by the DNS Challenge.
△ Less
Submitted 23 January, 2025; v1 submitted 17 January, 2025;
originally announced January 2025.