-
BabyLM Turns 3: Call for papers for the 2025 BabyLM workshop
Authors:
Lucas Charpentier,
Leshem Choshen,
Ryan Cotterell,
Mustafa Omer Gul,
Michael Hu,
Jaap Jumelet,
Tal Linzen,
Jing Liu,
Aaron Mueller,
Candace Ross,
Raj Sanjay Shah,
Alex Warstadt,
Ethan Wilcox,
Adina Williams
Abstract:
BabyLM aims to dissolve the boundaries between cognitive modeling and language modeling. We call for both workshop papers and for researchers to join the 3rd BabyLM competition. As in previous years, we call for participants in the data-efficient pretraining challenge in the general track. This year, we also offer a new track: INTERACTION. This new track encourages interactive behavior, learning f…
▽ More
BabyLM aims to dissolve the boundaries between cognitive modeling and language modeling. We call for both workshop papers and for researchers to join the 3rd BabyLM competition. As in previous years, we call for participants in the data-efficient pretraining challenge in the general track. This year, we also offer a new track: INTERACTION. This new track encourages interactive behavior, learning from a teacher, and adapting the teaching material to the student. We also call for papers outside the competition in any relevant areas. These include training efficiency, cognitively plausible research, weak model evaluation, and more.
△ Less
Submitted 24 February, 2025; v1 submitted 14 February, 2025;
originally announced February 2025.
-
Consequences of a Heavy-Metal Scenario of Ultra-High-Energy Cosmic Rays
Authors:
Jakub Vícha,
Alena Bakalová,
Olena Tkachenko,
Ana Laura Müller,
Maximilian Stadelmaier
Abstract:
We assume an extreme scenario, in which the arriving cosmic rays are composed of only iron nuclei at energies above $10^{19.6}\,\text{eV}\simeq40\,\text{EeV}$, while allowing a freedom in the scale of the depth of shower maximum ($X_{\rm{max}}$) and preserving the elongation rate and fluctuations of $X_{\rm{max}}$ predicted by models of hadronic interactions. We derive the shift of the…
▽ More
We assume an extreme scenario, in which the arriving cosmic rays are composed of only iron nuclei at energies above $10^{19.6}\,\text{eV}\simeq40\,\text{EeV}$, while allowing a freedom in the scale of the depth of shower maximum ($X_{\rm{max}}$) and preserving the elongation rate and fluctuations of $X_{\rm{max}}$ predicted by models of hadronic interactions. We derive the shift of the $X_{\rm{max}}$ scale for QGSJet II-04 and Sibyll 2.3d models using the public data from the Pierre Auger Observatory. We then propose a new mass-composition model for the energy evolution of four primary species at the ultra-high energies by fitting the publicly-available $X_{\rm{max}}$ distributions. We discuss the consequences of this Heavy-metal scenario on the energy spectrum of individual primary species, hadronic interaction studies, and the effect of the Galactic magnetic field on the arrival directions.
△ Less
Submitted 30 April, 2025; v1 submitted 12 February, 2025;
originally announced February 2025.
-
Ideas and Requirements for the Global Cosmic-Ray Observatory (GCOS)
Authors:
Markus Ahlers,
Ingo Allekotte,
Jaime Alvarez-Muniz,
Gioacchino Alex Anastasi,
Luis Anchordoqui,
Rita de Cassia Dos Anjos,
Hari Haran Balakrishnan,
Rafael Alves Batista,
Jose Bellido,
Mario Bertaina,
Sonali Bhatnagar,
Pierre Billoir,
Kathrin Bismark,
Teresa Bister,
Martina Bohacova,
Carla Bonifazi,
Fraser Bradfield,
Antonella Castellina,
Lorenzo Cazon,
Kevin Almeida Cheminant,
Alan Coleman,
Fabio Convenga,
Darko Veberič,
Paramita Dasgupta,
Kai Daumiller
, et al. (114 additional authors not shown)
Abstract:
After a successful kick-off meeting in 2021. two workshops in 2022 and 2023 on the future Global Cosmic-Ray Observatory (GCOS) focused mainly on a straw man design of the detector and science possibilities for astro- and particle physics. About 100 participants gathered for in-person and hybrid panel discussions. In this report, we summarize these discussions, present a preliminary straw-man desig…
▽ More
After a successful kick-off meeting in 2021. two workshops in 2022 and 2023 on the future Global Cosmic-Ray Observatory (GCOS) focused mainly on a straw man design of the detector and science possibilities for astro- and particle physics. About 100 participants gathered for in-person and hybrid panel discussions. In this report, we summarize these discussions, present a preliminary straw-man design for GCOS and collect short write-ups of the flash talks given during the focus sessions.
△ Less
Submitted 8 February, 2025;
originally announced February 2025.
-
Open Challenges in Time Series Anomaly Detection: An Industry Perspective
Authors:
Andreas Mueller
Abstract:
Current research in time-series anomaly detection is using definitions that miss critical aspects of how anomaly detection is commonly used in practice. We list several areas that are of practical relevance and that we believe are either under-investigated or missing entirely from the current discourse. Based on an investigation of systems deployed in a cloud environment, we motivate the areas of…
▽ More
Current research in time-series anomaly detection is using definitions that miss critical aspects of how anomaly detection is commonly used in practice. We list several areas that are of practical relevance and that we believe are either under-investigated or missing entirely from the current discourse. Based on an investigation of systems deployed in a cloud environment, we motivate the areas of streaming algorithms, human-in-the-loop scenarios, point processes, conditional anomalies and populations analysis of time series. This paper serves as a motivation and call for action, including opportunities for theoretical and applied research, as well as for building new dataset and benchmarks.
△ Less
Submitted 7 February, 2025;
originally announced February 2025.
-
Position-aware Automatic Circuit Discovery
Authors:
Tal Haklay,
Hadas Orgad,
David Bau,
Aaron Mueller,
Yonatan Belinkov
Abstract:
A widely used strategy to discover and understand language model mechanisms is circuit analysis. A circuit is a minimal subgraph of a model's computation graph that executes a specific task. We identify a gap in existing circuit discovery methods: they assume circuits are position-invariant, treating model components as equally relevant across input positions. This limits their ability to capture…
▽ More
A widely used strategy to discover and understand language model mechanisms is circuit analysis. A circuit is a minimal subgraph of a model's computation graph that executes a specific task. We identify a gap in existing circuit discovery methods: they assume circuits are position-invariant, treating model components as equally relevant across input positions. This limits their ability to capture cross-positional interactions or mechanisms that vary across positions. To address this gap, we propose two improvements to incorporate positionality into circuits, even on tasks containing variable-length examples. First, we extend edge attribution patching, a gradient-based method for circuit discovery, to differentiate between token positions. Second, we introduce the concept of a dataset schema, which defines token spans with similar semantics across examples, enabling position-aware circuit discovery in datasets with variable length examples. We additionally develop an automated pipeline for schema generation and application using large language models. Our approach enables fully automated discovery of position-sensitive circuits, yielding better trade-offs between circuit size and faithfulness compared to prior work.
△ Less
Submitted 6 February, 2025;
originally announced February 2025.
-
Ethical Considerations for the Military Use of Artificial Intelligence in Visual Reconnaissance
Authors:
Mathias Anneken,
Nadia Burkart,
Fabian Jeschke,
Achim Kuwertz-Wolf,
Almuth Mueller,
Arne Schumann,
Michael Teutsch
Abstract:
This white paper underscores the critical importance of responsibly deploying Artificial Intelligence (AI) in military contexts, emphasizing a commitment to ethical and legal standards. The evolving role of AI in the military goes beyond mere technical applications, necessitating a framework grounded in ethical principles. The discussion within the paper delves into ethical AI principles, particul…
▽ More
This white paper underscores the critical importance of responsibly deploying Artificial Intelligence (AI) in military contexts, emphasizing a commitment to ethical and legal standards. The evolving role of AI in the military goes beyond mere technical applications, necessitating a framework grounded in ethical principles. The discussion within the paper delves into ethical AI principles, particularly focusing on the Fairness, Accountability, Transparency, and Ethics (FATE) guidelines. Noteworthy considerations encompass transparency, justice, non-maleficence, and responsibility. Importantly, the paper extends its examination to military-specific ethical considerations, drawing insights from the Just War theory and principles established by prominent entities. In addition to the identified principles, the paper introduces further ethical considerations specifically tailored for military AI applications. These include traceability, proportionality, governability, responsibility, and reliability. The application of these ethical principles is discussed on the basis of three use cases in the domains of sea, air, and land. Methods of automated sensor data analysis, eXplainable AI (XAI), and intuitive user experience are utilized to specify the use cases close to real-world scenarios. This comprehensive approach to ethical considerations in military AI reflects a commitment to aligning technological advancements with established ethical frameworks. It recognizes the need for a balance between leveraging AI's potential benefits in military operations while upholding moral and legal standards. The inclusion of these ethical principles serves as a foundation for responsible and accountable use of AI in the complex and dynamic landscape of military scenarios.
△ Less
Submitted 5 February, 2025;
originally announced February 2025.
-
Spectra-orthogonal optical anisotropy in wafer-scale molecular crystal monolayers
Authors:
Tomojit Chowdhury,
Fauzia Mujid,
Zehra Naqvi,
Ariana Ray,
Ce Liang,
David A. Muller,
Nathan P. Guisinger,
Jiwoong Park
Abstract:
Controlling the spectral and polarization responses of two-dimensional (2D) crystals is vital for developing ultra-thin platforms for compact optoelectronic devices. However, independently tuning optical anisotropy and spectral response remains challenging in conventional semiconductors due to the intertwined nature of their lattice and electronic structures. Here, we report spectra-orthogonal opt…
▽ More
Controlling the spectral and polarization responses of two-dimensional (2D) crystals is vital for developing ultra-thin platforms for compact optoelectronic devices. However, independently tuning optical anisotropy and spectral response remains challenging in conventional semiconductors due to the intertwined nature of their lattice and electronic structures. Here, we report spectra-orthogonal optical anisotropy, where polarization anisotropy is tuned independently of spectral response, in wafer-scale, one-atom-thick 2D molecular crystal (2DMC) monolayers synthesized on monolayer transition metal dichalcogenide (TMD) crystals. Utilizing the concomitant spectral consistency and structural tunability of perylene derivatives, we demonstrate tunable optical polarization anisotropy in 2DMCs with similar spectral profiles, as confirmed by room-temperature scanning tunneling microscopy and cross-polarized reflectance microscopy. Additional angle-dependent analysis of the single- and polycrystalline molecular domains reveals an epitaxial relationship between the 2DMC and the TMD. Our results establish a scalable, molecule-based 2D crystalline platform for unique and tunable functionalities unattainable in covalent 2D solids.
△ Less
Submitted 3 February, 2025;
originally announced February 2025.
-
Performance guarantees for optimization-based state estimation using turnpike properties
Authors:
Julian D. Schiller,
Lars Grüne,
and Matthias A. Müller
Abstract:
In this paper, we develop novel accuracy and performance guarantees for optimal state estimation of general nonlinear systems (in particular, moving horizon estimation, MHE). Our results rely on a turnpike property of the optimal state estimation problem, which essentially states that the omniscient infinite-horizon solution involving all past and future data serves as turnpike for the solutions o…
▽ More
In this paper, we develop novel accuracy and performance guarantees for optimal state estimation of general nonlinear systems (in particular, moving horizon estimation, MHE). Our results rely on a turnpike property of the optimal state estimation problem, which essentially states that the omniscient infinite-horizon solution involving all past and future data serves as turnpike for the solutions of finite-horizon estimation problems involving a subset of the data. This leads to the surprising observation that MHE problems naturally exhibit a leaving arc, which may have a strong negative impact on the estimation accuracy. To address this, we propose a delayed MHE scheme, and we show that the resulting performance (both averaged and non-averaged) is approximately optimal and achieves bounded dynamic regret with respect to the infinite-horizon solution, with error terms that can be made arbitrarily small by an appropriate choice of the delay. In various simulation examples, we observe that already a very small delay in the MHE scheme is sufficient to significantly improve the overall estimation error by 20-25 % compared to standard MHE (without delay). This finding is of great importance for practical applications (especially for monitoring, fault detection, and parameter estimation) where a small delay in the estimation is rather irrelevant but may significantly improve the estimation results.
△ Less
Submitted 30 January, 2025;
originally announced January 2025.
-
Quantum oscillations of holes in GaN
Authors:
Chuan F. C. Chang,
Joseph E. Dill,
Zexuan Zhang,
Jie-Cheng Chen,
Naomi Pieczulewski,
Samuel J. Bader,
Oscar Ayala Valenzuela,
Scott A. Crooker,
Fedor F. Balakirev,
Ross D. McDonald,
Jimy Encomendero,
David A. Muller,
Feliciano Giustino,
Debdeep Jena,
Huili Grace Xing
Abstract:
GaN has emerged to be a major semiconductor akin to silicon due to its revolutionary impacts in solid state lighting, critically enabled by p-type doping, and high-performance radio-frequency and power electronics. Suffering from inefficient hole doping and low hole mobility, quantum oscillations in p-type GaN have not been observed, hindering fundamental studies of valence bands and hole transpor…
▽ More
GaN has emerged to be a major semiconductor akin to silicon due to its revolutionary impacts in solid state lighting, critically enabled by p-type doping, and high-performance radio-frequency and power electronics. Suffering from inefficient hole doping and low hole mobility, quantum oscillations in p-type GaN have not been observed, hindering fundamental studies of valence bands and hole transport in GaN. Here, we present the first observation of quantum oscillations of holes in GaN. Shubnikov-de Haas (SdH) oscillations in hole resistivity are observed in a quantum-confined two-dimensional hole gas at a GaN/AlN interface, where polarization-induced doping overcomes thermal freeze-out, and a sharp and clean interface boosts the hole mobility enough to unmask the quantum oscillations. These holes degenerately occupy the light and heavy hole bands of GaN and have record-high mobilities of ~1900 cm2/Vs and ~400 cm2/Vs at 3K, respectively. We use magnetic fields up to 72 T to resolve SdH oscillations of holes from both valence bands to extract their respective sheet densities, quantum scattering times, and the effective masses of light holes (0.5-0.7 m0) and heavy holes (1.9 m0). SdH oscillations of heavy and light holes in GaN constitute a direct metrology of valence bands and open new venues for quantum engineering in this technologically important semiconductor. Like strained silicon transistors, strain-engineering of the valence bands of GaN is predicted to dramatically improve hole mobilities by reducing the hole effective mass, a proposal that can now be explored experimentally, particularly in a fully fabricated transistor, using quantum oscillations. Furthermore, the findings of this work suggest a blueprint to create 2D hole gases and observe quantum oscillations of holes in related wide bandgap semiconductors such as SiC and ZnO in which such techniques are not yet possible.
△ Less
Submitted 27 January, 2025;
originally announced January 2025.
-
Gaussian Process-Based Prediction and Control of Hammerstein-Wiener Systems
Authors:
Mingzhou Yin,
Matthias A. Müller
Abstract:
This work investigates data-driven prediction and control of Hammerstein-Wiener systems using physics-informed Gaussian process models. Data-driven prediction algorithms have been developed for structured nonlinear systems based on Willems' fundamental lemma. However, existing frameworks cannot treat output nonlinearities and require a dictionary of basis functions for Hammerstein systems. In this…
▽ More
This work investigates data-driven prediction and control of Hammerstein-Wiener systems using physics-informed Gaussian process models. Data-driven prediction algorithms have been developed for structured nonlinear systems based on Willems' fundamental lemma. However, existing frameworks cannot treat output nonlinearities and require a dictionary of basis functions for Hammerstein systems. In this work, an implicit predictor structure is considered, leveraging the multi-step-ahead ARX structure for the linear part of the model. This implicit function is learned by Gaussian process regression with kernel functions designed from Gaussian process priors for the nonlinearities. The linear model parameters are estimated as hyperparameters by assuming a stable spline hyperprior. The implicit Gaussian process model provides explicit output prediction by optimizing selected optimality criteria. The model is also applied to receding horizon control with the expected control cost and chance constraint satisfaction guarantee. Numerical results demonstrate that the proposed prediction and control algorithms are superior to black-box Gaussian process models.
△ Less
Submitted 27 January, 2025;
originally announced January 2025.
-
The SPHERE infrared survey for exoplanets (SHINE). V. Complete observations, data reduction and analysis, detection performances, and final results
Authors:
A. Chomez,
P. Delorme,
A. -M. Lagrange,
R. Gratton,
O. Flasseur,
G. Chauvin,
M. Langlois,
J. Mazoyer,
A. Zurlo,
S. Desidera,
D. Mesa,
M. Bonnefoy,
M. Feldt,
J. Hagelberg,
M. Meyer,
A. Vigan,
C. Ginski,
M. Kenworthy,
D. Albert,
S. Bergeon,
J. -L. Beuzit,
B. Biller,
T. Bhowmik,
A. Boccaletti,
M. Bonavita
, et al. (95 additional authors not shown)
Abstract:
During the past decade, state-of-the-art planet-finder instruments like SPHERE@VLT, coupling coronagraphic devices and extreme adaptive optics systems, unveiled, thanks to large surveys, around 20 planetary mass companions at semi-major axis greater than 10 astronomical units. Direct imaging being the only detection technique to be able to probe this outer region of planetary systems, the SPHERE i…
▽ More
During the past decade, state-of-the-art planet-finder instruments like SPHERE@VLT, coupling coronagraphic devices and extreme adaptive optics systems, unveiled, thanks to large surveys, around 20 planetary mass companions at semi-major axis greater than 10 astronomical units. Direct imaging being the only detection technique to be able to probe this outer region of planetary systems, the SPHERE infrared survey for exoplanets (SHINE) was designed and conducted from 2015 to 2021 to study the demographics of such young gas giant planets around 400 young nearby solar-type stars. In this paper, we present the observing strategy, the data quality, and the point sources analysis of the full SHINE statistical sample as well as snapSHINE. Both surveys used the SPHERE@VLT instrument with the IRDIS dual band imager in conjunction with the integral field spectrograph IFS and the angular differential imaging observing technique. All SHINE data (650 datasets), corresponding to 400 stars, including the targets of the F150 survey, are processed in a uniform manner with an advanced post-processing algorithm called PACO ASDI. An emphasis is put on the classification and identification of the most promising candidate companions. Compared to the previous early analysis SHINE F150, the use of advanced post-processing techniques significantly improved by one or 2 magnitudes (x3-x6) the contrast detection limits, which will allow us to put even tighter constraints on the radial distribution of young gas giants. This increased sensitivity directly places SHINE as the largest and deepest direct imaging survey ever conducted. We detected and classified more than 3500 physical sources. One additional substellar companion has been confirmed during the second phase of the survey (HIP 74865 B), and several new promising candidate companions are awaiting second epoch confirmations.
△ Less
Submitted 21 January, 2025;
originally announced January 2025.
-
Human-like Nonverbal Behavior with MetaHumans in Real-World Interaction Studies: An Architecture Using Generative Methods and Motion Capture
Authors:
Oliver Chojnowski,
Alexander Eberhard,
Michael Schiffmann,
Ana Müller,
Anja Richert
Abstract:
Socially interactive agents are gaining prominence in domains like healthcare, education, and service contexts, particularly virtual agents due to their inherent scalability. To facilitate authentic interactions, these systems require verbal and nonverbal communication through e.g., facial expressions and gestures. While natural language processing technologies have rapidly advanced, incorporating…
▽ More
Socially interactive agents are gaining prominence in domains like healthcare, education, and service contexts, particularly virtual agents due to their inherent scalability. To facilitate authentic interactions, these systems require verbal and nonverbal communication through e.g., facial expressions and gestures. While natural language processing technologies have rapidly advanced, incorporating human-like nonverbal behavior into real-world interaction contexts is crucial for enhancing the success of communication, yet this area remains underexplored. One barrier is creating autonomous systems with sophisticated conversational abilities that integrate human-like nonverbal behavior. This paper presents a distributed architecture using Epic Games MetaHuman, combined with advanced conversational AI and camera-based user management, that supports methods like motion capture, handcrafted animation, and generative approaches for nonverbal behavior. We share insights into a system architecture designed to investigate nonverbal behavior in socially interactive agents, deployed in a three-week field study in the Deutsches Museum Bonn, showcasing its potential in realistic nonverbal behavior research.
△ Less
Submitted 18 January, 2025;
originally announced January 2025.
-
Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models
Authors:
Aruna Sankaranarayanan,
Dylan Hadfield-Menell,
Aaron Mueller
Abstract:
All natural languages are structured hierarchically. In humans, this structural restriction is neurologically coded: when two grammars are presented with identical vocabularies, brain areas responsible for language processing are only sensitive to hierarchical grammars. Using large language models (LLMs), we investigate whether such functionally distinct hierarchical processing regions can arise s…
▽ More
All natural languages are structured hierarchically. In humans, this structural restriction is neurologically coded: when two grammars are presented with identical vocabularies, brain areas responsible for language processing are only sensitive to hierarchical grammars. Using large language models (LLMs), we investigate whether such functionally distinct hierarchical processing regions can arise solely from exposure to large-scale language distributions. We generate inputs using English, Italian, Japanese, or nonce words, varying the underlying grammars to conform to either hierarchical or linear/positional rules. Using these grammars, we first observe that language models show distinct behaviors on hierarchical versus linearly structured inputs. Then, we find that the components responsible for processing hierarchical grammars are distinct from those that process linear grammars; we causally verify this in ablation experiments. Finally, we observe that hierarchy-selective components are also active on nonce grammars; this suggests that hierarchy sensitivity is not tied to meaning, nor in-distribution inputs.
△ Less
Submitted 15 January, 2025;
originally announced January 2025.
-
Resolving Structural Origins for Superconductivity in Strain-Engineered La$_3$Ni$_2$O$_7$ Thin Films
Authors:
Lopa Bhatt,
Abigail Y. Jiang,
Eun Kyo Ko,
Noah Schnitzer,
Grace A. Pan,
Dan Ferenc Segedin,
Yidi Liu,
Yijun Yu,
Yi-Feng Zhao,
Edgar Abarca Morales,
Charles M. Brooks,
Antia S. Botana,
Harold Y. Hwang,
Julia A. Mundy,
David A. Muller,
Berit H. Goodge
Abstract:
The discovery of high-temperature superconductivity in bulk La$_3$Ni$_2$O$_7$ under high hydrostatic pressure and, more recently, biaxial compression in epitaxial thin films has ignited significant interest in understanding the interplay between atomic and electronic structure in these compounds. Subtle changes in the nickel-oxygen bonding environment are thought to be key drivers for stabilizing…
▽ More
The discovery of high-temperature superconductivity in bulk La$_3$Ni$_2$O$_7$ under high hydrostatic pressure and, more recently, biaxial compression in epitaxial thin films has ignited significant interest in understanding the interplay between atomic and electronic structure in these compounds. Subtle changes in the nickel-oxygen bonding environment are thought to be key drivers for stabilizing superconductivity, but specific details of which bonds and which modifications are most relevant remains so far unresolved. While direct, atomic-scale structural characterization under hydrostatic pressure is beyond current experimental capabilities, static stabilization of strained La$_3$Ni$_2$O$_7$ films provides a platform well-suited to investigation with new picometer-resolution electron microscopy methods. Here, we use multislice electron ptychography to directly measure the atomic-scale structural evolution of La$_3$Ni$_2$O$_7$ thin films across a wide range of biaxial strains tuned via substrate. By resolving both the cation and oxygen sublattices, we study strain-dependent evolution of atomic bonds, providing the opportunity to isolate and disentangle the effects of specific structural motifs for stabilizing superconductivity. We identify the lifting of crystalline symmetry through modification of the nickel-oxygen octahedral distortions under compressive strain as a key structural ingredient for superconductivity. Rather than previously supposed $c$-axis compression, our results highlight the importance of in-plane biaxial compression in superconducting thin films, which suggests an alternative -- possibly cuprate-like -- understanding of the electronic structure. Identifying local regions of inhomogeneous oxygen stoichiometry and high internal strain near crystalline defects, we suggest potential pathways for improving the sharpness and temperature of the superconducting transition.
△ Less
Submitted 14 January, 2025;
originally announced January 2025.
-
Superconductivity and normal-state transport in compressively strained La$_2$PrNi$_2$O$_7$ thin films
Authors:
Yidi Liu,
Eun Kyo Ko,
Yaoju Tarn,
Lopa Bhatt,
Berit H. Goodge,
David A. Muller,
Srinivas Raghu,
Yijun Yu,
Harold Y. Hwang
Abstract:
The discovery of superconductivity under high pressure in Ruddlesden-Popper phases of bulk nickelates has sparked great interest in stabilizing ambient pressure superconductivity in thin-film form using epitaxial strain. Recently, signs of superconductivity have been observed in compressively strained bilayer nickelate thin films with an onset temperature exceeding 40 K, albeit with broad and two-…
▽ More
The discovery of superconductivity under high pressure in Ruddlesden-Popper phases of bulk nickelates has sparked great interest in stabilizing ambient pressure superconductivity in thin-film form using epitaxial strain. Recently, signs of superconductivity have been observed in compressively strained bilayer nickelate thin films with an onset temperature exceeding 40 K, albeit with broad and two-step-like transitions. Here, we report intrinsic superconductivity and normal-state transport properties in compressively strained La$_2$PrNi$_2$O$_7$ thin films, achieved through a combination of isovalent Pr substitution, growth optimization, and precision ozone annealing. The superconducting onset occurs above 48 K, with zero resistance reached above 30 K, and the critical current density at 1.4 K is 100-fold larger than previous reports. The normal-state resistivity exhibits quadratic temperature dependence indicative of Fermi liquid behaviour, and other phenomenological similarities to transport in overdoped cuprates suggest parallels in their emergent properties.
△ Less
Submitted 14 January, 2025;
originally announced January 2025.
-
Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages
Authors:
Jannik Brinkmann,
Chris Wendler,
Christian Bartelt,
Aaron Mueller
Abstract:
Human bilinguals often use similar brain regions to process multiple languages, depending on when they learned their second language and their proficiency. In large language models (LLMs), how are multiple languages learned and encoded? In this work, we explore the extent to which LLMs share representations of morphsyntactic concepts such as grammatical number, gender, and tense across languages.…
▽ More
Human bilinguals often use similar brain regions to process multiple languages, depending on when they learned their second language and their proficiency. In large language models (LLMs), how are multiple languages learned and encoded? In this work, we explore the extent to which LLMs share representations of morphsyntactic concepts such as grammatical number, gender, and tense across languages. We train sparse autoencoders on Llama-3-8B and Aya-23-8B, and demonstrate that abstract grammatical concepts are often encoded in feature directions shared across many languages. We use causal interventions to verify the multilingual nature of these representations; specifically, we show that ablating only multilingual features decreases classifier performance to near-chance across languages. We then use these features to precisely modify model behavior in a machine translation task; this demonstrates both the generality and selectivity of these feature's roles in the network. Our findings suggest that even models trained predominantly on English data can develop robust, cross-lingual abstractions of morphosyntactic concepts.
△ Less
Submitted 23 May, 2025; v1 submitted 10 January, 2025;
originally announced January 2025.
-
Double-$K$-hole resonances in single photoionization of He-like B$^{3+}$ ions
Authors:
A. Müller,
P. -M. Hillenbrand,
S. -X. Wang,
S. Schippers,
E. Lindroth,
F. Trinter,
J. Seltmann,
S. Reinwardt,
M. Martins,
A. S. Kheifets,
I. Bray
Abstract:
Within a joint experimental and theoretical research project, single photoionization of He-like B$^{3+}$ ions was investigated in the energy range from approximately 250 to 1200~eV. With the parent-ion beam in the experiment containing both $1s^2~^1S$ ground-state and $1s2s~^3S$ metastable B$^{3+}$ ions, double-core-hole resonances could be studied. Two series of hollow resonant states were observ…
▽ More
Within a joint experimental and theoretical research project, single photoionization of He-like B$^{3+}$ ions was investigated in the energy range from approximately 250 to 1200~eV. With the parent-ion beam in the experiment containing both $1s^2~^1S$ ground-state and $1s2s~^3S$ metastable B$^{3+}$ ions, double-core-hole resonances could be studied. Two series of hollow resonant states were observed, one populated by $K$-shell double excitation $1s^2~^1S \to 2\ell n\ell'~^1P$ ($\ell=s,p$; $\ell'=p,s$; $n=2,3,..,6$) at photon energies up to about 510~eV, the other by $K$-shell single excitation $1s2s~^3S \to 2\ell n\ell'~^3P$ ($\ell=s,p$; $\ell'=p,s$; $n=2,3,..,6$) at energies up to about 310~eV. High resolving powers up to approximately 29000 were achieved. The relativistic many-body perturbation theory was employed to determine level-to-level cross sections for $K$-shell excitation with subsequent autoionization. The resonance energies were calculated with inclusion of electron correlation and radiative contributions. The energy uncertainties of the most prominent resonances are estimated to be below $\pm 1$ meV. Convergent close coupling (CCC) calculations provided single-photoionization cross sections $σ_{34}$ for B$^{3+}$ including the resonant and non-resonant channels. Apart from the resonances, $σ_{34}$ is dominated by direct ionization in the investigated energy range. The contribution $σ_{34}^{\mathrm{dir}}$ of the latter process to $σ_{34}$ was separately determined by using the random-phase approximation with exchange and relativistic Hartree-Fock calculations which agree very well with previous calculations. Direct ionization of one electron accompanied by excitation of the remaining electron was treated by the CCC theory and found to be a minor contribution to $σ_{34}$.
△ Less
Submitted 6 January, 2025;
originally announced January 2025.
-
Analytically Informed Inverse Kinematics Solution at Singularities
Authors:
Andreas Mueller
Abstract:
Near kinematic singularities of a serial manipulator, the inverse kinematics (IK) problem becomes ill-conditioned, which poses computational problems for the numerical solution. Computational methods to tackle this issue are based on various forms of a pseudoinverse (PI) solution to the velocity IK problem. The damped least squares (DLS) method provides a robust solution with controllable converge…
▽ More
Near kinematic singularities of a serial manipulator, the inverse kinematics (IK) problem becomes ill-conditioned, which poses computational problems for the numerical solution. Computational methods to tackle this issue are based on various forms of a pseudoinverse (PI) solution to the velocity IK problem. The damped least squares (DLS) method provides a robust solution with controllable convergence rate. However, at singularities, it may not even be possible to solve the IK problem using any PI solution when certain end-effector motions are prescribed. To overcome this problem, an analytically informed inverse kinematics (AI-IK) method is proposed. The key step of the method is an explicit description of the tangent aspect of singular motions (the analytic part) to deduce a perturbation that yields a regular configuration. The latter serves as start configuration for the iterative solution (the numeric part). Numerical results are reported for a 7-DOF Kuka iiwa.
△ Less
Submitted 29 December, 2024;
originally announced December 2024.
-
Emittance Minimization for Aberration Correction I: Aberration correction of an electron microscope without knowing the aberration coefficients
Authors:
Desheng Ma,
Steven E. Zeltmann,
Chenyu Zhang,
Zhaslan Baraissov,
Yu-Tsun Shao,
Cameron Duncan,
Jared Maxson,
Auralee Edelen,
David A. Muller
Abstract:
Precise alignment of the electron beam is critical for successful application of scanning transmission electron microscopes (STEM) to understanding materials at atomic level. Despite the success of aberration correctors, aberration correction is still a complex process. Here we approach aberration correction from the perspective of accelerator physics and show it is equivalent to minimizing the em…
▽ More
Precise alignment of the electron beam is critical for successful application of scanning transmission electron microscopes (STEM) to understanding materials at atomic level. Despite the success of aberration correctors, aberration correction is still a complex process. Here we approach aberration correction from the perspective of accelerator physics and show it is equivalent to minimizing the emittance growth of the beam, the span of the phase space distribution of the probe. We train a deep learning model to predict emittance growth from experimentally accessible Ronchigrams. Both simulation and experimental results show the model can capture the emittance variation with aberration coefficients accurately. We further demonstrate the model can act as a fast-executing function for the global optimization of the lens parameters. Our approach enables new ways to quickly quantify and automate aberration correction that takes advantage of the rapid measurements possible with high-speed electron cameras. In part II of the paper, we demonstrate how the emittance metric enables rapid online tuning of the aberration corrector using Bayesian optimization.
△ Less
Submitted 29 December, 2024;
originally announced December 2024.
-
Emittance Minimization for Aberration Correction II: Physics-informed Bayesian Optimization of an Electron Microscope
Authors:
Desheng Ma,
Steven E. Zeltmann,
Chenyu Zhang,
Zhaslan Baraissov,
Yu-Tsun Shao,
Cameron Duncan,
Jared Maxson,
Auralee Edelen,
David A. Muller
Abstract:
Aberration-corrected Scanning Transmission Electron Microscopy (STEM) has become an essential tool in understanding materials at the atomic scale. However, tuning the aberration corrector to produce a sub-Ångström probe is a complex and time-costly procedure, largely due to the difficulty of precisely measuring the optical state of the system. When measurements are both costly and noisy, Bayesian…
▽ More
Aberration-corrected Scanning Transmission Electron Microscopy (STEM) has become an essential tool in understanding materials at the atomic scale. However, tuning the aberration corrector to produce a sub-Ångström probe is a complex and time-costly procedure, largely due to the difficulty of precisely measuring the optical state of the system. When measurements are both costly and noisy, Bayesian methods provide rapid and efficient optimization. To this end, we develop a Bayesian approach to fully automate the process by minimizing a new quality metric, beam emittance, which is shown to be equivalent to performing aberration correction. In part I, we derived several important properties of the beam emittance metric and trained a deep neural network to predict beam emittance growth from a single Ronchigram. Here we use this as the black box function for Bayesian Optimization and demonstrate automated tuning of simulated and real electron microscopes. We explore different surrogate functions for the Bayesian optimizer and implement a deep neural network kernel to effectively learn the interactions between different control channels without the need to explicitly measure a full set of aberration coefficients. Both simulation and experimental results show the proposed method outperforms conventional approaches by achieving a better optical state with a higher convergence rate.
△ Less
Submitted 24 January, 2025; v1 submitted 29 December, 2024;
originally announced December 2024.
-
Dynamics of Parallel Manipulators with Hybrid Complex Limbs -- Modular Modeling and Parallel Computing
Authors:
Andreas Mueller
Abstract:
Parallel manipulators, also called parallel kinematics machines (PKM), enable robotic solutions for highly dynamic handling and machining applications. The safe and accurate design and control necessitates high-fidelity dynamics models. Such modeling approaches have already been presented for PKM with simple limbs (i.e. each limb is a serial kinematic chain). A systematic modeling approach for PKM…
▽ More
Parallel manipulators, also called parallel kinematics machines (PKM), enable robotic solutions for highly dynamic handling and machining applications. The safe and accurate design and control necessitates high-fidelity dynamics models. Such modeling approaches have already been presented for PKM with simple limbs (i.e. each limb is a serial kinematic chain). A systematic modeling approach for PKM with complex limbs (i.e. limbs that possess kinematic loops) was not yet proposed despite the fact that many successful PKM comprise complex limbs. This paper presents a systematic modular approach to the kinematics and dynamics modeling of PKM with complex limbs that are built as serial arrangement of closed loops. The latter are referred to as hybrid limbs, and can be found in almost all PKM with complex limbs, such as the Delta robot. The proposed method generalizes the formulation for PKM with simple limbs by means of local resolution of loop constraints, which is known as constraint embedding in multibody dynamics. The constituent elements of the method are the kinematic and dynamic equations of motions (EOM), and the inverse kinematics solution of the limbs, i.e. the relation of platform motion and the motion of the limbs. While the approach is conceptually independent of the used kinematics and dynamics formulation, a Lie group formulation is employed for deriving the EOM. The frame invariance of the Lie group formulation is used for devising a modular modeling method where the EOM of a representative limb are used to derived the EOM of the limbs of a particular PKM. The PKM topology is exploited in a parallel computation scheme that shall allow for computationally efficient distributed evaluation of the overall EOM of the PKM. Finally, the method is applied to the IRSBot-2 and a 3\underline{R}R[2RR]R Delta robot, which is presented in detail.
△ Less
Submitted 18 December, 2024;
originally announced December 2024.
-
A Constraint Embedding Approach for Dynamics Modeling of Parallel Kinematic Manipulators with Hybrid Limbs
Authors:
Andreas Mueller
Abstract:
Parallel kinematic manipulators (PKM) are characterized by closed kinematic loops, due to the parallel arrangement of limbs but also due to the existence of kinematic loops within the limbs. Moreover, many PKM are built with limbs constructed by serially combining kinematic loops. Such limbs are called hybrid, which form a particular class of complex limbs. Design and model-based control requires…
▽ More
Parallel kinematic manipulators (PKM) are characterized by closed kinematic loops, due to the parallel arrangement of limbs but also due to the existence of kinematic loops within the limbs. Moreover, many PKM are built with limbs constructed by serially combining kinematic loops. Such limbs are called hybrid, which form a particular class of complex limbs. Design and model-based control requires accurate dynamic PKM models desirably without model simplifications. Dynamics modeling then necessitates kinematic relations of all members of the PKM, in contrast to the standard kinematics modeling of PKM, where only the forward and inverse kinematics solution for the manipulator (relating input and output motions) are computed. This becomes more involved for PKM with hybrid limbs. In this paper a modular modeling approach is employed, where limbs are treated separately, and the individual dynamic equations of motions (EOM) are subsequently assembled to the overall model. Key to the kinematic modeling is the constraint resolution for the individual loops within the limbs. This local constraint resolution is a special case of the general \emph{constraint embedding} technique. The proposed method finally allows for a systematic modeling of general PKM. The method is demonstrated for the IRSBot-2, where each limb comprises two independent loops.
△ Less
Submitted 18 December, 2024;
originally announced December 2024.
-
Polyhedral Control Design: Theory and Methods
Authors:
Boris Houska,
Matthias A. Müller,
Mario E. Villanueva
Abstract:
In this article, we survey the primary research on polyhedral computing methods for constrained linear control systems. Our focus is on the modeling power of convex optimization, featured to design set-based robust and optimal controllers. In detail, we review the state-of-the-art techniques for computing geometric structures such as robust control invariant polytopes. Moreover, we survey recent m…
▽ More
In this article, we survey the primary research on polyhedral computing methods for constrained linear control systems. Our focus is on the modeling power of convex optimization, featured to design set-based robust and optimal controllers. In detail, we review the state-of-the-art techniques for computing geometric structures such as robust control invariant polytopes. Moreover, we survey recent methods for constructing control Lyapunov functions with polyhedral epigraphs as well as the extensive literature on robust model predictive control. The article concludes with a discussion of both the complexity and potential of polyhedral computing methods that rely on large-scale convex optimization.
△ Less
Submitted 17 December, 2024;
originally announced December 2024.
-
Optimizing pulsed blowing parameters for active separation control in a one-sided diffuser using reinforcement learning
Authors:
Alexandra Müller,
Tobias Schesny,
Ben Steinfurth,
Julien Weiss
Abstract:
Reinforcement learning is employed to optimize the periodic forcing signal of a pulsed blowing system that controls flow separation in a fully-turbulent $Re_θ= 1000$ diffuser flow. Based on the state of the wind tunnel experiment that is determined with wall shear-stress measurements, Proximal Policy Optimization is used to iteratively adjust the forcing signal. Out of the reward functions investi…
▽ More
Reinforcement learning is employed to optimize the periodic forcing signal of a pulsed blowing system that controls flow separation in a fully-turbulent $Re_θ= 1000$ diffuser flow. Based on the state of the wind tunnel experiment that is determined with wall shear-stress measurements, Proximal Policy Optimization is used to iteratively adjust the forcing signal. Out of the reward functions investigated in this study, the incremental reduction of flow reversal per action is shown to be the most sample efficient. Less than 100 episodes are required to find the parameter combination that ensures the highest control authority for a fixed mass flow consumption. Fully consistent with recent studies, the algorithm suggests that the mass flow is used most efficiently when the actuation signal is characterized by a low duty cycle where the pulse duration is small compared to the pulsation period. The results presented in this paper promote the application of reinforcement learning for optimization tasks based on turbulent, experimental data.
△ Less
Submitted 10 December, 2024;
originally announced December 2024.
-
Incremental Sentence Processing Mechanisms in Autoregressive Transformer Language Models
Authors:
Michael Hanna,
Aaron Mueller
Abstract:
Autoregressive transformer language models (LMs) possess strong syntactic abilities, often successfully handling phenomena from agreement to NPI licensing. However, the features they use to incrementally process language inputs are not well understood. In this paper, we fill this gap by studying the mechanisms underlying garden path sentence processing in LMs. We ask: (1) Do LMs use syntactic feat…
▽ More
Autoregressive transformer language models (LMs) possess strong syntactic abilities, often successfully handling phenomena from agreement to NPI licensing. However, the features they use to incrementally process language inputs are not well understood. In this paper, we fill this gap by studying the mechanisms underlying garden path sentence processing in LMs. We ask: (1) Do LMs use syntactic features or shallow heuristics to perform incremental sentence processing? (2) Do LMs represent only one potential interpretation, or multiple? and (3) Do LMs reanalyze or repair their initial incorrect representations? To address these questions, we use sparse autoencoders to identify interpretable features that determine which continuation - and thus which reading - of a garden path sentence the LM prefers. We find that while many important features relate to syntactic structure, some reflect syntactically irrelevant heuristics. Moreover, while most active features correspond to one reading of the sentence, some features correspond to the other, suggesting that LMs assign weight to both possibilities simultaneously. Finally, LMs do not re-use features from garden path sentence processing to answer follow-up questions.
△ Less
Submitted 6 December, 2024;
originally announced December 2024.
-
Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Authors:
Michael Y. Hu,
Aaron Mueller,
Candace Ross,
Adina Williams,
Tal Linzen,
Chengxu Zhuang,
Ryan Cotterell,
Leshem Choshen,
Alex Warstadt,
Ethan Gotlieb Wilcox
Abstract:
The BabyLM Challenge is a community effort to close the data-efficiency gap between human and computational language learners. Participants compete to optimize language model training on a fixed language data budget of 100 million words or less. This year, we released improved text corpora, as well as a vision-and-language corpus to facilitate research into cognitively plausible vision language mo…
▽ More
The BabyLM Challenge is a community effort to close the data-efficiency gap between human and computational language learners. Participants compete to optimize language model training on a fixed language data budget of 100 million words or less. This year, we released improved text corpora, as well as a vision-and-language corpus to facilitate research into cognitively plausible vision language models. Submissions were compared on evaluation tasks targeting grammatical ability, (visual) question answering, pragmatic abilities, and grounding, among other abilities. Participants could submit to a 10M-word text-only track, a 100M-word text-only track, and/or a 100M-word and image multimodal track. From 31 submissions employing diverse methods, a hybrid causal-masked language model architecture outperformed other approaches. No submissions outperformed the baselines in the multimodal track. In follow-up analyses, we found a strong relationship between training FLOPs and average performance across tasks, and that the best-performing submissions proposed changes to the training data, training objective, and model architecture. This year's BabyLM Challenge shows that there is still significant room for innovation in this setting, in particular for image-text modeling, but community-driven research can yield actionable insights about effective strategies for small-scale language modeling.
△ Less
Submitted 6 December, 2024;
originally announced December 2024.
-
Black-Box Forgery Attacks on Semantic Watermarks for Diffusion Models
Authors:
Andreas Müller,
Denis Lukovnikov,
Jonas Thietke,
Asja Fischer,
Erwin Quiring
Abstract:
Integrating watermarking into the generation process of latent diffusion models (LDMs) simplifies detection and attribution of generated content. Semantic watermarks, such as Tree-Rings and Gaussian Shading, represent a novel class of watermarking techniques that are easy to implement and highly robust against various perturbations. However, our work demonstrates a fundamental security vulnerabili…
▽ More
Integrating watermarking into the generation process of latent diffusion models (LDMs) simplifies detection and attribution of generated content. Semantic watermarks, such as Tree-Rings and Gaussian Shading, represent a novel class of watermarking techniques that are easy to implement and highly robust against various perturbations. However, our work demonstrates a fundamental security vulnerability of semantic watermarks. We show that attackers can leverage unrelated models, even with different latent spaces and architectures (UNet vs DiT), to perform powerful and realistic forgery attacks. Specifically, we design two watermark forgery attacks. The first imprints a targeted watermark into real images by manipulating the latent representation of an arbitrary image in an unrelated LDM to get closer to the latent representation of a watermarked image. We also show that this technique can be used for watermark removal. The second attack generates new images with the target watermark by inverting a watermarked image and re-generating it with an arbitrary prompt. Both attacks just need a single reference image with the target watermark. Overall, our findings question the applicability of semantic watermarks by revealing that attackers can easily forge or remove these watermarks under realistic conditions.
△ Less
Submitted 7 June, 2025; v1 submitted 4 December, 2024;
originally announced December 2024.
-
Cosmology and general relativity (GR) in upper secondary school through new targeted teaching materials: a study on student learning and motivation
Authors:
Alice Gasparini,
Andreas Mueller,
Florian Stern,
Laura Weiss
Abstract:
Cosmology and GR remain largely inaccessible to high-school teaching due to the advanced prerequisites to master these topics. Integrating them into upper secondary teaching is a significant challenge that remains unresolved. This contribution reports on an implementation study of a GR and cosmology course for upper secondary school students as part of an educational project launched during the ce…
▽ More
Cosmology and GR remain largely inaccessible to high-school teaching due to the advanced prerequisites to master these topics. Integrating them into upper secondary teaching is a significant challenge that remains unresolved. This contribution reports on an implementation study of a GR and cosmology course for upper secondary school students as part of an educational project launched during the centenary of GR and tested ever since for several years. The course aimed to expand students' knowledge to include current physics topics while highlighting their foundations in areas of classical physics such as Newtonian mechanics, electromagnetism, and waves. Targeted teaching and learning materials are focused on conceptual and qualitative understanding, while systematically combined with a mathematical treatment accessible at the upper secondary level, avoiding oversimplification. A key element is an active learning approach, incorporating activities and tasks such as engaging applications related to current research, reflective exercises, thought experiments, and hands-on tasks. The main research objective was to explore whether a conceptually deep and educationally effective GR and cosmology course could be successfully implemented for non-specialist upper secondary students. A pre-post study assessed both conceptual learning and affective outcomes, including interest, curiosity, self-concept, and perceived relevance of science. Results showed encouraging gains in both learning and motivation, with large to very large effect sizes for conceptual learning of core principles. Additionally, no or small effects of predictors such as gender were observed. We conclude that the integration of GR and cosmology into upper secondary physics teaching, in the form of courses and materials that are engaging, comprehensible, and impactful, is feasible.
△ Less
Submitted 18 June, 2025; v1 submitted 2 December, 2024;
originally announced December 2024.
-
Online convex optimization for constrained control of nonlinear systems
Authors:
Marko Nonhoff,
Johannes Köhler,
Matthias A. Müller
Abstract:
This paper investigates the problem of controlling nonlinear dynamical systems subject to state and input constraints while minimizing time-varying and a priori unknown cost functions. We propose a modular approach that combines the online convex optimization framework and reference governors to solve this problem. Our method is general in the sense that we do not limit our analysis to a specific…
▽ More
This paper investigates the problem of controlling nonlinear dynamical systems subject to state and input constraints while minimizing time-varying and a priori unknown cost functions. We propose a modular approach that combines the online convex optimization framework and reference governors to solve this problem. Our method is general in the sense that we do not limit our analysis to a specific choice of online convex optimization algorithm or reference governor. We show that the dynamic regret of the proposed framework is bounded linearly in both the dynamic regret and the path length of the chosen online convex optimization algorithm, even though the online convex optimization algorithm does not account for the underlying dynamics. We prove that a linear bound with respect to the online convex optimization algorithm's dynamic regret is optimal, i.e., cannot be improved upon. Furthermore, for a standard class of online convex optimization algorithms, our proposed framework attains a bound on its dynamic regret that is linear only in the variation of the cost functions, which is known to be an optimal bound. Finally, we demonstrate implementation and flexibility of the proposed framework by comparing different combinations of online convex optimization algorithms and reference governors to control a nonlinear chemical reactor in a numerical experiment.
△ Less
Submitted 1 December, 2024;
originally announced December 2024.
-
Recitation tasks revamped? Students' perceptions of smartphone-based experimental and programming tasks in introductory mechanics
Authors:
Simon Zacharias Lahme,
Dominik Dorsel,
Heidrun Heinke,
Pascal Klein,
Andreas Müller,
Christoph Stampfer,
Sebastian Staacks
Abstract:
This exploratory field study investigates the integration of innovative forms of recitation tasks in a first-year introductory mechanics course, focusing on smartphone-based experimental tasks alongside programming and standard recitation tasks. Smartphones, combined with external sensor modules, serve as a gateway enabling students to conduct various low-cost and authentic physics experiments wit…
▽ More
This exploratory field study investigates the integration of innovative forms of recitation tasks in a first-year introductory mechanics course, focusing on smartphone-based experimental tasks alongside programming and standard recitation tasks. Smartphones, combined with external sensor modules, serve as a gateway enabling students to conduct various low-cost and authentic physics experiments with first-hand data collection outside traditional lab settings. These tasks aim to enhance students' agency in independent physics experimentation and enrich homework assignments by dissolving boundaries between lectures, recitation sessions, and traditional labs, and thereby linking theoretical and experimental aspects of undergraduate physics education. To explore this potential, we implemented and evaluated a sample set of nine smartphone-based experimental tasks and, for comparison, three programming tasks as weekly exercises in a first-year physics course at RWTH Aachen University. We investigated students' perceptions of learning with these new tasks through twelve short surveys involving up to 188 participants. In two additional surveys with 108 and 78 participants, students assessed affective responses to the smartphone-based experimental tasks relative to the programming and standard recitation tasks. Our findings indicate that the smartphone-based experimental tasks were generally well-suited to the students and tended to outperform the programming tasks in terms of perceptions of learning with the tasks and affective responses. Overall, students responded positively to the new experimental tasks, with perceptions comparable to, or only partly below, those of long-established standard recitation tasks. These results suggest that smartphone-based experimental tasks can be successfully integrated into teaching and contribute to refining traditional recitation tasks.
△ Less
Submitted 19 May, 2025; v1 submitted 20 November, 2024;
originally announced November 2024.
-
Some remarks on the effect of risk sharing and diversification for infinite mean risks
Authors:
Alfred Müller
Abstract:
The basic principle of any version of insurance is the paradigm that exchanging risk by sharing it in a pool is beneficial for the participants. In case of independent risks with a finite mean this is the case for risk averse decision makers. The situation may be very different in case of infinite mean models. In that case it is known that risk sharing may have a negative effect, which is sometime…
▽ More
The basic principle of any version of insurance is the paradigm that exchanging risk by sharing it in a pool is beneficial for the participants. In case of independent risks with a finite mean this is the case for risk averse decision makers. The situation may be very different in case of infinite mean models. In that case it is known that risk sharing may have a negative effect, which is sometimes called the nondiversification trap. This phenomenon is well known for infinite mean stable distributions. In a series of recent papers similar results for infinite mean Pareto and Fréchet distributions have been obtained. We further investigate this property by showing that many of these results can be obtained as special cases of a simple result demonstrating that this holds for any distribution that is more skewed than a Cauchy distribution. We also relate this to the situation of deadly catastrophic risks, where we assume a positive probability for an infinite value. That case gives a very simple intuition why this phenomenon can occur for such catastrophic risks. We also mention several open problems and conjectures in this context.
△ Less
Submitted 28 March, 2025; v1 submitted 15 November, 2024;
originally announced November 2024.
-
Evaluating Gender Bias in Large Language Models
Authors:
Michael Döll,
Markus Döhring,
Andreas Müller
Abstract:
Gender bias in artificial intelligence has become an important issue, particularly in the context of language models used in communication-oriented applications. This study examines the extent to which Large Language Models (LLMs) exhibit gender bias in pronoun selection in occupational contexts. The analysis evaluates the models GPT-4, GPT-4o, PaLM 2 Text Bison and Gemini 1.0 Pro using a self-gen…
▽ More
Gender bias in artificial intelligence has become an important issue, particularly in the context of language models used in communication-oriented applications. This study examines the extent to which Large Language Models (LLMs) exhibit gender bias in pronoun selection in occupational contexts. The analysis evaluates the models GPT-4, GPT-4o, PaLM 2 Text Bison and Gemini 1.0 Pro using a self-generated dataset. The jobs considered include a range of occupations, from those with a significant male presence to those with a notable female concentration, as well as jobs with a relatively equal gender distribution. Three different sentence processing methods were used to assess potential gender bias: masked tokens, unmasked sentences, and sentence completion. In addition, the LLMs suggested names of individuals in specific occupations, which were then examined for gender distribution. The results show a positive correlation between the models' pronoun choices and the gender distribution present in U.S. labor force data. Female pronouns were more often associated with female-dominated occupations, while male pronouns were more often associated with male-dominated occupations. Sentence completion showed the strongest correlation with actual gender distribution, while name generation resulted in a more balanced 'politically correct' gender distribution, albeit with notable variations in predominantly male or female occupations. Overall, the prompting method had a greater impact on gender distribution than the model selection itself, highlighting the complexity of addressing gender bias in LLMs. The findings highlight the importance of prompting in gender mapping.
△ Less
Submitted 14 November, 2024;
originally announced November 2024.
-
Conceptualization and Quantitative study of Aesthetic and Affective Perception of Pictures in Physics Education
Authors:
Tatjana Zähringer,
Raimund Girwidz,
Andreas Müller
Abstract:
Pictures in physics education go beyond instructional functions and serve affective roles, such as attracting attention, creating fascination, and fostering engagement with the depicted content. Recognizing the importance of these affective functions highlights the need to understand and utilize aesthetic pictures in a research-based educational environment. Prior research suggests that aesthetic…
▽ More
Pictures in physics education go beyond instructional functions and serve affective roles, such as attracting attention, creating fascination, and fostering engagement with the depicted content. Recognizing the importance of these affective functions highlights the need to understand and utilize aesthetic pictures in a research-based educational environment. Prior research suggests that aesthetic and affective attractiveness in pictures enhances enjoyment and engagement with the physics content. This paper offers three main contributions: Firstly, it conceptualizes and presents research-based criteria for selecting pictures perceived as aesthetically pleasing, drawing on insights from psychology and physics education research. Following these criteria, aesthetic pictures related to a given curricular content can be selected. Secondly, the paper applies these criteria to selecting pictures showing geometrical optics. It then delves into an evaluation of students' aesthetic and affective perception of the selected pictures. A validated instrument measured these responses, showing strong reliability (aesthetic perception: $α_C$ = 0.87 [0.85, 0.89]; affective perception: $α_C$ = 0.82 [0.80, 0.85]). Thirdly, it combines decorative and instructional functions in tasks and compares students' perceptions of aesthetic pictures (AP) and classroom experiment pictures (CEP) in junior high school ($N$ = 118), using a crossover design. Results indicated significantly better aesthetic and affective evaluations for APs, with large effect sizes (AP vs. CEP, aesthetic and affective perception: $d$ = 1.05 - 1.56 and 0.85 - 1.48, respectively). We conclude that the here developed and investigated criteria are useful for selecting aesthetic and affective pictures. This provides a basis for further leveraging their educational potential to create fascination and engagement in science education.
△ Less
Submitted 7 November, 2024;
originally announced November 2024.
-
KIT Superconducting Undulator Development -- Story of a successful industrial collaboration & future prospects
Authors:
B. Krasch,
A. Bernhard,
E. Bründermann,
S. Fatehi,
J. Gethmann,
N. Glamann,
A. Grau,
A. Hobl,
A. -S. Müller,
D. Saez de Jauregui,
E. Tan,
W. Walter
Abstract:
Undulators are X-ray sources widely used in synchrotron storage rings and free-electron laser facilities. With the commercial availability of low-temperature superconductors, a new type of undulator was born, the superconducting undulator (SCU). In this context, the industrial cooperation between the Karlsruhe Institute of Technology and Bilfinger Nuclear and Energy Transition GmbH started more th…
▽ More
Undulators are X-ray sources widely used in synchrotron storage rings and free-electron laser facilities. With the commercial availability of low-temperature superconductors, a new type of undulator was born, the superconducting undulator (SCU). In this context, the industrial cooperation between the Karlsruhe Institute of Technology and Bilfinger Nuclear and Energy Transition GmbH started more than 15 years ago. Since then, many projects have been successfully completed, leading to the production of the world's leading full-scale commercial SCUs based on conduction cooling. Starting with the SCU15, the first of its kind installed SCU providing light to a beamline, followed by the SCU20 installed and still in operation at the Karlsruhe Research Accelerator. The successful realisation of such SCUs has required the simultaneous development of appropriate measurement facilities such as CASPER I and CASPER II.
△ Less
Submitted 5 November, 2024; v1 submitted 4 November, 2024;
originally announced November 2024.
-
Characterizing the Role of Similarity in the Property Inferences of Language Models
Authors:
Juan Diego Rodriguez,
Aaron Mueller,
Kanishka Misra
Abstract:
Property inheritance -- a phenomenon where novel properties are projected from higher level categories (e.g., birds) to lower level ones (e.g., sparrows) -- provides a unique window into how humans organize and deploy conceptual knowledge. It is debated whether this ability arises due to explicitly stored taxonomic knowledge vs. simple computations of similarity between mental representations. How…
▽ More
Property inheritance -- a phenomenon where novel properties are projected from higher level categories (e.g., birds) to lower level ones (e.g., sparrows) -- provides a unique window into how humans organize and deploy conceptual knowledge. It is debated whether this ability arises due to explicitly stored taxonomic knowledge vs. simple computations of similarity between mental representations. How are these mechanistic hypotheses manifested in contemporary language models? In this work, we investigate how LMs perform property inheritance with behavioral and causal representational analysis experiments. We find that taxonomy and categorical similarities are not mutually exclusive in LMs' property inheritance behavior. That is, LMs are more likely to project novel properties from one category to the other when they are taxonomically related and at the same time, highly similar. Our findings provide insight into the conceptual structure of language models and may suggest new psycholinguistic experiments for human subjects.
△ Less
Submitted 9 March, 2025; v1 submitted 29 October, 2024;
originally announced October 2024.
-
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics
Authors:
Yaniv Nikankin,
Anja Reusch,
Aaron Mueller,
Yonatan Belinkov
Abstract:
Do large language models (LLMs) solve reasoning tasks by learning robust generalizable algorithms, or do they memorize training data? To investigate this question, we use arithmetic reasoning as a representative task. Using causal analysis, we identify a subset of the model (a circuit) that explains most of the model's behavior for basic arithmetic logic and examine its functionality. By zooming i…
▽ More
Do large language models (LLMs) solve reasoning tasks by learning robust generalizable algorithms, or do they memorize training data? To investigate this question, we use arithmetic reasoning as a representative task. Using causal analysis, we identify a subset of the model (a circuit) that explains most of the model's behavior for basic arithmetic logic and examine its functionality. By zooming in on the level of individual circuit neurons, we discover a sparse set of important neurons that implement simple heuristics. Each heuristic identifies a numerical input pattern and outputs corresponding answers. We hypothesize that the combination of these heuristic neurons is the mechanism used to produce correct arithmetic answers. To test this, we categorize each neuron into several heuristic types-such as neurons that activate when an operand falls within a certain range-and find that the unordered combination of these heuristic types is the mechanism that explains most of the model's accuracy on arithmetic prompts. Finally, we demonstrate that this mechanism appears as the main source of arithmetic accuracy early in training. Overall, our experimental results across several LLMs show that LLMs perform arithmetic using neither robust algorithms nor memorization; rather, they rely on a "bag of heuristics".
△ Less
Submitted 20 May, 2025; v1 submitted 28 October, 2024;
originally announced October 2024.
-
An abstract structure determines the contextuality degree of observable-based Kochen-Specker proofs
Authors:
Axel Muller,
Alain Giorgetti
Abstract:
This article delves into the concept of quantum contextuality, specifically focusing on proofs of the Kochen-Specker theorem obtained by assigning Pauli observables to hypergraph vertices satisfying a given commutation relation. The abstract structure composed of this hypergraph and the graph of anticommutations is named a hypergram. Its labelings with Pauli observables generalize the well-known m…
▽ More
This article delves into the concept of quantum contextuality, specifically focusing on proofs of the Kochen-Specker theorem obtained by assigning Pauli observables to hypergraph vertices satisfying a given commutation relation. The abstract structure composed of this hypergraph and the graph of anticommutations is named a hypergram. Its labelings with Pauli observables generalize the well-known magic sets. A first result is that all these quantum labelings satisfying the conditions of a given hypergram inherently possess the same degree of contextuality. Then we provide a necessary and sufficient algebraic condition for the existence of such quantum labelings and an efficient algorithm to find one of them. We finally attach to each assignable hypergram an abstract notion of contextuality degree. By presenting the study of observable-based Kochen-Specker proofs from the perspective of graphs and matrices, this abstraction opens the way to new methods to search for original contextual configurations.
△ Less
Submitted 18 October, 2024;
originally announced October 2024.
-
Gravitational memory contributions to waveform and effective action
Authors:
Gabriel Luz Almeida,
Alan Müller,
Stefano Foffa,
Riccardo Sturani
Abstract:
We use Effective Field Theory techniques to derive the quadrupole-quadrupole part of the gravitational wave, obtaining a waveform in agreement with previous results found within the multipolar-post-Minkowskian method. In particular we emphasize the role of radiation-reaction terms, which affect the energy-momentum balance between source and radiation. An in-in effective action is then derived alon…
▽ More
We use Effective Field Theory techniques to derive the quadrupole-quadrupole part of the gravitational wave, obtaining a waveform in agreement with previous results found within the multipolar-post-Minkowskian method. In particular we emphasize the role of radiation-reaction terms, which affect the energy-momentum balance between source and radiation. An in-in effective action is then derived along the same principles and it is shown to provide energy and angular momentum balance equations in agreement with the corresponding fluxes carried at infinity by gravitational radiation.
△ Less
Submitted 14 October, 2024;
originally announced October 2024.
-
Reversible long-range domain wall motion in an improper ferroelectric
Authors:
M. Zahn,
A. M. Müller,
K. P. Kelley,
S. M. Neumayer,
S. V. Kalinin,
I. Kézsmarki,
M. Fiebig,
Th. Lottermoser,
N. Domingo,
D. Meier,
J. Schultheiß
Abstract:
Reversible ferroelectric domain wall movements beyond the 10 nm range associated with Rayleigh behavior are usually restricted to specific defect-engineered systems. Here, we demonstrate that such long-range movements naturally occur in the improper ferroelectric ErMnO3 during electric-field-cycling. We study the electric-field-driven motion of domain walls, showing that they readily return to the…
▽ More
Reversible ferroelectric domain wall movements beyond the 10 nm range associated with Rayleigh behavior are usually restricted to specific defect-engineered systems. Here, we demonstrate that such long-range movements naturally occur in the improper ferroelectric ErMnO3 during electric-field-cycling. We study the electric-field-driven motion of domain walls, showing that they readily return to their initial position after having travelled distances exceeding 250 nm. By applying switching spectroscopy band-excitation piezoresponse force microscopy, we track the domain wall movement with nanometric spatial precision and analyze the local switching behavior. Phase field simulations show that the reversible long-range motion is intrinsic to the hexagonal manganites, linking it to their improper ferroelectricity and topologically protected structural vortex lines, which serve as anchor point for the ferroelectric domain walls. Our results give new insight into the local dynamics of domain walls in improper ferroelectrics and demonstrate the possibility to reversibly displace domain walls over much larger distances than commonly expected for ferroelectric systems in their pristine state, ensuring predictable device behavior for applications such as tunable capacitors or sensors
△ Less
Submitted 13 October, 2024;
originally announced October 2024.
-
Lattice-Matched Multiple Channel AlScN/GaN Heterostructures
Authors:
Thai-Son Nguyen,
Naomi Pieczulewsi,
Chandrashekhar Savant,
Joshua J. P. Cooper,
Joseph Casamento,
Rachel S. Goldman,
David A. Muller,
Huili G. Xing,
Debdeep Jena
Abstract:
AlScN is a new wide bandgap, high-k, ferroelectric material for RF, memory, and power applications. Successful integration of high quality AlScN with GaN in epitaxial layer stacks depends strongly on the ability to control lattice parameters and surface or interface through growth. This study investigates the molecular beam epitaxy growth and transport properties of AlScN/GaN multilayer heterostru…
▽ More
AlScN is a new wide bandgap, high-k, ferroelectric material for RF, memory, and power applications. Successful integration of high quality AlScN with GaN in epitaxial layer stacks depends strongly on the ability to control lattice parameters and surface or interface through growth. This study investigates the molecular beam epitaxy growth and transport properties of AlScN/GaN multilayer heterostructures. Single layer Al$_{1-x}$Sc$_x$N/GaN heterostructures exhibited lattice-matched composition within $x$ = 0.09 -- 0.11 using substrate (thermocouple) growth temperatures between 330 $ ^\circ$C and 630 $ ^\circ$C. By targeting the lattice-matched Sc composition, pseudomorphic AlScN/GaN multilayer structures with ten and twenty periods were achieved, exhibiting excellent structural and interface properties as confirmed by X-ray diffraction (XRD) and scanning transmission electron microscopy (STEM). These multilayer heterostructures exhibited substantial polarization-induced net mobile charge densities of up to 8.24 $\times$ 10$^{14}$/cm$^2$ for twenty channels. The sheet density scales with the number of AlScN/GaN periods. By identifying lattice-matched growth condition and using it to generate multiple conductive channels, this work enhances our understanding of the AlScN/GaN material platform.
△ Less
Submitted 11 October, 2024;
originally announced October 2024.
-
Unclonable Functional Encryption
Authors:
Arthur Mehta,
Anne Müller
Abstract:
In a functional encryption (FE) scheme, a user that holds a ciphertext and a function key can learn the result of applying the function to the plaintext message. Security requires that the user does not learn anything beyond the function evaluation. We extend this notion to the quantum setting by providing definitions and a construction for a quantum functional encryption (QFE) scheme which allows…
▽ More
In a functional encryption (FE) scheme, a user that holds a ciphertext and a function key can learn the result of applying the function to the plaintext message. Security requires that the user does not learn anything beyond the function evaluation. We extend this notion to the quantum setting by providing definitions and a construction for a quantum functional encryption (QFE) scheme which allows for the evaluation of polynomialy-sized circuits on arbitrary quantum messages. Our construction is built upon quantum garbled circuits [BY22]. We also investigate the relationship of QFE to the seemingly unrelated notion of unclonable encryption (UE) and find that any QFE scheme universally achieves the property of unclonable functional encryption (UFE). In particular we assume the existence of an unclonable encryption scheme with quantum decryption keys which was recently constructed by [AKY24]. Our UFE guarantees that two parties cannot simultaneously recover the correct function outputs using two independently sampled function secret keys. As an application we give the first construction for public-key UE with variable decryption keys. Lastly, we establish a connection between quantum indistinguishability obfuscation (qiO) and quantum functional encryption (QFE); Showing that any multi-input indistinguishability-secure quantum functional encryption scheme unconditionally implies the existence of qiO.
△ Less
Submitted 14 March, 2025; v1 submitted 8 October, 2024;
originally announced October 2024.
-
GAMformer: In-Context Learning for Generalized Additive Models
Authors:
Andreas Mueller,
Julien Siems,
Harsha Nori,
David Salinas,
Arber Zela,
Rich Caruana,
Frank Hutter
Abstract:
Generalized Additive Models (GAMs) are widely recognized for their ability to create fully interpretable machine learning models for tabular data. Traditionally, training GAMs involves iterative learning algorithms, such as splines, boosted trees, or neural networks, which refine the additive components through repeated error reduction. In this paper, we introduce GAMformer, the first method to le…
▽ More
Generalized Additive Models (GAMs) are widely recognized for their ability to create fully interpretable machine learning models for tabular data. Traditionally, training GAMs involves iterative learning algorithms, such as splines, boosted trees, or neural networks, which refine the additive components through repeated error reduction. In this paper, we introduce GAMformer, the first method to leverage in-context learning to estimate shape functions of a GAM in a single forward pass, representing a significant departure from the conventional iterative approaches to GAM fitting. Building on previous research applying in-context learning to tabular data, we exclusively use complex, synthetic data to train GAMformer, yet find it extrapolates well to real-world data. Our experiments show that GAMformer performs on par with other leading GAMs across various classification benchmarks while generating highly interpretable shape functions.
△ Less
Submitted 6 October, 2024;
originally announced October 2024.
-
Superconductivity in the parent infinite-layer nickelate NdNiO$_2$
Authors:
C. T. Parzyck,
Y. Wu,
L. Bhatt,
M. Kang,
Z. Arthur,
T. M. Pedersen,
R. Sutarto,
S. Fan,
J. Pelliciari,
V. Bisogni,
G. Herranz,
A. B. Georgescu,
D. G. Hawthorn,
L. F. Kourkoutis,
D. A. Muller,
D. G. Schlom,
K. M. Shen
Abstract:
We report evidence for superconductivity with onset temperatures up to 11 K in thin films of the infinite-layer nickelate parent compound NdNiO$_2$. A combination of oxide molecular-beam epitaxy and atomic hydrogen reduction yields samples with high crystallinity and low residual resistivities, a substantial fraction of which exhibit superconducting transitions. We survey a large series of samples…
▽ More
We report evidence for superconductivity with onset temperatures up to 11 K in thin films of the infinite-layer nickelate parent compound NdNiO$_2$. A combination of oxide molecular-beam epitaxy and atomic hydrogen reduction yields samples with high crystallinity and low residual resistivities, a substantial fraction of which exhibit superconducting transitions. We survey a large series of samples with a variety of techniques, including electrical transport, scanning transmission electron microscopy, x-ray absorption spectroscopy, and resonant inelastic x-ray scattering, to investigate the possible origins of superconductivity. We propose that superconductivity could be intrinsic to the undoped infinite-layer nickelates but suppressed by disorder due to its nodal order parameter, a finding which would necessitate a reconsideration of the nickelate phase diagram. Another possible hypothesis is that the parent materials can be hole doped from randomly dispersed apical oxygen atoms, which would suggest an alternative pathway for achieving superconductivity.
△ Less
Submitted 2 October, 2024;
originally announced October 2024.
-
Search for proton decay via $p\rightarrow{e^+η}$ and $p\rightarrow{μ^+η}$ with a 0.37 Mton-year exposure of Super-Kamiokande
Authors:
Super-Kamiokande Collaboration,
:,
N. Taniuchi,
K. Abe,
S. Abe,
Y. Asaoka,
C. Bronner,
M. Harada,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
R. Kaneshima,
Y. Kashiwagi,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
M. Nakahata,
S. Nakayama,
Y. Noguchi
, et al. (267 additional authors not shown)
Abstract:
A search for proton decay into $e^+/μ^+$ and a $η$ meson has been performed using data from a 0.373 Mton$\cdot$year exposure (6050.3 live days) of Super-Kamiokande. Compared to previous searches this work introduces an improved model of the intranuclear $η$ interaction cross section, resulting in a factor of two reduction in uncertainties from this source and $\sim$10\% increase in signal efficien…
▽ More
A search for proton decay into $e^+/μ^+$ and a $η$ meson has been performed using data from a 0.373 Mton$\cdot$year exposure (6050.3 live days) of Super-Kamiokande. Compared to previous searches this work introduces an improved model of the intranuclear $η$ interaction cross section, resulting in a factor of two reduction in uncertainties from this source and $\sim$10\% increase in signal efficiency. No significant data excess was found above the expected number of atmospheric neutrino background events resulting in no indication of proton decay into either mode. Lower limits on the proton partial lifetime of $1.4\times\mathrm{10^{34}~years}$ for $p\rightarrow e^+η$ and $7.3\times\mathrm{10^{33}~years}$ for $p\rightarrow μ^+η$ at the 90$\%$ C.L. were set. These limits are around 1.5 times longer than our previous study and are the most stringent to date.
△ Less
Submitted 29 September, 2024;
originally announced September 2024.
-
Robust and efficient data-driven predictive control
Authors:
Mohammad Alsalti,
Manuel Barkey,
Victor G. Lopez,
Matthias A. Müller
Abstract:
We propose a robust and efficient data-driven predictive control (eDDPC) scheme which is more sample efficient (requires less offline data) compared to existing schemes, and is also computationally efficient. This is done by leveraging an alternative data-based representation of the trajectories of linear time-invariant (LTI) systems. The proposed scheme relies only on using (short and potentially…
▽ More
We propose a robust and efficient data-driven predictive control (eDDPC) scheme which is more sample efficient (requires less offline data) compared to existing schemes, and is also computationally efficient. This is done by leveraging an alternative data-based representation of the trajectories of linear time-invariant (LTI) systems. The proposed scheme relies only on using (short and potentially irregularly measured) noisy input-output data, the amount of which is independent of the prediction horizon. To account for measurement noise, we provide a novel result that quantifies the uncertainty between the true (unknown) restricted behavior of the system and the estimated one from noisy data. Furthermore, we show that the robust eDDPC scheme is recursively feasible and that the resulting closed-loop system is practically stable. Finally, we compare the performance of this scheme to existing ones on a case study of a four tank system.
△ Less
Submitted 27 September, 2024;
originally announced September 2024.
-
Microsecond-Latency Feedback at a Particle Accelerator by Online Reinforcement Learning on Hardware
Authors:
Luca Scomparin,
Michele Caselle,
Andrea Santamaria Garcia,
Chenran Xu,
Edmund Blomley,
Timo Dritschler,
Akira Mochihashi,
Marcel Schuh,
Johannes L. Steinmann,
Erik Bründermann,
Andreas Kopmann,
Jürgen Becker,
Anke-Susanne Müller,
Marc Weber
Abstract:
The commissioning and operation of future large-scale scientific experiments will challenge current tuning and control methods. Reinforcement learning (RL) algorithms are a promising solution thanks to their capability of autonomously tackling a control problem based on a task parameterized by a reward function. The conventionally utilized machine learning (ML) libraries are not intended for micro…
▽ More
The commissioning and operation of future large-scale scientific experiments will challenge current tuning and control methods. Reinforcement learning (RL) algorithms are a promising solution thanks to their capability of autonomously tackling a control problem based on a task parameterized by a reward function. The conventionally utilized machine learning (ML) libraries are not intended for microsecond latency applications, as they mostly optimize for throughput performance. On the other hand, most of the programmable logic implementations are meant for computation acceleration, not being intended to work in a real-time environment. To overcome these limitations of current implementations, RL needs to be deployed on-the-edge, i.e. on to the device gathering the training data. In this paper we present the design and deployment of an experience accumulator system in a particle accelerator. In this system deep-RL algorithms run using hardware acceleration and act within a few microseconds, enabling the use of RL for control of ultra-fast phenomena. The training is performed offline to reduce the number of operations carried out on the acceleration hardware. The proposed architecture was tested in real experimental conditions at the Karlsruhe research accelerator (KARA), serving also as a synchrotron light source, where the system was used to control induced horizontal betatron oscillations in real-time. The results showed a performance comparable to the commercial feedback system available at the accelerator, proving the viability and potential of this approach. Due to the self-learning and reconfiguration capability of this implementation, its seamless application to other control problems is possible. Applications range from particle accelerators to large-scale research and industrial facilities.
△ Less
Submitted 24 September, 2024;
originally announced September 2024.
-
Optimal state estimation: Turnpike analysis and performance results
Authors:
Julian D. Schiller,
Lars Grüne,
Matthias A. Müller
Abstract:
In this paper, we introduce turnpike arguments in the context of optimal state estimation. In particular, we show that the optimal solution of the state estimation problem involving all available past data serves as turnpike for the solutions of truncated problems involving only a subset of the data. We consider two different mathematical characterizations of this phenomenon and provide correspond…
▽ More
In this paper, we introduce turnpike arguments in the context of optimal state estimation. In particular, we show that the optimal solution of the state estimation problem involving all available past data serves as turnpike for the solutions of truncated problems involving only a subset of the data. We consider two different mathematical characterizations of this phenomenon and provide corresponding sufficient conditions that rely on strict dissipativity and decaying sensitivity. As second contribution, we show how a specific turnpike property can be used to establish performance guarantees when approximating the optimal solution of the full problem by a sequence of truncated problems, and we show that the resulting performance (both averaged and non-averaged) is approximately optimal with error terms that can be made arbitrarily small by an appropriate choice of the horizon length. In addition, we discuss interesting implications of these results for the practically relevant case of moving horizon estimation and illustrate our results with a numerical example.
△ Less
Submitted 23 September, 2024;
originally announced September 2024.
-
Reinforcement Learning as an Improvement Heuristic for Real-World Production Scheduling
Authors:
Arthur Müller,
Lukas Vollenkemper
Abstract:
The integration of Reinforcement Learning (RL) with heuristic methods is an emerging trend for solving optimization problems, which leverages RL's ability to learn from the data generated during the search process. One promising approach is to train an RL agent as an improvement heuristic, starting with a suboptimal solution that is iteratively improved by applying small changes. We apply this app…
▽ More
The integration of Reinforcement Learning (RL) with heuristic methods is an emerging trend for solving optimization problems, which leverages RL's ability to learn from the data generated during the search process. One promising approach is to train an RL agent as an improvement heuristic, starting with a suboptimal solution that is iteratively improved by applying small changes. We apply this approach to a real-world multiobjective production scheduling problem. Our approach utilizes a network architecture that includes Transformer encoding to learn the relationships between jobs. Afterwards, a probability matrix is generated from which pairs of jobs are sampled and then swapped to improve the solution. We benchmarked our approach against other heuristics using real data from our industry partner, demonstrating its superior performance.
△ Less
Submitted 18 September, 2024;
originally announced September 2024.
-
Anionic disorder and its impact on the surface electronic structure of oxynitride photoactive semiconductors
Authors:
Anna Hartl,
Ján Minár,
Procopios Constantinou,
Vladimir Roddatis,
Fatima Alarab,
Arnold M. Müller,
Christof Vockenhuber,
Thorsten Schmitt,
Daniele Pergolesi,
Thomas Lippert Vladimir N. Strocov,
Nick A. Shepelin
Abstract:
The conversion of solar energy into chemical energy, stored in the form of hydrogen, bears enormous potential as a sustainable fuel for powering emerging technologies. Photoactive oxynitrides are promising materials for splitting water into molecular oxygen and hydrogen. However, one of the issues limiting widespread commercial use of oxynitrides is the degradation during operation. While recent s…
▽ More
The conversion of solar energy into chemical energy, stored in the form of hydrogen, bears enormous potential as a sustainable fuel for powering emerging technologies. Photoactive oxynitrides are promising materials for splitting water into molecular oxygen and hydrogen. However, one of the issues limiting widespread commercial use of oxynitrides is the degradation during operation. While recent studies have shown the loss of nitrogen, its relation to the reduced efficiency has not been directly and systematically addressed with experiments. In this study, we demonstrate the impact of the anionic stoichiometry of BaTaO$_x$N$_y$ on its electronic structure and functional properties. Through experimental ion scattering, electron microscopy, and photoelectron spectroscopy investigations, we determine the anionic composition ranging from the bulk towards the surface of BaTaO$_x$N$_y$ thin films. This further serves as input for band structure computations modeling the substitutional disorder of the anion sites. Combining our experimental and computational approaches, we reveal the depth-dependent elemental composition of oxynitride films, resulting in downward band bending and the loss of semiconducting character towards the surface. Extending beyond idealized systems, we demonstrate the relation between the electronic properties of real oxynitride photoanodes and their performance, providing guidelines for engineering highly efficient photoelectrodes and photocatalysts for clean hydrogen production.
△ Less
Submitted 18 September, 2024;
originally announced September 2024.
-
The SST-1M imaging atmospheric Cherenkov telescope for gamma-ray astrophysics
Authors:
C. Alispach,
A. Araudo,
M. Balbo,
V. Beshley,
A. Biland,
J. Blažek,
J. Borkowski,
T. Bulik,
F. Cadoux,
S. Casanova,
A. Christov,
J. Chudoba,
L. Chytka,
P. Dědič,
D. della Volpe,
Y. Favre,
M. Garczarczyk,
L. Gibaud,
T. Gieras,
P. Hamal,
M. Heller,
M. Hrabovský,
P. Janeček,
M. Jelínek,
V. Jílek
, et al. (41 additional authors not shown)
Abstract:
The SST-1M is a Small-Sized Telescope (SST) designed to provide a cost-effective and high-performance solution for gamma-ray astrophysics, particularly for energies beyond a few TeV. The goal is to integrate this telescope into an array of similar instruments, leveraging its lightweight design, earthquake resistance, and established Davies-Cotton configuration. Additionally, its optical system is…
▽ More
The SST-1M is a Small-Sized Telescope (SST) designed to provide a cost-effective and high-performance solution for gamma-ray astrophysics, particularly for energies beyond a few TeV. The goal is to integrate this telescope into an array of similar instruments, leveraging its lightweight design, earthquake resistance, and established Davies-Cotton configuration. Additionally, its optical system is designed to function without a protective dome, allowing it to withstand the harsh atmospheric conditions typical of mountain environments above 2000 m. The SST-1M utilizes a fully digitizing camera system based on silicon photomultipliers (SiPMs). This camera is capable of digitizing all signals from the UV-optical light detectors, allowing for the implementation of various triggers and data analysis methods. We detail the process of designing, prototyping, and validating this system, ensuring that it meets the stringent requirements for gamma-ray detection and performance. An SST-1M stereo system is currently operational and collecting data at the Ondřejov observatory in the Czech Republic, situated at 500 m. Preliminary results from this system are promising. A forthcoming paper will provide a comprehensive analysis of the performance of the telescopes in detecting gamma rays and operating under real-world conditions.
△ Less
Submitted 17 March, 2025; v1 submitted 17 September, 2024;
originally announced September 2024.