-
An operator algebraic approach to fusion category symmetry on the lattice
Authors:
David E. Evans,
Corey Jones
Abstract:
We propose a framework for fusion category symmetry on the (1+1)D lattice in the thermodynamic limit by giving a formal interpretation of SymTFT decompositions. Our approach is based on axiomatizing physical boundary subalgebra of quasi-local observables, and applying ideas from algebraic quantum field theory to derive the expected categorical structures. We show that given a physical boundary sub…
▽ More
We propose a framework for fusion category symmetry on the (1+1)D lattice in the thermodynamic limit by giving a formal interpretation of SymTFT decompositions. Our approach is based on axiomatizing physical boundary subalgebra of quasi-local observables, and applying ideas from algebraic quantum field theory to derive the expected categorical structures. We show that given a physical boundary subalgebra $B$ of a quasi-local algebra $A$, there is a canonical fusion category $\mathcal{C}$ that acts on $A$ by bimodules and whose fusion ring acts by locality preserving quantum channels on the quasi-local algebra such that $B$ is recovered as the invariant operators. We show that a fusion category can be realized as symmetries of a tensor product spin chain if and only if all of its objects have integer dimensions, and that it admits an on-site action on a tensor product spin chain if and only if it admits a fiber functor. We give a formal definition of a topological symmetric state, and prove a Lieb-Schultz-Mattis type theorem. Using this, we show that for any fusion category $\mathcal{C}$ with no fiber functor there always exists gapless pure symmetric states on an anyon chain. Finally, we apply our framework to show that any state covariant under an anomalous Kramers-Wannier type duality must be gapless.
△ Less
Submitted 7 July, 2025;
originally announced July 2025.
-
Strain-Gradient and Curvature-Induced Changes in Domain Morphology of BaTiO3 Nanorods: Experimental and Theoretical Studies
Authors:
Olha A. Kovalenko,
Eugene A. Eliseev,
Yuriy O. Zagorodniy,
Srečo Davor Škapin,
Marjeta Maček Kržmanc,
Lesya Demchenko,
Valentyn V. Laguta,
Zdravko Kutnjak,
Dean R. Evans,
Anna N. Morozovska
Abstract:
We investigate the impact of OH- ions incorporation on the lattice strain and spontaneous polarization of BaTiO3 nanorods synthesized under different conditions. It was confirmed that the lattice strain depends directly on Ba supersaturation, with higher supersaturation leading to an increase in the lattice strain. However, it was shown that crystal growth and observed lattice distortion are not p…
▽ More
We investigate the impact of OH- ions incorporation on the lattice strain and spontaneous polarization of BaTiO3 nanorods synthesized under different conditions. It was confirmed that the lattice strain depends directly on Ba supersaturation, with higher supersaturation leading to an increase in the lattice strain. However, it was shown that crystal growth and observed lattice distortion are not primarily influenced by external strain; rather, OH- ions incorporation plays a key role in generating internal chemical strains and driving these processes. By using the less reactive TiO2 precursor instead of TiOCl2 and controlling Ba supersaturation, the slower nucleation rate enables more effective regulation of OH- ions incorporation and crystal growth. This in turn effects both particle size and lattice distortion, leading to c/a ratio of 1.013 - 1.014. The incorporation of OH- ions induces lattice elongation along the c-axis, contributing to anisotropic growth, increasing of the rod diameter and their growth-induced bending. However, the possibility of the curvature-induced changes in domain morphology of BaTiO3 nanorods remains almost unexplored. To study the possibility, we perform analytical calculations and finite element modeling, which provide insights into the curvature-induced changes in the strain-gradient, polarization distribution, and domain morphology in BaTiO3 nanorods. Theoretical results reveal the appearance of the domain stripes in BaTiO3 nanorod when the curvature exceeds a critical angle. The physical origin of the domain stripes emergence is the tendency to minimize its elastic energy of the nanorod by the domain splitting. These findings suggest that BaTiO3 nanorods, with curvature-controllable amount of domain stripes, could serve as flexible race-track memory elements for flexo-tronics and domain-wall electronics.
△ Less
Submitted 18 May, 2025; v1 submitted 14 May, 2025;
originally announced May 2025.
-
Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control
Authors:
Hannah Cyberey,
David Evans
Abstract:
Large language models (LLMs) have transformed the way we access information. These models are often tuned to refuse to comply with requests that are considered harmful and to produce responses that better align with the preferences of those who control the models. To understand how this "censorship" works. We use representation engineering techniques to study open-weights safety-tuned models. We p…
▽ More
Large language models (LLMs) have transformed the way we access information. These models are often tuned to refuse to comply with requests that are considered harmful and to produce responses that better align with the preferences of those who control the models. To understand how this "censorship" works. We use representation engineering techniques to study open-weights safety-tuned models. We present a method for finding a refusal--compliance vector that detects and controls the level of censorship in model outputs. We also analyze recent reasoning LLMs, distilled from DeepSeek-R1, and uncover an additional dimension of censorship through "thought suppression". We show a similar approach can be used to find a vector that suppresses the model's reasoning process, allowing us to remove censorship by applying the negative multiples of this vector. Our code is publicly available at: https://github.com/hannahxchen/llm-censorship-steering
△ Less
Submitted 26 April, 2025; v1 submitted 23 April, 2025;
originally announced April 2025.
-
The Role of Flexoelectric Coupling and Chemical Strains in the Emergence of Polar Chiral Nano-Structures
Authors:
Anna N. Morozovska,
Salia Cherifi-Hertel,
Eugene A. Eliseev,
Victoria V. Khist,
Riccardo Hertel,
Dean R. Evans
Abstract:
This review examines the conditions that lead to the formation of flexo-sensitive chiral polar structures in thin films and core-shell ferroelectric nanoparticles. It also analyzes possible mechanisms by which the flexoelectric effect impacts the polarization structure in core-shell ferroelectric nanoparticles. Special attention is given to the role of the anisotropic flexoelectric effect in formi…
▽ More
This review examines the conditions that lead to the formation of flexo-sensitive chiral polar structures in thin films and core-shell ferroelectric nanoparticles. It also analyzes possible mechanisms by which the flexoelectric effect impacts the polarization structure in core-shell ferroelectric nanoparticles. Special attention is given to the role of the anisotropic flexoelectric effect in forming a unique type of polarization states with distinct chiral properties, referred to as "flexons". In the first part of the review, we study the influence of the flexoelectric coupling on the polarity, chirality and branching of metastable labyrinthine domain structures in uniaxial ferroelectric core-shell nanoparticles. We reveal that the transition from sinuous branched domain stripes to spiral-like domains occurs gradually as the flexoelectric coupling strength is increased. Our findings indicate that the joint action of flexoelectric effect and chemical strains, termed as "flexo-chemical" coupling, can significantly influence the effective Curie temperature, polarization distribution, domain morphology, and chirality in multiaxial ferroelectric core-shell nanoparticles. Furthermore, we demonstrate that the combination of flexo-chemical coupling and screening effects leads to the appearance and stabilization of a chiral polarization morphology in nanoflakes of van der Waals ferrielectrics. In the second part of the review, we discuss several advanced applications of flexo-sensitive chiral polar structures in core-shell ferroelectric nanoparticles for nanoelectronics elements and cryptography. We underline the possibilities of the flexoelectric control of multiple-degenerated labyrinthine states, which may correspond to a differential negative capacitance (NC) state stabilized in the uniaxial ferroelectric core by the presence of a screening shell.
△ Less
Submitted 28 May, 2025; v1 submitted 19 April, 2025;
originally announced April 2025.
-
A Prototype Atom Interferometer to Detect Dark Matter and Gravitational Waves
Authors:
C. F. A. Baynham,
R. Hobson,
O. Buchmueller,
D. Evans,
L. Hawkins,
L. Iannizzotto-Venezze,
A. Josset,
D. Lee,
E. Pasatembou,
B. E. Sauer,
M. R. Tarbutt,
T. Walker,
O. Ennis,
U. Chauhan,
A. Brzakalik,
S. Dey,
S. Hedges,
B. Stray,
M. Langlois,
K. Bongs,
T. Hird,
S. Lellouch,
M. Holynski,
B. Bostwick,
J. Chen
, et al. (67 additional authors not shown)
Abstract:
The AION project has built a tabletop prototype of a single-photon long-baseline atom interferometer using the 87Sr clock transition - a type of quantum sensor designed to search for dark matter and gravitational waves. Our prototype detector operates at the Standard Quantum Limit (SQL), producing a signal with no unexpected noise beyond atom shot noise. Importantly, the detector remains at the SQ…
▽ More
The AION project has built a tabletop prototype of a single-photon long-baseline atom interferometer using the 87Sr clock transition - a type of quantum sensor designed to search for dark matter and gravitational waves. Our prototype detector operates at the Standard Quantum Limit (SQL), producing a signal with no unexpected noise beyond atom shot noise. Importantly, the detector remains at the SQL even when additional laser phase noise is introduced, emulating conditions in a long-baseline detector such as AION or AEDGE where significant laser phase deviations will accumulate during long atom interrogation times. Our results mark a key milestone in extending atom interferometers to long baselines. Such interferometers can complement laser-interferometer gravitational wave detectors by accessing the mid-frequency gravitational wave band around 1 Hz, and can search for physics beyond the Standard Model.
△ Less
Submitted 16 April, 2025; v1 submitted 12 April, 2025;
originally announced April 2025.
-
Inferring Events from Time Series using Language Models
Authors:
Mingtian Tan,
Mike A. Merrill,
Zack Gottesman,
Tim Althoff,
David Evans,
Tom Hartvigsen
Abstract:
Time series data measure how environments change over time and drive decision-making in critical domains like finance and healthcare. A common goal in analyzing time series data is to understand the underlying events that cause the observed variations. We conduct the first study of whether Large Language Models (LLMs) can infer events described with natural language from time series data. We evalu…
▽ More
Time series data measure how environments change over time and drive decision-making in critical domains like finance and healthcare. A common goal in analyzing time series data is to understand the underlying events that cause the observed variations. We conduct the first study of whether Large Language Models (LLMs) can infer events described with natural language from time series data. We evaluate 18 LLMs on a task to match event sequences with real-valued time series data using a new benchmark we develop using sports data. Several current LLMs demonstrate promising abilities, with OpenAI's o1 performing the best but with DS-R1-distill-Qwen-32B outperforming proprietary models such as GPT-4o. From insights derived from analyzing reasoning failures, we also find clear avenues to improve performance. By applying post-training optimizations, i.e., distillation and self-improvement, we significantly enhance the performance of the Qwen2.5 1.5B, achieving results second only to o1. All resources needed to reproduce our work are available: https://github.com/BennyTMT/GAMETime
△ Less
Submitted 22 May, 2025; v1 submitted 18 March, 2025;
originally announced March 2025.
-
Numerical methods for unraveling inter-particle potentials in colloidal suspensions: A comparative study for two-dimensional suspensions
Authors:
Clare R. Rees-Zimmerman,
José Martín-Roca,
David Evans,
Mark A. Miller,
Dirk G. A. L. Aarts,
Chantal Valeriani
Abstract:
We compare three model-free numerical methods for inverting structural data to obtain interaction potentials, namely iterative Boltzmann inversion (IBI), test-particle insertion (TPI), and a machine-learning (ML) approach called ActiveNet. Three archetypal models of two-dimensional colloidal systems are used as test cases: Weeks--Chandler--Anderson short-ranged repulsion, the Lennard-Jones potenti…
▽ More
We compare three model-free numerical methods for inverting structural data to obtain interaction potentials, namely iterative Boltzmann inversion (IBI), test-particle insertion (TPI), and a machine-learning (ML) approach called ActiveNet. Three archetypal models of two-dimensional colloidal systems are used as test cases: Weeks--Chandler--Anderson short-ranged repulsion, the Lennard-Jones potential, and a repulsive shoulder interaction with two length scales. Additionally, data on an experimental suspension of colloidal spheres are acquired by optical microscopy and used to test the inversion methods. The methods have different merits. IBI is the only choice when the radial distribution function is known but particle coordinates are unavailable. TPI requires snapshots with particle positions and can extract both pair- and higher-body potentials without the need for simulation. The ML approach can only be used when particles can be tracked in time and it returns the force rather than the potential. However, it can unravel pair interactions from any one-body forces (such as drag or propulsion) and does not rely on equilibrium distributions for its derivation. Our results may serve as a guide when a numerical method is needed for application to experimental data, and as a reference for further development of the methodology itself.
△ Less
Submitted 15 March, 2025;
originally announced March 2025.
-
Sensing and Steering Stereotypes: Extracting and Applying Gender Representation Vectors in LLMs
Authors:
Hannah Cyberey,
Yangfeng Ji,
David Evans
Abstract:
Large language models (LLMs) are known to perpetuate stereotypes and exhibit biases. Various strategies have been proposed to mitigate these biases, but most work studies biases in LLMs as a black-box problem without considering how concepts are represented within the model. We adapt techniques from representation engineering to study how the concept of "gender" is represented within LLMs. We intr…
▽ More
Large language models (LLMs) are known to perpetuate stereotypes and exhibit biases. Various strategies have been proposed to mitigate these biases, but most work studies biases in LLMs as a black-box problem without considering how concepts are represented within the model. We adapt techniques from representation engineering to study how the concept of "gender" is represented within LLMs. We introduce a new method that extracts concept representations via probability weighting without labeled data and efficiently selects a steering vector for measuring and manipulating the model's representation. We also present a projection-based method that enables precise steering of model predictions and demonstrate its effectiveness in mitigating gender bias in LLMs. Our code is available at: https://github.com/hannahxchen/gender-bias-steering
△ Less
Submitted 20 May, 2025; v1 submitted 26 February, 2025;
originally announced February 2025.
-
Statistical tests based on Renyi entropy estimation
Authors:
Mehmet Siddik Cadirci,
Dafydd Evans,
Nikolai Leonenko,
Vitali Makogin,
Oleg Seleznjev
Abstract:
Entropy and its various generalizations are important in many fields, including mathematical statistics, communication theory, physics and computer science, for characterizing the amount of information associated with a probability distribution. In this paper we propose goodness-of-fit statistics for the multivariate Student and multivariate Pearson type II distributions, based on the maximum entr…
▽ More
Entropy and its various generalizations are important in many fields, including mathematical statistics, communication theory, physics and computer science, for characterizing the amount of information associated with a probability distribution. In this paper we propose goodness-of-fit statistics for the multivariate Student and multivariate Pearson type II distributions, based on the maximum entropy principle and a class of estimators for Renyi entropy based on nearest neighbour distances. We prove the L2-consistency of these statistics using results on the subadditivity of Euclidean functionals on nearest neighbour graphs, and investigate their rate of convergence and asymptotic distribution using Monte Carlo methods.
△ Less
Submitted 23 January, 2025;
originally announced February 2025.
-
Local doping of an oxide semiconductor by voltage-driven splitting of anti-Frenkel defects
Authors:
Jiali He,
Ursula Ludacka,
Kasper A. Hunnestad,
Didrik R. Småbråten,
Konstantin Shapovalov,
Per Erik Vullum,
Constantinos Hatzoglou,
Donald M. Evans,
Erik D. Roede,
Zewu Yan,
Edith Bourret,
Sverre M. Selbach,
David Gao,
Jaakko Akola,
Dennis Meier
Abstract:
Layered oxides exhibit high ionic mobility and chemical flexibility, attracting interest as cathode materials for lithium-ion batteries and the pairing of hydrogen production and carbon capture. Recently, layered oxides emerged as highly tunable semiconductors. For example, by introducing anti-Frenkel defects, the electronic hopping conductance in hexagonal manganites was increased locally by orde…
▽ More
Layered oxides exhibit high ionic mobility and chemical flexibility, attracting interest as cathode materials for lithium-ion batteries and the pairing of hydrogen production and carbon capture. Recently, layered oxides emerged as highly tunable semiconductors. For example, by introducing anti-Frenkel defects, the electronic hopping conductance in hexagonal manganites was increased locally by orders of magnitude. Here, we demonstrate local acceptor and donor doping in Er(Mn,Ti)O$_3$, facilitated by the splitting of such anti-Frenkel defects under applied d.c. voltage. By combining density functional theory calculations, scanning probe microscopy, atom probe tomography, and scanning transmission electron microscopy, we show that the oxygen defects readily move through the layered crystal structure, leading to nano-sized interstitial-rich (p-type) and vacancy-rich (n-type) regions. The resulting pattern is comparable to dipolar npn-junctions and stable on the timescale of days. Our findings reveal the possibility of temporarily functionalizing oxide semiconductors at the nanoscale, giving additional opportunities for the field of oxide electronics and the development of transient electronics in general.
△ Less
Submitted 11 February, 2025;
originally announced February 2025.
-
Toward a Principled Framework for Disclosure Avoidance
Authors:
Michael B Hawes,
Evan M Brassell,
Anthony Caruso,
Ryan Cumings-Menon,
Jason Devine,
Cassandra Dorius,
David Evans,
Kenneth Haase,
Michele C Hedrick,
Alexandra Krause,
Philip Leclerc,
James Livsey,
Rolando A Rodriguez,
Luke T Rogers,
Matthew Spence,
Victoria Velkoff,
Michael Walsh,
James Whitehorne,
Sallie Ann Keller
Abstract:
Responsible disclosure limitation is an iterative exercise in risk assessment and mitigation. From time to time, as disclosure risks grow and evolve and as data users' needs change, agencies must consider redesigning the disclosure avoidance system(s) they use. Discussions about candidate systems often conflate inherent features of those systems with implementation decisions independent of those s…
▽ More
Responsible disclosure limitation is an iterative exercise in risk assessment and mitigation. From time to time, as disclosure risks grow and evolve and as data users' needs change, agencies must consider redesigning the disclosure avoidance system(s) they use. Discussions about candidate systems often conflate inherent features of those systems with implementation decisions independent of those systems. For example, a system's ability to calibrate the strength of protection to suit the underlying disclosure risk of the data (e.g., by varying suppression thresholds), is a worthwhile feature regardless of the independent decision about how much protection is actually necessary. Having a principled discussion of candidate disclosure avoidance systems requires a framework for distinguishing these inherent features of the systems from the implementation decisions that need to be made independent of the system selected. For statistical agencies, this framework must also reflect the applied nature of these systems, acknowledging that candidate systems need to be adaptable to requirements stemming from the legal, scientific, resource, and stakeholder environments within which they would be operating. This paper proposes such a framework. No approach will be perfectly adaptable to every potential system requirement. Because the selection of some methodologies over others may constrain the resulting systems' efficiency and flexibility to adapt to particular statistical product specifications, data user needs, or disclosure risks, agencies may approach these choices in an iterative fashion, adapting system requirements, product specifications, and implementation parameters as necessary to ensure the resulting quality of the statistical product.
△ Less
Submitted 29 May, 2025; v1 submitted 10 February, 2025;
originally announced February 2025.
-
Easy-cone state mediating the spin reorientation in topological kagome magnet Fe$_3$Sn$_2$
Authors:
L. Prodan,
D. M. Evans,
A. S. Sukhanov,
S. E. Nikitin,
A. A. Tsirlin,
L. Puntingam,
M. C. Rahn,
L. Chioncel,
V. Tsurkan,
I. Kezsmarki
Abstract:
We investigated temperature-driven spin reorientation (SR) in the itinerant kagome magnet Fe$_3$Sn$_2$ using high-resolution synchrotron x-ray diffraction, neutron diffraction, magnetometry, and magnetic force microscopy (MFM), further supported by phenomenological analysis. Our study reveals a crossover from the state with easy-plane anisotropy to the high-temperature state with uniaxial easy-axi…
▽ More
We investigated temperature-driven spin reorientation (SR) in the itinerant kagome magnet Fe$_3$Sn$_2$ using high-resolution synchrotron x-ray diffraction, neutron diffraction, magnetometry, and magnetic force microscopy (MFM), further supported by phenomenological analysis. Our study reveals a crossover from the state with easy-plane anisotropy to the high-temperature state with uniaxial easy-axis anisotropy taking place between $\sim40-130$~ K through an intermediate easy-cone (or tilted spin) state. This state, induced by the interplay between the anisotropy constants $K_1$ and $K_2$, is clearly manifested in the thermal evolution of the magnetic structure factor, which reveals a gradual change of the SR angle $\mathbfθ$ between $40-130$~K. We also found that the SR is accompanied by a magnetoelastic effect. Zero-field MFM images across the SR range show a transformation in surface magnetic patterns from a dendritic structure at 120~K, to domain wall dominated MFM contrast at 40~K.
△ Less
Submitted 5 February, 2025;
originally announced February 2025.
-
Diffraction of walking drops by a standing Faraday wave
Authors:
Bauyrzhan K. Primkulov,
Davis J. Evans,
Valeri Frumkin,
Pedro J. Sáenz,
John W. M. Bush
Abstract:
The Kapitza-Dirac effect is the diffraction of quantum particles by a standing wave of light. We here report an analogous phenomenon in pilot-wave hydrodynamics, wherein droplets walking across the surface of a vibrating liquid bath are deflected by a standing Faraday wave. We show that, in certain parameter regimes, the statistical distribution of the droplet deflection angles reveals a diffracti…
▽ More
The Kapitza-Dirac effect is the diffraction of quantum particles by a standing wave of light. We here report an analogous phenomenon in pilot-wave hydrodynamics, wherein droplets walking across the surface of a vibrating liquid bath are deflected by a standing Faraday wave. We show that, in certain parameter regimes, the statistical distribution of the droplet deflection angles reveals a diffraction pattern reminiscent of that observed in the Kapitza-Dirac effect. Through experiments and simulations, we show that the diffraction pattern results from the complex interactions of the droplets with the standing wave. Our study highlights non-resonant effects associated with the detuning of the droplet bouncing and the bath vibration, which are shown to lead to drop speed variations and droplet sorting according to the droplet's phase of impact. We discuss the similarities and differences between our hydrodynamic system and the discrete and continuum interpretations of the Kapitza-Dirac effect, and introduce the notion of ponderomotive effects in pilot-wave hydrodynamics.
△ Less
Submitted 25 December, 2024;
originally announced December 2024.
-
Archaeoscape: Bringing Aerial Laser Scanning Archaeology to the Deep Learning Era
Authors:
Yohann Perron,
Vladyslav Sydorov,
Adam P. Wijker,
Damian Evans,
Christophe Pottier,
Loic Landrieu
Abstract:
Airborne Laser Scanning (ALS) technology has transformed modern archaeology by unveiling hidden landscapes beneath dense vegetation. However, the lack of expert-annotated, open-access resources has hindered the analysis of ALS data using advanced deep learning techniques. We address this limitation with Archaeoscape (available at https://archaeoscape.ai/data/2024/), a novel large-scale archaeologi…
▽ More
Airborne Laser Scanning (ALS) technology has transformed modern archaeology by unveiling hidden landscapes beneath dense vegetation. However, the lack of expert-annotated, open-access resources has hindered the analysis of ALS data using advanced deep learning techniques. We address this limitation with Archaeoscape (available at https://archaeoscape.ai/data/2024/), a novel large-scale archaeological ALS dataset spanning 888 km$^2$ in Cambodia with 31,141 annotated archaeological features from the Angkorian period. Archaeoscape is over four times larger than comparable datasets, and the first ALS archaeology resource with open-access data, annotations, and models.
We benchmark several recent segmentation models to demonstrate the benefits of modern vision techniques for this problem and highlight the unique challenges of discovering subtle human-made structures under dense jungle canopies. By making Archaeoscape available in open access, we hope to bridge the gap between traditional archaeology and modern computer vision methods.
△ Less
Submitted 12 December, 2024; v1 submitted 6 December, 2024;
originally announced December 2024.
-
Non-resonant effects in pilot-wave hydrodynamics
Authors:
Bauyrzhan K. Primkulov,
Davis J. Evans,
Joel B. Been,
John W. M. Bush
Abstract:
Pilot-wave hydrodynamics concerns the dynamics of 'walkers,' droplets walking on a vibrating bath, and has provided the basis for the burgeoning field of hydrodynamic quantum analogs. We here explore a theoretical model of pilot-wave hydrodynamics that relaxes the simplifying assumption of resonance between the droplet and its pilot wave, specifically the assumption of a fixed impact phase between…
▽ More
Pilot-wave hydrodynamics concerns the dynamics of 'walkers,' droplets walking on a vibrating bath, and has provided the basis for the burgeoning field of hydrodynamic quantum analogs. We here explore a theoretical model of pilot-wave hydrodynamics that relaxes the simplifying assumption of resonance between the droplet and its pilot wave, specifically the assumption of a fixed impact phase between the bouncing drop and its wave field. The model captures both the vertical and horizontal dynamics of the drop, allowing one to examine non-resonant effects for both free and constrained walkers. The model provides new rationale for a number of previously reported but poorly understood features of free walker motion in pilot-wave hydrodynamics, including colinear swaying at the onset of motion, intermittent walking, and chaotic speed oscillations, all of which are accompanied by sporadic changes in the impact phase of the bouncing drop. The model also highlights the degeneracy in the droplets' vertical dynamics, specifically, the possibility of two distinct bouncing phases and of switching between the two. Consideration of this degeneracy is critical to understanding the droplet dynamics and statistics emerging in confined geometries at high memory and the interaction of walking droplets with standing Faraday waves.
△ Less
Submitted 3 January, 2025; v1 submitted 22 November, 2024;
originally announced November 2024.
-
Theory of Nonequilibrium Crystallization and the Phase Diagram of Active Brownian Spheres
Authors:
Daniel Evans,
Ahmad K. Omar
Abstract:
The crystallization of hard spheres at equilibrium is perhaps the most familiar example of an entropically-driven phase transition. In recent years, it has become clear that activity can dramatically alter this order-disorder transition in unexpected ways. The theoretical description of active crystallization has remained elusive as the traditional thermodynamic arguments that shape our understand…
▽ More
The crystallization of hard spheres at equilibrium is perhaps the most familiar example of an entropically-driven phase transition. In recent years, it has become clear that activity can dramatically alter this order-disorder transition in unexpected ways. The theoretical description of active crystallization has remained elusive as the traditional thermodynamic arguments that shape our understanding of passive freezing are inapplicable to active systems. Here, we develop a statistical mechanical description of the one-body density field and a nonconserved order parameter field that represents local crystalline order. We develop equations of state, guided by computer simulations, describing the crystallinity field which result in shifting the order-disorder transition to higher packing fractions with increasing activity. We then leverage our recent dynamical theory of coexistence to construct the full phase diagram of active Brownian spheres, quantitatively recapitulating both the solid-fluid and liquid-gas coexistence curves and the solid-liquid-gas triple point.
△ Less
Submitted 21 November, 2024;
originally announced November 2024.
-
Lagrangian Klein bottles in $S^2 \times S^2$
Authors:
Nikolas Adaloglou,
Jonathan David Evans
Abstract:
We use Luttinger surgery to show that there are no Lagrangian Klein bottles in $S^2\times S^2$ in the $\mathbb{Z}_2$-homology class of an $S^2$-factor if the symplectic area of that factor is at least twice that of the other.
We use Luttinger surgery to show that there are no Lagrangian Klein bottles in $S^2\times S^2$ in the $\mathbb{Z}_2$-homology class of an $S^2$-factor if the symplectic area of that factor is at least twice that of the other.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
"Oh LLM, I'm Asking Thee, Please Give Me a Decision Tree": Zero-Shot Decision Tree Induction and Embedding with Large Language Models
Authors:
Ricardo Knauer,
Mario Koddenbrock,
Raphael Wallsberger,
Nicholas M. Brisson,
Georg N. Duda,
Deborah Falla,
David W. Evans,
Erik Rodner
Abstract:
Large language models (LLMs) provide powerful means to leverage prior knowledge for predictive modeling when data is limited. In this work, we demonstrate how LLMs can use their compressed world knowledge to generate intrinsically interpretable machine learning models, i.e., decision trees, without any training data. We find that these zero-shot decision trees can even surpass data-driven trees on…
▽ More
Large language models (LLMs) provide powerful means to leverage prior knowledge for predictive modeling when data is limited. In this work, we demonstrate how LLMs can use their compressed world knowledge to generate intrinsically interpretable machine learning models, i.e., decision trees, without any training data. We find that these zero-shot decision trees can even surpass data-driven trees on some small-sized tabular datasets and that embeddings derived from these trees perform better than data-driven tree-based embeddings on average. Our decision tree induction and embedding approaches can therefore serve as new knowledge-driven baselines for data-driven machine learning methods in the low-data regime. Furthermore, they offer ways to harness the rich world knowledge within LLMs for tabular machine learning tasks. Our code and results are available at https://github.com/ml-lab-htw/llm-trees.
△ Less
Submitted 27 May, 2025; v1 submitted 27 September, 2024;
originally announced September 2024.
-
On Amicable Numbers
Authors:
Leonhard Euler,
Jonathan David Evans
Abstract:
This is an English translation of Euler's 1750 paper "De numeris amicabilibus" (E152), the most substantial of his three works with this name. In it, he expounds at great length the ad hoc methods he has developed to search for pairs of amicable numbers, concluding with a list of around 60 new pairs.
This is an English translation of Euler's 1750 paper "De numeris amicabilibus" (E152), the most substantial of his three works with this name. In it, he expounds at great length the ad hoc methods he has developed to search for pairs of amicable numbers, concluding with a list of around 60 new pairs.
△ Less
Submitted 13 September, 2024;
originally announced September 2024.
-
Theory of Nonequilibrium Multicomponent Coexistence
Authors:
Yu-Jen Chiu,
Daniel Evans,
Ahmad K. Omar
Abstract:
Multicomponent phase separation is a routine occurrence in both living and synthetic systems. Thermodynamics provides a straightforward path to determine the phase boundaries that characterize these transitions for systems at equilibrium. The prevalence of phase separation in complex systems outside the confines of equilibrium motivates the need for a genuinely nonequilibrium theory of multicompon…
▽ More
Multicomponent phase separation is a routine occurrence in both living and synthetic systems. Thermodynamics provides a straightforward path to determine the phase boundaries that characterize these transitions for systems at equilibrium. The prevalence of phase separation in complex systems outside the confines of equilibrium motivates the need for a genuinely nonequilibrium theory of multicomponent phase coexistence. Here, we develop a mechanical theory for coexistence that casts coexistence criteria into the familiar form of equality of state functions. Our theory generalizes traditional equilibrium notions such as the species chemical potential and thermodynamic pressure to systems out of equilibrium. Crucially, while these notions may not be identifiable for all nonequilibrium systems, we numerically verify their existence for a variety of systems by introducing the phenomenological Multicomponent Active Model B+. Our work establishes an initial framework for understanding multicomponent coexistence that we hope can serve as the basis for a comprehensive theory for high-dimensional nonequilibrium phase transitions.
△ Less
Submitted 19 September, 2024; v1 submitted 11 September, 2024;
originally announced September 2024.
-
Re-entrant percolation in active Brownian hard disks
Authors:
David Evans,
José Martín-Roca,
Nathan J. Harmer,
Chantal Valeriani,
Mark A. Miller
Abstract:
Non-equilibrium clustering and percolation are investigated in an archetypal model of two-dimensional active matter using dynamic simulations of self-propelled Brownian repulsive particles. We concentrate on the single-phase region up to moderate levels of activity, before motility-induced phase separation (MIPS) sets in. Weak activity promotes cluster formation and lowers the percolation threshol…
▽ More
Non-equilibrium clustering and percolation are investigated in an archetypal model of two-dimensional active matter using dynamic simulations of self-propelled Brownian repulsive particles. We concentrate on the single-phase region up to moderate levels of activity, before motility-induced phase separation (MIPS) sets in. Weak activity promotes cluster formation and lowers the percolation threshold. However, driving the system further out of equilibrium partly reverses this effect, resulting in a minimum in the critical density for the formation of system-spanning clusters and introducing re-entrant percolation as a function of activity in the pre-MIPS regime. This non-monotonic behaviour arises from competition between activity-induced effective attraction (which eventually leads to MIPS) and activity-driven cluster breakup. Using an adapted iterative Boltzmann inversion method, we derive effective potentials to map weakly active cases onto a passive (equilibrium) model with conservative attraction, which can be characterised by Monte Carlo simulations. While the active and passive systems have practically identical radial distribution functions, we find decisive differences in higher-order structural correlations, to which the percolation threshold is highly sensitive. For sufficiently strong activity, no passive pairwise potential can reproduce the radial distribution function of the active system.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
Lagrangian Surplusection Phenomena
Authors:
Georgios Dimitroglou Rizell,
Jonathan David Evans
Abstract:
Suppose you have a family of Lagrangian submanifolds $L_t$ and an auxiliary Lagrangian $K$. Suppose that $K$ intersects some of the $L_t$ more than the minimal number of times. Can you eliminate surplus intersection (surplusection) with all fibres by performing a Hamiltonian isotopy of $K$? Or will any Lagrangian isotopic to $K$ surplusect some of the fibres? We argue that in several important sit…
▽ More
Suppose you have a family of Lagrangian submanifolds $L_t$ and an auxiliary Lagrangian $K$. Suppose that $K$ intersects some of the $L_t$ more than the minimal number of times. Can you eliminate surplus intersection (surplusection) with all fibres by performing a Hamiltonian isotopy of $K$? Or will any Lagrangian isotopic to $K$ surplusect some of the fibres? We argue that in several important situations, surplusection cannot be eliminated, and that a better understanding of surplusection phenomena (better bounds and a clearer understanding of how the surplusection is distributed in the family) would help to tackle some outstanding problems in different areas, including Oh's conjecture on the volume-minimising property of the Clifford torus and the concurrent normals conjecture in convex geometry. We pose many open questions.
△ Less
Submitted 6 December, 2024; v1 submitted 27 August, 2024;
originally announced August 2024.
-
The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models
Authors:
Hannah Chen,
Yangfeng Ji,
David Evans
Abstract:
Large language models (LLMs) are now being considered and even deployed for applications that support high-stakes decision-making, such as recruitment and clinical decisions. While several methods have been proposed for measuring bias, there remains a gap between predictions, which are what the proposed methods consider, and how they are used to make decisions. In this work, we introduce Rank-Allo…
▽ More
Large language models (LLMs) are now being considered and even deployed for applications that support high-stakes decision-making, such as recruitment and clinical decisions. While several methods have been proposed for measuring bias, there remains a gap between predictions, which are what the proposed methods consider, and how they are used to make decisions. In this work, we introduce Rank-Allocational-Based Bias Index (RABBI), a model-agnostic bias measure that assesses potential allocational harms arising from biases in LLM predictions. We compare RABBI and current bias metrics on two allocation decision tasks. We evaluate their predictive validity across ten LLMs and utility for model selection. Our results reveal that commonly-used bias metrics based on average performance gap and distribution distance fail to reliably capture group disparities in allocation outcomes, whereas RABBI exhibits a strong correlation with allocation disparities. Our work highlights the need to account for how models are used in contexts with limited resource constraints.
△ Less
Submitted 2 August, 2024;
originally announced August 2024.
-
The Chandra Source Catalog Release 2 Series
Authors:
Ian N. Evans,
Janet D. Evans,
J. Rafael Martínez-Galarza,
Joseph B. Miller,
Francis A. Primini,
Mojegan Azadi,
Douglas J. Burke,
Francesca M. Civano,
Raffaele D'Abrusco,
Giuseppina Fabbiano,
Dale E. Graessle,
John D. Grier,
John C. Houck,
Jennifer Lauer,
Michael L. McCollough,
Michael A. Nowak,
David A. Plummer,
Arnold H. Rots,
Aneta Siemiginowska,
Michael S. Tibbetts
Abstract:
The Chandra Source Catalog (CSC) is a virtual X-ray astrophysics facility that enables both detailed individual source studies and statistical studies of large samples of X-ray sources detected in ACIS and HRC-I imaging observations obtained by the Chandra X-ray Observatory. The catalog provides carefully-curated, high-quality, and uniformly calibrated and analyzed tabulated positional, spatial, p…
▽ More
The Chandra Source Catalog (CSC) is a virtual X-ray astrophysics facility that enables both detailed individual source studies and statistical studies of large samples of X-ray sources detected in ACIS and HRC-I imaging observations obtained by the Chandra X-ray Observatory. The catalog provides carefully-curated, high-quality, and uniformly calibrated and analyzed tabulated positional, spatial, photometric, spectral, and temporal source properties, as well as science-ready X-ray data products. The latter includes multiple types of source- and field-based FITS format products that can be used as a basis for further research, significantly simplifying followup analysis of scientifically meaningful source samples. We discuss in detail the algorithms used for the CSC Release 2 Series, including CSC 2.0, which includes 317,167 unique X-ray sources on the sky identified in observations released publicly through the end of 2014, and CSC 2.1, which adds Chandra data released through the end of 2021 and expands the catalog to 407,806 sources. Besides adding more recent observations, the CSC Release 2 Series includes multiple algorithmic enhancements that provide significant improvements over earlier releases. The compact source sensitivity limit for most observations is ~5 photons over most of the field of view, which is ~2x fainter than Release 1, achieved by co-adding observations and using an optimized source detection approach. A Bayesian X-ray aperture photometry code produces robust fluxes even in crowded fields and for low count sources. The current release, CSC 2.1, is tied to the Gaia-CRF3 astrometric reference frame for the best sky positions for catalog sources.
△ Less
Submitted 15 July, 2024;
originally announced July 2024.
-
The OPS-SAT benchmark for detecting anomalies in satellite telemetry
Authors:
Bogdan Ruszczak,
Krzysztof Kotowski,
David Evans,
Jakub Nalepa
Abstract:
Detecting anomalous events in satellite telemetry is a critical task in space operations. This task, however, is extremely time-consuming, error-prone and human dependent, thus automated data-driven anomaly detection algorithms have been emerging at a steady pace. However, there are no publicly available datasets of real satellite telemetry accompanied with the ground-truth annotations that could…
▽ More
Detecting anomalous events in satellite telemetry is a critical task in space operations. This task, however, is extremely time-consuming, error-prone and human dependent, thus automated data-driven anomaly detection algorithms have been emerging at a steady pace. However, there are no publicly available datasets of real satellite telemetry accompanied with the ground-truth annotations that could be used to train and verify anomaly detection supervised models. In this article, we address this research gap and introduce the AI-ready benchmark dataset (OPSSAT-AD) containing the telemetry data acquired on board OPS-SAT -- a CubeSat mission which has been operated by the European Space Agency which has come to an end during the night of 22--23 May 2024 (CEST). The dataset is accompanied with the baseline results obtained using 30 supervised and unsupervised classic and deep machine learning algorithms for anomaly detection. They were trained and validated using the training-test dataset split introduced in this work, and we present a suggested set of quality metrics which should be always calculated to confront the new algorithms for anomaly detection while exploiting OPSSAT-AD. We believe that this work may become an important step toward building a fair, reproducible and objective validation procedure that can be used to quantify the capabilities of the emerging anomaly detection techniques in an unbiased and fully transparent way.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Do Parameters Reveal More than Loss for Membership Inference?
Authors:
Anshuman Suri,
Xiao Zhang,
David Evans
Abstract:
Membership inference attacks are used as a key tool for disclosure auditing. They aim to infer whether an individual record was used to train a model. While such evaluations are useful to demonstrate risk, they are computationally expensive and often make strong assumptions about potential adversaries' access to models and training environments, and thus do not provide tight bounds on leakage from…
▽ More
Membership inference attacks are used as a key tool for disclosure auditing. They aim to infer whether an individual record was used to train a model. While such evaluations are useful to demonstrate risk, they are computationally expensive and often make strong assumptions about potential adversaries' access to models and training environments, and thus do not provide tight bounds on leakage from potential attacks. We show how prior claims around black-box access being sufficient for optimal membership inference do not hold for stochastic gradient descent, and that optimal membership inference indeed requires white-box access. Our theoretical results lead to a new white-box inference attack, IHA (Inverse Hessian Attack), that explicitly uses model parameters by taking advantage of computing inverse-Hessian vector products. Our results show that both auditors and adversaries may be able to benefit from access to model parameters, and we advocate for further research into white-box methods for membership inference.
△ Less
Submitted 19 December, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
The PLATO Mission
Authors:
Heike Rauer,
Conny Aerts,
Juan Cabrera,
Magali Deleuil,
Anders Erikson,
Laurent Gizon,
Mariejo Goupil,
Ana Heras,
Jose Lorenzo-Alvarez,
Filippo Marliani,
César Martin-Garcia,
J. Miguel Mas-Hesse,
Laurence O'Rourke,
Hugh Osborn,
Isabella Pagano,
Giampaolo Piotto,
Don Pollacco,
Roberto Ragazzoni,
Gavin Ramsay,
Stéphane Udry,
Thierry Appourchaux,
Willy Benz,
Alexis Brandeker,
Manuel Güdel,
Eduardo Janot-Pacheco
, et al. (820 additional authors not shown)
Abstract:
PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observati…
▽ More
PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observations from the ground, planets will be characterised for their radius, mass, and age with high accuracy (5 %, 10 %, 10 % for an Earth-Sun combination respectively). PLATO will provide us with a large-scale catalogue of well-characterised small planets up to intermediate orbital periods, relevant for a meaningful comparison to planet formation theories and to better understand planet evolution. It will make possible comparative exoplanetology to place our Solar System planets in a broader context. In parallel, PLATO will study (host) stars using asteroseismology, allowing us to determine the stellar properties with high accuracy, substantially enhancing our knowledge of stellar structure and evolution.
The payload instrument consists of 26 cameras with 12cm aperture each. For at least four years, the mission will perform high-precision photometric measurements. Here we review the science objectives, present PLATO's target samples and fields, provide an overview of expected core science performance as well as a description of the instrument and the mission profile at the beginning of the serial production of the flight cameras. PLATO is scheduled for a launch date end 2026. This overview therefore provides a summary of the mission to the community in preparation of the upcoming operational phases.
△ Less
Submitted 18 November, 2024; v1 submitted 8 June, 2024;
originally announced June 2024.
-
Ferri-ionic Coupling in CuInP$_2$S$_6$ Nanoflakes: Polarization States and Controllable Negative Capacitance
Authors:
Anna N. Morozovska,
Sergei V. Kalinin,
Eugene. A. Eliseev,
Svitlana Kopyl,
Yulian M. Vysochanskii,
Dean R. Evans
Abstract:
We consider nanoflakes of van der Waals ferrielectric CuInP$_2$S$_6$ covered by an ionic surface charge and reveal the appearance of polar states with relatively high polarization ~5 microC/cm$^2$ and stored free charge ~10 microC/cm$%2$, which can mimic "mid-gap" states associated with a surface field-induced transfer of Cu and/or In ions in the van der Waals gap. The change in the ionic screenin…
▽ More
We consider nanoflakes of van der Waals ferrielectric CuInP$_2$S$_6$ covered by an ionic surface charge and reveal the appearance of polar states with relatively high polarization ~5 microC/cm$^2$ and stored free charge ~10 microC/cm$%2$, which can mimic "mid-gap" states associated with a surface field-induced transfer of Cu and/or In ions in the van der Waals gap. The change in the ionic screening degree and mismatch strains induce a broad range of the transitions between paraelectric phase, antiferroelectric, ferrielectric, and ferri-ionic states in CuInP$_2$S$_6$ nanoflakes. The states' stability and/or metastability is determined by the minimum of the system free energy consisting of electrostatic energy, elastic energy, and a Landau-type four-well potential of the ferrielectric dipole polarization. The possibility to govern the transitions by strain and ionic screening can be useful for controlling the tunneling barrier in thin film devices based on CuInP$_2$S$_6$ nanoflakes. Also, we predict that the CuInP$_2$S$_6$ nanoflakes reveal features of the controllable negative capacitance effect, which make them attractive for advanced electronic devices, such as nano-capacitors and gate oxide nanomaterials with reduced heat dissipation.
△ Less
Submitted 3 August, 2024; v1 submitted 23 May, 2024;
originally announced May 2024.
-
DP-RuL: Differentially-Private Rule Learning for Clinical Decision Support Systems
Authors:
Josephine Lamp,
Lu Feng,
David Evans
Abstract:
Serious privacy concerns arise with the use of patient data in rule-based clinical decision support systems (CDSS). The goal of a privacy-preserving CDSS is to learn a population ruleset from individual clients' local rulesets, while protecting the potentially sensitive information contained in the rulesets. We present the first work focused on this problem and develop a framework for learning pop…
▽ More
Serious privacy concerns arise with the use of patient data in rule-based clinical decision support systems (CDSS). The goal of a privacy-preserving CDSS is to learn a population ruleset from individual clients' local rulesets, while protecting the potentially sensitive information contained in the rulesets. We present the first work focused on this problem and develop a framework for learning population rulesets with local differential privacy (LDP), suitable for use within a distributed CDSS and other distributed settings. Our rule discovery protocol uses a Monte-Carlo Tree Search (MCTS) method integrated with LDP to search a rule grammar in a structured way and find rule structures clients are likely to have. Randomized response queries are sent to clients to determine promising paths to search within the rule grammar. In addition, we introduce an adaptive budget allocation method which dynamically determines how much privacy loss budget to use at each query, resulting in better privacy-utility trade-offs. We evaluate our approach using three clinical datasets and find that we are able to learn population rulesets with high coverage (breadth of rules) and clinical utility even at low privacy loss budgets.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Evaluating Google's Protected Audience Protocol
Authors:
Minjun Long,
David Evans
Abstract:
While third-party cookies have been a key component of the digital marketing ecosystem for years, they allow users to be tracked across web sites in ways that raise serious privacy concerns. Google has proposed the Privacy Sandbox initiative to enable ad targeting without third-party cookies. While there have been several studies focused on other aspects of this initiative, there has been little a…
▽ More
While third-party cookies have been a key component of the digital marketing ecosystem for years, they allow users to be tracked across web sites in ways that raise serious privacy concerns. Google has proposed the Privacy Sandbox initiative to enable ad targeting without third-party cookies. While there have been several studies focused on other aspects of this initiative, there has been little analysis to date as to how well the system achieves the intended goal of preventing request linking. This work focuses on analyzing linkage privacy risks for the reporting mechanisms proposed in the Protected Audience (PrAu) proposal (previously known as FLEDGE), which is intended to enable online remarketing without using third-party cookies. We summarize the overall workflow of PrAu and highlight potential privacy risks associated with its proposed design, focusing on scenarios in which adversaries attempt to link requests to different sites to the same user. We show how a realistic adversary would be still able to use the privacy-protected reporting mechanisms to link user requests and conduct mass surveillance, even with correct implementations of all the currently proposed privacy mechanisms.
△ Less
Submitted 20 May, 2024; v1 submitted 13 May, 2024;
originally announced May 2024.
-
Tropical methods for stable octic double planes
Authors:
Jonathan David Evans,
Angelica Simonetti,
Giancarlo Urzúa
Abstract:
This paper has been written to illustrate the power of techniques from tropical geometry and mirror symmetry for studying the KSBA moduli space of surfaces on or near the Noether line. We focus on the moduli space of octic double planes ($K^2 = 2$, $p_g = 3$) and use methods from tropical and toric geometry to classify the strata corresponding to normal KSBA-stable surfaces, focusing on the non-Go…
▽ More
This paper has been written to illustrate the power of techniques from tropical geometry and mirror symmetry for studying the KSBA moduli space of surfaces on or near the Noether line. We focus on the moduli space of octic double planes ($K^2 = 2$, $p_g = 3$) and use methods from tropical and toric geometry to classify the strata corresponding to normal KSBA-stable surfaces, focusing on the non-Gorenstein case.
△ Less
Submitted 13 December, 2024; v1 submitted 4 May, 2024;
originally announced May 2024.
-
Quantum symmetries of noncommutative tori
Authors:
David E. Evans,
Corey Jones
Abstract:
We consider the problem of building non-invertible quantum symmetries (as characterized by actions of unitary fusion categories) on noncommutative tori. We introduce a general method to construct actions of fusion categories on inductive limit C*-algberas using finite dimenionsal data, and then apply it to obtain AT-actions of arbitrary Haagerup-Izumi categories on noncommutative 2-tori, of the ev…
▽ More
We consider the problem of building non-invertible quantum symmetries (as characterized by actions of unitary fusion categories) on noncommutative tori. We introduce a general method to construct actions of fusion categories on inductive limit C*-algberas using finite dimenionsal data, and then apply it to obtain AT-actions of arbitrary Haagerup-Izumi categories on noncommutative 2-tori, of the even part of the $E_{8}$ subfactor on a noncommutative 3-torus, and of $\text{PSU}(2)_{15}$ on a noncommutative 4-torus.
△ Less
Submitted 8 January, 2025; v1 submitted 22 April, 2024;
originally announced April 2024.
-
Adapting to time: Why nature may have evolved a diverse set of neurons
Authors:
Karim G. Habashy,
Benjamin D. Evans,
Dan F. M. Goodman,
Jeffrey S. Bowers
Abstract:
Brains have evolved diverse neurons with varying morphologies and dynamics that impact temporal information processing. In contrast, most neural network models use homogeneous units that vary only in spatial parameters (weights and biases). To explore the importance of temporal parameters, we trained spiking neural networks on tasks with varying temporal complexity, holding different parameter sub…
▽ More
Brains have evolved diverse neurons with varying morphologies and dynamics that impact temporal information processing. In contrast, most neural network models use homogeneous units that vary only in spatial parameters (weights and biases). To explore the importance of temporal parameters, we trained spiking neural networks on tasks with varying temporal complexity, holding different parameter subsets constant. We found that adapting conduction delays is crucial for solving all test conditions under tight resource constraints. Remarkably, these tasks can be solved using only temporal parameters (delays and time constants) with constant weights. In more complex spatio-temporal tasks, an adaptable bursting parameter was essential. Overall, allowing adaptation of both temporal and spatial parameters enhances network robustness to noise, a vital feature for biological brains and neuromorphic computing systems. Our findings suggest that rich and adaptable dynamics may be the key for solving temporally structured tasks efficiently in evolving organisms, which would help explain the diverse physiological properties of biological neurons.
△ Less
Submitted 12 January, 2025; v1 submitted 22 April, 2024;
originally announced April 2024.
-
Discovery of a dormant 33 solar-mass black hole in pre-release Gaia astrometry
Authors:
Gaia Collaboration,
P. Panuzzo,
T. Mazeh,
F. Arenou,
B. Holl,
E. Caffau,
A. Jorissen,
C. Babusiaux,
P. Gavras,
J. Sahlmann,
U. Bastian,
Ł. Wyrzykowski,
L. Eyer,
N. Leclerc,
N. Bauchet,
A. Bombrun,
N. Mowlavi,
G. M. Seabroke,
D. Teyssier,
E. Balbinot,
A. Helmi,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne
, et al. (390 additional authors not shown)
Abstract:
Gravitational waves from black-hole merging events have revealed a population of extra-galactic BHs residing in short-period binaries with masses that are higher than expected based on most stellar evolution models - and also higher than known stellar-origin black holes in our Galaxy. It has been proposed that those high-mass BHs are the remnants of massive metal-poor stars. Gaia astrometry is exp…
▽ More
Gravitational waves from black-hole merging events have revealed a population of extra-galactic BHs residing in short-period binaries with masses that are higher than expected based on most stellar evolution models - and also higher than known stellar-origin black holes in our Galaxy. It has been proposed that those high-mass BHs are the remnants of massive metal-poor stars. Gaia astrometry is expected to uncover many Galactic wide-binary systems containing dormant BHs, which may not have been detected before. The study of this population will provide new information on the BH-mass distribution in binaries and shed light on their formation mechanisms and progenitors. As part of the validation efforts in preparation for the fourth Gaia data release (DR4), we analysed the preliminary astrometric binary solutions, obtained by the Gaia Non-Single Star pipeline, to verify their significance and to minimise false-detection rates in high-mass-function orbital solutions. The astrometric binary solution of one source, Gaia BH3, implies the presence of a 32.70 \pm 0.82 M\odot BH in a binary system with a period of 11.6 yr. Gaia radial velocities independently validate the astrometric orbit. Broad-band photometric and spectroscopic data show that the visible component is an old, very metal-poor giant of the Galactic halo, at a distance of 590 pc. The BH in the Gaia BH3 system is more massive than any other Galactic stellar-origin BH known thus far. The low metallicity of the star companion supports the scenario that metal-poor massive stars are progenitors of the high-mass BHs detected by gravitational-wave telescopes. The Galactic orbit of the system and its metallicity indicate that it might belong to the Sequoia halo substructure. Alternatively, and more plausibly, it could belong to the ED-2 stream, which likely originated from a globular cluster that had been disrupted by the Milky Way.
△ Less
Submitted 19 April, 2024; v1 submitted 16 April, 2024;
originally announced April 2024.
-
Absolute dimensions of solar-type eclipsing binaries. NY Hya: A test for magnetic stellar evolution models
Authors:
T. C. Hinse,
O. Baştürk,
J. Southworth,
G. A. Feiden,
J. Tregloan-Reed,
V. B. Kostov,
J. Livingston,
E. M. Esmer,
Mesut Yılmaz,
Selçuk Yalçınkaya,
Şeyma Torun,
J. Vos,
D. F. Evans,
J. C. Morales,
J. C. A. Wolf,
E. H. Olsen,
J. V. Clausen,
B. E. Helt,
C. T. K. Lý,
O. Stahl,
R. Wells,
M. Herath,
U. G. Jørgensen,
M. Dominik,
J. Skottfelt
, et al. (7 additional authors not shown)
Abstract:
The binary star NY Hya is a bright, detached, double-lined eclipsing system with an orbital period of just under five days with two components each nearly identical to the Sun and located in the solar neighbourhood.
The objective of this study is to test and confront various stellar evolution models for solar-type stars based on accurate measurements of stellar mass and radius.
We present new…
▽ More
The binary star NY Hya is a bright, detached, double-lined eclipsing system with an orbital period of just under five days with two components each nearly identical to the Sun and located in the solar neighbourhood.
The objective of this study is to test and confront various stellar evolution models for solar-type stars based on accurate measurements of stellar mass and radius.
We present new ground-based spectroscopic and photometric as well as high-precision space-based photometric and astrometric data from which we derive orbital as well as physical properties of the components via the method of least-squares minimisation based on a standard binary model valid for two detached components. Classic statistical techniques were invoked to test the significance of model parameters. Additional empirical evidence was compiled from the public domain; the derived system properties were compared with archival broad-band photometry data enabling a measurement of the system's spectral energy distribution that allowed an independent estimate of stellar properties. We also utilised semi-empirical calibration methods to derive atmospheric properties from Strömgren photometry and related colour indices. Data was used to confront the observed physical properties with classic and magnetic stellar evolution models.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
MindSet: Vision. A toolbox for testing DNNs on key psychological experiments
Authors:
Valerio Biscione,
Dong Yin,
Gaurav Malhotra,
Marin Dujmovic,
Milton L. Montero,
Guillermo Puebla,
Federico Adolfi,
Rachel F. Heaton,
John E. Hummel,
Benjamin D. Evans,
Karim Habashy,
Jeffrey S. Bowers
Abstract:
Multiple benchmarks have been developed to assess the alignment between deep neural networks (DNNs) and human vision. In almost all cases these benchmarks are observational in the sense they are composed of behavioural and brain responses to naturalistic images that have not been manipulated to test hypotheses regarding how DNNs or humans perceive and identify objects. Here we introduce the toolbo…
▽ More
Multiple benchmarks have been developed to assess the alignment between deep neural networks (DNNs) and human vision. In almost all cases these benchmarks are observational in the sense they are composed of behavioural and brain responses to naturalistic images that have not been manipulated to test hypotheses regarding how DNNs or humans perceive and identify objects. Here we introduce the toolbox MindSet: Vision, consisting of a collection of image datasets and related scripts designed to test DNNs on 30 psychological findings. In all experimental conditions, the stimuli are systematically manipulated to test specific hypotheses regarding human visual perception and object recognition. In addition to providing pre-generated datasets of images, we provide code to regenerate these datasets, offering many configurable parameters which greatly extend the dataset versatility for different research contexts, and code to facilitate the testing of DNNs on these image datasets using three different methods (similarity judgments, out-of-distribution classification, and decoder method), accessible at https://github.com/MindSetVision/mindset-vision. We test ResNet-152 on each of these methods as an example of how the toolbox can be used.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Addressing Both Statistical and Causal Gender Fairness in NLP Models
Authors:
Hannah Chen,
Yangfeng Ji,
David Evans
Abstract:
Statistical fairness stipulates equivalent outcomes for every protected group, whereas causal fairness prescribes that a model makes the same prediction for an individual regardless of their protected characteristics. Counterfactual data augmentation (CDA) is effective for reducing bias in NLP models, yet models trained with CDA are often evaluated only on metrics that are closely tied to the caus…
▽ More
Statistical fairness stipulates equivalent outcomes for every protected group, whereas causal fairness prescribes that a model makes the same prediction for an individual regardless of their protected characteristics. Counterfactual data augmentation (CDA) is effective for reducing bias in NLP models, yet models trained with CDA are often evaluated only on metrics that are closely tied to the causal fairness notion; similarly, sampling-based methods designed to promote statistical fairness are rarely evaluated for causal fairness. In this work, we evaluate both statistical and causal debiasing methods for gender bias in NLP models, and find that while such methods are effective at reducing bias as measured by the targeted metric, they do not necessarily improve results on other bias metrics. We demonstrate that combinations of statistical and causal debiasing techniques are able to reduce bias measured through both types of metrics.
△ Less
Submitted 30 March, 2024;
originally announced April 2024.
-
Magnetoelectric coupling at the domain level in polycrystalline ErMnO3
Authors:
J. Schultheiß,
L. Puntigam,
M. Winkler,
S. Krohns,
D. Meier,
H. Das,
D. M. Evans,
I. Kézsmárki
Abstract:
We explore the impact of a magnetic field on the ferroelectric domain pattern in polycrystalline hexagonal ErMnO3 at cryogenic temperatures. Utilizing piezoelectric force microscopy measurements at 1.65 K, we observe modifications of the topologically protected ferroelectric domain structure induced by the magnetic field. These alterations likely result from strain induced by the magnetic field, f…
▽ More
We explore the impact of a magnetic field on the ferroelectric domain pattern in polycrystalline hexagonal ErMnO3 at cryogenic temperatures. Utilizing piezoelectric force microscopy measurements at 1.65 K, we observe modifications of the topologically protected ferroelectric domain structure induced by the magnetic field. These alterations likely result from strain induced by the magnetic field, facilitated by intergranular coupling in polycrystalline multiferroics. Our findings give insights into the interplay between electric and magnetic properties at the local scale and represent a so far unexplored pathway for manipulating topologically protected ferroelectric vortex patterns in hexagonal manganites.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
KIAS Lectures on Symplectic Aspects of Degenerations
Authors:
Jonathan David Evans
Abstract:
This is a series of three lectures I gave at the Korea Institute of Advanced Study in June 2019 at a workshop about "Algebraic and Symplectic Aspects of Degenerations of Complex Surfaces". I focus on the symplectic aspects, in particular on the case of cyclic quotient surface singularities. These notes have been available on a public Git repository since 2019, and I noticed that people occasionall…
▽ More
This is a series of three lectures I gave at the Korea Institute of Advanced Study in June 2019 at a workshop about "Algebraic and Symplectic Aspects of Degenerations of Complex Surfaces". I focus on the symplectic aspects, in particular on the case of cyclic quotient surface singularities. These notes have been available on a public Git repository since 2019, and I noticed that people occasionally cited them in the years since. For that reason, I decided to post them on arXiv for a more permanent record; I have made some small corrections and annotations but otherwise they are unchanged. These notes are a purely expository account of stuff I was thinking about 2016-2019, and are largely self-aggrandising.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Abundances of Neutron-Capture Elements in 62 Stars in the Globular Cluster Messier 15
Authors:
Jonathan Cabrera Garcia,
Charli M. Sakari,
Ian U. Roederer,
Donavon W. Evans,
Pedro Silva,
Mario Mateo,
Ying-Yi Song,
Anthony Kremin,
John I. Bailey III,
Matthew G. Walker
Abstract:
M15 is a globular cluster with a known spread in neutron-capture elements. This paper presents abundances of neutron-capture elements for 62 stars in M15. Spectra were obtained with the Michigan/Magellan Fiber System (M2FS) spectrograph, covering a wavelength range from ~4430-4630 A. Spectral lines from Fe I, Fe II, Sr I, Zr II, Ba II, La II, Ce II, Nd II, Sm II, Eu II, and Dy II, were measured, e…
▽ More
M15 is a globular cluster with a known spread in neutron-capture elements. This paper presents abundances of neutron-capture elements for 62 stars in M15. Spectra were obtained with the Michigan/Magellan Fiber System (M2FS) spectrograph, covering a wavelength range from ~4430-4630 A. Spectral lines from Fe I, Fe II, Sr I, Zr II, Ba II, La II, Ce II, Nd II, Sm II, Eu II, and Dy II, were measured, enabling classifications and neutron-capture abundance patterns for the stars. Of the 62 targets, 44 are found to be highly Eu-enhanced r-II stars, another 17 are moderately Eu-enhanced r-I stars, and one star is found to have an s-process signature. The neutron-capture patterns indicate that the majority of the stars are consistent with enrichment by the r-process. The 62 target stars are found to show significant star-to-star spreads in Sr, Zr, Ba, La, Ce, Nd, Sm, Eu, and Dy, but no significant spread in Fe. The neutron-capture abundances are further found to have slight correlations with sodium abundances from the literature, unlike what has been previously found; follow-up studies are needed to verify this result. The findings in this paper suggest that the Eu-enhanced stars in M15 were enhanced by the same process, that the nucleosynthetic source of this Eu pollution was the r-process, and that the r-process source occurred as the first generation of cluster stars was forming.
△ Less
Submitted 29 February, 2024;
originally announced March 2024.
-
Unifying F1TENTH Autonomous Racing: Survey, Methods and Benchmarks
Authors:
Benjamin David Evans,
Raphael Trumpp,
Marco Caccamo,
Felix Jahncke,
Johannes Betz,
Hendrik Willem Jordaan,
Herman Arnold Engelbrecht
Abstract:
The F1TENTH autonomous driving platform, consisting of 1:10-scale remote-controlled cars, has evolved into a well-established education and research platform. The many publications and real-world competitions span many domains, from classical path planning to novel learning-based algorithms. Consequently, the field is wide and disjointed, hindering direct comparison of developed methods and making…
▽ More
The F1TENTH autonomous driving platform, consisting of 1:10-scale remote-controlled cars, has evolved into a well-established education and research platform. The many publications and real-world competitions span many domains, from classical path planning to novel learning-based algorithms. Consequently, the field is wide and disjointed, hindering direct comparison of developed methods and making it difficult to assess the state-of-the-art. Therefore, we aim to unify the field by surveying current approaches, describing common methods, and providing benchmark results to facilitate clear comparisons and establish a baseline for future work. This research aims to survey past and current work with F1TENTH vehicles in the classical and learning categories and explain the different solution approaches. We describe particle filter localisation, trajectory optimisation and tracking, model predictive contouring control, follow-the-gap, and end-to-end reinforcement learning. We provide an open-source evaluation of benchmark methods and investigate overlooked factors of control frequency and localisation accuracy for classical methods as well as reward signal and training map for learning methods. The evaluation shows that the optimisation and tracking method achieves the fastest lap times, followed by the online planning approach. Finally, our work identifies and outlines the relevant research aspects to help motivate future work in the F1TENTH domain.
△ Less
Submitted 25 April, 2024; v1 submitted 28 February, 2024;
originally announced February 2024.
-
Do Membership Inference Attacks Work on Large Language Models?
Authors:
Michael Duan,
Anshuman Suri,
Niloofar Mireshghallah,
Sewon Min,
Weijia Shi,
Luke Zettlemoyer,
Yulia Tsvetkov,
Yejin Choi,
David Evans,
Hannaneh Hajishirzi
Abstract:
Membership inference attacks (MIAs) attempt to predict whether a particular datapoint is a member of a target model's training data. Despite extensive research on traditional machine learning models, there has been limited work studying MIA on the pre-training data of large language models (LLMs). We perform a large-scale evaluation of MIAs over a suite of language models (LMs) trained on the Pile…
▽ More
Membership inference attacks (MIAs) attempt to predict whether a particular datapoint is a member of a target model's training data. Despite extensive research on traditional machine learning models, there has been limited work studying MIA on the pre-training data of large language models (LLMs). We perform a large-scale evaluation of MIAs over a suite of language models (LMs) trained on the Pile, ranging from 160M to 12B parameters. We find that MIAs barely outperform random guessing for most settings across varying LLM sizes and domains. Our further analyses reveal that this poor performance can be attributed to (1) the combination of a large dataset and few training iterations, and (2) an inherently fuzzy boundary between members and non-members. We identify specific settings where LLMs have been shown to be vulnerable to membership inference and show that the apparent success in such settings can be attributed to a distribution shift, such as when members and non-members are drawn from the seemingly identical domain but with different temporal ranges. We release our code and data as a unified benchmark package that includes all existing MIAs, supporting future work.
△ Less
Submitted 16 September, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
The Influence of Chemical Strains on the Electrocaloric Response, Polarization Morphology, Tetragonality and Negative Capacitance Effect of Ferroelectric Core-Shell Nanorods and Nanowires
Authors:
Anna N. Morozovska,
Eugene A. Eliseev,
Olha A. Kovalenko,
Dean R. Evans
Abstract:
Using Landau-Ginzburg-Devonshire (LGD) approach we proposed the analytical description of the chemical strains influence on the spontaneous polarization and electrocaloric response in ferroelectric core-shell nanorods. We postulate that the nanorod core presents a defect-free single-crystalline ferroelectric material, and the elastic defects are accumulated in the ultra-thin shell, where they can…
▽ More
Using Landau-Ginzburg-Devonshire (LGD) approach we proposed the analytical description of the chemical strains influence on the spontaneous polarization and electrocaloric response in ferroelectric core-shell nanorods. We postulate that the nanorod core presents a defect-free single-crystalline ferroelectric material, and the elastic defects are accumulated in the ultra-thin shell, where they can induce tensile or compressive chemical strains. The finite element modeling (FEM) based on the LGD approach reveals transitions of domain structure morphology induced by the chemical strains in the BaTiO3 nanorods. Namely, tensile chemical strains induce and support the single-domain state in the central part of the nanorod, while the curled domain structures appear near the unscreened or partially screened ends of the rod. The vortex-like domains propagate toward the central part of the rod and fill it entirely, when the rod is covered by a shell with compressive chemical strains above some critical value. The critical value depends on the nanorod sizes, aspect ratio, and screening conditions at its ends. Both analytical theory and FEM predict that the tensile chemical strains in the shell increase the nanorod polarization, lattice tetragonality, and electrocaloric response well-above the values corresponding to the bulk material. The physical reason of the increase is the strong electrostriction coupling between the mismatch-type elastic strains induced in the core by the chemical strains in the shell. Comparison with the earlier XRD data confirmed an increase of tetragonality ratio in tensiled BaTiO3 nanorods compared to the bulk material.
△ Less
Submitted 8 April, 2024; v1 submitted 9 February, 2024;
originally announced February 2024.
-
High-performance Racing on Unmapped Tracks using Local Maps
Authors:
Benjamin David Evans,
Hendrik Willem Jordaan,
Herman Arnold Engelbrecht
Abstract:
Map-based methods for autonomous racing estimate the vehicle's location, which is used to follow a high-level plan. While map-based optimisation methods demonstrate high-performance results, they are limited by requiring a map of the environment. In contrast, mapless methods can operate in unmapped contexts since they directly process raw sensor data (often LiDAR) to calculate commands. However, a…
▽ More
Map-based methods for autonomous racing estimate the vehicle's location, which is used to follow a high-level plan. While map-based optimisation methods demonstrate high-performance results, they are limited by requiring a map of the environment. In contrast, mapless methods can operate in unmapped contexts since they directly process raw sensor data (often LiDAR) to calculate commands. However, a major limitation in mapless methods is poor performance due to a lack of optimisation. In response, we propose the local map framework that uses easily extractable, low-level features to build local maps of the visible region that form the input to optimisation-based controllers. Our local map generation extracts the visible racetrack boundaries and calculates a centreline and track widths used for planning. We evaluate our method for simulated F1Tenth autonomous racing using a two-stage trajectory optimisation and tracking strategy and a model predictive controller. Our method achieves lap times that are 8.8% faster than the Follow-The-Gap method and 3.22% faster than end-to-end neural networks due to the optimisation resulting in a faster speed profile. The local map planner is 3.28% slower than global methods that have access to an entire map of the track that can be used for planning. Critically, our approach enables high-speed autonomous racing on unmapped tracks, achieving performance similar to global methods without requiring a track map.
△ Less
Submitted 31 January, 2024;
originally announced January 2024.
-
Post-synthesis tuning of dielectric constant via ferroelectric domain wall engineering
Authors:
L. Zhou,
L. Puntigam,
P. Lunkenheimer,
E. Bourret,
Z. Yan,
I. Kézsmárki,
D. Meier,
S. Krohns,
J. Schultheiß,
D. M. Evans
Abstract:
A promising mechanism for achieving colossal dielectric constants is to use insulating internal barrier layers, which typically form during synthesis and then remain in the material. It has recently been shown that insulating domain walls in ferroelectrics can act as such barriers. One advantage domain walls have, in comparison to stationary interfaces, is that they can be moved, offering the pote…
▽ More
A promising mechanism for achieving colossal dielectric constants is to use insulating internal barrier layers, which typically form during synthesis and then remain in the material. It has recently been shown that insulating domain walls in ferroelectrics can act as such barriers. One advantage domain walls have, in comparison to stationary interfaces, is that they can be moved, offering the potential of post-synthesis control of the dielectric constant. However, to date, direct imaging of how changes in domain wall pattern cause a change in dielectric constant within a single sample has not been realized. In this work, we demonstrate that changing the domain wall density allows the engineering of the dielectric constant in hexagonal-ErMnO3 single crystals. The changes of the domain wall density are quantified via microscopy techniques, while the dielectric constant is determined via macroscopic dielectric spectroscopy measurements. The observed changes in the dielectric constant are quantitatively consistent with the observed variation in domain wall density, implying that the insulating domain walls behave as 'ideal' capacitors connected in series. Our approach to engineer the domain wall density can be readily extended to other control methods, e.g., electric fields or mechanical stresses, providing a novel degree of flexibility to in-situ tune the dielectric constant.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
Understanding Variation in Subpopulation Susceptibility to Poisoning Attacks
Authors:
Evan Rose,
Fnu Suya,
David Evans
Abstract:
Machine learning is susceptible to poisoning attacks, in which an attacker controls a small fraction of the training data and chooses that data with the goal of inducing some behavior unintended by the model developer in the trained model. We consider a realistic setting in which the adversary with the ability to insert a limited number of data points attempts to control the model's behavior on a…
▽ More
Machine learning is susceptible to poisoning attacks, in which an attacker controls a small fraction of the training data and chooses that data with the goal of inducing some behavior unintended by the model developer in the trained model. We consider a realistic setting in which the adversary with the ability to insert a limited number of data points attempts to control the model's behavior on a specific subpopulation. Inspired by previous observations on disparate effectiveness of random label-flipping attacks on different subpopulations, we investigate the properties that can impact the effectiveness of state-of-the-art poisoning attacks against different subpopulations. For a family of 2-dimensional synthetic datasets, we empirically find that dataset separability plays a dominant role in subpopulation vulnerability for less separable datasets. However, well-separated datasets exhibit more dependence on individual subpopulation properties. We further discover that a crucial subpopulation property is captured by the difference in loss on the clean dataset between the clean model and a target model that misclassifies the subpopulation, and a subpopulation is much easier to attack if the loss difference is small. This property also generalizes to high-dimensional benchmark datasets. For the Adult benchmark dataset, we show that we can find semantically-meaningful subpopulation properties that are related to the susceptibilities of a selected group of subpopulations. The results in this paper are accompanied by a fully interactive web-based visualization of subpopulation poisoning attacks found at https://uvasrg.github.io/visualizing-poisoning
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
The Future of Astronomical Data Infrastructure: Meeting Report
Authors:
Michael R. Blanton,
Janet D. Evans,
Dara Norman,
William O'Mullane,
Adrian Price-Whelan,
Luca Rizzi,
Alberto Accomazzi,
Megan Ansdell,
Stephen Bailey,
Paul Barrett,
Steven Berukoff,
Adam Bolton,
Julian Borrill,
Kelle Cruz,
Julianne Dalcanton,
Vandana Desai,
Gregory P. Dubois-Felsmann,
Frossie Economou,
Henry Ferguson,
Bryan Field,
Dan Foreman-Mackey,
Jaime Forero-Romero,
Niall Gaffney,
Kim Gillies,
Matthew J. Graham
, et al. (47 additional authors not shown)
Abstract:
The astronomical community is grappling with the increasing volume and complexity of data produced by modern telescopes, due to difficulties in reducing, accessing, analyzing, and combining archives of data. To address this challenge, we propose the establishment of a coordinating body, an "entity," with the specific mission of enhancing the interoperability, archiving, distribution, and productio…
▽ More
The astronomical community is grappling with the increasing volume and complexity of data produced by modern telescopes, due to difficulties in reducing, accessing, analyzing, and combining archives of data. To address this challenge, we propose the establishment of a coordinating body, an "entity," with the specific mission of enhancing the interoperability, archiving, distribution, and production of both astronomical data and software. This report is the culmination of a workshop held in February 2023 on the Future of Astronomical Data Infrastructure. Attended by 70 scientists and software professionals from ground-based and space-based missions and archives spanning the entire spectrum of astronomical research, the group deliberated on the prevailing state of software and data infrastructure in astronomy, identified pressing issues, and explored potential solutions. In this report, we describe the ecosystem of astronomical data, its existing flaws, and the many gaps, duplication, inconsistencies, barriers to access, drags on productivity, missed opportunities, and risks to the long-term integrity of essential data sets. We also highlight the successes and failures in a set of deep dives into several different illustrative components of the ecosystem, included as an appendix.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Direct imaging of spatial heterogeneities in type II superconductors
Authors:
Donald M. Evans,
Michele Conroy,
Lukas Puntigam,
Dorina Croitori,
Lilian Prodan,
James O. Douglas,
Baptiste Gault,
Vladimir Tsurkan
Abstract:
Understanding the exotic properties of quantum materials, including high-temperature superconductors, remains a formidable challenge that demands direct insights into electronic conductivity. Current methodologies either capture a bulk average or near-atomically-resolved information, missing direct measurements at the critical intermediate length scales. Here, using the superconductor Fe(Se,Te) as…
▽ More
Understanding the exotic properties of quantum materials, including high-temperature superconductors, remains a formidable challenge that demands direct insights into electronic conductivity. Current methodologies either capture a bulk average or near-atomically-resolved information, missing direct measurements at the critical intermediate length scales. Here, using the superconductor Fe(Se,Te) as a model system, we use low-temperature conductive atomic force microscopy (cAFM) to bridge this gap. Contrary to the uniform superconductivity anticipated from bulk assessments, cAFM uncovers micron-scale conductive intrusions within a relatively insulating matrix. Subsequent compositional mapping through atom probe tomography, shows that differences in conductivity correlated with local changes in composition. cAFM, supported by advanced microscopy and microanalysis, represents a methodological breakthrough that can be used to navigate the intricate landscape of high-temperature superconductors and the broader realm of quantum materials. Such fundamental information is critical for theoretical understanding and future guided design.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
SoK: Memorization in General-Purpose Large Language Models
Authors:
Valentin Hartmann,
Anshuman Suri,
Vincent Bindschaedler,
David Evans,
Shruti Tople,
Robert West
Abstract:
Large Language Models (LLMs) are advancing at a remarkable pace, with myriad applications under development. Unlike most earlier machine learning models, they are no longer built for one specific application but are designed to excel in a wide range of tasks. A major part of this success is due to their huge training datasets and the unprecedented number of model parameters, which allow them to me…
▽ More
Large Language Models (LLMs) are advancing at a remarkable pace, with myriad applications under development. Unlike most earlier machine learning models, they are no longer built for one specific application but are designed to excel in a wide range of tasks. A major part of this success is due to their huge training datasets and the unprecedented number of model parameters, which allow them to memorize large amounts of information contained in the training data. This memorization goes beyond mere language, and encompasses information only present in a few documents. This is often desirable since it is necessary for performing tasks such as question answering, and therefore an important part of learning, but also brings a whole array of issues, from privacy and security to copyright and beyond. LLMs can memorize short secrets in the training data, but can also memorize concepts like facts or writing styles that can be expressed in text in many different ways. We propose a taxonomy for memorization in LLMs that covers verbatim text, facts, ideas and algorithms, writing styles, distributional properties, and alignment goals. We describe the implications of each type of memorization - both positive and negative - for model performance, privacy, security and confidentiality, copyright, and auditing, and ways to detect and prevent memorization. We further highlight the challenges that arise from the predominant way of defining memorization with respect to model behavior instead of model weights, due to LLM-specific phenomena such as reasoning capabilities or differences between decoding algorithms. Throughout the paper, we describe potential risks and opportunities arising from memorization in LLMs that we hope will motivate new research directions.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
SoK: Pitfalls in Evaluating Black-Box Attacks
Authors:
Fnu Suya,
Anshuman Suri,
Tingwei Zhang,
Jingtao Hong,
Yuan Tian,
David Evans
Abstract:
Numerous works study black-box attacks on image classifiers. However, these works make different assumptions on the adversary's knowledge and current literature lacks a cohesive organization centered around the threat model. To systematize knowledge in this area, we propose a taxonomy over the threat space spanning the axes of feedback granularity, the access of interactive queries, and the qualit…
▽ More
Numerous works study black-box attacks on image classifiers. However, these works make different assumptions on the adversary's knowledge and current literature lacks a cohesive organization centered around the threat model. To systematize knowledge in this area, we propose a taxonomy over the threat space spanning the axes of feedback granularity, the access of interactive queries, and the quality and quantity of the auxiliary data available to the attacker. Our new taxonomy provides three key insights. 1) Despite extensive literature, numerous under-explored threat spaces exist, which cannot be trivially solved by adapting techniques from well-explored settings. We demonstrate this by establishing a new state-of-the-art in the less-studied setting of access to top-k confidence scores by adapting techniques from well-explored settings of accessing the complete confidence vector, but show how it still falls short of the more restrictive setting that only obtains the prediction label, highlighting the need for more research. 2) Identification the threat model of different attacks uncovers stronger baselines that challenge prior state-of-the-art claims. We demonstrate this by enhancing an initially weaker baseline (under interactive query access) via surrogate models, effectively overturning claims in the respective paper. 3) Our taxonomy reveals interactions between attacker knowledge that connect well to related areas, such as model inversion and extraction attacks. We discuss how advances in other areas can enable potentially stronger black-box attacks. Finally, we emphasize the need for a more realistic assessment of attack success by factoring in local attack runtime. This approach reveals the potential for certain attacks to achieve notably higher success rates and the need to evaluate attacks in diverse and harder settings, highlighting the need for better selection criteria.
△ Less
Submitted 14 February, 2024; v1 submitted 26 October, 2023;
originally announced October 2023.