Search | arXiv e-print repository

Towards Actionable Pedagogical Feedback: A Multi-Perspective Analysis of Mathematics Teaching and Tutoring Dialogue

Authors: Jannatun Naim, Jie Cao, Fareen Tasneem, Jennifer Jacobs, Brent Milne, James Martin, Tamara Sumner

Abstract: Effective feedback is essential for refining instructional practices in mathematics education, and researchers often turn to advanced natural language processing (NLP) models to analyze classroom dialogues from multiple perspectives. However, utterance-level discourse analysis encounters two primary challenges: (1) multifunctionality, where a single utterance may serve multiple purposes that a sin… ▽ More Effective feedback is essential for refining instructional practices in mathematics education, and researchers often turn to advanced natural language processing (NLP) models to analyze classroom dialogues from multiple perspectives. However, utterance-level discourse analysis encounters two primary challenges: (1) multifunctionality, where a single utterance may serve multiple purposes that a single tag cannot capture, and (2) the exclusion of many utterances from domain-specific discourse move classifications, leading to their omission in feedback. To address these challenges, we proposed a multi-perspective discourse analysis that integrates domain-specific talk moves with dialogue act (using the flattened multi-functional SWBD-MASL schema with 43 tags) and discourse relation (applying Segmented Discourse Representation Theory with 16 relations). Our top-down analysis framework enables a comprehensive understanding of utterances that contain talk moves, as well as utterances that do not contain talk moves. This is applied to two mathematics education datasets: TalkMoves (teaching) and SAGA22 (tutoring). Through distributional unigram analysis, sequential talk move analysis, and multi-view deep dive, we discovered meaningful discourse patterns, and revealed the vital role of utterances without talk moves, demonstrating that these utterances, far from being mere fillers, serve crucial functions in guiding, acknowledging, and structuring classroom discourse. These insights underscore the importance of incorporating discourse relations and dialogue acts into AI-assisted education systems to enhance feedback and create more responsive learning environments. Our framework may prove helpful for providing human educator feedback, but also aiding in the development of AI agents that can effectively emulate the roles of both educators and students. △ Less

Submitted 11 May, 2025; originally announced May 2025.

Comments: Accepted to EDM'2025

arXiv:2505.06154 [pdf, ps, other]

Coherent Generation and Protection of Anticoherent Spin States

Authors: J. Denis, C. Read, J. Martin

Abstract: We report the first protocol specifically designed to generate anticoherent spin-$j$ states at various orders. The protocol consists of cycles involving a rotation pulse about one axis, followed by a squeezing pulse along a perpendicular direction. To protect these states from decoherence, we develop dynamical decoupling techniques based on group-theoretic sequence design and the dynamically corre… ▽ More We report the first protocol specifically designed to generate anticoherent spin-$j$ states at various orders. The protocol consists of cycles involving a rotation pulse about one axis, followed by a squeezing pulse along a perpendicular direction. To protect these states from decoherence, we develop dynamical decoupling techniques based on group-theoretic sequence design and the dynamically corrected gate formalism. We analyze key sources of dephasing, disorder, and dipole-dipole interactions, and evaluate the effectiveness of our methods in preserving coherence. Potential applications of the generated anticoherent spin states include quantum sensing and studies of quantum entanglement. △ Less

Submitted 9 May, 2025; originally announced May 2025.

Comments: 34 pages, 12 figures

arXiv:2505.04319 [pdf, ps, other]

Sharp bounds for the growth and distortion of the analytic part of convex harmonic functions

Authors: María J. Martín

Abstract: We obtain the sharp upper and lower bounds for the growth and distortion of the analytic parts $h$ of orientation-preserving harmonic mappings $f=h+\overline g$ (normalized in the standard way) that map the unit disk onto a convex domain. We obtain the sharp upper and lower bounds for the growth and distortion of the analytic parts $h$ of orientation-preserving harmonic mappings $f=h+\overline g$ (normalized in the standard way) that map the unit disk onto a convex domain. △ Less

Submitted 7 May, 2025; originally announced May 2025.

MSC Class: 31A05; 30C45; 30C75

arXiv:2505.03951 [pdf, ps, other]

The Lie algebra $\mathfrak{sl}_4(\mathbb C)$ and the hypercubes

Authors: William J. Martin, Paul Terwilliger

Abstract: We describe a relationship between the Lie algebra $\mathfrak{sl}_4(\mathbb C)$ and the hypercube graphs. Consider the $\mathbb C$-algebra $P$ of polynomials in four commuting variables. We turn $P$ into an $\mathfrak{sl}_4(\mathbb C)$-module on which each element of $\mathfrak{sl}_4(\mathbb C)$ acts as a derivation. Then $P$ becomes a direct sum of irreducible $\mathfrak{sl}_4(\mathbb C)$-modules… ▽ More We describe a relationship between the Lie algebra $\mathfrak{sl}_4(\mathbb C)$ and the hypercube graphs. Consider the $\mathbb C$-algebra $P$ of polynomials in four commuting variables. We turn $P$ into an $\mathfrak{sl}_4(\mathbb C)$-module on which each element of $\mathfrak{sl}_4(\mathbb C)$ acts as a derivation. Then $P$ becomes a direct sum of irreducible $\mathfrak{sl}_4(\mathbb C)$-modules $P = \sum_{N\in \mathbb N} P_N$, where $P_N$ is the $N$th homogeneous component of $P$. For $N\in \mathbb N$ we construct some additional $\mathfrak{sl}_4(\mathbb C)$-modules ${\rm Fix}(G)$ and $T$. For these modules the underlying vector space is described as follows. Let $X$ denote the vertex set of the hypercube $H(N,2)$, and let $V$ denote the $\mathbb C$-vector space with basis $X$. For the automorphism group $G$ of $H(N,2)$, the action of $G$ on $X$ turns $V$ into a $G$-module. The vector space $V^{\otimes 3} = V \otimes V \otimes V$ becomes a $G$-module such that $g(u \otimes v \otimes w)= g(u) \otimes g(v) \otimes g(w)$ for $g\in G$ and $u,v,w \in V$. The subspace ${\rm Fix}(G)$ of $V^{\otimes 3}$ consists of the vectors in $V^{\otimes 3}$ that are fixed by every element in $G$. Pick $\varkappa \in X$. The corresponding subconstituent algebra $T$ of $H(N,2)$ is the subalgebra of ${\rm End}(V)$ generated by the adjacency map $\sf A$ of $H(N,2)$ and the dual adjacency map ${\sf A}^*$ of $H(N,2)$ with respect to $\varkappa$. In our main results, we turn ${\rm Fix}(G)$ and $T$ into $\mathfrak{sl}_4(\mathbb C)$-modules, and display $\mathfrak{sl}_4(\mathbb C)$-module isomorphisms $P_N \to {\rm Fix}(G) \to T$. We describe the $\mathfrak{sl}_4(\mathbb C)$-modules $P_N$, ${\rm Fix}(G)$, $T$ from multiple points of view. △ Less

Submitted 6 May, 2025; originally announced May 2025.

Comments: 85 pages

MSC Class: 05E30

arXiv:2505.03441 [pdf, other]

Simultaneous global and local clustering in multiplex networks with covariate information

Authors: Joshua Corneck, Edward A. K. Cohen, James S. Martin, Lekha Patel, Kurtis W. Shuler, Francesco Sanna Passino

Abstract: Understanding both global and layer-specific group structures is useful for uncovering complex patterns in networks with multiple interaction types. In this work, we introduce a new model, the hierarchical multiplex stochastic blockmodel (HMPSBM), that simultaneously detects communities within individual layers of a multiplex network while inferring a global node clustering across the layers. A st… ▽ More Understanding both global and layer-specific group structures is useful for uncovering complex patterns in networks with multiple interaction types. In this work, we introduce a new model, the hierarchical multiplex stochastic blockmodel (HMPSBM), that simultaneously detects communities within individual layers of a multiplex network while inferring a global node clustering across the layers. A stochastic blockmodel is assumed in each layer, with probabilities of layer-level group memberships determined by a node's global group assignment. Our model uses a Bayesian framework, employing a probit stick-breaking process to construct node-specific mixing proportions over a set of shared Griffiths-Engen-McCloseky (GEM) distributions. These proportions determine layer-level community assignment, allowing for an unknown and varying number of groups across layers, while incorporating nodal covariate information to inform the global clustering. We propose a scalable variational inference procedure with parallelisable updates for application to large networks. Extensive simulation studies demonstrate our model's ability to accurately recover both global and layer-level clusters in complicated settings, and applications to real data showcase the model's effectiveness in uncovering interesting latent network structure. △ Less

Submitted 6 May, 2025; originally announced May 2025.

arXiv:2504.19713 [pdf, ps, other]

doi 10.1103/PhysRevA.111.033111

Measurement of the total spin angular momentum <Fz> of alkali-metal atoms

Authors: Runa Yasuda, Kei Ishii, Wolfgang Klassen, Jeffery W. Martin, Atsushi Hatakeyama

Abstract: It is important to evaluate the total spin angular momentum of alkali-metal atoms if the atoms serve as a reservoir of angular momenta. We use an absorption-monitoring technique to measure <Fz>, i.e., the expectation values of the quantization (z) axis components of the total angular momentum of cesium (Cs) atoms in the electronic ground state in both uncoated and anti-relaxation-coated vacuum cel… ▽ More It is important to evaluate the total spin angular momentum of alkali-metal atoms if the atoms serve as a reservoir of angular momenta. We use an absorption-monitoring technique to measure <Fz>, i.e., the expectation values of the quantization (z) axis components of the total angular momentum of cesium (Cs) atoms in the electronic ground state in both uncoated and anti-relaxation-coated vacuum cells at room temperature. Cs atoms are polarized via optical pumping and probed using their D2 transitions. The probe laser frequency is varied across the Doppler-broadened D2 transition; the <Fz> values are derived using the integrated absorption coefficients. The largest <Fz> is 2.5 for the coated cell. We then use a simple model of spin flow through vapor cells to estimate the atomic spin relaxation probabilities after a single surface collision. △ Less

Submitted 28 April, 2025; originally announced April 2025.

Comments: 24 pages, 11 figures, 2 tables. Author prepared version

Journal ref: Physical Review A 111, 033111 (2025)

arXiv:2504.17604 [pdf, other]

Measurement of the Parity-Violating Asymmetry in the N to $Δ$ Transition at Low $Q^2$

Authors: D. Adhikari, T. Alshayeb, D. Androic, D. S. Armstrong, A. Asaturyan, K. Bartlett, R. S. Beminiwattha, J. Benesch, F. Benmokhtar, R. D. Carlini, J. C. Cornejo, S. Covrig Dusa, M. M. Dalton, C. A. Davis, W. Deconinck, J. A. Dunne, D. Dutta, W. S. Duvall, M. Elaasar, W. R. Falk, J. M. Finn, C. Gal, D. Gaskell, M. T. W. Gericke, J. R. Hoskins , et al. (48 additional authors not shown)

Abstract: We report the measurement of the parity-violating asymmetry in the N to $Δ$ transition via the $e^- + p \rightarrow e^- + Δ^+$ reaction at two different kinematic points with low four-momentum transfer Q$^2$. Measurements were made with incident electron beam energies of 0.877 and 1.16 GeV, corresponding to $Q^2$ values of 0.0111 and 0.0208 (GeV/c)$^2$, respectively. These measurements put constra… ▽ More We report the measurement of the parity-violating asymmetry in the N to $Δ$ transition via the $e^- + p \rightarrow e^- + Δ^+$ reaction at two different kinematic points with low four-momentum transfer Q$^2$. Measurements were made with incident electron beam energies of 0.877 and 1.16 GeV, corresponding to $Q^2$ values of 0.0111 and 0.0208 (GeV/c)$^2$, respectively. These measurements put constraints on a low-energy constant in the weak Lagrangian, $d_Δ$, corresponding to a parity-violating electric-dipole transition matrix element. This matrix element has been shown to be large in the strangeness-changing channel, via weak hyperon decays such as $Σ^+ \rightarrow pγ$. The measurements reported here constrain $d_Δ$ in the strangeness-conserving channel. The final asymmetries were -0.65 +- 1.00 (stat.) +- 1.02 (syst) ppm (parts per million) for 0.877 GeV and -3.59 +- 0.82 (stat.) +- 1.33 (syst.} ppm for 1.16 GeV. With these results we deduce a small value for $d_Δ$, consistent with zero, in the strangeness-conserving channel, in contrast to the large value for $d_Δ$ previously reported in the strangeness-changing channel. △ Less

Submitted 24 April, 2025; originally announced April 2025.

Comments: 6 pages, 1 figure

arXiv:2504.11317 [pdf, other]

The role of non-Markovian dissipation in quantum phase transitions: tricriticality, spin squeezing, and directional symmetry breaking

Authors: Baptiste Debecker, Lukas Pausch, Jonathan Louvet, Thierry Bastin, John Martin, François Damanet

Abstract: Understanding how to control phase transitions in quantum systems is at the forefront of research for the development of new quantum materials and technologies. Here, we study how the coupling of a quantum system to a non-Markovian environment, i.e., an environment with a frequency-dependent spectral density inducing memory effects, can be used to generate and reshape phase transitions and squeezi… ▽ More Understanding how to control phase transitions in quantum systems is at the forefront of research for the development of new quantum materials and technologies. Here, we study how the coupling of a quantum system to a non-Markovian environment, i.e., an environment with a frequency-dependent spectral density inducing memory effects, can be used to generate and reshape phase transitions and squeezing in matter phases. Focusing on a Lipkin-Meshkov-Glick model, we demonstrate that non-Markovian dissipation can be leveraged to engineer tricriticality via the fusion of $2^{\mathrm{nd}}$-order and $1^{\mathrm{st}}$-order critical points. We identify phases that arise from different ways of breaking the single weak symmetry of our model, which led us to introduce the concept of \textit{directional spontaneous symmetry breaking} (DSSB) as a general framework to understand this phenomenon. We show that signatures of DSSB can be seen in the emergence of spin squeezing along different directions, and that the latter is controllable via non-Markovian effects, opening up possibilities for applications in quantum metrology. Finally, we propose an experimental implementation of our non-Markovian model in cavity QED. Our work features non-Markovianity as a resource for controlling phase transitions in general systems, and highlights shortcomings of the Markovian limit in this context. △ Less

Submitted 15 April, 2025; originally announced April 2025.

Comments: 16 pages and 7 figures

arXiv:2504.08195 [pdf, other]

Graph Based Deep Reinforcement Learning Aided by Transformers for Multi-Agent Cooperation

Authors: Michael Elrod, Niloufar Mehrabi, Rahul Amin, Manveen Kaur, Long Cheng, Jim Martin, Abolfazl Razi

Abstract: Mission planning for a fleet of cooperative autonomous drones in applications that involve serving distributed target points, such as disaster response, environmental monitoring, and surveillance, is challenging, especially under partial observability, limited communication range, and uncertain environments. Traditional path-planning algorithms struggle in these scenarios, particularly when prior… ▽ More Mission planning for a fleet of cooperative autonomous drones in applications that involve serving distributed target points, such as disaster response, environmental monitoring, and surveillance, is challenging, especially under partial observability, limited communication range, and uncertain environments. Traditional path-planning algorithms struggle in these scenarios, particularly when prior information is not available. To address these challenges, we propose a novel framework that integrates Graph Neural Networks (GNNs), Deep Reinforcement Learning (DRL), and transformer-based mechanisms for enhanced multi-agent coordination and collective task execution. Our approach leverages GNNs to model agent-agent and agent-goal interactions through adaptive graph construction, enabling efficient information aggregation and decision-making under constrained communication. A transformer-based message-passing mechanism, augmented with edge-feature-enhanced attention, captures complex interaction patterns, while a Double Deep Q-Network (Double DQN) with prioritized experience replay optimizes agent policies in partially observable environments. This integration is carefully designed to address specific requirements of multi-agent navigation, such as scalability, adaptability, and efficient task execution. Experimental results demonstrate superior performance, with 90% service provisioning and 100% grid coverage (node discovery), while reducing the average steps per episode to 200, compared to 600 for benchmark methods such as particle swarm optimization (PSO), greedy algorithms and DQN. △ Less

Submitted 10 April, 2025; originally announced April 2025.

Comments: 6 pages, 7 figures, Accepted to the 2025 IEEE International Conference on Communications Workshops (ICC Workshops)

arXiv:2504.07571 [pdf, other]

The birth of Be star disks I. From localized ejection to circularization

Authors: J. Labadie-Bartz, A. C. Carciofi, A. C. Rubio, D. Baade, R. Siverd, C. Arcos, A. L. Figueiredo, Y. Nazé, C. Neiner, T. Rivinius, N. D. Richardson, S. Nova, M. L. Pinho, S. Bhattacharyya, R. Leadbeater, J. Guarro Fló, V. Lecocq, G. Piehler, J. Kozok, U. Sollecchia, E. Bryssinck, C. Buil, J. Martin, V. Desnoux, B. Heathcote , et al. (13 additional authors not shown)

Abstract: Classical Be stars are well known to eject mass, but the details governing the initial distribution and evolution of this matter into a disk are poorly constrained by observations. By combining high-cadence spectroscopy with contemporaneous space photometry from TESS, we have sampled about 30 mass ejection events in 13 Be stars. Our goal is to constrain the geometrical and kinematic properties of… ▽ More Classical Be stars are well known to eject mass, but the details governing the initial distribution and evolution of this matter into a disk are poorly constrained by observations. By combining high-cadence spectroscopy with contemporaneous space photometry from TESS, we have sampled about 30 mass ejection events in 13 Be stars. Our goal is to constrain the geometrical and kinematic properties of the ejecta, facilitating the investigation into the initial conditions and evolution, and understanding its interactions with preexisting material. The photometric variability is analyzed together with measurements of the rapidly changing emission features to identify the onset of outburst events and obtain information about the geometry of the ejecta and its evolution. All Be stars observed with sufficiently high cadence exhibit rapid oscillations of line asymmetry with a single frequency in the days following the start of the event. The emission asymmetry cycles break down after roughly 5 - 10 cycles, with the emission line profile converging toward approximate symmetry. In photometry, several frequencies typically emerge at relatively high amplitude at some point during the mass ejection process. In all observed cases, freshly ejected material was initially within a narrow azimuthal range, indicating it was launched from a localized region on the star. The material orbits the star with a frequency consistent with the near-surface Keplerian orbital frequency. This material circularizes into a disk configuration after several orbital timescales. This is true whether or not there was a preexisting disk. We find no evidence for precursor phases prior to the ejection of mass in our sample. The several photometric frequencies that emerge during outburst are at least partially stellar in origin. (Abstract abridged) △ Less

Submitted 10 April, 2025; originally announced April 2025.

Comments: 41 pages, 31 figures, 4 tables

arXiv:2504.02689 [pdf]

Plasmon-interband hybridization and anomalous production of hot electrons in aluminum nanoantennas

Authors: Jérôme Martin, Oscar Avalos-Ovando, Thomas Simon, Gabriel Arditi, Florian Lamaze, Julien Proust, Luiz H. G. Tizei, Zhiming Wang, Mathieu Kociak, Alexander O. Govorov, Odile Stéphan, Davy Gérard

Abstract: Strong coupling typically occurs between two separate objects or between an object and its environment (such as an atom and a cavity). However, it can also occur between two different excitations within the same object, a situation that has been much less studied. In this study, we observe strong coupling between localized surface plasmon resonances and the interband transition in aluminum nanorod… ▽ More Strong coupling typically occurs between two separate objects or between an object and its environment (such as an atom and a cavity). However, it can also occur between two different excitations within the same object, a situation that has been much less studied. In this study, we observe strong coupling between localized surface plasmon resonances and the interband transition in aluminum nanorods, as evidenced by optical spectroscopy and electron energy loss spectroscopy, and corroborated with numerical simulations. Strong coupling is observed between the interband transition and multiple orders of the surface plasmon mode, including dark ones. We also obtain experimental maps of the hybrid modes at the nanoscale. In each case, the associated Rabi energy, which corresponds to the energy splitting between the two polaritonic branches, is obtained. Finally, a dedicated numerical model was employed to calculate the hot electron generation rate in the nanorods. The calculations demonstrate that efficient generation of hot electrons can be achieved in the near-infrared region, when the interband transition is strongly coupled with a plasmon resonance. This high generation rate stems from the hybrid nature of the mode, as its plasmonic component provides a high absorption cross-section, while the IT part ensures efficient conversion to hot electrons. Consequently, aluminum nanorods represent an efficient source of hot electrons in the visible and near-infrared regions, with potential applications in local photochemistry, photodetection, and solar energy harvesting. △ Less

Submitted 3 April, 2025; originally announced April 2025.

Comments: 8 figures, 1 Supplementary Information

arXiv:2504.02434 [pdf, ps, other]

Generalised Hajłasz-Besov spaces on $RD$-spaces

Authors: Joaquim Martin, Walter A. Ortiz

Abstract: An $RD$ space is a doubling measure metric space $Ω$ with the additional property that it has a reverse doubling property. In this paper we introduce a new class of Hajłasz-Besov spaces on $Ω$ and extend several results from classical theory, such as embeddings and Sobolev-type embeddings. An $RD$ space is a doubling measure metric space $Ω$ with the additional property that it has a reverse doubling property. In this paper we introduce a new class of Hajłasz-Besov spaces on $Ω$ and extend several results from classical theory, such as embeddings and Sobolev-type embeddings. △ Less

Submitted 3 April, 2025; originally announced April 2025.

MSC Class: 46E35; 46E30

arXiv:2503.24049 [pdf, other]

The Linear Collider Facility (LCF) at CERN

Authors: H. Abramowicz, E. Adli, F. Alharthi, M. Almanza-Soto, M. M. Altakach, S Ampudia Castelazo, D. Angal-Kalinin, R. B. Appleby, O. Apsimon, A. Arbey, O. Arquero, D. Attié, J. L. Avila-Jimenez, H. Baer, Y. Bai, C. Balazs, T Barklow, J. Baudot, P. Bechtle, T. Behnke, A. B. Bellerive, S. Belomestnykh, Y. Benhammou, J. Berenguer-Antequera, M. Berger , et al. (359 additional authors not shown)

Abstract: In this paper we outline a proposal for a Linear Collider Facility as the next flagship project for CERN. It offers the opportunity for a timely, cost-effective and staged construction of a new collider that will be able to comprehensively map the Higgs boson's properties, including the Higgs field potential, thanks to a large span in centre-of-mass energies and polarised beams. A comprehensive pr… ▽ More In this paper we outline a proposal for a Linear Collider Facility as the next flagship project for CERN. It offers the opportunity for a timely, cost-effective and staged construction of a new collider that will be able to comprehensively map the Higgs boson's properties, including the Higgs field potential, thanks to a large span in centre-of-mass energies and polarised beams. A comprehensive programme to study the Higgs boson and its closest relatives with high precision requires data at centre-of-mass energies from the Z pole to at least 1 TeV. It should include measurements of the Higgs boson in both major production mechanisms, ee -> ZH and ee -> vvH, precision measurements of gauge boson interactions as well as of the W boson, Higgs boson and top-quark masses, measurement of the top-quark Yukawa coupling through ee ->ttH, measurement of the Higgs boson self-coupling through HH production, and precision measurements of the electroweak couplings of the top quark. In addition, ee collisions offer discovery potential for new particles complementary to HL-LHC. △ Less

Submitted 31 March, 2025; originally announced March 2025.

Comments: Submission to the EPPSU

arXiv:2503.21004 [pdf]

Evaluating Large Language Models for Automated Clinical Abstraction in Pulmonary Embolism Registries: Performance Across Model Sizes, Versions, and Parameters

Authors: Mahmoud Alwakeel, Emory Buck, Jonathan G. Martin, Imran Aslam, Sudarshan Rajagopal, Jian Pei, Mihai V. Podgoreanu, Christopher J. Lindsell, An-Kwok Ian Wong

Abstract: Pulmonary embolism (PE) is a leading cause of cardiovascular mortality, yet our understanding of optimal management remains limited due to heterogeneous and inaccessible radiology documentation. The PERT Consortium registry standardizes PE management data but depends on resource-intensive manual abstraction. Large language models (LLMs) offer a scalable alternative for automating concept extractio… ▽ More Pulmonary embolism (PE) is a leading cause of cardiovascular mortality, yet our understanding of optimal management remains limited due to heterogeneous and inaccessible radiology documentation. The PERT Consortium registry standardizes PE management data but depends on resource-intensive manual abstraction. Large language models (LLMs) offer a scalable alternative for automating concept extraction from computed tomography PE (CTPE) reports. This study evaluated the accuracy of LLMs in extracting PE-related concepts compared to a human-curated criterion standard. We retrospectively analyzed MIMIC-IV and Duke Health CTPE reports using multiple LLaMA models. Larger models (70B) outperformed smaller ones (8B), achieving kappa values of 0.98 (PE detection), 0.65-0.75 (PE location), 0.48-0.51 (right heart strain), and 0.65-0.70 (image artifacts). Moderate temperature tuning (0.2-0.5) improved accuracy, while excessive in-context examples reduced performance. A dual-model review framework achieved >80-90% precision. LLMs demonstrate strong potential for automating PE registry abstraction, minimizing manual workload while preserving accuracy. △ Less

Submitted 26 March, 2025; originally announced March 2025.

arXiv:2503.20021 [pdf, other]

Viscous Gubser flow with conserved charges to benchmark fluid simulations

Authors: Kevin Ingles, Jordi Salinas San Martín, Willian Serenone, Jacquelyn Noronha-Hostler

Abstract: We present semi-analytical solutions for the evolution of both the temperature and chemical potentials for viscous Gubser flow with conserved charges. Such a solution can be especially useful in testing numerical codes intended to simulate relativistic fluids with large chemical potentials. The freeze-out hypersurface profiles for constant energy density are calculated, along with the correspondin… ▽ More We present semi-analytical solutions for the evolution of both the temperature and chemical potentials for viscous Gubser flow with conserved charges. Such a solution can be especially useful in testing numerical codes intended to simulate relativistic fluids with large chemical potentials. The freeze-out hypersurface profiles for constant energy density are calculated, along with the corresponding normal vectors and presented as a new unit test for numerical codes. We also compare the influence of the equation of state on the semi-analytical solutions. We benchmark the newly developed Smoothed Particle Hydrodynamics (SPH) code CCAKE that includes both shear viscosity and three conserved charges. The numerical solutions are in excellent agreement with the semi-analytical solution and also are able to accurately reproduce the hypersurface at freeze-out. △ Less

Submitted 25 March, 2025; originally announced March 2025.

Comments: 13 pages, 5 figures

arXiv:2503.20006 [pdf]

doi 10.1021/acs.jpclett.5c00885

Exploiting a Shortcoming of Coupled-Cluster Theory: The Extent of non-Hermiticity as a Diagnostic Indicator of Computational Accuracy

Authors: Kaila E. Weflen, Megan R. Bentley, James H. Thorpe, Peter R. Franke, Jan M. L. Martin, Devin A. Matthews, John F. Stanton

Abstract: The fundamental non-Hermitian nature of the forms of coupled-cluster (CC) theory widely used in quantum chemistry has usually been viewed as a negative, but the present letter shows how this can be used to advantage. Specifically, the non-symmetric nature of the reduced one-particle density matrix (in the molecular orbital basis) is advocated as a diagnostic indicator of computational quality. In… ▽ More The fundamental non-Hermitian nature of the forms of coupled-cluster (CC) theory widely used in quantum chemistry has usually been viewed as a negative, but the present letter shows how this can be used to advantage. Specifically, the non-symmetric nature of the reduced one-particle density matrix (in the molecular orbital basis) is advocated as a diagnostic indicator of computational quality. In the limit of full coupled-cluster theory (which is equivalent to full configuration interaction (FCI)), the electronic wavefunction and correlation energy are exact within a given one-particle basis set and the symmetric character of the exact density matrix is recovered. The extent of the density matrix asymmetry is shown to provide a measure of ``how difficult the problem is'' (like the well-known T$_1$ diagnostic), but its variation with level of theory also gives information about ``how well this particular method works'', irrespective of the difficulty of the problem at hand. The proposed diagnostic is described and applied to a select group of small molecules, and an example of its overall utility for the practicing quantum chemist is illustrated through its application to the beryllium dimer (Be$_2$). Future applications of this idea to excited states, open-shell systems, symmetry-breaking problems and extension of the method to the two-particle density are then proposed. △ Less

Submitted 14 May, 2025; v1 submitted 25 March, 2025; originally announced March 2025.

Comments: Published version, Open Access CC-BY 4.0. Senior author deceased March 21, 2025; last version approved by him, revised following referee comments

Journal ref: Journal of Physical Chemistry Letters 16, 5121-5127 (2025)

arXiv:2503.19983 [pdf, other]

A Linear Collider Vision for the Future of Particle Physics

Authors: H. Abramowicz, E. Adli, F. Alharthi, M. Almanza-Soto, M. M. Altakach, S Ampudia Castelazo, D. Angal-Kalinin, R. B. Appleby, O. Apsimon, A. Arbey, O. Arquero, A. Aryshev, S. Asai, D. Attié, J. L. Avila-Jimenez, H. Baer, J. A. Bagger, Y. Bai, I. R. Bailey, C. Balazs, T Barklow, J. Baudot, P. Bechtle, T. Behnke, A. B. Bellerive , et al. (391 additional authors not shown)

Abstract: In this paper we review the physics opportunities at linear $e^+e^-$ colliders with a special focus on high centre-of-mass energies and beam polarisation, take a fresh look at the various accelerator technologies available or under development and, for the first time, discuss how a facility first equipped with a technology mature today could be upgraded with technologies of tomorrow to reach much… ▽ More In this paper we review the physics opportunities at linear $e^+e^-$ colliders with a special focus on high centre-of-mass energies and beam polarisation, take a fresh look at the various accelerator technologies available or under development and, for the first time, discuss how a facility first equipped with a technology mature today could be upgraded with technologies of tomorrow to reach much higher energies and/or luminosities. In addition, we will discuss detectors and alternative collider modes, as well as opportunities for beyond-collider experiments and R\&D facilities as part of a linear collider facility (LCF). The material of this paper will support all plans for $e^+e^-$ linear colliders and additional opportunities they offer, independently of technology choice or proposed site, as well as R\&D for advanced accelerator technologies. This joint perspective on the physics goals, early technologies and upgrade strategies has been developed by the LCVision team based on an initial discussion at LCWS2024 in Tokyo and a follow-up at the LCVision Community Event at CERN in January 2025. It heavily builds on decades of achievements of the global linear collider community, in particular in the context of CLIC and ILC. △ Less

Submitted 31 March, 2025; v1 submitted 25 March, 2025; originally announced March 2025.

Comments: Community document for EPPSU, will be updated several times

arXiv:2503.13463 [pdf, other]

doi 10.1007/978-3-031-49008-8_7

Completeness of Datasets Documentation on ML/AI repositories: an Empirical Investigation

Authors: Marco Rondina, Antonio Vetrò, Juan Carlos De Martin

Abstract: ML/AI is the field of computer science and computer engineering that arguably received the most attention and funding over the last decade. Data is the key element of ML/AI, so it is becoming increasingly important to ensure that users are fully aware of the quality of the datasets that they use, and of the process generating them, so that possible negative impacts on downstream effects can be tra… ▽ More ML/AI is the field of computer science and computer engineering that arguably received the most attention and funding over the last decade. Data is the key element of ML/AI, so it is becoming increasingly important to ensure that users are fully aware of the quality of the datasets that they use, and of the process generating them, so that possible negative impacts on downstream effects can be tracked, analysed, and, where possible, mitigated. One of the tools that can be useful in this perspective is dataset documentation. The aim of this work is to investigate the state of dataset documentation practices, measuring the completeness of the documentation of several popular datasets in ML/AI repositories. We created a dataset documentation schema -- the Documentation Test Sheet (DTS) -- that identifies the information that should always be attached to a dataset (to ensure proper dataset choice and informed use), according to relevant studies in the literature. We verified 100 popular datasets from four different repositories with the DTS to investigate which information was present. Overall, we observed a lack of relevant documentation, especially about the context of data collection and data processing, highlighting a paucity of transparency. △ Less

Submitted 10 February, 2025; originally announced March 2025.

Journal ref: Progress in Artificial Intelligence. EPIA 2023. Lecture Notes in Computer Science(), vol 14115. Springer, Cham

arXiv:2503.11881 [pdf]

GPT's Devastated and LLaMA's Content: Emotion Representation Alignment in LLMs for Keyword-based Generation

Authors: Shadab Choudhury, Asha Kumar, Lara J. Martin

Abstract: In controlled text generation using large language models (LLMs), gaps arise between the language model's interpretation and human expectations. We look at the problem of controlling emotions in keyword-based sentence generation for both GPT-4 and LLaMA-3. We selected four emotion representations: Words, Valence-Arousal-Dominance (VAD) dimensions expressed in both Lexical and Numeric forms, and Em… ▽ More In controlled text generation using large language models (LLMs), gaps arise between the language model's interpretation and human expectations. We look at the problem of controlling emotions in keyword-based sentence generation for both GPT-4 and LLaMA-3. We selected four emotion representations: Words, Valence-Arousal-Dominance (VAD) dimensions expressed in both Lexical and Numeric forms, and Emojis. Our human evaluation looked at the Human-LLM alignment for each representation, as well as the accuracy and realism of the generated sentences. While representations like VAD break emotions into easy-to-compute components, our findings show that people agree more with how LLMs generate when conditioned on English words (e.g., "angry") rather than VAD scales. This difference is especially visible when comparing Numeric VAD to words. However, we found that converting the originally-numeric VAD scales to Lexical scales (e.g., +4.0 becomes "High") dramatically improved agreement. Furthermore, the perception of how much a generated sentence conveys an emotion is highly dependent on the LLM, representation type, and which emotion it is. △ Less

Submitted 14 March, 2025; originally announced March 2025.

arXiv:2503.07767 [pdf, other]

Better Pose Initialization for Fast and Robust 2D/3D Pelvis Registration

Authors: Yehyun Suh, J. Ryan Martin, Daniel Moyer

Abstract: This paper presents an approach for improving 2D/3D pelvis registration in optimization-based pose estimators using a learned initialization function. Current methods often fail to converge to the optimal solution when initialized naively. We find that even a coarse initializer greatly improves pose estimator accuracy, and improves overall computational efficiency. This approach proves to be effec… ▽ More This paper presents an approach for improving 2D/3D pelvis registration in optimization-based pose estimators using a learned initialization function. Current methods often fail to converge to the optimal solution when initialized naively. We find that even a coarse initializer greatly improves pose estimator accuracy, and improves overall computational efficiency. This approach proves to be effective also in challenging cases under more extreme pose variation. Experimental validation demonstrates that our method consistently achieves robust and accurate registration, enhancing the reliability of 2D/3D registration for clinical applications. △ Less

Submitted 10 March, 2025; originally announced March 2025.

arXiv:2503.07763 [pdf, other]

2D/3D Registration of Acetabular Hip Implants Under Perspective Projection and Fully Differentiable Ellipse Fitting

Authors: Yehyun Suh, J. Ryan Martin, Daniel Moyer

Abstract: This paper presents a novel method for estimating the orientation and the position of acetabular hip implants in total hip arthroplasty using full anterior-posterior hip fluoroscopy images. Our method accounts for distortions induced in the fluoroscope geometry, estimating acetabular component pose by creating a forward model of the perspective projection and implementing differentiable ellipse fi… ▽ More This paper presents a novel method for estimating the orientation and the position of acetabular hip implants in total hip arthroplasty using full anterior-posterior hip fluoroscopy images. Our method accounts for distortions induced in the fluoroscope geometry, estimating acetabular component pose by creating a forward model of the perspective projection and implementing differentiable ellipse fitting for the similarity of our estimation from the ground truth. This approach enables precise estimation of the implant's rotation (anteversion, inclination) and the translation under the fluoroscope induced deformation. Experimental results from both numerically simulated and digitally reconstructed radiograph environments demonstrate high accuracy with minimal computational demands, offering enhanced precision and applicability in clinical and surgical settings. △ Less

Submitted 10 March, 2025; originally announced March 2025.

arXiv:2502.20472 [pdf, other]

doi 10.1051/0004-6361/202453385

Simple molecules and complex chemistry in a protoplanetary disk: A JWST investigation of the highly inclined disk d216-0939

Authors: Alexey Potapov, Hendrik Linz, Jeroen Bouwman, Will Rocha, Johannes Martin, Sebastian Wolf, Thomas Henning, Hiroshi Terada

Abstract: While the number of detected molecules, particularly complex organic molecules, in the solid-state in astrophysical environments is still rather limited, laboratory experiments and astrochemical models predict many potential candidates. Detection of molecules in protoplanetary disks provides a bridge between the chemical evolution of the interstellar medium and the chemistry of planets and their a… ▽ More While the number of detected molecules, particularly complex organic molecules, in the solid-state in astrophysical environments is still rather limited, laboratory experiments and astrochemical models predict many potential candidates. Detection of molecules in protoplanetary disks provides a bridge between the chemical evolution of the interstellar medium and the chemistry of planets and their atmospheres. The excellent spectral sensitivity, broad wavelength coverage and high spatial resolution of the James Webb Space Telescope (JWST) allows for making progress in exploring chemical compositions of various astrophysical environments including planet-forming disks. They are a prerequisite for probing the disk content by means of sensitive absorption studies. In this paper, we present initial results of the JWST Cycle 1 GO program 1741 on d216-0939, a highly inclined TTauri disk located in the outskirts of the Orion Nebula Cluster. We utilise the NIRSpec and MIRI integral field unit spectrographs to cover its spectrum from 1.7 to 28~$μ$m. In the d216-0939 disk, we give assignments of the composition of silicate grains. We unambiguously detect solid-state features of H$_2$O, CO$_2$, $^{13}$CO$_2$, CO, OCN$^-$, and tentatively OCS; species that had been detected recently also in other circumstellar disks. For the first time in disks, we provide unique detections of ices carrying NH$_4^+$ and the complex organic molecule ammonium carbamate (NH$_4^+$NH$_2$COO$^-$). The latter detections speak for a very efficient NH$_3$ chemistry in the disk. We also show the very important role of scattering in the analysis of observational spectra of highly inclined disks. △ Less

Submitted 27 February, 2025; originally announced February 2025.

Comments: 10 pages, 7 figures, 6 tables, accepted by A&A on February 20, 2025

Journal ref: A&A 697, A53 (2025)

arXiv:2502.17002 [pdf, other]

Neutron multiplicity measurement in muon capture on oxygen nuclei in the Gd-loaded Super-Kamiokande detector

Authors: The Super-Kamiokande Collaboration, :, S. Miki, K. Abe, S. Abe, Y. Asaoka, C. Bronner, M. Harada, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Mine, M. Miura, S. Moriyama, M. Nakahata, S. Nakayama, Y. Noguchi, K. Okamoto , et al. (265 additional authors not shown)

Abstract: In recent neutrino detectors, neutrons produced in neutrino reactions play an important role. Muon capture on oxygen nuclei is one of the processes that produce neutrons in water Cherenkov detectors. We measured neutron multiplicity in the process using cosmic ray muons that stop in the gadolinium-loaded Super-Kamiokande detector. For this measurement, neutron detection efficiency is obtained with… ▽ More In recent neutrino detectors, neutrons produced in neutrino reactions play an important role. Muon capture on oxygen nuclei is one of the processes that produce neutrons in water Cherenkov detectors. We measured neutron multiplicity in the process using cosmic ray muons that stop in the gadolinium-loaded Super-Kamiokande detector. For this measurement, neutron detection efficiency is obtained with the muon capture events followed by gamma rays to be $50.2^{+2.0}_{-2.1}\%$. By fitting the observed multiplicity considering the detection efficiency, we measure neutron multiplicity in muon capture as $P(0)=24\pm3\%$, $P(1)=70^{+3}_{-2}\%$, $P(2)=6.1\pm0.5\%$, $P(3)=0.38\pm0.09\%$. This is the first measurement of the multiplicity of neutrons associated with muon capture without neutron energy threshold. △ Less

Submitted 24 February, 2025; originally announced February 2025.

arXiv:2502.16884 [pdf, other]

Critical Dynamics of the Anderson Transition on Small-World Graphs

Authors: Weitao Chen, Ignacio García-Mata, John Martin, Jiangbin Gong, Bertrand Georgeot, Gabriel Lemarié

Abstract: The Anderson transition on random graphs draws interest through its resemblance to the many-body localization (MBL) transition with similarly debated properties. In this Letter, we construct a unitary Anderson model on Small-World graphs to characterize long time and large size wave-packet dynamics across the Anderson transition. We reveal the logarithmically slow non-ergodic dynamics in the criti… ▽ More The Anderson transition on random graphs draws interest through its resemblance to the many-body localization (MBL) transition with similarly debated properties. In this Letter, we construct a unitary Anderson model on Small-World graphs to characterize long time and large size wave-packet dynamics across the Anderson transition. We reveal the logarithmically slow non-ergodic dynamics in the critical regime, confirming recent random matrix predictions. Our data clearly indicate two localization times: an average localization time that diverges, while the typical one saturates. In the delocalized regime, the dynamics are initially non-ergodic but cross over to ergodic diffusion at long times and large distances. Finite-time scaling then allows us to characterize the critical dynamical properties: the logarithm of the average localization time diverges algebraically, while the ergodic time diverges exponentially. Our results could be used to clarify the dynamical properties of MBL and could guide future experiments with quantum simulators. △ Less

Submitted 24 February, 2025; originally announced February 2025.

Comments: 5 pages, 3 figures

arXiv:2502.06439 [pdf, other]

Testing software for non-discrimination: an updated and extended audit in the Italian car insurance domain

Authors: Marco Rondina, Antonio Vetrò, Riccardo Coppola, Oumaima Regragrui, Alessandro Fabris, Gianmaria Silvello, Gian Antonio Susto, Juan Carlos De Martin

Abstract: Context. As software systems become more integrated into society's infrastructure, the responsibility of software professionals to ensure compliance with various non-functional requirements increases. These requirements include security, safety, privacy, and, increasingly, non-discrimination. Motivation. Fairness in pricing algorithms grants equitable access to basic services without discriminat… ▽ More Context. As software systems become more integrated into society's infrastructure, the responsibility of software professionals to ensure compliance with various non-functional requirements increases. These requirements include security, safety, privacy, and, increasingly, non-discrimination. Motivation. Fairness in pricing algorithms grants equitable access to basic services without discriminating on the basis of protected attributes. Method. We replicate a previous empirical study that used black box testing to audit pricing algorithms used by Italian car insurance companies, accessible through a popular online system. With respect to the previous study, we enlarged the number of tests and the number of demographic variables under analysis. Results. Our work confirms and extends previous findings, highlighting the problematic permanence of discrimination across time: demographic variables significantly impact pricing to this day, with birthplace remaining the main discriminatory factor against individuals not born in Italian cities. We also found that driver profiles can determine the number of quotes available to the user, denying equal opportunities to all. Conclusion. The study underscores the importance of testing for non-discrimination in software systems that affect people's everyday lives. Performing algorithmic audits over time makes it possible to evaluate the evolution of such algorithms. It also demonstrates the role that empirical software engineering can play in making software systems more accountable. △ Less

Submitted 10 February, 2025; originally announced February 2025.

Comments: 14 pages, 1 figure

arXiv:2502.06341 [pdf, other]

doi 10.1007/978-3-031-74630-7_10

Facial Analysis Systems and Down Syndrome

Authors: Marco Rondina, Fabiana Vinci, Antonio Vetrò, Juan Carlos De Martin

Abstract: The ethical, social and legal issues surrounding facial analysis technologies have been widely debated in recent years. Key critics have argued that these technologies can perpetuate bias and discrimination, particularly against marginalized groups. We contribute to this field of research by reporting on the limitations of facial analysis systems with the faces of people with Down syndrome: this p… ▽ More The ethical, social and legal issues surrounding facial analysis technologies have been widely debated in recent years. Key critics have argued that these technologies can perpetuate bias and discrimination, particularly against marginalized groups. We contribute to this field of research by reporting on the limitations of facial analysis systems with the faces of people with Down syndrome: this particularly vulnerable group has received very little attention in the literature so far. This study involved the creation of a specific dataset of face images. An experimental group with faces of people with Down syndrome, and a control group with faces of people who are not affected by the syndrome. Two commercial tools were tested on the dataset, along three tasks: gender recognition, age prediction and face labelling. The results show an overall lower accuracy of prediction in the experimental group, and other specific patterns of performance differences: i) high error rates in gender recognition in the category of males with Down syndrome; ii) adults with Down syndrome were more often incorrectly labelled as children; iii) social stereotypes are propagated in both the control and experimental groups, with labels related to aesthetics more often associated with women, and labels related to education level and skills more often associated with men. These results, although limited in scope, shed new light on the biases that alter face classification when applied to faces of people with Down syndrome. They confirm the structural limitation of the technology, which is inherently dependent on the datasets used to train the models. △ Less

Submitted 10 February, 2025; originally announced February 2025.

Journal ref: Machine Learning and Principles and Practice of Knowledge Discovery in Databases. ECML PKDD 2023. Communications in Computer and Information Science, vol 2133. Springer, Cham

arXiv:2502.04876 [pdf, ps, other]

Ultraviolet Renormalization of Spin Boson Models I. Normal and 2-Nilpotent Interactions

Authors: Benjamin Hinrichs, Jonas Lampart, Javier Valentín Martín

Abstract: We study the ultraviolet problem for models of a finite-dimensional quantum mechanical system linearly coupled to a bosonic quantum field, such as the (many-)spin boson model or its rotating-wave approximation. If the state change of the system upon emission or absorption of a boson is either given by a normal matrix or by a 2-nilpotent one, which is the case for the previously named examples, we… ▽ More We study the ultraviolet problem for models of a finite-dimensional quantum mechanical system linearly coupled to a bosonic quantum field, such as the (many-)spin boson model or its rotating-wave approximation. If the state change of the system upon emission or absorption of a boson is either given by a normal matrix or by a 2-nilpotent one, which is the case for the previously named examples, we prove an optimal renormalization result. We complement it, by proving the norm resolvent convergence of appropriately regularized models to the renormalized one. Our method consists of a dressing transformation argument in the normal case and an appropriate interior boundary condition for the 2-nilpotent case. △ Less

Submitted 7 February, 2025; originally announced February 2025.

Comments: 18 pages

arXiv:2502.04618 [pdf, other]

Robust Quantum Control for Bragg Pulse Design in Atom Interferometry

Authors: Luke S. Baker, Andre Luiz P. de Lima, Andrew Harter, Ceren Uzun, Jr-Shin Li, Anatoly Zlotnik, Michael J. Martin, Malcolm G. Boshier

Abstract: We formulate a robust optimal control algorithm to synthesize minimum energy pulses that can transfer a cold atom system into various momentum states. The algorithm uses adaptive linearization of the evolution operator and sequential quadratic programming to iterate the control towards a minimum energy signal that achieves optimal target state fidelity. Robustness to parameter variation is achieve… ▽ More We formulate a robust optimal control algorithm to synthesize minimum energy pulses that can transfer a cold atom system into various momentum states. The algorithm uses adaptive linearization of the evolution operator and sequential quadratic programming to iterate the control towards a minimum energy signal that achieves optimal target state fidelity. Robustness to parameter variation is achieved using Legendre polynomial approximation over the domain of variation. The method is applied to optimize the Bragg beamsplitting operation in ultra-cold atom interferometry. Even in the presence of 10-40% variability in the initial momentum dispersion of the atomic cloud and the intensity of the optical pulse, the algorithm reliably converges to a control protocol that robustly achieves unprecedented momentum levels with high fidelity for a single-frequency multi-photon Bragg diffraction scheme (e.g. $|\pm 40\hbar k\rangle$). Advantages of the proposed method are demonstrated by comparison to stochastic optimization using sampled parameter values. △ Less

Submitted 10 February, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

Report number: LA-UR-25-20049 MSC Class: 49M37; 78A37; 81V80

arXiv:2502.02345 [pdf, other]

Optimal Subspace Inference for the Laplace Approximation of Bayesian Neural Networks

Authors: Josua Faller, Jörg Martin

Abstract: Subspace inference for neural networks assumes that a subspace of their parameter space suffices to produce a reliable uncertainty quantification. In this work, we mathematically derive the optimal subspace model to a Bayesian inference scenario based on the Laplace approximation. We demonstrate empirically that, in the optimal case, often a fraction of parameters less than 1% is sufficient to obt… ▽ More Subspace inference for neural networks assumes that a subspace of their parameter space suffices to produce a reliable uncertainty quantification. In this work, we mathematically derive the optimal subspace model to a Bayesian inference scenario based on the Laplace approximation. We demonstrate empirically that, in the optimal case, often a fraction of parameters less than 1% is sufficient to obtain a reliable estimate of the full Laplace approximation. Since the optimal solution is derived, we can evaluate all other subspace models against a baseline. In addition, we give an approximation of our method that is applicable to larger problem settings, in which the optimal solution is not computable, and compare it to existing subspace models from the literature. In general, our approximation scheme outperforms previous work. Furthermore, we present a metric to qualitatively compare different subspace models even if the exact Laplace approximation is unknown. △ Less

Submitted 4 February, 2025; originally announced February 2025.

Comments: for associated code, see https://github.com/josh3142/LowRankLaplaceApproximation

arXiv:2501.19012 [pdf, ps, other]

Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities

Authors: Arjun Krishna, Erick Galinkin, Leon Derczynski, Jeffrey Martin

Abstract: Large Language Models (LLMs) have become an essential tool in the programmer's toolkit, but their tendency to hallucinate code can be used by malicious actors to introduce vulnerabilities to broad swathes of the software supply chain. In this work, we analyze package hallucination behaviour in LLMs across popular programming languages examining both existing package references and fictional depend… ▽ More Large Language Models (LLMs) have become an essential tool in the programmer's toolkit, but their tendency to hallucinate code can be used by malicious actors to introduce vulnerabilities to broad swathes of the software supply chain. In this work, we analyze package hallucination behaviour in LLMs across popular programming languages examining both existing package references and fictional dependencies. By analyzing this package hallucination behaviour we find potential attacks and suggest defensive strategies to defend against these attacks. We discover that package hallucination rate is predicated not only on model choice, but also programming language, model size, and specificity of the coding task request. The Pareto optimality boundary between code generation performance and package hallucination is sparsely populated, suggesting that coding models are not being optimized for secure code. Additionally, we find an inverse correlation between package hallucination rate and the HumanEval coding benchmark, offering a heuristic for evaluating the propensity of a model to hallucinate packages. Our metrics, findings and analyses provide a base for future models, securing AI-assisted software development workflows against package supply chain attacks. △ Less

Submitted 31 January, 2025; originally announced January 2025.

arXiv:2501.17951 [pdf, ps, other]

doi 10.1093/comnet/cnae016

An iterative spectral algorithm for digraph clustering

Authors: James Martin, Tim Rogers, Luca Zanetti

Abstract: Graph clustering is a fundamental technique in data analysis with applications in many different fields. While there is a large body of work on clustering undirected graphs, the problem of clustering directed graphs is much less understood. The analysis is more complex in the directed graph case for two reasons: the clustering must preserve directional information in the relationships between clus… ▽ More Graph clustering is a fundamental technique in data analysis with applications in many different fields. While there is a large body of work on clustering undirected graphs, the problem of clustering directed graphs is much less understood. The analysis is more complex in the directed graph case for two reasons: the clustering must preserve directional information in the relationships between clusters, and directed graphs have non-Hermitian adjacency matrices whose properties are less conducive to traditional spectral methods. Here we consider the problem of partitioning the vertex set of a directed graph into $k\ge 2$ clusters so that edges between different clusters tend to follow the same direction. We present an iterative algorithm based on spectral methods applied to new Hermitian representations of directed graphs. Our algorithm performs favourably against the state-of-the-art, both on synthetic and real-world data sets. Additionally, it is able to identify a "meta-graph" of $k$ vertices that represents the higher-order relations between clusters in a directed graph. We showcase this capability on data sets pertaining food webs, biological neural networks, and the online card game Hearthstone. △ Less

Submitted 29 January, 2025; originally announced January 2025.

Comments: 18 pages, 8 figures

MSC Class: 91C20 (Primary) 05C50 05C82 (Secondary)

Journal ref: Journal of Complex Networks, Volume 12, Issue 2, April 2024, cnae016

arXiv:2501.17774 [pdf, other]

Percolation and localisation: Sub-leading eigenvalues of the nonbacktracking matrix

Authors: James Martin, Tim Rogers, Luca Zanetti

Abstract: The spectrum of the nonbacktracking matrix associated to a network is known to contain fundamental information regarding percolation properties of the network. Indeed, the inverse of its leading eigenvalue is often used as an estimate for the percolation threshold. However, for many networks with nonbacktracking centrality localised on a few nodes, such as networks with a core-periphery structure,… ▽ More The spectrum of the nonbacktracking matrix associated to a network is known to contain fundamental information regarding percolation properties of the network. Indeed, the inverse of its leading eigenvalue is often used as an estimate for the percolation threshold. However, for many networks with nonbacktracking centrality localised on a few nodes, such as networks with a core-periphery structure, this spectral approach badly underestimates the threshold. In this work, we study networks that exhibit this localisation effect by looking beyond the leading eigenvalue and searching deeper into the spectrum of the nonbacktracking matrix. We identify that, when localisation is present, the threshold often more closely aligns with the inverse of one of the sub-leading real eigenvalues: the largest real eigenvalue with a "delocalised" corresponding eigenvector. We investigate a core-periphery network model and determine, both theoretically and experimentally, a regime of parameters for which our approach closely approximates the threshold, while the estimate derived using the leading eigenvalue does not. We further present experimental results on large scale real-world networks that showcase the usefulness of our approach. △ Less

Submitted 29 January, 2025; originally announced January 2025.

Comments: 20 pages, 9 figures

MSC Class: 60K35 (Primary) 05C50; 05C82 (Secondary)

arXiv:2501.17532 [pdf, other]

Wireless Network Topology Inference: A Markov Chains Approach

Authors: James Martin, Tristan Pryer, Luca Zanetti

Abstract: In this work, we address the problem of inferring the topology of a wireless network using limited observational data. Specifically, we assume that we can detect when a node is transmitting, but no further information regarding the transmission is available. We propose a novel network estimation procedure grounded in the following abstract problem: estimating the parameters of a finite discrete-ti… ▽ More In this work, we address the problem of inferring the topology of a wireless network using limited observational data. Specifically, we assume that we can detect when a node is transmitting, but no further information regarding the transmission is available. We propose a novel network estimation procedure grounded in the following abstract problem: estimating the parameters of a finite discrete-time Markov chain by observing, at each time step, which states are visited by multiple ``anonymous'' copies of the chain. We develop a consistent estimator that approximates the transition matrix of the chain in the operator norm, with the number of required samples scaling roughly linearly with the size of the state space. Applying this estimation procedure to wireless networks, our numerical experiments demonstrate that the proposed method accurately infers network topology across a wide range of parameters, consistently outperforming transfer entropy, particularly under conditions of high network congestion. △ Less

Submitted 29 January, 2025; originally announced January 2025.

arXiv:2501.16729 [pdf, other]

On the Interplay Between Sparsity and Training in Deep Reinforcement Learning

Authors: Fatima Davelouis, John D. Martin, Michael Bowling

Abstract: We study the benefits of different sparse architectures for deep reinforcement learning. In particular, we focus on image-based domains where spatially-biased and fully-connected architectures are common. Using these and several other architectures of equal capacity, we show that sparse structure has a significant effect on learning performance. We also observe that choosing the best sparse archit… ▽ More We study the benefits of different sparse architectures for deep reinforcement learning. In particular, we focus on image-based domains where spatially-biased and fully-connected architectures are common. Using these and several other architectures of equal capacity, we show that sparse structure has a significant effect on learning performance. We also observe that choosing the best sparse architecture for a given domain depends on whether the hidden layer weights are fixed or learned. △ Less

Submitted 1 February, 2025; v1 submitted 28 January, 2025; originally announced January 2025.

arXiv:2501.15644 [pdf, other]

doi 10.3847/PSJ/ada560

Characterization of of (98943) 2001 CC$_{21}$, the target of Hayabusa2$\#$

Authors: Marcel M. Popescu, Eri Tatsumi, Javier Licandro, Miguel R. Alarcon, Javier Rodríguez Rodríguez, Miquel Serra-Ricart, Julia de León, Joaquín Fernandez Martin, David Morate, Gabriel N. Simion, Bogdan Alexandru Dumitru, Daniel Nicolae Berteşteanu, George Pantelimon Prodan, Masatoshi Hirabayashi

Abstract: The near-Earth asteroid (98943) Torifune, previously designated 2001 CC$_{21}$, is the flyby target of the Hayabusa2 extended mission, nicknamed Hayabusa2$\#$ (SHARP: Small Hazardous Asteroid Reconnaissance Probe). The ground-based telescope observations offer a key science input for the mission's scientific investigation. During 2022 - 2024 this asteroid was at visible apparent magnitudes brighte… ▽ More The near-Earth asteroid (98943) Torifune, previously designated 2001 CC$_{21}$, is the flyby target of the Hayabusa2 extended mission, nicknamed Hayabusa2$\#$ (SHARP: Small Hazardous Asteroid Reconnaissance Probe). The ground-based telescope observations offer a key science input for the mission's scientific investigation. During 2022 - 2024 this asteroid was at visible apparent magnitudes brighter than 18.5, allowing for a detailed characterization using ground-based telescope observations. We determined its rotation period $P~=~5.021516\pm0.000106$ h and its absolute magnitude H = 18.78 $\pm$ 0.14 and. The large number of lightcurves allows to estimate its axes ratio, its convex shape and its pole orientation $λ= 301^{\circ} \pm 35^{\circ}$, $β= {89^{+1}_{-6}}^{\circ}$ and $ε= 5^{\circ} \pm 3^{\circ}$ which indicate a prograde rotation. We report the semi-axis of the equivalent ellipsoid, $a$ = 0.42$^{+0.08}_{0.06}$ km, $b$ = 0.16$^{+0.05}_{0.04}$ km, and $c$ = $0.17\pm0.03$ km. Consequently, the volume equivalent diameter is $D_{eq}$ = $0.44 \pm 0.06$ km . Using observations conducted simultaneously with four broadband filters, we determined $(g-r) = 0.663 \pm 0.022$ mag, $(r-i) = 0.177 \pm 0.012$ mag, and $(i-z_s) = -0.061 \pm 0.032$ mag. Additionally, we found that Torifune exhibits no detectable large-scale heterogeneity. We classified the object using a high signal-to-noise ratio spectrum (over the visible and near-infrared region) as Sq-type in the Bus-DeMeo taxonomy. We estimate a mineralogy similar to LL/L ordinary chondrites, with an ol/(ol+px) = 0.60, a Fa content of 28.5 mol$\%$, and a Fs content of 23.4 mol$\%$. The spectral data indicate a surface affected by moderate space weathering effects. △ Less

Submitted 26 January, 2025; originally announced January 2025.

Comments: Accepted for publication in PSJ, 29 page, 15 figures

arXiv:2501.10853 [pdf, other]

Quasiconvex relaxation of planar Biot-type energies and the role of determinant constraints

Authors: Robert J. Martin, Ionel-Dumitrel Ghiba, Maximilian Köhler, Daniel Balzani, Oliver Sander, Patrizio Neff

Abstract: We derive the quasiconvex relaxation of the Biot-type energy density $\lVert\sqrt{\operatorname{D}\varphi^T \operatorname{D}\varphi}-I_2\rVert^2$ for planar mappings $\varphi\colon\mathbb{R}^2\to \mathbb{R}^2$ in two different scenarios. First, we consider the case $\operatorname{D}\varphi\in\textrm{GL}^+(2)$, in which the energy can be expressed as the squared Euclidean distance… ▽ More We derive the quasiconvex relaxation of the Biot-type energy density $\lVert\sqrt{\operatorname{D}\varphi^T \operatorname{D}\varphi}-I_2\rVert^2$ for planar mappings $\varphi\colon\mathbb{R}^2\to \mathbb{R}^2$ in two different scenarios. First, we consider the case $\operatorname{D}\varphi\in\textrm{GL}^+(2)$, in which the energy can be expressed as the squared Euclidean distance $\operatorname{dist}^2(\operatorname{D}\varphi,\textrm{SO}(2))$ to the special orthogonal group $\textrm{SO}(2)$. We then allow for planar mappings with arbitrary $\operatorname{D}\varphi\in\mathbb{R}^{2\times 2}$; in the context of solid mechanics, this lack of determinant constraints on the deformation gradient would allow for self-interpenetration of matter. We demonstrate that the two resulting relaxations do not coincide and compare the analytical findings to numerical results for different relaxation approaches, including a rank-one sequential lamination algorithm, trust-region FEM calculations of representative microstructures and physics-informed neural networks. △ Less

Submitted 18 January, 2025; originally announced January 2025.

MSC Class: 74A05; 74A60; 74B20; 74G65

arXiv:2501.10601 [pdf]

Understanding Computational Science and Domain Science Skills Development in National Laboratory Graduate Internships

Authors: Morgan M. Fong, Hilary Egan, Marc Day, Kristin Potter, Michael J. Martin

Abstract: Contribution: This study presents an evaluation of federally-funded graduate internship outcomes in computational science at a national laboratory. Additionally, we present a survey instrument that may be used for other internship programs with a similar focus. Background: There is ongoing demand for computational scientists to grapple with large-scale problems such as climate change. Internships… ▽ More Contribution: This study presents an evaluation of federally-funded graduate internship outcomes in computational science at a national laboratory. Additionally, we present a survey instrument that may be used for other internship programs with a similar focus. Background: There is ongoing demand for computational scientists to grapple with large-scale problems such as climate change. Internships may help provide additional training and access to greater compute capabilities for graduate students. However, little work has been done to quantify the learning outcomes of such internships. Background: There is ongoing demand for computational scientists to grapple with large-scale problems such as climate change. Internships may help provide additional training and access to greater compute capabilities for graduate students. However, little work has been done to quantify the learning outcomes of such internships. Research Questions: What computational skills, research skills, and professional skills do graduate students improve through their internships at NREL, the national laboratory selected for the study? What sustainability and renewable energy topics do graduate students gain more familiarity with through their internships at NREL? Do graduate students' career interests change after their internships at NREL? Methodology: We developed a survey and collected responses from past participants of five federally-funded internship programs and compare participant ratings of their prior experience to their internship experience. Findings: Our results indicate participants improve their computational skills, familiarity with sustainability and renewable energy topics, and are more interested in working at national labs. Additionally, participants go on to degree programs and positions related to sustainability and renewable energy after their internships. △ Less

Submitted 17 January, 2025; originally announced January 2025.

Comments: Submission to IEEE Transactions on Education pending

MSC Class: 97 ACM Class: K.3

arXiv:2501.10351 [pdf, other]

Purcell-Enhanced, Directional Light-Matter Interaction in a Waveguide-Coupled Nanocavity

Authors: Nicholas J. Martin, Dominic Hallett, Mateusz Duda, Luke Hallacy, Elena Callus, Luke Brunswick, René Dost, Edmund Clarke, Pallavi K. Patil, Pieter Kok, Maurice S. Skolnick, Luke R. Wilson

Abstract: We demonstrate electrically tunable, spin-dependent, directional coupling of single photons by embedding quantum dots (QDs) in a waveguide-coupled nanocavity. The directional behavior arises from direction-dependent interference between two cavity modes when coupled to the device waveguides. The small mode volume cavity enables simultaneous Purcell enhancement (${10.8\pm0.7}$) and peak directional… ▽ More We demonstrate electrically tunable, spin-dependent, directional coupling of single photons by embedding quantum dots (QDs) in a waveguide-coupled nanocavity. The directional behavior arises from direction-dependent interference between two cavity modes when coupled to the device waveguides. The small mode volume cavity enables simultaneous Purcell enhancement (${10.8\pm0.7}$) and peak directional contrast (${88\pm1\%}$), exceeding current state-of-the-art waveguide-only systems. We also present a scattering matrix model for the transmission through this structure, alongside a quantum trajectory-based model for predicting the system's directionality, which we use to explain the observed asymmetry in directional contrast seen in QD devices. Furthermore, the nanocavity enables wide-range electrical tuning of the emitter's directional contrast. We present results showing precise tuning of a QD emission line from a directional contrast of ${2\%}$ to ${96\%}$. In combination, these characteristics make this cavity-waveguide approach promising for use as a building block in directional nanophotonic circuits. △ Less

Submitted 17 January, 2025; originally announced January 2025.

arXiv:2501.04406 [pdf, other]

Classically Bound and Quantum Quasi-Bound States of an Electron on a Plane Adjacent to a Magnetic Monopole

Authors: J. Martin, A. Baskerville, V. L. Campo, J. Minns, J. Pooley, S. T. Carr, C. A. Hooley, G. Möller, J. Quintanilla

Abstract: In three-dimensional space an electron moving in the field of a magnetic monopole has no bound states. In this paper we explore the physics when the electron is restricted to a two-dimensional plane adjacent to a magnetic monopole. We find bound states in the classical version of the problem and quasi-bound states in the quantum one, in addition to a continuum of scattering states. We calculate th… ▽ More In three-dimensional space an electron moving in the field of a magnetic monopole has no bound states. In this paper we explore the physics when the electron is restricted to a two-dimensional plane adjacent to a magnetic monopole. We find bound states in the classical version of the problem and quasi-bound states in the quantum one, in addition to a continuum of scattering states. We calculate the lifetimes of the quasi-bound states using several complementary approximate methods, which agree well in the cases where the lifetimes are relatively short. The threshold monopole magnetic charge required to realise a single quasi-bound state is approximately $18Q_D$, where $Q_D$ is the magnetic charge of a Dirac monopole. We examine the feasibility of achieving this magnetic charge in currently available monopole analogues: spin ice, artificial spin ice, and magnetic needles. △ Less

Submitted 8 January, 2025; originally announced January 2025.

Comments: 26 pages

arXiv:2501.04365 [pdf, ps, other]

Characterization of subfields of adelic algebras by a product formula

Authors: Luis Manuel Navas Vicente, Francisco J. Plaza Martin

Abstract: We consider projective, irreducible, non-singular curves over an algebraically closed field $\k$. A cover $Y \to X$ of such curves corresponds to an extension $Ω/Σ$ of their function fields and yields an isomorphism $\A_{Y} \simeq \A_{X} \otimes_Σ Ω$ of their geometric adele rings. The primitive element theorem shows that $\A_{Y}$ is a quotient of $\A_{X}[T]$ by a polynomial. In general, we may… ▽ More We consider projective, irreducible, non-singular curves over an algebraically closed field $\k$. A cover $Y \to X$ of such curves corresponds to an extension $Ω/Σ$ of their function fields and yields an isomorphism $\A_{Y} \simeq \A_{X} \otimes_Σ Ω$ of their geometric adele rings. The primitive element theorem shows that $\A_{Y}$ is a quotient of $\A_{X}[T]$ by a polynomial. In general, we may look at quotient algebras $\AXp{\p} = \A_{X}[T]/(\p(T))$ where $\p(T) \in \A_{X}[T]$ is monic and separable over $\A_{X}$, and try to characterize the field extensions $Ω/Σ$ lying in $\AXp{\p}$ which arise from covers as above. We achieve this topologically, namely, as those $Ω$ which embed discretely in $\AXp{\p}$, and in terms of an additive analog of the product formula for global fields, a result which is reminiscent of classical work of Artin-Whaples and Iwasawa. The technical machinery requires studying which topology on $\AXp{\p}$ is natural for this problem. Local compactness no longer holds, but instead we have linear topologies defined by commensurability of $\k$-subspaces which coincide with the restricted direct product topology with respect to integral closures. The content function is given as an index measuring the discrepancy in commensurable subspaces. △ Less

Submitted 8 January, 2025; originally announced January 2025.

MSC Class: 14H05 (Primary); 12J20; 13B02; 13A18; 13J99 (Secondary)

arXiv:2501.04355 [pdf, ps, other]

Cyclic covers of an algebraic curve from an adelic viewpoint

Authors: Luis Manuel Navas Vicente, Francisco J. Plaza Martin

Abstract: We propose an algebraic method for the classification of branched Galois covers of a curve $X$ focused on studying Galois ring extensions of its geometric adele ring $\A_{X}$. As an application, we deal with cyclic covers; namely, we determine when a given cyclic ring extension of $\A_{X}$ comes from a corresponding cover of curves $Y \to X$, which is reminiscent of a Grunwald-Wang problem, and al… ▽ More We propose an algebraic method for the classification of branched Galois covers of a curve $X$ focused on studying Galois ring extensions of its geometric adele ring $\A_{X}$. As an application, we deal with cyclic covers; namely, we determine when a given cyclic ring extension of $\A_{X}$ comes from a corresponding cover of curves $Y \to X$, which is reminiscent of a Grunwald-Wang problem, and also determine when two covers yield isomorphic ring extensions, which is known in the literature as an equivalence problem. This completely algebraic method permits us to recover ramification, certain analytic data such as rotation numbers, and enumeration formulas for covers. △ Less

Submitted 8 January, 2025; originally announced January 2025.

MSC Class: 14H30 (Primary) 13B05 14H05; 11R56 (Secondary)

arXiv:2412.13395 [pdf, other]

Enhancing Talk Moves Analysis in Mathematics Tutoring through Classroom Teaching Discourse

Authors: Jie Cao, Abhijit Suresh, Jennifer Jacobs, Charis Clevenger, Amanda Howard, Chelsea Brown, Brent Milne, Tom Fischaber, Tamara Sumner, James H. Martin

Abstract: Human tutoring interventions play a crucial role in supporting student learning, improving academic performance, and promoting personal growth. This paper focuses on analyzing mathematics tutoring discourse using talk moves - a framework of dialogue acts grounded in Accountable Talk theory. However, scaling the collection, annotation, and analysis of extensive tutoring dialogues to develop machine… ▽ More Human tutoring interventions play a crucial role in supporting student learning, improving academic performance, and promoting personal growth. This paper focuses on analyzing mathematics tutoring discourse using talk moves - a framework of dialogue acts grounded in Accountable Talk theory. However, scaling the collection, annotation, and analysis of extensive tutoring dialogues to develop machine learning models is a challenging and resource-intensive task. To address this, we present SAGA22, a compact dataset, and explore various modeling strategies, including dialogue context, speaker information, pretraining datasets, and further fine-tuning. By leveraging existing datasets and models designed for classroom teaching, our results demonstrate that supplementary pretraining on classroom data enhances model performance in tutoring settings, particularly when incorporating longer context and speaker information. Additionally, we conduct extensive ablation studies to underscore the challenges in talk move modeling. △ Less

Submitted 17 December, 2024; originally announced December 2024.

Comments: Accepted to COLING'2025

arXiv:2412.12923 [pdf, other]

Generation of cosmic ray trajectories by a Diffusion Model trained on test particles in 3D magnetohydrodynamic turbulence

Authors: Johannes Martin, Jeremiah Lübke, Tianyi Li, Michele Buzzicotti, Rainer Grauer, Luca Biferale

Abstract: Models for the transport of high energy charged particles through strong magnetic turbulence play a key role in space and astrophysical studies, such as describing the propagation of solar energetic particles and high energy cosmic rays. Inspired by the recent advances in high-performance machine learning techniques, we investigate the application of generative diffusion models to synthesizing tes… ▽ More Models for the transport of high energy charged particles through strong magnetic turbulence play a key role in space and astrophysical studies, such as describing the propagation of solar energetic particles and high energy cosmic rays. Inspired by the recent advances in high-performance machine learning techniques, we investigate the application of generative diffusion models to synthesizing test particle trajectories obtained from a turbulent magnetohydrodynamics simulation. We consider velocity increment, spatial transport and curvature statistics, and find excellent agreement with the baseline trajectories for fixed particle energies. Additionally, we consider two synthetic turbulence models for comparison. Finally, challenges towards an application-ready transport model based on our approach are discussed. △ Less

Submitted 10 February, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

Comments: 19 pages, 12 figures, accepted for publication in The Astrophysical Journal Supplement Series

arXiv:2412.12355 [pdf]

doi 10.1109/MCSE.2025.3549359

Integrating Energy-Efficient Computing Research to Accelerate Energy Technology

Authors: Michael James Martin, Aaron Andersen, Charles Tripp, David Sickinger, Kristin Munch

Abstract: NREL's computational sciences center hosts the largest high-performance computing (HPC) capabilities dedicated to energy research while functioning as a living laboratory for energy-efficient computing. NREL's HPC capabilities support the research needs of the Department of Energy's Office of Energy Efficiency and Renewable Energy (EERE). In ten years of operation, HPC use in EERE-sponsored resear… ▽ More NREL's computational sciences center hosts the largest high-performance computing (HPC) capabilities dedicated to energy research while functioning as a living laboratory for energy-efficient computing. NREL's HPC capabilities support the research needs of the Department of Energy's Office of Energy Efficiency and Renewable Energy (EERE). In ten years of operation, HPC use in EERE-sponsored research has grown by a factor of 30, including work in electricity generation, energy efficiency, transportation, and energy system modeling. This paper analyzes this research portfolio, providing examples of individual use cases. The paper documents NREL's history of operating one of the world's most energy-efficient data centers while examining pathways to reduce economic and environmental impact beyond reduction of Power Usage Efficiency (PUE). This paper concludes by examining the unique opportunities created for accelerating improvements in data center efficiency created by combining an HPC system dedicated to energy research and a research program in energy-efficient computing. △ Less

Submitted 28 March, 2025; v1 submitted 16 December, 2024; originally announced December 2024.

Comments: Invited submission to IEEE Computing in Science and Engineering

MSC Class: 00-02 ACM Class: K.4; J.2

arXiv:2412.10582 [pdf, other]

WHAT-IF: Exploring Branching Narratives by Meta-Prompting Large Language Models

Authors: Runsheng "Anson" Huang, Lara J. Martin, Chris Callison-Burch

Abstract: WHAT-IF -- Writing a Hero's Alternate Timeline through Interactive Fiction -- is a system that uses zero-shot meta-prompting to create branching narratives from a prewritten story. Played as an interactive fiction (IF) game, WHAT-IF lets the player choose between decisions that the large language model (LLM) GPT-4 generates as possible branches in the story. Starting with an existing linear plot a… ▽ More WHAT-IF -- Writing a Hero's Alternate Timeline through Interactive Fiction -- is a system that uses zero-shot meta-prompting to create branching narratives from a prewritten story. Played as an interactive fiction (IF) game, WHAT-IF lets the player choose between decisions that the large language model (LLM) GPT-4 generates as possible branches in the story. Starting with an existing linear plot as input, a branch is created at each key decision taken by the main character. By meta-prompting the LLM to consider the major plot points from the story, the system produces coherent and well-structured alternate storylines. WHAT-IF stores the branching plot tree in a graph which helps it to both keep track of the story for prompting and maintain the structure for the final IF system. A video demo of our system can be found here: https://youtu.be/8vBqjqtupcc. △ Less

Submitted 17 December, 2024; v1 submitted 13 December, 2024; originally announced December 2024.

arXiv:2412.06508 [pdf, other]

Step Function in Momentum Space by a Metagrating

Authors: Mahmoud A. A. Abouelatta, Sergejs Boroviks, Olivier J. F. Martin, Karim Achouri

Abstract: Metasurface research has shown significant potential for controlling the polarization, amplitude, phase and propagation direction of light. Nevertheless, control over the angular response of incident light still remains a long-standing problem. In this work, we show the potential of diffractive systems for obtaining a step function in momentum space where the mirror symmetry of the angular transmi… ▽ More Metasurface research has shown significant potential for controlling the polarization, amplitude, phase and propagation direction of light. Nevertheless, control over the angular response of incident light still remains a long-standing problem. In this work, we show the potential of diffractive systems for obtaining a step function in momentum space where the mirror symmetry of the angular transmittance is broken. By engineering the scattering response of an asymmetric particles in a metagrating, we could obtain such a step function in a passive, reciprocal, and lossless fashion. More specifically, the metagrating performs filtering in the momentum space with an abrupt switching from reflection to transmission for an incident electromagnetic wave with an arbitrary spatial profile. This metagrating may find diverse applications in the context of optical spatial analog computing. Moreover, it paves the way for exploring the capabilities of diffractive systems for gaining full control over the angular response of light using arbitrary momentum transfer functions. △ Less

Submitted 9 December, 2024; originally announced December 2024.

arXiv:2412.04820 [pdf, other]

Assessing Similarity Measures for the Evaluation of Human-Robot Motion Correspondence

Authors: Charles Dietzel, Patrick J. Martin

Abstract: One key area of research in Human-Robot Interaction is solving the human-robot correspondence problem, which asks how a robot can learn to reproduce a human motion demonstration when the human and robot have different dynamics and kinematic structures. Evaluating these correspondence problem solutions often requires the use of qualitative surveys that can be time consuming to design and administer… ▽ More One key area of research in Human-Robot Interaction is solving the human-robot correspondence problem, which asks how a robot can learn to reproduce a human motion demonstration when the human and robot have different dynamics and kinematic structures. Evaluating these correspondence problem solutions often requires the use of qualitative surveys that can be time consuming to design and administer. Additionally, qualitative survey results vary depending on the population of survey participants. In this paper, we propose the use of heterogeneous time-series similarity measures as a quantitative evaluation metric for evaluating motion correspondence to complement these qualitative surveys. To assess the suitability of these measures, we develop a behavioral cloning-based motion correspondence model, and evaluate it with a qualitative survey as well as quantitative measures. By comparing the resulting similarity scores with the human survey results, we identify Gromov Dynamic Time Warping as a promising quantitative measure for evaluating motion correspondence. △ Less

Submitted 6 December, 2024; originally announced December 2024.

Comments: 8 pages, 4 figures

arXiv:2411.19633 [pdf, other]

doi 10.1016/j.spasta.2025.100898

Isotropy testing in spatial point patterns: nonparametric versus parametric replication under misspecification

Authors: Jakub J. Pypkowski, Adam M. Sykulski, James S. Martin

Abstract: Several hypothesis testing methods have been proposed to validate the assumption of isotropy in spatial point patterns. A majority of these methods are characterised by an unknown distribution of the test statistic under the null hypothesis of isotropy. Parametric approaches to approximating the distribution involve simulation of patterns from a user-specified isotropic model. Alternatively, nonpa… ▽ More Several hypothesis testing methods have been proposed to validate the assumption of isotropy in spatial point patterns. A majority of these methods are characterised by an unknown distribution of the test statistic under the null hypothesis of isotropy. Parametric approaches to approximating the distribution involve simulation of patterns from a user-specified isotropic model. Alternatively, nonparametric replicates of the test statistic under isotropy can be used to waive the need for specifying a model. In this paper, we first present a general framework which allows for the integration of a selected nonparametric replication method into isotropy testing. We then conduct a large simulation study comprising application-like scenarios to assess the performance of tests with different parametric and nonparametric replication methods. In particular, we explore distortions in test size and power caused by model misspecification, and demonstrate the advantages of nonparametric replication in such scenarios. △ Less

Submitted 8 April, 2025; v1 submitted 29 November, 2024; originally announced November 2024.

Comments: 24 pages, 13 figures, 3 tables

arXiv:2411.16622 [pdf, other]

Imperceptible Adversarial Examples in the Physical World

Authors: Weilin Xu, Sebastian Szyller, Cory Cornelius, Luis Murillo Rojas, Marius Arvinte, Alvaro Velasquez, Jason Martin, Nageen Himayat

Abstract: Adversarial examples in the digital domain against deep learning-based computer vision models allow for perturbations that are imperceptible to human eyes. However, producing similar adversarial examples in the physical world has been difficult due to the non-differentiable image distortion functions in visual sensing systems. The existing algorithms for generating physically realizable adversaria… ▽ More Adversarial examples in the digital domain against deep learning-based computer vision models allow for perturbations that are imperceptible to human eyes. However, producing similar adversarial examples in the physical world has been difficult due to the non-differentiable image distortion functions in visual sensing systems. The existing algorithms for generating physically realizable adversarial examples often loosen their definition of adversarial examples by allowing unbounded perturbations, resulting in obvious or even strange visual patterns. In this work, we make adversarial examples imperceptible in the physical world using a straight-through estimator (STE, a.k.a. BPDA). We employ STE to overcome the non-differentiability -- applying exact, non-differentiable distortions in the forward pass of the backpropagation step, and using the identity function in the backward pass. Our differentiable rendering extension to STE also enables imperceptible adversarial patches in the physical world. Using printout photos, and experiments in the CARLA simulator, we show that STE enables fast generation of $\ell_\infty$ bounded adversarial examples despite the non-differentiable distortions. To the best of our knowledge, this is the first work demonstrating imperceptible adversarial examples bounded by small $\ell_\infty$ norms in the physical world that force zero classification accuracy in the global perturbation threat model and cause near-zero ($4.22\%$) AP50 in object detection in the patch perturbation threat model. We urge the community to re-evaluate the threat of adversarial examples in the physical world. △ Less

Submitted 25 November, 2024; originally announced November 2024.

arXiv:2411.16461 [pdf, other]

doi 10.1103/PhysRevA.111.042418

Nonequivalence between absolute separability and positive partial transposition in the symmetric subspace

Authors: Jonathan Louvet, Eduardo Serrano-Ensástiga, Thierry Bastin, John Martin

Abstract: The equivalence between absolutely separable states and absolutely positive partial transposed (PPT) states in general remains an open problem in quantum entanglement theory. In this work, we study an analogous question for symmetric multiqubit states. We show that symmetric absolutely PPT (SAPPT) states (symmetric states that remain PPT after any symmetry-preserving unitary evolution) are not alw… ▽ More The equivalence between absolutely separable states and absolutely positive partial transposed (PPT) states in general remains an open problem in quantum entanglement theory. In this work, we study an analogous question for symmetric multiqubit states. We show that symmetric absolutely PPT (SAPPT) states (symmetric states that remain PPT after any symmetry-preserving unitary evolution) are not always symmetric absolutely separable by providing explicit counterexamples. More precisely, we construct a family of entangled five-qubit SAPPT states. Similar counterexamples for larger odd numbers of qubits are identified. △ Less

Submitted 15 April, 2025; v1 submitted 25 November, 2024; originally announced November 2024.

Comments: 9 pages, 2 figure

Journal ref: Phys. Rev. A 111, 042418 (2025)

Showing 1–50 of 1,416 results for author: Martín, J