Search | arXiv e-print repository

Constructive interference at the edge of quantum ergodic dynamics

Authors: Dmitry A. Abanin, Rajeev Acharya, Laleh Aghababaie-Beni, Georg Aigeldinger, Ashok Ajoy, Ross Alcaraz, Igor Aleiner, Trond I. Andersen, Markus Ansmann, Frank Arute, Kunal Arya, Abraham Asfaw, Nikita Astrakhantsev, Juan Atalaya, Ryan Babbush, Dave Bacon, Brian Ballard, Joseph C. Bardin, Christian Bengs, Andreas Bengtsson, Alexander Bilmes, Sergio Boixo, Gina Bortoli, Alexandre Bourassa, Jenna Bovaird , et al. (240 additional authors not shown)

Abstract: Quantum observables in the form of few-point correlators are the key to characterizing the dynamics of quantum many-body systems. In dynamics with fast entanglement generation, quantum observables generally become insensitive to the details of the underlying dynamics at long times due to the effects of scrambling. In experimental systems, repeated time-reversal protocols have been successfully imp… ▽ More Quantum observables in the form of few-point correlators are the key to characterizing the dynamics of quantum many-body systems. In dynamics with fast entanglement generation, quantum observables generally become insensitive to the details of the underlying dynamics at long times due to the effects of scrambling. In experimental systems, repeated time-reversal protocols have been successfully implemented to restore sensitivities of quantum observables. Using a 103-qubit superconducting quantum processor, we characterize ergodic dynamics using the second-order out-of-time-order correlators, OTOC$^{(2)}$. In contrast to dynamics without time reversal, OTOC$^{(2)}$ are observed to remain sensitive to the underlying dynamics at long time scales. Furthermore, by inserting Pauli operators during quantum evolution and randomizing the phases of Pauli strings in the Heisenberg picture, we observe substantial changes in OTOC$^{(2)}$ values. This indicates that OTOC$^{(2)}$ is dominated by constructive interference between Pauli strings that form large loops in configuration space. The observed interference mechanism endows OTOC$^{(2)}$ with a high degree of classical simulation complexity, which culminates in a set of large-scale OTOC$^{(2)}$ measurements exceeding the simulation capacity of known classical algorithms. Further supported by an example of Hamiltonian learning through OTOC$^{(2)}$, our results indicate a viable path to practical quantum advantage. △ Less

Submitted 11 June, 2025; originally announced June 2025.

Comments: See following link: https://zenodo.org/records/15640503, which includes: Circuits used in Fig. 3d, Fig. 3e, Fig. 4a, Fig. 4b of the main text. In addition, OTOC (C^(2)) circuits and data with 95, 40 and 31 qubits are also provided. For system sizes <= 40 qubits, we include exact simulation results. For system sizes > 40, we include experimental data

arXiv:2506.06579 [pdf, ps, other]

Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques

Authors: Adarsh Prasad Behera, Jaya Prakash Champati, Roberto Morabito, Sasu Tarkoma, James Gross

Abstract: Recent progress in Language Models (LMs) has dramatically advanced the field of natural language processing (NLP), excelling at tasks like text generation, summarization, and question answering. However, their inference remains computationally expensive and energy intensive, especially in settings with limited hardware, power, or bandwidth. This makes it difficult to deploy LMs in mobile, edge, or… ▽ More Recent progress in Language Models (LMs) has dramatically advanced the field of natural language processing (NLP), excelling at tasks like text generation, summarization, and question answering. However, their inference remains computationally expensive and energy intensive, especially in settings with limited hardware, power, or bandwidth. This makes it difficult to deploy LMs in mobile, edge, or cost sensitive environments. To address these challenges, recent approaches have introduced multi LLM intelligent model selection strategies that dynamically allocate computational resources based on query complexity -- using lightweight models for simpler queries and escalating to larger models only when necessary. This survey explores two complementary strategies for efficient LLM inference: (i) routing, which selects the most suitable model based on the query, and (ii) cascading or hierarchical inference (HI), which escalates queries through a sequence of models until a confident response is found. Both approaches aim to reduce computation by using lightweight models for simpler tasks while offloading only when needed. We provide a comparative analysis of these techniques across key performance metrics, discuss benchmarking efforts, and outline open challenges. Finally, we outline future research directions to enable faster response times, adaptive model selection based on task complexity, and scalable deployment across heterogeneous environments, making LLM based systems more efficient and accessible for real world applications. △ Less

Submitted 6 June, 2025; originally announced June 2025.

arXiv:2505.04972 [pdf, ps, other]

AI and Vision based Autonomous Navigation of Nano-Drones in Partially-Known Environments

Authors: Mattia Sartori, Chetna Singhal, Neelabhro Roy, Davide Brunelli, James Gross

Abstract: The miniaturisation of sensors and processors, the advancements in connected edge intelligence, and the exponential interest in Artificial Intelligence are boosting the affirmation of autonomous nano-size drones in the Internet of Robotic Things ecosystem. However, achieving safe autonomous navigation and high-level tasks such as exploration and surveillance with these tiny platforms is extremely… ▽ More The miniaturisation of sensors and processors, the advancements in connected edge intelligence, and the exponential interest in Artificial Intelligence are boosting the affirmation of autonomous nano-size drones in the Internet of Robotic Things ecosystem. However, achieving safe autonomous navigation and high-level tasks such as exploration and surveillance with these tiny platforms is extremely challenging due to their limited resources. This work focuses on enabling the safe and autonomous flight of a pocket-size, 30-gram platform called Crazyflie 2.1 in a partially known environment. We propose a novel AI-aided, vision-based reactive planning method for obstacle avoidance under the ambit of Integrated Sensing, Computing and Communication paradigm. We deal with the constraints of the nano-drone by splitting the navigation task into two parts: a deep learning-based object detector runs on the edge (external hardware) while the planning algorithm is executed onboard. The results show the ability to command the drone at $\sim8$ frames-per-second and a model performance reaching a COCO mean-average-precision of $60.8$. Field experiments demonstrate the feasibility of the solution with the drone flying at a top speed of $1$ m/s while steering away from an obstacle placed in an unknown position and reaching the target destination. The outcome highlights the compatibility of the communication delay and the model performance with the requirements of the real-time navigation task. We provide a feasible alternative to a fully onboard implementation that can be extended to autonomous exploration with nano-drones. △ Less

Submitted 8 May, 2025; originally announced May 2025.

Comments: in DCOSS-IoT 2025, Wi-DroIT 2025

arXiv:2504.07843 [pdf, other]

Experimental Analysis of Quadcopter Drone Hover Constraints for Localization Improvements

Authors: Uthman Olawoye, David Akhihiero, Jason N. Gross

Abstract: In this work, we evaluate the use of aerial drone hover constraints in a multisensor fusion of ground robot and drone data to improve the localization performance of a drone. In particular, we build upon our prior work on cooperative localization between an aerial drone and ground robot that fuses data from LiDAR, inertial navigation, peer-to-peer ranging, altimeter, and stereo-vision and evaluate… ▽ More In this work, we evaluate the use of aerial drone hover constraints in a multisensor fusion of ground robot and drone data to improve the localization performance of a drone. In particular, we build upon our prior work on cooperative localization between an aerial drone and ground robot that fuses data from LiDAR, inertial navigation, peer-to-peer ranging, altimeter, and stereo-vision and evaluate the incorporation knowledge from the autopilot regarding when the drone is hovering. This control command data is leveraged to add constraints on the velocity state. Hover constraints can be considered important dynamic model information, such as the exploitation of zero-velocity updates in pedestrian navigation. We analyze the benefits of these constraints using an incremental factor graph optimization. Experimental data collected in a motion capture faculty is used to provide performance insights and assess the benefits of hover constraints. △ Less

Submitted 10 April, 2025; originally announced April 2025.

arXiv:2504.07242 [pdf, other]

Analysis of the Unscented Transform for Cooperative Localization with Ranging-Only Information

Authors: Uthman Olawoye, Cagri Kilic, Jason N Gross

Abstract: Cooperative localization in multi-agent robotic systems is challenging, especially when agents rely on limited information, such as only peer-to-peer range measurements. Two key challenges arise: utilizing this limited information to improve position estimation; handling uncertainties from sensor noise, nonlinearity, and unknown correlations between agents measurements; and avoiding information re… ▽ More Cooperative localization in multi-agent robotic systems is challenging, especially when agents rely on limited information, such as only peer-to-peer range measurements. Two key challenges arise: utilizing this limited information to improve position estimation; handling uncertainties from sensor noise, nonlinearity, and unknown correlations between agents measurements; and avoiding information reuse. This paper examines the use of the Unscented Transform (UT) for state estimation for a case in which range measurement between agents and covariance intersection (CI) is used to handle unknown correlations. Unlike Kalman Filter approaches, CI methods fuse complete state and covariance estimates. This makes formulating a CI approach with ranging-only measurements a challenge. To overcome this, UT is used to handle uncertainties and formulate a cooperative state update using range measurements and current cooperative state estimates. This introduces information reuse in the measurement update. Therefore, this work aims to evaluate the limitations and utility of this formulation when faced with various levels of state measurement uncertainty and errors. △ Less

Submitted 5 May, 2025; v1 submitted 9 April, 2025; originally announced April 2025.

Comments: 8 pages, 8 figures. The paper will be presented at the 2025 IEEE/ION Position, Location and Navigation Symposium (PLANS)

arXiv:2504.07231 [pdf, other]

A Pointcloud Registration Framework for Relocalization in Subterranean Environments

Authors: David Akhihiero, Jason N. Gross

Abstract: Relocalization, the process of re-establishing a robot's position within an environment, is crucial for ensuring accurate navigation and task execution when external positioning information, such as GPS, is unavailable or has been lost. Subterranean environments present significant challenges for relocalization due to limited external positioning information, poor lighting that affects camera loca… ▽ More Relocalization, the process of re-establishing a robot's position within an environment, is crucial for ensuring accurate navigation and task execution when external positioning information, such as GPS, is unavailable or has been lost. Subterranean environments present significant challenges for relocalization due to limited external positioning information, poor lighting that affects camera localization, irregular and often non-distinct surfaces, and dust, which can introduce noise and occlusion in sensor data. In this work, we propose a robust, computationally friendly framework for relocalization through point cloud registration utilizing a prior point cloud map. The framework employs Intrinsic Shape Signatures (ISS) to select feature points in both the target and prior point clouds. The Fast Point Feature Histogram (FPFH) algorithm is utilized to create descriptors for these feature points, and matching these descriptors yields correspondences between the point clouds. A 3D transformation is estimated using the matched points, which initializes a Normal Distribution Transform (NDT) registration. The transformation result from NDT is further refined using the Iterative Closest Point (ICP) registration algorithm. This framework enhances registration accuracy even in challenging conditions, such as dust interference and significant initial transformations between the target and source, making it suitable for autonomous robots operating in underground mines and tunnels. This framework was validated with experiments in simulated and real-world mine datasets, demonstrating its potential for improving relocalization. △ Less

Submitted 9 April, 2025; originally announced April 2025.

arXiv:2504.07028 [pdf, other]

doi 10.1109/PLANS53410.2023.10139979

UAV Position Estimation using a LiDAR-based 3D Object Detection Method

Authors: Uthman Olawoye, Jason N. Gross

Abstract: This paper explores the use of applying a deep learning approach for 3D object detection to compute the relative position of an Unmanned Aerial Vehicle (UAV) from an Unmanned Ground Vehicle (UGV) equipped with a LiDAR sensor in a GPS-denied environment. This was achieved by evaluating the LiDAR sensor's data through a 3D detection algorithm (PointPillars). The PointPillars algorithm incorporates a… ▽ More This paper explores the use of applying a deep learning approach for 3D object detection to compute the relative position of an Unmanned Aerial Vehicle (UAV) from an Unmanned Ground Vehicle (UGV) equipped with a LiDAR sensor in a GPS-denied environment. This was achieved by evaluating the LiDAR sensor's data through a 3D detection algorithm (PointPillars). The PointPillars algorithm incorporates a column voxel point-cloud representation and a 2D Convolutional Neural Network (CNN) to generate distinctive point-cloud features representing the object to be identified, in this case, the UAV. The current localization method utilizes point-cloud segmentation, Euclidean clustering, and predefined heuristics to obtain the relative position of the UAV. Results from the two methods were then compared to a reference truth solution. △ Less

Submitted 9 April, 2025; originally announced April 2025.

Journal ref: IEEE/ION Position, Location and Navigation Symposium (PLANS) (2023)

arXiv:2504.03032 [pdf, other]

Modelling Interfacial Dynamics Using Hydrodynamic Density Functional Theory: Dynamic Contact Angles and the Role of Local Viscosity

Authors: Benjamin Bursik, Rolf Stierle, Hamza Oukili, Martin Schneider, Gernot Bauer, Joachim Gross

Abstract: Hydrodynamic density functional theory (DFT) is applied to analyse dynamic contact angles of droplets in order to assess its predictive capability regarding wetting phenomena at the microscopic scale and to evaluate its feasibility for multiscale modelling. Hydrodynamic DFT incorporates the influence of fluid-fluid and solid-fluid interfaces into a hydrodynamic theory by including a thermodynamic… ▽ More Hydrodynamic density functional theory (DFT) is applied to analyse dynamic contact angles of droplets in order to assess its predictive capability regarding wetting phenomena at the microscopic scale and to evaluate its feasibility for multiscale modelling. Hydrodynamic DFT incorporates the influence of fluid-fluid and solid-fluid interfaces into a hydrodynamic theory by including a thermodynamic model based on classical DFT for the chemical potential of inhomogeneous fluids. It simplifies to the isothermal Navier-Stokes equations far away from interfaces, thus connecting microscopic molecular modelling and continuum fluid dynamics. In this work we use a Helmholtz energy functional based on the perturbed-chain statistical associating fluid theory (PC-SAFT) and the viscosity is obtained from generalised entropy scaling, a one-parameter model which takes microscopic information of the fluid and solid phase into account. Deterministic (noise-free) density and velocity profiles reveal wetting phenomena including different advancing and receding contact angles, the transition from equilibrium to steady state and the rolling motion of droplets. Compared to a viscosity model based on bulk values, generalised entropy scaling provides more accurate results, which stresses the importance of including microscopic information in the local viscosity model. Hydrodynamic DFT is transferable as it captures the influence of different external forces, wetting strengths and (molecular) solid roughness. For all results good quantitative agreement with non-equilibrium molecular dynamics simulations is found, which emphasises that hydrodynamic DFT is able to predict wetting phenomena at the microscopic scale. △ Less

Submitted 3 April, 2025; originally announced April 2025.

Comments: 31 pages, 27 figures

arXiv:2503.20631 [pdf, other]

Robust Flower Cluster Matching Using The Unscented Transform

Authors: Andy Chu, Rashik Shrestha, Yu Gu, Jason N. Gross

Abstract: Monitoring flowers over time is essential for precision robotic pollination in agriculture. To accomplish this, a continuous spatial-temporal observation of plant growth can be done using stationary RGB-D cameras. However, image registration becomes a serious challenge due to changes in the visual appearance of the plant caused by the pollination process and occlusions from growth and camera angle… ▽ More Monitoring flowers over time is essential for precision robotic pollination in agriculture. To accomplish this, a continuous spatial-temporal observation of plant growth can be done using stationary RGB-D cameras. However, image registration becomes a serious challenge due to changes in the visual appearance of the plant caused by the pollination process and occlusions from growth and camera angles. Plants flower in a manner that produces distinct clusters on branches. This paper presents a method for matching flower clusters using descriptors generated from RGB-D data and considers allowing for spatial uncertainty within the cluster. The proposed approach leverages the Unscented Transform to efficiently estimate plant descriptor uncertainty tolerances, enabling a robust image-registration process despite temporal changes. The Unscented Transform is used to handle the nonlinear transformations by propagating the uncertainty of flower positions to determine the variations in the descriptor domain. A Monte Carlo simulation is used to validate the Unscented Transform results, confirming our method's effectiveness for flower cluster matching. Therefore, it can facilitate improved robotics pollination in dynamic environments. △ Less

Submitted 26 March, 2025; originally announced March 2025.

Comments: *CASE2025 Under Review*

arXiv:2503.15297 [pdf, other]

Probabilistic Delay Forecasting in 5G Using Recurrent and Attention-Based Architectures

Authors: Samie Mostafavi, Gourav Prateek Sharma, Ahmad Traboulsi, James Gross

Abstract: With the emergence of new application areas such as cyber-physical systems and human-in-the-loop applications ensuring a specific level of end-to-end network latency with high reliability (e.g., 99.9%) is becoming increasingly critical. To align wireless links with these reliability requirements, it is essential to analyze and control network latency in terms of its full probability distribution.… ▽ More With the emergence of new application areas such as cyber-physical systems and human-in-the-loop applications ensuring a specific level of end-to-end network latency with high reliability (e.g., 99.9%) is becoming increasingly critical. To align wireless links with these reliability requirements, it is essential to analyze and control network latency in terms of its full probability distribution. However, in a wireless link, the distribution may vary over time, making this task particularly challenging. We propose predicting the latency distribution using state-of-the-art data-driven techniques that leverage historical network information. Our approach tokenizes network state information and processes it using temporal deep-learning architectures-namely LSTM and Transformer models-to capture both short- and long-term delay dependencies. These models output parameters for a chosen parametric density via a mixture density network with Gaussian mixtures, yielding multi-step probabilistic forecasts of future delays. To validate our proposed approach, we implemented and tested these methods using a time-synchronized, SDR-based OpenAirInterface 5G testbed to collect and preprocess network-delay data. Our experiments show that the Transformer model achieves lower negative log-likelihood and mean absolute error than both LSTM and feed-forward baselines in challenging scenarios, while also providing insights into model complexity and training/inference overhead. This framework enables more informed decision-making for adaptive scheduling and resource allocation, paving the way toward enhanced QoS in evolving 5G and 6G networks. △ Less

Submitted 19 March, 2025; originally announced March 2025.

arXiv:2503.15081 [pdf, other]

doi 10.1051/0004-6361/202451124

Orbits of very distant asteroid satellites

Authors: K. Minker, B. Carry, F. Vachier, P. Scheirich, P. Pravec, T. Müller, A. Moór, C. Arcidiacono, A. Conrad, C. Veillet, S. A. Jacobson, M. Marsset, W. J. Merline, P. Tamblyn, M. E. Brown, D. Pray, R. Montaigut, A. Leroy, C. Gillier, P. Kušnirák, K. Hornoch, M. Husárik, V. Benishek, W. Cooney, J. Gross , et al. (14 additional authors not shown)

Abstract: The very wide binary asteroid (VWBA) population is a small subset of the population of known binary and multiple asteroids made of systems with very widely orbiting satellites and long orbital periods, on the order of tens to hundreds of days. The origin of these systems is debatable, and most members of this population are poorly characterized. We have compiled all available high-angular resoluti… ▽ More The very wide binary asteroid (VWBA) population is a small subset of the population of known binary and multiple asteroids made of systems with very widely orbiting satellites and long orbital periods, on the order of tens to hundreds of days. The origin of these systems is debatable, and most members of this population are poorly characterized. We have compiled all available high-angular resolution imaging archival data of VWBA systems from large ground- and space-based telescopes. We measure the astrometric positions of the satellite relative to the primary and analyze the dynamics of the satellites using the Genoid genetic algorithm. Additionally, we use a NEATM thermal model to estimate the diameters of two systems, and we model the orbit of Litva's inner satellite using photometric lightcurve observations. We determine the effective diameters of binary systems Christophedumas and Alconrad to be 4.7 + 0.4km and 5.2 + 0.3km respectively. We determine new orbital solutions for five systems, Huenna, Litva, (3548) Eurybates, Pauling, and Alconrad. We find a significantly eccentric best-fit orbital solution for the outer satellite of Litva, a moderately eccentric solution for Alconrad, and a nearly circular solution for Pauling. We also confirm previously reported orbital solutions for (379) Huenna and Eurybates. It is unlikely that BYORP expansion could be solely responsible for the formation of VWBAs. It is possible that the satellites of these systems were formed through YORP spin-up and then later scattered onto very wide orbits. Additionally, we find that some members of the population are unlikely to have formed satellites through YORP spin-up, and a collisional formation history is favored. In particular, this applies to VWBAs within large dynamical families, or large VWBA systems such as Huenna and NASA's Lucy mission target Eurybates. △ Less

Submitted 19 March, 2025; originally announced March 2025.

Comments: 22 pages, 8 figures, 16 tables, accepted for publication at A&A

Journal ref: A&A 698, A136 (2025)

arXiv:2502.11595 [pdf, other]

End-to-End Reliability in Wireless IEEE 802.1Qbv Time-Sensitive Networks

Authors: S. Egger, J. Gross, J. Sachs, G. P. Sharma, C. Becker, F. Dürr

Abstract: Industrial cyber-physical systems require dependable network communication with formal end-to-end reliability guarantees. Striving towards this goal, recent efforts aim to advance the integration of 5G into Time-Sensitive Networking (TSN). However, we show that IEEE 802.1Qbv TSN schedulers that are unattuned to 5G packet delay variations may jeopardize any reliability guarantees provided by the 5G… ▽ More Industrial cyber-physical systems require dependable network communication with formal end-to-end reliability guarantees. Striving towards this goal, recent efforts aim to advance the integration of 5G into Time-Sensitive Networking (TSN). However, we show that IEEE 802.1Qbv TSN schedulers that are unattuned to 5G packet delay variations may jeopardize any reliability guarantees provided by the 5G system. We demonstrate this on a case where a 99.99% reliability in the inner 5G network diminishes to below 10% when looking at end-to-end communication in TSN. In this paper, we overcome this shortcoming by introducing Full Interleaving Packet Scheduling (FIPS) as a wireless-friendly IEEE 802.1Qbv scheduler. To the best of our knowledge, FIPS is the first to provide formal end-to-end QoS guarantees in wireless TSN. FIPS allows a controlled batching of TSN streams, which improves schedulability in terms of the number of wireless TSN streams by a factor of up to x45. Even in failure cases, FIPS isolates the otherwise cascading QoS violations to the affected streams and protects all other streams. With formal end-to-end reliability, improved schedulability, and fault isolation, FIPS makes a substantial advance towards dependability in wireless TSN. △ Less

Submitted 17 February, 2025; originally announced February 2025.

Comments: Preprint with extended appendix

arXiv:2502.08789 [pdf, ps, other]

Delay Analysis of 5G HARQ in the Presence of Decoding and Feedback Latencies

Authors: Vishnu N Moothedath, Sangwon Seo, Neda Petreska, Bernhard Kloiber, James Gross

Abstract: The growing demand for stringent quality of service (QoS) guarantees in 5G networks requires accurate characterisation of delay performance, often measured using Delay Violation Probability (DVP) for a given target delay. Widely used retransmission schemes like Automatic Repeat reQuest (ARQ) and Hybrid ARQ (HARQ) improve QoS through effective feedback, incremental redundancy (IR), and parallel ret… ▽ More The growing demand for stringent quality of service (QoS) guarantees in 5G networks requires accurate characterisation of delay performance, often measured using Delay Violation Probability (DVP) for a given target delay. Widely used retransmission schemes like Automatic Repeat reQuest (ARQ) and Hybrid ARQ (HARQ) improve QoS through effective feedback, incremental redundancy (IR), and parallel retransmission processes. However, existing works to quantify the DVP under these retransmission schemes overlook practical aspects such as decoding complexity, feedback delays, and the resulting need for multiple parallel ARQ/HARQ processes that enable packet transmissions without waiting for previous feedback, thus exploiting valuable transmission opportunities. This work proposes a comprehensive multi-server delay model for ARQ/HARQ that incorporates these aspects. Using a finite blocklength error model, we derive closed-form expressions and algorithms for accurate DVP evaluation under realistic 5G configurations aligned with 3GPP standards. Our numerical evaluations demonstrate notable improvements in DVP accuracy over the state-of-the-art, highlight the impact of parameter tuning and resource allocation, and reveal how DVP affects system throughput. △ Less

Submitted 12 February, 2025; originally announced February 2025.

arXiv:2502.05050 [pdf, other]

Calibration of a $Δ$E-E telescope based on CeBr$_3$ scintillator for secondary charged particles measurements in hadron therapy

Authors: L. Gesson, J. Gross, C. Mozzi, C. Reibel, Ch. Finck, S. Higueret, T. D. Le, E. Traykov, J. C. Thomas, N. Arbor, M. Pullia, G. Harmant, M. Vanstalle

Abstract: Hadrontherapy is a promising cancer treatment method that offers better dose conformity and reduces damage to healthy tissues compared to conventional radiotherapy. However, one major challenge remaining is the precise characterization of secondary particles generated by nuclear interactions of the primary beam with tissues. Current data on secondary charged particles, such as protons and light io… ▽ More Hadrontherapy is a promising cancer treatment method that offers better dose conformity and reduces damage to healthy tissues compared to conventional radiotherapy. However, one major challenge remaining is the precise characterization of secondary particles generated by nuclear interactions of the primary beam with tissues. Current data on secondary charged particles, such as protons and light ions, remain insufficient, particularly in the clinically relevant energy ranges. This lack of experimental data introduces uncertainties in treatment planning softwares and Monte Carlo calculations, thus compromising the accuracy of dose delivery to the patients. This work consists in the characterization of secondary charged particles generated in hadron therapy using a $Δ$E-E telescope comprising a CeBr$_3$ crystal scintillator and a plastic scintillator. The calibration and response of this telescope to ions commonly used in clinical settings is presented in this work, highlighting adherence to Birks law for accurate energy measurements. This study is the first to optimize a $Δ$E-E telescope combining CeBr$_3$ and plastic scintillators specifically for secondary particle detection in hadrontherapy. This represents an important step in the exploitation of the system for nuclear data acquisition, as it enables both the measurement of energy and the discrimination of secondary particles. The objective is to develop a system compatible with clinical use, allowing for the most precise possible comparison with treatment planning software calculations. △ Less

Submitted 7 February, 2025; originally announced February 2025.

arXiv:2501.14586 [pdf, other]

doi 10.1016/j.ymssp.2025.112482

A sub-structuring approach for model reduction of frictionally clamped thin-walled structures

Authors: Patrick Hippold, Johann Gross, Malte Krack

Abstract: Thin-walled structures clamped by friction joints, such as aircraft skin panels are exposed to bending-stretching coupling and frictional contact. We propose an original sub-structuring approach, where the system is divided into thin-walled and support regions, so that geometrically nonlinear behavior is relevant only in the former, and nonlinear contact behavior only in the latter. This permits t… ▽ More Thin-walled structures clamped by friction joints, such as aircraft skin panels are exposed to bending-stretching coupling and frictional contact. We propose an original sub-structuring approach, where the system is divided into thin-walled and support regions, so that geometrically nonlinear behavior is relevant only in the former, and nonlinear contact behavior only in the latter. This permits to derive reduced component models, in principle, with available techniques. The Hurty-/Craig-Bampton method, combined with an interface reduction relying on an orthogonal polynomial series, is used to construct the reduction basis for each component. To model geometrically nonlinear behavior, implicit condensation is used, where an original, engineering-oriented proposition is made for the delicate scaling of the static load cases required to estimate the coefficients of the nonlinear terms. The proposed method is validated and its computational performance is assessed for the example of a plate with frictional clamping, using finite element analysis as reference. The numerical results shed light into an interesting mutual interaction: The extent of geometric hardening is limited by the reduced boundary stiffness when more sliding occurs in the clamping. On the other hand, the frictional dissipation is increased by the tangential loading induced by membrane stretching. △ Less

Submitted 24 January, 2025; originally announced January 2025.

arXiv:2501.14249 [pdf, other]

Humanity's Last Exam

Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of human knowledge, designed to be the final closed-ended academic benchmark of its kind with broad subject coverage. HLE consists of 2,500 questions across dozens of subjects, including mathematics, humanities, and the natural sciences. HLE is developed globally by subject-matter experts and consists of multiple-choice and short-answer questions suitable for automated grading. Each question has a known solution that is unambiguous and easily verifiable, but cannot be quickly answered via internet retrieval. State-of-the-art LLMs demonstrate low accuracy and calibration on HLE, highlighting a significant gap between current LLM capabilities and the expert human frontier on closed-ended academic questions. To inform research and policymaking upon a clear understanding of model capabilities, we publicly release HLE at https://lastexam.ai. △ Less

Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

Comments: 29 pages, 6 figures

arXiv:2501.12833 [pdf, other]

A coupled FE-BE multi-scale method for the dynamics of jointed structures

Authors: Hendrik D. Linder, Johann Gross, Malte Krack

Abstract: The damping of built-up structures stems largely from the microscopic dry frictional interactions in the contact interfaces. The accurate prediction of friction damping has been an important scientific aim of the past several decades. Recent research indicates that very good agreement with vibration measurements is to be expected if the actual contact surface topography is sufficiently well known… ▽ More The damping of built-up structures stems largely from the microscopic dry frictional interactions in the contact interfaces. The accurate prediction of friction damping has been an important scientific aim of the past several decades. Recent research indicates that very good agreement with vibration measurements is to be expected if the actual contact surface topography is sufficiently well known and finely resolved, and frictional-unilateral interactions are modeled in terms of the Coulomb-Signorini conditions. Resolving all relevant length scales in one finite element model leads to enormous or even prohibitive computation effort and regularization of the set-valued contact laws might be needed to ensure numerical stability. In this work, we propose a multi-scale approach: The stress and deformation field in the contact region is modeled using elastic half-space theory, implemented on a regular and fine grid of boundary elements (BE), so that the compliance matrix can be expressed in closed form. The vibration behavior of the remaining region is described using a relatively coarse finite element (FE) model, which is further reduced via component mode synthesis. The two models are coupled by enforcing compatibility and equilibrium conditions in the far field. The set-valued Coulomb-Signorini conditions are enforced robustly and efficiently using a projected over-relaxation scheme in conjunction with an appropriate active-set strategy. For the S4 beam benchmark, very good agreement with regard to the amplitude-dependent frequency and damping ratio of the first few modes is achieved, while the computation effort is reduced by several orders of magnitude compared to the full-FE reference. The proposed multi-scale method permits a very fine resolution of the contact surface topography without suffering from numerical instability. △ Less

Submitted 22 January, 2025; originally announced January 2025.

arXiv:2501.01398 [pdf, other]

A Proof of Concept Resource Management Scheme for Augmented Reality Applications in 5G Systems

Authors: Panagiotis Nikolaidis, Samie Mostafavi, James Gross, John Baras

Abstract: Augmented reality applications are bitrate intensive, delay-sensitive, and computationally demanding. To support them, mobile edge computing systems need to carefully manage both their networking and computing resources. To this end, we present a proof of concept resource management scheme that adapts the bandwidth at the base station and the GPU frequency at the edge to efficiently fulfill roundt… ▽ More Augmented reality applications are bitrate intensive, delay-sensitive, and computationally demanding. To support them, mobile edge computing systems need to carefully manage both their networking and computing resources. To this end, we present a proof of concept resource management scheme that adapts the bandwidth at the base station and the GPU frequency at the edge to efficiently fulfill roundtrip delay constrains. Resource adaptation is performed using a Multi-Armed Bandit algorithm that accounts for the monotonic relationship between allocated resources and performance. We evaluate our scheme by experimentation on an OpenAirInterface 5G testbed where the considered application is OpenRTiST. The results indicate that our resource management scheme can substantially reduce both bandwidth usage and power consumption while delivering high quality of service. Overall, this work demonstrates that intelligent resource control can potentially establish systems that are not only more efficient but also more sustainable. △ Less

Submitted 2 January, 2025; originally announced January 2025.

arXiv:2501.00980 [pdf]

Strain Mediated Voltage Control of Magnetic Anisotropy and Magnetization Reversal in Bismuth Substituted Yttrium Iron Garnet Films and Meso-structures

Authors: Walid Al Misba, Miela Josephine Gross, Kensuke Hayashi, Daniel B. Gopman, Caroline A. Ross, Jayasimha Atulasimha

Abstract: We report on magnetic anisotropy modulation in Bismuth substituted Yttrium Iron Garnet (Bi-YIG) thin films and mesoscale patterned structures deposited on a PMN-PT substrate with the application of voltage-induced strain. The Bi content is selected for low coercivity and higher magnetostriction than that of YIG, yielding significant changes in the hysteresis loops through the magnetoelastic effect… ▽ More We report on magnetic anisotropy modulation in Bismuth substituted Yttrium Iron Garnet (Bi-YIG) thin films and mesoscale patterned structures deposited on a PMN-PT substrate with the application of voltage-induced strain. The Bi content is selected for low coercivity and higher magnetostriction than that of YIG, yielding significant changes in the hysteresis loops through the magnetoelastic effect. The piezoelectric substrate is poled along its thickness, which is the [011] direction, by applying a voltage across the PMN-PT/SiO2/Bi-YIG/Pt heterostructure. In-situ magneto-optical Kerr effect microscopy (MOKE) shows the modulation of magnetic anisotropy with voltage-induced strain. Furthermore, voltage control of the magnetic domain state of the Bi-YIG film at a fixed magnetic field produces a 90° switching of the magnetization easy axis above a threshold voltage. The magnetoelectric coefficient of the heterostructure is 1.05x10^(-7)s/m which is competitive with that of other ferromagnetic oxide films on ferroelectric substrates such as La0.67Sr0.33MnO3/PMNPT and YIG/PMN-PZT. Voltage-control of magnetization reversal fields in 5-30 microns wide dots and racetracks of Bi-YIG show potential for energy efficient non-volatile memory and neuromorphic computing devices. △ Less

Submitted 1 January, 2025; originally announced January 2025.

arXiv:2412.14360 [pdf, other]

Demonstrating dynamic surface codes

Authors: Alec Eickbusch, Matt McEwen, Volodymyr Sivak, Alexandre Bourassa, Juan Atalaya, Jahan Claes, Dvir Kafri, Craig Gidney, Christopher W. Warren, Jonathan Gross, Alex Opremcak, Nicholas Zobrist Kevin C. Miao, Gabrielle Roberts, Kevin J. Satzinger, Andreas Bengtsson, Matthew Neeley, William P. Livingston, Alex Greene, Rajeev, Acharya, Laleh Aghababaie Beni, Georg Aigeldinger, Ross Alcaraz, Trond I. Andersen, Markus Ansmann , et al. (193 additional authors not shown)

Abstract: A remarkable characteristic of quantum computing is the potential for reliable computation despite faulty qubits. This can be achieved through quantum error correction, which is typically implemented by repeatedly applying static syndrome checks, permitting correction of logical information. Recently, the development of time-dynamic approaches to error correction has uncovered new codes and new co… ▽ More A remarkable characteristic of quantum computing is the potential for reliable computation despite faulty qubits. This can be achieved through quantum error correction, which is typically implemented by repeatedly applying static syndrome checks, permitting correction of logical information. Recently, the development of time-dynamic approaches to error correction has uncovered new codes and new code implementations. In this work, we experimentally demonstrate three time-dynamic implementations of the surface code, each offering a unique solution to hardware design challenges and introducing flexibility in surface code realization. First, we embed the surface code on a hexagonal lattice, reducing the necessary couplings per qubit from four to three. Second, we walk a surface code, swapping the role of data and measure qubits each round, achieving error correction with built-in removal of accumulated non-computational errors. Finally, we realize the surface code using iSWAP gates instead of the traditional CNOT, extending the set of viable gates for error correction without additional overhead. We measure the error suppression factor when scaling from distance-3 to distance-5 codes of $Λ_{35,\text{hex}} = 2.15(2)$, $Λ_{35,\text{walk}} = 1.69(6)$, and $Λ_{35,\text{iSWAP}} = 1.56(2)$, achieving state-of-the-art error suppression for each. With detailed error budgeting, we explore their performance trade-offs and implications for hardware design. This work demonstrates that dynamic circuit approaches satisfy the demands for fault-tolerance and opens new alternative avenues for scalable hardware design. △ Less

Submitted 18 December, 2024; originally announced December 2024.

Comments: 11 pages, 5 figures, Supplementary Information

arXiv:2412.14256 [pdf, other]

Scaling and logic in the color code on a superconducting quantum processor

Authors: Nathan Lacroix, Alexandre Bourassa, Francisco J. H. Heras, Lei M. Zhang, Johannes Bausch, Andrew W. Senior, Thomas Edlich, Noah Shutty, Volodymyr Sivak, Andreas Bengtsson, Matt McEwen, Oscar Higgott, Dvir Kafri, Jahan Claes, Alexis Morvan, Zijun Chen, Adam Zalcman, Sid Madhuk, Rajeev Acharya, Laleh Aghababaie Beni, Georg Aigeldinger, Ross Alcaraz, Trond I. Andersen, Markus Ansmann, Frank Arute , et al. (190 additional authors not shown)

Abstract: Quantum error correction is essential for bridging the gap between the error rates of physical devices and the extremely low logical error rates required for quantum algorithms. Recent error-correction demonstrations on superconducting processors have focused primarily on the surface code, which offers a high error threshold but poses limitations for logical operations. In contrast, the color code… ▽ More Quantum error correction is essential for bridging the gap between the error rates of physical devices and the extremely low logical error rates required for quantum algorithms. Recent error-correction demonstrations on superconducting processors have focused primarily on the surface code, which offers a high error threshold but poses limitations for logical operations. In contrast, the color code enables much more efficient logic, although it requires more complex stabilizer measurements and decoding techniques. Measuring these stabilizers in planar architectures such as superconducting qubits is challenging, and so far, realizations of color codes have not addressed performance scaling with code size on any platform. Here, we present a comprehensive demonstration of the color code on a superconducting processor, achieving logical error suppression and performing logical operations. Scaling the code distance from three to five suppresses logical errors by a factor of $Λ_{3/5}$ = 1.56(4). Simulations indicate this performance is below the threshold of the color code, and furthermore that the color code may be more efficient than the surface code with modest device improvements. Using logical randomized benchmarking, we find that transversal Clifford gates add an error of only 0.0027(3), which is substantially less than the error of an idling error correction cycle. We inject magic states, a key resource for universal computation, achieving fidelities exceeding 99% with post-selection (retaining about 75% of the data). Finally, we successfully teleport logical states between distance-three color codes using lattice surgery, with teleported state fidelities between 86.5(1)% and 90.7(1)%. This work establishes the color code as a compelling research direction to realize fault-tolerant quantum computation on superconducting processors in the near future. △ Less

Submitted 18 December, 2024; originally announced December 2024.

arXiv:2412.04333 [pdf, other]

Beta delayed neutron emission of $N=84$ $^{132}$Cd

Authors: M. Madurga, Z. Y. Xu, 1 R. Grzywacz, A. Andreyev, G. Benzoni, M. J. G. Borge, C. Costache, I. Cox, B. Dimitrov, P. Van Duppen, L. M. Fraile, S. Franchoo, H. Fynbo, B. Gonsalves, A. Gottardo, P. T. Greenless, C. J. Gross, L. J. Harkness-Brennan, M. Hyuse, D. S. Judson, S. Kisyov, K. Kolos, J. Konki, J. Kurzewicz, I. Lazarus , et al. (29 additional authors not shown)

Abstract: Using the time-of-flight technique, we measured the beta-delayed neutron emission of $^{132}$Cd. From our large-scale shell model (LSSM) calculation using the N$^3$LO interaction [Z.Y. Xu et al., Phys. Rev. Lett. 131, 022501 (2023)], we suggest the decay is dominated by the transformation of a neutron in the $g_{7/2}$ orbital, deep below the Fermi surface, into a proton in the $g_{9/2}$ orbital. W… ▽ More Using the time-of-flight technique, we measured the beta-delayed neutron emission of $^{132}$Cd. From our large-scale shell model (LSSM) calculation using the N$^3$LO interaction [Z.Y. Xu et al., Phys. Rev. Lett. 131, 022501 (2023)], we suggest the decay is dominated by the transformation of a neutron in the $g_{7/2}$ orbital, deep below the Fermi surface, into a proton in the $g_{9/2}$ orbital. We compare the beta-decay half-lives and neutron branching ratios of nuclei with $Z<50$ and $N\geq82$ obtained with our LSSM with those of leading "global" models. Our calculations match known half-lives and neutron branching ratios well and suggest that current leading models overestimate the yet-to-be-measured half-lives. Our model, backed by the $^{132}$Cd decay data presented here, offers robust predictive power for nuclei of astrophysical interest such as $r$-process waiting points. △ Less

Submitted 5 December, 2024; originally announced December 2024.

Comments: 7 pages, 5 figures

arXiv:2412.03773 [pdf, other]

Modular addition without black-boxes: Compressing explanations of MLPs that compute numerical integration

Authors: Chun Hei Yip, Rajashree Agrawal, Lawrence Chan, Jason Gross

Abstract: The goal of mechanistic interpretability is discovering simpler, low-rank algorithms implemented by models. While we can compress activations into features, compressing nonlinear feature-maps -- like MLP layers -- is an open problem. In this work, we present the first case study in rigorously compressing nonlinear feature-maps, which are the leading asymptotic bottleneck to compressing small trans… ▽ More The goal of mechanistic interpretability is discovering simpler, low-rank algorithms implemented by models. While we can compress activations into features, compressing nonlinear feature-maps -- like MLP layers -- is an open problem. In this work, we present the first case study in rigorously compressing nonlinear feature-maps, which are the leading asymptotic bottleneck to compressing small transformer models. We work in the classic setting of the modular addition models, and target a non-vacuous bound on the behaviour of the ReLU MLP in time linear in the parameter-count of the circuit. To study the ReLU MLP analytically, we use the infinite-width lens, which turns post-activation matrix multiplications into approximate integrals. We discover a novel interpretation of} the MLP layer in one-layer transformers implementing the ``pizza'' algorithm: the MLP can be understood as evaluating a quadrature scheme, where each neuron computes the area of a rectangle under the curve of a trigonometric integral identity. Our code is available at https://tinyurl.com/mod-add-integration. △ Less

Submitted 4 December, 2024; originally announced December 2024.

arXiv:2411.07405 [pdf, other]

Quality of Control based Resource Dimensioning for Collaborative Edge Robotics

Authors: Neelabhro Roy, Mani H. Dhullipalla, Gourav Prateek Sharma, Dimos V. Dimarogonas, James Gross

Abstract: With the increasing focus on flexible automation, which emphasizes systems capable of adapting to varied tasks and conditions, exploring future deployments of cloud and edge-based network infrastructures in robotic systems becomes crucial. This work, examines how wireless solutions could support the shift from rigid, wired setups toward more adaptive, flexible automation in industrial environments… ▽ More With the increasing focus on flexible automation, which emphasizes systems capable of adapting to varied tasks and conditions, exploring future deployments of cloud and edge-based network infrastructures in robotic systems becomes crucial. This work, examines how wireless solutions could support the shift from rigid, wired setups toward more adaptive, flexible automation in industrial environments. We provide a quality of control (QoC) based abstraction for robotic workloads, parameterized on loop latency and reliability, and jointly optimize system performance. The setup involves collaborative robots working on distributed tasks, underscoring how wireless communication can enable more dynamic coordination in flexible automation systems. We use our abstraction to optimally maximize the QoC ensuring efficient operation even under varying network conditions. Additionally, our solution allocates the communication resources in time slots, optimizing the balance between communication and control costs. Our simulation results highlight that minimizing the delay in the system may not always ensure the best QoC but can lead to substantial gains in QoC if delays are sometimes relaxed, allowing more packets to be delivered reliably. △ Less

Submitted 11 November, 2024; originally announced November 2024.

Comments: Accepted in IEEE CCNC 2025

arXiv:2410.21276 [pdf, other]

GPT-4o System Card

Authors: OpenAI, :, Aaron Hurst, Adam Lerer, Adam P. Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, AJ Ostrow, Akila Welihinda, Alan Hayes, Alec Radford, Aleksander Mądry, Alex Baker-Whitcomb, Alex Beutel, Alex Borzunov, Alex Carney, Alex Chow, Alex Kirillov, Alex Nichol, Alex Paino, Alex Renzin, Alex Tachard Passos, Alexander Kirillov, Alexi Christakis , et al. (395 additional authors not shown)

Abstract: GPT-4o is an autoregressive omni model that accepts as input any combination of text, audio, image, and video, and generates any combination of text, audio, and image outputs. It's trained end-to-end across text, vision, and audio, meaning all inputs and outputs are processed by the same neural network. GPT-4o can respond to audio inputs in as little as 232 milliseconds, with an average of 320 mil… ▽ More GPT-4o is an autoregressive omni model that accepts as input any combination of text, audio, image, and video, and generates any combination of text, audio, and image outputs. It's trained end-to-end across text, vision, and audio, meaning all inputs and outputs are processed by the same neural network. GPT-4o can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time in conversation. It matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages, while also being much faster and 50\% cheaper in the API. GPT-4o is especially better at vision and audio understanding compared to existing models. In line with our commitment to building AI safely and consistent with our voluntary commitments to the White House, we are sharing the GPT-4o System Card, which includes our Preparedness Framework evaluations. In this System Card, we provide a detailed look at GPT-4o's capabilities, limitations, and safety evaluations across multiple categories, focusing on speech-to-speech while also evaluating text and image capabilities, and measures we've implemented to ensure the model is safe and aligned. We also include third-party assessments on dangerous capabilities, as well as discussion of potential societal impacts of GPT-4o's text and vision capabilities. △ Less

Submitted 25 October, 2024; originally announced October 2024.

arXiv:2410.07476 [pdf, other]

Towards a unified and verified understanding of group-operation networks

Authors: Wilson Wu, Louis Jaburi, Jacob Drori, Jason Gross

Abstract: A recent line of work in mechanistic interpretability has focused on reverse-engineering the computation performed by neural networks trained on the binary operation of finite groups. We investigate the internals of one-hidden-layer neural networks trained on this task, revealing previously unidentified structure and producing a more complete description of such models in a step towards unifying t… ▽ More A recent line of work in mechanistic interpretability has focused on reverse-engineering the computation performed by neural networks trained on the binary operation of finite groups. We investigate the internals of one-hidden-layer neural networks trained on this task, revealing previously unidentified structure and producing a more complete description of such models in a step towards unifying the explanations of previous works (Chughtai et al., 2023; Stander et al., 2024). Notably, these models approximate equivariance in each input argument. We verify that our explanation applies to a large fraction of networks trained on this task by translating it into a compact proof of model performance, a quantitative evaluation of the extent to which we faithfully and concisely explain model internals. In the main text, we focus on the symmetric group S5. For models trained on this group, our explanation yields a guarantee of model accuracy that runs 3x faster than brute force and gives a >=95% accuracy bound for 45% of the models we trained. We were unable to obtain nontrivial non-vacuous accuracy bounds using only explanations from previous works. △ Less

Submitted 24 January, 2025; v1 submitted 9 October, 2024; originally announced October 2024.

Comments: ICLR 2025 camera ready. 32 pages, 11 figures

arXiv:2410.06557 [pdf, other]

Observation of disorder-free localization and efficient disorder averaging on a quantum processor

Authors: Gaurav Gyawali, Tyler Cochran, Yuri Lensky, Eliott Rosenberg, Amir H. Karamlou, Kostyantyn Kechedzhi, Julia Berndtsson, Tom Westerhout, Abraham Asfaw, Dmitry Abanin, Rajeev Acharya, Laleh Aghababaie Beni, Trond I. Andersen, Markus Ansmann, Frank Arute, Kunal Arya, Nikita Astrakhantsev, Juan Atalaya, Ryan Babbush, Brian Ballard, Joseph C. Bardin, Andreas Bengtsson, Alexander Bilmes, Gina Bortoli, Alexandre Bourassa , et al. (195 additional authors not shown)

Abstract: One of the most challenging problems in the computational study of localization in quantum manybody systems is to capture the effects of rare events, which requires sampling over exponentially many disorder realizations. We implement an efficient procedure on a quantum processor, leveraging quantum parallelism, to efficiently sample over all disorder realizations. We observe localization without d… ▽ More One of the most challenging problems in the computational study of localization in quantum manybody systems is to capture the effects of rare events, which requires sampling over exponentially many disorder realizations. We implement an efficient procedure on a quantum processor, leveraging quantum parallelism, to efficiently sample over all disorder realizations. We observe localization without disorder in quantum many-body dynamics in one and two dimensions: perturbations do not diffuse even though both the generator of evolution and the initial states are fully translationally invariant. The disorder strength as well as its density can be readily tuned using the initial state. Furthermore, we demonstrate the versatility of our platform by measuring Renyi entropies. Our method could also be extended to higher moments of the physical observables and disorder learning. △ Less

Submitted 9 October, 2024; originally announced October 2024.

arXiv:2409.17142 [pdf, other]

Visualizing Dynamics of Charges and Strings in (2+1)D Lattice Gauge Theories

Authors: Tyler A. Cochran, Bernhard Jobst, Eliott Rosenberg, Yuri D. Lensky, Gaurav Gyawali, Norhan Eassa, Melissa Will, Dmitry Abanin, Rajeev Acharya, Laleh Aghababaie Beni, Trond I. Andersen, Markus Ansmann, Frank Arute, Kunal Arya, Abraham Asfaw, Juan Atalaya, Ryan Babbush, Brian Ballard, Joseph C. Bardin, Andreas Bengtsson, Alexander Bilmes, Alexandre Bourassa, Jenna Bovaird, Michael Broughton, David A. Browne , et al. (167 additional authors not shown)

Abstract: Lattice gauge theories (LGTs) can be employed to understand a wide range of phenomena, from elementary particle scattering in high-energy physics to effective descriptions of many-body interactions in materials. Studying dynamical properties of emergent phases can be challenging as it requires solving many-body problems that are generally beyond perturbative limits. We investigate the dynamics of… ▽ More Lattice gauge theories (LGTs) can be employed to understand a wide range of phenomena, from elementary particle scattering in high-energy physics to effective descriptions of many-body interactions in materials. Studying dynamical properties of emergent phases can be challenging as it requires solving many-body problems that are generally beyond perturbative limits. We investigate the dynamics of local excitations in a $\mathbb{Z}_2$ LGT using a two-dimensional lattice of superconducting qubits. We first construct a simple variational circuit which prepares low-energy states that have a large overlap with the ground state; then we create particles with local gates and simulate their quantum dynamics via a discretized time evolution. As the effective magnetic field is increased, our measurements show signatures of transitioning from deconfined to confined dynamics. For confined excitations, the magnetic field induces a tension in the string connecting them. Our method allows us to experimentally image string dynamics in a (2+1)D LGT from which we uncover two distinct regimes inside the confining phase: for weak confinement the string fluctuates strongly in the transverse direction, while for strong confinement transverse fluctuations are effectively frozen. In addition, we demonstrate a resonance condition at which dynamical string breaking is facilitated. Our LGT implementation on a quantum processor presents a novel set of techniques for investigating emergent particle and string dynamics. △ Less

Submitted 25 September, 2024; originally announced September 2024.

arXiv:2409.16515 [pdf, ps, other]

Quantum error correction-inspired multiparameter quantum metrology

Authors: Sivaprasad Omanakuttan, Jonathan A. Gross, T. J. Volkoff

Abstract: We present a novel strategy for obtaining optimal probe states and measurement schemes in a class of noiseless multiparameter estimation problems with symmetry among the generators. The key to the framework is the introduction of a set of quantum metrology conditions, analogous to the quantum error correction conditions of Knill and Laflamme, which are utilized to identify probe states that satura… ▽ More We present a novel strategy for obtaining optimal probe states and measurement schemes in a class of noiseless multiparameter estimation problems with symmetry among the generators. The key to the framework is the introduction of a set of quantum metrology conditions, analogous to the quantum error correction conditions of Knill and Laflamme, which are utilized to identify probe states that saturate the multiparameter quantum Cramér-Rao bound. Similar to finding two-dimensional irreps for encoding a logical qubit in error correction, we identify trivial irreps of finite groups that guarantee the satisfaction of the quantum metrology conditions. To demonstrate our framework, we analyze the SU(2) estimation with symmetric states in which three parameters define a global rotation of an ensemble of $N$ qubits. For even $N$, we find that tetrahedral symmetry and, with fine-tuning, $S_{3}$ symmetry, are minimal symmetry groups providing optimal probe states for SU(2) estimation, but that the quantum metrology conditions can also be satisfied in an entanglement-assisted setting by using a maximally entangled state of two spin-$N/2$ representations for any $N$. By extending the multiparameter method of moments to non-commuting observables, we use the quantum metrology conditions to construct a measurement scheme that saturates the multiparameter quantum Cramér-Rao bound for small rotation angles. △ Less

Submitted 24 September, 2024; originally announced September 2024.

Comments: Comments are Welcome!

Report number: LA-UR-24-27714

arXiv:2409.15671 [pdf, other]

Autonomous Hiking Trail Navigation via Semantic Segmentation and Geometric Analysis

Authors: Camndon Reed, Christopher Tatsch, Jason N. Gross, Yu Gu

Abstract: Natural environments pose significant challenges for autonomous robot navigation, particularly due to their unstructured and ever-changing nature. Hiking trails, with their dynamic conditions influenced by weather, vegetation, and human traffic, represent one such challenge. This work introduces a novel approach to autonomous hiking trail navigation that balances trail adherence with the flexibili… ▽ More Natural environments pose significant challenges for autonomous robot navigation, particularly due to their unstructured and ever-changing nature. Hiking trails, with their dynamic conditions influenced by weather, vegetation, and human traffic, represent one such challenge. This work introduces a novel approach to autonomous hiking trail navigation that balances trail adherence with the flexibility to adapt to off-trail routes when necessary. The solution is a Traversability Analysis module that integrates semantic data from camera images with geometric information from LiDAR to create a comprehensive understanding of the surrounding terrain. A planner uses this traversability map to navigate safely, adhering to trails while allowing off-trail movement when necessary to avoid on-trail hazards or for safe off-trail shortcuts. The method is evaluated through simulation to determine the balance between semantic and geometric information in traversability estimation. These simulations tested various weights to assess their impact on navigation performance across different trail scenarios. Weights were then validated through field tests at the West Virginia University Core Arboretum, demonstrating the method's effectiveness in a real-world environment. △ Less

Submitted 23 September, 2024; originally announced September 2024.

arXiv:2408.13687 [pdf, other]

doi 10.1038/s41586-024-08449-y

Quantum error correction below the surface code threshold

Authors: Rajeev Acharya, Laleh Aghababaie-Beni, Igor Aleiner, Trond I. Andersen, Markus Ansmann, Frank Arute, Kunal Arya, Abraham Asfaw, Nikita Astrakhantsev, Juan Atalaya, Ryan Babbush, Dave Bacon, Brian Ballard, Joseph C. Bardin, Johannes Bausch, Andreas Bengtsson, Alexander Bilmes, Sam Blackwell, Sergio Boixo, Gina Bortoli, Alexandre Bourassa, Jenna Bovaird, Leon Brill, Michael Broughton, David A. Browne , et al. (224 additional authors not shown)

Abstract: Quantum error correction provides a path to reach practical quantum computing by combining multiple physical qubits into a logical qubit, where the logical error rate is suppressed exponentially as more qubits are added. However, this exponential suppression only occurs if the physical error rate is below a critical threshold. In this work, we present two surface code memories operating below this… ▽ More Quantum error correction provides a path to reach practical quantum computing by combining multiple physical qubits into a logical qubit, where the logical error rate is suppressed exponentially as more qubits are added. However, this exponential suppression only occurs if the physical error rate is below a critical threshold. In this work, we present two surface code memories operating below this threshold: a distance-7 code and a distance-5 code integrated with a real-time decoder. The logical error rate of our larger quantum memory is suppressed by a factor of $Λ$ = 2.14 $\pm$ 0.02 when increasing the code distance by two, culminating in a 101-qubit distance-7 code with 0.143% $\pm$ 0.003% error per cycle of error correction. This logical memory is also beyond break-even, exceeding its best physical qubit's lifetime by a factor of 2.4 $\pm$ 0.3. We maintain below-threshold performance when decoding in real time, achieving an average decoder latency of 63 $μ$s at distance-5 up to a million cycles, with a cycle time of 1.1 $μ$s. To probe the limits of our error-correction performance, we run repetition codes up to distance-29 and find that logical performance is limited by rare correlated error events occurring approximately once every hour, or 3 $\times$ 10$^9$ cycles. Our results present device performance that, if scaled, could realize the operational requirements of large scale fault-tolerant quantum algorithms. △ Less

Submitted 24 August, 2024; originally announced August 2024.

Comments: 10 pages, 4 figures, Supplementary Information

Journal ref: Nature 638 (2025) 920-926

arXiv:2408.13196 [pdf, other]

Predictability of Performance in Communication Networks Under Markovian Dynamics

Authors: Samie Mostafavi, Simon Egger, György Dán, James Gross

Abstract: With the emergence of time-critical applications in modern communication networks, there is a growing demand for proactive network adaptation and quality of service (QoS) prediction. However, a fundamental question remains largely unexplored: how can we quantify and achieve more predictable communication systems in terms of performance? To address this gap, this paper introduces a theoretical fram… ▽ More With the emergence of time-critical applications in modern communication networks, there is a growing demand for proactive network adaptation and quality of service (QoS) prediction. However, a fundamental question remains largely unexplored: how can we quantify and achieve more predictable communication systems in terms of performance? To address this gap, this paper introduces a theoretical framework for defining and analyzing predictability in communication systems, with a focus on the impact of observations for performance forecasting. We establish a mathematical definition of predictability based on the total variation distance between forecast and marginal performance distributions. A system is deemed unpredictable when the forecast distribution, providing the most comprehensive characterization of future states using all accessible information, is indistinguishable from the marginal distribution, which depicts the system's behavior without any observational input. This framework is applied to multi-hop systems under Markovian conditions, with a detailed analysis of Geo/Geo/1 queuing models in both single-hop and multi-hop scenarios. We derive exact and approximate expressions for predictability in these systems, as well as upper bounds based on spectral analysis of the underlying Markov chains. Our results have implications for the design of efficient monitoring and prediction mechanisms in future communication networks aiming to provide deterministic services. △ Less

Submitted 25 April, 2025; v1 submitted 23 August, 2024; originally announced August 2024.

arXiv:2408.00913 [pdf, ps, other]

Design and Implementation of ARA Wireless Living Lab for Rural Broadband and Applications

Authors: Taimoor Ul Islam, Joshua Ofori Boateng, Md Nadim, Guoying Zu, Mukaram Shahid, Xun Li, Tianyi Zhang, Salil Reddy, Wei Xu, Ataberk Atalar, Vincent Lee, Yung-Fu Chen, Evan Gosling, Elisabeth Permatasari, Christ Somiah, Owen Perrin, Zhibo Meng, Reshal Afzal, Sarath Babu, Mohammed Soliman, Ali Hussain, Daji Qiao, Mai Zheng, Ozdal Boyraz, Yong Guan , et al. (9 additional authors not shown)

Abstract: Addressing the broadband gap between rural and urban regions requires rural-focused wireless research and innovation. In the meantime, rural regions provide rich, diverse use cases of advanced wireless, and they offer unique real-world settings for piloting applications that advance the frontiers of wireless systems (e.g., teleoperation of ground and aerial vehicles). To fill the broadband gap and… ▽ More Addressing the broadband gap between rural and urban regions requires rural-focused wireless research and innovation. In the meantime, rural regions provide rich, diverse use cases of advanced wireless, and they offer unique real-world settings for piloting applications that advance the frontiers of wireless systems (e.g., teleoperation of ground and aerial vehicles). To fill the broadband gap and to leverage the unique opportunities that rural regions provide for piloting advanced wireless applications, we design and implement the ARA wireless living lab for research and innovation in rural wireless systems and their applications in precision agriculture, community services, and so on. ARA focuses on the unique community, application, and economic context of rural regions, and it features the first-of-its-kind, real-world deployment of long-distance, high-capacity terrestrial wireless x-haul and access platforms as well as low-earth-orbit (LEO) satellite communications platforms across a rural area of diameter over 30 km. With both software-defined radios and programmable COTS systems, and through effective orchestration of these wireless resources with fiber as well as compute resources embedded end-to-end across user equipment (UE), base stations (BS), edge, and cloud, including support for Bring Your Own Device (BYOD), ARA offers programmability, performance, robustness, and heterogeneity at the same time, thus enabling rural-focused co-evolution of wireless and applications while helping advance the frontiers of wireless systems in domains such as Open RAN, NextG, and agriculture applications. △ Less

Submitted 28 May, 2025; v1 submitted 1 August, 2024; originally announced August 2024.

Comments: 47 pages, 18 figures

arXiv:2407.11387 [pdf, other]

A Framework for Evaluating Appropriateness, Trustworthiness, and Safety in Mental Wellness AI Chatbots

Authors: Lucia Chen, David A. Preece, Pilleriin Sikka, James J. Gross, Ben Krause

Abstract: Large language model (LLM) chatbots are susceptible to biases and hallucinations, but current evaluations of mental wellness technologies lack comprehensive case studies to evaluate their practical applications. Here, we address this gap by introducing the MHealth-EVAL framework, a new role-play based interactive evaluation method designed specifically for evaluating the appropriateness, trustwort… ▽ More Large language model (LLM) chatbots are susceptible to biases and hallucinations, but current evaluations of mental wellness technologies lack comprehensive case studies to evaluate their practical applications. Here, we address this gap by introducing the MHealth-EVAL framework, a new role-play based interactive evaluation method designed specifically for evaluating the appropriateness, trustworthiness, and safety of mental wellness chatbots. We also introduce Psyfy, a new chatbot leveraging LLMs to facilitate transdiagnostic Cognitive Behavioral Therapy (CBT). We demonstrate the MHealth-EVAL framework's utility through a comparative study of two versions of Psyfy against standard baseline chatbots. Our results showed that Psyfy chatbots outperformed the baseline chatbots in delivering appropriate responses, engaging users, and avoiding untrustworthy responses. However, both Psyfy and the baseline chatbots exhibited some limitations, such as providing predominantly US-centric resources. While Psyfy chatbots were able to identify most unsafe situations and avoid giving unsafe responses, they sometimes struggled to recognize subtle harmful intentions when prompted in role play scenarios. Our study demonstrates a practical application of the MHealth-EVAL framework and showcases Psyfy's utility in harnessing LLMs to enhance user engagement and provide flexible and appropriate responses aligned with an evidence-based CBT approach. △ Less

Submitted 16 July, 2024; originally announced July 2024.

arXiv:2407.08887 [pdf, other]

Automatic Pruning of Fine-tuning Datasets for Transformer-based Language Models

Authors: Mohammadreza Tayaranian, Seyyed Hasan Mozafari, Brett H. Meyer, James J. Clark, Warren J. Gross

Abstract: Transformer-based language models have shown state-of-the-art performance on a variety of natural language understanding tasks. To achieve this performance, these models are first pre-trained on general corpus and then fine-tuned on downstream tasks. Previous work studied the effect of pruning the training set of the downstream tasks on the performance of the model on its evaluation set. In this w… ▽ More Transformer-based language models have shown state-of-the-art performance on a variety of natural language understanding tasks. To achieve this performance, these models are first pre-trained on general corpus and then fine-tuned on downstream tasks. Previous work studied the effect of pruning the training set of the downstream tasks on the performance of the model on its evaluation set. In this work, we propose an automatic dataset pruning method for the training set of fine-tuning tasks. Our method is based on the model's success rate in correctly classifying each training data point. Unlike previous work which relies on user feedback to determine subset size, our method automatically extracts training subsets that are adapted for each pair of model and fine-tuning task. Our method provides multiple subsets for use in dataset pruning that navigate the trade-off between subset size and evaluation accuracy. Our largest subset, which we also refer to as the winning ticket subset, is on average $3 \times$ smaller than the original training set of the fine-tuning task. Our experiments on 5 downstream tasks and 2 language models show that, on average, fine-tuning on the winning ticket subsets results in a $0.1 \%$ increase in the evaluation performance of the model. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 28 pages, 17 figures. Accepted at the Third Conference on Lifelong Learning Agents (CoLLAs 2024)

arXiv:2407.01583 [pdf, other]

doi 10.1038/s41467-025-56724-x

Optimal Low-Depth Quantum Signal-Processing Phase Estimation

Authors: Yulong Dong, Jonathan A. Gross, Murphy Yuezhen Niu

Abstract: Quantum effects like entanglement and coherent amplification can be used to drastically enhance the accuracy of quantum parameter estimation beyond classical limits. However, challenges such as decoherence and time-dependent errors hinder Heisenberg-limited amplification. We introduce Quantum Signal-Processing Phase Estimation algorithms that are robust against these challenges and achieve optimal… ▽ More Quantum effects like entanglement and coherent amplification can be used to drastically enhance the accuracy of quantum parameter estimation beyond classical limits. However, challenges such as decoherence and time-dependent errors hinder Heisenberg-limited amplification. We introduce Quantum Signal-Processing Phase Estimation algorithms that are robust against these challenges and achieve optimal performance as dictated by the Cramér-Rao bound. These algorithms use quantum signal transformation to decouple interdependent phase parameters into largely orthogonal ones, ensuring that time-dependent errors in one do not compromise the accuracy of learning the other. Combining provably optimal classical estimation with near-optimal quantum circuit design, our approach achieves a standard deviation accuracy of $10^{-4}$ radians for estimating unwanted swap angles in superconducting two-qubit experiments, using low-depth ($<10$) circuits. This represents up to two orders of magnitude improvement over existing methods. Theoretically and numerically, we demonstrate the optimality of our algorithm against time-dependent phase errors, observing that the variance of the time-sensitive parameter $\varphi$ scales faster than the asymptotic Heisenberg scaling in the small-depth regime. Our results are rigorously validated against the quantum Fisher information, confirming our protocol's ability to achieve unmatched precision for two-qubit gate learning. △ Less

Submitted 16 February, 2025; v1 submitted 17 June, 2024; originally announced July 2024.

Comments: 58 pages, 22 figures. arXiv admin note: substantial text overlap with arXiv:2209.11207

Journal ref: Nature Communications 16, no. 1 (2025): 1504

arXiv:2406.13500 [pdf, other]

Gradient-Boosted Generalized Linear Models for Conditional Vine Copulas

Authors: David Jobst, Annette Möller, Jürgen Groß

Abstract: Vine copulas are flexible dependence models using bivariate copulas as building blocks. If the parameters of the bivariate copulas in the vine copula depend on covariates, one obtains a conditional vine copula. We propose an extension for the estimation of continuous conditional vine copulas, where the parameters of continuous conditional bivariate copulas are estimated sequentially and separately… ▽ More Vine copulas are flexible dependence models using bivariate copulas as building blocks. If the parameters of the bivariate copulas in the vine copula depend on covariates, one obtains a conditional vine copula. We propose an extension for the estimation of continuous conditional vine copulas, where the parameters of continuous conditional bivariate copulas are estimated sequentially and separately via gradient-boosting. For this purpose, we link covariates via generalized linear models (GLMs) to Kendall's $τ$ correlation coefficient from which the corresponding copula parameter can be obtained. Consequently, the gradient-boosting algorithm estimates the copula parameters providing a natural covariate selection. In a second step, an additional covariate deselection procedure is applied. The performance of the gradient-boosted conditional vine copulas is illustrated in a simulation study. Linear covariate effects in low- and high-dimensional settings are investigated for the conditional bivariate copulas separately and for conditional vine copulas. Moreover, the gradient-boosted conditional vine copulas are applied to the temporal postprocessing of ensemble weather forecasts in a low-dimensional setting. The results show, that our suggested method is able to outperform the benchmark methods and identifies temporal correlations better. Eventually, we provide an R-package called boostCopula for this method. △ Less

Submitted 19 June, 2024; originally announced June 2024.

arXiv:2406.11779 [pdf, other]

Compact Proofs of Model Performance via Mechanistic Interpretability

Authors: Jason Gross, Rajashree Agrawal, Thomas Kwa, Euan Ong, Chun Hei Yip, Alex Gibson, Soufiane Noubir, Lawrence Chan

Abstract: We propose using mechanistic interpretability -- techniques for reverse engineering model weights into human-interpretable algorithms -- to derive and compactly prove formal guarantees on model performance. We prototype this approach by formally proving accuracy lower bounds for a small transformer trained on Max-of-K, validating proof transferability across 151 random seeds and four values of K.… ▽ More We propose using mechanistic interpretability -- techniques for reverse engineering model weights into human-interpretable algorithms -- to derive and compactly prove formal guarantees on model performance. We prototype this approach by formally proving accuracy lower bounds for a small transformer trained on Max-of-K, validating proof transferability across 151 random seeds and four values of K. We create 102 different computer-assisted proof strategies and assess their length and tightness of bound on each of our models. Using quantitative metrics, we find that shorter proofs seem to require and provide more mechanistic understanding. Moreover, we find that more faithful mechanistic understanding leads to tighter performance bounds. We confirm these connections by qualitatively examining a subset of our proofs. Finally, we identify compounding structureless errors as a key challenge for using mechanistic interpretability to generate compact proofs on model performance. △ Less

Submitted 24 December, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

Comments: accepted to the 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

arXiv:2406.01121 [pdf, other]

$K^+Λ(1520)$ photoproduction at forward angles near threshold with the BGOOD experiment

Authors: E. O. Rosanowski, T. C. Jude, S. Alef, A. J. Clara Figueiredo, D. D Burdeinyi, P. L. Cole, R. Di Salvo, D. Elsner, A. Fantini, O. Freyermuth, F. Frommberger, V. B Ganenko, F. Ghio, J. Groß, K. Kohl, P. Levi Sandri, G. Mandaglio, R. Messi, D. Moricciani, P. Pedroni, B. -E. Reitz, M. Romaniuk, G. Scheluchin, H. Schmieden, A. Sonnenschein

Abstract: The differential cross section for $γp\rightarrow K^+Λ(1520)$ was measured from threshold to a centre-of-mass energy of 2090\,MeV at forward angles at the BGOOD experiment. The high statistical precision and resolution in centre-of-mass energy and angle allows a detailed characterisation of this low-momentum transfer kinematic region. The data agree with a previous LEPS measurement and support eff… ▽ More The differential cross section for $γp\rightarrow K^+Λ(1520)$ was measured from threshold to a centre-of-mass energy of 2090\,MeV at forward angles at the BGOOD experiment. The high statistical precision and resolution in centre-of-mass energy and angle allows a detailed characterisation of this low-momentum transfer kinematic region. The data agree with a previous LEPS measurement and support effective Lagrangian models that indicate that the contact term dominates the cross section near threshold. △ Less

Submitted 29 October, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

Comments: 6 pages, 7 figures

arXiv:2405.17385 [pdf, other]

Thermalization and Criticality on an Analog-Digital Quantum Simulator

Authors: Trond I. Andersen, Nikita Astrakhantsev, Amir H. Karamlou, Julia Berndtsson, Johannes Motruk, Aaron Szasz, Jonathan A. Gross, Alexander Schuckert, Tom Westerhout, Yaxing Zhang, Ebrahim Forati, Dario Rossi, Bryce Kobrin, Agustin Di Paolo, Andrey R. Klots, Ilya Drozdov, Vladislav D. Kurilovich, Andre Petukhov, Lev B. Ioffe, Andreas Elben, Aniket Rath, Vittorio Vitale, Benoit Vermersch, Rajeev Acharya, Laleh Aghababaie Beni , et al. (202 additional authors not shown)

Abstract: Understanding how interacting particles approach thermal equilibrium is a major challenge of quantum simulators. Unlocking the full potential of such systems toward this goal requires flexible initial state preparation, precise time evolution, and extensive probes for final state characterization. We present a quantum simulator comprising 69 superconducting qubits which supports both universal qua… ▽ More Understanding how interacting particles approach thermal equilibrium is a major challenge of quantum simulators. Unlocking the full potential of such systems toward this goal requires flexible initial state preparation, precise time evolution, and extensive probes for final state characterization. We present a quantum simulator comprising 69 superconducting qubits which supports both universal quantum gates and high-fidelity analog evolution, with performance beyond the reach of classical simulation in cross-entropy benchmarking experiments. Emulating a two-dimensional (2D) XY quantum magnet, we leverage a wide range of measurement techniques to study quantum states after ramps from an antiferromagnetic initial state. We observe signatures of the classical Kosterlitz-Thouless phase transition, as well as strong deviations from Kibble-Zurek scaling predictions attributed to the interplay between quantum and classical coarsening of the correlated domains. This interpretation is corroborated by injecting variable energy density into the initial state, which enables studying the effects of the eigenstate thermalization hypothesis (ETH) in targeted parts of the eigenspectrum. Finally, we digitally prepare the system in pairwise-entangled dimer states and image the transport of energy and vorticity during thermalization. These results establish the efficacy of superconducting analog-digital quantum processors for preparing states across many-body spectra and unveiling their thermalization dynamics. △ Less

Submitted 8 July, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.15637 [pdf]

Clearing the Path for Software Sustainability

Authors: Jennifer Gross, Sofia Ouhbi

Abstract: The advancement of software sustainability encounters notable challenges, underscoring the necessity for understanding these challenges to facilitate significant progress and pave the way for effective solutions to advance software sustainability. This paper outlines key challenges identified in literature based on findings from a tertiary study. Challenges identified include: confusion regarding… ▽ More The advancement of software sustainability encounters notable challenges, underscoring the necessity for understanding these challenges to facilitate significant progress and pave the way for effective solutions to advance software sustainability. This paper outlines key challenges identified in literature based on findings from a tertiary study. Challenges identified include: confusion regarding the definition of software sustainability, uncertainty about when to consider sustainability in software development, lack of assessment metrics and tools, narrow perspectives on sustainability in software systems, insufficient awareness and education, and a lack of serious considerations in practice. The paper aims at clarifying the confusion surrounding software sustainability to motivate effective solutions. The provided recommendations aim to give a more organized approach towards advancing sustainable software development, emphasizing comprehensive strategies, the integration of sustainability as a fundamental aspect of software development, actionable research directions, and the cultivation of a common understanding of sustainable software. △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2405.09392 [pdf, other]

Coherent $π^0ηd$ photoproduction at forward deuteron angles measured at BGOOD

Authors: A. J. Clara Figueiredo, T. C. Jude, S. Alef, P. L. Cole, R. Di Salvo, D. Elsner, A. Fantini, O. Freyermuth, F. Frommberger, F. Ghio, J. Groß, K. Kohl, P. Levi Sandri, G. Mandaglio, P. Pedroni, B. -E. Reitz, M. Romaniuk, G. Scheluchin, H. Schmieden, A. Sonnenschein, C. Tillmanns

Abstract: The coherent reaction, $γd \rightarrow π^0ηd$ was studied with the BGOOD experiment at ELSA from threshold to a centre-of-mass energy of 3200\,MeV. A full kinematic reconstruction was made, with final state deuterons identified in the forward spectrometer and $π^0$ and $η$ decays in the central BGO Rugby Ball. The strength of the differential cross section exceeds what can be described by models o… ▽ More The coherent reaction, $γd \rightarrow π^0ηd$ was studied with the BGOOD experiment at ELSA from threshold to a centre-of-mass energy of 3200\,MeV. A full kinematic reconstruction was made, with final state deuterons identified in the forward spectrometer and $π^0$ and $η$ decays in the central BGO Rugby Ball. The strength of the differential cross section exceeds what can be described by models of coherent photoproduction at forward angles by orders of magnitude. The distribution of the differential cross section has an excellent agreement with a model including quasi-free $Δπ$ photoproduction, pion re-scattering and $N(1535)$ formation and subsequent nucleon coalescence to the deuteron. This also gives a reasonable description of the two-body invariant mass distributions and naturally explains the similar magnitudes of this channel and $π^0π^0 d$ coherent photoproduction. △ Less

Submitted 15 May, 2024; originally announced May 2024.

Comments: 7 pages, 5 figures

arXiv:2404.12550 [pdf, other]

doi 10.1038/s41534-024-00917-7

Characterizing Coherent Errors using Matrix-Element Amplification

Authors: Jonathan A. Gross, Elie Genois, Dripto M. Debroy, Yaxing Zhang, Wojciech Mruczkiewicz, Ze-Pei Cian, Zhang Jiang

Abstract: Repeating a gate sequence multiple times amplifies systematic errors coherently, making it a useful tool for characterizing quantum gates. However, the precision of such an approach is limited by low-frequency noises, while its efficiency hindered by time-consuming scans required to match up the phases of the off-diagonal matrix elements being amplified. Here, we overcome both challenges by interl… ▽ More Repeating a gate sequence multiple times amplifies systematic errors coherently, making it a useful tool for characterizing quantum gates. However, the precision of such an approach is limited by low-frequency noises, while its efficiency hindered by time-consuming scans required to match up the phases of the off-diagonal matrix elements being amplified. Here, we overcome both challenges by interleaving the gate of interest with dynamical decoupling sequences in a protocol we call Matrix-Element Amplification using Dynamical Decoupling (MEADD). Using frequency-tunable superconducting qubits from a Google Sycamore quantum processor, we experimentally demonstrate that MEADD surpasses the accuracy and precision of existing characterization protocols for estimating systematic errors in single- and two-qubit gates. In particular, MEADD yields factors of 5 to 10 improvements in estimating coherent parameters of the $\mathrm{CZ}$ gates compared to existing methods, reaching a precision below one milliradian. We also use it to characterize coherent crosstalk in the processor which was previously too small to detect reliably. △ Less

Submitted 2 December, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

Journal ref: npj Quantum Inf 10, 123 (2024)

arXiv:2404.04964 [pdf, other]

The zero degree of freedom non-central chi squared distribution for ensemble postprocessing

Authors: Jürgen Groß, Annette Möller

Abstract: In this note the use of the zero degree non-central chi squared distribution as predictive distribution for ensemble postprocessing is investigated. It has a point mass at zero by definition, and is thus particularly suited for postprocessing weather variables naturally exhibiting large numbers of zeros, such as precipitation, solar radiation or lightnings. Due to the properties of the distributio… ▽ More In this note the use of the zero degree non-central chi squared distribution as predictive distribution for ensemble postprocessing is investigated. It has a point mass at zero by definition, and is thus particularly suited for postprocessing weather variables naturally exhibiting large numbers of zeros, such as precipitation, solar radiation or lightnings. Due to the properties of the distribution no additional truncation or censoring is required to obtain a positive probability at zero. The presented study investigates its performance compared to that of the censored generalized extreme value distribution and the censored and shifted gamma distribution for postprocessing 24h accumulated precipitation using an EMOS (ensemble model output statistics) approach with a rolling training period. The obtained results support the conclusion that it serves well as a predictive distribution in postprocessing precipitation and thus may also be considered in future analyses of other weather variables having substantial zero observations. △ Less

Submitted 7 April, 2024; originally announced April 2024.

MSC Class: 62P12

arXiv:2404.03489 [pdf, other]

Design of Stickbug: a Six-Armed Precision Pollination Robot

Authors: Trevor Smith, Madhav Rijal, Christopher Tatsch, R. Michael Butts, Jared Beard, R. Tyler Cook, Andy Chu, Jason Gross, Yu Gu

Abstract: This work presents the design of Stickbug, a six-armed, multi-agent, precision pollination robot that combines the accuracy of single-agent systems with swarm parallelization in greenhouses. Precision pollination robots have often been proposed to offset the effects of a decreasing population of natural pollinators, but they frequently lack the required parallelization and scalability. Stickbug ac… ▽ More This work presents the design of Stickbug, a six-armed, multi-agent, precision pollination robot that combines the accuracy of single-agent systems with swarm parallelization in greenhouses. Precision pollination robots have often been proposed to offset the effects of a decreasing population of natural pollinators, but they frequently lack the required parallelization and scalability. Stickbug achieves this by allowing each arm and drive base to act as an individual agent, significantly reducing planning complexity. Stickbug uses a compact holonomic Kiwi drive to navigate narrow greenhouse rows, a tall mast to support multiple manipulators and reach plant heights, a detection model and classifier to identify Bramble flowers, and a felt-tipped end-effector for contact-based pollination. Initial experimental validation demonstrates that Stickbug can attempt over 1.5 pollinations per minute with a 50% success rate. Additionally, a Bramble flower perception dataset was created and is publicly available alongside Stickbug's software and design files. △ Less

Submitted 4 April, 2024; originally announced April 2024.

Comments: 7 pages, 7 figures

arXiv:2402.00555 [pdf, other]

Time Series based Ensemble Model Output Statistics for Temperature Forecasts Postprocessing

Authors: David Jobst, Annette Möller, Jürgen Groß

Abstract: Nowadays, weather prediction is based on numerical weather prediction (NWP) models to produce an ensemble of forecasts. Despite of large improvements over the last few decades, they still tend to exhibit systematic bias and dispersion errors. Consequently, these forecasts may be improved by statistical postprocessing. This work proposes an extension of the ensemble model output statistics (EMOS) m… ▽ More Nowadays, weather prediction is based on numerical weather prediction (NWP) models to produce an ensemble of forecasts. Despite of large improvements over the last few decades, they still tend to exhibit systematic bias and dispersion errors. Consequently, these forecasts may be improved by statistical postprocessing. This work proposes an extension of the ensemble model output statistics (EMOS) method in a time series framework. Besides of taking account of seasonality and trend in the location and scale parameter of the predictive distribution, the autoregressive process in the mean forecast errors or the standardized forecast errors is considered. The models can be further extended by allowing generalized autoregressive conditional heteroscedasticity (GARCH). Last but not least, it is outlined how to use these models for arbitrary forecast horizons. To illustrate the performance of the suggested EMOS models in time series fashion, we present a case study for the postprocessing of 2 m surface temperature forecasts using five different lead times and a set of observation stations in Germany. The results indicate that the time series EMOS extensions are able to significantly outperform the benchmark EMOS and autoregressive adjusted EMOS (AR-EMOS) in most of the lead time-station cases. To complement this article, our method is accompanied by an R-package called tsEMOS. △ Less

Submitted 1 February, 2024; originally announced February 2024.

arXiv:2401.09856 [pdf, other]

EDAF: An End-to-End Delay Analytics Framework for 5G-and-Beyond Networks

Authors: Samie Mostafavi, Marius Tillner, Gourav Prateek Sharma, James Gross

Abstract: Supporting applications in emerging domains like cyber-physical systems and human-in-the-loop scenarios typically requires adherence to strict end-to-end delay guarantees. Contributions of many tandem processes unfolding layer by layer within the wireless network result in violations of delay constraints, thereby severely degrading application performance. Meeting the application's stringent requi… ▽ More Supporting applications in emerging domains like cyber-physical systems and human-in-the-loop scenarios typically requires adherence to strict end-to-end delay guarantees. Contributions of many tandem processes unfolding layer by layer within the wireless network result in violations of delay constraints, thereby severely degrading application performance. Meeting the application's stringent requirements necessitates coordinated optimization of the end-to-end delay by fine-tuning all contributing processes. To achieve this task, we designed and implemented EDAF, a framework to decompose packets' end-to-end delays and determine each component's significance for 5G network. We showcase EDAF on OpenAirInterface 5G uplink, modified to report timestamps across the data plane. By applying the obtained insights, we optimized end-to-end uplink delay by eliminating segmentation and frame-alignment delays, decreasing average delay from 12ms to 4ms. △ Less

Submitted 18 January, 2024; originally announced January 2024.

Comments: Submitted to the 11th International Workshop on Computer and Networking Experimental Research using Testbeds (CNERT 2024)

arXiv:2401.04271 [pdf, ps, other]

doi 10.1103/PRXQuantum.5.020355

Fault-tolerant quantum computation using large spin cat-codes

Authors: Sivaprasad Omanakuttan, Vikas Buchemmavari, Jonathan A. Gross, Ivan H Deutsch, Milad Marvian

Abstract: We construct a fault-tolerant quantum error-correcting protocol based on a qubit encoded in a large spin qudit using a spin-cat code, analogous to the continuous variable cat encoding. With this, we can correct the dominant error sources, namely processes that can be expressed as error operators that are linear or quadratic in the components of angular momentum. Such codes tailored to dominant err… ▽ More We construct a fault-tolerant quantum error-correcting protocol based on a qubit encoded in a large spin qudit using a spin-cat code, analogous to the continuous variable cat encoding. With this, we can correct the dominant error sources, namely processes that can be expressed as error operators that are linear or quadratic in the components of angular momentum. Such codes tailored to dominant error sources {can} exhibit superior thresholds and lower resource overheads when compared to those designed for unstructured noise models. To preserve the dominant errors during gate operations, we identify a suitable universal gate set. A key component is the CNOT gate that preserves the rank of spherical tensor operators. Categorizing the dominant errors as phase and amplitude errors, we demonstrate how phase errors, analogous to phase-flip errors for qubits, can be effectively corrected. Furthermore, we propose a measurement-free error correction scheme to address amplitude errors without relying on syndrome measurements. Through an in-depth analysis of logical CNOT gate errors, we establish that the fault-tolerant threshold for error correction in the spin-cat encoding surpasses that of standard qubit-based encodings. We consider a specific implementation based on neutral-atom quantum computing, with qudits encoded in the nuclear spin of $^{87}$Sr, and show how to generate the universal gate set, including the rank-preserving CNOT gate, using quantum control and the Rydberg blockade. These findings pave the way for encoding a qubit in a large spin with the potential to achieve fault tolerance, high threshold, and reduced resource overhead in quantum information processing. △ Less

Submitted 11 June, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

Comments: Published in PRX Quantum

Journal ref: PRX Quantum 2024

arXiv:2312.04917 [pdf]

doi 10.1007/978-3-031-49266-2_10

Operationalizing Assurance Cases for Data Scientists: A Showcase of Concepts and Tooling in the Context of Test Data Quality for Machine Learning

Authors: Lisa Jöckel, Michael Kläs, Janek Groß, Pascal Gerber, Markus Scholz, Jonathan Eberle, Marc Teschner, Daniel Seifert, Richard Hawkins, John Molloy, Jens Ottnad

Abstract: Assurance Cases (ACs) are an established approach in safety engineering to argue quality claims in a structured way. In the context of quality assurance for Machine Learning (ML)-based software components, ACs are also being discussed and appear promising. Tools for operationalizing ACs do exist, yet mainly focus on supporting safety engineers on the system level. However, assuring the quality of… ▽ More Assurance Cases (ACs) are an established approach in safety engineering to argue quality claims in a structured way. In the context of quality assurance for Machine Learning (ML)-based software components, ACs are also being discussed and appear promising. Tools for operationalizing ACs do exist, yet mainly focus on supporting safety engineers on the system level. However, assuring the quality of an ML component within the system is commonly the responsibility of data scientists, who are usually less familiar with these tools. To address this gap, we propose a framework to support the operationalization of ACs for ML components based on technologies that data scientists use on a daily basis: Python and Jupyter Notebook. Our aim is to make the process of creating ML-related evidence in ACs more effective. Results from the application of the framework, documented through notebooks, can be integrated into existing AC tools. We illustrate the application of the framework on an example excerpt concerned with the quality of the test data. △ Less

Submitted 8 December, 2023; originally announced December 2023.

Comments: Accepted for publication at International Conference on Product-Focused Software Process Improvement (Profes 2023), https://conf.researchr.org/home/profes-2023

arXiv:2312.00162 [pdf, other]

Qutrit codes within representations of SU(3)

Authors: Xzavier Herbert, Jonathan Gross, Michael Newman

Abstract: We describe a quantum error-detecting and error-correcting code embedded within irreducible representations of SU(3). These logical qutrits inherit the He(3) symmetries induced by the representation, while protecting against small SU(3) displacements. We explore the general methodology for finding codes from structure-inducing representations of groups, together with symmetries inherited from fini… ▽ More We describe a quantum error-detecting and error-correcting code embedded within irreducible representations of SU(3). These logical qutrits inherit the He(3) symmetries induced by the representation, while protecting against small SU(3) displacements. We explore the general methodology for finding codes from structure-inducing representations of groups, together with symmetries inherited from finite subgroups, extending the case of spin representations of SU(2). △ Less

Submitted 30 November, 2023; originally announced December 2023.

Comments: 14 pages

Showing 1–50 of 364 results for author: Groß, J