Search | arXiv e-print repository

Getting More from Less: Transfer Learning Improves Sleep Stage Decoding Accuracy in Peripheral Wearable Devices

Authors: William G Coon, Diego Luna, Akshita Panagrahi, Matthew Reid, Mattson Ogg

Abstract: Transfer learning, a technique commonly used in generative artificial intelligence, allows neural network models to bring prior knowledge to bear when learning a new task. This study demonstrates that transfer learning significantly enhances the accuracy of sleep-stage decoding from peripheral wearable devices by leveraging neural network models pretrained on electroencephalographic (EEG) signals.… ▽ More Transfer learning, a technique commonly used in generative artificial intelligence, allows neural network models to bring prior knowledge to bear when learning a new task. This study demonstrates that transfer learning significantly enhances the accuracy of sleep-stage decoding from peripheral wearable devices by leveraging neural network models pretrained on electroencephalographic (EEG) signals. Consumer wearable technologies typically rely on peripheral physiological signals such as pulse plethysmography (PPG) and respiratory data, which, while convenient, lack the fidelity of clinical electroencephalography (EEG) for detailed sleep-stage classification. We pretrained a transformer-based neural network on a large, publicly available EEG dataset and subsequently fine-tuned this model on noisier peripheral signals. Our transfer learning approach improved overall classification accuracy from 67.6\% (baseline model trained solely on peripheral signals) to 76.6\%. Notable accuracy improvements were observed across sleep stages, particularly lighter sleep stages such as REM and N1. These results highlight transfer learning's potential to substantially enhance the accuracy and utility of consumer wearable devices without altering existing hardware. Future integration of self-supervised learning methods may further boost performance, facilitating more precise, longitudinal sleep monitoring for personalized health applications. △ Less

Submitted 31 May, 2025; originally announced June 2025.

arXiv:2505.01860 [pdf, other]

Performing all-atom molecular dynamics simulations of intrinsically disordered proteins with replica exchange solute tempering

Authors: Jaya Krishna Koneru, Korey M. Reid, Paul Robustelli

Abstract: All-atom molecular dynamics (MD) computer simulations are a valuable tool for characterizing the conformational ensembles of intrinsically disordered proteins (IDPs). IDP conformational ensembles are highly heterogeneous and contain structures with many distinct topologies separated by large free-energy barriers. Sampling the vast conformational space of IDPs in explicit solvent all-atom MD simula… ▽ More All-atom molecular dynamics (MD) computer simulations are a valuable tool for characterizing the conformational ensembles of intrinsically disordered proteins (IDPs). IDP conformational ensembles are highly heterogeneous and contain structures with many distinct topologies separated by large free-energy barriers. Sampling the vast conformational space of IDPs in explicit solvent all-atom MD simulations is extremely challenging, and enhanced sampling methods are generally required to obtain statistically meaningful descriptions of IDP conformational ensembles. Replica exchange solute tempering (REST) methods, where multiple coupled simulations of a system are performed in parallel with selectively modified potential energy functions, are a powerful approach for efficiently sampling the conformational space of IDPs. In this chapter, we demonstrate how to set-up, perform and analyze all-atom MD simulations of IDPs with REST enhanced sampling methods. △ Less

Submitted 3 May, 2025; originally announced May 2025.

Comments: 30 pages, 4 figures

arXiv:2504.01669 [pdf, other]

The CosmoVerse White Paper: Addressing observational tensions in cosmology with systematics and fundamental physics

Authors: Eleonora Di Valentino, Jackson Levi Said, Adam Riess, Agnieszka Pollo, Vivian Poulin, Adrià Gómez-Valent, Amanda Weltman, Antonella Palmese, Caroline D. Huang, Carsten van de Bruck, Chandra Shekhar Saraf, Cheng-Yu Kuo, Cora Uhlemann, Daniela Grandón, Dante Paz, Dominique Eckert, Elsa M. Teixeira, Emmanuel N. Saridakis, Eoin Ó Colgáin, Florian Beutler, Florian Niedermann, Francesco Bajardi, Gabriela Barenboim, Giulia Gubitosi, Ilaria Musella , et al. (513 additional authors not shown)

Abstract: The standard model of cosmology has provided a good phenomenological description of a wide range of observations both at astrophysical and cosmological scales for several decades. This concordance model is constructed by a universal cosmological constant and supported by a matter sector described by the standard model of particle physics and a cold dark matter contribution, as well as very early-t… ▽ More The standard model of cosmology has provided a good phenomenological description of a wide range of observations both at astrophysical and cosmological scales for several decades. This concordance model is constructed by a universal cosmological constant and supported by a matter sector described by the standard model of particle physics and a cold dark matter contribution, as well as very early-time inflationary physics, and underpinned by gravitation through general relativity. There have always been open questions about the soundness of the foundations of the standard model. However, recent years have shown that there may also be questions from the observational sector with the emergence of differences between certain cosmological probes. In this White Paper, we identify the key objectives that need to be addressed over the coming decade together with the core science projects that aim to meet these challenges. These discordances primarily rest on the divergence in the measurement of core cosmological parameters with varying levels of statistical confidence. These possible statistical tensions may be partially accounted for by systematics in various measurements or cosmological probes but there is also a growing indication of potential new physics beyond the standard model. After reviewing the principal probes used in the measurement of cosmological parameters, as well as potential systematics, we discuss the most promising array of potential new physics that may be observable in upcoming surveys. We also discuss the growing set of novel data analysis approaches that go beyond traditional methods to test physical models. [Abridged] △ Less

Submitted 15 May, 2025; v1 submitted 2 April, 2025; originally announced April 2025.

Comments: 416 pages, 81 figures, accepted in PotDU

arXiv:2503.12749 [pdf, other]

Matrix phase-space representations in quantum optics

Authors: Peter D. Drummond, Alexander S. Dellios, Margaret D. Reid

Abstract: We introduce matrix quantum phase-space distributions. These extend the idea of a quantum phase-space representation via projections onto a density matrix of global symmetry variables. The method is applied to verification of low-loss Gaussian boson sampling (GBS) quantum computational advantage experiments with up to 10,000 modes, where classically generating photon-number counts is exponentially… ▽ More We introduce matrix quantum phase-space distributions. These extend the idea of a quantum phase-space representation via projections onto a density matrix of global symmetry variables. The method is applied to verification of low-loss Gaussian boson sampling (GBS) quantum computational advantage experiments with up to 10,000 modes, where classically generating photon-number counts is exponentially hard. We demonstrate improvements in sampling error by a factor of 1000 or more compared to unprojected methods, which are infeasible for such cases. △ Less

Submitted 16 March, 2025; originally announced March 2025.

Comments: Six figures, results presented at 2025 APS global summit

arXiv:2502.07550 [pdf, other]

doi 10.3847/1538-4357/adb70f

The Expanding 3 kpc Arms Are Neither Expanding nor Spiral Arms but X1 Orbits Driven by the Galactic Bar

Authors: Jayender Kumar, Mark J. Reid, T. M. Dame, Simon P. Ellingsen, Lucas J. Hyland, Andreas Brunthaler, Karl M. Menten, Xing-Wu Zheng, Alberto Sanna

Abstract: Near the center of our Milky Way is a bar-like structure and the so-called Expanding 3-kpc arms. We currently have limited knowledge of this important region, since we are about 8.2 kpc from the center and cannot directly observe it at optical wavelengths, owing to strong extinction from interstellar dust. Here we present extremely precise VLBI measurements of water maser sources from the BeSSeL S… ▽ More Near the center of our Milky Way is a bar-like structure and the so-called Expanding 3-kpc arms. We currently have limited knowledge of this important region, since we are about 8.2 kpc from the center and cannot directly observe it at optical wavelengths, owing to strong extinction from interstellar dust. Here we present extremely precise VLBI measurements of water maser sources from the BeSSeL Survey, where extinction is not a problem, which accurately determine the 3-dimensional locations and motions of three massive young stars. Combined with previous measurements, these stars delineate a trail of orbits outlining the Milky Way's Galactic Bar. We present the first measurements capturing the dynamics of quasi-elliptical (X1) orbits around the Galactic Bar. Our findings provide evidence substantiating the existence of such orbits populated by massive young stars. Our measurements of the position and velocity of a number of massive young stars, previously identified with the Expanding 3-kpc arms, show that they are more likely located in the X1 orbits about the Galactic Bar. Also, some stars previously assigned to the Norma spiral arm appear to be in these orbits, which suggests that this spiral arm does not extend past the end of the bar. △ Less

Submitted 11 April, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

Journal ref: The Astrophysical Journal, 982:185 (17pp), 2025 April 1

arXiv:2501.02681 [pdf, other]

Quest for quantum advantage: Monte Carlo wave-function simulations of the Coherent Ising Machine

Authors: Manushan Thenabadu, Run Yan Teh, Jia Wang, Simon Kiesewetter, Margaret D Reid, Peter D Drummond

Abstract: The Coherent Ising Machine (CIM) is a quantum network of optical parametric oscillators (OPOs) intended to find ground states of the Ising model. This is an NP-hard problem, related to several important minimization problems, including the max-cut graph problem, and many similar problems. In order to enhance its potential performance, we analyze the coherent coupling strategy for the CIM in a high… ▽ More The Coherent Ising Machine (CIM) is a quantum network of optical parametric oscillators (OPOs) intended to find ground states of the Ising model. This is an NP-hard problem, related to several important minimization problems, including the max-cut graph problem, and many similar problems. In order to enhance its potential performance, we analyze the coherent coupling strategy for the CIM in a highly quantum regime. To explore this limit we employ accurate numerical simulations. Due to the inherent complexity of the system, the maximum network size is limited. While master equation methods can be used, their scalability diminishes rapidly for larger systems. Instead, we use Monte Carlo wave-function methods, which scale as the wave-function dimension, and use large numbers of samples. These simulations involve Hilbert spaces exceeding $10^{7}$ dimensions. To evaluate success probabilities, we use quadrature probabilities. We demonstrate the potential for quantum computational advantage through improved simulation times and success rates in a low-dissipation regime, by using quantum superpositions and time varying couplings to give enhanced quantum effects. △ Less

Submitted 5 January, 2025; originally announced January 2025.

Comments: 10 pages

arXiv:2412.00406 [pdf, other]

Resolving Schrödinger's analysis of the Einstein-Podolsky-Rosen paradox: an incompleteness criterion and weak elements of reality

Authors: C. McGuigan, R. Y. Teh, P. D. Drummond, M. D Reid

Abstract: The Einstein-Podolsky-Rosen (EPR) paradox was presented as an argument that quantum mechanics is an incomplete description of physical reality. However, the premises on which the argument is based are falsifiable by Bell experiments. In this paper, we examine the EPR paradox from the perspective of Schrodinger's reply to EPR. Schrodinger pointed out that the correlated states of the paradox enable… ▽ More The Einstein-Podolsky-Rosen (EPR) paradox was presented as an argument that quantum mechanics is an incomplete description of physical reality. However, the premises on which the argument is based are falsifiable by Bell experiments. In this paper, we examine the EPR paradox from the perspective of Schrodinger's reply to EPR. Schrodinger pointed out that the correlated states of the paradox enable the simultaneous measurement of $\hat{x}$ and $\hat{p}$, one by direct, the other by indirect measurement. Schrodinger's analysis takes on a timely importance because a recent experiment realizes these correlations for macroscopic atomic systems. Different to the original argument, Schrodinger's analysis applies to the experiment at the time when the measurement settings have been fixed. In this context, a subset of local realistic assumptions (not negated by Bell's theorem) implies that $x$ and $p$ are simultaneously precisely defined. Hence, an alternative EPR argument can be presented that quantum mechanics is incomplete, based on a set of (arguably) nonfalsifiable premises. As systems are amplified, macroscopic realism can be invoked, and the premises are referred to as weak macroscopic realism (wMR). In this paper, we propose a realization of Schrodinger's gedanken experiment where field quadrature phase amplitudes $\hat{X}$ and $\hat{P}$ replace position and momentum. Assuming wMR, we derive a criterion for the incompleteness of quantum mechanics, showing that the criterion is feasible for current experiments. Questions raised by Schrodinger are resolved. By performing simulations based on an objective-field ($Q$-based) model for quantum mechanics, we illustrate the emergence on amplification of simultaneous predetermined values for $\hat{X}$ and $\hat{P}$. The values can be regarded as weak elements of reality, along the lines of Bell's macroscopic beables. △ Less

Submitted 30 November, 2024; originally announced December 2024.

arXiv:2411.12456 [pdf, ps, other]

Ichnos: A Carbon Footprint Estimator for Scientific Workflows

Authors: Kathleen West, Magnus Reid, Yehia Elkhatib, Lauritz Thamsen

Abstract: Scientific workflows facilitate the automation of data analysis, and are used to process increasing amounts of data. Therefore, they tend to be resource-intensive and long-running, leading to significant energy consumption and carbon emissions. With ever-increasing emissions from the ICT sector, it is crucial to quantify and understand the carbon footprint of scientific workflows. However, existin… ▽ More Scientific workflows facilitate the automation of data analysis, and are used to process increasing amounts of data. Therefore, they tend to be resource-intensive and long-running, leading to significant energy consumption and carbon emissions. With ever-increasing emissions from the ICT sector, it is crucial to quantify and understand the carbon footprint of scientific workflows. However, existing tooling requires significant effort from users - such as setting up power monitoring before executing workloads, or translating monitored metrics into the carbon footprints post-execution. In this paper, we introduce a system to estimate the carbon footprint of Nextflow scientific workflows that enables post-hoc estimation based on existing workflow traces, power models for computational resources utilised, and carbon intensity data aligned with the execution time. We discuss our automated power modelling approach, and compare it with commonly used estimation methodologies. Furthermore, we exemplify several potential use cases and evaluate our energy consumption estimation approach, finding its estimation error to be between 3.9-10.3%, outperforming both baseline methodologies. △ Less

Submitted 3 June, 2025; v1 submitted 19 November, 2024; originally announced November 2024.

Comments: GitHub Repository: https://github.com/westkath/ichnos

arXiv:2411.11228 [pdf, other]

Validation tests of Gaussian boson samplers with photon-number resolving detectors

Authors: Alexander S. Dellios, Margaret D. Reid, Peter D. Drummond

Abstract: An important challenge with the current generation of noisy, large-scale quantum computers is the question of validation. Does the hardware generate correct answers? If not, what are the errors? This issue is often combined with questions of computational advantage, but it is a fundamentally distinct issue. In current experiments, complete validation of the output statistics is generally not possi… ▽ More An important challenge with the current generation of noisy, large-scale quantum computers is the question of validation. Does the hardware generate correct answers? If not, what are the errors? This issue is often combined with questions of computational advantage, but it is a fundamentally distinct issue. In current experiments, complete validation of the output statistics is generally not possible because it is exponentially hard to do so. Here, we apply phase-space simulation methods to partially verify recent experiments on Gaussian boson sampling (GBS) implementing photon-number resolving (PNR) detectors. The positive-P phase-space distribution is employed, as it uses probabilistic sampling to reduce complexity. It is $10^{18}$ times faster than direct classical simulation for experiments on $288$ modes where quantum computational advantage is claimed. When combined with binning and marginalization to improve statistics, multiple validation tests are efficiently computable, of which some tests can be carried out on experimental data. We show that the data as a whole shows discrepancies with theoretical predictions for perfect squeezing. However, a small modification of the GBS parameters greatly improves agreement. Hence, we suggest that such validation tests could form the basis of feedback methods to improve GBS quantum computer experiments. △ Less

Submitted 17 November, 2024; originally announced November 2024.

Comments: Total of 16 pages, 6 figures and 3 tables

arXiv:2409.20489 [pdf, other]

Online Decision Deferral under Budget Constraints

Authors: Mirabel Reid, Tom Sühr, Claire Vernade, Samira Samadi

Abstract: Machine Learning (ML) models are increasingly used to support or substitute decision making. In applications where skilled experts are a limited resource, it is crucial to reduce their burden and automate decisions when the performance of an ML model is at least of equal quality. However, models are often pre-trained and fixed, while tasks arrive sequentially and their distribution may shift. In t… ▽ More Machine Learning (ML) models are increasingly used to support or substitute decision making. In applications where skilled experts are a limited resource, it is crucial to reduce their burden and automate decisions when the performance of an ML model is at least of equal quality. However, models are often pre-trained and fixed, while tasks arrive sequentially and their distribution may shift. In that case, the respective performance of the decision makers may change, and the deferral algorithm must remain adaptive. We propose a contextual bandit model of this online decision making problem. Our framework includes budget constraints and different types of partial feedback models. Beyond the theoretical guarantees of our algorithm, we propose efficient extensions that achieve remarkable performance on real-world datasets. △ Less

Submitted 30 September, 2024; originally announced September 2024.

Comments: 15 pages, 9 figures

arXiv:2409.16580 [pdf, other]

doi 10.1016/j.optmat.2023.114093

Growth and Spectroscopy of Lanthanide Doped Y$_2$SiO$_5$ Microcrystals for Quantum Information Processing

Authors: Jamin L. B. Martin, Lily F. Williams, Michael F. Reid, Jon-Paul R. Wells

Abstract: Lanthanide-doped Y$_{2}$SiO$_{5}$ microcrystals were prepared using the solution combustion, solid state and sol-gel synthesis techniques. Of these, the sol-gel method yields the most reliable and high-quality X2 phase Y$_{2}$SiO$_{5}$ microcrystals. Absorption and laser site-selective fluorescence measurements of Nd$^{3+}$, Eu$^{3+}$ and Er$^{3+}$ doped material, performed at cryogenic temperatur… ▽ More Lanthanide-doped Y$_{2}$SiO$_{5}$ microcrystals were prepared using the solution combustion, solid state and sol-gel synthesis techniques. Of these, the sol-gel method yields the most reliable and high-quality X2 phase Y$_{2}$SiO$_{5}$ microcrystals. Absorption and laser site-selective fluorescence measurements of Nd$^{3+}$, Eu$^{3+}$ and Er$^{3+}$ doped material, performed at cryogenic temperatures, indicate that the as-grown microcrystals are of high optical quality with inhomogeneously broadened optical linewidths that are comparable to bulk crystals at similar dopant concentrations. △ Less

Submitted 24 September, 2024; originally announced September 2024.

Journal ref: Optical Materials 142, 114093 (2023)

arXiv:2409.15630 [pdf, other]

doi 10.1016/j.omx.2024.100356

Spectroscopy and Crystal-Field Analysis of Low -Symmetry Er$^{3+}$ Centres in K$_2$YF$_5$ Microparticles

Authors: Pratik S. Solanki, Michael F. Reid, Jon-Paul R. Wells

Abstract: K$_2$YF$_5$ crystals doped with lanthanide ions have a variety of possible optical applications. Owing to the low symmetry of the system, the crystal structure cannot be unambiguously determined by x-ray diffraction. However, electron-paramagnetic resonance studies have demonstrated that lanthanide ions substitute for yttrium in sites of C$_{\rm s}$ local symmetry. In this work, we use high-resolu… ▽ More K$_2$YF$_5$ crystals doped with lanthanide ions have a variety of possible optical applications. Owing to the low symmetry of the system, the crystal structure cannot be unambiguously determined by x-ray diffraction. However, electron-paramagnetic resonance studies have demonstrated that lanthanide ions substitute for yttrium in sites of C$_{\rm s}$ local symmetry. In this work, we use high-resolution absorption and laser spectroscopy to determine electronic energy levels for Er$^{3+}$ ions in K$_2$YF$_5$ microparticles. A total of 39 crystal-field energy levels, distributed among 7 multiplets of the Er$^{3+}$ ion, have been assigned. This optical data is used for crystal-field modelling of the electronic structure of Er$^{3+}$ in K$_2$YF$_5$. Our model is fitted not only to the electronic energy levels, but also to the ground-state g-tensor. This magnetic-splitting data defines the axis system of the calculation, avoiding ambiguities associated with low-symmetry crystal-field fits. △ Less

Submitted 25 September, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

Journal ref: Optical Materials: X 24 , 100356 (2024)

arXiv:2409.15625 [pdf, other]

doi 10.1016/j.jlumin.2024.120705

Laser Site-Selective Spectroscopy and Magnetic Hyperfine Splittings of Ho$^{3+}$ doped Y$_{2}$SiO$_{5}$

Authors: Sagar Mothkuri, Michael F. Reid, Jon-Paul R. Wells, Eloïse Lafitte-Houssat, Alban Ferrier, Philippe Goldner

Abstract: Laser site-selective spectroscopy and high-resolution absorption measurements have been used to determine 51 crystal-field energy levels for one of the Ho$^{3+}$ centres in Y$_{2}$SiO$_{5}$. This centre is denoted as Site 2 and has been tentatively assigned as the seven-fold coordinated centre. High resolution absorption measurements reveal complex hyperfine patterns that obey and approximate sele… ▽ More Laser site-selective spectroscopy and high-resolution absorption measurements have been used to determine 51 crystal-field energy levels for one of the Ho$^{3+}$ centres in Y$_{2}$SiO$_{5}$. This centre is denoted as Site 2 and has been tentatively assigned as the seven-fold coordinated centre. High resolution absorption measurements reveal complex hyperfine patterns that obey and approximate selection rule. The application of a magnetic field along the three optical axes reveals the presence of avoided crossings below 0.5 Tesla, in both the ground and excited states. △ Less

Submitted 25 September, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

Journal ref: Journal of Luminescence 275, 120705 (2024)

arXiv:2409.15622 [pdf, other]

Spectroscopy, Crystal-Field, and Transition Intensity Analyses of the C$_{\rm 3v}$(O$^{2-}$) Centre in Er$^{3+}$ Doped CaF$_{2}$ Crystals

Authors: M. D. Moull, J. B. L. Martin, T. G. M. Newman, A. L. Jeffery, J. G. Bartholomew, J. -P. R. Wells, M. F. Reid

Abstract: Erbium ions in crystals show considerable promise for the technologies that will form the backbone of future networked quantum information technology. Despite advances in leveraging erbium's fibre-compatible infrared transition for classical and quantum applications, the transitions are, in general, not well understood. We present detailed absorption and laser site-selective spectroscopy of the C… ▽ More Erbium ions in crystals show considerable promise for the technologies that will form the backbone of future networked quantum information technology. Despite advances in leveraging erbium's fibre-compatible infrared transition for classical and quantum applications, the transitions are, in general, not well understood. We present detailed absorption and laser site-selective spectroscopy of the C$_{\rm 3v}$(O$^{2-}$) centre in CaF$_2$:Er$^{3+}$ as an interesting erbium site case study. The $^{4}$I$_{15/2}$Z$_1 \rightarrow {^{4}}$I$_{13/2}$Y$_1$ transition has a low-temperature inhomogeneous linewidth of 1 GHz with hyperfine structure observable from the $^{167}$Er isotope. A parametrized crystal-field Hamiltonian is fitted to 34 energy levels and the two ground state magnetic splitting factors. The wavefunctions are used to perform a transition intensity analysis and electric-dipole parameters are fitted to absorption oscillator strengths. Simulated spectra for the $^{4}$I$_{11/2}\rightarrow {^{4}}$I$_{15/2}$ and $^{4}$I$_{13/2} \rightarrow {^{4}}$I$_{15/2}$ inter-multiplet transitions are in excellent agreement with the experimentally measured spectra. The $^{4}$I$_{13/2}$ excited state lifetime is 25.0\,ms and the intensity calculation is in excellent agreement with this value. △ Less

Submitted 25 September, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

arXiv:2409.09228 [pdf, other]

Exploring code portability solutions for HEP with a particle tracking test code

Authors: Hammad Ather, Sophie Berkman, Giuseppe Cerati, Matti Kortelainen, Ka Hei Martin Kwok, Steven Lantz, Seyong Lee, Boyana Norris, Michael Reid, Allison Reinsvold Hall, Daniel Riley, Alexei Strelchenko, Cong Wang

Abstract: Traditionally, high energy physics (HEP) experiments have relied on x86 CPUs for the majority of their significant computing needs. As the field looks ahead to the next generation of experiments such as DUNE and the High-Luminosity LHC, the computing demands are expected to increase dramatically. To cope with this increase, it will be necessary to take advantage of all available computing resource… ▽ More Traditionally, high energy physics (HEP) experiments have relied on x86 CPUs for the majority of their significant computing needs. As the field looks ahead to the next generation of experiments such as DUNE and the High-Luminosity LHC, the computing demands are expected to increase dramatically. To cope with this increase, it will be necessary to take advantage of all available computing resources, including GPUs from different vendors. A broad landscape of code portability tools -- including compiler pragma-based approaches, abstraction libraries, and other tools -- allow the same source code to run efficiently on multiple architectures. In this paper, we use a test code taken from a HEP tracking algorithm to compare the performance and experience of implementing different portability solutions. △ Less

Submitted 13 September, 2024; originally announced September 2024.

Report number: FERMILAB-PUB-24-0556-CSAID

arXiv:2408.12655 [pdf, other]

Improving Radiography Machine Learning Workflows via Metadata Management for Training Data Selection

Authors: Mirabel Reid, Christine Sweeney, Oleg Korobkin

Abstract: Most machine learning models require many iterations of hyper-parameter tuning, feature engineering, and debugging to produce effective results. As machine learning models become more complicated, this pipeline becomes more difficult to manage effectively. In the physical sciences, there is an ever-increasing pool of metadata that is generated by the scientific research cycle. Tracking this metada… ▽ More Most machine learning models require many iterations of hyper-parameter tuning, feature engineering, and debugging to produce effective results. As machine learning models become more complicated, this pipeline becomes more difficult to manage effectively. In the physical sciences, there is an ever-increasing pool of metadata that is generated by the scientific research cycle. Tracking this metadata can reduce redundant work, improve reproducibility, and aid in the feature and training dataset engineering process. In this case study, we present a tool for machine learning metadata management in dynamic radiography. We evaluate the efficacy of this tool against the initial research workflow and discuss extensions to general machine learning pipelines in the physical sciences. △ Less

Submitted 22 August, 2024; originally announced August 2024.

Comments: 14 pages, 9 figures

arXiv:2408.00118 [pdf, other]

Gemma 2: Improving Open Language Models at a Practical Size

Authors: Gemma Team, Morgane Riviere, Shreya Pathak, Pier Giuseppe Sessa, Cassidy Hardin, Surya Bhupatiraju, Léonard Hussenot, Thomas Mesnard, Bobak Shahriari, Alexandre Ramé, Johan Ferret, Peter Liu, Pouya Tafti, Abe Friesen, Michelle Casbon, Sabela Ramos, Ravin Kumar, Charline Le Lan, Sammy Jerome, Anton Tsitsulin, Nino Vieillard, Piotr Stanczyk, Sertan Girgin, Nikola Momchev, Matt Hoffman , et al. (173 additional authors not shown)

Abstract: In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical modifications to the Transformer architecture, such as interleaving local-global attentions (Beltagy et al., 2020a) and group-query attention (Ainslie et al., 2023). We al… ▽ More In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical modifications to the Transformer architecture, such as interleaving local-global attentions (Beltagy et al., 2020a) and group-query attention (Ainslie et al., 2023). We also train the 2B and 9B models with knowledge distillation (Hinton et al., 2015) instead of next token prediction. The resulting models deliver the best performance for their size, and even offer competitive alternatives to models that are 2-3 times bigger. We release all our models to the community. △ Less

Submitted 2 October, 2024; v1 submitted 31 July, 2024; originally announced August 2024.

arXiv:2407.18231 [pdf, other]

Line Segment Tracking: Improving the Phase 2 CMS High Level Trigger Tracking with a Novel, Hardware-Agnostic Pattern Recognition Algorithm

Authors: Emmanouil Vourliotis, Philip Chang, Peter Elmer, Yanxi Gu, Jonathan Guiang, Vyacheslav Krutelyov, Balaji Venkat Sathia Narayanan, Gavin Niendorf, Michael Reid, Mayra Silva, Andres Rios Tascon, Matevž Tadel, Peter Wittich, Avraham Yagil

Abstract: Charged particle reconstruction is one the most computationally heavy components of the full event reconstruction of Large Hadron Collider (LHC) experiments. Looking to the future, projections for the High Luminosity LHC (HL-LHC) indicate a superlinear growth for required computing resources for single-threaded CPU algorithms that surpass the computing resources that are expected to be available.… ▽ More Charged particle reconstruction is one the most computationally heavy components of the full event reconstruction of Large Hadron Collider (LHC) experiments. Looking to the future, projections for the High Luminosity LHC (HL-LHC) indicate a superlinear growth for required computing resources for single-threaded CPU algorithms that surpass the computing resources that are expected to be available. The combination of these facts creates the need for efficient and computationally performant pattern recognition algorithms that will be able to run in parallel and possibly on other hardware, such as GPUs, given that these become more and more available in LHC experiments and high-performance computing centres. Line Segment Tracking (LST) is a novel such algorithm which has been developed to be fully parallelizable and hardware agnostic. The latter is achieved through the usage of the Alpaka library. The LST algorithm has been tested with the CMS central software as an external package and has been used in the context of the CMS HL-LHC High Level Trigger (HLT). When employing LST for pattern recognition in the HLT tracking, the physics and timing performances are shown to improve with respect to the ones utilizing the current pattern recognition algorithms. The latest results on the usage of the LST algorithm within the CMS HL-LHC HLT are presented, along with prospects for further improvements of the algorithm and its CMS central software integration. △ Less

Submitted 25 July, 2024; originally announced July 2024.

Report number: CMS-CR-2024-141

arXiv:2406.14722 [pdf, other]

Does GPT Really Get It? A Hierarchical Scale to Quantify Human vs AI's Understanding of Algorithms

Authors: Mirabel Reid, Santosh S. Vempala

Abstract: As Large Language Models (LLMs) perform (and sometimes excel at) more and more complex cognitive tasks, a natural question is whether AI really understands. The study of understanding in LLMs is in its infancy, and the community has yet to incorporate well-trodden research in philosophy, psychology, and education. We initiate this, specifically focusing on understanding algorithms, and propose a h… ▽ More As Large Language Models (LLMs) perform (and sometimes excel at) more and more complex cognitive tasks, a natural question is whether AI really understands. The study of understanding in LLMs is in its infancy, and the community has yet to incorporate well-trodden research in philosophy, psychology, and education. We initiate this, specifically focusing on understanding algorithms, and propose a hierarchy of levels of understanding. We use the hierarchy to design and conduct a study with human subjects (undergraduate and graduate students) as well as large language models (generations of GPT), revealing interesting similarities and differences. We expect that our rigorous criteria will be useful to keep track of AI's progress in such cognitive domains. △ Less

Submitted 18 January, 2025; v1 submitted 20 June, 2024; originally announced June 2024.

Comments: 13 pages, 10 figures. To be published at AAAI 2025

ACM Class: I.2.m; F.1.1

arXiv:2405.11439 [pdf, other]

doi 10.3847/1538-3881/ad4030

On the Structure of the Sagittarius Spiral Arm in the Inner Milky Way

Authors: S. B. Bian, Y. W. Wu, Y. Xu, M. J. Reid, J. J. Li, B. Zhang, K. M. Menten, L. Moscadelli, A. Brunthaler

Abstract: We report measurements of trigonometric parallax and proper motion for two 6.7 GHz methanol and two 22 GHz water masers located in the far portion of the Sagittarius spiral arm as part of the BeSSeL Survey. Distances for these sources are estimated from parallax measurements combined with 3-dimensional kinematic distances. The distances of G033.64$-$00.22, G035.57$-$00.03, G041.15$-$00.20, and G04… ▽ More We report measurements of trigonometric parallax and proper motion for two 6.7 GHz methanol and two 22 GHz water masers located in the far portion of the Sagittarius spiral arm as part of the BeSSeL Survey. Distances for these sources are estimated from parallax measurements combined with 3-dimensional kinematic distances. The distances of G033.64$-$00.22, G035.57$-$00.03, G041.15$-$00.20, and G043.89$-$00.78 are $9.9\pm0.5$, $10.2\pm0.6$, $7.6\pm0.5$, and $7.5\pm0.3$ kpc, respectively. Based on these measurements, we suggest that the Sagittarius arm segment beyond about 8 kpc from the Sun in the first Galactic quadrant should be adjusted radially outward relative to previous models. This supports the suggestion of Xu et al. (2023) that the Sagittarius and Perseus spiral arms might merge in the first quadrant before spiraling inward to the far end of the Galactic bar. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: 14 pages, 5 figures, accepted to AJ

Journal ref: 2024 AJ 167:267

arXiv:2403.08295 [pdf, other]

Gemma: Open Models Based on Gemini Research and Technology

Authors: Gemma Team, Thomas Mesnard, Cassidy Hardin, Robert Dadashi, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Léonard Hussenot, Pier Giuseppe Sessa, Aakanksha Chowdhery, Adam Roberts, Aditya Barua, Alex Botev, Alex Castro-Ros, Ambrose Slone, Amélie Héliou, Andrea Tacchetti, Anna Bulanova, Antonia Paterson, Beth Tsai, Bobak Shahriari , et al. (83 additional authors not shown)

Abstract: This work introduces Gemma, a family of lightweight, state-of-the art open models built from the research and technology used to create Gemini models. Gemma models demonstrate strong performance across academic benchmarks for language understanding, reasoning, and safety. We release two sizes of models (2 billion and 7 billion parameters), and provide both pretrained and fine-tuned checkpoints. Ge… ▽ More This work introduces Gemma, a family of lightweight, state-of-the art open models built from the research and technology used to create Gemini models. Gemma models demonstrate strong performance across academic benchmarks for language understanding, reasoning, and safety. We release two sizes of models (2 billion and 7 billion parameters), and provide both pretrained and fine-tuned checkpoints. Gemma outperforms similarly sized open models on 11 out of 18 text-based tasks, and we present comprehensive evaluations of safety and responsibility aspects of the models, alongside a detailed description of model development. We believe the responsible release of LLMs is critical for improving the safety of frontier models, and for enabling the next wave of LLM innovations. △ Less

Submitted 16 April, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

arXiv:2403.05530 [pdf, other]

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content. △ Less

Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

arXiv:2401.14221 [pdf]

Application of performance portability solutions for GPUs and many-core CPUs to track reconstruction kernels

Authors: Ka Hei Martin Kwok, Matti Kortelainen, Giuseppe Cerati, Alexei Strelchenko, Oliver Gutsche, Allison Reinsvold Hall, Steve Lantz, Michael Reid, Daniel Riley, Sophie Berkman, Seyong Lee, Hammad Ather, Boyana Norris, Cong Wang

Abstract: Next generation High-Energy Physics (HEP) experiments are presented with significant computational challenges, both in terms of data volume and processing power. Using compute accelerators, such as GPUs, is one of the promising ways to provide the necessary computational power to meet the challenge. The current programming models for compute accelerators often involve using architecture-specific p… ▽ More Next generation High-Energy Physics (HEP) experiments are presented with significant computational challenges, both in terms of data volume and processing power. Using compute accelerators, such as GPUs, is one of the promising ways to provide the necessary computational power to meet the challenge. The current programming models for compute accelerators often involve using architecture-specific programming languages promoted by the hardware vendors and hence limit the set of platforms that the code can run on. Developing software with platform restrictions is especially unfeasible for HEP communities as it takes significant effort to convert typical HEP algorithms into ones that are efficient for compute accelerators. Multiple performance portability solutions have recently emerged and provide an alternative path for using compute accelerators, which allow the code to be executed on hardware from different vendors. We apply several portability solutions, such as Kokkos, SYCL, C++17 std::execution::par and Alpaka, on two mini-apps extracted from the mkFit project: p2z and p2r. These apps include basic kernels for a Kalman filter track fit, such as propagation and update of track parameters, for detectors at a fixed z or fixed r position, respectively. The two mini-apps explore different memory layout formats. We report on the development experience with different portability solutions, as well as their performance on GPUs and many-core CPUs, measured as the throughput of the kernels from different GPU and CPU vendors such as NVIDIA, AMD and Intel. △ Less

Submitted 25 January, 2024; originally announced January 2024.

Comments: 26th Intl Conf Computing High Energy & Nuclear Phys (CHEP 2023)

Report number: FERMILAB-CONF-23-535-CMS-CSAID

arXiv:2312.16382 [pdf, ps, other]

What Determines the Boundaries of H2O Maser Emission in an X-ray Illuminated Gas Disk ?

Authors: C. Y. Kuo, F. Gao, J. A. Braatz, D. W. Pesce, E. M. L. Humphreys, M. J. Reid, C. M. V. Impellizzeri, C. Henkel, J. Wagner, C. E. Wu

Abstract: High precision mapping of H2O megamaser emission from active galaxies has revealed more than a dozen Keplerian H2O maser disks, which enable a ~4% uncertainty estimate of the Hubble constant as well as providing accurate masses for the central black holes. These disks often have well-defined inner and outer boundaries of maser emission on sub-parsec scales. In order to better understand the physic… ▽ More High precision mapping of H2O megamaser emission from active galaxies has revealed more than a dozen Keplerian H2O maser disks, which enable a ~4% uncertainty estimate of the Hubble constant as well as providing accurate masses for the central black holes. These disks often have well-defined inner and outer boundaries of maser emission on sub-parsec scales. In order to better understand the physical conditions that determine the inner and outer radii of a maser disk, we examine the distributions of gas density and X-ray heating rate in a warped molecular disk described by a power-law surface density profile. For a suitable choice of the disk mass, we find that the outer radius R_out of the maser disk predicted from our model can match the observed value, with R_out mainly determined by the maximum heating rate or the minimum density for efficient maser action, depending on the combination of the Eddington ratio, black hole mass, and disk mass. Our analysis also indicates that the inner radius for maser action is comparable to the dust sublimation radius, suggesting that dust may play a role in determining the inner radius of a maser disk. Finally, our model predicts that H2O gigamaser disks could exist at the centers of high-z quasars, with disk sizes of >~ 10-30 pc. △ Less

Submitted 10 July, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

Comments: Accepted by MNRAS, 17 pages, 8 figures, 2 tables

arXiv:2312.11805 [pdf, other]

Gemini: A Family of Highly Capable Multimodal Models

Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI. △ Less

Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

arXiv:2309.15027 [pdf, other]

On the Distances to the X-ray Binaries Cygnus X-3 and GRS 1915+105

Authors: M. J. Reid, J. C. A. Miller-Jones

Abstract: In this paper we significantly improve estimates of distance to the X-ray binary systems Cyg X-3 and GRS 1915+105. We report a highly accurate trigonometric parallax measurement for Cyg X-3 using the VLBA at 43 GHz, placing the source at a distance of 9.67+0.53-0.48 kpc. We also use Galactic proper motions and line-of-sight radial velocity measurements to determine 3-dimensional (3D) kinematic dis… ▽ More In this paper we significantly improve estimates of distance to the X-ray binary systems Cyg X-3 and GRS 1915+105. We report a highly accurate trigonometric parallax measurement for Cyg X-3 using the VLBA at 43 GHz, placing the source at a distance of 9.67+0.53-0.48 kpc. We also use Galactic proper motions and line-of-sight radial velocity measurements to determine 3-dimensional (3D) kinematic distances to both systems, under the assumption that they have low peculiar velocities. This yields distances of 8.95+-0.96 kpc for Cyg X-3 and 9.4+-0.6 (statistical)+-0.8 (systematic) for GRS 1915+105. The good agreement between parallax and 3D kinematic distances validates the assumption of low peculiar velocities, and hence small natal kicks, for both of the systems. For a source with a low peculiar velocity, given its parallax distance, Cyg X-3 should have a Vlsr near -64+-5 km/s. Our measurements imply a slightly higher inclination angle, and hence lower black hole mass for GRS 1915+105 than found from previous work by Reid et al (2014) and strengthen arguments from X-ray polarization that Cyg X-3 would be an ultraluminous X-ray source if viewed face-on. △ Less

Submitted 26 September, 2023; originally announced September 2023.

Comments: 14 pages, 7 figures, 1 table

arXiv:2308.03147 [pdf, other]

An Updated Reference Frame for the Galactic Inner Parsec

Authors: Jeremy Darling, Jennie Paine, Mark J. Reid, Karl M. Menten, Shoko Sakai, Andrea Ghez

Abstract: Infrared observations of stellar orbits about Sgr A* probe the mass distribution in the inner parsec of the Galaxy and provide definitive evidence for the existence of a massive black hole. However, the infrared astrometry is relative and is tied to the radio emission from Sgr A* using stellar SiO masers that coincide with infrared-bright stars. To support and improve this two-step astrometry, we… ▽ More Infrared observations of stellar orbits about Sgr A* probe the mass distribution in the inner parsec of the Galaxy and provide definitive evidence for the existence of a massive black hole. However, the infrared astrometry is relative and is tied to the radio emission from Sgr A* using stellar SiO masers that coincide with infrared-bright stars. To support and improve this two-step astrometry, we present new astrometric observations of 15 stellar SiO masers within 2 pc of Sgr A*. Combined with legacy observations spanning 25.8 years, we re-analyze the relative offsets of these masers from Sgr A* and measure positions and proper motions that are significantly improved compared to the previously published reference frame. Maser positions are corrected for epoch-specific differential aberration, precession, nutation, and solar gravitational deflection. Omitting the supergiant IRS 7, the mean position uncertainties are 0.46 mas and 0.84 mas in RA and Dec., and the mean proper motion uncertainties are 0.07 mas yr$^{-1}$ and 0.12 mas yr$^{-1}$, respectively. At a distance of 8.2 kpc, these correspond to position uncertainties of 3.7 AU and 6.9 AU and proper motion uncertainties of 2.7 km s$^{-1}$ and 4.6 km s$^{-1}$. The reference frame stability, the uncertainty in the variance-weighted mean proper motion of the maser ensemble, is 8 $μ$as yr$^{-1}$ (0.30 km s$^{-1}$) in RA and 11 $μ$as yr$^{-1}$ (0.44 km s$^{-1}$) in Dec., which represents a 2.3-fold improvement over previous work and a new benchmark for the maser-based reference frame. △ Less

Submitted 6 August, 2023; originally announced August 2023.

Comments: 18 pages, 7 figures, 3 tables. Accepted by ApJ

arXiv:2308.00908 [pdf, other]

Simulating Gaussian boson sampling quantum computers

Authors: Alexander S. Dellios, Margaret D. Reid, Peter D. Drummond

Abstract: A growing cohort of experimental linear photonic networks implementing Gaussian boson sampling (GBS) have now claimed quantum advantage. However, many open questions remain on how to effectively verify these experimental results, as scalable methods are needed that fully capture the rich array of quantum correlations generated by these photonic quantum computers. In this paper, we briefly review r… ▽ More A growing cohort of experimental linear photonic networks implementing Gaussian boson sampling (GBS) have now claimed quantum advantage. However, many open questions remain on how to effectively verify these experimental results, as scalable methods are needed that fully capture the rich array of quantum correlations generated by these photonic quantum computers. In this paper, we briefly review recent theoretical methods to simulate experimental GBS networks. We focus mostly on methods that use phase-space representations of quantum mechanics, as these methods are highly scalable and can be used to validate experimental outputs and claims of quantum advantage for a variety of input states, ranging from the ideal pure squeezed vacuum state to more realistic thermalized squeezed states. A brief overview of the theory of GBS, recent experiments and other types of methods are also presented. Although this is not an exhaustive review, we aim to provide a brief introduction to phase-space methods applied to linear photonic networks to encourage further theoretical investigations. △ Less

Submitted 1 August, 2023; originally announced August 2023.

Comments: A brief topical review on GBS simulation methods and verification techniques

arXiv:2308.00379 [pdf, other]

doi 10.3390/e25121620

A macroscopic quantum three-box paradox: finding consistency with weak macroscopic realism

Authors: C. Hatharasinghe, M. Thenabadu, P. D. Drummond, M. D. Reid

Abstract: The quantum three-box paradox considers a ball prepared in a superposition of being in one of three Boxes. Bob makes measurements by opening either Box 1 or Box 2. After performing some unitary operations (shuffling), Alice can infer with certainty that the ball was detected by Bob, regardless of which box he opened, if she detects the ball after opening Box 3. The paradox is that the ball would h… ▽ More The quantum three-box paradox considers a ball prepared in a superposition of being in one of three Boxes. Bob makes measurements by opening either Box 1 or Box 2. After performing some unitary operations (shuffling), Alice can infer with certainty that the ball was detected by Bob, regardless of which box he opened, if she detects the ball after opening Box 3. The paradox is that the ball would have been found with certainty in either box, if that box had been opened. Resolutions of the paradox include that Bob's measurement cannot be made non-invasively, or else that realism cannot be assumed at the quantum level. Here, we strengthen the case for the former argument, by constructing macroscopic versions of the paradox. Macroscopic realism implies that the ball is in one of the boxes, prior to Bob or Alice opening any boxes. We demonstrate consistency of the paradox with macroscopic realism, if carefully defined (as weak macroscopic realism, wMR) to apply to the system at the times prior to Alice or Bob opening any Boxes, but after the unitary operations associated with preparation or shuffling. By solving for the dynamics of the unitary operations, and comparing with mixed states, we demonstrate agreement between the predictions of wMR and quantum mechanics: The paradox only manifests if Alice's shuffling combines both local operations (on Box 3) and nonlocal operations, on the other Boxes. Following previous work, the macroscopic paradox is shown to correspond to a violation of a Leggett-Garg inequality, which implies non-invasive measurability, if wMR holds. △ Less

Submitted 1 August, 2023; originally announced August 2023.

Journal ref: Entropy 25(12), 1620 (2023)

arXiv:2305.14857 [pdf, other]

BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer

Authors: Akari Asai, Sneha Kudugunta, Xinyan Velocity Yu, Terra Blevins, Hila Gonen, Machel Reid, Yulia Tsvetkov, Sebastian Ruder, Hannaneh Hajishirzi

Abstract: Despite remarkable advancements in few-shot generalization in natural language processing, most models are developed and evaluated primarily in English. To facilitate research on few-shot cross-lingual transfer, we introduce a new benchmark, called BUFFET, which unifies 15 diverse tasks across 54 languages in a sequence-to-sequence format and provides a fixed set of few-shot examples and instructi… ▽ More Despite remarkable advancements in few-shot generalization in natural language processing, most models are developed and evaluated primarily in English. To facilitate research on few-shot cross-lingual transfer, we introduce a new benchmark, called BUFFET, which unifies 15 diverse tasks across 54 languages in a sequence-to-sequence format and provides a fixed set of few-shot examples and instructions. BUFFET is designed to establish a rigorous and equitable evaluation framework for few-shot cross-lingual transfer across a broad range of tasks and languages. Using BUFFET, we perform thorough evaluations of state-of-the-art multilingual large language models with different transfer methods, namely in-context learning and fine-tuning. Our findings reveal significant room for improvement in few-shot in-context cross-lingual transfer. In particular, ChatGPT with in-context learning often performs worse than much smaller mT5-base models fine-tuned on English task data and few-shot in-language examples. Our analysis suggests various avenues for future research in few-shot cross-lingual transfer, such as improved pretraining, understanding, and future evaluations. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: The data and code is available at https://buffetfs.github.io/

arXiv:2305.14224 [pdf, other]

mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations

Authors: Jonas Pfeiffer, Francesco Piccinno, Massimo Nicosia, Xinyi Wang, Machel Reid, Sebastian Ruder

Abstract: Multilingual sequence-to-sequence models perform poorly with increased language coverage and fail to consistently generate text in the correct target language in few-shot settings. To address these challenges, we propose mmT5, a modular multilingual sequence-to-sequence model. mmT5 utilizes language-specific modules during pre-training, which disentangle language-specific information from language… ▽ More Multilingual sequence-to-sequence models perform poorly with increased language coverage and fail to consistently generate text in the correct target language in few-shot settings. To address these challenges, we propose mmT5, a modular multilingual sequence-to-sequence model. mmT5 utilizes language-specific modules during pre-training, which disentangle language-specific information from language-agnostic information. We identify representation drift during fine-tuning as a key limitation of modular generative models and develop strategies that enable effective zero-shot transfer. Our model outperforms mT5 at the same parameter sizes by a large margin on representative natural language understanding and generation tasks in 40+ languages. Compared to mT5, mmT5 raises the rate of generating text in the correct language under zero-shot settings from 7% to 99%, thereby greatly alleviating the source language hallucination problem. △ Less

Submitted 23 May, 2023; originally announced May 2023.

arXiv:2304.05853 [pdf, other]

Speeding up the CMS track reconstruction with a parallelized and vectorized Kalman-filter-based algorithm during the LHC Run 3

Authors: Sophie Berkman, Giuseppe Cerati, Peter Elmer, Patrick Gartung, Leonardo Giannini, Brian Gravelle, Allison R. Hall, Matti Kortelainen, Vyacheslav Krutelyov, Steve R. Lantz, Mario Masciovecchio, Kevin McDermott, Boyana Norris, Michael Reid, Daniel S. Riley, Matevž Tadel, Emmanouil Vourliotis, Bei Wang, Peter Wittich, Avraham Yagil

Abstract: One of the most challenging computational problems in the Run 3 of the Large Hadron Collider (LHC) and more so in the High-Luminosity LHC (HL-LHC) is expected to be finding and fitting charged-particle tracks during event reconstruction. The methods used so far at the LHC and in particular at the CMS experiment are based on the Kalman filter technique. Such methods have shown to be robust and to p… ▽ More One of the most challenging computational problems in the Run 3 of the Large Hadron Collider (LHC) and more so in the High-Luminosity LHC (HL-LHC) is expected to be finding and fitting charged-particle tracks during event reconstruction. The methods used so far at the LHC and in particular at the CMS experiment are based on the Kalman filter technique. Such methods have shown to be robust and to provide good physics performance, both in the trigger and offline. In order to improve computational performance, we explored Kalman-filter-based methods for track finding and fitting, adapted for many-core SIMD architectures. This adapted Kalman-filter-based software, called "mkFit", was shown to provide a significant speedup compared to the traditional algorithm, thanks to its parallelized and vectorized implementation. The mkFit software was recently integrated into the offline CMS software framework, in view of its exploitation during the Run 3 of the LHC. At the start of the LHC Run 3, mkFit will be used for track finding in a subset of the CMS offline track reconstruction iterations, allowing for significant improvements over the existing framework in terms of computational performance, while retaining comparable physics performance. The performance of the CMS track reconstruction using mkFit at the start of the LHC Run 3 is presented, together with prospects of further improvement in the upcoming years of data taking. △ Less

Submitted 12 April, 2023; originally announced April 2023.

Comments: Contribution to the ACAT 2022

arXiv:2303.17502 [pdf]

Synchrotron Science for Sustainability: Life Cycle of Metals in the Environment

Authors: Louisa Smieska, Mary Lou Guerinot, Karin Olson Hoal, Matthew Reid, Olena Vatamaniuk

Abstract: The movement of metals through the environment links together a wide range of scientific fields: from earth sciences and geology as weathering releases minerals; to environmental sciences as metals are mobilized and transformed, cycling through soil and water; to biology as living things take up metals from their surroundings. Studies of these fundamental processes all require quantitative analysi… ▽ More The movement of metals through the environment links together a wide range of scientific fields: from earth sciences and geology as weathering releases minerals; to environmental sciences as metals are mobilized and transformed, cycling through soil and water; to biology as living things take up metals from their surroundings. Studies of these fundamental processes all require quantitative analysis of metal concentrations, locations, and chemical states. Synchrotron x-ray tools can address these requirements with high sensitivity, high spatial resolution, and minimal sample preparation. This perspective describes the state of fundamental scientific questions in the lifecycle of metals, from rocks to ecosystems, from soils to plants, and from environment to animals. Key x-ray capabilities and facility infrastructure for future synchrotron-based analytical resources serving these areas are summarized, and potential opportunities for future experiments are explored. △ Less

Submitted 30 March, 2023; originally announced March 2023.

Comments: 25 pages with references, 8 figures, submitted to Metallomics

arXiv:2303.09129 [pdf, other]

doi 10.3847/1538-4357/acc52a

The parallax and 3D kinematics of water masers in the massive star-forming region G034.43+0.24

Authors: Xiaofeng Mai, Bo Zhang, M. J. Reid, L. Moscadelli, Shuangjing Xu, Yan Sun, Jingdong Zhang, Wen Chen, Shiming Wen, Qiuyi Luo, Karl M. Menten, Xingwu Zheng, Andreas Brunthaler, Ye Xu, Guangli Wang

Abstract: We report a trigonometric parallax measurement of 22 GHz water masers in the massive star-forming region G034.43+0.24 as part of the Bar and Spiral Structure Legacy (BeSSeL) Survey using the Very Long Baseline Array. The parallax is 0.330$\pm$50.018 mas, corresponding to a distance of $3.03^{+0.17}_{-0.16}$ kpc. This locates G034.43+0.24 near the inner edge of the Sagittarius spiral arm and at one… ▽ More We report a trigonometric parallax measurement of 22 GHz water masers in the massive star-forming region G034.43+0.24 as part of the Bar and Spiral Structure Legacy (BeSSeL) Survey using the Very Long Baseline Array. The parallax is 0.330$\pm$50.018 mas, corresponding to a distance of $3.03^{+0.17}_{-0.16}$ kpc. This locates G034.43+0.24 near the inner edge of the Sagittarius spiral arm and at one end of a linear distribution of massive young stars which cross nearly the full width of the arm. The measured 3-dimensional motion of G034.43+0.24 indicates a near-circular Galactic orbit. The water masers display arc-like distributions, possibly bow shocks, associated with winds from one or more massive young stars. △ Less

Submitted 16 March, 2023; originally announced March 2023.

arXiv:2303.04448 [pdf, other]

The Quantum and Stochastic Toolbox: xSPDE4.2

Authors: Peter D. Drummond, Run Yan Teh, Manushan Thenabadu, Channa Hatharasinghe, Chris McGuigan, Alex Dellios, Ned Goodman, Margaret D. Reid

Abstract: This is the fourth major release of the xSPDE toolbox, which solves stochastic partial and ordinary differential equations, with applications in biology, chemistry, engineering, medicine, physics and quantum technologies. It computes statistical averages, including time-step and sampling error estimation. xSPDE can provide higher order convergence, Fourier spectra and probability densities. The to… ▽ More This is the fourth major release of the xSPDE toolbox, which solves stochastic partial and ordinary differential equations, with applications in biology, chemistry, engineering, medicine, physics and quantum technologies. It computes statistical averages, including time-step and sampling error estimation. xSPDE can provide higher order convergence, Fourier spectra and probability densities. The toolbox has graphical output and $χ^{2}$ statistics, as well as weighted, projected, or forward-backward equations. It can generate input-output quantum spectra. The equations can have independent periodic, Dirichlet, and Neumann or Robin boundary conditions in any dimension, for any vector component, and at either end of any interval. xSPDE has functions that can numerically solve both ordinary and partial differential stochastic equations of any type, obtaining correlations, probabilities and averages. The toolbox has a core treating stochastic differential equations, with averages, probability distributions and full error estimates. There are stochastic extensions treating applications to partial differential equations, projected equations, quantum stochastic equations, master equations and quantum phase-space simulations including Gaussian boson sampling experiments. △ Less

Submitted 26 December, 2024; v1 submitted 8 March, 2023; originally announced March 2023.

Comments: Fourth major release of the user manual for xSPDE software on Github, at https://github.com/peterddrummond/xspde_matlab

arXiv:2303.02373 [pdf, other]

Hidden causal loops, macroscopic realism and Einstein-Podolsky-Rosen-Bell nonlocality: forward-backward stochastic simulations

Authors: M. D. Reid, P. D. Drummond

Abstract: We analyze quantum measurement and entanglement by solving the dynamics of stochastic amplitudes that propagate both forward and backward in time. The model allows simulation of Einstein-Podolsky-Rosen and Bell correlations, and reveals consistency with a weak form of local realism defined after the unitary interactions determining the measurement settings. Bell violations emerge due to a breakdow… ▽ More We analyze quantum measurement and entanglement by solving the dynamics of stochastic amplitudes that propagate both forward and backward in time. The model allows simulation of Einstein-Podolsky-Rosen and Bell correlations, and reveals consistency with a weak form of local realism defined after the unitary interactions determining the measurement settings. Bell violations emerge due to a breakdown of a subset of Bell's local-realism conditions. Our results elucidate how hidden causal loops can explain Bell nonlocality, without requiring retrocausality at a macroscopic level. △ Less

Submitted 12 December, 2024; v1 submitted 4 March, 2023; originally announced March 2023.

Comments: 5 pages, 4 figures

arXiv:2302.03676 [pdf, other]

doi 10.3847/1538-4365/acdc9f

Open data from the third observing run of LIGO, Virgo, KAGRA and GEO

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné, A. Allocca , et al. (1719 additional authors not shown)

Abstract: The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasti… ▽ More The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasting 2 weeks. In this paper we describe these data and various other science products that can be freely accessed through the Gravitational Wave Open Science Center at https://gwosc.org. The main dataset, consisting of the gravitational-wave strain time series that contains the astrophysical signals, is released together with supporting data useful for their analysis and documentation, tutorials, as well as analysis software packages. △ Less

Submitted 7 February, 2023; originally announced February 2023.

Comments: 27 pages, 3 figures

Report number: LIGO-P2200316

arXiv:2301.04756 [pdf, ps, other]

Techniques for Measuring Parallax and Proper Motion with VLBI

Authors: Mark J. Reid

Abstract: Astrometry at centimeter wavelengths using Very Long Baseline Interferometry is approaching accuracies of ~1 uas for the angle between a target and a calibrator source separated by <1 degree on the sky. The BeSSeL Survey and the Japanese VERA project are using this to map the spiral structure of the Milky Way by measuring trigonometric parallaxes of hundreds of maser sources associated with massiv… ▽ More Astrometry at centimeter wavelengths using Very Long Baseline Interferometry is approaching accuracies of ~1 uas for the angle between a target and a calibrator source separated by <1 degree on the sky. The BeSSeL Survey and the Japanese VERA project are using this to map the spiral structure of the Milky Way by measuring trigonometric parallaxes of hundreds of maser sources associated with massive, young stars. This paper outlines how micro-arcsecond astrometry is done, including details regarding the scheduling of observations, calibration of data, and measuring positions. △ Less

Submitted 11 January, 2023; originally announced January 2023.

Comments: 18 pages; 5 figures

arXiv:2212.10173 [pdf, other]

On the Role of Parallel Data in Cross-lingual Transfer Learning

Authors: Machel Reid, Mikel Artetxe

Abstract: While prior work has established that the use of parallel data is conducive for cross-lingual learning, it is unclear if the improvements come from the data itself, or if it is the modeling of parallel interactions that matters. Exploring this, we examine the usage of unsupervised machine translation to generate synthetic parallel data, and compare it to supervised machine translation and gold par… ▽ More While prior work has established that the use of parallel data is conducive for cross-lingual learning, it is unclear if the improvements come from the data itself, or if it is the modeling of parallel interactions that matters. Exploring this, we examine the usage of unsupervised machine translation to generate synthetic parallel data, and compare it to supervised machine translation and gold parallel data. We find that even model generated parallel data can be useful for downstream tasks, in both a general setting (continued pretraining) as well as the task-specific setting (translate-train), although our best results are still obtained using real parallel data. Our findings suggest that existing multilingual models do not exploit the full potential of monolingual data, and prompt the community to reconsider the traditional categorization of cross-lingual learning approaches. △ Less

Submitted 20 December, 2022; originally announced December 2022.

Comments: Preprint

arXiv:2212.03555 [pdf, other]

doi 10.3847/1538-4357/acdbc5

Inverse MultiView II: Microarcsecond Trigonometric Parallaxes for Southern Hemisphere 6.7~GHz Methanol Masers G232.62+00.99 and G323.74$-$00.26

Authors: Lucas J. Hyland, Mark J. Reid, Gabor Orosz, Simon P. Ellingsen, Stuart D. Weston, Jayendar Kumar, Richard Dodson, Maria J. Rioja, Warren J. Hankey, Patrick M. Yates-Jones, Tim Natusch, Sergei Gulyaev, Karl M. Menten, Andreas Brunthaler

Abstract: We present the first results from the Southern Hemisphere Parallax Interferometric Radio Astrometry Legacy Survey (\spirals): $10μ$as-accurate parallaxes and proper motions for two southern hemisphere 6.7 GHz methanol masers obtained using the inverse MultiView calibration method. Using an array of radio telescopes in Australia and New Zealand, we measured the trigonometric parallax and proper mot… ▽ More We present the first results from the Southern Hemisphere Parallax Interferometric Radio Astrometry Legacy Survey (\spirals): $10μ$as-accurate parallaxes and proper motions for two southern hemisphere 6.7 GHz methanol masers obtained using the inverse MultiView calibration method. Using an array of radio telescopes in Australia and New Zealand, we measured the trigonometric parallax and proper motions for the masers associated with the star formation region G232.62+00.99 of $π= 0.610\pm0.011$~mas, $μ_x=-2.266\pm0.021$~mas~y$^{-1}$ and $μ_y=2.249\pm0.049$~mas~y$^{-1}$, which implies its distance to be $d=1.637\pm0.029$~kpc. These measurements represent an improvement in accuracy by more than a factor of 3 over the previous measurements obtained through Very Long Baseline Array observations of the 12~GHz methanol masers associated with this region. We also measure the trigonometric parallax and proper motion for G323.74--00.26 as $π= 0.364\pm0.009$~mas, $μ_x=-3.239\pm0.025$~mas~y$^{-1}$ and $μ_y=-3.976\pm0.039$~mas~y$^{-1}$, which implies a distance of $d=2.747\pm0.068$~kpc. These are the most accurate measurements of trigonometric parallax obtained for 6.7~GHz class II methanol masers to date. We confirm that G232.62+00.99 is in the Local arm and find that G323.74--00.26 is in the Scutum-Centaurus arm. We also investigate the structure and internal dynamics of both masers. △ Less

Submitted 16 May, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

Comments: 13 pages, 9 figures, 3 tables. Accepted for publication in ApJ

arXiv:2211.03480 [pdf, other]

Validation tests of GBS quantum computers give evidence for quantum advantage with a decoherent target

Authors: Alexander S. Dellios, Bogdan Opanchuk, Margaret D. Reid, Peter D. Drummond

Abstract: Computational validation is vital for all large-scale quantum computers. One needs computers that are both fast and accurate. Here we apply precise, scalable, high order statistical tests to data from large Gaussian boson sampling (GBS) quantum computers that claim quantum computational advantage. These tests can be used to validate the output results for such technologies. Our method allows inves… ▽ More Computational validation is vital for all large-scale quantum computers. One needs computers that are both fast and accurate. Here we apply precise, scalable, high order statistical tests to data from large Gaussian boson sampling (GBS) quantum computers that claim quantum computational advantage. These tests can be used to validate the output results for such technologies. Our method allows investigation of accuracy as well as quantum advantage. Such issues have not been investigated in detail before. Our highly scalable technique is also applicable to other applications of linear bosonic networks. We utilize positive-P phase-space simulations of grouped count probabilities (GCP) as a fingerprint for verifying multi-mode data. This is exponentially more efficient than other phase-space methods, due to much lower sampling errors. We randomly generate tests from exponentially many high-order, grouped count tests. Each of these can be efficiently measured and simulated, providing a quantum verification method that is hard to replicate classically. We give a detailed comparison of theory with a 144-channel GBS experiment, including grouped correlations up to the largest order measured. We show how one can disprove faked data, and apply this to a classical count algorithm. There are multiple distance measures for evaluating the fidelity and computational complexity of a distribution. We compute these and explain them. The best fit to the data is a partly thermalized Gaussian model, which is neither the ideal case, nor the model that gives classically computable counts. Even with this model, discrepancies of $Z>100$ were observed from some $χ^2$ tests, indicating likely parameter estimation errors. Total count distributions were much closer to a thermalized quantum model than the classical model, giving evidence consistent with quantum computational advantage for a modified target problem. △ Less

Submitted 1 August, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

Comments: Added titles to references, changed main paper title slightly

arXiv:2211.02877 [pdf, other]

Wigner's Friend paradoxes: consistency with weak-contextual and weak-macroscopic realism models

Authors: Ria Joseph, Manushan Thenabadu, Channa Hatharasinghe, Jesse Fulton, Run-Yan Teh, P. D. Drummond, M. D. Reid

Abstract: Wigner's friend paradoxes highlight contradictions between measurements made by Friends inside a laboratory and superobservers outside a laboratory, who have access to an entangled state of the measurement apparatus. The contradictions lead to no-go theorems for observer-independent facts, thus challenging concepts of objectivity. Here, we examine the paradoxes from the perspective of establishing… ▽ More Wigner's friend paradoxes highlight contradictions between measurements made by Friends inside a laboratory and superobservers outside a laboratory, who have access to an entangled state of the measurement apparatus. The contradictions lead to no-go theorems for observer-independent facts, thus challenging concepts of objectivity. Here, we examine the paradoxes from the perspective of establishing consistency with macroscopic realism. We present versions of the Brukner-Wigner-friend and Frauchiger-Renner paradoxes in which the spin-$1/2$ system measured by the Friends corresponds to two macroscopically distinct states. The local unitary operations $U_θ$ that determine the measurement setting $θ$ are carried out using nonlinear interactions, thereby ensuring measurements need only distinguish between the macroscopically distinct states. The macroscopic paradoxes are perplexing, seemingly suggesting there is no objectivity in a macroscopic limit. However, we demonstrate consistency with a contextual weak form of macroscopic realism (wMR): The premise wMR asserts that the system can be considered to have a definite spin outcome $λ_θ$, at the time after the system has undergone the unitary rotation $U_θ$ to prepare it in a suitable pointer basis. We further show that the paradoxical outcomes imply failure of deterministic macroscopic local realism, and arise when there are unitary interactions $U_θ$ occurring due to a change of measurement setting at both sites, with respect to the state prepared by each Friend. In models which validate wMR, there is a breakdown of a subset of the assumptions that constitute the Bell-Locality premise. A similar interpretation involving a weak contextual form of realism exists for the original paradoxes. △ Less

Submitted 5 November, 2022; originally announced November 2022.

Journal ref: Phys. Rev. A 110, 022219 (2024)

arXiv:2210.16886 [pdf, other]

DiffusER: Discrete Diffusion via Edit-based Reconstruction

Authors: Machel Reid, Vincent J. Hellendoorn, Graham Neubig

Abstract: In text generation, models that generate text from scratch one token at a time are currently the dominant paradigm. Despite being performant, these models lack the ability to revise existing text, which limits their usability in many practical scenarios. We look to address this, with DiffusER (Diffusion via Edit-based Reconstruction), a new edit-based generative model for text based on denoising d… ▽ More In text generation, models that generate text from scratch one token at a time are currently the dominant paradigm. Despite being performant, these models lack the ability to revise existing text, which limits their usability in many practical scenarios. We look to address this, with DiffusER (Diffusion via Edit-based Reconstruction), a new edit-based generative model for text based on denoising diffusion models -- a class of models that use a Markov chain of denoising steps to incrementally generate data. DiffusER is not only a strong generative model in general, rivalling autoregressive models on several tasks spanning machine translation, summarization, and style transfer; it can also perform other varieties of generation that standard autoregressive models are not well-suited for. For instance, we demonstrate that DiffusER makes it possible for a user to condition generation on a prototype, or an incomplete sequence, and continue revising based on previous edit steps. △ Less

Submitted 30 October, 2022; originally announced October 2022.

Comments: Preprint. Work in progress

arXiv:2210.07370 [pdf, other]

M2D2: A Massively Multi-domain Language Modeling Dataset

Authors: Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer

Abstract: We present M2D2, a fine-grained, massively multi-domain corpus for studying domain adaptation in language models (LMs). M2D2 consists of 8.5B tokens and spans 145 domains extracted from Wikipedia and Semantic Scholar. Using ontologies derived from Wikipedia and ArXiv categories, we organize the domains in each data source into 22 groups. This two-level hierarchy enables the study of relationships… ▽ More We present M2D2, a fine-grained, massively multi-domain corpus for studying domain adaptation in language models (LMs). M2D2 consists of 8.5B tokens and spans 145 domains extracted from Wikipedia and Semantic Scholar. Using ontologies derived from Wikipedia and ArXiv categories, we organize the domains in each data source into 22 groups. This two-level hierarchy enables the study of relationships between domains and their effects on in- and out-of-domain performance after adaptation. We also present a number of insights into the nature of effective domain adaptation in LMs, as examples of the new types of studies M2D2 enables. To improve in-domain performance, we show the benefits of adapting the LM along a domain hierarchy; adapting to smaller amounts of fine-grained domain-specific data can lead to larger in-domain performance gains than larger amounts of weakly relevant data. We further demonstrate a trade-off between in-domain specialization and out-of-domain generalization within and across ontologies, as well as a strong correlation between out-of-domain performance and lexical overlap between domains. △ Less

Submitted 13 October, 2022; originally announced October 2022.

Comments: EMNLP 2022

arXiv:2210.03390 [pdf, other]

doi 10.3847/1538-4357/ac98b9

A Milliarcsecond-accurate Position for Sagittarius A*

Authors: Shuangjing Xu, Bo Zhang, Mark J. Reid, Xingwu Zheng, Guangli Wang, Taehyun Jung

Abstract: The absolute position of Sgr A*, the compact radio source at the center of the Milky Way, had been uncertain by several tens of milliarcseconds. Here we report improved astrometric measurements of the absolute position and proper motion of Sgr A*. Three epochs of phase-referencing observations were conducted with the Very Long Baseline Array for Sgr A* at 22 and 43 GHz in 2019 and 2020. Using extr… ▽ More The absolute position of Sgr A*, the compact radio source at the center of the Milky Way, had been uncertain by several tens of milliarcseconds. Here we report improved astrometric measurements of the absolute position and proper motion of Sgr A*. Three epochs of phase-referencing observations were conducted with the Very Long Baseline Array for Sgr A* at 22 and 43 GHz in 2019 and 2020. Using extragalactic radio sources with submilliarcsecond-accurate positions as reference, we determined the absolute position of Sgr A* at a reference epoch 2020.0 to be at $α$(J2000) = $17^{\rm h} 45^{\rm m}40.^{\rm s}032863~\pm~0.^{\rm s}000016$ and $δ$(J2000) = $-29^{\circ} 00^{\prime} 28.^{''}24260~\pm~0.^{''}00047$, with an updated proper motion $-3.152~\pm~0.011$ and $-5.586~\pm~0.006$ mas yr$^{-1}$ in the easterly and northerly directions, respectively. △ Less

Submitted 22 November, 2022; v1 submitted 7 October, 2022; originally announced October 2022.

Comments: 9 pages, 2 figures, accepted by ApJ

arXiv:2209.13711 [pdf, other]

doi 10.1088/1742-6596/2375/1/012005

Segment Linking: A Highly Parallelizable Track Reconstruction Algorithm for HL-LHC

Authors: Philip Chang, Peter Elmer, Yanxi Gu, Vyacheslav Krutelyov, Gavin Niendorf, Michael Reid, Balaji Venkat Sathia Narayanan, Matevž Tadel, Emmanouil Vourliotis, Bei Wang, Peter Wittich, Avraham Yagil

Abstract: The High Luminosity upgrade of the Large Hadron Collider (HL-LHC) will produce particle collisions with up to 200 simultaneous proton-proton interactions. These unprecedented conditions will create a combinatorial complexity for charged-particle track reconstruction that demands a computational cost that is expected to surpass the projected computing budget using conventional CPUs. Motivated by th… ▽ More The High Luminosity upgrade of the Large Hadron Collider (HL-LHC) will produce particle collisions with up to 200 simultaneous proton-proton interactions. These unprecedented conditions will create a combinatorial complexity for charged-particle track reconstruction that demands a computational cost that is expected to surpass the projected computing budget using conventional CPUs. Motivated by this and taking into account the prevalence of heterogeneous computing in cutting-edge High Performance Computing centers, we propose an efficient, fast and highly parallelizable bottom-up approach to track reconstruction for the HL-LHC, along with an associated implementation on GPUs, in the context of the Phase 2 CMS outer tracker. Our algorithm, called Segment Linking (or Line Segment Tracking), takes advantage of localized track stub creation, combining individual stubs to progressively form higher level objects that are subject to kinematical and geometrical requirements compatible with genuine physics tracks. The local nature of the algorithm makes it ideal for parallelization under the Single Instruction, Multiple Data paradigm, as hundreds of objects can be built simultaneously. The computing and physics performance of the algorithm has been tested on an NVIDIA Tesla V100 GPU, already yielding efficiency and timing measurements that are on par with the latest, multi-CPU versions of existing CMS tracking algorithms. △ Less

Submitted 27 September, 2022; originally announced September 2022.

Comments: Contribution to the HEP 2022 - 39th Conference on Recent Developments in High Energy Physics and Cosmology, 15-18 June 2022, Thessaloniki, Greece

Journal ref: 2022 J. Phys.: Conf. Ser. 2375 012005

arXiv:2208.01225 [pdf, other]

An Einstein-Podolsky-Rosen argument based on weak forms of local realism not falsifiable by GHZ or Bell experiments

Authors: Jesse Fulton, Run Yan Teh, M. D. Reid

Abstract: The Einstein-Podolsky-Rosen (EPR) paradox gives an argument for the incompleteness of quantum mechanics based on the premises of local realism. A general view is that the argument is compromised, because EPR's premises are falsified by Greenberger-Horne-Zeilinger (GHZ) and Bell experiments. In this paper, we present an EPR argument based on premises not falsifiable by these experiments. We propose… ▽ More The Einstein-Podolsky-Rosen (EPR) paradox gives an argument for the incompleteness of quantum mechanics based on the premises of local realism. A general view is that the argument is compromised, because EPR's premises are falsified by Greenberger-Horne-Zeilinger (GHZ) and Bell experiments. In this paper, we present an EPR argument based on premises not falsifiable by these experiments. We propose macroscopic EPR and GHZ experiments using spins $S_θ$ defined by two macroscopically distinct states. The analyzers that realize the unitary operations $U_θ$ determining the measurement settings $θ$ are devices that create macroscopic superposition states. For a system with two macroscopically distinct states available, macroscopic realism (MR) posits a predetermined outcome for a measurement $S_θ$ distinguishing between the states. Deterministic macroscopic realism (dMR) posits MR for the system prior to the interaction $U_θ$. Weak macroscopic realism (wMR) posits MR for the system after $U_θ$, at the time $t_f$ (when the system is prepared for a final "pointer" measurement), the outcome of $S_θ$ not being changed by interactions that might occur at a remote system $B$. The premise also posits that if the outcome for $S_θ^A$ of a system $A$ can be predicted by a pointer measurement on a system $B$ defined after the interaction fixing the setting at $B$, then the outcome for $S_θ^A$ is determined at this time. The GHZ predictions negate dMR but are consistent with wMR. Yet, an EPR paradox arises based on wMR for the set-up proposed by Schrödinger, where one measures two complementary spins simultaneously, "one by direct, the other by indirect" measurement. We revisit the original EPR paradox and find similarly that an EPR argument can be based on a weak form of local realism not falsifiable by GHZ or Bell tests. △ Less

Submitted 26 September, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

Journal ref: Phys. Rev. A 110, 022218 (2024)

arXiv:2206.10597 [pdf, ps, other]

doi 10.1088/1361-648X/ac711e

Reproduction of the electronic and magnetic structure of the low symmetry sites of Y$_{2}$SiO$_{5}$ doped with Sm$^{3+}$ via a parameterized crystal-field model

Authors: N. L. Jobbitt, J. -P. R. Wells, M. F. Reid

Abstract: Parametrized crystal-field analyses are presented for both the six and seven fold coordinated, C$_{1}$ symmetry Sm$^{3+}$ centers in Y$_{2}$SiO$_{5}$, based on extensive spectroscopic data spanning the infrared to optical regions. Laser site-selective excitation and fluorescence spectroscopy as well as Zeeman absorption spectroscopy performed along multiple crystallographic directions has been uti… ▽ More Parametrized crystal-field analyses are presented for both the six and seven fold coordinated, C$_{1}$ symmetry Sm$^{3+}$ centers in Y$_{2}$SiO$_{5}$, based on extensive spectroscopic data spanning the infrared to optical regions. Laser site-selective excitation and fluorescence spectroscopy as well as Zeeman absorption spectroscopy performed along multiple crystallographic directions has been utilised, in addition to previously determined $g$ tensors for the $^{6}$H$_{5/2}$Z$_{1}$ and $^{4}$G$_{5/2}$A$_{1}$ states. The resultant analyses give good approximation to the experimental energy levels and magnetic splittings, yielding crystal-field parameters consistent with the few other lanthanide ions for which such analyses are available. △ Less

Submitted 17 June, 2022; originally announced June 2022.

Comments: arXiv admin note: text overlap with arXiv:2206.09080

Journal ref: J. Phys.: Condens. Matter 34 325502 (2022)

arXiv:2206.09080 [pdf, ps, other]

doi 10.1103/PhysRevB.104.155121

Prediction of the Optical Polarization and High Field Hyperfine Structure Via a Parametrized Crystal-Field Model for the Low Symmetry Centers in Er$^{3+}$ Doped Y$_{2}$SiO$_{5}$

Authors: N. L. Jobbitt, J. -P. R. Wells, M. F. Reid, S. P. Horvath, P. Goldner, A. Ferrier

Abstract: We report on the development and application of a parametrized crystal-field model for both C$_{1}$ symmetry centers in trivalent erbium-doped Y$_{2}$SiO$_{5}$. High resolution Zeeman and temperature dependent absorption spectroscopy was performed to acquire the necessary experimental data. The obtained data, in addition to the ground ($^{4}$I$_{15/2}$Z$_{1}$) state and exited ($^{4}$I$_{13/2}$Y… ▽ More We report on the development and application of a parametrized crystal-field model for both C$_{1}$ symmetry centers in trivalent erbium-doped Y$_{2}$SiO$_{5}$. High resolution Zeeman and temperature dependent absorption spectroscopy was performed to acquire the necessary experimental data. The obtained data, in addition to the ground ($^{4}$I$_{15/2}$Z$_{1}$) state and exited ($^{4}$I$_{13/2}$Y$_{1}$) state Zeeman and hyperfine structure, was simultaneously fitted in order to refine an existing crystal-field interpretation of the Er$^{3+}$:Y$_{2}$SiO$_{5}$ system. We demonstrate that it is possible to account for the electronic, magnetic and hyperfine structure of the full 4f$^{11}$ configuration of Er$^{3+}$:Y$_{2}$SiO$_{5}$ and further, that it is possible to predict both optical polarization behavior and high magnetic field hyperfine structure of transitions in the 1.5 $μ$m telecommunications band. △ Less

Submitted 17 June, 2022; originally announced June 2022.

Journal ref: Phys. Rev. B 104, 155121 (2021)

arXiv:2206.09047 [pdf, other]

doi 10.21883/EOS.2022.01.52984.40-21

Zeeman-Hyperfine Measurements of a Pseudo-Degenerate Quadruplet in CaF$_2$:Ho$^{3+}$

Authors: Kieran M. Smith, Michael F. Reid, Jon-Paul R. Wells

Abstract: We report Zeeman infra-red spectroscopy of electronic-nuclear levels of $^5$I$_8 \rightarrow ^5$I$_7$ transitions of Ho$^{3+}$ in the C$_{\rm 4v}$(F$^-$) centre in CaF$_2$ with the magnetic field along the $\langle 111\rangle$ direction of the crystal. Transitions to the lowest $^5$I$_7$ state, an isolated electronic doublet, and the next group of states, a pseudo-quadruplet consisting of a double… ▽ More We report Zeeman infra-red spectroscopy of electronic-nuclear levels of $^5$I$_8 \rightarrow ^5$I$_7$ transitions of Ho$^{3+}$ in the C$_{\rm 4v}$(F$^-$) centre in CaF$_2$ with the magnetic field along the $\langle 111\rangle$ direction of the crystal. Transitions to the lowest $^5$I$_7$ state, an isolated electronic doublet, and the next group of states, a pseudo-quadruplet consisting of a doublet and two nearby singlets, exhibit strongly non-linear Zeeman splittings and intensity variations. Simulated spectra based upon a crystal-field analysis give an excellent approximation to the data, illustrating the strong predictive ability of the parametrised crystal-field approach. Anti-crossings in the hyperfine splittings, the basis of quantum information storage in rare-earth doped insulating dielectrics, are also predicted. △ Less

Submitted 17 June, 2022; originally announced June 2022.

Journal ref: Optics and Spectroscopy, 130:28 (2022)

Showing 1–50 of 772 results for author: Reid, M