-
Van der Waals waveguide quantum electrodynamics probed by infrared nano-photoluminescence
Authors:
Samuel L. Moore,
Hae Yeon Lee,
Nicholas Rivera,
Yuzuka Karube,
Mark Ziffer,
Emanuil S. Yanev,
Thomas P. Darlington,
Aaron J. Sternbach,
Madisen A. Holbrook,
Jordan Pack,
Xiaodong Xu,
Cory R. Dean,
Jonathan S. Owen,
P. James Schuck,
Milan Delor,
Xiaoyang Zhu,
James Hone,
Dmitri N. Basov
Abstract:
Atomically layered van der Waals (vdW) materials exhibit remarkable properties, including highly-confined infrared waveguide modes and the capacity for infrared emission in the monolayer limit. Here, we engineered structures that leverage both of these nano-optical functionalities. Specifically, we encased a photoluminescing atomic sheet of MoTe2 within two bulk crystals of WSe2, forming a vdW wav…
▽ More
Atomically layered van der Waals (vdW) materials exhibit remarkable properties, including highly-confined infrared waveguide modes and the capacity for infrared emission in the monolayer limit. Here, we engineered structures that leverage both of these nano-optical functionalities. Specifically, we encased a photoluminescing atomic sheet of MoTe2 within two bulk crystals of WSe2, forming a vdW waveguide for the embedded light-emitting monolayer. The modified electromagnetic environment offered by the WSe2 waveguide alters MoTe2 spontaneous emission, a phenomenon we directly image with our interferometric nano-photoluminescence technique. We captured spatially-oscillating nanoscale patterns prompted by spontaneous emission from MoTe2 into waveguide modes of WSe2 slabs. We quantify the resulting Purcell-enhanced emission rate within the framework of a waveguide quantum electrodynamics (QED) model, relating the MoTe2 spontaneous emission rate to the measured waveguide dispersion. Our work marks a significant advance in the implementation of all-vdW QED waveguides.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
Splashback radius and the mass accretion rate of RASS MCMF galaxy clusters
Authors:
Jitendra Joshi,
Divya Rana,
Surhud More,
Matthias Klein
Abstract:
We present measurements of the radial profile of mass and galaxy number density around X-ray selected ROSAT All Sky Survey-Multi-Component Matched Filter galaxy clusters using Year 3 data from the Dark Energy Survey. We measure the projected cross-correlation signal of the RedMaGiC "high density" galaxies around an approximately volume-limited sample of 255 galaxy clusters at a median redshift of…
▽ More
We present measurements of the radial profile of mass and galaxy number density around X-ray selected ROSAT All Sky Survey-Multi-Component Matched Filter galaxy clusters using Year 3 data from the Dark Energy Survey. We measure the projected cross-correlation signal of the RedMaGiC "high density" galaxies around an approximately volume-limited sample of 255 galaxy clusters at a median redshift of $z=0.4$ and an X-ray luminosity $L_X > 10^{44} \,\text{ergs} \, \text{s}^{-1} \, \text{h}^{-2}$. This cross-correlation signal measured with a signal-to-noise ratio of 16.41 allows us to infer a 3D number density profile which shows a significant steepening at the edges of these galaxy clusters, namely the splashback radius of $r_{sp}$ /$h^{-1} \mathrm{M_{\odot}} = 2.19^{+0.50}_{-0.43}$. We present the dependence of the splashback radius value over a range of absolute galaxy magnitude cuts to look for any evidence of dynamical friction affecting these results. The weak lensing signal around our galaxy clusters measured with a signal-to-noise ratio of 32.19 allows us to infer a halo mass $\text{log} (M_{\rm 200m} / h^{-1} \text{Mpc}) = 14.68_{-0.04}^{+0.04}$. Comparison of the location of the splashback radius with the spherical overdensity boundary $r_{\rm 200m}$ shows consistency with the $\mathrm{ΛCDM}$ predictions. We present the first inference of the average mass accretion rate of galaxy clusters using our measurements of the splashback radius.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
Rod Bustall: In Memoriam
Authors:
J Strother Moore,
Gordon Plotkin,
David Rydeheard,
Don Sannella
Abstract:
This is an obituary of Rod Burstall, written in his honour. Rod was a prominent computer scientist whose contributions span over forty years. Most of his career was spent at Edinburgh University. He lead the team programming Freddy, the first hand-eye assembly robot, with much of his effort being devoted to the development of the POP-2 programming language. He became interested in a mathematical a…
▽ More
This is an obituary of Rod Burstall, written in his honour. Rod was a prominent computer scientist whose contributions span over forty years. Most of his career was spent at Edinburgh University. He lead the team programming Freddy, the first hand-eye assembly robot, with much of his effort being devoted to the development of the POP-2 programming language. He became interested in a mathematical approach to software development: he recognised the central role of structural induction; his work on reasoning about mutable data structures was an influential precursor of separation logic; he was the first to point out the connection between program proof and modal logic; and he was responsible for the idea that stores are mappings from locations to their contents.
As part of his quest for correctness of programs, Rod, with John Darlington, undertook the first major work on program transformation. His interest in novel programming languages continued with the experimental language HOPE, developed with Don Sannella and David MacQueen. Robin Milner's Standard ML and its relatives integrated ideas from Hope, and Rod was an active member of the Standard ML design team. Rod pioneered the use of algebraic and categorical techniques in programming. He and Joseph Goguen proposed the first algebraic specification language. Rod was an early user of categorical ideas, in particular developing computational category theory with David Rydeheard.
The interplay between programming and correctness proofs is again evident in Rod's contributions to automated proof support systems. For example, he led Randy Pollack, Zhaohui Luo, and others in the Lego proof assistant, which implemented type systems supporting interactive proof development. Finally, Rod, with James McKinna, investigated notions of programs packaged with proofs of their correctness, anticipating the topic of proof-carrying code.
△ Less
Submitted 9 May, 2025;
originally announced May 2025.
-
LAPP: Large Language Model Feedback for Preference-Driven Reinforcement Learning
Authors:
Pingcheng Jian,
Xiao Wei,
Yanbaihui Liu,
Samuel A. Moore,
Michael M. Zavlanos,
Boyuan Chen
Abstract:
We introduce Large Language Model-Assisted Preference Prediction (LAPP), a novel framework for robot learning that enables efficient, customizable, and expressive behavior acquisition with minimum human effort. Unlike prior approaches that rely heavily on reward engineering, human demonstrations, motion capture, or expensive pairwise preference labels, LAPP leverages large language models (LLMs) t…
▽ More
We introduce Large Language Model-Assisted Preference Prediction (LAPP), a novel framework for robot learning that enables efficient, customizable, and expressive behavior acquisition with minimum human effort. Unlike prior approaches that rely heavily on reward engineering, human demonstrations, motion capture, or expensive pairwise preference labels, LAPP leverages large language models (LLMs) to automatically generate preference labels from raw state-action trajectories collected during reinforcement learning (RL). These labels are used to train an online preference predictor, which in turn guides the policy optimization process toward satisfying high-level behavioral specifications provided by humans. Our key technical contribution is the integration of LLMs into the RL feedback loop through trajectory-level preference prediction, enabling robots to acquire complex skills including subtle control over gait patterns and rhythmic timing. We evaluate LAPP on a diverse set of quadruped locomotion and dexterous manipulation tasks and show that it achieves efficient learning, higher final performance, faster adaptation, and precise control of high-level behaviors. Notably, LAPP enables robots to master highly dynamic and expressive tasks such as quadruped backflips, which remain out of reach for standard LLM-generated or handcrafted rewards. Our results highlight LAPP as a promising direction for scalable preference-driven robot learning.
△ Less
Submitted 21 April, 2025;
originally announced April 2025.
-
Quantum-assured magnetic navigation achieves positioning accuracy better than a strategic-grade INS in airborne and ground-based field trials
Authors:
Murat Muradoglu,
Mattias T. Johnsson,
Nathanial M. Wilson,
Yuval Cohen,
Dongki Shin,
Tomas Navickas,
Tadas Pyragius,
Divya Thomas,
Daniel Thompson,
Steven I. Moore,
Md Tanvir Rahman,
Adrian Walker,
Indranil Dutta,
Suraj Bijjahalli,
Jacob Berlocher,
Michael R. Hush,
Russell P. Anderson,
Stuart S. Szigeti,
Michael J. Biercuk
Abstract:
Modern navigation systems rely critically on GNSS, which in many cases is unavailable or unreliable (e.g. due to jamming or spoofing). For this reason there is great interest in augmenting backup navigation systems such as inertial navigation systems (INS) with additional modalities that reduce positioning error in the absence of reliable GNSS. Magnetic-anomaly navigation is one such approach, pro…
▽ More
Modern navigation systems rely critically on GNSS, which in many cases is unavailable or unreliable (e.g. due to jamming or spoofing). For this reason there is great interest in augmenting backup navigation systems such as inertial navigation systems (INS) with additional modalities that reduce positioning error in the absence of reliable GNSS. Magnetic-anomaly navigation is one such approach, providing passive, non-jammable navigation through periodic position fixes obtained by comparing local measurements of Earth's crustal field against known anomaly maps. Despite its potential, existing MagNav efforts have been limited by magnetometer performance and platform noise; solutions addressing these problems have proven either too brittle or impractical for realistic deployment. Here we demonstrate a quantum-assured MagNav solution based on proprietary quantum magnetometers with by a novel denoising and map-matching algorithms. The system fits on fixed-wing drones or in the avionics bay of a commercial airliner. We present trials at altitudes up to 19000 feet, testing onboard and outboard quantum magnetometers comparing against a strategic-grade INS. Our MagNav solution achieves superior performance, delivering up to 46x better positioning error than the velocity-aided INS; the best final positioning accuracy we achieve is 22m or 0.006% of the flight distance. Airborne trials consistently achieve at least 11x advantage over the INS across varying conditions, altitudes, and flight patterns. The system learns model parameters online without special vehicle maneuvers providing robustness to various configuration changes (e.g. changing payload or latitude). Our trials also include the first successful MagNav performed in a ground vehicle using publicly-available anomaly maps, delivering bounded positioning error 7x lower than the INS, with both systems in strapdown configuration.
△ Less
Submitted 10 April, 2025;
originally announced April 2025.
-
Subaru Hyper-Supreme Cam observations of IC 1396: Source catalogue, member population, and sub-clusters of the complex
Authors:
Swagat R Das,
Saumya Gupta,
Jessy Jose,
Manash Samal,
Gregory J. Herczeg,
Zhen Guo,
Surhud More,
Prem Prakash
Abstract:
To identify member populations of IC 1396, we employ the random forest (RF) classifier of machine learning technique. Random forest classifier is an ensemble of individual decision trees suitable for large, high-dimensional datasets. The training set used in this work is derived from previous Gaia-based studies, where the member stars are younger than $\sim$ 10~Myr. However, its sensitivity is lim…
▽ More
To identify member populations of IC 1396, we employ the random forest (RF) classifier of machine learning technique. Random forest classifier is an ensemble of individual decision trees suitable for large, high-dimensional datasets. The training set used in this work is derived from previous Gaia-based studies, where the member stars are younger than $\sim$ 10~Myr. However, its sensitivity is limited to $\sim$ 20~mag in the $\rm r_2$ band, making it challenging to identify candidates at the fainter end. In this analysis, in addition to magnitudes and colours, we incorporate several derived parameters from the magnitude and colour of the sources to identify candidate members of the star-forming complex. By employing this method, we are able to identify promising candidate member populations of the star-forming complex. We discuss the associated limitations and caveats in the method and for improvment in future studies.
In this analysis, we identify 2425 high-probability low-mass stars distributed within the entire star-forming complex, of which 1331 are new detections. Comparison of these identified member populations shows a high retrieval rate with Gaia-based literature sources, as well as sources detected through methods based on optical spectroscopy, Spitzer, $\rm H_α/X-ray$ emissions, optical, and 2MASS photometry. The mean age of the member populations is $\rm \sim 2-4~Myr$, consistent with findings from previous studies. Considering the identified member populations, we present preliminary results by exploring the presence of sub-clusters within IC 1396, assessing the possible mass limit of the member populations, and providing a brief discussion on the star formation history of the complex.
△ Less
Submitted 20 March, 2025;
originally announced March 2025.
-
Reducing Communication Overhead in Federated Learning for Network Anomaly Detection with Adaptive Client Selection
Authors:
William Marfo,
Deepak Tosh,
Shirley Moore,
Joshua Suetterlein,
Joseph Manzano
Abstract:
Communication overhead in federated learning (FL) poses a significant challenge for network anomaly detection systems, where diverse client configurations and network conditions impact efficiency and detection accuracy. Existing approaches attempt optimization individually but struggle to balance reduced overhead with performance. This paper presents an adaptive FL framework combining batch size o…
▽ More
Communication overhead in federated learning (FL) poses a significant challenge for network anomaly detection systems, where diverse client configurations and network conditions impact efficiency and detection accuracy. Existing approaches attempt optimization individually but struggle to balance reduced overhead with performance. This paper presents an adaptive FL framework combining batch size optimization, client selection, and asynchronous updates for efficient anomaly detection. Using UNSW-NB15 for general network traffic and ROAD for automotive networks, our framework reduces communication overhead by 97.6% (700.0s to 16.8s) while maintaining comparable accuracy (95.10% vs. 95.12%). The Mann-Whitney U test confirms significant improvements (p < 0.05). Profiling analysis reveals efficiency gains via reduced GPU operations and memory transfers, ensuring robust detection across varying client conditions.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
The Impact of Item-Writing Flaws on Difficulty and Discrimination in Item Response Theory
Authors:
Robin Schmucker,
Steven Moore
Abstract:
High-quality test items are essential for educational assessments, particularly within Item Response Theory (IRT). Traditional validation methods rely on resource-intensive pilot testing to estimate item difficulty and discrimination. More recently, Item-Writing Flaw (IWF) rubrics emerged as a domain-general approach for evaluating test items based on textual features. However, their relationship…
▽ More
High-quality test items are essential for educational assessments, particularly within Item Response Theory (IRT). Traditional validation methods rely on resource-intensive pilot testing to estimate item difficulty and discrimination. More recently, Item-Writing Flaw (IWF) rubrics emerged as a domain-general approach for evaluating test items based on textual features. However, their relationship to IRT parameters remains underexplored. To address this gap, we conducted a study involving over 7,000 multiple-choice questions across various STEM subjects (e.g., math and biology). Using an automated approach, we annotated each question with a 19-criteria IWF rubric and studied relationships to data-driven IRT parameters. Our analysis revealed statistically significant links between the number of IWFs and IRT difficulty and discrimination parameters, particularly in life and physical science domains. We further observed how specific IWF criteria can impact item quality more and less severely (e.g., negative wording vs. implausible distractors). Overall, while IWFs are useful for predicting IRT parameters--particularly for screening low-difficulty MCQs--they cannot replace traditional data-driven validation methods. Our findings highlight the need for further research on domain-general evaluation rubrics and algorithms that understand domain-specific content for robust item validation.
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
Network Anomaly Detection in Distributed Edge Computing Infrastructure
Authors:
William Marfo,
Enrique A. Rico,
Deepak K. Tosh,
Shirley V. Moore
Abstract:
As networks continue to grow in complexity and scale, detecting anomalies has become increasingly challenging, particularly in diverse and geographically dispersed environments. Traditional approaches often struggle with managing the computational burden associated with analyzing large-scale network traffic to identify anomalies. This paper introduces a distributed edge computing framework that in…
▽ More
As networks continue to grow in complexity and scale, detecting anomalies has become increasingly challenging, particularly in diverse and geographically dispersed environments. Traditional approaches often struggle with managing the computational burden associated with analyzing large-scale network traffic to identify anomalies. This paper introduces a distributed edge computing framework that integrates federated learning with Apache Spark and Kubernetes to address these challenges. We hypothesize that our approach, which enables collaborative model training across distributed nodes, significantly enhances the detection accuracy of network anomalies across different network types. By leveraging distributed computing and containerization technologies, our framework not only improves scalability and fault tolerance but also achieves superior detection performance compared to state-of-the-art methods. Extensive experiments on the UNSW-NB15 and ROAD datasets validate the effectiveness of our approach, demonstrating statistically significant improvements in detection accuracy and training efficiency over baseline models, as confirmed by Mann-Whitney U and Kolmogorov-Smirnov tests (p < 0.05).
△ Less
Submitted 24 January, 2025;
originally announced March 2025.
-
Luminosity and stellar mass functions of faint photometric satellites around spectroscopic central galaxies from DESI Year-1 Bright Galaxy Survey
Authors:
Wenting Wang,
Xiaohu Yang,
Yipeng Jing,
Ashley J. Ross,
Malgorzata Siudek,
John Moustakas,
Samuel G. Moore,
Shaun Cole,
Carlos Frenk,
Jiaxi Yu,
Sergey E. Koposov,
Jiaxin Han,
Zhenlin Tan,
Kun Xu,
Yizhou Gu,
Yirong Wang,
Oleg Y. Gnedin,
Jessica Nicole Aguilar,
Steven Ahlen,
Davide Bianchi,
David Brooks,
Todd Claybaugh,
Axel de la Macorra,
Arjun Dey,
Peter Doel
, et al. (25 additional authors not shown)
Abstract:
We measure the luminosity functions (LFs) and stellar mass functions (SMFs) of photometric satellite galaxies around spectroscopically identified isolated central galaxies (ICGs). The photometric satellites are from the DESI Legacy Imaging Surveys (DR9), while the spectroscopic ICGs are selected from the DESI Year-1 BGS sample. We can measure satellite LFs down to $r$-band absolute magnitudes of…
▽ More
We measure the luminosity functions (LFs) and stellar mass functions (SMFs) of photometric satellite galaxies around spectroscopically identified isolated central galaxies (ICGs). The photometric satellites are from the DESI Legacy Imaging Surveys (DR9), while the spectroscopic ICGs are selected from the DESI Year-1 BGS sample. We can measure satellite LFs down to $r$-band absolute magnitudes of $M_{r,\mathrm{sat}}\sim-7$, around ICGs as small as $7.1<\log_{10}M_{\ast,\mathrm{ICG}}/\mathrm{M_\odot}<7.8$, with the stellar mass of ICGs measured by the DESI Fastspecfit pipeline. The satellite SMF can be measured down to $\log_{10}M_{\ast,\mathrm{sat}}/\mathrm{M_\odot}\sim 5.5$. Interestingly, we discover that the faint/low-mass end slopes of satellite LFs/SMFs become steeper with the decrease in the stellar masses of host ICGs, with smaller and nearby host ICGs capable of being used to probe their fainter satellites.. The steepest slopes are $-2.298\pm0.656$ and $-$2.888$\pm$0.916 for satellite LF and SMF, respectively. Detailed comparisons are performed between the satellite LFs around ICGs selected from DESI BGS or from the SDSS NYU-VAGC spectroscopic Main galaxies over $7.1<\log_{10}M_{\ast,\mathrm{ICG}}/\mathrm{M_\odot}<11.7$, showing reasonable agreements, but we show that the differences between DESI and SDSS stellar masses for ICGs play a role to affect the results. We also compare measurements based on DESI Fastspecfit and Cigale stellar masses used to bin ICGs, with the latter including the modeling of AGN based on WISE photometry, and we find good agreements in the measured satellite LFs by using either of the DESI stellar mass catalogs.
△ Less
Submitted 4 May, 2025; v1 submitted 5 March, 2025;
originally announced March 2025.
-
PAC in DESI. I. Galaxy Stellar Mass Function into the $10^{6}{\rm M}_{\odot}$ Frontier
Authors:
Kun Xu,
Y. P. Jing,
Shaun Cole,
Carlos S. Frenk,
Sownak Bose,
Willem Elbers,
Wenting Wang,
Yirong Wang,
Samuel Moore,
J. Aguilar,
S. Ahlen,
D. Bianchi,
D. Brooks,
T. Claybaugh,
A. de la Macorra,
Arjun Dey,
J. E. Forero-Romero,
E. Gaztañaga,
S. Gontcho A Gontcho,
G. Gutierrez,
K. Honscheid,
M. Ishak,
T. Kisner,
S. E. Koposov,
M. Landriau
, et al. (12 additional authors not shown)
Abstract:
The Photometric Objects Around Cosmic Webs (PAC) method integrates cosmological photometric and spectroscopic surveys, offering valuable insights into galaxy formation. PAC measures the excess surface density of photometric objects, $\bar{n}_2w_{\rm{p}}$, with specific physical properties around spectroscopic tracers. In this study, we improve the PAC method to make it more rigorous and eliminate…
▽ More
The Photometric Objects Around Cosmic Webs (PAC) method integrates cosmological photometric and spectroscopic surveys, offering valuable insights into galaxy formation. PAC measures the excess surface density of photometric objects, $\bar{n}_2w_{\rm{p}}$, with specific physical properties around spectroscopic tracers. In this study, we improve the PAC method to make it more rigorous and eliminate the need for redshift bins. We apply the enhanced PAC method to the DESI Y1 BGS Bright spectroscopic sample and the deep DECaLS photometric sample, obtaining $\bar{n}_2w_{\rm{p}}$ measurements across the complete stellar mass range, from $10^{5.3}{\rm M}_{\odot}$ to $10^{11.5}{\rm M}_{\odot}$ for blue galaxies, and from $10^{6.3}{\rm M}_{\odot}$ to $10^{11.9}{\rm M}_{\odot}$ for red galaxies. We combine $\bar{n}_2w_{\rm{p}}$ with $w_{\rm{p}}$ measurements from the BGS sample, which is not necessarily complete in stellar mass. Assuming that galaxy bias is primarily determined by stellar mass and colour, we derive the galaxy stellar mass functions (GSMFs) down to $10^{5.3}{\rm M}_{\odot}$ for blue galaxies and $10^{6.3}{\rm M}_{\odot}$ for red galaxies, while also setting lower limits for smaller masses. The blue and red GSMFs are well described by single and double Schechter functions, respectively, with low-mass end slopes of $α_{\rm{blue}}=-1.54^{+0.02}_{-0.02}$ and $α_{\rm{red}}=-2.50^{+0.08}_{-0.08}$, resulting in the dominance of red galaxies below $10^{7.6}{\rm M}_{\odot}$. Stage-IV cosmological photometric surveys, capable of reaching 2-3 magnitudes deeper than DECaLS, present an opportunity to explore the entire galaxy population in the local universe with PAC. This advancement allows us to address critical questions regarding the nature of dark matter, the physics of reionization, and the formation of dwarf galaxies.
△ Less
Submitted 9 May, 2025; v1 submitted 3 March, 2025;
originally announced March 2025.
-
Efficient Client Selection in Federated Learning
Authors:
William Marfo,
Deepak K. Tosh,
Shirley V. Moore
Abstract:
Federated Learning (FL) enables decentralized machine learning while preserving data privacy. This paper proposes a novel client selection framework that integrates differential privacy and fault tolerance. The adaptive client selection adjusts the number of clients based on performance and system constraints, with noise added to protect privacy. Evaluated on the UNSW-NB15 and ROAD datasets for ne…
▽ More
Federated Learning (FL) enables decentralized machine learning while preserving data privacy. This paper proposes a novel client selection framework that integrates differential privacy and fault tolerance. The adaptive client selection adjusts the number of clients based on performance and system constraints, with noise added to protect privacy. Evaluated on the UNSW-NB15 and ROAD datasets for network anomaly detection, the method improves accuracy by 7% and reduces training time by 25% compared to baselines. Fault tolerance enhances robustness with minimal performance trade-offs.
△ Less
Submitted 24 January, 2025;
originally announced February 2025.
-
Federated Learning for Efficient Condition Monitoring and Anomaly Detection in Industrial Cyber-Physical Systems
Authors:
William Marfo,
Deepak K. Tosh,
Shirley V. Moore
Abstract:
Detecting and localizing anomalies in cyber-physical systems (CPS) has become increasingly challenging as systems grow in complexity, particularly due to varying sensor reliability and node failures in distributed environments. While federated learning (FL) provides a foundation for distributed model training, existing approaches often lack mechanisms to address these CPS-specific challenges. This…
▽ More
Detecting and localizing anomalies in cyber-physical systems (CPS) has become increasingly challenging as systems grow in complexity, particularly due to varying sensor reliability and node failures in distributed environments. While federated learning (FL) provides a foundation for distributed model training, existing approaches often lack mechanisms to address these CPS-specific challenges. This paper introduces an enhanced FL framework with three key innovations: adaptive model aggregation based on sensor reliability, dynamic node selection for resource optimization, and Weibull-based checkpointing for fault tolerance. The proposed framework ensures reliable condition monitoring while tackling the computational and reliability challenges of industrial CPS deployments. Experiments on the NASA Bearing and Hydraulic System datasets demonstrate superior performance compared to state-of-the-art FL methods, achieving 99.5% AUC-ROC in anomaly detection and maintaining accuracy even under node failures. Statistical validation using the Mann-Whitney U test confirms significant improvements, with a p-value less than 0.05, in both detection accuracy and computational efficiency across various operational scenarios.
△ Less
Submitted 27 January, 2025;
originally announced January 2025.
-
Adaptive Client Selection in Federated Learning: A Network Anomaly Detection Use Case
Authors:
William Marfo,
Deepak K. Tosh,
Shirley V. Moore
Abstract:
Federated Learning (FL) has become a widely used approach for training machine learning models on decentralized data, addressing the significant privacy concerns associated with traditional centralized methods. However, the efficiency of FL relies on effective client selection and robust privacy preservation mechanisms. Ineffective client selection can result in suboptimal model performance, while…
▽ More
Federated Learning (FL) has become a widely used approach for training machine learning models on decentralized data, addressing the significant privacy concerns associated with traditional centralized methods. However, the efficiency of FL relies on effective client selection and robust privacy preservation mechanisms. Ineffective client selection can result in suboptimal model performance, while inadequate privacy measures risk exposing sensitive data.
This paper introduces a client selection framework for FL that incorporates differential privacy and fault tolerance. The proposed adaptive approach dynamically adjusts the number of selected clients based on model performance and system constraints, ensuring privacy through the addition of calibrated noise.
The method is evaluated on a network anomaly detection use case using the UNSW-NB15 and ROAD datasets. Results demonstrate up to a 7% improvement in accuracy and a 25% reduction in training time compared to the FedL2P approach. Additionally, the study highlights trade-offs between privacy budgets and model performance, with higher privacy budgets leading to reduced noise and improved accuracy. While the fault tolerance mechanism introduces a slight performance decrease, it enhances robustness against client failures. Statistical validation using the Mann-Whitney U test confirms the significance of these improvements, with results achieving a p-value of less than 0.05.
△ Less
Submitted 24 January, 2025;
originally announced January 2025.
-
Region-wise stacking ensembles for estimating brain-age using MRI
Authors:
Georgios Antonopoulos,
Shammi More,
Simon B. Eickhoff,
Federico Raimondo,
Kaustubh R. Patil
Abstract:
Predictive modeling using structural magnetic resonance imaging (MRI) data is a prominent approach to study brain-aging. Machine learning algorithms and feature extraction methods have been employed to improve predictions and explore healthy and accelerated aging e.g. neurodegenerative and psychiatric disorders. The high-dimensional MRI data pose challenges to building generalizable and interpreta…
▽ More
Predictive modeling using structural magnetic resonance imaging (MRI) data is a prominent approach to study brain-aging. Machine learning algorithms and feature extraction methods have been employed to improve predictions and explore healthy and accelerated aging e.g. neurodegenerative and psychiatric disorders. The high-dimensional MRI data pose challenges to building generalizable and interpretable models as well as for data privacy. Common practices are resampling or averaging voxels within predefined parcels, which reduces anatomical specificity and biological interpretability as voxels within a region may differently relate to aging. Effectively, naive fusion by averaging can result in information loss and reduced accuracy. We present a conceptually novel two-level stacking ensemble (SE) approach. The first level comprises regional models for predicting individuals' age based on voxel-wise information, fused by a second-level model yielding final predictions. Eight data fusion scenarios were explored using as input Gray matter volume (GMV) estimates from four datasets covering the adult lifespan. Performance, measured using mean absolute error (MAE), R2, correlation and prediction bias, showed that SE outperformed the region-wise averages. The best performance was obtained when first-level regional predictions were obtained as out-of-sample predictions on the application site with second-level models trained on independent and site-specific data (MAE=4.75 vs baseline regional mean GMV MAE=5.68). Performance improved as more datasets were used for training. First-level predictions showed improved and more robust aging signal providing new biological insights and enhanced data privacy. Overall, the SE improves accuracy compared to the baseline while preserving or enhancing data privacy.
△ Less
Submitted 17 January, 2025;
originally announced January 2025.
-
Astronomy and Society: The Road Ahead
Authors:
Aniket Sule,
Niruj Mohan Ramanujam,
Moupiya Maji,
Surhud More,
Virendra Yadav,
Anand Narayanan,
Samir Dhurde,
Jayant Ganguly,
S. Seetha,
Ajit Mohan Srivastava,
B. S. Shylaja,
Yogesh Wadadekar
Abstract:
Astronomy, of all the sciences, is possibly the one with the most public appeal across all age groups. This is also evidenced by the existence of a large number of planetaria and amateur astronomy societies, which is unique to the field. Astronomy is known as a `gateway science', with an ability to attract students who then proceed to explore their interest in other STEM fields too. Astronomy's li…
▽ More
Astronomy, of all the sciences, is possibly the one with the most public appeal across all age groups. This is also evidenced by the existence of a large number of planetaria and amateur astronomy societies, which is unique to the field. Astronomy is known as a `gateway science', with an ability to attract students who then proceed to explore their interest in other STEM fields too. Astronomy's link to society is therefore substantive and diverse. In this white paper, six key areas are analysed, namely outreach and communication, astronomy education, history and heritage, astronomy for development, diversity, and hiring practices for outreach personnel.
The current status of each of these areas is described, followed by an analysis of what is needed for the future. A set of recommendations for institutions, funding agencies, and individuals are evolved for each specific area. This work charts out the vision for how the astronomy-society connection should take shape in the future, and attempts to provide a road-map for the various stakeholders involved.
△ Less
Submitted 14 January, 2025; v1 submitted 11 January, 2025;
originally announced January 2025.
-
Search for continuous gravitational waves from known pulsars in the first part of the fourth LIGO-Virgo-KAGRA observing run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
D. Agarwal,
M. Agathos,
M. Aghaei Abchouyeh,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah,
C. Alléné
, et al. (1794 additional authors not shown)
Abstract:
Continuous gravitational waves (CWs) emission from neutron stars carries information about their internal structure and equation of state, and it can provide tests of General Relativity. We present a search for CWs from a set of 45 known pulsars in the first part of the fourth LIGO--Virgo--KAGRA observing run, known as O4a. We conducted a targeted search for each pulsar using three independent ana…
▽ More
Continuous gravitational waves (CWs) emission from neutron stars carries information about their internal structure and equation of state, and it can provide tests of General Relativity. We present a search for CWs from a set of 45 known pulsars in the first part of the fourth LIGO--Virgo--KAGRA observing run, known as O4a. We conducted a targeted search for each pulsar using three independent analysis methods considering the single-harmonic and the dual-harmonic emission models. We find no evidence of a CW signal in O4a data for both models and set upper limits on the signal amplitude and on the ellipticity, which quantifies the asymmetry in the neutron star mass distribution. For the single-harmonic emission model, 29 targets have the upper limit on the amplitude below the theoretical spin-down limit. The lowest upper limit on the amplitude is $6.4\!\times\!10^{-27}$ for the young energetic pulsar J0537-6910, while the lowest constraint on the ellipticity is $8.8\!\times\!10^{-9}$ for the bright nearby millisecond pulsar J0437-4715. Additionally, for a subset of 16 targets we performed a narrowband search that is more robust regarding the emission model, with no evidence of a signal. We also found no evidence of non-standard polarizations as predicted by the Brans-Dicke theory.
△ Less
Submitted 2 January, 2025;
originally announced January 2025.
-
AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset
Authors:
Tobi Olatunji,
Charles Nimo,
Abraham Owodunni,
Tassallah Abdullahi,
Emmanuel Ayodele,
Mardhiyah Sanni,
Chinemelu Aka,
Folafunmi Omofoye,
Foutse Yuehgoh,
Timothy Faniran,
Bonaventure F. P. Dossou,
Moshood Yekini,
Jonas Kemp,
Katherine Heller,
Jude Chidubem Omeke,
Chidi Asuzu MD,
Naome A. Etori,
Aimérou Ndiaye,
Ifeoma Okoh,
Evans Doe Ocansey,
Wendy Kinara,
Michael Best,
Irfan Essa,
Stephen Edward Moore,
Chris Fourie
, et al. (1 additional authors not shown)
Abstract:
Recent advancements in large language model(LLM) performance on medical multiple choice question (MCQ) benchmarks have stimulated interest from healthcare providers and patients globally. Particularly in low-and middle-income countries (LMICs) facing acute physician shortages and lack of specialists, LLMs offer a potentially scalable pathway to enhance healthcare access and reduce costs. However,…
▽ More
Recent advancements in large language model(LLM) performance on medical multiple choice question (MCQ) benchmarks have stimulated interest from healthcare providers and patients globally. Particularly in low-and middle-income countries (LMICs) facing acute physician shortages and lack of specialists, LLMs offer a potentially scalable pathway to enhance healthcare access and reduce costs. However, their effectiveness in the Global South, especially across the African continent, remains to be established. In this work, we introduce AfriMed-QA, the first large scale Pan-African English multi-specialty medical Question-Answering (QA) dataset, 15,000 questions (open and closed-ended) sourced from over 60 medical schools across 16 countries, covering 32 medical specialties. We further evaluate 30 LLMs across multiple axes including correctness and demographic bias. Our findings show significant performance variation across specialties and geographies, MCQ performance clearly lags USMLE (MedQA). We find that biomedical LLMs underperform general models and smaller edge-friendly LLMs struggle to achieve a passing score. Interestingly, human evaluations show a consistent consumer preference for LLM answers and explanations when compared with clinician answers.
△ Less
Submitted 14 January, 2025; v1 submitted 23 November, 2024;
originally announced November 2024.
-
DESI 2024 VII: Cosmological Constraints from the Full-Shape Modeling of Clustering Measurements
Authors:
DESI Collaboration,
A. G. Adame,
J. Aguilar,
S. Ahlen,
S. Alam,
D. M. Alexander,
C. Allende Prieto,
M. Alvarez,
O. Alves,
A. Anand,
U. Andrade,
E. Armengaud,
S. Avila,
A. Aviles,
H. Awan,
B. Bahr-Kalus,
S. Bailey,
C. Baltay,
A. Bault,
J. Behera,
S. BenZvi,
F. Beutler,
D. Bianchi,
C. Blake,
R. Blum
, et al. (188 additional authors not shown)
Abstract:
We present cosmological results from the measurement of clustering of galaxy, quasar and Lyman-$α$ forest tracers from the first year of observations with the Dark Energy Spectroscopic Instrument (DESI Data Release 1). We adopt the full-shape (FS) modeling of the power spectrum, including the effects of redshift-space distortions, in an analysis which has been validated in a series of supporting p…
▽ More
We present cosmological results from the measurement of clustering of galaxy, quasar and Lyman-$α$ forest tracers from the first year of observations with the Dark Energy Spectroscopic Instrument (DESI Data Release 1). We adopt the full-shape (FS) modeling of the power spectrum, including the effects of redshift-space distortions, in an analysis which has been validated in a series of supporting papers. In the flat $Λ$CDM cosmological model, DESI (FS+BAO), combined with a baryon density prior from Big Bang Nucleosynthesis and a weak prior on the scalar spectral index, determines matter density to $Ω_\mathrm{m}=0.2962\pm 0.0095$, and the amplitude of mass fluctuations to $σ_8=0.842\pm 0.034$. The addition of the cosmic microwave background (CMB) data tightens these constraints to $Ω_\mathrm{m}=0.3056\pm 0.0049$ and $σ_8=0.8121\pm 0.0053$, while further addition of the the joint clustering and lensing analysis from the Dark Energy Survey Year-3 (DESY3) data leads to a 0.4% determination of the Hubble constant, $H_0 = (68.40\pm 0.27)\,{\rm km\,s^{-1}\,Mpc^{-1}}$. In models with a time-varying dark energy equation of state, combinations of DESI (FS+BAO) with CMB and type Ia supernovae continue to show the preference, previously found in the DESI DR1 BAO analysis, for $w_0>-1$ and $w_a<0$ with similar levels of significance. DESI data, in combination with the CMB, impose the upper limits on the sum of the neutrino masses of $\sum m_ν< 0.071\,{\rm eV}$ at 95% confidence. DESI data alone measure the modified-gravity parameter that controls the clustering of massive particles, $μ_0=0.11^{+0.45}_{-0.54}$, while the combination of DESI with the CMB and the clustering and lensing analysis from DESY3 constrains both modified-gravity parameters, giving $μ_0 = 0.04\pm 0.22$ and $Σ_0 = 0.044\pm 0.047$, in agreement with general relativity. [Abridged.]
△ Less
Submitted 21 November, 2024; v1 submitted 18 November, 2024;
originally announced November 2024.
-
DESI 2024 V: Full-Shape Galaxy Clustering from Galaxies and Quasars
Authors:
DESI Collaboration,
A. G. Adame,
J. Aguilar,
S. Ahlen,
S. Alam,
D. M. Alexander,
M. Alvarez,
O. Alves,
A. Anand,
U. Andrade,
E. Armengaud,
S. Avila,
A. Aviles,
H. Awan,
S. Bailey,
C. Baltay,
A. Bault,
J. Behera,
S. BenZvi,
F. Beutler,
D. Bianchi,
C. Blake,
R. Blum,
S. Brieden,
A. Brodzeller
, et al. (174 additional authors not shown)
Abstract:
We present the measurements and cosmological implications of the galaxy two-point clustering using over 4.7 million unique galaxy and quasar redshifts in the range $0.1<z<2.1$ divided into six redshift bins over a $\sim 7,500$ square degree footprint, from the first year of observations with the Dark Energy Spectroscopic Instrument (DESI Data Release 1). By fitting the full power spectrum, we exte…
▽ More
We present the measurements and cosmological implications of the galaxy two-point clustering using over 4.7 million unique galaxy and quasar redshifts in the range $0.1<z<2.1$ divided into six redshift bins over a $\sim 7,500$ square degree footprint, from the first year of observations with the Dark Energy Spectroscopic Instrument (DESI Data Release 1). By fitting the full power spectrum, we extend previous DESI DR1 baryon acoustic oscillation (BAO) measurements to include redshift-space distortions and signals from the matter-radiation equality scale. For the first time, this Full-Shape analysis is blinded at the catalogue-level to avoid confirmation bias and the systematic errors are accounted for at the two-point clustering level, which automatically propagates them into any cosmological parameter. When analysing the data in terms of compressed model-agnostic variables, we obtain a combined precision of 4.7\% on the amplitude of the redshift space distortion signal reaching similar precision with just one year of DESI data than with 20 years of observation from previous generation surveys. We analyse the data to directly constrain the cosmological parameters within the $Λ$CDM model using perturbation theory and combine this information with the reconstructed DESI DR1 galaxy BAO. Using a Big Bang Nucleosynthesis Gaussian prior on the baryon density parameter, and a Gaussian prior on the spectral index, we constrain the matter density is $Ω_m=0.296\pm 0.010 $ and the Hubble constant $H_0=(68.63 \pm 0.79)[{\rm km\, s^{-1}Mpc^{-1}}]$. Additionally, we measure the amplitude of clustering $σ_8=0.841 \pm 0.034$. The DESI DR1 results are in agreement with the $Λ$CDM model based on general relativity with parameters consistent with those from Planck. The cosmological interpretation of these results in combination with external datasets are presented in a companion paper.
△ Less
Submitted 11 March, 2025; v1 submitted 18 November, 2024;
originally announced November 2024.
-
DESI 2024 II: Sample Definitions, Characteristics, and Two-point Clustering Statistics
Authors:
DESI Collaboration,
A. G. Adame,
J. Aguilar,
S. Ahlen,
S. Alam,
D. M. Alexander,
M. Alvarez,
O. Alves,
A. Anand,
U. Andrade,
E. Armengaud,
S. Avila,
A. Aviles,
H. Awan,
S. Bailey,
C. Baltay,
A. Bault,
J. Behera,
S. BenZvi,
F. Beutler,
D. Bianchi,
C. Blake,
R. Blum,
S. Brieden,
A. Brodzeller
, et al. (178 additional authors not shown)
Abstract:
We present the samples of galaxies and quasars used for DESI 2024 cosmological analyses, drawn from the DESI Data Release 1 (DR1). We describe the construction of large-scale structure (LSS) catalogs from these samples, which include matched sets of synthetic reference `randoms' and weights that account for variations in the observed density of the samples due to experimental design and varying in…
▽ More
We present the samples of galaxies and quasars used for DESI 2024 cosmological analyses, drawn from the DESI Data Release 1 (DR1). We describe the construction of large-scale structure (LSS) catalogs from these samples, which include matched sets of synthetic reference `randoms' and weights that account for variations in the observed density of the samples due to experimental design and varying instrument performance. We detail how we correct for variations in observational completeness, the input `target' densities due to imaging systematics, and the ability to confidently measure redshifts from DESI spectra. We then summarize how remaining uncertainties in the corrections can be translated to systematic uncertainties for particular analyses. We describe the weights added to maximize the signal-to-noise of DESI DR1 2-point clustering measurements. We detail measurement pipelines applied to the LSS catalogs that obtain 2-point clustering measurements in configuration and Fourier space. The resulting 2-point measurements depend on window functions and normalization constraints particular to each sample, and we present the corrections required to match models to the data. We compare the configuration- and Fourier-space 2-point clustering of the data samples to that recovered from simulations of DESI DR1 and find they are, generally, in statistical agreement to within 2\% in the inferred real-space over-density field. The LSS catalogs, 2-point measurements, and their covariance matrices will be released publicly with DESI DR1.
△ Less
Submitted 18 November, 2024;
originally announced November 2024.
-
Breaking the mold: overcoming the time constraints of molecular dynamics on general-purpose hardware
Authors:
Danny Perez,
Aidan Thompson,
Stan Moore,
Tomas Oppelstrup,
Ilya Sharapov,
Kylee Santos,
Amirali Sharifian,
Delyan Z. Kalchev,
Robert Schreiber,
Scott Pakin,
Edgar A. Leon,
James H. Laros III,
Michael James,
Sivasankaran Rajamanickam
Abstract:
The evolution of molecular dynamics (MD) simulations has been intimately linked to that of computing hardware. For decades following the creation of MD, simulations have improved with computing power along the three principal dimensions of accuracy, atom count (spatial scale), and duration (temporal scale). Since the mid-2000s, computer platforms have however failed to provide strong scaling for M…
▽ More
The evolution of molecular dynamics (MD) simulations has been intimately linked to that of computing hardware. For decades following the creation of MD, simulations have improved with computing power along the three principal dimensions of accuracy, atom count (spatial scale), and duration (temporal scale). Since the mid-2000s, computer platforms have however failed to provide strong scaling for MD as scale-out CPU and GPU platforms that provide substantial increases to spatial scale do not lead to proportional increases in temporal scale. Important scientific problems therefore remained inaccessible to direct simulation, prompting the development of increasingly sophisticated algorithms that present significant complexity, accuracy, and efficiency challenges. While bespoke MD-only hardware solutions have provided a path to longer timescales for specific physical systems, their impact on the broader community has been mitigated by their limited adaptability to new methods and potentials. In this work, we show that a novel computing architecture, the Cerebras Wafer Scale Engine, completely alters the scaling path by delivering unprecedentedly high simulation rates up to 1.144M steps/second for 200,000 atoms whose interactions are described by an Embedded Atom Method potential. This enables direct simulations of the evolution of materials using general-purpose programmable hardware over millisecond timescales, dramatically increasing the space of direct MD simulations that can be carried out.
△ Less
Submitted 15 November, 2024;
originally announced November 2024.
-
Rapid identification of lensed type Ia supernovae with color-magnitude selection
Authors:
Prajakta Mane,
Anupreeta More,
Surhud More
Abstract:
Strongly lensed type Ia supernovae (SNe Ia) provide a unique cosmological probe to address the Hubble tension problem in cosmology. In addition to the sensitivity of the time delays to the value of the Hubble constant, the transient and standard candle nature of SNe Ia also enable valuable joint constraints on the model of the lens and the cosmological parameters. The upcoming Legacy Survey of Spa…
▽ More
Strongly lensed type Ia supernovae (SNe Ia) provide a unique cosmological probe to address the Hubble tension problem in cosmology. In addition to the sensitivity of the time delays to the value of the Hubble constant, the transient and standard candle nature of SNe Ia also enable valuable joint constraints on the model of the lens and the cosmological parameters. The upcoming Legacy Survey of Space and Time (LSST) with the Vera C. Rubin Observatory is expected to increase the number of observed SNe Ia by an order of magnitude in ten years of its lifetime. However, finding such systems in the LSST data is a challenge. In this work, we revisit the color-magnitude (CM) diagram used previously as a means to identify lensed SNe Ia and extend the work further as follows. We simulate LSST-like photometric data ($rizy$~bands) of lensed SNe Ia and analyze it in the CM parameter space. We find that a subset of lensed SNe Ia are redder compared to unlensed SNe Ia at a given magnitude, both in the rising and falling phases of their light curves and for SNe up to $z=3$. We propose a modified selection criterion based on these new results. We show that the contamination coming from the unlensed core-collapse (CC) SNe is negligible, whereas a small fraction of lensed CC SNe types Ib and Ic may get selected by this criterion as potential lensed SNe. Finally, we demonstrate that our criterion works well on a wide sample of observed unlensed SNe Ia, a handful of known multiply-imaged lensed SNe systems, and a representative sample of observed super-luminous supernovae.
△ Less
Submitted 14 November, 2024;
originally announced November 2024.
-
Automated Global Analysis of Experimental Dynamics through Low-Dimensional Linear Embeddings
Authors:
Samuel A. Moore,
Brian P. Mann,
Boyuan Chen
Abstract:
Dynamical systems theory has long provided a foundation for understanding evolving phenomena across scientific domains. Yet, the application of this theory to complex real-world systems remains challenging due to issues in mathematical modeling, nonlinearity, and high dimensionality. In this work, we introduce a data-driven computational framework to derive low-dimensional linear models for nonlin…
▽ More
Dynamical systems theory has long provided a foundation for understanding evolving phenomena across scientific domains. Yet, the application of this theory to complex real-world systems remains challenging due to issues in mathematical modeling, nonlinearity, and high dimensionality. In this work, we introduce a data-driven computational framework to derive low-dimensional linear models for nonlinear dynamical systems directly from raw experimental data. This framework enables global stability analysis through interpretable linear models that capture the underlying system structure. Our approach employs time-delay embedding, physics-informed deep autoencoders, and annealing-based regularization to identify novel low-dimensional coordinate representations, unlocking insights across a variety of simulated and previously unstudied experimental dynamical systems. These new coordinate representations enable accurate long-horizon predictions and automatic identification of intricate invariant sets while providing empirical stability guarantees. Our method offers a promising pathway to analyze complex dynamical behaviors across fields such as physics, climate science, and engineering, with broad implications for understanding nonlinear systems in the real world.
△ Less
Submitted 1 November, 2024;
originally announced November 2024.
-
Search for gravitational waves emitted from SN 2023ixf
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
D. Agarwal,
M. Agathos,
M. Aghaei Abchouyeh,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah,
C. Alléné,
A. Allocca
, et al. (1758 additional authors not shown)
Abstract:
We present the results of a search for gravitational-wave transients associated with core-collapse supernova SN 2023ixf, which was observed in the galaxy Messier 101 via optical emission on 2023 May 19th, during the LIGO-Virgo-KAGRA 15th Engineering Run. We define a five-day on-source window during which an accompanying gravitational-wave signal may have occurred. No gravitational waves have been…
▽ More
We present the results of a search for gravitational-wave transients associated with core-collapse supernova SN 2023ixf, which was observed in the galaxy Messier 101 via optical emission on 2023 May 19th, during the LIGO-Virgo-KAGRA 15th Engineering Run. We define a five-day on-source window during which an accompanying gravitational-wave signal may have occurred. No gravitational waves have been identified in data when at least two gravitational-wave observatories were operating, which covered $\sim 14\%$ of this five-day window. We report the search detection efficiency for various possible gravitational-wave emission models. Considering the distance to M101 (6.7 Mpc), we derive constraints on the gravitational-wave emission mechanism of core-collapse supernovae across a broad frequency spectrum, ranging from 50 Hz to 2 kHz where we assume the gravitational-wave emission occurred when coincident data are available in the on-source window. Considering an ellipsoid model for a rotating proto-neutron star, our search is sensitive to gravitational-wave energy $1 \times 10^{-4} M_{\odot} c^2$ and luminosity $2.6 \times 10^{-4} M_{\odot} c^2/s$ for a source emitting at 82 Hz. These constraints are around an order of magnitude more stringent than those obtained so far with gravitational-wave data. The constraint on the ellipticity of the proto-neutron star that is formed is as low as 1.08, at frequencies above 1200 Hz, surpassing past results.
△ Less
Submitted 11 March, 2025; v1 submitted 21 October, 2024;
originally announced October 2024.
-
A search using GEO600 for gravitational waves coincident with fast radio bursts from SGR 1935+2154
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
D. Agarwal,
M. Agathos,
M. Aghaei Abchouyeh,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah,
C. Alléné
, et al. (1758 additional authors not shown)
Abstract:
The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by…
▽ More
The magnetar SGR 1935+2154 is the only known Galactic source of fast radio bursts (FRBs). FRBs from SGR 1935+2154 were first detected by CHIME/FRB and STARE2 in 2020 April, after the conclusion of the LIGO, Virgo, and KAGRA Collaborations' O3 observing run. Here we analyze four periods of gravitational wave (GW) data from the GEO600 detector coincident with four periods of FRB activity detected by CHIME/FRB, as well as X-ray glitches and X-ray bursts detected by NICER and NuSTAR close to the time of one of the FRBs. We do not detect any significant GW emission from any of the events. Instead, using a short-duration GW search (for bursts $\leq$ 1 s) we derive 50\% (90\%) upper limits of $10^{48}$ ($10^{49}$) erg for GWs at 300 Hz and $10^{49}$ ($10^{50}$) erg at 2 kHz, and constrain the GW-to-radio energy ratio to $\leq 10^{14} - 10^{16}$. We also derive upper limits from a long-duration search for bursts with durations between 1 and 10 s. These represent the strictest upper limits on concurrent GW emission from FRBs.
△ Less
Submitted 21 May, 2025; v1 submitted 11 October, 2024;
originally announced October 2024.
-
AI generated annotations for Breast, Brain, Liver, Lungs and Prostate cancer collections in National Cancer Institute Imaging Data Commons
Authors:
Gowtham Krishnan Murugesan,
Diana McCrumb,
Rahul Soni,
Jithendra Kumar,
Leonard Nuernberg,
Linmin Pei,
Ulrike Wagner,
Sutton Granger,
Andrey Y. Fedorov,
Stephen Moore,
Jeff Van Oss
Abstract:
AI in Medical Imaging project aims to enhance the National Cancer Institute's (NCI) Image Data Commons (IDC) by developing nnU-Net models and providing AI-assisted segmentations for cancer radiology images. We created high-quality, AI-annotated imaging datasets for 11 IDC collections. These datasets include images from various modalities, such as computed tomography (CT) and magnetic resonance ima…
▽ More
AI in Medical Imaging project aims to enhance the National Cancer Institute's (NCI) Image Data Commons (IDC) by developing nnU-Net models and providing AI-assisted segmentations for cancer radiology images. We created high-quality, AI-annotated imaging datasets for 11 IDC collections. These datasets include images from various modalities, such as computed tomography (CT) and magnetic resonance imaging (MRI), covering the lungs, breast, brain, kidneys, prostate, and liver. The nnU-Net models were trained using open-source datasets. A portion of the AI-generated annotations was reviewed and corrected by radiologists. Both the AI and radiologist annotations were encoded in compliance with the the Digital Imaging and Communications in Medicine (DICOM) standard, ensuring seamless integration into the IDC collections. All models, images, and annotations are publicly accessible, facilitating further research and development in cancer imaging. This work supports the advancement of imaging tools and algorithms by providing comprehensive and accurate annotated datasets.
△ Less
Submitted 30 September, 2024;
originally announced September 2024.
-
Environmental dependence on galaxy-halo connections for satellites using HSC weak lensing
Authors:
Amit Kumar,
Surhud More
Abstract:
We present the luminosity-halo mass relations of satellite (sLHMRs) galaxies in the SDSS redMaPPer cluster catalogue and the effects of the dense cluster environment on subhalo mass evolution. We use data from the Subaru Hyper Suprime-Cam survey Year-3 catalogue of galaxy shapes to measure the weak lensing signal around these satellites. This signal serves as a probe of the matter distribution aro…
▽ More
We present the luminosity-halo mass relations of satellite (sLHMRs) galaxies in the SDSS redMaPPer cluster catalogue and the effects of the dense cluster environment on subhalo mass evolution. We use data from the Subaru Hyper Suprime-Cam survey Year-3 catalogue of galaxy shapes to measure the weak lensing signal around these satellites. This signal serves as a probe of the matter distribution around the satellites, thereby providing the masses of their associated subhalos. We bin our satellites based on physical observable quantities such as their luminosity or the host cluster's richness, combined with their cluster-centric radial separations. Our results indicate that although more luminous satellites tend to reside in more massive halos, the sLHMRs depend on the distance of the satellite from the cluster centre. Subhalos near the cluster centre (within $<0.3 h^{-1}Mpc$) are stripped of mass. Consequently, the ratio of subhalo mass to luminosity decreases near the cluster centre. For low luminosity galaxies ($L < 10^{10} h^{-2}L_{\odot}$), the lack of evidence of increasing subhalo masses with luminosity shows the impact of tidal stripping. We also present stellar-to-subhalo mass relations (sSHMRs) for our satellite sample evolving at different cluster-centric separations. Inferred sSHMRs in the outer radial bin appear to match that observed for the field galaxies. We show that the sSHMRs from the mock-redMaPPer run on galaxy catalogues generated by the empirical UniverseMachine galaxy formation model are in good agreement with our observational results. Satellites, when binned based on the host cluster's richness, show very little dependence of the subhalo mass on the richness.
△ Less
Submitted 9 September, 2024;
originally announced September 2024.
-
Double Machine Learning at Scale to Predict Causal Impact of Customer Actions
Authors:
Sushant More,
Priya Kotwal,
Sujith Chappidi,
Dinesh Mandalapu,
Chris Khawand
Abstract:
Causal Impact (CI) of customer actions are broadly used across the industry to inform both short- and long-term investment decisions of various types. In this paper, we apply the double machine learning (DML) methodology to estimate the CI values across 100s of customer actions of business interest and 100s of millions of customers. We operationalize DML through a causal ML library based on Spark…
▽ More
Causal Impact (CI) of customer actions are broadly used across the industry to inform both short- and long-term investment decisions of various types. In this paper, we apply the double machine learning (DML) methodology to estimate the CI values across 100s of customer actions of business interest and 100s of millions of customers. We operationalize DML through a causal ML library based on Spark with a flexible, JSON-driven model configuration approach to estimate CI at scale (i.e., across hundred of actions and millions of customers). We outline the DML methodology and implementation, and associated benefits over the traditional potential outcomes based CI model. We show population-level as well as customer-level CI values along with confidence intervals. The validation metrics show a 2.2% gain over the baseline methods and a 2.5X gain in the computational time. Our contribution is to advance the scalable application of CI, while also providing an interface that allows faster experimentation, cross-platform support, ability to onboard new use cases, and improves accessibility of underlying code for partner teams.
△ Less
Submitted 3 September, 2024;
originally announced September 2024.
-
Valuing an Engagement Surface using a Large Scale Dynamic Causal Model
Authors:
Abhimanyu Mukerji,
Sushant More,
Ashwin Viswanathan Kannan,
Lakshmi Ravi,
Hua Chen,
Naman Kohli,
Chris Khawand,
Dinesh Mandalapu
Abstract:
With recent rapid growth in online shopping, AI-powered Engagement Surfaces (ES) have become ubiquitous across retail services. These engagement surfaces perform an increasing range of functions, including recommending new products for purchase, reminding customers of their orders and providing delivery notifications. Understanding the causal effect of engagement surfaces on value driven for custo…
▽ More
With recent rapid growth in online shopping, AI-powered Engagement Surfaces (ES) have become ubiquitous across retail services. These engagement surfaces perform an increasing range of functions, including recommending new products for purchase, reminding customers of their orders and providing delivery notifications. Understanding the causal effect of engagement surfaces on value driven for customers and businesses remains an open scientific question. In this paper, we develop a dynamic causal model at scale to disentangle value attributable to an ES, and to assess its effectiveness. We demonstrate the application of this model to inform business decision-making by understanding returns on investment in the ES, and identifying product lines and features where the ES adds the most value.
△ Less
Submitted 21 August, 2024;
originally announced August 2024.
-
Detecting Masquerade Attacks in Controller Area Networks Using Graph Machine Learning
Authors:
William Marfo,
Pablo Moriano,
Deepak K. Tosh,
Shirley V. Moore
Abstract:
Modern vehicles rely on a myriad of electronic control units (ECUs) interconnected via controller area networks (CANs) for critical operations. Despite their ubiquitous use and reliability, CANs are susceptible to sophisticated cyberattacks, particularly masquerade attacks, which inject false data that mimic legitimate messages at the expected frequency. These attacks pose severe risks such as uni…
▽ More
Modern vehicles rely on a myriad of electronic control units (ECUs) interconnected via controller area networks (CANs) for critical operations. Despite their ubiquitous use and reliability, CANs are susceptible to sophisticated cyberattacks, particularly masquerade attacks, which inject false data that mimic legitimate messages at the expected frequency. These attacks pose severe risks such as unintended acceleration, brake deactivation, and rogue steering. Traditional intrusion detection systems (IDS) often struggle to detect these subtle intrusions due to their seamless integration into normal traffic. This paper introduces a novel framework for detecting masquerade attacks in the CAN bus using graph machine learning (ML). We hypothesize that the integration of shallow graph embeddings with time series features derived from CAN frames enhances the detection of masquerade attacks. We show that by representing CAN bus frames as message sequence graphs (MSGs) and enriching each node with contextual statistical attributes from time series, we can enhance detection capabilities across various attack patterns compared to using only graph-based features. Our method ensures a comprehensive and dynamic analysis of CAN frame interactions, improving robustness and efficiency. Extensive experiments on the ROAD dataset validate the effectiveness of our approach, demonstrating statistically significant improvements in the detection rates of masquerade attacks compared to a baseline that uses only graph-based features, as confirmed by Mann-Whitney U and Kolmogorov-Smirnov tests (p < 0.05).
△ Less
Submitted 10 August, 2024;
originally announced August 2024.
-
Probing the nature of dark matter using strongly lensed gravitational waves from binary black holes
Authors:
Souvik Jana,
Shasvath J. Kapadia,
Tejaswi Venumadhav,
Surhud More,
Parameswaran Ajith
Abstract:
Next-generation ground-based gravitational-wave (GW) detectors are expected to detect millions of binary black hole mergers during their operation period. A small fraction ($\sim 0.1 - 1\%$) of them will be strongly lensed by intervening galaxies and clusters, producing multiple copies of the GW signals. The expected number of lensed events and the distribution of the time delay between lensed ima…
▽ More
Next-generation ground-based gravitational-wave (GW) detectors are expected to detect millions of binary black hole mergers during their operation period. A small fraction ($\sim 0.1 - 1\%$) of them will be strongly lensed by intervening galaxies and clusters, producing multiple copies of the GW signals. The expected number of lensed events and the distribution of the time delay between lensed images will depend on the mass distribution of the lenses at different redshifts. Warm dark matter or fuzzy dark matter models predict lower abundances of small mass dark matter halos as compared to the standard cold dark matter. This will result in a reduction in the number of strongly lensed GW events, especially at small time delays. Using the number of lensed events and the lensing time delay distribution, we can put a lower bound on the mass of the warm/fuzzy dark matter particle from a catalog of lensed GW events. The expected bounds from GW strong lensing from next-generation detectors are significantly better than the current constraints.
△ Less
Submitted 9 August, 2024;
originally announced August 2024.
-
Galaxy-dark matter connection from weak lensing in imaging surveys: Impact of photometric redshift errors
Authors:
Navin Chaurasiya,
Surhud More,
Daichi Kashino,
Shogo Masaki,
Shogo Ishikawa
Abstract:
The uncertainties in photometric redshifts and stellar masses from imaging surveys affect galaxy sample selection, their abundance measurements, as well as the measured weak lensing signals. We develop a framework to assess the systematic effects arising from the use of redshifts and stellar masses derived from photometric data, and explore their impact on the inferred galaxy-dark matter connectio…
▽ More
The uncertainties in photometric redshifts and stellar masses from imaging surveys affect galaxy sample selection, their abundance measurements, as well as the measured weak lensing signals. We develop a framework to assess the systematic effects arising from the use of redshifts and stellar masses derived from photometric data, and explore their impact on the inferred galaxy-dark matter connection. We use galaxy catalogues from the UniverseMachine (UM) galaxy formation model to create Pz-mock galaxy samples that approximately follow the redshift errors in the Subaru HSC survey. We focus on galaxy stellar-mass thresholds ranging from $\log\left[M_*/(h^{-2}M_\odot)\right]$ from $8.6$ to $11.2$ in steps of 0.2 dex within two redshift bins $0.30-0.55$ and $0.55-0.80$. A comparison of the Pz-mock samples to true galaxy samples in UM shows a relatively mild sample contamination for thresholds with $\log\left[M_{*,\rm limit}/(h^{-2}M_\odot)\right]<10.6$, while an increasing contamination towards the more massive end. We show how such contamination affects the measured abundance and the lensing signal. A joint HOD modelling of the observables from the Pz-mock compared to the truth in the UM informs the systematic biases on the average halo masses of central galaxies in the HSC survey. Even with a reasonably conservative choice of photo-$z$ errors in Pz-mock, we show that the inferred halo masses deduced from the HSC galaxies for low-mass thresholds will have a systematic bias smaller than 0.05 dex. Beyond $\log\left[M_{*,\rm limit}/(h^{-2}M_\odot)\right]=10.6$, the inferred halo masses show an increasing systematic bias with stellar mass, reaching values of order $0.2$ dex, larger than the statistical error.
△ Less
Submitted 9 August, 2024;
originally announced August 2024.
-
Achieving Human Level Competitive Robot Table Tennis
Authors:
David B. D'Ambrosio,
Saminda Abeyruwan,
Laura Graesser,
Atil Iscen,
Heni Ben Amor,
Alex Bewley,
Barney J. Reed,
Krista Reymann,
Leila Takayama,
Yuval Tassa,
Krzysztof Choromanski,
Erwin Coumans,
Deepali Jain,
Navdeep Jaitly,
Natasha Jaques,
Satoshi Kataoka,
Yuheng Kuang,
Nevena Lazic,
Reza Mahjourian,
Sherry Moore,
Kenneth Oslund,
Anish Shankar,
Vikas Sindhwani,
Vincent Vanhoucke,
Grace Vesom
, et al. (2 additional authors not shown)
Abstract:
Achieving human-level speed and performance on real world tasks is a north star for the robotics research community. This work takes a step towards that goal and presents the first learned robot agent that reaches amateur human-level performance in competitive table tennis. Table tennis is a physically demanding sport which requires human players to undergo years of training to achieve an advanced…
▽ More
Achieving human-level speed and performance on real world tasks is a north star for the robotics research community. This work takes a step towards that goal and presents the first learned robot agent that reaches amateur human-level performance in competitive table tennis. Table tennis is a physically demanding sport which requires human players to undergo years of training to achieve an advanced level of proficiency. In this paper, we contribute (1) a hierarchical and modular policy architecture consisting of (i) low level controllers with their detailed skill descriptors which model the agent's capabilities and help to bridge the sim-to-real gap and (ii) a high level controller that chooses the low level skills, (2) techniques for enabling zero-shot sim-to-real including an iterative approach to defining the task distribution that is grounded in the real-world and defines an automatic curriculum, and (3) real time adaptation to unseen opponents. Policy performance was assessed through 29 robot vs. human matches of which the robot won 45% (13/29). All humans were unseen players and their skill level varied from beginner to tournament level. Whilst the robot lost all matches vs. the most advanced players it won 100% matches vs. beginners and 55% matches vs. intermediate players, demonstrating solidly amateur human-level performance. Videos of the matches can be viewed at https://sites.google.com/view/competitive-robot-table-tennis
△ Less
Submitted 1 May, 2025; v1 submitted 7 August, 2024;
originally announced August 2024.
-
DRAM Errors and Cosmic Rays: Space Invaders or Science Fiction?
Authors:
Isaac Boixaderas,
Jorge Amaya,
Sergi Moré,
Javier Bartolome,
David Vicente,
Osman Unsal,
Dimitris Gizopoulos,
Paul M. Carpenter,
Petar Radojković,
Eduard Ayguadé
Abstract:
It is widely accepted that cosmic rays are a plausible cause of DRAM errors in high-performance computing (HPC) systems, and various studies suggest that they could explain some aspects of the observed DRAM error behavior. However, this phenomenon is insufficiently studied in production environments. We analyze the correlations between cosmic rays and DRAM errors on two HPC clusters: a production…
▽ More
It is widely accepted that cosmic rays are a plausible cause of DRAM errors in high-performance computing (HPC) systems, and various studies suggest that they could explain some aspects of the observed DRAM error behavior. However, this phenomenon is insufficiently studied in production environments. We analyze the correlations between cosmic rays and DRAM errors on two HPC clusters: a production supercomputer with server-class DDR3-1600 and a prototype with LPDDR3-1600 and no hardware error correction. Our error logs cover 2000 billion MB-hours for the MareNostrum 3 supercomputer and 135 million MB-hours for the Mont-Blanc prototype. Our analysis combines quantitative analysis, formal statistical methods and machine learning. We detect no indications that cosmic rays have any influence on the DRAM errors. To understand whether the findings are specific to systems under study, located at 100 meters above the sea level, the analysis should be repeated on other HPC clusters, especially the ones located on higher altitudes. Also, analysis can (and should) be applied to revisit and extend numerous previous studies which use cosmic rays as a hypothetical explanation for some aspects of the observed DRAM error behaviors.
△ Less
Submitted 1 November, 2024; v1 submitted 23 July, 2024;
originally announced July 2024.
-
Reinforcement Learning-based Adaptive Mitigation of Uncorrected DRAM Errors in the Field
Authors:
Isaac Boixaderas,
Sergi Moré,
Javier Bartolome,
David Vicente,
Petar Radojković,
Paul M. Carpenter,
Eduard Ayguadé
Abstract:
Scaling to larger systems, with current levels of reliability, requires cost-effective methods to mitigate hardware failures. One of the main causes of hardware failure is an uncorrected error in memory, which terminates the current job and wastes all computation since the last checkpoint. This paper presents the first adaptive method for triggering uncorrected error mitigation. It uses a predicti…
▽ More
Scaling to larger systems, with current levels of reliability, requires cost-effective methods to mitigate hardware failures. One of the main causes of hardware failure is an uncorrected error in memory, which terminates the current job and wastes all computation since the last checkpoint. This paper presents the first adaptive method for triggering uncorrected error mitigation. It uses a prediction approach that considers the likelihood of an uncorrected error and its current potential cost. The method is based on reinforcement learning, and the only user-defined parameters are the mitigation cost and whether the job can be restarted from a mitigation point. We evaluate our method using classical machine learning metrics together with a cost-benefit analysis, which compares the cost of mitigation actions with the benefits from mitigating some of the errors. On two years of production logs from the MareNostrum supercomputer, our method reduces lost compute time by 54% compared with no mitigation and is just 6% below the optimal Oracle method. All source code is open source.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
Swift-BAT GUANO follow-up of gravitational-wave triggers in the third LIGO-Virgo-KAGRA observing run
Authors:
Gayathri Raman,
Samuele Ronchini,
James Delaunay,
Aaron Tohuvavohu,
Jamie A. Kennea,
Tyler Parsotan,
Elena Ambrosi,
Maria Grazia Bernardini,
Sergio Campana,
Giancarlo Cusumano,
Antonino D'Ai,
Paolo D'Avanzo,
Valerio D'Elia,
Massimiliano De Pasquale,
Simone Dichiara,
Phil Evans,
Dieter Hartmann,
Paul Kuin,
Andrea Melandri,
Paul O'Brien,
Julian P. Osborne,
Kim Page,
David M. Palmer,
Boris Sbarufatti,
Gianpiero Tagliaferri
, et al. (1797 additional authors not shown)
Abstract:
We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wav…
▽ More
We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wave Transient Catalogs (GWTC-3). Targeted searches were carried out on the entire GW sample using the maximum--likelihood NITRATES pipeline on the BAT data made available via the GUANO infrastructure. We do not detect any significant electromagnetic emission that is temporally and spatially coincident with any of the GW candidates. We report flux upper limits in the 15-350 keV band as a function of sky position for all the catalog candidates. For GW candidates where the Swift-BAT false alarm rate is less than 10$^{-3}$ Hz, we compute the GW--BAT joint false alarm rate. Finally, the derived Swift-BAT upper limits are used to infer constraints on the putative electromagnetic emission associated with binary black hole mergers.
△ Less
Submitted 27 March, 2025; v1 submitted 13 July, 2024;
originally announced July 2024.
-
Representation theory of the Reflection Equation Algebra II: Theory of shapes
Authors:
Kenny De Commer,
Stephen T. Moore
Abstract:
We continue our study of the representations of the Reflection Equation Algebra (=REA) on Hilbert spaces, focusing again on the REA constructed from the $R$-matrix associated to the standard $q$-deformation of $GL(N,\mathbb{C})$ for $0<q<1$. We consider the Poisson structure appearing as the classical limit of the $R$-matrix, and parametrize the symplectic leaves explicitly in terms of a type of m…
▽ More
We continue our study of the representations of the Reflection Equation Algebra (=REA) on Hilbert spaces, focusing again on the REA constructed from the $R$-matrix associated to the standard $q$-deformation of $GL(N,\mathbb{C})$ for $0<q<1$. We consider the Poisson structure appearing as the classical limit of the $R$-matrix, and parametrize the symplectic leaves explicitly in terms of a type of matrix we call a shape matrix. We then introduce a quantized version of the shape matrix for the REA, and show that each irreducible representation of the REA has a unique shape.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Secure quantum-enhanced measurements on a network of sensors
Authors:
Sean William Moore,
Jacob A. Dunningham
Abstract:
Two-party secure quantum remote sensing (SQRS) protocols enable quantum-enhanced measurements at remote locations with guaranteed security against eavesdroppers. This idea can be scaled up to networks of nodes where one party can directly measure functions of parameters at the different nodes using entangled states. However, the security on such networks decreases exponentially with the number of…
▽ More
Two-party secure quantum remote sensing (SQRS) protocols enable quantum-enhanced measurements at remote locations with guaranteed security against eavesdroppers. This idea can be scaled up to networks of nodes where one party can directly measure functions of parameters at the different nodes using entangled states. However, the security on such networks decreases exponentially with the number of nodes. Here we show how this problem can be overcome in a hybrid protocol that utilises both entangled and separable states to achieve quantum-enhanced measurement precision and security on networks of any size.
△ Less
Submitted 24 October, 2024; v1 submitted 27 June, 2024;
originally announced June 2024.
-
Plasmonic polarization sensing of electrostatic superlattice potentials
Authors:
Shuai Zhang,
Jordan Fonseca,
Daniel Bennett,
Zhiyuan Sun,
Junhe Zhang,
Ran Jing,
Suheng Xu,
Leo He,
S. L. Moore,
S. E. Rossi,
Dmitry Ovchinnikov,
David Cobden,
Pablo. Jarillo-Herrero,
M. M. Fogler,
Philip Kim,
Efthimios Kaxiras,
Xiaodong Xu,
D. N. Basov
Abstract:
Plasmon polaritons are formed by coupling light with delocalized electrons. The half-light and half-matter nature of plasmon polaritons endows them with unparalleled tunability via a range of parameters, such as dielectric environments and carrier density. Therefore, plasmon polaritons are expected to be tuned when in proximity to polar materials since the carrier density is tuned by an electrosta…
▽ More
Plasmon polaritons are formed by coupling light with delocalized electrons. The half-light and half-matter nature of plasmon polaritons endows them with unparalleled tunability via a range of parameters, such as dielectric environments and carrier density. Therefore, plasmon polaritons are expected to be tuned when in proximity to polar materials since the carrier density is tuned by an electrostatic potential; conversely, the plasmon polariton response might enable the sensing of polarization. Here, we use infrared nano-imaging and nano-photocurrent measurements to investigate heterostructures composed of graphene and twisted hexagonal boron nitride (t-BN), with alternating polarization in a triangular network of moiré stacking domains. We observe that the carrier density and the corresponding plasmonic response of graphene are modulated by polar domains in t-BN. In addition, we demonstrate that the nanometer-wide domain walls of graphene moirés superlattices, created by the polar domains of t-BN, provide momenta to assist the plasmonic excitations. Furthermore, our studies establish that the plasmon of graphene could function as a delicate sensor for polarization textures. The evolution of polarization textures in t-BN under uniform electric fields is tomographically examined via plasmonic imaging. Strikingly, no noticeable polarization switching is observed under applied electric fields up to 0.23 V/nm, at variance with transport reports. Our nano-images unambiguously reveal that t-BN with triangular domains acts like a ferrielectric, rather than ferroelectric claimed by many previous studies.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Status of Astronomy Education in India: A Baseline Survey
Authors:
Moupiya Maji,
Surhud More,
Aniket Sule,
Vishaak Balasubramanya,
Ankit Bhandari,
Hum Chand,
Kshitij Chavan,
Avik Dasgupta,
Anindya De,
Jayant Gangopadhyay,
Mamta Gulati,
Priya Hasan,
Syed Ishtiyaq,
Meraj Madani,
Kuntal Misra,
Amoghavarsha N,
Divya Oberoi,
Subhendu Pattnaik,
Mayuri Patwardhan,
Niruj Mohan Ramanujam,
Pritesh Ranadive,
Disha Sawant,
Paryag Sharma,
Twinkle Sharma,
Sai Shetye
, et al. (6 additional authors not shown)
Abstract:
We present the results of a nation-wide baseline survey, conducted by us, for the status of Astronomy education among secondary school students in India. The survey was administered in 10 different languages to over 2000 students from diverse backgrounds, and it explored multiple facets of their perspectives on astronomy. The topics included students' views on the incorporation of astronomy in cur…
▽ More
We present the results of a nation-wide baseline survey, conducted by us, for the status of Astronomy education among secondary school students in India. The survey was administered in 10 different languages to over 2000 students from diverse backgrounds, and it explored multiple facets of their perspectives on astronomy. The topics included students' views on the incorporation of astronomy in curricula, their grasp of fundamental astronomical concepts, access to educational resources, cultural connections to astronomy, and their levels of interest and aspirations in the subject. We find notable deficiencies in students' knowledge of basic astronomical principles, with only a minority demonstrating proficiency in key areas such as celestial sizes, distances, and lunar phases. Furthermore, access to resources such as telescopes and planetariums remain limited across the country. Despite these challenges, a significant majority of students expressed a keen interest in astronomy. We further analyze the data along socioeconomic and gender lines. Particularly striking were the socioeconomic disparities, with students from resource-poor backgrounds often having lower levels of access and proficiency. Some differences were observed between genders, although not very pronounced. The insights gleaned from this study hold valuable implications for the development of a more robust astronomy curriculum and the design of effective teacher training programs in the future.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Weak-Lensing Shear-Selected Galaxy Clusters from the Hyper Suprime-Cam Subaru Strategic Program: II. Cosmological Constraints from the Cluster Abundance
Authors:
I-Non Chiu,
Kai-Feng Chen,
Masamune Oguri,
Markus M. Rau,
Takashi Hamana,
Yen-Ting Lin,
Hironao Miyatake,
Satoshi Miyazaki,
Surhud More,
Tomomi Sunayama,
Sunao Sugiyama,
Masahiro Takada
Abstract:
We present cosmological constraints using the abundance of weak-lensing shear-selected galaxy clusters in the Hyper Suprime-Cam (HSC) Subaru Strategic Program. The clusters are selected on the mass maps constructed using the three-year (Y3) weak-lensing data with an area of $\approx500~$deg$^2$, resulting in a sample size of $129$ clusters with high signal-to-noise ratios $ν$ of $ν\geq4.7$. Owing…
▽ More
We present cosmological constraints using the abundance of weak-lensing shear-selected galaxy clusters in the Hyper Suprime-Cam (HSC) Subaru Strategic Program. The clusters are selected on the mass maps constructed using the three-year (Y3) weak-lensing data with an area of $\approx500~$deg$^2$, resulting in a sample size of $129$ clusters with high signal-to-noise ratios $ν$ of $ν\geq4.7$. Owing to the deep, wide-field, and uniform imaging of the HSC survey, this is by far the largest sample of shear-selected clusters, in which the selection solely depends on gravity and is free from any assumptions about the dynamical state. Informed by the optical counterparts, the shear-selected clusters span a redshift range of $z\lesssim0.7$ with a median of $z\approx0.3$. The lensing sources are securely selected at $z\gtrsim0.7$ with a median of $z\approx1.3$, leading to nearly zero cluster member contamination. We carefully account for (1) the bias in the photometric redshift of sources, (2) the bias and scatter in the weak-lensing mass using a simulation-based calibration, and (3) the measurement uncertainty that is directly estimated on the mass maps using an injection-based method developed in a companion paper (Chen et al. submitted). In a blind analysis, the fully marginalized posteriors of the cosmological parameters are obtained as $Ω_{\mathrm{m}} = 0.50^{+0.28}_{-0.24}$, $σ_8 = 0.685^{+0.161}_{-0.088}$, $\hat{S}_{8}\equivσ_8\left(Ω_{\mathrm{m}}/0.3\right)^{0.25} = 0.835^{+0.041}_{-0.044}$, and $σ_8\left(Ω_{\mathrm{m}}/0.3\right)^{0.5} = 0.993^{+0.084}_{-0.126}$ in a flat $Λ$CDM model. We compare our cosmological constraints with other studies, including those based on cluster abundances, galaxy-galaxy lensing and clustering, and Cosmic Microwave Background observed by $Planck$, and find good agreement at levels of $\lesssim2σ$. [abridged]
△ Less
Submitted 14 October, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
Weak-Lensing Shear-Selected Galaxy Clusters from the Hyper Suprime-Cam Subaru Strategic Program: I. Cluster Catalog, Selection Function and Mass--Observable Relation
Authors:
Kai-Feng Chen,
I-Non Chiu,
Masamune Oguri,
Yen-Ting Lin,
Hironao Miyatake,
Satoshi Miyazaki,
Surhud More,
Takashi Hamana,
Markus M. Rau,
Tomomi Sunayama,
Sunao Sugiyama,
Masahiro Takada
Abstract:
We present the first step toward deriving cosmological constraints through the abundances of galaxy clusters selected in a $510\,\mathrm{deg}^2$ weak-lensing aperture mass map, constructed with the Year-Three shear catalog from the Hyper Suprime-Cam Subaru Strategic Program. We adopt a conservative source galaxy selection to construct a sample of $129$ weak-lensing peaks with a signal-to-noise rat…
▽ More
We present the first step toward deriving cosmological constraints through the abundances of galaxy clusters selected in a $510\,\mathrm{deg}^2$ weak-lensing aperture mass map, constructed with the Year-Three shear catalog from the Hyper Suprime-Cam Subaru Strategic Program. We adopt a conservative source galaxy selection to construct a sample of $129$ weak-lensing peaks with a signal-to-noise ratio above $4.7$. We use semi-analytical injection simulations to derive the selection function and the mass--observable relation of our sample. These results take into account complicated uncertainties associated with weak-lensing measurements, such as the non-uniform survey depth and the complex survey geometry, projection effects from uncorrelated large-scale structures, and the intrinsic alignment of source galaxies. We also propose a novel modeling framework to make parts of the mass--observable relation insensitive to assumed cosmological parameters. Such a framework not only offers a great computational advantage to cosmological studies, but can also benefit future astrophysical studies using shear-selected clusters. Our results are an important step toward utilizing these cluster samples that are constructed nearly independent of any baryonic assumptions in upcoming deep-and-wide lensing surveys from the Vera Rubin Observatory, Euclid, and the Nancy Grace Roman Space Telescope.
△ Less
Submitted 3 January, 2025; v1 submitted 17 June, 2024;
originally announced June 2024.
-
BISON: Blind Identification with Stateless scOped pseudoNyms
Authors:
Jakob Heher,
Stefan More,
Lena Heimberger
Abstract:
Delegating authentication to identity providers like Google or Facebook, while convenient, compromises user privacy. These identity providers can record users' every move; the global identifiers they provide also enable internet-wide tracking.
We show that neither is a necessary evil by presenting the BISON pseudonym derivation protocol, inspired by Oblivious Pseudorandom Functions. It hides the…
▽ More
Delegating authentication to identity providers like Google or Facebook, while convenient, compromises user privacy. These identity providers can record users' every move; the global identifiers they provide also enable internet-wide tracking.
We show that neither is a necessary evil by presenting the BISON pseudonym derivation protocol, inspired by Oblivious Pseudorandom Functions. It hides the service provider's identity from the identity provider yet produces a trusted, scoped, immutable pseudonym. Colluding service providers cannot link BISON pseudonyms; this prevents user tracking. BISON does not require a long-lived state on the user device and does not add additional actors to the authentication process.
BISON is practical. It is easy to understand, implement, and reason about, and is designed to integrate into existing authentication protocols. To demonstrate this, we provide an OpenID Connect extension that allows OIDC's PPID pseudonyms to be derived using BISON. Additionally, BISON uses lightweight cryptography. Pseudonym derivation requires a total of four elliptic curve scalar-point multiplications and four hash function evaluations, taking ~3 ms in our proof of concept implementation. Thus, BISON's privacy guarantees can be realized in practice.
This makes BISON a crucial stepping stone towards the privacy-preserving internet of tomorrow.
△ Less
Submitted 17 July, 2024; v1 submitted 3 June, 2024;
originally announced June 2024.
-
An Automatic Question Usability Evaluation Toolkit
Authors:
Steven Moore,
Eamon Costello,
Huy A. Nguyen,
John Stamper
Abstract:
Evaluating multiple-choice questions (MCQs) involves either labor intensive human assessments or automated methods that prioritize readability, often overlooking deeper question design flaws. To address this issue, we introduce the Scalable Automatic Question Usability Evaluation Toolkit (SAQUET), an open-source tool that leverages the Item-Writing Flaws (IWF) rubric for a comprehensive and automa…
▽ More
Evaluating multiple-choice questions (MCQs) involves either labor intensive human assessments or automated methods that prioritize readability, often overlooking deeper question design flaws. To address this issue, we introduce the Scalable Automatic Question Usability Evaluation Toolkit (SAQUET), an open-source tool that leverages the Item-Writing Flaws (IWF) rubric for a comprehensive and automated quality evaluation of MCQs. By harnessing the latest in large language models such as GPT-4, advanced word embeddings, and Transformers designed to analyze textual complexity, SAQUET effectively pinpoints and assesses a wide array of flaws in MCQs. We first demonstrate the discrepancy between commonly used automated evaluation metrics and the human assessment of MCQ quality. Then we evaluate SAQUET on a diverse dataset of MCQs across the five domains of Chemistry, Statistics, Computer Science, Humanities, and Healthcare, showing how it effectively distinguishes between flawed and flawless questions, providing a level of analysis beyond what is achievable with traditional metrics. With an accuracy rate of over 94% in detecting the presence of flaws identified by human evaluators, our findings emphasize the limitations of existing evaluation methods and showcase potential in improving the quality of educational assessments.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Automated Generation and Tagging of Knowledge Components from Multiple-Choice Questions
Authors:
Steven Moore,
Robin Schmucker,
Tom Mitchell,
John Stamper
Abstract:
Knowledge Components (KCs) linked to assessments enhance the measurement of student learning, enrich analytics, and facilitate adaptivity. However, generating and linking KCs to assessment items requires significant effort and domain-specific knowledge. To streamline this process for higher-education courses, we employed GPT-4 to generate KCs for multiple-choice questions (MCQs) in Chemistry and E…
▽ More
Knowledge Components (KCs) linked to assessments enhance the measurement of student learning, enrich analytics, and facilitate adaptivity. However, generating and linking KCs to assessment items requires significant effort and domain-specific knowledge. To streamline this process for higher-education courses, we employed GPT-4 to generate KCs for multiple-choice questions (MCQs) in Chemistry and E-Learning. We analyzed discrepancies between the KCs generated by the Large Language Model (LLM) and those made by humans through evaluation from three domain experts in each subject area. This evaluation aimed to determine whether, in instances of non-matching KCs, evaluators showed a preference for the LLM-generated KCs over their human-created counterparts. We also developed an ontology induction algorithm to cluster questions that assess similar KCs based on their content. Our most effective LLM strategy accurately matched KCs for 56% of Chemistry and 35% of E-Learning MCQs, with even higher success when considering the top five KC suggestions. Human evaluators favored LLM-generated KCs, choosing them over human-assigned ones approximately two-thirds of the time, a preference that was statistically significant across both domains. Our clustering algorithm successfully grouped questions by their underlying KCs without needing explicit labels or contextual information. This research advances the automation of KC generation and classification for assessment items, alleviating the need for student data or predefined KC labels.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Strong-lensing cosmography using third-generation gravitational-wave detectors
Authors:
Souvik Jana,
Shasvath J Kapadia,
Tejaswi Venumadhav,
Surhud More,
Parameswaran Ajith
Abstract:
We present a detailed exposition of a statistical method for estimating cosmological parameters from the observation of a large number of strongly lensed binary-black-hole (BBH) mergers observable by next (third) generation (XG) gravitational-wave (GW) detectors. This method, first presented in Jana (2023 Phys. Rev. Lett. 130 261401), compares the observed number of strongly lensed GW events and t…
▽ More
We present a detailed exposition of a statistical method for estimating cosmological parameters from the observation of a large number of strongly lensed binary-black-hole (BBH) mergers observable by next (third) generation (XG) gravitational-wave (GW) detectors. This method, first presented in Jana (2023 Phys. Rev. Lett. 130 261401), compares the observed number of strongly lensed GW events and their time delay distribution (between lensed images) with observed events to infer cosmological parameters. We show that the precision of the estimation of the cosmological parameters does not have a strong dependance on the assumed BBH redshift distribution model. Using the large number of unlensed mergers, XG detectors are expected to measure the BBH redshift distribution with sufficient precision for the cosmological inference. However, a biased inference of the BBH redshift distribution will bias the estimation of cosmological parameters. An incorrect model for the distribution of lens properties can also lead to a biased cosmological inference. However, Bayesian model selection can assist in selecting the right model from a set of available parametric models for the lens distribution. We also present a way to incorporate the effect of contamination in the data due to the limited efficiency of lensing identification methods, so that it will not bias the cosmological inference.
△ Less
Submitted 19 November, 2024; v1 submitted 28 May, 2024;
originally announced May 2024.
-
Breaking the Molecular Dynamics Timescale Barrier Using a Wafer-Scale System
Authors:
Kylee Santos,
Stan Moore,
Tomas Oppelstrup,
Amirali Sharifian,
Ilya Sharapov,
Aidan Thompson,
Delyan Z Kalchev,
Danny Perez,
Robert Schreiber,
Scott Pakin,
Edgar A Leon,
James H Laros III,
Michael James,
Sivasankaran Rajamanickam
Abstract:
Molecular dynamics (MD) simulations have transformed our understanding of the nanoscale, driving breakthroughs in materials science, computational chemistry, and several other fields, including biophysics and drug design. Even on exascale supercomputers, however, runtimes are excessive for systems and timescales of scientific interest. Here, we demonstrate strong scaling of MD simulations on the C…
▽ More
Molecular dynamics (MD) simulations have transformed our understanding of the nanoscale, driving breakthroughs in materials science, computational chemistry, and several other fields, including biophysics and drug design. Even on exascale supercomputers, however, runtimes are excessive for systems and timescales of scientific interest. Here, we demonstrate strong scaling of MD simulations on the Cerebras Wafer-Scale Engine. By dedicating a processor core for each simulated atom, we demonstrate a 179-fold improvement in timesteps per second versus the Frontier GPU-based Exascale platform, along with a large improvement in timesteps per unit energy. Reducing every year of runtime to two days unlocks currently inaccessible timescales of slow microstructure transformation processes that are critical for understanding material behavior and function. Our dataflow algorithm runs Embedded Atom Method (EAM) simulations at rates over 270,000 timesteps per second for problems with up to 800k atoms. This demonstrated performance is unprecedented for general-purpose processing cores.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Neural network prediction of model parameters for strong lensing samples from Hyper Suprime-Cam Survey
Authors:
Priyanka Gawade,
Anupreeta More,
Surhud More,
Akisato Kimura,
Alessandro Sonnenfeld,
Masamune Oguri,
Naoki Yoshida
Abstract:
Strong lensing of background galaxies provides important information about the matter distribution around lens galaxies. Traditional modelling of such strong lenses is both time and resource intensive. Fast and automated analysis methods are the need of the hour given large upcoming surveys. In this work, we build and train a simple convolutional neural network with an aim of rapidly predicting mo…
▽ More
Strong lensing of background galaxies provides important information about the matter distribution around lens galaxies. Traditional modelling of such strong lenses is both time and resource intensive. Fast and automated analysis methods are the need of the hour given large upcoming surveys. In this work, we build and train a simple convolutional neural network with an aim of rapidly predicting model parameters of gravitational lenses. We focus on the inference of the Einstein radius, and ellipticity components of the mass distribution. We train our network on a variety of simulated data with increasing degree of realism and compare its performance on simulated test data in a quantitative manner. We also model 182 gravitational lenses from the HSC survey using {\sc YattaLens} pipeline to infer their model parameters, which allow a benchmark to compare the predictions of the network. Given all considerations, we conclude that the network trained on simulated samples with lensed sources injected in empty HSC cutouts is the most robust, reproducing Einstein radii with an accuracy of about $10-20$ percent, a bias less than 5 percent, and an outlier fraction of the order of 10 percent. We argue in favour of the subtraction of the lens light before modelling the lens mass distribution. Our comparisons of the inferred parameters of 10 HSC lenses previously modelled in the literature, demonstrate agreement on the Einstein radius. However, the ellipticity components from the network as well as the individual modelling methods, seem to have systematic uncertainties beyond the quoted errors.
△ Less
Submitted 6 June, 2025; v1 submitted 29 April, 2024;
originally announced April 2024.
-
Observation of Gravitational Waves from the Coalescence of a $2.5\text{-}4.5~M_\odot$ Compact Object and a Neutron Star
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
D. Agarwal,
M. Agathos,
M. Aghaei Abchouyeh,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
S. Akçay,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah
, et al. (1771 additional authors not shown)
Abstract:
We report the observation of a coalescing compact binary with component masses $2.5\text{-}4.5~M_\odot$ and $1.2\text{-}2.0~M_\odot$ (all measurements quoted at the 90% credible level). The gravitational-wave signal GW230529_181500 was observed during the fourth observing run of the LIGO-Virgo-KAGRA detector network on 2023 May 29 by the LIGO Livingston Observatory. The primary component of the so…
▽ More
We report the observation of a coalescing compact binary with component masses $2.5\text{-}4.5~M_\odot$ and $1.2\text{-}2.0~M_\odot$ (all measurements quoted at the 90% credible level). The gravitational-wave signal GW230529_181500 was observed during the fourth observing run of the LIGO-Virgo-KAGRA detector network on 2023 May 29 by the LIGO Livingston Observatory. The primary component of the source has a mass less than $5~M_\odot$ at 99% credibility. We cannot definitively determine from gravitational-wave data alone whether either component of the source is a neutron star or a black hole. However, given existing estimates of the maximum neutron star mass, we find the most probable interpretation of the source to be the coalescence of a neutron star with a black hole that has a mass between the most massive neutron stars and the least massive black holes observed in the Galaxy. We provisionally estimate a merger rate density of $55^{+127}_{-47}~\text{Gpc}^{-3}\,\text{yr}^{-1}$ for compact binary coalescences with properties similar to the source of GW230529_181500; assuming that the source is a neutron star-black hole merger, GW230529_181500-like sources constitute about 60% of the total merger rate inferred for neutron star-black hole coalescences. The discovery of this system implies an increase in the expected rate of neutron star-black hole mergers with electromagnetic counterparts and provides further evidence for compact objects existing within the purported lower mass gap.
△ Less
Submitted 26 July, 2024; v1 submitted 5 April, 2024;
originally announced April 2024.