-
Glo-In-One: Holistic Glomerular Detection, Segmentation, and Lesion Characterization with Large-scale Web Image Mining
Authors:
Tianyuan Yao,
Yuzhe Lu,
Jun Long,
Aadarsh Jha,
Zheyu Zhu,
Zuhayr Asad,
Haichun Yang,
Agnes B. Fogo,
Yuankai Huo
Abstract:
The quantitative detection, segmentation, and characterization of glomeruli from high-resolution whole slide imaging (WSI) play essential roles in the computer-assisted diagnosis and scientific research in digital renal pathology. Historically, such comprehensive quantification requires extensive programming skills in order to be able to handle heterogeneous and customized computational tools. To…
▽ More
The quantitative detection, segmentation, and characterization of glomeruli from high-resolution whole slide imaging (WSI) play essential roles in the computer-assisted diagnosis and scientific research in digital renal pathology. Historically, such comprehensive quantification requires extensive programming skills in order to be able to handle heterogeneous and customized computational tools. To bridge the gap of performing glomerular quantification for non-technical users, we develop the Glo-In-One toolkit to achieve holistic glomerular detection, segmentation, and characterization via a single line of command. Additionally, we release a large-scale collection of 30,000 unlabeled glomerular images to further facilitate the algorithmic development of self-supervised deep learning. The inputs of the Glo-In-One toolkit are WSIs, while the outputs are (1) WSI-level multi-class circle glomerular detection results (which can be directly manipulated with ImageScope), (2) glomerular image patches with segmentation masks, and (3) different lesion types. To leverage the performance of the Glo-In-One toolkit, we introduce self-supervised deep learning to glomerular quantification via large-scale web image mining. The GGS fine-grained classification model achieved a decent performance compared with baseline supervised methods while only using 10% of the annotated data. The glomerular detection achieved an average precision of 0.627 with circle representations, while the glomerular segmentation achieved a 0.955 patch-wise Dice Similarity Coefficient (DSC).
△ Less
Submitted 31 May, 2022;
originally announced June 2022.
-
Faster Optimization on Sparse Graphs via Neural Reparametrization
Authors:
Nima Dehmamy,
Csaba Both,
Jianzhi Long,
Rose Yu
Abstract:
In mathematical optimization, second-order Newton's methods generally converge faster than first-order methods, but they require the inverse of the Hessian, hence are computationally expensive. However, we discover that on sparse graphs, graph neural networks (GNN) can implement an efficient Quasi-Newton method that can speed up optimization by a factor of 10-100x. Our method, neural reparametriza…
▽ More
In mathematical optimization, second-order Newton's methods generally converge faster than first-order methods, but they require the inverse of the Hessian, hence are computationally expensive. However, we discover that on sparse graphs, graph neural networks (GNN) can implement an efficient Quasi-Newton method that can speed up optimization by a factor of 10-100x. Our method, neural reparametrization, modifies the optimization parameters as the output of a GNN to reshape the optimization landscape. Using a precomputed Hessian as the propagation rule, the GNN can effectively utilize the second-order information, reaching a similar effect as adaptive gradient methods. As our method solves optimization through architecture design, it can be used in conjunction with any optimizers such as Adam and RMSProp. We show the application of our method on scientifically relevant problems including heat diffusion, synchronization and persistent homology.
△ Less
Submitted 26 May, 2022;
originally announced May 2022.
-
Solving Optimal Control Problems of Rigid-Body Dynamics with Collisions Using the Hybrid Minimum Principle
Authors:
Wei Hu,
Jihao Long,
Yaohua Zang,
Weinan E,
Jiequn Han
Abstract:
Collisions are common in many dynamical systems with real applications. They can be formulated as hybrid dynamical systems with discontinuities automatically triggered when states transverse certain manifolds. We present an algorithm for the optimal control problem of such hybrid dynamical systems based on solving the equations derived from the hybrid minimum principle (HMP). The algorithm is an i…
▽ More
Collisions are common in many dynamical systems with real applications. They can be formulated as hybrid dynamical systems with discontinuities automatically triggered when states transverse certain manifolds. We present an algorithm for the optimal control problem of such hybrid dynamical systems based on solving the equations derived from the hybrid minimum principle (HMP). The algorithm is an iterative scheme following the spirit of the method of successive approximations (MSA), and it is robust to undesired collisions observed in the initial guesses. We propose several techniques to address the additional numerical challenges introduced by the presence of discontinuities. The algorithm is tested on disc collision problems whose optimal solutions exhibit one or multiple collisions. Linear convergence in terms of iteration steps and asymptotic first-order accuracy in terms of time discretization are observed when the algorithm is implemented with the forward-Euler scheme. The numerical results demonstrate that the proposed algorithm has better accuracy and convergence than direct methods based on gradient descent. Furthermore, the algorithm is also simpler, more accurate, and more stable than a deep reinforcement learning method.
△ Less
Submitted 17 January, 2025; v1 submitted 17 May, 2022;
originally announced May 2022.
-
Empowering Optimal Control with Machine Learning: A Perspective from Model Predictive Control
Authors:
Weinan E,
Jiequn Han,
Jihao Long
Abstract:
Solving complex optimal control problems have confronted computational challenges for a long time. Recent advances in machine learning have provided us with new opportunities to address these challenges. This paper takes model predictive control, a popular optimal control method, as the primary example to survey recent progress that leverages machine learning techniques to empower optimal control…
▽ More
Solving complex optimal control problems have confronted computational challenges for a long time. Recent advances in machine learning have provided us with new opportunities to address these challenges. This paper takes model predictive control, a popular optimal control method, as the primary example to survey recent progress that leverages machine learning techniques to empower optimal control solvers. We also discuss some of the main challenges encountered when applying machine learning to develop more robust optimal control algorithms.
△ Less
Submitted 20 July, 2022; v1 submitted 16 May, 2022;
originally announced May 2022.
-
Photoelastic Stress Response of Complex 3D-Printed Particle Shapes
Authors:
Negin Amini,
Josh Tuohey,
John M. Long,
Jun Zhang,
David A. V. Morton,
Karen Daniels,
Farnaz Fazelpour,
Karen P. Hapgood
Abstract:
While stress visualization within 3-dimensional particles would greatly advance our understanding of the behaviors of complex particles, traditional photoelastic methods suffer from a lack of available technology for producing suitable complex particles. Recently, 3D-printing has created new possibilities for enhancing the scope of stress analysis within physically representative granules. Here, w…
▽ More
While stress visualization within 3-dimensional particles would greatly advance our understanding of the behaviors of complex particles, traditional photoelastic methods suffer from a lack of available technology for producing suitable complex particles. Recently, 3D-printing has created new possibilities for enhancing the scope of stress analysis within physically representative granules. Here, we investigate and evaluate opportunities offered by 3D-printing a single particle with a complex external shape with photoelastic properties. We report the results of X-ray computed tomography and 3D-printing, combined with traditional photoelastic analysis, to visualize strain for particles ranging from simple 2D discs to complex 3D printed coffee beans, including with internal voids. We find that the relative orientation of the print layers and the loading force affects the optical response of the discs, but without a significant difference in their mechanical properties. Furthermore, we present semi-quantitative measurements of stresses within 3D-printed complex particles. The paper outlines the potential limitations and areas of future interest for stress visualization of 3-dimensional particles.
△ Less
Submitted 16 May, 2022;
originally announced May 2022.
-
Towards on-sky adaptive optics control using reinforcement learning
Authors:
J. Nousiainen,
C. Rajani,
M. Kasper,
T. Helin,
S. Y. Haffert,
C. Vérinaud,
J. R. Males,
K. Van Gorkom,
L. M. Close,
J. D. Long,
A. D. Hedglen,
O. Guyon,
L. Schatz,
M. Kautz,
J. Lumbres,
A. Rodack,
J. M. Knight,
K. Miller
Abstract:
The direct imaging of potentially habitable Exoplanets is one prime science case for the next generation of high contrast imaging instruments on ground-based extremely large telescopes. To reach this demanding science goal, the instruments are equipped with eXtreme Adaptive Optics (XAO) systems which will control thousands of actuators at a framerate of kilohertz to several kilohertz. Most of the…
▽ More
The direct imaging of potentially habitable Exoplanets is one prime science case for the next generation of high contrast imaging instruments on ground-based extremely large telescopes. To reach this demanding science goal, the instruments are equipped with eXtreme Adaptive Optics (XAO) systems which will control thousands of actuators at a framerate of kilohertz to several kilohertz. Most of the habitable exoplanets are located at small angular separations from their host stars, where the current XAO systems' control laws leave strong residuals.Current AO control strategies like static matrix-based wavefront reconstruction and integrator control suffer from temporal delay error and are sensitive to mis-registration, i.e., to dynamic variations of the control system geometry. We aim to produce control methods that cope with these limitations, provide a significantly improved AO correction and, therefore, reduce the residual flux in the coronagraphic point spread function.
We extend previous work in Reinforcement Learning for AO. The improved method, called PO4AO, learns a dynamics model and optimizes a control neural network, called a policy. We introduce the method and study it through numerical simulations of XAO with Pyramid wavefront sensing for the 8-m and 40-m telescope aperture cases. We further implemented PO4AO and carried out experiments in a laboratory environment using MagAO-X at the Steward laboratory. PO4AO provides the desired performance by improving the coronagraphic contrast in numerical simulations by factors 3-5 within the control region of DM and Pyramid WFS, in simulation and in the laboratory. The presented method is also quick to train, i.e., on timescales of typically 5-10 seconds, and the inference time is sufficiently small (< ms) to be used in real-time control for XAO with currently available hardware even for extremely large telescopes.
△ Less
Submitted 16 May, 2022;
originally announced May 2022.
-
Reductive MDPs: A Perspective Beyond Temporal Horizons
Authors:
Thomas Spooner,
Rui Silva,
Joshua Lockhart,
Jason Long,
Vacslav Glukhov
Abstract:
Solving general Markov decision processes (MDPs) is a computationally hard problem. Solving finite-horizon MDPs, on the other hand, is highly tractable with well known polynomial-time algorithms. What drives this extreme disparity, and do problems exist that lie between these diametrically opposed complexities? In this paper we identify and analyse a sub-class of stochastic shortest path problems…
▽ More
Solving general Markov decision processes (MDPs) is a computationally hard problem. Solving finite-horizon MDPs, on the other hand, is highly tractable with well known polynomial-time algorithms. What drives this extreme disparity, and do problems exist that lie between these diametrically opposed complexities? In this paper we identify and analyse a sub-class of stochastic shortest path problems (SSPs) for general state-action spaces whose dynamics satisfy a particular drift condition. This construction generalises the traditional, temporal notion of a horizon via decreasing reachability: a property called reductivity. It is shown that optimal policies can be recovered in polynomial-time for reductive SSPs -- via an extension of backwards induction -- with an efficient analogue in reductive MDPs. The practical considerations of the proposed approach are discussed, and numerical verification provided on a canonical optimal liquidation problem.
△ Less
Submitted 15 May, 2022;
originally announced May 2022.
-
Double-Weak Decays of $^{124}$Xe and $^{136}$Xe in the XENON1T and XENONnT Experiments
Authors:
E. Aprile,
K. Abe,
F. Agostini,
S. Ahmed Maouloud,
M. Alfonsi,
L. Althueser,
B. Andrieu,
E. Angelino,
J. R. Angevaare,
V. C. Antochi,
D. Antón Martin,
F. Arneodo,
L. Baudis,
A. L. Baxter,
L. Bellagamba,
R. Biondi,
A. Bismark,
A. Brown,
S. Bruenner,
G. Bruno,
R. Budnik,
C. Cai,
C. Capelli,
J. M. R. Cardoso,
D. Cichon
, et al. (135 additional authors not shown)
Abstract:
We present results on the search for double-electron capture ($2ν\text{ECEC}$) of $^{124}$Xe and neutrinoless double-$β$ decay ($0νββ$) of $^{136}$Xe in XENON1T. We consider captures from the K- up to the N-shell in the $2ν\text{ECEC}$ signal model and measure a total half-life of $T_{1/2}^{2ν\text{ECEC}}=(1.1\pm0.2_\text{stat}\pm0.1_\text{sys})\times 10^{22}\;\text{yr}$ with a…
▽ More
We present results on the search for double-electron capture ($2ν\text{ECEC}$) of $^{124}$Xe and neutrinoless double-$β$ decay ($0νββ$) of $^{136}$Xe in XENON1T. We consider captures from the K- up to the N-shell in the $2ν\text{ECEC}$ signal model and measure a total half-life of $T_{1/2}^{2ν\text{ECEC}}=(1.1\pm0.2_\text{stat}\pm0.1_\text{sys})\times 10^{22}\;\text{yr}$ with a $0.87\;\text{kg}\times\text{yr}$ isotope exposure. The statistical significance of the signal is $7.0\,σ$. We use XENON1T data with $36.16\;\text{kg}\times\text{yr}$ of $^{136}$Xe exposure to search for $0νββ$. We find no evidence of a signal and set a lower limit on the half-life of $T_{1/2}^{0νββ} > 1.2 \times 10^{24}\;\text{yr}\; \text{at}\; 90\,\%\;\text{CL}$. This is the best result from a dark matter detector without an enriched target to date. We also report projections on the sensitivity of XENONnT to $0νββ$. Assuming a $275\;\text{kg}\times\text{yr}$ $^{136}$Xe exposure, the expected sensitivity is $T_{1/2}^{0νββ} > 2.1 \times 10^{25}\;\text{yr}\; \text{at}\; 90\,\%\;\text{CL}$, corresponding to an effective Majorana mass range of $\langle m_{ββ} \rangle < (0.19 - 0.59)\;\text{eV/c}^2$.
△ Less
Submitted 6 September, 2022; v1 submitted 9 May, 2022;
originally announced May 2022.
-
Learning High-Dimensional McKean-Vlasov Forward-Backward Stochastic Differential Equations with General Distribution Dependence
Authors:
Jiequn Han,
Ruimeng Hu,
Jihao Long
Abstract:
One of the core problems in mean-field control and mean-field games is to solve the corresponding McKean-Vlasov forward-backward stochastic differential equations (MV-FBSDEs). Most existing methods are tailored to special cases in which the mean-field interaction only depends on expectation or other moments and thus inadequate to solve problems when the mean-field interaction has full distribution…
▽ More
One of the core problems in mean-field control and mean-field games is to solve the corresponding McKean-Vlasov forward-backward stochastic differential equations (MV-FBSDEs). Most existing methods are tailored to special cases in which the mean-field interaction only depends on expectation or other moments and thus inadequate to solve problems when the mean-field interaction has full distribution dependence.
In this paper, we propose a novel deep learning method for computing MV-FBSDEs with a general form of mean-field interactions. Specifically, built on fictitious play, we recast the problem into repeatedly solving standard FBSDEs with explicit coefficient functions. These coefficient functions are used to approximate the MV-FBSDEs' model coefficients with full distribution dependence, and are updated by solving another supervising learning problem using training data simulated from the last iteration's FBSDE solutions. We use deep neural networks to solve standard BSDEs and approximate coefficient functions in order to solve high-dimensional MV-FBSDEs. Under proper assumptions on the learned functions, we prove that the convergence of the proposed method is free of the curse of dimensionality (CoD) by using a class of integral probability metrics previously developed in [Han, Hu and Long, arXiv:2104.12036]. The proved theorem shows the advantage of the method in high dimensions. We present the numerical performance in high-dimensional MV-FBSDE problems, including a mean-field game example of the well-known Cucker-Smale model whose cost depends on the full distribution of the forward process.
△ Less
Submitted 18 September, 2023; v1 submitted 25 April, 2022;
originally announced April 2022.
-
High-precision half-life determination of $^{14}$O via direct $β$ counting
Authors:
S. Sharma,
G. F. Grinyer,
G. C. Ball,
J. R. Leslie,
C. E. Svensson,
F. A. Ali,
C. Andreoiu,
N. Bernier,
S. S. Bhattacharjee,
V. Bildstein,
C. Burbadge,
R. Caballero-Folch,
R. Coleman,
A. Diaz Varela,
M. R. Dunlop,
R. Dunlop,
A. B. Garnsworthy,
E. Gyabeng Fuakye,
G. M. Huber,
B. Jigmeddorj,
K. Kapoor,
A. T. Laffoley,
K. G. Leach,
J. Long,
A. D. MacLean
, et al. (8 additional authors not shown)
Abstract:
The half-life of the superallowed Fermi $β^+$ emitter $^{14}$O was determined to high precision via a direct $β$ counting experiment performed at the Isotope Separator and Accelerator (ISAC) facility at TRIUMF. The result, $T_{1/2}$($^{14}$O) = 70619.2(76) ms, is consistent with, but is more precise than, the world average obtained from 11 previous measurements. Combining the $^{14}$O half-life de…
▽ More
The half-life of the superallowed Fermi $β^+$ emitter $^{14}$O was determined to high precision via a direct $β$ counting experiment performed at the Isotope Separator and Accelerator (ISAC) facility at TRIUMF. The result, $T_{1/2}$($^{14}$O) = 70619.2(76) ms, is consistent with, but is more precise than, the world average obtained from 11 previous measurements. Combining the $^{14}$O half-life deduced in the present work with the previous most precise measurements of this quantity leads to a reduction in the overall uncertainty, by nearly a factor of 2. The new world average is $T_{1/2}$($^{14}$O) = 70619.6(63) ms with a reduced $χ^2$ value of 0.87 obtained from 8 degrees of freedom.
△ Less
Submitted 13 April, 2022;
originally announced April 2022.
-
Decentralized Collaborative Learning Framework for Next POI Recommendation
Authors:
Jing Long,
Tong Chen,
Nguyen Quoc Viet Hung,
Hongzhi Yin
Abstract:
Next Point-of-Interest (POI) recommendation has become an indispensable functionality in Location-based Social Networks (LBSNs) due to its effectiveness in helping people decide the next POI to visit. However, accurate recommendation requires a vast amount of historical check-in data, thus threatening user privacy as the location-sensitive data needs to be handled by cloud servers. Although there…
▽ More
Next Point-of-Interest (POI) recommendation has become an indispensable functionality in Location-based Social Networks (LBSNs) due to its effectiveness in helping people decide the next POI to visit. However, accurate recommendation requires a vast amount of historical check-in data, thus threatening user privacy as the location-sensitive data needs to be handled by cloud servers. Although there have been several on-device frameworks for privacy-preserving POI recommendations, they are still resource-intensive when it comes to storage and computation, and show limited robustness to the high sparsity of user-POI interactions. On this basis, we propose a novel decentralized collaborative learning framework for POI recommendation (DCLR), which allows users to train their personalized models locally in a collaborative manner. DCLR significantly reduces the local models' dependence on the cloud for training, and can be used to expand arbitrary centralized recommendation models. To counteract the sparsity of on-device user data when learning each local model, we design two self-supervision signals to pretrain the POI representations on the server with geographical and categorical correlations of POIs. To facilitate collaborative learning, we innovatively propose to incorporate knowledge from either geographically or semantically similar users into each local model with attentive aggregation and mutual information maximization. The collaborative learning process makes use of communications between devices while requiring only minor engagement from the central server for identifying user groups, and is compatible with common privacy preservation mechanisms like differential privacy. We evaluate DCLR with two real-world datasets, where the results show that DCLR outperforms state-of-the-art on-device frameworks and yields competitive results compared with centralized counterparts.
△ Less
Submitted 31 July, 2022; v1 submitted 30 March, 2022;
originally announced April 2022.
-
Magnet-free nonreciprocal metasurface for on-demand bi-directional phase modulation
Authors:
Weihao Yang,
Jun Qin,
Jiawei Long,
Wei Yan,
Yucong Yang,
Chaoyang Li,
En Li,
Juejun Hu,
Longjiang Deng,
Qingyang Du,
Lei Bi
Abstract:
Unconstrained by Lorentz reciprocity, nonreciprocal metasurfaces are uniquely capable of encoding distinctive optical functions on forward- and backward-propagating waves. The nonreciprocal metasurfaces reported to date require external electric or magnetic field biasing or rely on nonlinear effects, both of which are challenging to practically implement. Here, we propose and experimentally realiz…
▽ More
Unconstrained by Lorentz reciprocity, nonreciprocal metasurfaces are uniquely capable of encoding distinctive optical functions on forward- and backward-propagating waves. The nonreciprocal metasurfaces reported to date require external electric or magnetic field biasing or rely on nonlinear effects, both of which are challenging to practically implement. Here, we propose and experimentally realize a magnet-free, linear, and passive nonreciprocal metasurface based on self-biased magnetic meta-atoms. Record transmittance up to 77% and operation angle reaching 64 degree are experimentally demonstrated. Moreover, on-demand bidirectional phase modulation in a "LEGO-like" manner is theoretically proposed and experimentally demonstrated, enabling a cohort of nonreciprocal functionalities such as microwave isolation, nonreciprocal beam steering, nonreciprocal focusing, and nonreciprocal holography. The design can also be extended to MHz and optical frequencies, taking advantage of the wide variety of self-biased gyrotropic materials available. We foresee that the nonreciprocal metasurfaces demonstrated in this work will have a significant practical impact for applications ranging from nonreciprocal antennas and radomes to full-duplex wireless communication and radar systems.
△ Less
Submitted 6 April, 2022;
originally announced April 2022.
-
Thinking inside The Box: Learning Hypercube Representations for Group Recommendation
Authors:
Tong Chen,
Hongzhi Yin,
Jing Long,
Quoc Viet Hung Nguyen,
Yang Wang,
Meng Wang
Abstract:
As a step beyond traditional personalized recommendation, group recommendation is the task of suggesting items that can satisfy a group of users. In group recommendation, the core is to design preference aggregation functions to obtain a quality summary of all group members' preferences. Such user and group preferences are commonly represented as points in the vector space (i.e., embeddings), wher…
▽ More
As a step beyond traditional personalized recommendation, group recommendation is the task of suggesting items that can satisfy a group of users. In group recommendation, the core is to design preference aggregation functions to obtain a quality summary of all group members' preferences. Such user and group preferences are commonly represented as points in the vector space (i.e., embeddings), where multiple user embeddings are compressed into one to facilitate ranking for group-item pairs. However, the resulted group representations, as points, lack adequate flexibility and capacity to account for the multi-faceted user preferences. Also, the point embedding-based preference aggregation is a less faithful reflection of a group's decision-making process, where all users have to agree on a certain value in each embedding dimension instead of a negotiable interval. In this paper, we propose a novel representation of groups via the notion of hypercubes, which are subspaces containing innumerable points in the vector space. Specifically, we design the hypercube recommender (CubeRec) to adaptively learn group hypercubes from user embeddings with minimal information loss during preference aggregation, and to leverage a revamped distance metric to measure the affinity between group hypercubes and item points. Moreover, to counteract the long-standing issue of data sparsity in group recommendation, we make full use of the geometric expressiveness of hypercubes and innovatively incorporate self-supervision by intersecting two groups. Experiments on four real-world datasets have validated the superiority of CubeRec over state-of-the-art baselines.
△ Less
Submitted 4 December, 2022; v1 submitted 6 April, 2022;
originally announced April 2022.
-
Super-Eddington accretion of the first Galactic Ultra-luminous X-ray pulsar Swift J0243.6+6124
Authors:
Liu Jiren,
Peter A Jenke,
Ji Long,
Zhang Shuang-Nan,
Zhang Shu,
Ge Mingyu,
Liao Jinyuan,
Li Xiaobo,
Song Liming
Abstract:
We present a detailed timing study of the pulse profile of Swift J0243.6+6124 with HXMT and Fermi/GBM data during its 2017 giant outburst. The double-peak profile at luminosity above $5\times10^{38}$erg\,s$^{-1}$ is found to be 0.25 phase offset from that below $1.5\times10^{38}$erg\,s$^{-1}$, which strongly supports for a transition from a pencil beam to a fan beam, and thus for the formation of…
▽ More
We present a detailed timing study of the pulse profile of Swift J0243.6+6124 with HXMT and Fermi/GBM data during its 2017 giant outburst. The double-peak profile at luminosity above $5\times10^{38}$erg\,s$^{-1}$ is found to be 0.25 phase offset from that below $1.5\times10^{38}$erg\,s$^{-1}$, which strongly supports for a transition from a pencil beam to a fan beam, and thus for the formation of shock dominated accretion column. During the rising stage of the high double-peak regime, the faint peak got saturated in 10-100 keV band above a luminosity of $L_t\sim1.3\times10^{39}$erg\,s$^{-1}$, which is coincident with sudden spectral changes of both the main and faint peaks. They imply a sudden change of emission pattern around $L_t$. The spin-up rate ($\dotν$) is linearly correlated with luminosity ($L$) below $L_t$, consistent with the prediction of a radiation pressure dominated (RPD) disk. The $\dotν-L$ relation flattens above $L_t$, indicating a less efficient transfer of angular momentum and a change of accretion disk geometry above $L_t$. It is likely due to irradiation of the disk by the central accretion column and indicates significant radiation feedback before the inner disk radius reaching the spherization radius.
△ Less
Submitted 23 March, 2022;
originally announced March 2022.
-
A portable atom gravimeter operating in noisy urban environments
Authors:
Bin Chen,
Jinbao Long,
Hongtai Xie,
Chenyang Li,
Luokan Chen,
Bonan Jiang,
Shuai Chen
Abstract:
The gravimeter based on atom interferometry has potentially wide applications on building the gravity networks, geophysics as well as gravity assisted navigation. Here, we demonstrate experimentally a portable atom gravimeter operating in the noisy urban environment. Despite the influence of noisy external vibrations, our portable atom gravimeter reaches a sensitivity as good as 65 uGal/\sqrt{Hz}…
▽ More
The gravimeter based on atom interferometry has potentially wide applications on building the gravity networks, geophysics as well as gravity assisted navigation. Here, we demonstrate experimentally a portable atom gravimeter operating in the noisy urban environment. Despite the influence of noisy external vibrations, our portable atom gravimeter reaches a sensitivity as good as 65 uGal/\sqrt{Hz} and a resolution of 1.1 uGal after 4000 s integration time, being comparable to state-of-the-art atom gravimeters. Our achievement paves the way for bring the portable atom gravimeter to field applications, such as gravity survey on a moving platform.
△ Less
Submitted 20 March, 2022;
originally announced March 2022.
-
Electric dipole moments and the search for new physics
Authors:
Ricardo Alarcon,
Jim Alexander,
Vassilis Anastassopoulos,
Takatoshi Aoki,
Rick Baartman,
Stefan Baeßler,
Larry Bartoszek,
Douglas H. Beck,
Franco Bedeschi,
Robert Berger,
Martin Berz,
Hendrick L. Bethlem,
Tanmoy Bhattacharya,
Michael Blaskiewicz,
Thomas Blum,
Themis Bowcock,
Anastasia Borschevsky,
Kevin Brown,
Dmitry Budker,
Sergey Burdin,
Brendan C. Casey,
Gianluigi Casse,
Giovanni Cantatore,
Lan Cheng,
Timothy Chupp
, et al. (118 additional authors not shown)
Abstract:
Static electric dipole moments of nondegenerate systems probe mass scales for physics beyond the Standard Model well beyond those reached directly at high energy colliders. Discrimination between different physics models, however, requires complementary searches in atomic-molecular-and-optical, nuclear and particle physics. In this report, we discuss the current status and prospects in the near fu…
▽ More
Static electric dipole moments of nondegenerate systems probe mass scales for physics beyond the Standard Model well beyond those reached directly at high energy colliders. Discrimination between different physics models, however, requires complementary searches in atomic-molecular-and-optical, nuclear and particle physics. In this report, we discuss the current status and prospects in the near future for a compelling suite of such experiments, along with developments needed in the encompassing theoretical framework.
△ Less
Submitted 4 April, 2022; v1 submitted 15 March, 2022;
originally announced March 2022.
-
Optimal Admission Control for Multiclass Queues with Time-Varying Arrival Rates via State Abstraction
Authors:
Marc Rigter,
Danial Dervovic,
Parisa Hassanzadeh,
Jason Long,
Parisa Zehtabi,
Daniele Magazzeni
Abstract:
We consider a novel queuing problem where the decision-maker must choose to accept or reject randomly arriving tasks into a no buffer queue which are processed by $N$ identical servers. Each task has a price, which is a positive real number, and a class. Each class of task has a different price distribution and service rate, and arrives according to an inhomogenous Poisson process. The objective i…
▽ More
We consider a novel queuing problem where the decision-maker must choose to accept or reject randomly arriving tasks into a no buffer queue which are processed by $N$ identical servers. Each task has a price, which is a positive real number, and a class. Each class of task has a different price distribution and service rate, and arrives according to an inhomogenous Poisson process. The objective is to decide which tasks to accept so that the total price of tasks processed is maximised over a finite horizon. We formulate the problem as a discrete time Markov Decision Process (MDP) with a hybrid state space. We show that the optimal value function has a specific structure, which enables us to solve the hybrid MDP exactly. Moreover, we prove that as the time step is reduced, the discrete time solution approaches the optimal solution to the original continuous time problem. To improve the scalability of our approach to a greater number of task classes, we present an approximation based on state abstraction. We validate our approach on synthetic data, as well as a real financial fraud data set, which is the motivating application for this work.
△ Less
Submitted 14 March, 2022;
originally announced March 2022.
-
A Machine Learning Enhanced Algorithm for the Optimal Landing Problem
Authors:
Yaohua Zang,
Jihao Long,
Xuanxi Zhang,
Wei Hu,
Weinan E,
Jiequn Han
Abstract:
We propose a machine learning enhanced algorithm for solving the optimal landing problem. Using Pontryagin's minimum principle, we derive a two-point boundary value problem for the landing problem. The proposed algorithm uses deep learning to predict the optimal landing time and a space-marching technique to provide good initial guesses for the boundary value problem solver. The performance of the…
▽ More
We propose a machine learning enhanced algorithm for solving the optimal landing problem. Using Pontryagin's minimum principle, we derive a two-point boundary value problem for the landing problem. The proposed algorithm uses deep learning to predict the optimal landing time and a space-marching technique to provide good initial guesses for the boundary value problem solver. The performance of the proposed method is studied using the quadrotor example, a reasonably high dimensional and strongly nonlinear system. Drastic improvement in reliability and efficiency is observed.
△ Less
Submitted 13 March, 2022;
originally announced March 2022.
-
Early-Universe Model Building
Authors:
Pouya Asadi,
Saurabh Bansal,
Asher Berlin,
Raymond T. Co,
Djuna Croon,
Yanou Cui,
David Curtin,
Francis-Yan Cyr-Racine,
Hooman Davoudiasl,
Luigi Delle Rose,
Marco Drewes,
Jeff A. Dror,
Gilly Elor,
Oliver Gould,
Keisuke Harigaya,
Saniya Heeba,
Yonit Hochberg,
Anson Hook,
Seyda Ipek,
Eric Kuflik,
Andrew J. Long,
Robert McGehee,
Nadav Joseph Outmezguine,
Giuliano Panico,
Vivian Poulin
, et al. (15 additional authors not shown)
Abstract:
Theoretical investigations into the evolution of the early universe are an essential part of particle physics that allow us to identify viable extensions to the Standard Model as well as motivated parameter space that can be probed by various experiments and observations. In this white paper, we review particle physics models of the early universe. First, we outline various models that explain two…
▽ More
Theoretical investigations into the evolution of the early universe are an essential part of particle physics that allow us to identify viable extensions to the Standard Model as well as motivated parameter space that can be probed by various experiments and observations. In this white paper, we review particle physics models of the early universe. First, we outline various models that explain two essential ingredients of the early universe (dark matter and baryon asymmetry) and those that seek to address current observational anomalies. We then discuss dynamics of the early universe in models of neutrino masses, axions, and several solutions to the electroweak hierarchy problem. Finally, we review solutions to naturalness problems of the Standard Model that employ cosmological dynamics.
△ Less
Submitted 7 September, 2022; v1 submitted 13 March, 2022;
originally announced March 2022.
-
Snowmass2021 Cosmic Frontier White Paper: Ultraheavy particle dark matter
Authors:
Daniel Carney,
Nirmal Raj,
Yang Bai,
Joshua Berger,
Carlos Blanco,
Joseph Bramante,
Christopher Cappiello,
Maíra Dutra,
Reza Ebadi,
Kristi Engel,
Edward Kolb,
J. Patrick Harding,
Jason Kumar,
Gordan Krnjaic,
Rafael F. Lang,
Rebecca K. Leane,
Benjamin V. Lehmann,
Shengchao Li,
Andrew J. Long,
Gopolang Mohlabeng,
Ibles Olcina,
Elisa Pueschel,
Nicholas L. Rodd,
Carsten Rott,
Dipan Sengupta
, et al. (3 additional authors not shown)
Abstract:
We outline the unique opportunities and challenges in the search for "ultraheavy" dark matter candidates with masses between roughly $10~{\rm TeV}$ and the Planck scale $m_{\rm pl} \approx 10^{16}~{\rm TeV}$. This mass range presents a wide and relatively unexplored dark matter parameter space, with a rich space of possible models and cosmic histories. We emphasize that both current detectors and…
▽ More
We outline the unique opportunities and challenges in the search for "ultraheavy" dark matter candidates with masses between roughly $10~{\rm TeV}$ and the Planck scale $m_{\rm pl} \approx 10^{16}~{\rm TeV}$. This mass range presents a wide and relatively unexplored dark matter parameter space, with a rich space of possible models and cosmic histories. We emphasize that both current detectors and new, targeted search techniques, via both direct and indirect detection, are poised to contribute to searches for ultraheavy particle dark matter in the coming decade. We highlight the need for new developments in this space, including new analyses of current and imminent direct and indirect experiments targeting ultraheavy dark matter and development of new, ultra-sensitive detector technologies like next-generation liquid noble detectors, neutrino experiments, and specialized quantum sensing techniques.
△ Less
Submitted 27 April, 2023; v1 submitted 12 March, 2022;
originally announced March 2022.
-
A Next-Generation Liquid Xenon Observatory for Dark Matter and Neutrino Physics
Authors:
J. Aalbers,
K. Abe,
V. Aerne,
F. Agostini,
S. Ahmed Maouloud,
D. S. Akerib,
D. Yu. Akimov,
J. Akshat,
A. K. Al Musalhi,
F. Alder,
S. K. Alsum,
L. Althueser,
C. S. Amarasinghe,
F. D. Amaro,
A. Ames,
T. J. Anderson,
B. Andrieu,
N. Angelides,
E. Angelino,
J. Angevaare,
V. C. Antochi,
D. Antón Martin,
B. Antunovic,
E. Aprile,
H. M. Araújo
, et al. (572 additional authors not shown)
Abstract:
The nature of dark matter and properties of neutrinos are among the most pressing issues in contemporary particle physics. The dual-phase xenon time-projection chamber is the leading technology to cover the available parameter space for Weakly Interacting Massive Particles (WIMPs), while featuring extensive sensitivity to many alternative dark matter candidates. These detectors can also study neut…
▽ More
The nature of dark matter and properties of neutrinos are among the most pressing issues in contemporary particle physics. The dual-phase xenon time-projection chamber is the leading technology to cover the available parameter space for Weakly Interacting Massive Particles (WIMPs), while featuring extensive sensitivity to many alternative dark matter candidates. These detectors can also study neutrinos through neutrinoless double-beta decay and through a variety of astrophysical sources. A next-generation xenon-based detector will therefore be a true multi-purpose observatory to significantly advance particle physics, nuclear physics, astrophysics, solar physics, and cosmology. This review article presents the science cases for such a detector.
△ Less
Submitted 4 March, 2022;
originally announced March 2022.
-
CheXstray: Real-time Multi-Modal Data Concordance for Drift Detection in Medical Imaging AI
Authors:
Arjun Soin,
Jameson Merkow,
Jin Long,
Joseph Paul Cohen,
Smitha Saligrama,
Stephen Kaiser,
Steven Borg,
Ivan Tarapov,
Matthew P Lungren
Abstract:
Clinical Artificial lntelligence (AI) applications are rapidly expanding worldwide, and have the potential to impact to all areas of medical practice. Medical imaging applications constitute a vast majority of approved clinical AI applications. Though healthcare systems are eager to adopt AI solutions a fundamental question remains: \textit{what happens after the AI model goes into production?} We…
▽ More
Clinical Artificial lntelligence (AI) applications are rapidly expanding worldwide, and have the potential to impact to all areas of medical practice. Medical imaging applications constitute a vast majority of approved clinical AI applications. Though healthcare systems are eager to adopt AI solutions a fundamental question remains: \textit{what happens after the AI model goes into production?} We use the CheXpert and PadChest public datasets to build and test a medical imaging AI drift monitoring workflow to track data and model drift without contemporaneous ground truth. We simulate drift in multiple experiments to compare model performance with our novel multi-modal drift metric, which uses DICOM metadata, image appearance representation from a variational autoencoder (VAE), and model output probabilities as input. Through experimentation, we demonstrate a strong proxy for ground truth performance using unsupervised distributional shifts in relevant metadata, predicted probabilities, and VAE latent representation. Our key contributions include (1) proof-of-concept for medical imaging drift detection that includes the use of VAE and domain specific statistical methods, (2) a multi-modal methodology to measure and unify drift metrics, (3) new insights into the challenges and solutions to observe deployed medical imaging AI, and (4) creation of open-source tools that enable others to easily run their own workflows and scenarios. This work has important implications. It addresses the concerning translation gap found in continuous medical imaging AI model monitoring common in dynamic healthcare environments.
△ Less
Submitted 17 March, 2022; v1 submitted 6 February, 2022;
originally announced February 2022.
-
Design considerations and performance analysis of fiber laser array system for structuring orbital angular momentum beams
Authors:
Tianyue Hou,
Qi Chang,
Jinhu Long,
Pengfei Ma,
Pu Zhou
Abstract:
Since the advent of optical orbital angular momentum (OAM), advances in the generation and manipulation of OAM beams have continuously impacted on intriguing applications including optical communication, optical tweezers, and remote sensing. To realize the generation of high-power and fast switchable OAM beams, coherent combining of fiber lasers offers a promising way. Here in this contribution, w…
▽ More
Since the advent of optical orbital angular momentum (OAM), advances in the generation and manipulation of OAM beams have continuously impacted on intriguing applications including optical communication, optical tweezers, and remote sensing. To realize the generation of high-power and fast switchable OAM beams, coherent combining of fiber lasers offers a promising way. Here in this contribution, we comprehensively investigate the coherent fiber laser array system for structuring OAM beams in terms of the design considerations and performance analysis. The performance metric and evaluation method of the laser array system are presented and introduced. Accordingly, the effect of the main sections of the laser array system, namely the high-power laser sources, emitting array configuration, and dynamic control system, on the performance of the output coherently combined OAM beams is evaluated, which reveals the system tolerance of perturbative factors and provides the guidance on system design and optimization. This work could provide beneficial reference on the practical implementation of spatially structuring high-power, fast switchable OAM beams with fiber laser arrays.
△ Less
Submitted 26 January, 2022;
originally announced January 2022.
-
Interactive Data Analysis with Next-step Natural Language Query Recommendation
Authors:
Xingbo Wang,
Furui Cheng,
Yong Wang,
Ke Xu,
Jiang Long,
Hong Lu,
Huamin Qu
Abstract:
Natural language interfaces (NLIs) provide users with a convenient way to interactively analyze data through natural language queries. Nevertheless, interactive data analysis is a demanding process, especially for novice data analysts. When exploring large and complex SQL databases from different domains, data analysts do not necessarily have sufficient knowledge about different data tables and ap…
▽ More
Natural language interfaces (NLIs) provide users with a convenient way to interactively analyze data through natural language queries. Nevertheless, interactive data analysis is a demanding process, especially for novice data analysts. When exploring large and complex SQL databases from different domains, data analysts do not necessarily have sufficient knowledge about different data tables and application domains. It makes them unable to systematically elicit a series of topically-related and meaningful queries for insight discovery in target domains. We develop a NLI with a step-wise query recommendation module to assist users in choosing appropriate next-step exploration actions. The system adopts a data-driven approach to suggest semantically relevant and context-aware queries for application domains of users' interest based on their query logs. Also, the system helps users organize query histories and results into a dashboard to communicate the discovered data insights. With a comparative user study, we show that our system can facilitate a more effective and systematic data analysis process than a baseline without the recommendation module.
△ Less
Submitted 1 November, 2022; v1 submitted 13 January, 2022;
originally announced January 2022.
-
Application and modeling of an online distillation method to reduce krypton and argon in XENON1T
Authors:
E. Aprile,
K. Abe,
F. Agostini,
S. Ahmed Maouloud,
M. Alfonsi,
L. Althueser,
E. Angelino,
J. R. Angevaare,
V. C. Antochi,
D. Antón Martin,
F. Arneodo,
L. Baudis,
A. L. Baxter,
L. Bellagamba,
A. Bernard,
R. Biondi,
A. Bismark,
A. Brown,
S. Bruenner,
G. Bruno,
R. Budnik,
C. Capelli,
J. M. R. Cardoso,
D. Cichon,
B. Cimmino
, et al. (129 additional authors not shown)
Abstract:
A novel online distillation technique was developed for the XENON1T dark matter experiment to reduce intrinsic background components more volatile than xenon, such as krypton or argon, while the detector was operating. The method is based on a continuous purification of the gaseous volume of the detector system using the XENON1T cryogenic distillation column. A krypton-in-xenon concentration of…
▽ More
A novel online distillation technique was developed for the XENON1T dark matter experiment to reduce intrinsic background components more volatile than xenon, such as krypton or argon, while the detector was operating. The method is based on a continuous purification of the gaseous volume of the detector system using the XENON1T cryogenic distillation column. A krypton-in-xenon concentration of $(360 \pm 60)$ ppq was achieved. It is the lowest concentration measured in the fiducial volume of an operating dark matter detector to date. A model was developed and fit to the data to describe the krypton evolution in the liquid and gas volumes of the detector system for several operation modes over the time span of 550 days, including the commissioning and science runs of XENON1T. The online distillation was also successfully applied to remove Ar-37 after its injection for a low energy calibration in XENON1T. This makes the usage of Ar-37 as a regular calibration source possible in the future. The online distillation can be applied to next-generation experiments to remove krypton prior to, or during, any science run. The model developed here allows further optimization of the distillation strategy for future large scale detectors.
△ Less
Submitted 14 June, 2022; v1 submitted 22 December, 2021;
originally announced December 2021.
-
Emission of Single and Few Electrons in XENON1T and Limits on Light Dark Matter
Authors:
E. Aprile,
K. Abe,
F. Agostini,
S. Ahmed Maouloud,
M. Alfonsi,
L. Althueser,
E. Angelino,
J. R. Angevaare,
V. C. Antochi,
D. Antón Martin,
F. Arneodo,
L. Baudis,
A. L. Baxter,
L. Bellagamba,
A. Bernard,
R. Biondi,
A. Bismark,
A. Brown,
S. Bruenner,
G. Bruno,
R. Budnik,
C. Capelli,
J. M. R. Cardoso,
D. Cichon,
B. Cimmino
, et al. (130 additional authors not shown)
Abstract:
Delayed single- and few-electron emissions plague dual-phase time projection chambers, limiting their potential to search for light-mass dark matter. This paper examines the origins of these events in the XENON1T experiment. Characterization of the intensity of delayed electron backgrounds shows that the resulting emissions are correlated, in time and position, with high-energy events and can effe…
▽ More
Delayed single- and few-electron emissions plague dual-phase time projection chambers, limiting their potential to search for light-mass dark matter. This paper examines the origins of these events in the XENON1T experiment. Characterization of the intensity of delayed electron backgrounds shows that the resulting emissions are correlated, in time and position, with high-energy events and can effectively be vetoed. In this work we extend previous S2-only analyses down to a single electron. From this analysis, after removing the correlated backgrounds, we observe rates < 30 events/(electron*kg*day) in the region of interest spanning 1 to 5 electrons. We derive 90% confidence upper limits for dark matter-electron scattering, first direct limits on the electric dipole, magnetic dipole, and anapole interactions, and bosonic dark matter models, where we exclude new parameter space for dark photons and solar dark photons.
△ Less
Submitted 2 September, 2024; v1 submitted 22 December, 2021;
originally announced December 2021.
-
Laser array of coherent beam combination system revisited: angular domain perspective and fractal-based optimization
Authors:
Tianyue Hou,
Qi Chang,
Pengfei Ma,
Jinhu Long,
Pu Zhou
Abstract:
Coherent beam combination (CBC) of fiber lasers holds promise for achieving high brightness laser systems, which have given rise to widespread applications such as particle accelerator, space debris removal, and industrial fabrication. The emitting laser array of CBC systems offers intriguing features in terms of agile beam steering, flexible beam shaping, and high scalability for output power and…
▽ More
Coherent beam combination (CBC) of fiber lasers holds promise for achieving high brightness laser systems, which have given rise to widespread applications such as particle accelerator, space debris removal, and industrial fabrication. The emitting laser array of CBC systems offers intriguing features in terms of agile beam steering, flexible beam shaping, and high scalability for output power and array elements. However, the theoretical model of the laser array in CBC systems is less well explored beyond the routine angular-spectrum method, where methods for optimizing the laser array configuration are more limited. Here, we explore the theory for the laser array of CBC systems in the view of angular domain. The laser array is represented by the composition of angular harmonics, the orthogonal basis over the azimuthal plane, and we elucidate the formation of mainlobe and sidelobes of the far-field interference pattern by using the orbital angular momentum spectrum analysis and azimuthal decomposition. Based on our findings, a fractal-based laser array configuration is proposed to enhance the performance of the combining system. Our work offers a deeper insight into the theoretical study and application of laser beam combination and opens opportunities for the further optimization of CBC implementations.
△ Less
Submitted 11 December, 2021;
originally announced December 2021.
-
Material radiopurity control in the XENONnT experiment
Authors:
E. Aprile,
K. Abe,
F. Agostini,
S. Ahmed Maouloud,
M. Alfonsi,
L. Althueser,
E. Angelino,
J. R. Angevaare,
V. C. Antochi,
D. Antón Martin,
F. Arneodo,
L. Baudis,
A. L. Baxter,
L. Bellagamba,
R. Biondi,
A. Bismark,
A. Brown,
S. Bruenner,
G. Bruno,
R. Budnik,
C. Capelli,
J. M. R. Cardoso,
D. Cichon,
B. Cimmino,
M. Clark
, et al. (128 additional authors not shown)
Abstract:
The selection of low-radioactive construction materials is of the utmost importance for rare-event searches and thus critical to the XENONnT experiment. Results of an extensive radioassay program are reported, in which material samples have been screened with gamma-ray spectroscopy, mass spectrometry, and $^{222}$Rn emanation measurements. Furthermore, the cleanliness procedures applied to remove…
▽ More
The selection of low-radioactive construction materials is of the utmost importance for rare-event searches and thus critical to the XENONnT experiment. Results of an extensive radioassay program are reported, in which material samples have been screened with gamma-ray spectroscopy, mass spectrometry, and $^{222}$Rn emanation measurements. Furthermore, the cleanliness procedures applied to remove or mitigate surface contamination of detector materials are described. Screening results, used as inputs for a XENONnT Monte Carlo simulation, predict a reduction of materials background ($\sim$17%) with respect to its predecessor XENON1T. Through radon emanation measurements, the expected $^{222}$Rn activity concentration in XENONnT is determined to be 4.2$\,(^{+0.5}_{-0.7})\,μ$Bq/kg, a factor three lower with respect to XENON1T. This radon concentration will be further suppressed by means of the novel radon distillation system.
△ Less
Submitted 26 January, 2023; v1 submitted 10 December, 2021;
originally announced December 2021.
-
Upper Limit on the QCD Axion Mass from Isolated Neutron Star Cooling
Authors:
Malte Buschmann,
Christopher Dessert,
Joshua W. Foster,
Andrew J. Long,
Benjamin R. Safdi
Abstract:
The quantum chromodynamics (QCD) axion may modify the cooling rates of neutron stars (NSs). The axions are produced within the NS cores from nucleon bremsstrahlung and, when the nucleons are in superfluid states, Cooper pair breaking and formation processes. We show that four of the nearby isolated Magnificent Seven NSs along with PSR J0659 are prime candidates for axion cooling studies because th…
▽ More
The quantum chromodynamics (QCD) axion may modify the cooling rates of neutron stars (NSs). The axions are produced within the NS cores from nucleon bremsstrahlung and, when the nucleons are in superfluid states, Cooper pair breaking and formation processes. We show that four of the nearby isolated Magnificent Seven NSs along with PSR J0659 are prime candidates for axion cooling studies because they are coeval, with ages of a few hundred thousand years known from kinematic considerations, and they have well-measured surface luminosities. We compare these data to dedicated NS cooling simulations incorporating axions, profiling over uncertainties related to the equation of state, NS masses, surface compositions, and superfluidity. Our calculations of the axion and neutrino emissivities include high-density suppression factors that also affect SN 1987A and previous NS cooling limits on axions. We find no evidence for axions in the isolated NS data, and within the context of the KSVZ QCD axion model we constrain $m_a \lesssim 16$ meV at 95% confidence. An improved understanding of NS cooling and nucleon superfluidity could further improve these limits or lead to the discovery of the axion at weaker couplings.
△ Less
Submitted 18 November, 2021;
originally announced November 2021.
-
Peta-electron volt gamma-ray emission from the Crab Nebula
Authors:
The LHAASO Collaboration,
Zhen Cao,
F. Aharonian,
Q. An,
Axikegu,
L. X. Bai,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
H. Cai,
J. T. Cai,
Zhe Cao,
J. Chang,
J. F. Chang,
B. M. Chen,
E. S. Chen,
J. Chen,
Liang Chen,
Liang Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen
, et al. (250 additional authors not shown)
Abstract:
The Crab pulsar and the surrounding nebula powered by the pulsar's rotational energy through the formation and termination of a relativistic electron-positron wind is a bright source of gamma-rays carrying crucial information about this complex conglomerate. We report the detection of $γ$-rays with a spectrum showing gradual steepening over three energy decades, from $5\times 10^{-4}$ to $1.1$ pet…
▽ More
The Crab pulsar and the surrounding nebula powered by the pulsar's rotational energy through the formation and termination of a relativistic electron-positron wind is a bright source of gamma-rays carrying crucial information about this complex conglomerate. We report the detection of $γ$-rays with a spectrum showing gradual steepening over three energy decades, from $5\times 10^{-4}$ to $1.1$ petaelectronvolt (PeV). The ultra-high-energy photons exhibit the presence of a PeV electron accelerator (a pevatron) with an acceleration rate exceeding 15% of the absolute theoretical limit. Assuming that unpulsed $γ$-rays are produced at the termination of the pulsar's wind, we constrain the pevatron's size, between $0.025$ and $0.1$ pc, and the magnetic field $\approx 110 μ$G. The production rate of PeV electrons, $2.5 \times 10^{36}$ erg $\rm s^{-1}$, constitutes 0.5% of the pulsar's spin-down luminosity, although we do not exclude a non-negligible contribution of PeV protons to the production of the highest energy $γ$-rays.
△ Less
Submitted 11 November, 2021;
originally announced November 2021.
-
Perturbational Complexity by Distribution Mismatch: A Systematic Analysis of Reinforcement Learning in Reproducing Kernel Hilbert Space
Authors:
Jihao Long,
Jiequn Han
Abstract:
Most existing theoretical analysis of reinforcement learning (RL) is limited to the tabular setting or linear models due to the difficulty in dealing with function approximation in high dimensional space with an uncertain environment. This work offers a fresh perspective into this challenge by analyzing RL in a general reproducing kernel Hilbert space (RKHS). We consider a family of Markov decisio…
▽ More
Most existing theoretical analysis of reinforcement learning (RL) is limited to the tabular setting or linear models due to the difficulty in dealing with function approximation in high dimensional space with an uncertain environment. This work offers a fresh perspective into this challenge by analyzing RL in a general reproducing kernel Hilbert space (RKHS). We consider a family of Markov decision processes $\mathcal{M}$ of which the reward functions lie in the unit ball of an RKHS and transition probabilities lie in a given arbitrary set. We define a quantity called perturbational complexity by distribution mismatch $Δ_{\mathcal{M}}(ε)$ to characterize the complexity of the admissible state-action distribution space in response to a perturbation in the RKHS with scale $ε$. We show that $Δ_{\mathcal{M}}(ε)$ gives both the lower bound of the error of all possible algorithms and the upper bound of two specific algorithms (fitted reward and fitted Q-iteration) for the RL problem. Hence, the decay of $Δ_\mathcal{M}(ε)$ with respect to $ε$ measures the difficulty of the RL problem on $\mathcal{M}$. We further provide some concrete examples and discuss whether $Δ_{\mathcal{M}}(ε)$ decays fast or not in these examples. As a byproduct, we show that when the reward functions lie in a high dimensional RKHS, even if the transition probability is known and the action space is finite, it is still possible for RL problems to suffer from the curse of dimensionality.
△ Less
Submitted 27 March, 2022; v1 submitted 5 November, 2021;
originally announced November 2021.
-
A Survey on the Robustness of Feature Importance and Counterfactual Explanations
Authors:
Saumitra Mishra,
Sanghamitra Dutta,
Jason Long,
Daniele Magazzeni
Abstract:
There exist several methods that aim to address the crucial task of understanding the behaviour of AI/ML models. Arguably, the most popular among them are local explanations that focus on investigating model behaviour for individual instances. Several methods have been proposed for local analysis, but relatively lesser effort has gone into understanding if the explanations are robust and accuratel…
▽ More
There exist several methods that aim to address the crucial task of understanding the behaviour of AI/ML models. Arguably, the most popular among them are local explanations that focus on investigating model behaviour for individual instances. Several methods have been proposed for local analysis, but relatively lesser effort has gone into understanding if the explanations are robust and accurately reflect the behaviour of underlying models. In this work, we present a survey of the works that analysed the robustness of two classes of local explanations (feature importance and counterfactual explanations) that are popularly used in analysing AI/ML models in finance. The survey aims to unify existing definitions of robustness, introduces a taxonomy to classify different robustness approaches, and discusses some interesting results. Finally, the survey introduces some pointers about extending current robustness analysis approaches so as to identify reliable explainability methods.
△ Less
Submitted 3 January, 2023; v1 submitted 30 October, 2021;
originally announced November 2021.
-
Counterfactual Shapley Additive Explanations
Authors:
Emanuele Albini,
Jason Long,
Danial Dervovic,
Daniele Magazzeni
Abstract:
Feature attributions are a common paradigm for model explanations due to their simplicity in assigning a single numeric score for each input feature to a model. In the actionable recourse setting, wherein the goal of the explanations is to improve outcomes for model consumers, it is often unclear how feature attributions should be correctly used. With this work, we aim to strengthen and clarify th…
▽ More
Feature attributions are a common paradigm for model explanations due to their simplicity in assigning a single numeric score for each input feature to a model. In the actionable recourse setting, wherein the goal of the explanations is to improve outcomes for model consumers, it is often unclear how feature attributions should be correctly used. With this work, we aim to strengthen and clarify the link between actionable recourse and feature attributions. Concretely, we propose a variant of SHAP, Counterfactual SHAP (CF-SHAP), that incorporates counterfactual information to produce a background dataset for use within the marginal (a.k.a. interventional) Shapley value framework. We motivate the need within the actionable recourse setting for careful consideration of background datasets when using Shapley values for feature attributions with numerous synthetic examples. Moreover, we demonstrate the efficacy of CF-SHAP by proposing and justifying a quantitative score for feature attributions, counterfactual-ability, showing that as measured by this metric, CF-SHAP is superior to existing methods when evaluated on public datasets using tree ensembles.
△ Less
Submitted 16 May, 2022; v1 submitted 27 October, 2021;
originally announced October 2021.
-
LTD064402+245919: A Subgiant with a 1-3 M$_{\odot}$ Undetected Companion Identified from LAMOST-TD Data
Authors:
Fan Yang,
Bo Zhang,
Richard J. Long,
You-Jun Lu,
Su-Su Shan,
Xing Wei,
Jian-Ning Fu,
Xian-Fei Zhang,
Zhi-Chao Zhao,
Yu Bai,
Tuan Yi,
Ling-Lin Zheng,
Ze-Ming Zhou,
Ji-Feng Liu
Abstract:
Single-line spectroscopic binaries recently contribute to the stellar-mass black hole discovery, independently of the X-ray transient method. We report the identification of a single-line binary system LTD064402+245919, with an orbital period of 14.50 days. The observed component is a subgiant with a mass of 2.77$\pm$0.68M$_{\odot}$, radius 15.5$\pm$2.5R$_{\odot}$, effective temperature…
▽ More
Single-line spectroscopic binaries recently contribute to the stellar-mass black hole discovery, independently of the X-ray transient method. We report the identification of a single-line binary system LTD064402+245919, with an orbital period of 14.50 days. The observed component is a subgiant with a mass of 2.77$\pm$0.68M$_{\odot}$, radius 15.5$\pm$2.5R$_{\odot}$, effective temperature $T_{\rm eff}$ 4500$\pm$200K, and surface gravity log\emph{g} 2.5$\pm$0.25dex. The discovery makes use of the LAMOST time-domain (LAMOST-TD) and ZTF survey. Our general-purpose software pipeline applies the Lomb-Scargle periodogram to determine the orbital period and uses machine-learning to classify the variable type from the folded light curves. We apply a combined model to estimate the orbital parameters from both the light and radial velocity curves, taking constraints on the primary star mass, mass function, and detection limit of secondary luminosity into consideration. We obtain a radial velocity semi-amplitude of 44.6$\pm$1.5 km s$^{-1}$, mass ratio of 0.73$\pm$0.07, and an undetected component mass of 2.02$\pm$0.49M$_{\odot}$ when the type of the undetected component is not set. We conclude that the inclination is not well constrained, and that the secondary mass is larger than 1M$_{\odot}$ when the undetected component is modelled as a compact object. According to our investigations using an MCMC simulation, increasing the spectra SNR by a factor of 3 would enable the secondary light to be distinguished (if present). The algorithm and software in this work are able to serve as general-purpose tools for the identification of compact objects quiescent in X-rays.
△ Less
Submitted 24 October, 2021; v1 submitted 15 October, 2021;
originally announced October 2021.
-
The Three-Sided PyramidWavefront Sensor. I. Simulations and Analysis for Astronomical Adaptive Optics
Authors:
Lauren Schatz,
Jared R. Males,
Carlos Correia,
Benoit Neichel,
Vincent Chambouleyron,
Johanan Codona,
Olivier Fauvarque,
Jean-François Sauvage,
Thierry Fusco,
Michael Hart,
Pierre Janin-Potiron,
Robert Johnson,
Joseph Long,
Mala Mateen
Abstract:
For ExAO instruments for the Giant Segmented Mirror Telescopes (GSMTs), alternative architectures of WFS are under consideration because there is a tradeoff between detector size, speed, and noise that reduces the performance of GSMT-ExAO wavefront control. One option under consideration for a GSMT-ExAO wavefront sensor is a three-sided PWFS (3PWFS). The 3PWFS creates three copies of the telescope…
▽ More
For ExAO instruments for the Giant Segmented Mirror Telescopes (GSMTs), alternative architectures of WFS are under consideration because there is a tradeoff between detector size, speed, and noise that reduces the performance of GSMT-ExAO wavefront control. One option under consideration for a GSMT-ExAO wavefront sensor is a three-sided PWFS (3PWFS). The 3PWFS creates three copies of the telescope pupil for wavefront sensing, compared to the conventional four-sided PWFS (4PWFS) which uses four pupils. The 3PWFS uses fewer detector pixels than the 4PWFS and should therefore be less sensitive to read noise. Here we develop a mathematical formalism based on the diffraction theory description of the Foucault knife edge test that predicts the intensity pattern after the PWFS. Our formalism allows us to calculate the intensity in the pupil images formed by the PWFS in the presence of phase errors corresponding to arbitrary Fourier modes. We then use the Object Oriented MATLAB Adaptive Optics toolbox (OOMAO) to simulate an end-to-end model of an adaptive optics system using a PWFS with modulation and compare the performance of the 3PWFS to the 4PWFS. In the case of a low read noise detector, the Strehl ratios of the 3PWFS and 4PWFS are within 0.01. When we included higher read noise in the simulation, we found a Strehl ratio gain of 0.036 for the 3PWFS using Raw Intensity over the 4PWFS using Slopes Maps at a stellar magnitude of 10. At the same magnitude, the 4PWFS RI also outperformed the 4PWFS SM, but the gain was only 0.012 Strehl. This is significant because 4PWFS using Slopes Maps is how the PWFS is conventionally used for AO wavefront sensing. We have found that the 3PWFS is a viable wavefront sensor that can fully reconstruct a wavefront and produce a stable closed-loop with correction comparable to that of a 4PWFS, with modestly better performance for high read-noise detectors.
△ Less
Submitted 13 September, 2021;
originally announced September 2021.
-
SDSS-IV MaNGA: Stellar M/L gradients and the M/L-colour relation in galaxies
Authors:
Junqiang Ge,
Shude Mao,
Youjun Lu,
Michele Cappellari,
Richard J. Long,
Renbin Yan
Abstract:
The stellar mass-to-light ratio gradient in SDSS $r-$band $\nabla (M_*/L_r)$ of a galaxy depends on its mass assembly history, which is imprinted in its morphology and gradients of age, metallicity, and stellar initial mass function (IMF). Taking a MaNGA sample of 2051 galaxies with stellar masses ranging from $10^9$ to $10^{12}M_\odot$ released in SDSS DR15, we focus on face-on galaxies, without…
▽ More
The stellar mass-to-light ratio gradient in SDSS $r-$band $\nabla (M_*/L_r)$ of a galaxy depends on its mass assembly history, which is imprinted in its morphology and gradients of age, metallicity, and stellar initial mass function (IMF). Taking a MaNGA sample of 2051 galaxies with stellar masses ranging from $10^9$ to $10^{12}M_\odot$ released in SDSS DR15, we focus on face-on galaxies, without merger and bar signatures, and investigate the dependence of the 2D $\nabla (M_*/L_r)$ on other galaxy properties, including $M_*/L_r$-colour relationships by assuming a fixed Salpeter IMF as the mass normalization reference. The median gradient is $\nabla M_*/L_r\sim -0.1$ (i.e., the $M_*/L_r$ is larger at the centre) for massive galaxies, becomes flat around $M_*\sim 10^{10} M_{\odot}$ and change sign to $\nabla M_*/L_r\sim 0.1$ at the lowest masses. The $M_*/L_r$ inside a half light radius increases with increasing galaxy stellar mass; in each mass bin, early-type galaxies have the highest value, while pure-disk late-type galaxies have the smallest. Correlation analyses suggest that the mass-weighted stellar age is the dominant parameter influencing the $M_*/L_r$ profile, since a luminosity-weighted age is easily affected by star formation when the specific star formation rate (sSFR) inside the half light radius is higher than $10^{-3} {\rm Gyr}^{-1}$. With increased sSFR gradient, one can obtain a steeper negative $\nabla (M_*/L_r)$. The scatter in the slopes of $M_*/L$-colour relations increases with increasing sSFR, for example, the slope for post-starburst galaxies can be flattened to $0.45$ from the global value $0.87$ in the $M_*/L$ vs. $g-r$ diagram. Hence converting galaxy colours to $M_*/L$ should be done carefully, especially for those galaxies with young luminosity-weighted stellar ages, which can have quite different star formation histories.
△ Less
Submitted 11 August, 2021;
originally announced August 2021.
-
A spectral-based analysis of the separation between two-layer neural networks and linear methods
Authors:
Lei Wu,
Jihao Long
Abstract:
We propose a spectral-based approach to analyze how two-layer neural networks separate from linear methods in terms of approximating high-dimensional functions. We show that quantifying this separation can be reduced to estimating the Kolmogorov width of two-layer neural networks, and the latter can be further characterized by using the spectrum of an associated kernel. Different from previous wor…
▽ More
We propose a spectral-based approach to analyze how two-layer neural networks separate from linear methods in terms of approximating high-dimensional functions. We show that quantifying this separation can be reduced to estimating the Kolmogorov width of two-layer neural networks, and the latter can be further characterized by using the spectrum of an associated kernel. Different from previous work, our approach allows obtaining upper bounds, lower bounds, and identifying explicit hard functions in a united manner. We provide a systematic study of how the choice of activation functions affects the separation, in particular the dependence on the input dimension. Specifically, for nonsmooth activation functions, we extend known results to more activation functions with sharper bounds. As concrete examples, we prove that any single neuron can instantiate the separation between neural networks and random feature models. For smooth activation functions, one surprising finding is that the separation is negligible unless the norms of inner-layer weights are polynomially large with respect to the input dimension. By contrast, the separation for nonsmooth activation functions is independent of the norms of inner-layer weights.
△ Less
Submitted 23 February, 2022; v1 submitted 10 August, 2021;
originally announced August 2021.
-
The Secret Higgstory of the Highest Temperature during Reheating
Authors:
Samuel Passaglia,
Wayne Hu,
Andrew J. Long,
David Zegeye
Abstract:
We study the role of the Standard Model Higgs condensate, formed during cosmological inflation, in the epoch of reheating that follows. We focus on the scenario where the inflaton decays slowly and perturbatively, so that there is a long period between the end of inflation and the beginning of radiation domination. The Higgs condensate decays non-perturbatively during this period, and we show that…
▽ More
We study the role of the Standard Model Higgs condensate, formed during cosmological inflation, in the epoch of reheating that follows. We focus on the scenario where the inflaton decays slowly and perturbatively, so that there is a long period between the end of inflation and the beginning of radiation domination. The Higgs condensate decays non-perturbatively during this period, and we show that it heats the primordial plasma to much higher temperatures than would result from the slowly-decaying inflaton alone. We discuss the effect of this hot plasma on the thermalization of the inflaton's decay products, and study its phenomenological implications for the formation of cosmological relics like dark matter, with associated isocurvature fluctuations, and the restoration of the electroweak and Peccei-Quinn symmetries.
△ Less
Submitted 22 November, 2021; v1 submitted 2 August, 2021;
originally announced August 2021.
-
Characterizing deformable mirrors for the MagAO-X instrument
Authors:
Kyle Van Gorkom,
Jared R. Males,
Laird M. Close,
Jennifer Lumbres,
Alex Hedglen,
Joseph D. Long,
Sebastiaan Y. Haffert,
Olivier Guyon,
Maggie Kautz,
Lauren Schatz,
Kelsey Miller,
Alexander T. Rodack,
Justin M. Knight,
Katie M. Morzinski
Abstract:
The MagAO-X instrument is a new extreme adaptive optics system for high-contrast imaging at visible and near-infrared wavelengths on the Magellan Clay Telescope. A central component of this system is a 2040-actuator microelectromechanical deformable mirror (DM) from Boston Micromachines Corp. that operates at 3.63 kHz for high-order wavefront control (the tweeter). Two additional DMs from ALPAO pe…
▽ More
The MagAO-X instrument is a new extreme adaptive optics system for high-contrast imaging at visible and near-infrared wavelengths on the Magellan Clay Telescope. A central component of this system is a 2040-actuator microelectromechanical deformable mirror (DM) from Boston Micromachines Corp. that operates at 3.63 kHz for high-order wavefront control (the tweeter). Two additional DMs from ALPAO perform the low-order (the woofer) and non-common-path science-arm wavefront correction (the NCPC DM). Prior to integration with the instrument, we characterized these devices using a Zygo Verifire Interferometer to measure each DM surface. We present the results of the characterization effort here, demonstrating the ability to drive tweeter to a flat of 6.9 nm root mean square (RMS) surface (and 0.56 nm RMS surface within its control bandwidth), the woofer to 2.2 nm RMS surface, and the NCPC DM to 2.1 nm RMS surface over the MagAO-X beam footprint on each device. Using focus-diversity phase retrieval on the MagAO-X science cameras to estimate the internal instrument wavefront error (WFE), we further show that the integrated DMs correct the instrument WFE to 18.7 nm RMS, which, combined with a 11.7% pupil amplitude RMS, produces a Strehl ratio of 0.94 at H$α$.
△ Less
Submitted 15 July, 2021;
originally announced July 2021.
-
Spatial beam self-cleaning in bi-tapered multimode fibers
Authors:
Xiao-Jun Lin,
Yu-Xin Gao,
Jin-Gan Long,
Jia-Wen Wu,
Xiang-Yue Li,
Wei-Yi Hong,
Hu Cui,
Zhi-Chao Luo,
Wen-Cheng Xu,
Ai-Ping Luo
Abstract:
We report the spatial beam self-cleaning in bi-tapered conventional multimode fibers (MMFs) with different tapered lengths. Through the introduction of the bi-tapered structure in MMFs, the input beam with poor beam quality from a high-power fiber laser can be converted to a centered, bell-shaped beam in a short length, due to the strengthened nonlinear modes coupling. It is found that the bi-tape…
▽ More
We report the spatial beam self-cleaning in bi-tapered conventional multimode fibers (MMFs) with different tapered lengths. Through the introduction of the bi-tapered structure in MMFs, the input beam with poor beam quality from a high-power fiber laser can be converted to a centered, bell-shaped beam in a short length, due to the strengthened nonlinear modes coupling. It is found that the bi-tapered MMF with longer tapered length at the same waist diameter shows better beam self-cleaning effect and larger spectral broadening. The obtained results offer a new method to improve the beam quality of high-power laser at low cost. Besides, it may be interesting for manufacturing bi-tapered MMF-based devices to obtain the quasi-fundamental mode beam in spatiotemporal mode-locked fiber lasers.
△ Less
Submitted 8 July, 2021;
originally announced July 2021.
-
Counterfactual Explanations for Arbitrary Regression Models
Authors:
Thomas Spooner,
Danial Dervovic,
Jason Long,
Jon Shepard,
Jiahao Chen,
Daniele Magazzeni
Abstract:
We present a new method for counterfactual explanations (CFEs) based on Bayesian optimisation that applies to both classification and regression models. Our method is a globally convergent search algorithm with support for arbitrary regression models and constraints like feature sparsity and actionable recourse, and furthermore can answer multiple counterfactual questions in parallel while learnin…
▽ More
We present a new method for counterfactual explanations (CFEs) based on Bayesian optimisation that applies to both classification and regression models. Our method is a globally convergent search algorithm with support for arbitrary regression models and constraints like feature sparsity and actionable recourse, and furthermore can answer multiple counterfactual questions in parallel while learning from previous queries. We formulate CFE search for regression models in a rigorous mathematical framework using differentiable potentials, which resolves robustness issues in threshold-based objectives. We prove that in this framework, (a) verifying the existence of counterfactuals is NP-complete; and (b) that finding instances using such potentials is CLS-complete. We describe a unified algorithm for CFEs using a specialised acquisition function that composes both expected improvement and an exponential-polynomial (EP) family with desirable properties. Our evaluation on real-world benchmark domains demonstrate high sample-efficiency and precision.
△ Less
Submitted 29 June, 2021;
originally announced June 2021.
-
Subdivergence-free gluings of trees
Authors:
Xinle Dai,
Jordan Long,
Karen Yeats
Abstract:
A gluing of two rooted trees is an identification of their leaves and un-subdivision of the resulting 2-valent vertices. A gluing of two rooted trees is subdivergence free if it has no 2-edge cuts with both roots on the same side of the cut. The problem and language is motivated by quantum field theory. We enumerate subdivergence-free gluings for certain families of trees, showing a connection wit…
▽ More
A gluing of two rooted trees is an identification of their leaves and un-subdivision of the resulting 2-valent vertices. A gluing of two rooted trees is subdivergence free if it has no 2-edge cuts with both roots on the same side of the cut. The problem and language is motivated by quantum field theory. We enumerate subdivergence-free gluings for certain families of trees, showing a connection with connected permutations, and we give algorithms to compute subdivergence-free gluings.
△ Less
Submitted 10 January, 2025; v1 submitted 14 June, 2021;
originally announced June 2021.
-
Classification of radial Kerr geodesic motion
Authors:
Geoffrey Compère,
Yan Liu,
Jiang Long
Abstract:
We classify radial timelike geodesic motion of the exterior non-extremal Kerr spacetime by performing a taxonomy of inequivalent root structures of the first order radial geodesic equation using a novel compact notation and by implementing the constraints from polar, time and azimuthal motion. Four generic root structures with only simple roots give rise to eight non-generic root structures when e…
▽ More
We classify radial timelike geodesic motion of the exterior non-extremal Kerr spacetime by performing a taxonomy of inequivalent root structures of the first order radial geodesic equation using a novel compact notation and by implementing the constraints from polar, time and azimuthal motion. Four generic root structures with only simple roots give rise to eight non-generic root structures when either one root becomes coincident with the horizon, one root vanishes or two roots becomes coincident. We derive the explicit phase space of all such root systems in the basis of energy, angular momentum and Carter's constant and classify whether each corresponding radial geodesic motion is allowed or disallowed from existence of polar, time and azimuthal motion. The classification of radial motion within the ergoregion for both positive and negative energies leads to 6 distinguished values of the Kerr angular momentum. The classification of null radial motion and near-horizon extremal Kerr radial motion are obtained as limiting cases and compared with the literature. We explicitly parametrize the separatrix describing root systems with double roots as the union of the following three regions that are described by the same quartic respectively obtained when (1) the pericenter of bound motion becomes a double root; (2) the eccentricity of bound motion becomes zero; (3) the turning point of unbound motion becomes a double root.
△ Less
Submitted 2 February, 2022; v1 submitted 6 June, 2021;
originally announced June 2021.
-
Sample Selection Bias in Evaluation of Prediction Performance of Causal Models
Authors:
James P. Long,
Min Jin Ha
Abstract:
Causal models are notoriously difficult to validate because they make untestable assumptions regarding confounding. New scientific experiments offer the possibility of evaluating causal models using prediction performance. Prediction performance measures are typically robust to violations in causal assumptions. However, prediction performance does depend on the selection of training and test sets.…
▽ More
Causal models are notoriously difficult to validate because they make untestable assumptions regarding confounding. New scientific experiments offer the possibility of evaluating causal models using prediction performance. Prediction performance measures are typically robust to violations in causal assumptions. However, prediction performance does depend on the selection of training and test sets. Biased training sets can lead to optimistic assessments of model performance. In this work, we revisit the prediction performance of several recently proposed causal models tested on a genetic perturbation data set of Kemmeren. We find that sample selection bias is likely a key driver of model performance. We propose using a less-biased evaluation set for assessing prediction performance and compare models on this new set. In this setting, the causal models have similar or worse performance compared to standard association-based estimators such as Lasso. Finally, we compare the performance of causal estimators in simulation studies that reproduce the Kemmeren structure of genetic knockout experiments but without any sample selection bias. These results provide an improved understanding of the performance of several causal models and offer guidance on how future studies should use Kemmeren.
△ Less
Submitted 26 October, 2021; v1 submitted 3 June, 2021;
originally announced June 2021.
-
No evidence for axions from Chandra observation of magnetic white dwarf
Authors:
Christopher Dessert,
Andrew J. Long,
Benjamin R. Safdi
Abstract:
Ultralight axions with axion-photon couplings $g_{aγγ} \sim {\rm few} \times 10^{-11}$ GeV$^{-1}$ may resolve a number of astrophysical anomalies, such as unexpected ~TeV transparency, anomalous stellar cooling, and X-ray excesses from nearby neutron stars. We show, however, that such axions are severely constrained by the non-observation of X-rays from the magnetic white dwarf (MWD) RE J0317-853…
▽ More
Ultralight axions with axion-photon couplings $g_{aγγ} \sim {\rm few} \times 10^{-11}$ GeV$^{-1}$ may resolve a number of astrophysical anomalies, such as unexpected ~TeV transparency, anomalous stellar cooling, and X-ray excesses from nearby neutron stars. We show, however, that such axions are severely constrained by the non-observation of X-rays from the magnetic white dwarf (MWD) RE J0317-853 using ~40 ks of data acquired from a dedicated observation with the Chandra X-ray Observatory. Axions may be produced in the core of the MWD through electron bremsstrahlung and then convert to X-rays in the magnetosphere. The non-observation of X-rays constrains the axion-photon coupling to $g_{aγγ} \lesssim 5.5 \times 10^{-13} \sqrt{C_{aγγ}/C_{aee}}$ GeV$^{-1}$ at 95% confidence for axion masses $m_a \lesssim 5 \times 10^{-6}$ eV, with $C_{aee}$ and $C_{aγγ}$ the dimensionless coupling constants to electrons and photons. Considering that $C_{aee}$ is generated from the renormalization group, our results robustly disfavor $g_{aγγ} \gtrsim 4.4 \times 10^{-11}$ GeV$^{-1}$ even for models with no ultraviolet contribution to $C_{aee}$.
△ Less
Submitted 26 April, 2021;
originally announced April 2021.
-
A Class of Dimension-free Metrics for the Convergence of Empirical Measures
Authors:
Jiequn Han,
Ruimeng Hu,
Jihao Long
Abstract:
This paper concerns the convergence of empirical measures in high dimensions. We propose a new class of probability metrics and show that under such metrics, the convergence is free of the curse of dimensionality (CoD). Such a feature is critical for high-dimensional analysis and stands in contrast to classical metrics ({\it e.g.}, the Wasserstein metric). The proposed metrics fall into the catego…
▽ More
This paper concerns the convergence of empirical measures in high dimensions. We propose a new class of probability metrics and show that under such metrics, the convergence is free of the curse of dimensionality (CoD). Such a feature is critical for high-dimensional analysis and stands in contrast to classical metrics ({\it e.g.}, the Wasserstein metric). The proposed metrics fall into the category of integral probability metrics, for which we specify criteria of test function spaces to guarantee the property of being free of CoD. Examples of the selected test function spaces include the reproducing kernel Hilbert spaces, Barron space, and flow-induced function spaces. Three applications of the proposed metrics are presented: 1. The convergence of empirical measure in the case of random variables; 2. The convergence of $n$-particle system to the solution to McKean-Vlasov stochastic differential equation; 3. The construction of an $\varepsilon$-Nash equilibrium for a homogeneous $n$-player game by its mean-field limit. As a byproduct, we prove that, given a distribution close to the target distribution measured by our metric and a certain representation of the target distribution, we can generate a distribution close to the target one in terms of the Wasserstein metric and relative entropy. Overall, we show that the proposed class of metrics is a powerful tool to analyze the convergence of empirical measures in high dimensions without CoD.
△ Less
Submitted 16 September, 2023; v1 submitted 24 April, 2021;
originally announced April 2021.
-
The vector-apodizing phase plate coronagraph: design, current performance, and future development
Authors:
D. S. Doelman,
F. Snik,
E. H. Por,
S. P. Bos,
G. P. P. L. Otten,
M. Kenworthy,
S. Y. Haffert,
M. Wilby,
A. J. Bohn,
B. J. Sutlieff,
K. Miller,
M. Ouellet,
J. de Boer,
C. U. Keller,
M. J. Escuti,
S. Shi,
N. Z. Warriner,
K. J. Hornburg,
J. L. Birkby,
J. Males,
K. M. Morzinski,
L. M. Close,
J. Codona,
J. Long,
L. Schatz
, et al. (28 additional authors not shown)
Abstract:
Over the last decade, the vector-apodizing phase plate (vAPP) coronagraph has been developed from concept to on-sky application in many high-contrast imaging systems on 8-m class telescopes. The vAPP is an geometric-phase patterned coronagraph that is inherently broadband, and its manufacturing is enabled only by direct-write technology for liquid-crystal patterns. The vAPP generates two coronagra…
▽ More
Over the last decade, the vector-apodizing phase plate (vAPP) coronagraph has been developed from concept to on-sky application in many high-contrast imaging systems on 8-m class telescopes. The vAPP is an geometric-phase patterned coronagraph that is inherently broadband, and its manufacturing is enabled only by direct-write technology for liquid-crystal patterns. The vAPP generates two coronagraphic PSFs that cancel starlight on opposite sides of the point spread function (PSF) and have opposite circular polarization states. The efficiency, that is the amount of light in these PSFs, depends on the retardance offset from half-wave of the liquid-crystal retarder. Using different liquid-crystal recipes to tune the retardance, different vAPPs operate with high efficiencies ($>96\%$) in the visible and thermal infrared (0.55 $μ$m to 5 $μ$m). Since 2015, seven vAPPs have been installed in a total of six different instruments, including Magellan/MagAO, Magellan/MagAO-X, Subaru/SCExAO, and LBT/LMIRcam. Using two integral field spectrographs installed on the latter two instruments, these vAPPs can provide low-resolution spectra (R$\sim$30) between 1 $μ$m and 5 $μ$m. We review the design process, development, commissioning, on-sky performance, and first scientific results of all commissioned vAPPs. We report on the lessons learned and conclude with perspectives for future developments and applications.
△ Less
Submitted 4 November, 2021; v1 submitted 22 April, 2021;
originally announced April 2021.
-
An Empirical Bayesian Approach to Limb-darkening in Modeling WASP-121b Transit Light Curves
Authors:
Fan Yang,
Richard J. Long,
Ji-Feng Liu,
Su-Su Shan,
Rui Guo,
Bo Zhang,
Tuan Yi,
Ling-Lin Zheng,
Zhi-Chao Zhao
Abstract:
We present a novel, iterative method using an empirical Bayesian approach for modeling the limb darkened WASP-121b transit from the TESS light curve. Our method is motivated by the need to improve $R_{p}/R_{\ast}$ estimates for exoplanet atmosphere modeling, and is particularly effective with the limb darkening (LD) quadratic law requiring no prior central value from stellar atmospheric models. Wi…
▽ More
We present a novel, iterative method using an empirical Bayesian approach for modeling the limb darkened WASP-121b transit from the TESS light curve. Our method is motivated by the need to improve $R_{p}/R_{\ast}$ estimates for exoplanet atmosphere modeling, and is particularly effective with the limb darkening (LD) quadratic law requiring no prior central value from stellar atmospheric models. With the non-linear LD law, the method has all the advantages of not needing atmospheric models but does not converge. The iterative method gives a different $R_{p}/R_{\ast}$ for WASP-121b at a significance level of 1$σ$ when compared with existing non-iterative methods. To assess the origins and implications of this difference, we generate and analyze light curves with known values of the limb darkening coefficients (LDCs). We find that non-iterative modeling with LDC priors from stellar atmospheric models results in an inconsistent $R_{p}/R_{\ast}$ at 1.5$σ$ level when the known LDC values are as those previously found when modeling real data by the iterative method. In contrast, the LDC values from the iterative modeling yields the correct value of $R_{p}/R_{\ast}$ to within 0.25$σ$. For more general cases with different known inputs, Monte Carlo simulations show that the iterative method obtains unbiased LDCs and correct $R_{p}/R_{\ast}$ to within a significance level of 0.3$σ$. Biased LDC priors can cause biased LDC posteriors and lead to bias in the $R_{p}/R_{\ast}$ of up to 0.82$\%$, 2.5$σ$ for the quadratic law and 0.32$\%$, 1.0$σ$ for the non-linear law. Our improvement in $R_{p}/R_{\ast}$ estimation is important when analyzing exoplanet atmospheres.
△ Less
Submitted 15 April, 2021;
originally announced April 2021.
-
An $L^2$ Analysis of Reinforcement Learning in High Dimensions with Kernel and Neural Network Approximation
Authors:
Jihao Long,
Jiequn Han,
Weinan E
Abstract:
Reinforcement learning (RL) algorithms based on high-dimensional function approximation have achieved tremendous empirical success in large-scale problems with an enormous number of states. However, most analysis of such algorithms gives rise to error bounds that involve either the number of states or the number of features. This paper considers the situation where the function approximation is ma…
▽ More
Reinforcement learning (RL) algorithms based on high-dimensional function approximation have achieved tremendous empirical success in large-scale problems with an enormous number of states. However, most analysis of such algorithms gives rise to error bounds that involve either the number of states or the number of features. This paper considers the situation where the function approximation is made either using the kernel method or the two-layer neural network model, in the context of a fitted Q-iteration algorithm with explicit regularization. We establish an $\tilde{O}(H^3|\mathcal {A}|^{\frac14}n^{-\frac14})$ bound for the optimal policy with $Hn$ samples, where $H$ is the length of each episode and $|\mathcal {A}|$ is the size of action space. Our analysis hinges on analyzing the $L^2$ error of the approximated Q-function using $n$ data points. Even though this result still requires a finite-sized action space, the error bound is independent of the dimensionality of the state space.
△ Less
Submitted 15 February, 2022; v1 submitted 15 April, 2021;
originally announced April 2021.
-
Calibration of the Air Shower Energy Scale of the Water and Air Cherenkov Techniques in the LHAASO experiment
Authors:
F. Aharonian,
Q. An,
Axikegu,
L. X. Bai,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
H. Cai,
J. T. Cai,
Z. Cao Z. Cao,
J. Chang,
J. F. Chang,
X. C. Chang,
B. M. Chen,
J. Chen,
L. Chen,
L. Chen,
L. Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen,
S. H. Chen,
S. Z. Chen
, et al. (233 additional authors not shown)
Abstract:
The Wide Field-of-View Cherenkov Telescope Array (WFCTA) and the Water Cherenkov Detector Arrays (WCDA) of LHAASO are designed to work in combination for measuring the energy spectra of various cosmic ray species over a very wide energy range from a few TeV to 10 PeV. The energy calibration of WCDA can be achieved with a proven technique of measuring the westward shift of the Moon shadow of galact…
▽ More
The Wide Field-of-View Cherenkov Telescope Array (WFCTA) and the Water Cherenkov Detector Arrays (WCDA) of LHAASO are designed to work in combination for measuring the energy spectra of various cosmic ray species over a very wide energy range from a few TeV to 10 PeV. The energy calibration of WCDA can be achieved with a proven technique of measuring the westward shift of the Moon shadow of galactic cosmic rays due to the geomagnetic field. This deflection angle $Δ$ is inversely proportional to the energy of the cosmic rays. The precise measurements of the shifts by WCDA allows us to calibrate its energy scale for energies as high as 35 TeV. The energy scale measured by WCDA can be used to cross calibrate the energy reconstructed by WFCTA, which spans the whole energy range up to 10 PeV. In this work, we will demonstrate the feasibility of the method using the data collected from April 2019 to January 2020 by the WFCTA array and WCDA-1 detector, the first of the three water Cherenkov ponds, already commissioned at LHAASO site.
△ Less
Submitted 13 April, 2021; v1 submitted 11 April, 2021;
originally announced April 2021.