Search | arXiv e-print repository

Glo-In-One: Holistic Glomerular Detection, Segmentation, and Lesion Characterization with Large-scale Web Image Mining

Authors: Tianyuan Yao, Yuzhe Lu, Jun Long, Aadarsh Jha, Zheyu Zhu, Zuhayr Asad, Haichun Yang, Agnes B. Fogo, Yuankai Huo

Abstract: The quantitative detection, segmentation, and characterization of glomeruli from high-resolution whole slide imaging (WSI) play essential roles in the computer-assisted diagnosis and scientific research in digital renal pathology. Historically, such comprehensive quantification requires extensive programming skills in order to be able to handle heterogeneous and customized computational tools. To… ▽ More The quantitative detection, segmentation, and characterization of glomeruli from high-resolution whole slide imaging (WSI) play essential roles in the computer-assisted diagnosis and scientific research in digital renal pathology. Historically, such comprehensive quantification requires extensive programming skills in order to be able to handle heterogeneous and customized computational tools. To bridge the gap of performing glomerular quantification for non-technical users, we develop the Glo-In-One toolkit to achieve holistic glomerular detection, segmentation, and characterization via a single line of command. Additionally, we release a large-scale collection of 30,000 unlabeled glomerular images to further facilitate the algorithmic development of self-supervised deep learning. The inputs of the Glo-In-One toolkit are WSIs, while the outputs are (1) WSI-level multi-class circle glomerular detection results (which can be directly manipulated with ImageScope), (2) glomerular image patches with segmentation masks, and (3) different lesion types. To leverage the performance of the Glo-In-One toolkit, we introduce self-supervised deep learning to glomerular quantification via large-scale web image mining. The GGS fine-grained classification model achieved a decent performance compared with baseline supervised methods while only using 10% of the annotated data. The glomerular detection achieved an average precision of 0.627 with circle representations, while the glomerular segmentation achieved a 0.955 patch-wise Dice Similarity Coefficient (DSC). △ Less

Submitted 31 May, 2022; originally announced June 2022.

arXiv:2205.13624 [pdf, other]

Faster Optimization on Sparse Graphs via Neural Reparametrization

Authors: Nima Dehmamy, Csaba Both, Jianzhi Long, Rose Yu

Abstract: In mathematical optimization, second-order Newton's methods generally converge faster than first-order methods, but they require the inverse of the Hessian, hence are computationally expensive. However, we discover that on sparse graphs, graph neural networks (GNN) can implement an efficient Quasi-Newton method that can speed up optimization by a factor of 10-100x. Our method, neural reparametriza… ▽ More In mathematical optimization, second-order Newton's methods generally converge faster than first-order methods, but they require the inverse of the Hessian, hence are computationally expensive. However, we discover that on sparse graphs, graph neural networks (GNN) can implement an efficient Quasi-Newton method that can speed up optimization by a factor of 10-100x. Our method, neural reparametrization, modifies the optimization parameters as the output of a GNN to reshape the optimization landscape. Using a precomputed Hessian as the propagation rule, the GNN can effectively utilize the second-order information, reaching a similar effect as adaptive gradient methods. As our method solves optimization through architecture design, it can be used in conjunction with any optimizers such as Adam and RMSProp. We show the application of our method on scientifically relevant problems including heat diffusion, synchronization and persistent homology. △ Less

Submitted 26 May, 2022; originally announced May 2022.

arXiv:2205.08622 [pdf, other]

Solving Optimal Control Problems of Rigid-Body Dynamics with Collisions Using the Hybrid Minimum Principle

Authors: Wei Hu, Jihao Long, Yaohua Zang, Weinan E, Jiequn Han

Abstract: Collisions are common in many dynamical systems with real applications. They can be formulated as hybrid dynamical systems with discontinuities automatically triggered when states transverse certain manifolds. We present an algorithm for the optimal control problem of such hybrid dynamical systems based on solving the equations derived from the hybrid minimum principle (HMP). The algorithm is an i… ▽ More Collisions are common in many dynamical systems with real applications. They can be formulated as hybrid dynamical systems with discontinuities automatically triggered when states transverse certain manifolds. We present an algorithm for the optimal control problem of such hybrid dynamical systems based on solving the equations derived from the hybrid minimum principle (HMP). The algorithm is an iterative scheme following the spirit of the method of successive approximations (MSA), and it is robust to undesired collisions observed in the initial guesses. We propose several techniques to address the additional numerical challenges introduced by the presence of discontinuities. The algorithm is tested on disc collision problems whose optimal solutions exhibit one or multiple collisions. Linear convergence in terms of iteration steps and asymptotic first-order accuracy in terms of time discretization are observed when the algorithm is implemented with the forward-Euler scheme. The numerical results demonstrate that the proposed algorithm has better accuracy and convergence than direct methods based on gradient descent. Furthermore, the algorithm is also simpler, more accurate, and more stable than a deep reinforcement learning method. △ Less

Submitted 17 January, 2025; v1 submitted 17 May, 2022; originally announced May 2022.

MSC Class: 49Mxx

arXiv:2205.07990 [pdf, other]

Empowering Optimal Control with Machine Learning: A Perspective from Model Predictive Control

Authors: Weinan E, Jiequn Han, Jihao Long

Abstract: Solving complex optimal control problems have confronted computational challenges for a long time. Recent advances in machine learning have provided us with new opportunities to address these challenges. This paper takes model predictive control, a popular optimal control method, as the primary example to survey recent progress that leverages machine learning techniques to empower optimal control… ▽ More Solving complex optimal control problems have confronted computational challenges for a long time. Recent advances in machine learning have provided us with new opportunities to address these challenges. This paper takes model predictive control, a popular optimal control method, as the primary example to survey recent progress that leverages machine learning techniques to empower optimal control solvers. We also discuss some of the main challenges encountered when applying machine learning to develop more robust optimal control algorithms. △ Less

Submitted 20 July, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

arXiv:2205.07622 [pdf]

Photoelastic Stress Response of Complex 3D-Printed Particle Shapes

Authors: Negin Amini, Josh Tuohey, John M. Long, Jun Zhang, David A. V. Morton, Karen Daniels, Farnaz Fazelpour, Karen P. Hapgood

Abstract: While stress visualization within 3-dimensional particles would greatly advance our understanding of the behaviors of complex particles, traditional photoelastic methods suffer from a lack of available technology for producing suitable complex particles. Recently, 3D-printing has created new possibilities for enhancing the scope of stress analysis within physically representative granules. Here, w… ▽ More While stress visualization within 3-dimensional particles would greatly advance our understanding of the behaviors of complex particles, traditional photoelastic methods suffer from a lack of available technology for producing suitable complex particles. Recently, 3D-printing has created new possibilities for enhancing the scope of stress analysis within physically representative granules. Here, we investigate and evaluate opportunities offered by 3D-printing a single particle with a complex external shape with photoelastic properties. We report the results of X-ray computed tomography and 3D-printing, combined with traditional photoelastic analysis, to visualize strain for particles ranging from simple 2D discs to complex 3D printed coffee beans, including with internal voids. We find that the relative orientation of the print layers and the loading force affects the optical response of the discs, but without a significant difference in their mechanical properties. Furthermore, we present semi-quantitative measurements of stresses within 3D-printed complex particles. The paper outlines the potential limitations and areas of future interest for stress visualization of 3-dimensional particles. △ Less

Submitted 16 May, 2022; originally announced May 2022.

arXiv:2205.07554 [pdf, other]

doi 10.1051/0004-6361/202243311

Towards on-sky adaptive optics control using reinforcement learning

Authors: J. Nousiainen, C. Rajani, M. Kasper, T. Helin, S. Y. Haffert, C. Vérinaud, J. R. Males, K. Van Gorkom, L. M. Close, J. D. Long, A. D. Hedglen, O. Guyon, L. Schatz, M. Kautz, J. Lumbres, A. Rodack, J. M. Knight, K. Miller

Abstract: The direct imaging of potentially habitable Exoplanets is one prime science case for the next generation of high contrast imaging instruments on ground-based extremely large telescopes. To reach this demanding science goal, the instruments are equipped with eXtreme Adaptive Optics (XAO) systems which will control thousands of actuators at a framerate of kilohertz to several kilohertz. Most of the… ▽ More The direct imaging of potentially habitable Exoplanets is one prime science case for the next generation of high contrast imaging instruments on ground-based extremely large telescopes. To reach this demanding science goal, the instruments are equipped with eXtreme Adaptive Optics (XAO) systems which will control thousands of actuators at a framerate of kilohertz to several kilohertz. Most of the habitable exoplanets are located at small angular separations from their host stars, where the current XAO systems' control laws leave strong residuals.Current AO control strategies like static matrix-based wavefront reconstruction and integrator control suffer from temporal delay error and are sensitive to mis-registration, i.e., to dynamic variations of the control system geometry. We aim to produce control methods that cope with these limitations, provide a significantly improved AO correction and, therefore, reduce the residual flux in the coronagraphic point spread function. We extend previous work in Reinforcement Learning for AO. The improved method, called PO4AO, learns a dynamics model and optimizes a control neural network, called a policy. We introduce the method and study it through numerical simulations of XAO with Pyramid wavefront sensing for the 8-m and 40-m telescope aperture cases. We further implemented PO4AO and carried out experiments in a laboratory environment using MagAO-X at the Steward laboratory. PO4AO provides the desired performance by improving the coronagraphic contrast in numerical simulations by factors 3-5 within the control region of DM and Pyramid WFS, in simulation and in the laboratory. The presented method is also quick to train, i.e., on timescales of typically 5-10 seconds, and the inference time is sufficiently small (< ms) to be used in real-time control for XAO with currently available hardware even for extremely large telescopes. △ Less

Submitted 16 May, 2022; originally announced May 2022.

Journal ref: A&A 664, A71 (2022)

arXiv:2205.07338 [pdf, other]

Reductive MDPs: A Perspective Beyond Temporal Horizons

Authors: Thomas Spooner, Rui Silva, Joshua Lockhart, Jason Long, Vacslav Glukhov

Abstract: Solving general Markov decision processes (MDPs) is a computationally hard problem. Solving finite-horizon MDPs, on the other hand, is highly tractable with well known polynomial-time algorithms. What drives this extreme disparity, and do problems exist that lie between these diametrically opposed complexities? In this paper we identify and analyse a sub-class of stochastic shortest path problems… ▽ More Solving general Markov decision processes (MDPs) is a computationally hard problem. Solving finite-horizon MDPs, on the other hand, is highly tractable with well known polynomial-time algorithms. What drives this extreme disparity, and do problems exist that lie between these diametrically opposed complexities? In this paper we identify and analyse a sub-class of stochastic shortest path problems (SSPs) for general state-action spaces whose dynamics satisfy a particular drift condition. This construction generalises the traditional, temporal notion of a horizon via decreasing reachability: a property called reductivity. It is shown that optimal policies can be recovered in polynomial-time for reductive SSPs -- via an extension of backwards induction -- with an efficient analogue in reductive MDPs. The practical considerations of the proposed approach are discussed, and numerical verification provided on a canonical optimal liquidation problem. △ Less

Submitted 15 May, 2022; originally announced May 2022.

Comments: 15 pages, 10 figures, 1 algorithm

arXiv:2205.04158 [pdf, other]

doi 10.1103/PhysRevC.106.024328

Double-Weak Decays of $^{124}$Xe and $^{136}$Xe in the XENON1T and XENONnT Experiments

Authors: E. Aprile, K. Abe, F. Agostini, S. Ahmed Maouloud, M. Alfonsi, L. Althueser, B. Andrieu, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, L. Bellagamba, R. Biondi, A. Bismark, A. Brown, S. Bruenner, G. Bruno, R. Budnik, C. Cai, C. Capelli, J. M. R. Cardoso, D. Cichon , et al. (135 additional authors not shown)

Abstract: We present results on the search for double-electron capture ($2ν\text{ECEC}$) of $^{124}$Xe and neutrinoless double-$β$ decay ($0νββ$) of $^{136}$Xe in XENON1T. We consider captures from the K- up to the N-shell in the $2ν\text{ECEC}$ signal model and measure a total half-life of $T_{1/2}^{2ν\text{ECEC}}=(1.1\pm0.2_\text{stat}\pm0.1_\text{sys})\times 10^{22}\;\text{yr}$ with a… ▽ More We present results on the search for double-electron capture ($2ν\text{ECEC}$) of $^{124}$Xe and neutrinoless double-$β$ decay ($0νββ$) of $^{136}$Xe in XENON1T. We consider captures from the K- up to the N-shell in the $2ν\text{ECEC}$ signal model and measure a total half-life of $T_{1/2}^{2ν\text{ECEC}}=(1.1\pm0.2_\text{stat}\pm0.1_\text{sys})\times 10^{22}\;\text{yr}$ with a $0.87\;\text{kg}\times\text{yr}$ isotope exposure. The statistical significance of the signal is $7.0\,σ$. We use XENON1T data with $36.16\;\text{kg}\times\text{yr}$ of $^{136}$Xe exposure to search for $0νββ$. We find no evidence of a signal and set a lower limit on the half-life of $T_{1/2}^{0νββ} > 1.2 \times 10^{24}\;\text{yr}\; \text{at}\; 90\,\%\;\text{CL}$. This is the best result from a dark matter detector without an enriched target to date. We also report projections on the sensitivity of XENONnT to $0νββ$. Assuming a $275\;\text{kg}\times\text{yr}$ $^{136}$Xe exposure, the expected sensitivity is $T_{1/2}^{0νββ} > 2.1 \times 10^{25}\;\text{yr}\; \text{at}\; 90\,\%\;\text{CL}$, corresponding to an effective Majorana mass range of $\langle m_{ββ} \rangle < (0.19 - 0.59)\;\text{eV/c}^2$. △ Less

Submitted 6 September, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

Comments: 23 pages, 14 figures, 9 tables, version accepted for publication in Phys. Rev. C

Journal ref: Phys. Rev. C 106, 024328 (2022)

arXiv:2204.11924 [pdf, other]

Learning High-Dimensional McKean-Vlasov Forward-Backward Stochastic Differential Equations with General Distribution Dependence

Authors: Jiequn Han, Ruimeng Hu, Jihao Long

Abstract: One of the core problems in mean-field control and mean-field games is to solve the corresponding McKean-Vlasov forward-backward stochastic differential equations (MV-FBSDEs). Most existing methods are tailored to special cases in which the mean-field interaction only depends on expectation or other moments and thus inadequate to solve problems when the mean-field interaction has full distribution… ▽ More One of the core problems in mean-field control and mean-field games is to solve the corresponding McKean-Vlasov forward-backward stochastic differential equations (MV-FBSDEs). Most existing methods are tailored to special cases in which the mean-field interaction only depends on expectation or other moments and thus inadequate to solve problems when the mean-field interaction has full distribution dependence. In this paper, we propose a novel deep learning method for computing MV-FBSDEs with a general form of mean-field interactions. Specifically, built on fictitious play, we recast the problem into repeatedly solving standard FBSDEs with explicit coefficient functions. These coefficient functions are used to approximate the MV-FBSDEs' model coefficients with full distribution dependence, and are updated by solving another supervising learning problem using training data simulated from the last iteration's FBSDE solutions. We use deep neural networks to solve standard BSDEs and approximate coefficient functions in order to solve high-dimensional MV-FBSDEs. Under proper assumptions on the learned functions, we prove that the convergence of the proposed method is free of the curse of dimensionality (CoD) by using a class of integral probability metrics previously developed in [Han, Hu and Long, arXiv:2104.12036]. The proved theorem shows the advantage of the method in high dimensions. We present the numerical performance in high-dimensional MV-FBSDE problems, including a mean-field game example of the well-known Cucker-Smale model whose cost depends on the full distribution of the forward process. △ Less

Submitted 18 September, 2023; v1 submitted 25 April, 2022; originally announced April 2022.

MSC Class: 65C30; 68T07; 49N80; 68Q25

arXiv:2204.06668 [pdf, other]

doi 10.1140/epja/s10050-022-00730-w

High-precision half-life determination of $^{14}$O via direct $β$ counting

Authors: S. Sharma, G. F. Grinyer, G. C. Ball, J. R. Leslie, C. E. Svensson, F. A. Ali, C. Andreoiu, N. Bernier, S. S. Bhattacharjee, V. Bildstein, C. Burbadge, R. Caballero-Folch, R. Coleman, A. Diaz Varela, M. R. Dunlop, R. Dunlop, A. B. Garnsworthy, E. Gyabeng Fuakye, G. M. Huber, B. Jigmeddorj, K. Kapoor, A. T. Laffoley, K. G. Leach, J. Long, A. D. MacLean , et al. (8 additional authors not shown)

Abstract: The half-life of the superallowed Fermi $β^+$ emitter $^{14}$O was determined to high precision via a direct $β$ counting experiment performed at the Isotope Separator and Accelerator (ISAC) facility at TRIUMF. The result, $T_{1/2}$($^{14}$O) = 70619.2(76) ms, is consistent with, but is more precise than, the world average obtained from 11 previous measurements. Combining the $^{14}$O half-life de… ▽ More The half-life of the superallowed Fermi $β^+$ emitter $^{14}$O was determined to high precision via a direct $β$ counting experiment performed at the Isotope Separator and Accelerator (ISAC) facility at TRIUMF. The result, $T_{1/2}$($^{14}$O) = 70619.2(76) ms, is consistent with, but is more precise than, the world average obtained from 11 previous measurements. Combining the $^{14}$O half-life deduced in the present work with the previous most precise measurements of this quantity leads to a reduction in the overall uncertainty, by nearly a factor of 2. The new world average is $T_{1/2}$($^{14}$O) = 70619.6(63) ms with a reduced $χ^2$ value of 0.87 obtained from 8 degrees of freedom. △ Less

Submitted 13 April, 2022; originally announced April 2022.

Comments: 8 pages, 6 figures, accepted for publication in the European Physical Journal A

arXiv:2204.06516 [pdf, other]

Decentralized Collaborative Learning Framework for Next POI Recommendation

Authors: Jing Long, Tong Chen, Nguyen Quoc Viet Hung, Hongzhi Yin

Abstract: Next Point-of-Interest (POI) recommendation has become an indispensable functionality in Location-based Social Networks (LBSNs) due to its effectiveness in helping people decide the next POI to visit. However, accurate recommendation requires a vast amount of historical check-in data, thus threatening user privacy as the location-sensitive data needs to be handled by cloud servers. Although there… ▽ More Next Point-of-Interest (POI) recommendation has become an indispensable functionality in Location-based Social Networks (LBSNs) due to its effectiveness in helping people decide the next POI to visit. However, accurate recommendation requires a vast amount of historical check-in data, thus threatening user privacy as the location-sensitive data needs to be handled by cloud servers. Although there have been several on-device frameworks for privacy-preserving POI recommendations, they are still resource-intensive when it comes to storage and computation, and show limited robustness to the high sparsity of user-POI interactions. On this basis, we propose a novel decentralized collaborative learning framework for POI recommendation (DCLR), which allows users to train their personalized models locally in a collaborative manner. DCLR significantly reduces the local models' dependence on the cloud for training, and can be used to expand arbitrary centralized recommendation models. To counteract the sparsity of on-device user data when learning each local model, we design two self-supervision signals to pretrain the POI representations on the server with geographical and categorical correlations of POIs. To facilitate collaborative learning, we innovatively propose to incorporate knowledge from either geographically or semantically similar users into each local model with attentive aggregation and mutual information maximization. The collaborative learning process makes use of communications between devices while requiring only minor engagement from the central server for identifying user groups, and is compatible with common privacy preservation mechanisms like differential privacy. We evaluate DCLR with two real-world datasets, where the results show that DCLR outperforms state-of-the-art on-device frameworks and yields competitive results compared with centralized counterparts. △ Less

Submitted 31 July, 2022; v1 submitted 30 March, 2022; originally announced April 2022.

Comments: 21 Pages, 3 figures, 4 tables

arXiv:2204.03169 [pdf]

doi 10.1038/s41928-023-00936-w

Magnet-free nonreciprocal metasurface for on-demand bi-directional phase modulation

Authors: Weihao Yang, Jun Qin, Jiawei Long, Wei Yan, Yucong Yang, Chaoyang Li, En Li, Juejun Hu, Longjiang Deng, Qingyang Du, Lei Bi

Abstract: Unconstrained by Lorentz reciprocity, nonreciprocal metasurfaces are uniquely capable of encoding distinctive optical functions on forward- and backward-propagating waves. The nonreciprocal metasurfaces reported to date require external electric or magnetic field biasing or rely on nonlinear effects, both of which are challenging to practically implement. Here, we propose and experimentally realiz… ▽ More Unconstrained by Lorentz reciprocity, nonreciprocal metasurfaces are uniquely capable of encoding distinctive optical functions on forward- and backward-propagating waves. The nonreciprocal metasurfaces reported to date require external electric or magnetic field biasing or rely on nonlinear effects, both of which are challenging to practically implement. Here, we propose and experimentally realize a magnet-free, linear, and passive nonreciprocal metasurface based on self-biased magnetic meta-atoms. Record transmittance up to 77% and operation angle reaching 64 degree are experimentally demonstrated. Moreover, on-demand bidirectional phase modulation in a "LEGO-like" manner is theoretically proposed and experimentally demonstrated, enabling a cohort of nonreciprocal functionalities such as microwave isolation, nonreciprocal beam steering, nonreciprocal focusing, and nonreciprocal holography. The design can also be extended to MHz and optical frequencies, taking advantage of the wide variety of self-biased gyrotropic materials available. We foresee that the nonreciprocal metasurfaces demonstrated in this work will have a significant practical impact for applications ranging from nonreciprocal antennas and radomes to full-duplex wireless communication and radar systems. △ Less

Submitted 6 April, 2022; originally announced April 2022.

Comments: 18 pages, 5 figures

arXiv:2204.02592 [pdf, other]

doi 10.1145/3477495.3532066

Thinking inside The Box: Learning Hypercube Representations for Group Recommendation

Authors: Tong Chen, Hongzhi Yin, Jing Long, Quoc Viet Hung Nguyen, Yang Wang, Meng Wang

Abstract: As a step beyond traditional personalized recommendation, group recommendation is the task of suggesting items that can satisfy a group of users. In group recommendation, the core is to design preference aggregation functions to obtain a quality summary of all group members' preferences. Such user and group preferences are commonly represented as points in the vector space (i.e., embeddings), wher… ▽ More As a step beyond traditional personalized recommendation, group recommendation is the task of suggesting items that can satisfy a group of users. In group recommendation, the core is to design preference aggregation functions to obtain a quality summary of all group members' preferences. Such user and group preferences are commonly represented as points in the vector space (i.e., embeddings), where multiple user embeddings are compressed into one to facilitate ranking for group-item pairs. However, the resulted group representations, as points, lack adequate flexibility and capacity to account for the multi-faceted user preferences. Also, the point embedding-based preference aggregation is a less faithful reflection of a group's decision-making process, where all users have to agree on a certain value in each embedding dimension instead of a negotiable interval. In this paper, we propose a novel representation of groups via the notion of hypercubes, which are subspaces containing innumerable points in the vector space. Specifically, we design the hypercube recommender (CubeRec) to adaptively learn group hypercubes from user embeddings with minimal information loss during preference aggregation, and to leverage a revamped distance metric to measure the affinity between group hypercubes and item points. Moreover, to counteract the long-standing issue of data sparsity in group recommendation, we make full use of the geometric expressiveness of hypercubes and innovatively incorporate self-supervision by intersecting two groups. Experiments on four real-world datasets have validated the superiority of CubeRec over state-of-the-art baselines. △ Less

Submitted 4 December, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

Comments: SIGIR'22

arXiv:2203.12227 [pdf, ps, other]

Super-Eddington accretion of the first Galactic Ultra-luminous X-ray pulsar Swift J0243.6+6124

Authors: Liu Jiren, Peter A Jenke, Ji Long, Zhang Shuang-Nan, Zhang Shu, Ge Mingyu, Liao Jinyuan, Li Xiaobo, Song Liming

Abstract: We present a detailed timing study of the pulse profile of Swift J0243.6+6124 with HXMT and Fermi/GBM data during its 2017 giant outburst. The double-peak profile at luminosity above $5\times10^{38}$erg\,s$^{-1}$ is found to be 0.25 phase offset from that below $1.5\times10^{38}$erg\,s$^{-1}$, which strongly supports for a transition from a pencil beam to a fan beam, and thus for the formation of… ▽ More We present a detailed timing study of the pulse profile of Swift J0243.6+6124 with HXMT and Fermi/GBM data during its 2017 giant outburst. The double-peak profile at luminosity above $5\times10^{38}$erg\,s$^{-1}$ is found to be 0.25 phase offset from that below $1.5\times10^{38}$erg\,s$^{-1}$, which strongly supports for a transition from a pencil beam to a fan beam, and thus for the formation of shock dominated accretion column. During the rising stage of the high double-peak regime, the faint peak got saturated in 10-100 keV band above a luminosity of $L_t\sim1.3\times10^{39}$erg\,s$^{-1}$, which is coincident with sudden spectral changes of both the main and faint peaks. They imply a sudden change of emission pattern around $L_t$. The spin-up rate ($\dotν$) is linearly correlated with luminosity ($L$) below $L_t$, consistent with the prediction of a radiation pressure dominated (RPD) disk. The $\dotν-L$ relation flattens above $L_t$, indicating a less efficient transfer of angular momentum and a change of accretion disk geometry above $L_t$. It is likely due to irradiation of the disk by the central accretion column and indicates significant radiation feedback before the inner disk radius reaching the spherization radius. △ Less

Submitted 23 March, 2022; originally announced March 2022.

Comments: 8 pages, 5 figs, to appear on MNRAS

arXiv:2203.10696 [pdf, other]

A portable atom gravimeter operating in noisy urban environments

Authors: Bin Chen, Jinbao Long, Hongtai Xie, Chenyang Li, Luokan Chen, Bonan Jiang, Shuai Chen

Abstract: The gravimeter based on atom interferometry has potentially wide applications on building the gravity networks, geophysics as well as gravity assisted navigation. Here, we demonstrate experimentally a portable atom gravimeter operating in the noisy urban environment. Despite the influence of noisy external vibrations, our portable atom gravimeter reaches a sensitivity as good as 65 uGal/\sqrt{Hz}… ▽ More The gravimeter based on atom interferometry has potentially wide applications on building the gravity networks, geophysics as well as gravity assisted navigation. Here, we demonstrate experimentally a portable atom gravimeter operating in the noisy urban environment. Despite the influence of noisy external vibrations, our portable atom gravimeter reaches a sensitivity as good as 65 uGal/\sqrt{Hz} and a resolution of 1.1 uGal after 4000 s integration time, being comparable to state-of-the-art atom gravimeters. Our achievement paves the way for bring the portable atom gravimeter to field applications, such as gravity survey on a moving platform. △ Less

Submitted 20 March, 2022; originally announced March 2022.

Journal ref: COL 18(9), 090201(2020)

arXiv:2203.08103 [pdf, other]

Electric dipole moments and the search for new physics

Authors: Ricardo Alarcon, Jim Alexander, Vassilis Anastassopoulos, Takatoshi Aoki, Rick Baartman, Stefan Baeßler, Larry Bartoszek, Douglas H. Beck, Franco Bedeschi, Robert Berger, Martin Berz, Hendrick L. Bethlem, Tanmoy Bhattacharya, Michael Blaskiewicz, Thomas Blum, Themis Bowcock, Anastasia Borschevsky, Kevin Brown, Dmitry Budker, Sergey Burdin, Brendan C. Casey, Gianluigi Casse, Giovanni Cantatore, Lan Cheng, Timothy Chupp , et al. (118 additional authors not shown)

Abstract: Static electric dipole moments of nondegenerate systems probe mass scales for physics beyond the Standard Model well beyond those reached directly at high energy colliders. Discrimination between different physics models, however, requires complementary searches in atomic-molecular-and-optical, nuclear and particle physics. In this report, we discuss the current status and prospects in the near fu… ▽ More Static electric dipole moments of nondegenerate systems probe mass scales for physics beyond the Standard Model well beyond those reached directly at high energy colliders. Discrimination between different physics models, however, requires complementary searches in atomic-molecular-and-optical, nuclear and particle physics. In this report, we discuss the current status and prospects in the near future for a compelling suite of such experiments, along with developments needed in the encompassing theoretical framework. △ Less

Submitted 4 April, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

Comments: Contribution to Snowmass 2021; updated with community edits and endorsements

arXiv:2203.08019 [pdf, other]

Optimal Admission Control for Multiclass Queues with Time-Varying Arrival Rates via State Abstraction

Authors: Marc Rigter, Danial Dervovic, Parisa Hassanzadeh, Jason Long, Parisa Zehtabi, Daniele Magazzeni

Abstract: We consider a novel queuing problem where the decision-maker must choose to accept or reject randomly arriving tasks into a no buffer queue which are processed by $N$ identical servers. Each task has a price, which is a positive real number, and a class. Each class of task has a different price distribution and service rate, and arrives according to an inhomogenous Poisson process. The objective i… ▽ More We consider a novel queuing problem where the decision-maker must choose to accept or reject randomly arriving tasks into a no buffer queue which are processed by $N$ identical servers. Each task has a price, which is a positive real number, and a class. Each class of task has a different price distribution and service rate, and arrives according to an inhomogenous Poisson process. The objective is to decide which tasks to accept so that the total price of tasks processed is maximised over a finite horizon. We formulate the problem as a discrete time Markov Decision Process (MDP) with a hybrid state space. We show that the optimal value function has a specific structure, which enables us to solve the hybrid MDP exactly. Moreover, we prove that as the time step is reduced, the discrete time solution approaches the optimal solution to the original continuous time problem. To improve the scalability of our approach to a greater number of task classes, we present an approximation based on state abstraction. We validate our approach on synthetic data, as well as a real financial fraud data set, which is the motivating application for this work. △ Less

Submitted 14 March, 2022; originally announced March 2022.

Comments: 7+1 pages main text, 16 pages supplementary material, accepted to AAAI 2022

arXiv:2203.06753 [pdf, other]

A Machine Learning Enhanced Algorithm for the Optimal Landing Problem

Authors: Yaohua Zang, Jihao Long, Xuanxi Zhang, Wei Hu, Weinan E, Jiequn Han

Abstract: We propose a machine learning enhanced algorithm for solving the optimal landing problem. Using Pontryagin's minimum principle, we derive a two-point boundary value problem for the landing problem. The proposed algorithm uses deep learning to predict the optimal landing time and a space-marching technique to provide good initial guesses for the boundary value problem solver. The performance of the… ▽ More We propose a machine learning enhanced algorithm for solving the optimal landing problem. Using Pontryagin's minimum principle, we derive a two-point boundary value problem for the landing problem. The proposed algorithm uses deep learning to predict the optimal landing time and a space-marching technique to provide good initial guesses for the boundary value problem solver. The performance of the proposed method is studied using the quadrotor example, a reasonably high dimensional and strongly nonlinear system. Drastic improvement in reliability and efficiency is observed. △ Less

Submitted 13 March, 2022; originally announced March 2022.

arXiv:2203.06680 [pdf, other]

Early-Universe Model Building

Authors: Pouya Asadi, Saurabh Bansal, Asher Berlin, Raymond T. Co, Djuna Croon, Yanou Cui, David Curtin, Francis-Yan Cyr-Racine, Hooman Davoudiasl, Luigi Delle Rose, Marco Drewes, Jeff A. Dror, Gilly Elor, Oliver Gould, Keisuke Harigaya, Saniya Heeba, Yonit Hochberg, Anson Hook, Seyda Ipek, Eric Kuflik, Andrew J. Long, Robert McGehee, Nadav Joseph Outmezguine, Giuliano Panico, Vivian Poulin , et al. (15 additional authors not shown)

Abstract: Theoretical investigations into the evolution of the early universe are an essential part of particle physics that allow us to identify viable extensions to the Standard Model as well as motivated parameter space that can be probed by various experiments and observations. In this white paper, we review particle physics models of the early universe. First, we outline various models that explain two… ▽ More Theoretical investigations into the evolution of the early universe are an essential part of particle physics that allow us to identify viable extensions to the Standard Model as well as motivated parameter space that can be probed by various experiments and observations. In this white paper, we review particle physics models of the early universe. First, we outline various models that explain two essential ingredients of the early universe (dark matter and baryon asymmetry) and those that seek to address current observational anomalies. We then discuss dynamics of the early universe in models of neutrino masses, axions, and several solutions to the electroweak hierarchy problem. Finally, we review solutions to naturalness problems of the Standard Model that employ cosmological dynamics. △ Less

Submitted 7 September, 2022; v1 submitted 13 March, 2022; originally announced March 2022.

Comments: contribution to Snowmass 2021

arXiv:2203.06508 [pdf, other]

doi 10.21468/SciPostPhysCore.6.4.075

Snowmass2021 Cosmic Frontier White Paper: Ultraheavy particle dark matter

Authors: Daniel Carney, Nirmal Raj, Yang Bai, Joshua Berger, Carlos Blanco, Joseph Bramante, Christopher Cappiello, Maíra Dutra, Reza Ebadi, Kristi Engel, Edward Kolb, J. Patrick Harding, Jason Kumar, Gordan Krnjaic, Rafael F. Lang, Rebecca K. Leane, Benjamin V. Lehmann, Shengchao Li, Andrew J. Long, Gopolang Mohlabeng, Ibles Olcina, Elisa Pueschel, Nicholas L. Rodd, Carsten Rott, Dipan Sengupta , et al. (3 additional authors not shown)

Abstract: We outline the unique opportunities and challenges in the search for "ultraheavy" dark matter candidates with masses between roughly $10~{\rm TeV}$ and the Planck scale $m_{\rm pl} \approx 10^{16}~{\rm TeV}$. This mass range presents a wide and relatively unexplored dark matter parameter space, with a rich space of possible models and cosmic histories. We emphasize that both current detectors and… ▽ More We outline the unique opportunities and challenges in the search for "ultraheavy" dark matter candidates with masses between roughly $10~{\rm TeV}$ and the Planck scale $m_{\rm pl} \approx 10^{16}~{\rm TeV}$. This mass range presents a wide and relatively unexplored dark matter parameter space, with a rich space of possible models and cosmic histories. We emphasize that both current detectors and new, targeted search techniques, via both direct and indirect detection, are poised to contribute to searches for ultraheavy particle dark matter in the coming decade. We highlight the need for new developments in this space, including new analyses of current and imminent direct and indirect experiments targeting ultraheavy dark matter and development of new, ultra-sensitive detector technologies like next-generation liquid noble detectors, neutrino experiments, and specialized quantum sensing techniques. △ Less

Submitted 27 April, 2023; v1 submitted 12 March, 2022; originally announced March 2022.

Comments: Solicited community whitepaper for the Snowmass2021 process (Cosmic frontier, particle dark matter working group). 10 pages, 3 figures, many references. Comments welcome. v2: minor revisions based on comments

Journal ref: SciPost Phys. Core 6, 075 (2023), published 6 November 2023

arXiv:2203.02309 [pdf, other]

doi 10.1088/1361-6471/ac841a

A Next-Generation Liquid Xenon Observatory for Dark Matter and Neutrino Physics

Authors: J. Aalbers, K. Abe, V. Aerne, F. Agostini, S. Ahmed Maouloud, D. S. Akerib, D. Yu. Akimov, J. Akshat, A. K. Al Musalhi, F. Alder, S. K. Alsum, L. Althueser, C. S. Amarasinghe, F. D. Amaro, A. Ames, T. J. Anderson, B. Andrieu, N. Angelides, E. Angelino, J. Angevaare, V. C. Antochi, D. Antón Martin, B. Antunovic, E. Aprile, H. M. Araújo , et al. (572 additional authors not shown)

Abstract: The nature of dark matter and properties of neutrinos are among the most pressing issues in contemporary particle physics. The dual-phase xenon time-projection chamber is the leading technology to cover the available parameter space for Weakly Interacting Massive Particles (WIMPs), while featuring extensive sensitivity to many alternative dark matter candidates. These detectors can also study neut… ▽ More The nature of dark matter and properties of neutrinos are among the most pressing issues in contemporary particle physics. The dual-phase xenon time-projection chamber is the leading technology to cover the available parameter space for Weakly Interacting Massive Particles (WIMPs), while featuring extensive sensitivity to many alternative dark matter candidates. These detectors can also study neutrinos through neutrinoless double-beta decay and through a variety of astrophysical sources. A next-generation xenon-based detector will therefore be a true multi-purpose observatory to significantly advance particle physics, nuclear physics, astrophysics, solar physics, and cosmology. This review article presents the science cases for such a detector. △ Less

Submitted 4 March, 2022; originally announced March 2022.

Comments: 77 pages, 40 figures, 1262 references

Report number: INT-PUB-22-003

Journal ref: J. Phys. G: Nucl. Part. Phys. 50 (2023) 013001

arXiv:2202.02833 [pdf, other]

CheXstray: Real-time Multi-Modal Data Concordance for Drift Detection in Medical Imaging AI

Authors: Arjun Soin, Jameson Merkow, Jin Long, Joseph Paul Cohen, Smitha Saligrama, Stephen Kaiser, Steven Borg, Ivan Tarapov, Matthew P Lungren

Abstract: Clinical Artificial lntelligence (AI) applications are rapidly expanding worldwide, and have the potential to impact to all areas of medical practice. Medical imaging applications constitute a vast majority of approved clinical AI applications. Though healthcare systems are eager to adopt AI solutions a fundamental question remains: \textit{what happens after the AI model goes into production?} We… ▽ More Clinical Artificial lntelligence (AI) applications are rapidly expanding worldwide, and have the potential to impact to all areas of medical practice. Medical imaging applications constitute a vast majority of approved clinical AI applications. Though healthcare systems are eager to adopt AI solutions a fundamental question remains: \textit{what happens after the AI model goes into production?} We use the CheXpert and PadChest public datasets to build and test a medical imaging AI drift monitoring workflow to track data and model drift without contemporaneous ground truth. We simulate drift in multiple experiments to compare model performance with our novel multi-modal drift metric, which uses DICOM metadata, image appearance representation from a variational autoencoder (VAE), and model output probabilities as input. Through experimentation, we demonstrate a strong proxy for ground truth performance using unsupervised distributional shifts in relevant metadata, predicted probabilities, and VAE latent representation. Our key contributions include (1) proof-of-concept for medical imaging drift detection that includes the use of VAE and domain specific statistical methods, (2) a multi-modal methodology to measure and unify drift metrics, (3) new insights into the challenges and solutions to observe deployed medical imaging AI, and (4) creation of open-source tools that enable others to easily run their own workflows and scenarios. This work has important implications. It addresses the concerning translation gap found in continuous medical imaging AI model monitoring common in dynamic healthcare environments. △ Less

Submitted 17 March, 2022; v1 submitted 6 February, 2022; originally announced February 2022.

Comments: Added code url

arXiv:2201.11032 [pdf]

doi 10.1364/OE.456930

Design considerations and performance analysis of fiber laser array system for structuring orbital angular momentum beams

Authors: Tianyue Hou, Qi Chang, Jinhu Long, Pengfei Ma, Pu Zhou

Abstract: Since the advent of optical orbital angular momentum (OAM), advances in the generation and manipulation of OAM beams have continuously impacted on intriguing applications including optical communication, optical tweezers, and remote sensing. To realize the generation of high-power and fast switchable OAM beams, coherent combining of fiber lasers offers a promising way. Here in this contribution, w… ▽ More Since the advent of optical orbital angular momentum (OAM), advances in the generation and manipulation of OAM beams have continuously impacted on intriguing applications including optical communication, optical tweezers, and remote sensing. To realize the generation of high-power and fast switchable OAM beams, coherent combining of fiber lasers offers a promising way. Here in this contribution, we comprehensively investigate the coherent fiber laser array system for structuring OAM beams in terms of the design considerations and performance analysis. The performance metric and evaluation method of the laser array system are presented and introduced. Accordingly, the effect of the main sections of the laser array system, namely the high-power laser sources, emitting array configuration, and dynamic control system, on the performance of the output coherently combined OAM beams is evaluated, which reveals the system tolerance of perturbative factors and provides the guidance on system design and optimization. This work could provide beneficial reference on the practical implementation of spatially structuring high-power, fast switchable OAM beams with fiber laser arrays. △ Less

Submitted 26 January, 2022; originally announced January 2022.

arXiv:2201.04868 [pdf, other]

Interactive Data Analysis with Next-step Natural Language Query Recommendation

Authors: Xingbo Wang, Furui Cheng, Yong Wang, Ke Xu, Jiang Long, Hong Lu, Huamin Qu

Abstract: Natural language interfaces (NLIs) provide users with a convenient way to interactively analyze data through natural language queries. Nevertheless, interactive data analysis is a demanding process, especially for novice data analysts. When exploring large and complex SQL databases from different domains, data analysts do not necessarily have sufficient knowledge about different data tables and ap… ▽ More Natural language interfaces (NLIs) provide users with a convenient way to interactively analyze data through natural language queries. Nevertheless, interactive data analysis is a demanding process, especially for novice data analysts. When exploring large and complex SQL databases from different domains, data analysts do not necessarily have sufficient knowledge about different data tables and application domains. It makes them unable to systematically elicit a series of topically-related and meaningful queries for insight discovery in target domains. We develop a NLI with a step-wise query recommendation module to assist users in choosing appropriate next-step exploration actions. The system adopts a data-driven approach to suggest semantically relevant and context-aware queries for application domains of users' interest based on their query logs. Also, the system helps users organize query histories and results into a dashboard to communicate the discovered data insights. With a comparative user study, we show that our system can facilitate a more effective and systematic data analysis process than a baseline without the recommendation module. △ Less

Submitted 1 November, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

Comments: 14 pages, 6 figures

arXiv:2112.12231 [pdf, other]

doi 10.1093/ptep/ptac074

Application and modeling of an online distillation method to reduce krypton and argon in XENON1T

Authors: E. Aprile, K. Abe, F. Agostini, S. Ahmed Maouloud, M. Alfonsi, L. Althueser, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, L. Bellagamba, A. Bernard, R. Biondi, A. Bismark, A. Brown, S. Bruenner, G. Bruno, R. Budnik, C. Capelli, J. M. R. Cardoso, D. Cichon, B. Cimmino , et al. (129 additional authors not shown)

Abstract: A novel online distillation technique was developed for the XENON1T dark matter experiment to reduce intrinsic background components more volatile than xenon, such as krypton or argon, while the detector was operating. The method is based on a continuous purification of the gaseous volume of the detector system using the XENON1T cryogenic distillation column. A krypton-in-xenon concentration of… ▽ More A novel online distillation technique was developed for the XENON1T dark matter experiment to reduce intrinsic background components more volatile than xenon, such as krypton or argon, while the detector was operating. The method is based on a continuous purification of the gaseous volume of the detector system using the XENON1T cryogenic distillation column. A krypton-in-xenon concentration of $(360 \pm 60)$ ppq was achieved. It is the lowest concentration measured in the fiducial volume of an operating dark matter detector to date. A model was developed and fit to the data to describe the krypton evolution in the liquid and gas volumes of the detector system for several operation modes over the time span of 550 days, including the commissioning and science runs of XENON1T. The online distillation was also successfully applied to remove Ar-37 after its injection for a low energy calibration in XENON1T. This makes the usage of Ar-37 as a regular calibration source possible in the future. The online distillation can be applied to next-generation experiments to remove krypton prior to, or during, any science run. The model developed here allows further optimization of the distillation strategy for future large scale detectors. △ Less

Submitted 14 June, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

Journal ref: Prog Theor Exp Phys (2022)

arXiv:2112.12116 [pdf, other]

doi 10.1103/PhysRevD.106.022001

Emission of Single and Few Electrons in XENON1T and Limits on Light Dark Matter

Authors: E. Aprile, K. Abe, F. Agostini, S. Ahmed Maouloud, M. Alfonsi, L. Althueser, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, L. Bellagamba, A. Bernard, R. Biondi, A. Bismark, A. Brown, S. Bruenner, G. Bruno, R. Budnik, C. Capelli, J. M. R. Cardoso, D. Cichon, B. Cimmino , et al. (130 additional authors not shown)

Abstract: Delayed single- and few-electron emissions plague dual-phase time projection chambers, limiting their potential to search for light-mass dark matter. This paper examines the origins of these events in the XENON1T experiment. Characterization of the intensity of delayed electron backgrounds shows that the resulting emissions are correlated, in time and position, with high-energy events and can effe… ▽ More Delayed single- and few-electron emissions plague dual-phase time projection chambers, limiting their potential to search for light-mass dark matter. This paper examines the origins of these events in the XENON1T experiment. Characterization of the intensity of delayed electron backgrounds shows that the resulting emissions are correlated, in time and position, with high-energy events and can effectively be vetoed. In this work we extend previous S2-only analyses down to a single electron. From this analysis, after removing the correlated backgrounds, we observe rates < 30 events/(electron*kg*day) in the region of interest spanning 1 to 5 electrons. We derive 90% confidence upper limits for dark matter-electron scattering, first direct limits on the electric dipole, magnetic dipole, and anapole interactions, and bosonic dark matter models, where we exclude new parameter space for dark photons and solar dark photons. △ Less

Submitted 2 September, 2024; v1 submitted 22 December, 2021; originally announced December 2021.

Comments: 20 pages, 17 figures, Updated to correct published Solar Dark Photon limit

Journal ref: Phys. Rev. D 106, 022001 (2022)

arXiv:2112.06012 [pdf]

Laser array of coherent beam combination system revisited: angular domain perspective and fractal-based optimization

Authors: Tianyue Hou, Qi Chang, Pengfei Ma, Jinhu Long, Pu Zhou

Abstract: Coherent beam combination (CBC) of fiber lasers holds promise for achieving high brightness laser systems, which have given rise to widespread applications such as particle accelerator, space debris removal, and industrial fabrication. The emitting laser array of CBC systems offers intriguing features in terms of agile beam steering, flexible beam shaping, and high scalability for output power and… ▽ More Coherent beam combination (CBC) of fiber lasers holds promise for achieving high brightness laser systems, which have given rise to widespread applications such as particle accelerator, space debris removal, and industrial fabrication. The emitting laser array of CBC systems offers intriguing features in terms of agile beam steering, flexible beam shaping, and high scalability for output power and array elements. However, the theoretical model of the laser array in CBC systems is less well explored beyond the routine angular-spectrum method, where methods for optimizing the laser array configuration are more limited. Here, we explore the theory for the laser array of CBC systems in the view of angular domain. The laser array is represented by the composition of angular harmonics, the orthogonal basis over the azimuthal plane, and we elucidate the formation of mainlobe and sidelobes of the far-field interference pattern by using the orbital angular momentum spectrum analysis and azimuthal decomposition. Based on our findings, a fractal-based laser array configuration is proposed to enhance the performance of the combining system. Our work offers a deeper insight into the theoretical study and application of laser beam combination and opens opportunities for the further optimization of CBC implementations. △ Less

Submitted 11 December, 2021; originally announced December 2021.

arXiv:2112.05629 [pdf, other]

doi 10.1140/epjc/s10052-022-10345-6

Material radiopurity control in the XENONnT experiment

Authors: E. Aprile, K. Abe, F. Agostini, S. Ahmed Maouloud, M. Alfonsi, L. Althueser, E. Angelino, J. R. Angevaare, V. C. Antochi, D. Antón Martin, F. Arneodo, L. Baudis, A. L. Baxter, L. Bellagamba, R. Biondi, A. Bismark, A. Brown, S. Bruenner, G. Bruno, R. Budnik, C. Capelli, J. M. R. Cardoso, D. Cichon, B. Cimmino, M. Clark , et al. (128 additional authors not shown)

Abstract: The selection of low-radioactive construction materials is of the utmost importance for rare-event searches and thus critical to the XENONnT experiment. Results of an extensive radioassay program are reported, in which material samples have been screened with gamma-ray spectroscopy, mass spectrometry, and $^{222}$Rn emanation measurements. Furthermore, the cleanliness procedures applied to remove… ▽ More The selection of low-radioactive construction materials is of the utmost importance for rare-event searches and thus critical to the XENONnT experiment. Results of an extensive radioassay program are reported, in which material samples have been screened with gamma-ray spectroscopy, mass spectrometry, and $^{222}$Rn emanation measurements. Furthermore, the cleanliness procedures applied to remove or mitigate surface contamination of detector materials are described. Screening results, used as inputs for a XENONnT Monte Carlo simulation, predict a reduction of materials background ($\sim$17%) with respect to its predecessor XENON1T. Through radon emanation measurements, the expected $^{222}$Rn activity concentration in XENONnT is determined to be 4.2$\,(^{+0.5}_{-0.7})\,μ$Bq/kg, a factor three lower with respect to XENON1T. This radon concentration will be further suppressed by means of the novel radon distillation system. △ Less

Submitted 26 January, 2023; v1 submitted 10 December, 2021; originally announced December 2021.

arXiv:2111.09892 [pdf, other]

doi 10.1103/PhysRevLett.128.091102

Upper Limit on the QCD Axion Mass from Isolated Neutron Star Cooling

Authors: Malte Buschmann, Christopher Dessert, Joshua W. Foster, Andrew J. Long, Benjamin R. Safdi

Abstract: The quantum chromodynamics (QCD) axion may modify the cooling rates of neutron stars (NSs). The axions are produced within the NS cores from nucleon bremsstrahlung and, when the nucleons are in superfluid states, Cooper pair breaking and formation processes. We show that four of the nearby isolated Magnificent Seven NSs along with PSR J0659 are prime candidates for axion cooling studies because th… ▽ More The quantum chromodynamics (QCD) axion may modify the cooling rates of neutron stars (NSs). The axions are produced within the NS cores from nucleon bremsstrahlung and, when the nucleons are in superfluid states, Cooper pair breaking and formation processes. We show that four of the nearby isolated Magnificent Seven NSs along with PSR J0659 are prime candidates for axion cooling studies because they are coeval, with ages of a few hundred thousand years known from kinematic considerations, and they have well-measured surface luminosities. We compare these data to dedicated NS cooling simulations incorporating axions, profiling over uncertainties related to the equation of state, NS masses, surface compositions, and superfluidity. Our calculations of the axion and neutrino emissivities include high-density suppression factors that also affect SN 1987A and previous NS cooling limits on axions. We find no evidence for axions in the isolated NS data, and within the context of the KSVZ QCD axion model we constrain $m_a \lesssim 16$ meV at 95% confidence. An improved understanding of NS cooling and nucleon superfluidity could further improve these limits or lead to the discovery of the axion at weaker couplings. △ Less

Submitted 18 November, 2021; originally announced November 2021.

Comments: 9+16 pages, 3+11 figures

arXiv:2111.06545 [pdf, ps, other]

doi 10.1126/science.abg5137

Peta-electron volt gamma-ray emission from the Crab Nebula

Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, Axikegu, L. X. Bai, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, H. Cai, J. T. Cai, Zhe Cao, J. Chang, J. F. Chang, B. M. Chen, E. S. Chen, J. Chen, Liang Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen , et al. (250 additional authors not shown)

Abstract: The Crab pulsar and the surrounding nebula powered by the pulsar's rotational energy through the formation and termination of a relativistic electron-positron wind is a bright source of gamma-rays carrying crucial information about this complex conglomerate. We report the detection of $γ$-rays with a spectrum showing gradual steepening over three energy decades, from $5\times 10^{-4}$ to $1.1$ pet… ▽ More The Crab pulsar and the surrounding nebula powered by the pulsar's rotational energy through the formation and termination of a relativistic electron-positron wind is a bright source of gamma-rays carrying crucial information about this complex conglomerate. We report the detection of $γ$-rays with a spectrum showing gradual steepening over three energy decades, from $5\times 10^{-4}$ to $1.1$ petaelectronvolt (PeV). The ultra-high-energy photons exhibit the presence of a PeV electron accelerator (a pevatron) with an acceleration rate exceeding 15% of the absolute theoretical limit. Assuming that unpulsed $γ$-rays are produced at the termination of the pulsar's wind, we constrain the pevatron's size, between $0.025$ and $0.1$ pc, and the magnetic field $\approx 110 μ$G. The production rate of PeV electrons, $2.5 \times 10^{36}$ erg $\rm s^{-1}$, constitutes 0.5% of the pulsar's spin-down luminosity, although we do not exclude a non-negligible contribution of PeV protons to the production of the highest energy $γ$-rays. △ Less

Submitted 11 November, 2021; originally announced November 2021.

Comments: 43 pages, 13 figures, 2 tables; Published in Science

Journal ref: Science, 2021, Vol 373, Issue 6553, pp. 425-430

arXiv:2111.03469 [pdf, ps, other]

Perturbational Complexity by Distribution Mismatch: A Systematic Analysis of Reinforcement Learning in Reproducing Kernel Hilbert Space

Authors: Jihao Long, Jiequn Han

Abstract: Most existing theoretical analysis of reinforcement learning (RL) is limited to the tabular setting or linear models due to the difficulty in dealing with function approximation in high dimensional space with an uncertain environment. This work offers a fresh perspective into this challenge by analyzing RL in a general reproducing kernel Hilbert space (RKHS). We consider a family of Markov decisio… ▽ More Most existing theoretical analysis of reinforcement learning (RL) is limited to the tabular setting or linear models due to the difficulty in dealing with function approximation in high dimensional space with an uncertain environment. This work offers a fresh perspective into this challenge by analyzing RL in a general reproducing kernel Hilbert space (RKHS). We consider a family of Markov decision processes $\mathcal{M}$ of which the reward functions lie in the unit ball of an RKHS and transition probabilities lie in a given arbitrary set. We define a quantity called perturbational complexity by distribution mismatch $Δ_{\mathcal{M}}(ε)$ to characterize the complexity of the admissible state-action distribution space in response to a perturbation in the RKHS with scale $ε$. We show that $Δ_{\mathcal{M}}(ε)$ gives both the lower bound of the error of all possible algorithms and the upper bound of two specific algorithms (fitted reward and fitted Q-iteration) for the RL problem. Hence, the decay of $Δ_\mathcal{M}(ε)$ with respect to $ε$ measures the difficulty of the RL problem on $\mathcal{M}$. We further provide some concrete examples and discuss whether $Δ_{\mathcal{M}}(ε)$ decays fast or not in these examples. As a byproduct, we show that when the reward functions lie in a high dimensional RKHS, even if the transition probability is known and the action space is finite, it is still possible for RL problems to suffer from the curse of dimensionality. △ Less

Submitted 27 March, 2022; v1 submitted 5 November, 2021; originally announced November 2021.

arXiv:2111.00358 [pdf, ps, other]

A Survey on the Robustness of Feature Importance and Counterfactual Explanations

Authors: Saumitra Mishra, Sanghamitra Dutta, Jason Long, Daniele Magazzeni

Abstract: There exist several methods that aim to address the crucial task of understanding the behaviour of AI/ML models. Arguably, the most popular among them are local explanations that focus on investigating model behaviour for individual instances. Several methods have been proposed for local analysis, but relatively lesser effort has gone into understanding if the explanations are robust and accuratel… ▽ More There exist several methods that aim to address the crucial task of understanding the behaviour of AI/ML models. Arguably, the most popular among them are local explanations that focus on investigating model behaviour for individual instances. Several methods have been proposed for local analysis, but relatively lesser effort has gone into understanding if the explanations are robust and accurately reflect the behaviour of underlying models. In this work, we present a survey of the works that analysed the robustness of two classes of local explanations (feature importance and counterfactual explanations) that are popularly used in analysing AI/ML models in finance. The survey aims to unify existing definitions of robustness, introduces a taxonomy to classify different robustness approaches, and discusses some interesting results. Finally, the survey introduces some pointers about extending current robustness analysis approaches so as to identify reliable explainability methods. △ Less

Submitted 3 January, 2023; v1 submitted 30 October, 2021; originally announced November 2021.

Comments: 4 pages plus references. Accepted at the workshop on Explainable AI in Finance (XAI-FIN21). Camera-ready version. V2: Added more references and expanded robust explanations for counterfactuals

arXiv:2110.14270 [pdf, other]

doi 10.1145/3531146.3533168

Counterfactual Shapley Additive Explanations

Authors: Emanuele Albini, Jason Long, Danial Dervovic, Daniele Magazzeni

Abstract: Feature attributions are a common paradigm for model explanations due to their simplicity in assigning a single numeric score for each input feature to a model. In the actionable recourse setting, wherein the goal of the explanations is to improve outcomes for model consumers, it is often unclear how feature attributions should be correctly used. With this work, we aim to strengthen and clarify th… ▽ More Feature attributions are a common paradigm for model explanations due to their simplicity in assigning a single numeric score for each input feature to a model. In the actionable recourse setting, wherein the goal of the explanations is to improve outcomes for model consumers, it is often unclear how feature attributions should be correctly used. With this work, we aim to strengthen and clarify the link between actionable recourse and feature attributions. Concretely, we propose a variant of SHAP, Counterfactual SHAP (CF-SHAP), that incorporates counterfactual information to produce a background dataset for use within the marginal (a.k.a. interventional) Shapley value framework. We motivate the need within the actionable recourse setting for careful consideration of background datasets when using Shapley values for feature attributions with numerous synthetic examples. Moreover, we demonstrate the efficacy of CF-SHAP by proposing and justifying a quantitative score for feature attributions, counterfactual-ability, showing that as measured by this metric, CF-SHAP is superior to existing methods when evaluated on public datasets using tree ensembles. △ Less

Submitted 16 May, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

Comments: Accepted at FAccT '22 (2022 ACM Conference on Fairness, Accountability, and Transparency)

ACM Class: I.2; I.5; H.5

arXiv:2110.07944 [pdf, other]

doi 10.3847/1538-4357/ac31b3

LTD064402+245919: A Subgiant with a 1-3 M$_{\odot}$ Undetected Companion Identified from LAMOST-TD Data

Authors: Fan Yang, Bo Zhang, Richard J. Long, You-Jun Lu, Su-Su Shan, Xing Wei, Jian-Ning Fu, Xian-Fei Zhang, Zhi-Chao Zhao, Yu Bai, Tuan Yi, Ling-Lin Zheng, Ze-Ming Zhou, Ji-Feng Liu

Abstract: Single-line spectroscopic binaries recently contribute to the stellar-mass black hole discovery, independently of the X-ray transient method. We report the identification of a single-line binary system LTD064402+245919, with an orbital period of 14.50 days. The observed component is a subgiant with a mass of 2.77$\pm$0.68M$_{\odot}$, radius 15.5$\pm$2.5R$_{\odot}$, effective temperature… ▽ More Single-line spectroscopic binaries recently contribute to the stellar-mass black hole discovery, independently of the X-ray transient method. We report the identification of a single-line binary system LTD064402+245919, with an orbital period of 14.50 days. The observed component is a subgiant with a mass of 2.77$\pm$0.68M$_{\odot}$, radius 15.5$\pm$2.5R$_{\odot}$, effective temperature $T_{\rm eff}$ 4500$\pm$200K, and surface gravity log\emph{g} 2.5$\pm$0.25dex. The discovery makes use of the LAMOST time-domain (LAMOST-TD) and ZTF survey. Our general-purpose software pipeline applies the Lomb-Scargle periodogram to determine the orbital period and uses machine-learning to classify the variable type from the folded light curves. We apply a combined model to estimate the orbital parameters from both the light and radial velocity curves, taking constraints on the primary star mass, mass function, and detection limit of secondary luminosity into consideration. We obtain a radial velocity semi-amplitude of 44.6$\pm$1.5 km s$^{-1}$, mass ratio of 0.73$\pm$0.07, and an undetected component mass of 2.02$\pm$0.49M$_{\odot}$ when the type of the undetected component is not set. We conclude that the inclination is not well constrained, and that the secondary mass is larger than 1M$_{\odot}$ when the undetected component is modelled as a compact object. According to our investigations using an MCMC simulation, increasing the spectra SNR by a factor of 3 would enable the secondary light to be distinguished (if present). The algorithm and software in this work are able to serve as general-purpose tools for the identification of compact objects quiescent in X-rays. △ Less

Submitted 24 October, 2021; v1 submitted 15 October, 2021; originally announced October 2021.

Comments: Accepted by ApJ

arXiv:2109.06386 [pdf, other]

The Three-Sided PyramidWavefront Sensor. I. Simulations and Analysis for Astronomical Adaptive Optics

Authors: Lauren Schatz, Jared R. Males, Carlos Correia, Benoit Neichel, Vincent Chambouleyron, Johanan Codona, Olivier Fauvarque, Jean-François Sauvage, Thierry Fusco, Michael Hart, Pierre Janin-Potiron, Robert Johnson, Joseph Long, Mala Mateen

Abstract: For ExAO instruments for the Giant Segmented Mirror Telescopes (GSMTs), alternative architectures of WFS are under consideration because there is a tradeoff between detector size, speed, and noise that reduces the performance of GSMT-ExAO wavefront control. One option under consideration for a GSMT-ExAO wavefront sensor is a three-sided PWFS (3PWFS). The 3PWFS creates three copies of the telescope… ▽ More For ExAO instruments for the Giant Segmented Mirror Telescopes (GSMTs), alternative architectures of WFS are under consideration because there is a tradeoff between detector size, speed, and noise that reduces the performance of GSMT-ExAO wavefront control. One option under consideration for a GSMT-ExAO wavefront sensor is a three-sided PWFS (3PWFS). The 3PWFS creates three copies of the telescope pupil for wavefront sensing, compared to the conventional four-sided PWFS (4PWFS) which uses four pupils. The 3PWFS uses fewer detector pixels than the 4PWFS and should therefore be less sensitive to read noise. Here we develop a mathematical formalism based on the diffraction theory description of the Foucault knife edge test that predicts the intensity pattern after the PWFS. Our formalism allows us to calculate the intensity in the pupil images formed by the PWFS in the presence of phase errors corresponding to arbitrary Fourier modes. We then use the Object Oriented MATLAB Adaptive Optics toolbox (OOMAO) to simulate an end-to-end model of an adaptive optics system using a PWFS with modulation and compare the performance of the 3PWFS to the 4PWFS. In the case of a low read noise detector, the Strehl ratios of the 3PWFS and 4PWFS are within 0.01. When we included higher read noise in the simulation, we found a Strehl ratio gain of 0.036 for the 3PWFS using Raw Intensity over the 4PWFS using Slopes Maps at a stellar magnitude of 10. At the same magnitude, the 4PWFS RI also outperformed the 4PWFS SM, but the gain was only 0.012 Strehl. This is significant because 4PWFS using Slopes Maps is how the PWFS is conventionally used for AO wavefront sensing. We have found that the 3PWFS is a viable wavefront sensor that can fully reconstruct a wavefront and produce a stable closed-loop with correction comparable to that of a 4PWFS, with modestly better performance for high read-noise detectors. △ Less

Submitted 13 September, 2021; originally announced September 2021.

Comments: 39 pages, 15 figures

arXiv:2108.05487 [pdf, other]

doi 10.1093/mnras/stab2341

SDSS-IV MaNGA: Stellar M/L gradients and the M/L-colour relation in galaxies

Authors: Junqiang Ge, Shude Mao, Youjun Lu, Michele Cappellari, Richard J. Long, Renbin Yan

Abstract: The stellar mass-to-light ratio gradient in SDSS $r-$band $\nabla (M_*/L_r)$ of a galaxy depends on its mass assembly history, which is imprinted in its morphology and gradients of age, metallicity, and stellar initial mass function (IMF). Taking a MaNGA sample of 2051 galaxies with stellar masses ranging from $10^9$ to $10^{12}M_\odot$ released in SDSS DR15, we focus on face-on galaxies, without… ▽ More The stellar mass-to-light ratio gradient in SDSS $r-$band $\nabla (M_*/L_r)$ of a galaxy depends on its mass assembly history, which is imprinted in its morphology and gradients of age, metallicity, and stellar initial mass function (IMF). Taking a MaNGA sample of 2051 galaxies with stellar masses ranging from $10^9$ to $10^{12}M_\odot$ released in SDSS DR15, we focus on face-on galaxies, without merger and bar signatures, and investigate the dependence of the 2D $\nabla (M_*/L_r)$ on other galaxy properties, including $M_*/L_r$-colour relationships by assuming a fixed Salpeter IMF as the mass normalization reference. The median gradient is $\nabla M_*/L_r\sim -0.1$ (i.e., the $M_*/L_r$ is larger at the centre) for massive galaxies, becomes flat around $M_*\sim 10^{10} M_{\odot}$ and change sign to $\nabla M_*/L_r\sim 0.1$ at the lowest masses. The $M_*/L_r$ inside a half light radius increases with increasing galaxy stellar mass; in each mass bin, early-type galaxies have the highest value, while pure-disk late-type galaxies have the smallest. Correlation analyses suggest that the mass-weighted stellar age is the dominant parameter influencing the $M_*/L_r$ profile, since a luminosity-weighted age is easily affected by star formation when the specific star formation rate (sSFR) inside the half light radius is higher than $10^{-3} {\rm Gyr}^{-1}$. With increased sSFR gradient, one can obtain a steeper negative $\nabla (M_*/L_r)$. The scatter in the slopes of $M_*/L$-colour relations increases with increasing sSFR, for example, the slope for post-starburst galaxies can be flattened to $0.45$ from the global value $0.87$ in the $M_*/L$ vs. $g-r$ diagram. Hence converting galaxy colours to $M_*/L$ should be done carefully, especially for those galaxies with young luminosity-weighted stellar ages, which can have quite different star formation histories. △ Less

Submitted 11 August, 2021; originally announced August 2021.

Comments: 12 pages, 10 figures. Accepted for publication in MNRAS

arXiv:2108.04964 [pdf, other]

A spectral-based analysis of the separation between two-layer neural networks and linear methods

Authors: Lei Wu, Jihao Long

Abstract: We propose a spectral-based approach to analyze how two-layer neural networks separate from linear methods in terms of approximating high-dimensional functions. We show that quantifying this separation can be reduced to estimating the Kolmogorov width of two-layer neural networks, and the latter can be further characterized by using the spectrum of an associated kernel. Different from previous wor… ▽ More We propose a spectral-based approach to analyze how two-layer neural networks separate from linear methods in terms of approximating high-dimensional functions. We show that quantifying this separation can be reduced to estimating the Kolmogorov width of two-layer neural networks, and the latter can be further characterized by using the spectrum of an associated kernel. Different from previous work, our approach allows obtaining upper bounds, lower bounds, and identifying explicit hard functions in a united manner. We provide a systematic study of how the choice of activation functions affects the separation, in particular the dependence on the input dimension. Specifically, for nonsmooth activation functions, we extend known results to more activation functions with sharper bounds. As concrete examples, we prove that any single neuron can instantiate the separation between neural networks and random feature models. For smooth activation functions, one surprising finding is that the separation is negligible unless the norms of inner-layer weights are polynomially large with respect to the input dimension. By contrast, the separation for nonsmooth activation functions is independent of the norms of inner-layer weights. △ Less

Submitted 23 February, 2022; v1 submitted 10 August, 2021; originally announced August 2021.

Comments: Accepted by Journal of Machine Learning Research

arXiv:2108.00962 [pdf, other]

doi 10.1103/PhysRevD.104.083540

The Secret Higgstory of the Highest Temperature during Reheating

Authors: Samuel Passaglia, Wayne Hu, Andrew J. Long, David Zegeye

Abstract: We study the role of the Standard Model Higgs condensate, formed during cosmological inflation, in the epoch of reheating that follows. We focus on the scenario where the inflaton decays slowly and perturbatively, so that there is a long period between the end of inflation and the beginning of radiation domination. The Higgs condensate decays non-perturbatively during this period, and we show that… ▽ More We study the role of the Standard Model Higgs condensate, formed during cosmological inflation, in the epoch of reheating that follows. We focus on the scenario where the inflaton decays slowly and perturbatively, so that there is a long period between the end of inflation and the beginning of radiation domination. The Higgs condensate decays non-perturbatively during this period, and we show that it heats the primordial plasma to much higher temperatures than would result from the slowly-decaying inflaton alone. We discuss the effect of this hot plasma on the thermalization of the inflaton's decay products, and study its phenomenological implications for the formation of cosmological relics like dark matter, with associated isocurvature fluctuations, and the restoration of the electroweak and Peccei-Quinn symmetries. △ Less

Submitted 22 November, 2021; v1 submitted 2 August, 2021; originally announced August 2021.

Comments: 17 pages, 4 figures; matches published version

Journal ref: Phys.Rev.D 104 (2021) 8, 083540

arXiv:2107.07523 [pdf, other]

Characterizing deformable mirrors for the MagAO-X instrument

Authors: Kyle Van Gorkom, Jared R. Males, Laird M. Close, Jennifer Lumbres, Alex Hedglen, Joseph D. Long, Sebastiaan Y. Haffert, Olivier Guyon, Maggie Kautz, Lauren Schatz, Kelsey Miller, Alexander T. Rodack, Justin M. Knight, Katie M. Morzinski

Abstract: The MagAO-X instrument is a new extreme adaptive optics system for high-contrast imaging at visible and near-infrared wavelengths on the Magellan Clay Telescope. A central component of this system is a 2040-actuator microelectromechanical deformable mirror (DM) from Boston Micromachines Corp. that operates at 3.63 kHz for high-order wavefront control (the tweeter). Two additional DMs from ALPAO pe… ▽ More The MagAO-X instrument is a new extreme adaptive optics system for high-contrast imaging at visible and near-infrared wavelengths on the Magellan Clay Telescope. A central component of this system is a 2040-actuator microelectromechanical deformable mirror (DM) from Boston Micromachines Corp. that operates at 3.63 kHz for high-order wavefront control (the tweeter). Two additional DMs from ALPAO perform the low-order (the woofer) and non-common-path science-arm wavefront correction (the NCPC DM). Prior to integration with the instrument, we characterized these devices using a Zygo Verifire Interferometer to measure each DM surface. We present the results of the characterization effort here, demonstrating the ability to drive tweeter to a flat of 6.9 nm root mean square (RMS) surface (and 0.56 nm RMS surface within its control bandwidth), the woofer to 2.2 nm RMS surface, and the NCPC DM to 2.1 nm RMS surface over the MagAO-X beam footprint on each device. Using focus-diversity phase retrieval on the MagAO-X science cameras to estimate the internal instrument wavefront error (WFE), we further show that the integrated DMs correct the instrument WFE to 18.7 nm RMS, which, combined with a 11.7% pupil amplitude RMS, produces a Strehl ratio of 0.94 at H$α$. △ Less

Submitted 15 July, 2021; originally announced July 2021.

Comments: Accepted for publication in JATIS

arXiv:2107.03871 [pdf]

Spatial beam self-cleaning in bi-tapered multimode fibers

Authors: Xiao-Jun Lin, Yu-Xin Gao, Jin-Gan Long, Jia-Wen Wu, Xiang-Yue Li, Wei-Yi Hong, Hu Cui, Zhi-Chao Luo, Wen-Cheng Xu, Ai-Ping Luo

Abstract: We report the spatial beam self-cleaning in bi-tapered conventional multimode fibers (MMFs) with different tapered lengths. Through the introduction of the bi-tapered structure in MMFs, the input beam with poor beam quality from a high-power fiber laser can be converted to a centered, bell-shaped beam in a short length, due to the strengthened nonlinear modes coupling. It is found that the bi-tape… ▽ More We report the spatial beam self-cleaning in bi-tapered conventional multimode fibers (MMFs) with different tapered lengths. Through the introduction of the bi-tapered structure in MMFs, the input beam with poor beam quality from a high-power fiber laser can be converted to a centered, bell-shaped beam in a short length, due to the strengthened nonlinear modes coupling. It is found that the bi-tapered MMF with longer tapered length at the same waist diameter shows better beam self-cleaning effect and larger spectral broadening. The obtained results offer a new method to improve the beam quality of high-power laser at low cost. Besides, it may be interesting for manufacturing bi-tapered MMF-based devices to obtain the quasi-fundamental mode beam in spatiotemporal mode-locked fiber lasers. △ Less

Submitted 8 July, 2021; originally announced July 2021.

arXiv:2106.15212 [pdf, other]

Counterfactual Explanations for Arbitrary Regression Models

Authors: Thomas Spooner, Danial Dervovic, Jason Long, Jon Shepard, Jiahao Chen, Daniele Magazzeni

Abstract: We present a new method for counterfactual explanations (CFEs) based on Bayesian optimisation that applies to both classification and regression models. Our method is a globally convergent search algorithm with support for arbitrary regression models and constraints like feature sparsity and actionable recourse, and furthermore can answer multiple counterfactual questions in parallel while learnin… ▽ More We present a new method for counterfactual explanations (CFEs) based on Bayesian optimisation that applies to both classification and regression models. Our method is a globally convergent search algorithm with support for arbitrary regression models and constraints like feature sparsity and actionable recourse, and furthermore can answer multiple counterfactual questions in parallel while learning from previous queries. We formulate CFE search for regression models in a rigorous mathematical framework using differentiable potentials, which resolves robustness issues in threshold-based objectives. We prove that in this framework, (a) verifying the existence of counterfactuals is NP-complete; and (b) that finding instances using such potentials is CLS-complete. We describe a unified algorithm for CFEs using a specialised acquisition function that composes both expected improvement and an exponential-polynomial (EP) family with desirable properties. Our evaluation on real-world benchmark domains demonstrate high sample-efficiency and precision. △ Less

Submitted 29 June, 2021; originally announced June 2021.

Comments: 20 pages, 5 figures, 3 tables

arXiv:2106.07494 [pdf, other]

Subdivergence-free gluings of trees

Authors: Xinle Dai, Jordan Long, Karen Yeats

Abstract: A gluing of two rooted trees is an identification of their leaves and un-subdivision of the resulting 2-valent vertices. A gluing of two rooted trees is subdivergence free if it has no 2-edge cuts with both roots on the same side of the cut. The problem and language is motivated by quantum field theory. We enumerate subdivergence-free gluings for certain families of trees, showing a connection wit… ▽ More A gluing of two rooted trees is an identification of their leaves and un-subdivision of the resulting 2-valent vertices. A gluing of two rooted trees is subdivergence free if it has no 2-edge cuts with both roots on the same side of the cut. The problem and language is motivated by quantum field theory. We enumerate subdivergence-free gluings for certain families of trees, showing a connection with connected permutations, and we give algorithms to compute subdivergence-free gluings. △ Less

Submitted 10 January, 2025; v1 submitted 14 June, 2021; originally announced June 2021.

Comments: minor edits according to referee comments, 20 pages

MSC Class: Primary: 05A15; Secondary: 05C05; 81T18

arXiv:2106.03141 [pdf, other]

doi 10.1103/PhysRevD.105.024075

Classification of radial Kerr geodesic motion

Authors: Geoffrey Compère, Yan Liu, Jiang Long

Abstract: We classify radial timelike geodesic motion of the exterior non-extremal Kerr spacetime by performing a taxonomy of inequivalent root structures of the first order radial geodesic equation using a novel compact notation and by implementing the constraints from polar, time and azimuthal motion. Four generic root structures with only simple roots give rise to eight non-generic root structures when e… ▽ More We classify radial timelike geodesic motion of the exterior non-extremal Kerr spacetime by performing a taxonomy of inequivalent root structures of the first order radial geodesic equation using a novel compact notation and by implementing the constraints from polar, time and azimuthal motion. Four generic root structures with only simple roots give rise to eight non-generic root structures when either one root becomes coincident with the horizon, one root vanishes or two roots becomes coincident. We derive the explicit phase space of all such root systems in the basis of energy, angular momentum and Carter's constant and classify whether each corresponding radial geodesic motion is allowed or disallowed from existence of polar, time and azimuthal motion. The classification of radial motion within the ergoregion for both positive and negative energies leads to 6 distinguished values of the Kerr angular momentum. The classification of null radial motion and near-horizon extremal Kerr radial motion are obtained as limiting cases and compared with the literature. We explicitly parametrize the separatrix describing root systems with double roots as the union of the following three regions that are described by the same quartic respectively obtained when (1) the pericenter of bound motion becomes a double root; (2) the eccentricity of bound motion becomes zero; (3) the turning point of unbound motion becomes a double root. △ Less

Submitted 2 February, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

Comments: 80 pages, 12 tables, 41 figures, matches the published version up to editorial changes of PRD

arXiv:2106.01921 [pdf, ps, other]

doi 10.1002/sam.11559

Sample Selection Bias in Evaluation of Prediction Performance of Causal Models

Authors: James P. Long, Min Jin Ha

Abstract: Causal models are notoriously difficult to validate because they make untestable assumptions regarding confounding. New scientific experiments offer the possibility of evaluating causal models using prediction performance. Prediction performance measures are typically robust to violations in causal assumptions. However, prediction performance does depend on the selection of training and test sets.… ▽ More Causal models are notoriously difficult to validate because they make untestable assumptions regarding confounding. New scientific experiments offer the possibility of evaluating causal models using prediction performance. Prediction performance measures are typically robust to violations in causal assumptions. However, prediction performance does depend on the selection of training and test sets. Biased training sets can lead to optimistic assessments of model performance. In this work, we revisit the prediction performance of several recently proposed causal models tested on a genetic perturbation data set of Kemmeren. We find that sample selection bias is likely a key driver of model performance. We propose using a less-biased evaluation set for assessing prediction performance and compare models on this new set. In this setting, the causal models have similar or worse performance compared to standard association-based estimators such as Lasso. Finally, we compare the performance of causal estimators in simulation studies that reproduce the Kemmeren structure of genetic knockout experiments but without any sample selection bias. These results provide an improved understanding of the performance of several causal models and offer guidance on how future studies should use Kemmeren. △ Less

Submitted 26 October, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

Comments: 12 pages, 4 figures, 2 tables

arXiv:2104.12772 [pdf, other]

doi 10.1103/PhysRevLett.128.071102

No evidence for axions from Chandra observation of magnetic white dwarf

Authors: Christopher Dessert, Andrew J. Long, Benjamin R. Safdi

Abstract: Ultralight axions with axion-photon couplings $g_{aγγ} \sim {\rm few} \times 10^{-11}$ GeV$^{-1}$ may resolve a number of astrophysical anomalies, such as unexpected ~TeV transparency, anomalous stellar cooling, and X-ray excesses from nearby neutron stars. We show, however, that such axions are severely constrained by the non-observation of X-rays from the magnetic white dwarf (MWD) RE J0317-853… ▽ More Ultralight axions with axion-photon couplings $g_{aγγ} \sim {\rm few} \times 10^{-11}$ GeV$^{-1}$ may resolve a number of astrophysical anomalies, such as unexpected ~TeV transparency, anomalous stellar cooling, and X-ray excesses from nearby neutron stars. We show, however, that such axions are severely constrained by the non-observation of X-rays from the magnetic white dwarf (MWD) RE J0317-853 using ~40 ks of data acquired from a dedicated observation with the Chandra X-ray Observatory. Axions may be produced in the core of the MWD through electron bremsstrahlung and then convert to X-rays in the magnetosphere. The non-observation of X-rays constrains the axion-photon coupling to $g_{aγγ} \lesssim 5.5 \times 10^{-13} \sqrt{C_{aγγ}/C_{aee}}$ GeV$^{-1}$ at 95% confidence for axion masses $m_a \lesssim 5 \times 10^{-6}$ eV, with $C_{aee}$ and $C_{aγγ}$ the dimensionless coupling constants to electrons and photons. Considering that $C_{aee}$ is generated from the renormalization group, our results robustly disfavor $g_{aγγ} \gtrsim 4.4 \times 10^{-11}$ GeV$^{-1}$ even for models with no ultraviolet contribution to $C_{aee}$. △ Less

Submitted 26 April, 2021; originally announced April 2021.

Comments: 7+11 pages, 4+7 figures

arXiv:2104.12036 [pdf, ps, other]

A Class of Dimension-free Metrics for the Convergence of Empirical Measures

Authors: Jiequn Han, Ruimeng Hu, Jihao Long

Abstract: This paper concerns the convergence of empirical measures in high dimensions. We propose a new class of probability metrics and show that under such metrics, the convergence is free of the curse of dimensionality (CoD). Such a feature is critical for high-dimensional analysis and stands in contrast to classical metrics ({\it e.g.}, the Wasserstein metric). The proposed metrics fall into the catego… ▽ More This paper concerns the convergence of empirical measures in high dimensions. We propose a new class of probability metrics and show that under such metrics, the convergence is free of the curse of dimensionality (CoD). Such a feature is critical for high-dimensional analysis and stands in contrast to classical metrics ({\it e.g.}, the Wasserstein metric). The proposed metrics fall into the category of integral probability metrics, for which we specify criteria of test function spaces to guarantee the property of being free of CoD. Examples of the selected test function spaces include the reproducing kernel Hilbert spaces, Barron space, and flow-induced function spaces. Three applications of the proposed metrics are presented: 1. The convergence of empirical measure in the case of random variables; 2. The convergence of $n$-particle system to the solution to McKean-Vlasov stochastic differential equation; 3. The construction of an $\varepsilon$-Nash equilibrium for a homogeneous $n$-player game by its mean-field limit. As a byproduct, we prove that, given a distribution close to the target distribution measured by our metric and a certain representation of the target distribution, we can generate a distribution close to the target one in terms of the Wasserstein metric and relative entropy. Overall, we show that the proposed class of metrics is a powerful tool to analyze the convergence of empirical measures in high dimensions without CoD. △ Less

Submitted 16 September, 2023; v1 submitted 24 April, 2021; originally announced April 2021.

MSC Class: 60B10; 60E15; 60K35; 91A16; 60H10

arXiv:2104.11211 [pdf, other]

doi 10.1364/AO.422155

The vector-apodizing phase plate coronagraph: design, current performance, and future development

Authors: D. S. Doelman, F. Snik, E. H. Por, S. P. Bos, G. P. P. L. Otten, M. Kenworthy, S. Y. Haffert, M. Wilby, A. J. Bohn, B. J. Sutlieff, K. Miller, M. Ouellet, J. de Boer, C. U. Keller, M. J. Escuti, S. Shi, N. Z. Warriner, K. J. Hornburg, J. L. Birkby, J. Males, K. M. Morzinski, L. M. Close, J. Codona, J. Long, L. Schatz , et al. (28 additional authors not shown)

Abstract: Over the last decade, the vector-apodizing phase plate (vAPP) coronagraph has been developed from concept to on-sky application in many high-contrast imaging systems on 8-m class telescopes. The vAPP is an geometric-phase patterned coronagraph that is inherently broadband, and its manufacturing is enabled only by direct-write technology for liquid-crystal patterns. The vAPP generates two coronagra… ▽ More Over the last decade, the vector-apodizing phase plate (vAPP) coronagraph has been developed from concept to on-sky application in many high-contrast imaging systems on 8-m class telescopes. The vAPP is an geometric-phase patterned coronagraph that is inherently broadband, and its manufacturing is enabled only by direct-write technology for liquid-crystal patterns. The vAPP generates two coronagraphic PSFs that cancel starlight on opposite sides of the point spread function (PSF) and have opposite circular polarization states. The efficiency, that is the amount of light in these PSFs, depends on the retardance offset from half-wave of the liquid-crystal retarder. Using different liquid-crystal recipes to tune the retardance, different vAPPs operate with high efficiencies ($>96\%$) in the visible and thermal infrared (0.55 $μ$m to 5 $μ$m). Since 2015, seven vAPPs have been installed in a total of six different instruments, including Magellan/MagAO, Magellan/MagAO-X, Subaru/SCExAO, and LBT/LMIRcam. Using two integral field spectrographs installed on the latter two instruments, these vAPPs can provide low-resolution spectra (R$\sim$30) between 1 $μ$m and 5 $μ$m. We review the design process, development, commissioning, on-sky performance, and first scientific results of all commissioned vAPPs. We report on the lessons learned and conclude with perspectives for future developments and applications. △ Less

Submitted 4 November, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

Comments: 38 pages, 17 figures, accepted for publication in Applied Optics, added NSF grant acknowledgement

arXiv:2104.07864 [pdf, other]

doi 10.3847/1538-3881/abf92f

An Empirical Bayesian Approach to Limb-darkening in Modeling WASP-121b Transit Light Curves

Authors: Fan Yang, Richard J. Long, Ji-Feng Liu, Su-Su Shan, Rui Guo, Bo Zhang, Tuan Yi, Ling-Lin Zheng, Zhi-Chao Zhao

Abstract: We present a novel, iterative method using an empirical Bayesian approach for modeling the limb darkened WASP-121b transit from the TESS light curve. Our method is motivated by the need to improve $R_{p}/R_{\ast}$ estimates for exoplanet atmosphere modeling, and is particularly effective with the limb darkening (LD) quadratic law requiring no prior central value from stellar atmospheric models. Wi… ▽ More We present a novel, iterative method using an empirical Bayesian approach for modeling the limb darkened WASP-121b transit from the TESS light curve. Our method is motivated by the need to improve $R_{p}/R_{\ast}$ estimates for exoplanet atmosphere modeling, and is particularly effective with the limb darkening (LD) quadratic law requiring no prior central value from stellar atmospheric models. With the non-linear LD law, the method has all the advantages of not needing atmospheric models but does not converge. The iterative method gives a different $R_{p}/R_{\ast}$ for WASP-121b at a significance level of 1$σ$ when compared with existing non-iterative methods. To assess the origins and implications of this difference, we generate and analyze light curves with known values of the limb darkening coefficients (LDCs). We find that non-iterative modeling with LDC priors from stellar atmospheric models results in an inconsistent $R_{p}/R_{\ast}$ at 1.5$σ$ level when the known LDC values are as those previously found when modeling real data by the iterative method. In contrast, the LDC values from the iterative modeling yields the correct value of $R_{p}/R_{\ast}$ to within 0.25$σ$. For more general cases with different known inputs, Monte Carlo simulations show that the iterative method obtains unbiased LDCs and correct $R_{p}/R_{\ast}$ to within a significance level of 0.3$σ$. Biased LDC priors can cause biased LDC posteriors and lead to bias in the $R_{p}/R_{\ast}$ of up to 0.82$\%$, 2.5$σ$ for the quadratic law and 0.32$\%$, 1.0$σ$ for the non-linear law. Our improvement in $R_{p}/R_{\ast}$ estimation is important when analyzing exoplanet atmospheres. △ Less

Submitted 15 April, 2021; originally announced April 2021.

Comments: Accepted by AJ

arXiv:2104.07794 [pdf, ps, other]

An $L^2$ Analysis of Reinforcement Learning in High Dimensions with Kernel and Neural Network Approximation

Authors: Jihao Long, Jiequn Han, Weinan E

Abstract: Reinforcement learning (RL) algorithms based on high-dimensional function approximation have achieved tremendous empirical success in large-scale problems with an enormous number of states. However, most analysis of such algorithms gives rise to error bounds that involve either the number of states or the number of features. This paper considers the situation where the function approximation is ma… ▽ More Reinforcement learning (RL) algorithms based on high-dimensional function approximation have achieved tremendous empirical success in large-scale problems with an enormous number of states. However, most analysis of such algorithms gives rise to error bounds that involve either the number of states or the number of features. This paper considers the situation where the function approximation is made either using the kernel method or the two-layer neural network model, in the context of a fitted Q-iteration algorithm with explicit regularization. We establish an $\tilde{O}(H^3|\mathcal {A}|^{\frac14}n^{-\frac14})$ bound for the optimal policy with $Hn$ samples, where $H$ is the length of each episode and $|\mathcal {A}|$ is the size of action space. Our analysis hinges on analyzing the $L^2$ error of the approximated Q-function using $n$ data points. Even though this result still requires a finite-sized action space, the error bound is independent of the dimensionality of the state space. △ Less

Submitted 15 February, 2022; v1 submitted 15 April, 2021; originally announced April 2021.

arXiv:2104.04965 [pdf, ps, other]

doi 10.1103/PhysRevD.104.062007

Calibration of the Air Shower Energy Scale of the Water and Air Cherenkov Techniques in the LHAASO experiment

Authors: F. Aharonian, Q. An, Axikegu, L. X. Bai, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, H. Cai, J. T. Cai, Z. Cao Z. Cao, J. Chang, J. F. Chang, X. C. Chang, B. M. Chen, J. Chen, L. Chen, L. Chen, L. Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (233 additional authors not shown)

Abstract: The Wide Field-of-View Cherenkov Telescope Array (WFCTA) and the Water Cherenkov Detector Arrays (WCDA) of LHAASO are designed to work in combination for measuring the energy spectra of various cosmic ray species over a very wide energy range from a few TeV to 10 PeV. The energy calibration of WCDA can be achieved with a proven technique of measuring the westward shift of the Moon shadow of galact… ▽ More The Wide Field-of-View Cherenkov Telescope Array (WFCTA) and the Water Cherenkov Detector Arrays (WCDA) of LHAASO are designed to work in combination for measuring the energy spectra of various cosmic ray species over a very wide energy range from a few TeV to 10 PeV. The energy calibration of WCDA can be achieved with a proven technique of measuring the westward shift of the Moon shadow of galactic cosmic rays due to the geomagnetic field. This deflection angle $Δ$ is inversely proportional to the energy of the cosmic rays. The precise measurements of the shifts by WCDA allows us to calibrate its energy scale for energies as high as 35 TeV. The energy scale measured by WCDA can be used to cross calibrate the energy reconstructed by WFCTA, which spans the whole energy range up to 10 PeV. In this work, we will demonstrate the feasibility of the method using the data collected from April 2019 to January 2020 by the WFCTA array and WCDA-1 detector, the first of the three water Cherenkov ponds, already commissioned at LHAASO site. △ Less

Submitted 13 April, 2021; v1 submitted 11 April, 2021; originally announced April 2021.

Journal ref: Phys. Rev. D 104, 062007 (2021)

Showing 201–250 of 532 results for author: Long, J