-
CAP-VSTNet: Content Affinity Preserved Versatile Style Transfer
Authors:
Linfeng Wen,
Chengying Gao,
Changqing Zou
Abstract:
Content affinity loss including feature and pixel affinity is a main problem which leads to artifacts in photorealistic and video style transfer. This paper proposes a new framework named CAP-VSTNet, which consists of a new reversible residual network and an unbiased linear transform module, for versatile style transfer. This reversible residual network can not only preserve content affinity but n…
▽ More
Content affinity loss including feature and pixel affinity is a main problem which leads to artifacts in photorealistic and video style transfer. This paper proposes a new framework named CAP-VSTNet, which consists of a new reversible residual network and an unbiased linear transform module, for versatile style transfer. This reversible residual network can not only preserve content affinity but not introduce redundant information as traditional reversible networks, and hence facilitate better stylization. Empowered by Matting Laplacian training loss which can address the pixel affinity loss problem led by the linear transform, the proposed framework is applicable and effective on versatile style transfer. Extensive experiments show that CAP-VSTNet can produce better qualitative and quantitative results in comparison with the state-of-the-art methods.
△ Less
Submitted 31 March, 2023;
originally announced March 2023.
-
Parameter estimation from aggregate observations: A Wasserstein distance based sequential Monte Carlo sampler
Authors:
Chen Cheng,
Linjie Wen,
Jinglai Li
Abstract:
In this work we study systems consisting of a group of moving particles. In such systems, often some important parameters are unknown and have to be estimated from observed data. Such parameter estimation problems can often be solved via a Bayesian inference framework. However in many practical problems, only data at the aggregate level is available and as a result the likelihood function is not a…
▽ More
In this work we study systems consisting of a group of moving particles. In such systems, often some important parameters are unknown and have to be estimated from observed data. Such parameter estimation problems can often be solved via a Bayesian inference framework. However in many practical problems, only data at the aggregate level is available and as a result the likelihood function is not available, which poses challenge for Bayesian methods. In particular, we consider the situation where the distributions of the particles are observed. We propose a Wasserstein distance based sequential Monte Carlo sampler to solve the problem: the Wasserstein distance is used to measure the similarity between the observed and the simulated particle distributions and the sequential Monte Carlo samplers is used to deal with the sequentially available observations. Two real-world examples are provided to demonstrate the performance of the proposed method.
△ Less
Submitted 8 July, 2023; v1 submitted 27 March, 2023;
originally announced March 2023.
-
A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capability
Authors:
Aiwei Liu,
Xuming Hu,
Lijie Wen,
Philip S. Yu
Abstract:
This paper presents the first comprehensive analysis of ChatGPT's Text-to-SQL ability. Given the recent emergence of large-scale conversational language model ChatGPT and its impressive capabilities in both conversational abilities and code generation, we sought to evaluate its Text-to-SQL performance. We conducted experiments on 12 benchmark datasets with different languages, settings, or scenari…
▽ More
This paper presents the first comprehensive analysis of ChatGPT's Text-to-SQL ability. Given the recent emergence of large-scale conversational language model ChatGPT and its impressive capabilities in both conversational abilities and code generation, we sought to evaluate its Text-to-SQL performance. We conducted experiments on 12 benchmark datasets with different languages, settings, or scenarios, and the results demonstrate that ChatGPT has strong text-to-SQL abilities. Although there is still a gap from the current state-of-the-art (SOTA) model performance, considering that the experiment was conducted in a zero-shot scenario, ChatGPT's performance is still impressive. Notably, in the ADVETA (RPL) scenario, the zero-shot ChatGPT even outperforms the SOTA model that requires fine-tuning on the Spider dataset by 4.1\%, demonstrating its potential for use in practical applications. To support further research in related fields, we have made the data generated by ChatGPT publicly available at https://github.com/THU-BPM/chatgpt-sql.
△ Less
Submitted 11 March, 2023;
originally announced March 2023.
-
Text with Knowledge Graph Augmented Transformer for Video Captioning
Authors:
Xin Gu,
Guang Chen,
Yufei Wang,
Libo Zhang,
Tiejian Luo,
Longyin Wen
Abstract:
Video captioning aims to describe the content of videos using natural language. Although significant progress has been made, there is still much room to improve the performance for real-world applications, mainly due to the long-tail words challenge. In this paper, we propose a text with knowledge graph augmented transformer (TextKG) for video captioning. Notably, TextKG is a two-stream transforme…
▽ More
Video captioning aims to describe the content of videos using natural language. Although significant progress has been made, there is still much room to improve the performance for real-world applications, mainly due to the long-tail words challenge. In this paper, we propose a text with knowledge graph augmented transformer (TextKG) for video captioning. Notably, TextKG is a two-stream transformer, formed by the external stream and internal stream. The external stream is designed to absorb additional knowledge, which models the interactions between the additional knowledge, e.g., pre-built knowledge graph, and the built-in information of videos, e.g., the salient object regions, speech transcripts, and video captions, to mitigate the long-tail words challenge. Meanwhile, the internal stream is designed to exploit the multi-modality information in videos (e.g., the appearance of video frames, speech transcripts, and video captions) to ensure the quality of caption results. In addition, the cross attention mechanism is also used in between the two streams for sharing information. In this way, the two streams can help each other for more accurate results. Extensive experiments conducted on four challenging video captioning datasets, i.e., YouCookII, ActivityNet Captions, MSRVTT, and MSVD, demonstrate that the proposed method performs favorably against the state-of-the-art methods. Specifically, the proposed TextKG method outperforms the best published results by improving 18.7% absolute CIDEr scores on the YouCookII dataset.
△ Less
Submitted 25 March, 2023; v1 submitted 22 March, 2023;
originally announced March 2023.
-
The JUNO experiment Top Tracker
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato
, et al. (592 additional authors not shown)
Abstract:
The main task of the Top Tracker detector of the neutrino reactor experiment Jiangmen Underground Neutrino Observatory (JUNO) is to reconstruct and extrapolate atmospheric muon tracks down to the central detector. This muon tracker will help to evaluate the contribution of the cosmogenic background to the signal. The Top Tracker is located above JUNO's water Cherenkov Detector and Central Detector…
▽ More
The main task of the Top Tracker detector of the neutrino reactor experiment Jiangmen Underground Neutrino Observatory (JUNO) is to reconstruct and extrapolate atmospheric muon tracks down to the central detector. This muon tracker will help to evaluate the contribution of the cosmogenic background to the signal. The Top Tracker is located above JUNO's water Cherenkov Detector and Central Detector, covering about 60% of the surface above them. The JUNO Top Tracker is constituted by the decommissioned OPERA experiment Target Tracker modules. The technology used consists in walls of two planes of plastic scintillator strips, one per transverse direction. Wavelength shifting fibres collect the light signal emitted by the scintillator strips and guide it to both ends where it is read by multianode photomultiplier tubes. Compared to the OPERA Target Tracker, the JUNO Top Tracker uses new electronics able to cope with the high rate produced by the high rock radioactivity compared to the one in Gran Sasso underground laboratory. This paper will present the new electronics and mechanical structure developed for the Top Tracker of JUNO along with its expected performance based on the current detector simulation.
△ Less
Submitted 9 March, 2023;
originally announced March 2023.
-
JUNO sensitivity to $^7$Be, $pep$, and CNO solar neutrinos
Authors:
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Marco Beretta
, et al. (592 additional authors not shown)
Abstract:
The Jiangmen Underground Neutrino Observatory (JUNO), the first multi-kton liquid scintillator detector, which is under construction in China, will have a unique potential to perform a real-time measurement of solar neutrinos well below the few MeV threshold typical for Water Cherenkov detectors. JUNO's large target mass and excellent energy resolution are prerequisites for reaching unprecedented…
▽ More
The Jiangmen Underground Neutrino Observatory (JUNO), the first multi-kton liquid scintillator detector, which is under construction in China, will have a unique potential to perform a real-time measurement of solar neutrinos well below the few MeV threshold typical for Water Cherenkov detectors. JUNO's large target mass and excellent energy resolution are prerequisites for reaching unprecedented levels of precision. In this paper, we provide estimation of the JUNO sensitivity to 7Be, pep, and CNO solar neutrinos that can be obtained via a spectral analysis above the 0.45 MeV threshold. This study is performed assuming different scenarios of the liquid scintillator radiopurity, ranging from the most opti mistic one corresponding to the radiopurity levels obtained by the Borexino experiment, up to the minimum requirements needed to perform the neutrino mass ordering determination with reactor antineutrinos - the main goal of JUNO. Our study shows that in most scenarios, JUNO will be able to improve the current best measurements on 7Be, pep, and CNO solar neutrino fluxes. We also perform a study on the JUNO capability to detect periodical time variations in the solar neutrino flux, such as the day-night modulation induced by neutrino flavor regeneration in Earth, and the modulations induced by temperature changes driven by helioseismic waves.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
Authors:
Wei Li,
Linchao Zhu,
Longyin Wen,
Yi Yang
Abstract:
Large-scale pre-trained multi-modal models (e.g., CLIP) demonstrate strong zero-shot transfer capability in many discriminative tasks. Their adaptation to zero-shot image-conditioned text generation tasks has drawn increasing interest. Prior arts approach to zero-shot captioning by either utilizing the existing large language models (e.g., GPT-2) or pre-training the encoder-decoder network in an e…
▽ More
Large-scale pre-trained multi-modal models (e.g., CLIP) demonstrate strong zero-shot transfer capability in many discriminative tasks. Their adaptation to zero-shot image-conditioned text generation tasks has drawn increasing interest. Prior arts approach to zero-shot captioning by either utilizing the existing large language models (e.g., GPT-2) or pre-training the encoder-decoder network in an end-to-end manner. In this work, we propose a simple framework, named DeCap, for zero-shot captioning. We introduce a lightweight visual-aware language decoder. This decoder is both data-efficient and computation-efficient: 1) it only requires the text data for training, easing the burden on the collection of paired data. 2) it does not require end-to-end training. When trained with text-only data, the decoder takes the text embedding extracted from the off-the-shelf CLIP encoder as a prefix embedding. The challenge is that the decoder is trained on the text corpus but at the inference stage, it needs to generate captions based on visual inputs. The modality gap issue is widely observed in multi-modal contrastive models that prevents us from directly taking the visual embedding as the prefix embedding. We propose a training-free mechanism to reduce the modality gap. We project the visual embedding into the CLIP text embedding space, while the projected embedding retains the information of the visual input. Taking the projected embedding as the prefix embedding, the decoder generates high-quality descriptions that match the visual input. The experiments show that DeCap outperforms other zero-shot captioning methods and unpaired captioning methods on the typical image captioning benchmarks, i.e., MSCOCO and NoCaps.
△ Less
Submitted 6 March, 2023;
originally announced March 2023.
-
Search for Two-neutrino Double-Beta Decay of $^{136}\rm Xe$ to the $0^+_1$ excited state of $^{136}\rm Ba$ with the Complete EXO-200 Dataset
Authors:
EXO-200 Collaboration,
:,
S. Al Kharusi,
G. Anton,
I. Badhrees,
P. S. Barbeau,
D. Beck,
V. Belov,
T. Bhatta,
M. Breidenbach,
T. Brunner,
G. F. Cao,
W. R. Cen,
C. Chambers,
B. Cleveland,
M. Coon,
A. Craycraft,
T. Daniels,
L. Darroch,
S. J. Daugherty,
J. Davis,
S. Delaquis,
A. Der Mesrobian-Kabakian,
R. DeVoe,
J. Dilling
, et al. (83 additional authors not shown)
Abstract:
A new search for two-neutrino double-beta ($2νββ$) decay of $^{136}\rm Xe$ to the $0^+_1$ excited state of $^{136}\rm Ba$ is performed with the full EXO-200 dataset. A deep learning-based convolutional neural network is used to discriminate signal from background events. Signal detection efficiency is increased relative to previous searches by EXO-200 by more than a factor of two. With the additio…
▽ More
A new search for two-neutrino double-beta ($2νββ$) decay of $^{136}\rm Xe$ to the $0^+_1$ excited state of $^{136}\rm Ba$ is performed with the full EXO-200 dataset. A deep learning-based convolutional neural network is used to discriminate signal from background events. Signal detection efficiency is increased relative to previous searches by EXO-200 by more than a factor of two. With the addition of the Phase II dataset taken with an upgraded detector, the median 90$\%$ confidence level half-life sensitivity of $2νββ$ decay to the $0^+_1$ state of $^{136}\rm Ba$ is $2.9 \times 10^{24}~\rm yr$ using a total $^{136}\rm Xe$ exposure of $234.1~\rm kg~yr$. No statistically significant evidence for $2νββ$ decay to the $0^+_1$ state is observed, leading to a lower limit of $T^{2ν}_{1/2}(0^+ \rightarrow 0^+_1) > 1.4\times10^{24}~\rm yr$ at 90$\%$ confidence level, improved by 70$\%$ relative to the current world's best constraint.
△ Less
Submitted 16 October, 2023; v1 submitted 2 March, 2023;
originally announced March 2023.
-
Generative Oversampling for Imbalanced Data via Majority-Guided VAE
Authors:
Qingzhong Ai,
Pengyun Wang,
Lirong He,
Liangjian Wen,
Lujia Pan,
Zenglin Xu
Abstract:
Learning with imbalanced data is a challenging problem in deep learning. Over-sampling is a widely used technique to re-balance the sampling distribution of training data. However, most existing over-sampling methods only use intra-class information of minority classes to augment the data but ignore the inter-class relationships with the majority ones, which is prone to overfitting, especially whe…
▽ More
Learning with imbalanced data is a challenging problem in deep learning. Over-sampling is a widely used technique to re-balance the sampling distribution of training data. However, most existing over-sampling methods only use intra-class information of minority classes to augment the data but ignore the inter-class relationships with the majority ones, which is prone to overfitting, especially when the imbalance ratio is large. To address this issue, we propose a novel over-sampling model, called Majority-Guided VAE~(MGVAE), which generates new minority samples under the guidance of a majority-based prior. In this way, the newly generated minority samples can inherit the diversity and richness of the majority ones, thus mitigating overfitting in downstream tasks. Furthermore, to prevent model collapse under limited data, we first pre-train MGVAE on sufficient majority samples and then fine-tune based on minority samples with Elastic Weight Consolidation(EWC) regularization. Experimental results on benchmark image datasets and real-world tabular data show that MGVAE achieves competitive improvements over other over-sampling methods in downstream classification tasks, demonstrating the effectiveness of our method.
△ Less
Submitted 14 February, 2023;
originally announced February 2023.
-
Bringing Diversity to Autonomous Vehicles: An Interpretable Multi-vehicle Decision-making and Planning Framework
Authors:
Licheng Wen,
Pinlong Cai,
Daocheng Fu,
Song Mao,
Yikang Li
Abstract:
With the development of autonomous driving, it is becoming increasingly common for autonomous vehicles (AVs) and human-driven vehicles (HVs) to travel on the same roads. Existing single-vehicle planning algorithms on board struggle to handle sophisticated social interactions in the real world. Decisions made by these methods are difficult to understand for humans, raising the risk of crashes and m…
▽ More
With the development of autonomous driving, it is becoming increasingly common for autonomous vehicles (AVs) and human-driven vehicles (HVs) to travel on the same roads. Existing single-vehicle planning algorithms on board struggle to handle sophisticated social interactions in the real world. Decisions made by these methods are difficult to understand for humans, raising the risk of crashes and making them unlikely to be applied in practice. Moreover, vehicle flows produced by open-source traffic simulators suffer from being overly conservative and lacking behavioral diversity. We propose a hierarchical multi-vehicle decision-making and planning framework with several advantages. The framework jointly makes decisions for all vehicles within the flow and reacts promptly to the dynamic environment through a high-frequency planning module. The decision module produces interpretable action sequences that can explicitly communicate self-intent to the surrounding HVs. We also present the cooperation factor and trajectory weight set, bringing diversity to autonomous vehicles in traffic at both the social and individual levels. The superiority of our proposed framework is validated through experiments with multiple scenarios, and the diverse behaviors in the generated vehicle trajectories are demonstrated through closed-loop simulations.
△ Less
Submitted 13 February, 2023;
originally announced February 2023.
-
Open data from the third observing run of LIGO, Virgo, KAGRA and GEO
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah,
C. Alléné,
A. Allocca
, et al. (1719 additional authors not shown)
Abstract:
The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasti…
▽ More
The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasting 2 weeks. In this paper we describe these data and various other science products that can be freely accessed through the Gravitational Wave Open Science Center at https://gwosc.org. The main dataset, consisting of the gravitational-wave strain time series that contains the astrophysical signals, is released together with supporting data useful for their analysis and documentation, tutorials, as well as analysis software packages.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
-
Study on U/Th residual radioactivity in acrylic from surface treatment
Authors:
Yuanxia Li,
Xiaohui Qian,
Xiaolan Luo,
Jie Zhao,
Gaofeng Zhang,
Xiaoyan Ma,
Yuekun Heng,
Liangjian Wen,
Monica Sisti,
Frédéric Perrot,
Hongqiang Tang
Abstract:
Acrylic is widely used as material for the target container in low background experiments due to its high light transparency and low intrinsic radioactivity. However, its surface can be easily contaminated during production, so careful treatment of the surface is essential to avoid direct contamination of the target. The Jiangmen Underground Neutrino Observatory will use about 600~t of acrylic to…
▽ More
Acrylic is widely used as material for the target container in low background experiments due to its high light transparency and low intrinsic radioactivity. However, its surface can be easily contaminated during production, so careful treatment of the surface is essential to avoid direct contamination of the target. The Jiangmen Underground Neutrino Observatory will use about 600~t of acrylic to build the spherical vessel of 35.4~m in diameter for a 20~kt liquid scintillator (LS). Since acrylic will contact the LS directly, the cleanliness of the its surface is quite important for the radiopurity of the LS. A new method for measuring the radioactivity of $^{238}$U and $^{232}$Th in acrylic to sub-ppt ($<10^{-12}$~g/g) was developed, and it is crucial for the acrylic radioactivity screening in this study. We performed many background tests on different surface treatments, and the recommended procedure for the treatment of acrylic to achieve low radioactivity and high light transparency could be applicable to other low background experiments.
△ Less
Submitted 12 January, 2023;
originally announced January 2023.
-
Pre-merger sky localization of gravitational waves from binary neutron star mergers using deep learning
Authors:
Chayan Chatterjee,
Linqing Wen
Abstract:
The simultaneous observation of gravitational waves (GW) and prompt electromagnetic counterparts from the merger of two neutron stars can help reveal the properties of extreme matter and gravity during and immediately after the final plunge. Rapid sky localization of these sources is crucial to facilitate such multi-messenger observations. Since GWs from binary neutron star (BNS) mergers can spend…
▽ More
The simultaneous observation of gravitational waves (GW) and prompt electromagnetic counterparts from the merger of two neutron stars can help reveal the properties of extreme matter and gravity during and immediately after the final plunge. Rapid sky localization of these sources is crucial to facilitate such multi-messenger observations. Since GWs from binary neutron star (BNS) mergers can spend up to 10-15 mins in the frequency bands of the detectors at design sensitivity, early warning alerts and pre-merger sky localization can be achieved for sufficiently bright sources, as demonstrated in recent studies. In this work, we present pre-merger BNS sky localization results using CBC-SkyNet, a deep learning model capable of inferring sky location posterior distributions of GW sources at orders of magnitude faster speeds than standard Markov Chain Monte Carlo methods. We test our model's performance on a catalog of simulated injections from Sachdev et al. (2020), recovered at 0-60 secs before merger, and obtain comparable sky localization areas to the rapid localization tool BAYESTAR. These results show the feasibility of our model for rapid pre-merger sky localization and the possibility of follow-up observations for precursor emissions from BNS mergers.
△ Less
Submitted 7 December, 2023; v1 submitted 30 December, 2022;
originally announced January 2023.
-
Grace periods in comparative effectiveness studies of sustained treatments
Authors:
Kerollos Nashat Wanis,
Aaron L. Sarvet,
Lan Wen,
Jason P. Block,
Sheryl L. Rifas-Shiman,
James M. Robins,
Jessica G. Young
Abstract:
Researchers are often interested in estimating the effect of sustained use of a treatment on a health outcome. However, adherence to strict treatment protocols can be challenging for individuals in practice and, when non-adherence is expected, estimates of the effect of sustained use may not be useful for decision making. As an alternative, more relaxed treatment protocols which allow for periods…
▽ More
Researchers are often interested in estimating the effect of sustained use of a treatment on a health outcome. However, adherence to strict treatment protocols can be challenging for individuals in practice and, when non-adherence is expected, estimates of the effect of sustained use may not be useful for decision making. As an alternative, more relaxed treatment protocols which allow for periods of time off treatment (i.e. grace periods) have been considered in pragmatic randomized trials and observational studies. In this article, we consider the interpretation, identification, and estimation of treatment strategies which include grace periods. We contrast natural grace period strategies which allow individuals the flexibility to take treatment as they would naturally do, with stochastic grace period strategies in which the investigator specifies the distribution of treatment utilization. We estimate the effect of initiation of a thiazide diuretic or an angiotensin-converting enzyme inhibitor in hypertensive individuals under various strategies which include grace periods.
△ Less
Submitted 23 January, 2024; v1 submitted 21 December, 2022;
originally announced December 2022.
-
Neutrinoless Double Beta Decay
Authors:
C. Adams,
K. Alfonso,
C. Andreoiu,
E. Angelico,
I. J. Arnquist,
J. A. A. Asaadi,
F. T. Avignone,
S. N. Axani,
A. S. Barabash,
P. S. Barbeau,
L. Baudis,
F. Bellini,
M. Beretta,
T. Bhatta,
V. Biancacci,
M. Biassoni,
E. Bossio,
P. A. Breur,
J. P. Brodsky,
C. Brofferio,
E. Brown,
R. Brugnera,
T. Brunner,
N. Burlac,
E. Caden
, et al. (207 additional authors not shown)
Abstract:
This White Paper, prepared for the Fundamental Symmetries, Neutrons, and Neutrinos Town Meeting related to the 2023 Nuclear Physics Long Range Plan, makes the case for double beta decay as a critical component of the future nuclear physics program. The major experimental collaborations and many theorists have endorsed this white paper.
This White Paper, prepared for the Fundamental Symmetries, Neutrons, and Neutrinos Town Meeting related to the 2023 Nuclear Physics Long Range Plan, makes the case for double beta decay as a critical component of the future nuclear physics program. The major experimental collaborations and many theorists have endorsed this white paper.
△ Less
Submitted 21 December, 2022;
originally announced December 2022.
-
Simulation Software of the JUNO Experiment
Authors:
Tao Lin,
Yuxiang Hu,
Miao Yu,
Haosen Zhang,
Simon Charles Blyth,
Yaoguang Wang,
Haoqi Lu,
Cecile Jollet,
João Pedro Athayde Marcondes de André,
Ziyan Deng,
Guofu Cao,
Fengpeng An,
Pietro Chimenti,
Xiao Fang,
Yuhang Guo,
Wenhao Huang,
Xingtao Huang,
Rui Li,
Teng Li,
Weidong Li,
Xinying Li,
Yankai Liu,
Anselmo Meregaglia,
Zhen Qian,
Yuhan Ren
, et al. (9 additional authors not shown)
Abstract:
The Jiangmen Underground Neutrino Observatory (JUNO) is a multi-purpose experiment, under construction in southeast China, that is designed to determine the neutrino mass ordering and precisely measure neutrino oscillation parameters. Monte Carlo simulation plays an important role for JUNO detector design, detector commissioning, offline data processing, and physics processing. The JUNO experiment…
▽ More
The Jiangmen Underground Neutrino Observatory (JUNO) is a multi-purpose experiment, under construction in southeast China, that is designed to determine the neutrino mass ordering and precisely measure neutrino oscillation parameters. Monte Carlo simulation plays an important role for JUNO detector design, detector commissioning, offline data processing, and physics processing. The JUNO experiment has the world's largest liquid scintillator detector instrumented with many thousands of PMTs. The broad energy range of interest, long lifetime, and the large scale present data processing challenges across all areas. This paper describes the JUNO simulation software, highlighting the challenges of JUNO simulation and solutions to meet these challenges, including such issues as support for time-correlated analysis, event mixing, event correlation and handling the simulation of many millions of optical photons.
△ Less
Submitted 17 May, 2023; v1 submitted 20 December, 2022;
originally announced December 2022.
-
JUNO Sensitivity on Proton Decay $p\to \barνK^+$ Searches
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Antonio Bergnoli,
Thilo Birkenfeld,
Sylvie Blin
, et al. (586 additional authors not shown)
Abstract:
The Jiangmen Underground Neutrino Observatory (JUNO) is a large liquid scintillator detector designed to explore many topics in fundamental physics. In this paper, the potential on searching for proton decay in $p\to \barνK^+$ mode with JUNO is investigated.The kaon and its decay particles feature a clear three-fold coincidence signature that results in a high efficiency for identification. Moreov…
▽ More
The Jiangmen Underground Neutrino Observatory (JUNO) is a large liquid scintillator detector designed to explore many topics in fundamental physics. In this paper, the potential on searching for proton decay in $p\to \barνK^+$ mode with JUNO is investigated.The kaon and its decay particles feature a clear three-fold coincidence signature that results in a high efficiency for identification. Moreover, the excellent energy resolution of JUNO permits to suppress the sizable background caused by other delayed signals. Based on these advantages, the detection efficiency for the proton decay via $p\to \barνK^+$ is 36.9% with a background level of 0.2 events after 10 years of data taking. The estimated sensitivity based on 200 kton-years exposure is $9.6 \times 10^{33}$ years, competitive with the current best limits on the proton lifetime in this channel.
△ Less
Submitted 26 October, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
The long-time asymptotic of the derivative nonlinear Schr$\ddot{o}$dinger equation with step-like initial value
Authors:
Lili Wen,
Yong Chen,
Jian Xu
Abstract:
Consideration in this present paper is the long-time asymptotic of solutions to the derivative nonlinear Schr$\ddot{o}$dinger equation with the step-like initial value \begin{eqnarray} q(x,0)=q_{0}(x)=\begin{cases} \begin{split} A_{1}e^{iφ}e^{2iBx}, \quad\quad x<0,\\ A_{2}e^{-2iBx}, \quad\quad~~ x>0. \end{split}\nonumber \end{cases} \end{eqnarray} by Deift-Zhou method. The step-like initial proble…
▽ More
Consideration in this present paper is the long-time asymptotic of solutions to the derivative nonlinear Schr$\ddot{o}$dinger equation with the step-like initial value \begin{eqnarray} q(x,0)=q_{0}(x)=\begin{cases} \begin{split} A_{1}e^{iφ}e^{2iBx}, \quad\quad x<0,\\ A_{2}e^{-2iBx}, \quad\quad~~ x>0. \end{split}\nonumber \end{cases} \end{eqnarray} by Deift-Zhou method. The step-like initial problem described by a matrix Riemann-Hilbert problem. A crucial ingredient used in this paper is to introduce $g$-function mechanism for solving the problem of the entries of the jump matrix growing exponentially as $t\rightarrow\infty$. It is shown that the leading order term of the asymptotic solution of the DNLS equation expressed by the Theta function $Θ$ about the Riemann-surface of genus 3 and the subleading order term expressed by parabolic cylinder and Airy functions.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.
-
Search for subsolar-mass black hole binaries in the second part of Advanced LIGO's and Advanced Virgo's third observing run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
C. Alléné,
A. Allocca,
P. A. Altin
, et al. (1680 additional authors not shown)
Abstract:
We describe a search for gravitational waves from compact binaries with at least one component with mass 0.2 $M_\odot$ -- $1.0 M_\odot$ and mass ratio $q \geq 0.1$ in Advanced LIGO and Advanced Virgo data collected between 1 November 2019, 15:00 UTC and 27 March 2020, 17:00 UTC. No signals were detected. The most significant candidate has a false alarm rate of 0.2 $\mathrm{yr}^{-1}$. We estimate t…
▽ More
We describe a search for gravitational waves from compact binaries with at least one component with mass 0.2 $M_\odot$ -- $1.0 M_\odot$ and mass ratio $q \geq 0.1$ in Advanced LIGO and Advanced Virgo data collected between 1 November 2019, 15:00 UTC and 27 March 2020, 17:00 UTC. No signals were detected. The most significant candidate has a false alarm rate of 0.2 $\mathrm{yr}^{-1}$. We estimate the sensitivity of our search over the entirety of Advanced LIGO's and Advanced Virgo's third observing run, and present the most stringent limits to date on the merger rate of binary black holes with at least one subsolar-mass component. We use the upper limits to constrain two fiducial scenarios that could produce subsolar-mass black holes: primordial black holes (PBH) and a model of dissipative dark matter. The PBH model uses recent prescriptions for the merger rate of PBH binaries that include a rate suppression factor to effectively account for PBH early binary disruptions. If the PBHs are monochromatically distributed, we can exclude a dark matter fraction in PBHs $f_\mathrm{PBH} \gtrsim 0.6$ (at 90% confidence) in the probed subsolar-mass range. However, if we allow for broad PBH mass distributions we are unable to rule out $f_\mathrm{PBH} = 1$. For the dissipative model, where the dark matter has chemistry that allows a small fraction to cool and collapse into black holes, we find an upper bound $f_{\mathrm{DBH}} < 10^{-5}$ on the fraction of atomic dark matter collapsed into black holes.
△ Less
Submitted 26 January, 2024; v1 submitted 2 December, 2022;
originally announced December 2022.
-
The most probable host of CHIME FRB 190425A, associated with binary neutron star merger GW190425, and a late-time transient search
Authors:
Fiona H. Panther,
Gemma E. Anderson,
Shivani Bhandari,
Adelle J. Goodwin,
Natasha Hurley-Walker,
Clancy W. James,
Adela Kawka,
Shunke Ai,
Manoj Kovalam,
Alexandra Moroianu,
Linqing Wen,
Bing Zhang
Abstract:
The identification and localization of Fast Radio Bursts to their host galaxies has revealed important details about the progenitors of these mysterious, millisecond-long bursts of coherent radio emission. In this work we study the most probable host galaxy of the apparently non-repeating CHIME/FRB event FRB 20190425A -- a particularly high luminosity, low dispersion measure event that was demonst…
▽ More
The identification and localization of Fast Radio Bursts to their host galaxies has revealed important details about the progenitors of these mysterious, millisecond-long bursts of coherent radio emission. In this work we study the most probable host galaxy of the apparently non-repeating CHIME/FRB event FRB 20190425A -- a particularly high luminosity, low dispersion measure event that was demonstrated by Moroianu et al. 2022 to be temporally and spatially coincident with the LIGO-Virgo-KAGRA binary neutron star merger GW190425, suggesting an astrophysical association (p-value 0.0052). In this paper we remain agnostic to this result, and we confirm UGC10667 as the most probable host galaxy of FRB 20190425A, demonstrating that the host galaxies of low dispersion measure, one-off CHIME FRBs can be plausibly identified. We then perform multi-wavelength observations to characterize the galaxy and search for any afterglow emission associated with the FRB and its putative GW counterpart. We find no radio or optical transient emission in our observations $2.5\,\mathrm{yr}$ post-burst. UGC10667 is a spiral galaxy at $z\sim0.03$, dominated by an old stellar population. We find no evidence of a large population of young stars, with nebular emission dominated by star formation at a rate of $1-2\,\mathrm{M_\odot\,yr^{-1}}$. While we cannot rule out a young magnetar as the origin of FRB 20190425A, our observations are consistent with an origin in a long delay-time neutron star binary merger as posited by Moroianu et al. 2022.
△ Less
Submitted 1 December, 2022;
originally announced December 2022.
-
An assessment of the Association Between a Fast Radio Burst and Binary Neutron Star Merger
Authors:
Alexandra Moroianu,
Linqing Wen,
Clancy W. James,
Shunke Ai,
Manoj Kovalam,
Fiona Panther,
Bing Zhang
Abstract:
Fast radio bursts (FRBs) are mysterious bright millisecond-duration radio bursts at cosmological distances. While young magnetars have been put forward as the leading source candidate, recent observations suggest there may be multiple FRB progenitor classes. It has long been theorised that FRBs could be emitted from compact object mergers - cataclysmic events such as binary neutron star (BNS) merg…
▽ More
Fast radio bursts (FRBs) are mysterious bright millisecond-duration radio bursts at cosmological distances. While young magnetars have been put forward as the leading source candidate, recent observations suggest there may be multiple FRB progenitor classes. It has long been theorised that FRBs could be emitted from compact object mergers - cataclysmic events such as binary neutron star (BNS) mergers that may be detectable in gravitational waves (GWs) by the ground-based Laser Interferometer Gravitational Wave Observatory (LIGO)and Virgo. Here we report a potential coincidence between the only BNS merger event GW190425 out of 21 GW sources detected during the first six months of LIGO-Virgo's 3rd Science Run and a bright, non-repeating FRB event, FRB 20190425A, from a search using public GW and CHIME FRB data. The FRB is located within the GW's sky localization area, occurred 2.5 hours after the GW event, and has a dispersion measure consistent with the distance inferred from GW parameter estimation. The chance probability of a coincidence between unrelated FRB and GW events in the databases is estimated to be 0.0052 ($2.8 σ$). We estimate the chance of CHIME detecting such an event to range from 0.4% for a beam-centre detection to 68% if a bright burst is detectable in a far sidelobe. This potential association is consistent with the theory that the BNS merger leaves behind a supramassive, highly magnetized compact object, which collapses to form a black hole after losing angular momentum due to spindown and makes an FRB through ejecting the magnetosphere. If such a physical association is established, the equation of state of the post-merger compact object is likely stiff, with a Tolman-Oppenheimer-Volkoff non-spinning maximum mass $M_{TOV} > 2.63_{-0.23}^{+0.39} M_\odot$ for a neutron star remnant, or $M_{TOV} > 2.31_{-0.08}^{+0.24} M_\odot$ for a quark star remnant.
△ Less
Submitted 30 November, 2022;
originally announced December 2022.
-
Data-driven simultaneous vertex and energy reconstruction for large liquid scintillator detectors
Authors:
Gui-hong Huang,
Wei Jiang,
Liang-jian Wen,
Yi-fang Wang,
Wu-Ming Luo
Abstract:
High precision vertex and energy reconstruction is crucial for large liquid scintillator detectors such as JUNO, especially for the determination of the neutrino mass ordering by analyzing the energy spectrum of reactor neutrinos. This paper presents a data-driven method to obtain more realistic and more accurate expected PMT response of positron events in JUNO, and develops a simultaneous vertex…
▽ More
High precision vertex and energy reconstruction is crucial for large liquid scintillator detectors such as JUNO, especially for the determination of the neutrino mass ordering by analyzing the energy spectrum of reactor neutrinos. This paper presents a data-driven method to obtain more realistic and more accurate expected PMT response of positron events in JUNO, and develops a simultaneous vertex and energy reconstruction method that combines the charge and time information of PMTs. For the JUNO detector, the impact of vertex inaccuracy on the energy resolution is about 0.6\%.
△ Less
Submitted 30 November, 2022;
originally announced November 2022.
-
Precision measurement of reactor antineutrino oscillation at kilometer-scale baselines by Daya Bay
Authors:
Daya Bay collaboration,
F. P. An,
W. D. Bai,
A. B. Balantekin,
M. Bishai,
S. Blyth,
G. F. Cao,
J. Cao,
J. F. Chang,
Y. Chang,
H. S. Chen,
H. Y. Chen,
S. M. Chen,
Y. Chen,
Y. X. Chen,
Z. Y. Chen,
J. Cheng,
Z. K. Cheng,
J. J. Cherwinka,
M. C. Chu,
J. P. Cummings,
O. Dalager,
F. S. Deng,
Y. Y. Ding,
X. Y. Ding
, et al. (176 additional authors not shown)
Abstract:
We present a new determination of the smallest neutrino mixing angle $θ_{13}$ and the mass-squared difference $Δ{\rm m}^{2}_{32}$ using a final sample of $5.55 \times 10^{6}$ inverse beta-decay (IBD) candidates with the final-state neutron captured on gadolinium. This sample was selected from the complete data set obtained by the Daya Bay reactor neutrino experiment in 3158 days of operation. Comp…
▽ More
We present a new determination of the smallest neutrino mixing angle $θ_{13}$ and the mass-squared difference $Δ{\rm m}^{2}_{32}$ using a final sample of $5.55 \times 10^{6}$ inverse beta-decay (IBD) candidates with the final-state neutron captured on gadolinium. This sample was selected from the complete data set obtained by the Daya Bay reactor neutrino experiment in 3158 days of operation. Compared to the previous Daya Bay results, selection of IBD candidates has been optimized, energy calibration refined, and treatment of backgrounds further improved. The resulting oscillation parameters are ${\rm sin}^{2}2θ_{13} = 0.0851 \pm 0.0024$, $Δ{\rm m}^{2}_{32} = (2.466 \pm 0.060) \times 10^{-3}{\rm eV}^{2}$ for the normal mass ordering or $Δ{\rm m}^{2}_{32} = -(2.571 \pm 0.060) \times 10^{-3} {\rm eV}^{2}$ for the inverted mass ordering.
△ Less
Submitted 27 November, 2022;
originally announced November 2022.
-
In situ tuning of dynamical Coulomb blockade on Andreev bound states in hybrid nanowire devices
Authors:
Shan Zhang,
Zhichuan Wang,
Dong Pan,
Zhaoyu Wang,
Zonglin Li,
Zitong Zhang,
Yichun Gao,
Zhan Cao,
Gu Zhang,
Lei Liu,
Lianjun Wen,
Ran Zhuo,
Dong E. Liu,
Ke He,
Runan Shang,
Jianhua Zhao,
Hao Zhang
Abstract:
Electron interactions in quantum devices can exhibit intriguing phenomena. One example is assembling an electronic device in series with an on-chip resistor. The quantum laws of electricity of the device is modified at low energies and temperatures by dissipative interactions induced by the resistor, a phenomenon known as dynamical Coulomb blockade (DCB). The DCB strength is usually non-adjustable…
▽ More
Electron interactions in quantum devices can exhibit intriguing phenomena. One example is assembling an electronic device in series with an on-chip resistor. The quantum laws of electricity of the device is modified at low energies and temperatures by dissipative interactions induced by the resistor, a phenomenon known as dynamical Coulomb blockade (DCB). The DCB strength is usually non-adjustable in a fixed environment defined by the resistor. Here, we design an on-chip circuit for InAs-Al hybrid nanowires where the DCB strength can be gate-tuned in situ. InAs-Al nanowires could host Andreev or Majorana zero-energy states. This technique enables tracking the evolution of the same state while tuning the DCB strength from weak to strong. We observe the transition from a zero-bias conductance peak to split peaks for Andreev zero-energy states. Our technique opens the door to in situ tuning interaction strength on zero-energy states.
△ Less
Submitted 12 December, 2023; v1 submitted 14 November, 2022;
originally announced November 2022.
-
Gradient Imitation Reinforcement Learning for General Low-Resource Information Extraction
Authors:
Xuming Hu,
Shiao Meng,
Chenwei Zhang,
Xiangli Yang,
Lijie Wen,
Irwin King,
Philip S. Yu
Abstract:
Information Extraction (IE) aims to extract structured information from heterogeneous sources. IE from natural language texts include sub-tasks such as Named Entity Recognition (NER), Relation Extraction (RE), and Event Extraction (EE). Most IE systems require comprehensive understandings of sentence structure, implied semantics, and domain knowledge to perform well; thus, IE tasks always need ade…
▽ More
Information Extraction (IE) aims to extract structured information from heterogeneous sources. IE from natural language texts include sub-tasks such as Named Entity Recognition (NER), Relation Extraction (RE), and Event Extraction (EE). Most IE systems require comprehensive understandings of sentence structure, implied semantics, and domain knowledge to perform well; thus, IE tasks always need adequate external resources and annotations. However, it takes time and effort to obtain more human annotations. Low-Resource Information Extraction (LRIE) strives to use unsupervised data, reducing the required resources and human annotation. In practice, existing systems either utilize self-training schemes to generate pseudo labels that will cause the gradual drift problem, or leverage consistency regularization methods which inevitably possess confirmation bias. To alleviate confirmation bias due to the lack of feedback loops in existing LRIE learning paradigms, we develop a Gradient Imitation Reinforcement Learning (GIRL) method to encourage pseudo-labeled data to imitate the gradient descent direction on labeled data, which can force pseudo-labeled data to achieve better optimization capabilities similar to labeled data. Based on how well the pseudo-labeled data imitates the instructive gradient descent direction obtained from labeled data, we design a reward to quantify the imitation process and bootstrap the optimization capability of pseudo-labeled data through trial and error. In addition to learning paradigms, GIRL is not limited to specific sub-tasks, and we leverage GIRL to solve all IE sub-tasks (named entity recognition, relation extraction, and event extraction) in low-resource settings (semi-supervised IE and few-shot IE).
△ Less
Submitted 14 November, 2022; v1 submitted 11 November, 2022;
originally announced November 2022.
-
Determine Energy Nonlinearity and Resolution of $e^{\pm}$ and $γ$ in Liquid Scintillator Detectors by A Universal Energy Response Model
Authors:
Miao Yu,
Liangjian Wen,
Xiang Zhou,
Wuming Luo
Abstract:
Energy nonlinearity and resolution in liquid scintillator (LS) detectors are correlated and particle-dependent. A unified energy response model for liquid scintillator detectors has been presented in details. This model has advanced a data-driven approach to calibrate the particle-dependent energy response, using both the monoenergetic $γ$-ray sources and the continuous $β$ spectra of…
▽ More
Energy nonlinearity and resolution in liquid scintillator (LS) detectors are correlated and particle-dependent. A unified energy response model for liquid scintillator detectors has been presented in details. This model has advanced a data-driven approach to calibrate the particle-dependent energy response, using both the monoenergetic $γ$-ray sources and the continuous $β$ spectra of $^\mathrm{12}\mathrm{B}$ and Michel $e^-$ induced by cosmic muons. Monte Carlo studies have demonstrated the effectiveness and robustness of the proposed model, in particular, the positron energy resolution can be extracted in the absence of positron sources. This work will provide a feasible approach of simultaneous calibration of energy nonlinearity and resolution for the running and future LS detectors.
△ Less
Submitted 10 November, 2022; v1 submitted 4 November, 2022;
originally announced November 2022.
-
A centimeter-scale achromatic hybrid metalens with polarization-insensitivity in the visible
Authors:
Tie Hu,
Shengqi Wang,
Yunxuan Wei,
Liqing Wen,
Xing Feng,
Ming Zhao,
Zhenyu Yang
Abstract:
Metalenses, featuring ultra-compactness and CMOS compatibility, are limited by the compromise between the diameter, numerical aperture, and working waveband. To address this problem, we propose and numerically demonstrate a centimeter-scale metasurface-refractive hybrid metalens working in the band of 440 - 700 nm. Revisiting the general Snell law, we present the phase profile of a chromatic aberr…
▽ More
Metalenses, featuring ultra-compactness and CMOS compatibility, are limited by the compromise between the diameter, numerical aperture, and working waveband. To address this problem, we propose and numerically demonstrate a centimeter-scale metasurface-refractive hybrid metalens working in the band of 440 - 700 nm. Revisiting the general Snell law, we present the phase profile of a chromatic aberration correction metasurface that can apply to a plano-convex refractive lens of an arbitrary surface type. Simulated by our semi-vector method, the designed achromatic hybrid metalens achieves 81% chromatic aberration suppression and polarization insensitivity. Broadband imaging results of the hybrid metalens are further provided, verifying the achromatism of the designed hybrid metalens. It can find applications in camera lenses and other optical systems that need compact, high-performance lenses.
△ Less
Submitted 31 October, 2022;
originally announced October 2022.
-
Character-level White-Box Adversarial Attacks against Transformers via Attachable Subwords Substitution
Authors:
Aiwei Liu,
Honghai Yu,
Xuming Hu,
Shu'ang Li,
Li Lin,
Fukun Ma,
Yawen Yang,
Lijie Wen
Abstract:
We propose the first character-level white-box adversarial attack method against transformer models. The intuition of our method comes from the observation that words are split into subtokens before being fed into the transformer models and the substitution between two close subtokens has a similar effect to the character modification. Our method mainly contains three steps. First, a gradient-base…
▽ More
We propose the first character-level white-box adversarial attack method against transformer models. The intuition of our method comes from the observation that words are split into subtokens before being fed into the transformer models and the substitution between two close subtokens has a similar effect to the character modification. Our method mainly contains three steps. First, a gradient-based method is adopted to find the most vulnerable words in the sentence. Then we split the selected words into subtokens to replace the origin tokenization result from the transformer tokenizer. Finally, we utilize an adversarial loss to guide the substitution of attachable subtokens in which the Gumbel-softmax trick is introduced to ensure gradient propagation. Meanwhile, we introduce the visual and length constraint in the optimization process to achieve minimum character modifications. Extensive experiments on both sentence-level and token-level tasks demonstrate that our method could outperform the previous attack methods in terms of success rate and edit distance. Furthermore, human evaluation verifies our adversarial examples could preserve their origin labels.
△ Less
Submitted 30 October, 2022;
originally announced October 2022.
-
Search for gravitational-wave transients associated with magnetar bursts in Advanced LIGO and Advanced Virgo data from the third observing run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Allocca,
P. A. Altin
, et al. (1645 additional authors not shown)
Abstract:
Gravitational waves are expected to be produced from neutron star oscillations associated with magnetar giant flares and short bursts. We present the results of a search for short-duration (milliseconds to seconds) and long-duration ($\sim$ 100 s) transient gravitational waves from 13 magnetar short bursts observed during Advanced LIGO, Advanced Virgo and KAGRA's third observation run. These 13 bu…
▽ More
Gravitational waves are expected to be produced from neutron star oscillations associated with magnetar giant flares and short bursts. We present the results of a search for short-duration (milliseconds to seconds) and long-duration ($\sim$ 100 s) transient gravitational waves from 13 magnetar short bursts observed during Advanced LIGO, Advanced Virgo and KAGRA's third observation run. These 13 bursts come from two magnetars, SGR 1935$+$2154 and Swift J1818.0$-$1607. We also include three other electromagnetic burst events detected by Fermi GBM which were identified as likely coming from one or more magnetars, but they have no association with a known magnetar. No magnetar giant flares were detected during the analysis period. We find no evidence of gravitational waves associated with any of these 16 bursts. We place upper bounds on the root-sum-square of the integrated gravitational-wave strain that reach $2.2 \times 10^{-23}$ $/\sqrt{\text{Hz}}$ at 100 Hz for the short-duration search and $8.7 \times 10^{-23}$ $/\sqrt{\text{Hz}}$ at $450$ Hz for the long-duration search, given a detection efficiency of 50%. For a ringdown signal at 1590 Hz targeted by the short-duration search the limit is set to $1.8 \times 10^{-22}$ $/\sqrt{\text{Hz}}$. Using the estimated distance to each magnetar, we derive upper bounds on the emitted gravitational-wave energy of $3.2 \times 10^{43}$ erg ($7.3 \times 10^{43}$ erg) for SGR 1935$+$2154 and $8.2 \times 10^{42}$ erg ($2.8 \times 10^{43}$ erg) for Swift J1818.0$-$1607, for the short-duration (long-duration) search. Assuming isotropic emission of electromagnetic radiation of the burst fluences, we constrain the ratio of gravitational-wave energy to electromagnetic energy for bursts from SGR 1935$+$2154 with available fluence information. The lowest of these ratios is $3 \times 10^3$.
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks
Authors:
Xuming Hu,
Yong Jiang,
Aiwei Liu,
Zhongqiang Huang,
Pengjun Xie,
Fei Huang,
Lijie Wen,
Philip S. Yu
Abstract:
Data augmentation techniques have been used to alleviate the problem of scarce labeled data in various NER tasks (flat, nested, and discontinuous NER tasks). Existing augmentation techniques either manipulate the words in the original text that break the semantic coherence of the text, or exploit generative models that ignore preserving entities in the original text, which impedes the use of augme…
▽ More
Data augmentation techniques have been used to alleviate the problem of scarce labeled data in various NER tasks (flat, nested, and discontinuous NER tasks). Existing augmentation techniques either manipulate the words in the original text that break the semantic coherence of the text, or exploit generative models that ignore preserving entities in the original text, which impedes the use of augmentation techniques on nested and discontinuous NER tasks. In this work, we propose a novel Entity-to-Text based data augmentation technique named EnTDA to add, delete, replace or swap entities in the entity list of the original texts, and adopt these augmented entity lists to generate semantically coherent and entity preserving texts for various NER tasks. Furthermore, we introduce a diversity beam search to increase the diversity during the text generation process. Experiments on thirteen NER datasets across three tasks (flat, nested, and discontinuous NER tasks) and two settings (full data and low resource settings) show that EnTDA could bring more performance improvements compared to the baseline augmentation techniques.
△ Less
Submitted 26 May, 2023; v1 submitted 19 October, 2022;
originally announced October 2022.
-
Expected geoneutrino signal at JUNO using local integrated 3-D refined crustal model
Authors:
Ran Han,
ZhiWei Li,
Ruohan Gao,
Yao Sun,
Ya Xu,
Yufei Xi,
Guangzheng Jiang,
Andong Wang,
Yaping Cheng,
Yao Sun,
Jie Pang,
Qi Hua,
Liangjian Wen,
Liang Zhan,
Yu-Feng Li
Abstract:
Geoneutrinos serve as a potent tool for comprehending the radiogenic power and composition of Earth. Although geoneutrinos have been observed in prior experiments, the forthcoming generation of experiments,such as JUNO, will be necessary for fully harnessing their potential. Precise prediction of the crustal contribution is vital for interpreting particlephysics measurements in the context of geo-…
▽ More
Geoneutrinos serve as a potent tool for comprehending the radiogenic power and composition of Earth. Although geoneutrinos have been observed in prior experiments, the forthcoming generation of experiments,such as JUNO, will be necessary for fully harnessing their potential. Precise prediction of the crustal contribution is vital for interpreting particlephysics measurements in the context of geo-scientific inquiries. Nonetheless, existing models such as JULOC and GIGJ have limitations in accurately forecasting the crustal contribution. This paper introduces JULOCI, the novel 3-D integrated crustal model of JUNO, which employs seismic, gravity, rock sample, and heat flow data to precisely estimate the geoneutrino signal of the lithosphere. The model indicates elevated concentrations of uranium and thorium in southern China, resulting in unexpectedly strong geoneutrino signals.The accuracy of JULOC-I, coupled with a decade of experimental data, affords JUNO the opportunity to test multiple mantle models. Once operational, JUNO can validate the model predictions and enhance the precision of mantle measurements. All in all, the improved accuracy ofJULOC-I represents a substantial stride towards comprehending the geochemical distribution of the South China crust, offering a valuable tool for investigating the composition and evolution of the Earth through geoneutrinos.
△ Less
Submitted 6 March, 2024; v1 submitted 17 October, 2022;
originally announced October 2022.
-
Model Independent Approach of the JUNO $^8$B Solar Neutrino Program
Authors:
JUNO Collaboration,
Jie Zhao,
Baobiao Yue,
Haoqi Lu,
Yufeng Li,
Jiajie Ling,
Zeyuan Yu,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai
, et al. (579 additional authors not shown)
Abstract:
The physics potential of detecting $^8$B solar neutrinos will be exploited at the Jiangmen Underground Neutrino Observatory (JUNO), in a model independent manner by using three distinct channels of the charged-current (CC), neutral-current (NC) and elastic scattering (ES) interactions. Due to the largest-ever mass of $^{13}$C nuclei in the liquid-scintillator detectors and the {expected} low backg…
▽ More
The physics potential of detecting $^8$B solar neutrinos will be exploited at the Jiangmen Underground Neutrino Observatory (JUNO), in a model independent manner by using three distinct channels of the charged-current (CC), neutral-current (NC) and elastic scattering (ES) interactions. Due to the largest-ever mass of $^{13}$C nuclei in the liquid-scintillator detectors and the {expected} low background level, $^8$B solar neutrinos would be observable in the CC and NC interactions on $^{13}$C for the first time. By virtue of optimized event selections and muon veto strategies, backgrounds from the accidental coincidence, muon-induced isotopes, and external backgrounds can be greatly suppressed. Excellent signal-to-background ratios can be achieved in the CC, NC and ES channels to guarantee the $^8$B solar neutrino observation. From the sensitivity studies performed in this work, we show that JUNO, with ten years of data, can reach the {1$σ$} precision levels of 5%, 8% and 20% for the $^8$B neutrino flux, $\sin^2θ_{12}$, and $Δm^2_{21}$, respectively. It would be unique and helpful to probe the details of both solar physics and neutrino physics. In addition, when combined with SNO, the world-best precision of 3% is expected for the $^8$B neutrino flux measurement.
△ Less
Submitted 6 March, 2024; v1 submitted 15 October, 2022;
originally announced October 2022.
-
Energy Dissipation and Asymmetric Excitation in Hybrid Waveguides for Routing and Coloring
Authors:
Xianguang Yang,
Long Wen,
Jiahao Yan,
Yanjun Bao,
Qin Chen,
Andrea Camposeo,
Dario Pisignano,
Baojun Li
Abstract:
The delivery of optical signals from an external light source to a nanoscale waveguide is highly important for the development of nanophotonic circuits. However, the efficient coupling of external light energy into nanophotonic components is difficult and still remains a challenge. Herein, we use an external silica nanofiber to light up an organic-inorganic hybrid nano-waveguide, namely a system c…
▽ More
The delivery of optical signals from an external light source to a nanoscale waveguide is highly important for the development of nanophotonic circuits. However, the efficient coupling of external light energy into nanophotonic components is difficult and still remains a challenge. Herein, we use an external silica nanofiber to light up an organic-inorganic hybrid nano-waveguide, namely a system composed of a polymer filament doped with MoS$_{2}$ quantum dots. Nanofiber-excited nano-waveguides in a crossed geometry are found to asymmetrically couple excitation signals along two opposite directions, with different energy dissipation resulting in different colors of the light emitted by MoS$_{2}$ quantum dots and collected from the waveguide terminals. Interestingly, rainbow-like light in the hybrid waveguide is achieved by three-in-one mixing of red, green, and blue components. This hetero-dimensional system of dots-in-waveguide represents a significant advance towards all-optical routing and full-color display in integrated nanophotonic devices.
△ Less
Submitted 24 September, 2022;
originally announced September 2022.
-
Scene Graph Modification as Incremental Structure Expanding
Authors:
Xuming Hu,
Zhijiang Guo,
Yu Fu,
Lijie Wen,
Philip S. Yu
Abstract:
A scene graph is a semantic representation that expresses the objects, attributes, and relationships between objects in a scene. Scene graphs play an important role in many cross modality tasks, as they are able to capture the interactions between images and texts. In this paper, we focus on scene graph modification (SGM), where the system is required to learn how to update an existing scene graph…
▽ More
A scene graph is a semantic representation that expresses the objects, attributes, and relationships between objects in a scene. Scene graphs play an important role in many cross modality tasks, as they are able to capture the interactions between images and texts. In this paper, we focus on scene graph modification (SGM), where the system is required to learn how to update an existing scene graph based on a natural language query. Unlike previous approaches that rebuilt the entire scene graph, we frame SGM as a graph expansion task by introducing the incremental structure expanding (ISE). ISE constructs the target graph by incrementally expanding the source graph without changing the unmodified structure. Based on ISE, we further propose a model that iterates between nodes prediction and edges prediction, inferring more accurate and harmonious expansion decisions progressively. In addition, we construct a challenging dataset that contains more complicated queries and larger scene graphs than existing datasets. Experiments on four benchmarks demonstrate the effectiveness of our approach, which surpasses the previous state-of-the-art model by large margins.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
Performance of novel VUV-sensitive Silicon Photo-Multipliers for nEXO
Authors:
G. Gallina,
Y. Guan,
F. Retiere,
G. Cao,
A. Bolotnikov,
I. Kotov,
S. Rescia,
A. K. Soma,
T. Tsang,
L. Darroch,
T. Brunner,
J. Bolster,
J. R. Cohen,
T. Pinto Franco,
W. C. Gillis,
H. Peltz Smalley,
S. Thibado,
A. Pocar,
A. Bhat,
A. Jamil,
D. C. Moore,
G. Adhikari,
S. Al Kharusi,
E. Angelico,
I. J. Arnquist
, et al. (140 additional authors not shown)
Abstract:
Liquid xenon time projection chambers are promising detectors to search for neutrinoless double beta decay (0$νββ$), due to their response uniformity, monolithic sensitive volume, scalability to large target masses, and suitability for extremely low background operations. The nEXO collaboration has designed a tonne-scale time projection chamber that aims to search for 0$νββ$ of \ce{^{136}Xe} with…
▽ More
Liquid xenon time projection chambers are promising detectors to search for neutrinoless double beta decay (0$νββ$), due to their response uniformity, monolithic sensitive volume, scalability to large target masses, and suitability for extremely low background operations. The nEXO collaboration has designed a tonne-scale time projection chamber that aims to search for 0$νββ$ of \ce{^{136}Xe} with projected half-life sensitivity of $1.35\times 10^{28}$~yr. To reach this sensitivity, the design goal for nEXO is $\leq$1\% energy resolution at the decay $Q$-value ($2458.07\pm 0.31$~keV). Reaching this resolution requires the efficient collection of both the ionization and scintillation produced in the detector. The nEXO design employs Silicon Photo-Multipliers (SiPMs) to detect the vacuum ultra-violet, 175 nm scintillation light of liquid xenon. This paper reports on the characterization of the newest vacuum ultra-violet sensitive Fondazione Bruno Kessler VUVHD3 SiPMs specifically designed for nEXO, as well as new measurements on new test samples of previously characterised Hamamatsu VUV4 Multi Pixel Photon Counters (MPPCs). Various SiPM and MPPC parameters, such as dark noise, gain, direct crosstalk, correlated avalanches and photon detection efficiency were measured as a function of the applied over voltage and wavelength at liquid xenon temperature (163~K). The results from this study are used to provide updated estimates of the achievable energy resolution at the decay $Q$-value for the nEXO design.
△ Less
Submitted 25 November, 2022; v1 submitted 16 September, 2022;
originally announced September 2022.
-
Model-based cross-correlation search for gravitational waves from the low-mass X-ray binary Scorpius X-1 in LIGO O3 data
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
C. Alléné,
A. Allocca,
P. A. Altin
, et al. (1670 additional authors not shown)
Abstract:
We present the results of a model-based search for continuous gravitational waves from the low-mass X-ray binary Scorpius X-1 using LIGO detector data from the third observing run of Advanced LIGO, Advanced Virgo and KAGRA. This is a semicoherent search which uses details of the signal model to coherently combine data separated by less than a specified coherence time, which can be adjusted to bala…
▽ More
We present the results of a model-based search for continuous gravitational waves from the low-mass X-ray binary Scorpius X-1 using LIGO detector data from the third observing run of Advanced LIGO, Advanced Virgo and KAGRA. This is a semicoherent search which uses details of the signal model to coherently combine data separated by less than a specified coherence time, which can be adjusted to balance sensitivity with computing cost. The search covered a range of gravitational-wave frequencies from 25Hz to 1600Hz, as well as ranges in orbital speed, frequency and phase determined from observational constraints. No significant detection candidates were found, and upper limits were set as a function of frequency. The most stringent limits, between 100Hz and 200Hz, correspond to an amplitude h0 of about 1e-25 when marginalized isotropically over the unknown inclination angle of the neutron star's rotation axis, or less than 4e-26 assuming the optimal orientation. The sensitivity of this search is now probing amplitudes predicted by models of torque balance equilibrium. For the usual conservative model assuming accretion at the surface of the neutron star, our isotropically-marginalized upper limits are close to the predicted amplitude from about 70Hz to 100Hz; the limits assuming the neutron star spin is aligned with the most likely orbital angular momentum are below the conservative torque balance predictions from 40Hz to 200Hz. Assuming a broader range of accretion models, our direct limits on gravitational-wave amplitude delve into the relevant parameter space over a wide range of frequencies, to 500Hz or more.
△ Less
Submitted 2 January, 2023; v1 submitted 6 September, 2022;
originally announced September 2022.
-
Structure-Preserving Graph Representation Learning
Authors:
Ruiyi Fang,
Liangjian Wen,
Zhao Kang,
Jianzhuang Liu
Abstract:
Though graph representation learning (GRL) has made significant progress, it is still a challenge to extract and embed the rich topological structure and feature information in an adequate way. Most existing methods focus on local structure and fail to fully incorporate the global topological structure. To this end, we propose a novel Structure-Preserving Graph Representation Learning (SPGRL) meth…
▽ More
Though graph representation learning (GRL) has made significant progress, it is still a challenge to extract and embed the rich topological structure and feature information in an adequate way. Most existing methods focus on local structure and fail to fully incorporate the global topological structure. To this end, we propose a novel Structure-Preserving Graph Representation Learning (SPGRL) method, to fully capture the structure information of graphs. Specifically, to reduce the uncertainty and misinformation of the original graph, we construct a feature graph as a complementary view via k-Nearest Neighbor method. The feature graph can be used to contrast at node-level to capture the local relation. Besides, we retain the global topological structure information by maximizing the mutual information (MI) of the whole graph and feature embeddings, which is theoretically reduced to exchanging the feature embeddings of the feature and the original graphs to reconstruct themselves. Extensive experiments show that our method has quite superior performance on semi-supervised node classification task and excellent robustness under noise perturbation on graph structure or node features.
△ Less
Submitted 7 December, 2022; v1 submitted 1 September, 2022;
originally announced September 2022.
-
The alignment between brightest cluster galaxies and host clusters
Authors:
Z. S. Yuan,
Z. L. Wen
Abstract:
The alignment between brightest cluster galaxies (BCGs) and host clusters can reveal the mystery of formation and evolution for galaxy clusters. We measure cluster orientations in optical based on the projected distribution of member galaxies and in X-ray by fitting the morphology of intra-cluster medium (ICM). Cluster orientations determined in the two wavelengths are generally consistent. The or…
▽ More
The alignment between brightest cluster galaxies (BCGs) and host clusters can reveal the mystery of formation and evolution for galaxy clusters. We measure cluster orientations in optical based on the projected distribution of member galaxies and in X-ray by fitting the morphology of intra-cluster medium (ICM). Cluster orientations determined in the two wavelengths are generally consistent. The orientation alignment between BCGs and host clusters is confirmed and more significant than previous works. We find that BCGs are more aligned with cluster orientations measured in X-ray than those from optical data. Clusters with a brighter BCG generally show a stronger alignment. We argue that the detected redshift evolution of the alignment is probably caused by observational bias rather than intrinsic evolution. The alignment is not related to the ellipticity of BCGs, and the richness, ellipticity and dynamical state of host clusters. The strong alignment between BCGs and morphology of ICMs may be the consequence of the co-evolution between the central massive galaxy and host clusters.
△ Less
Submitted 31 August, 2022;
originally announced September 2022.
-
Bottomonium sequential suppression and strong heavy-quark potential in heavy-ion collisions
Authors:
Liuyuan Wen,
Baoyi Chen
Abstract:
We employ the time-dependent Schrödinger equation with different complex potentials to study the bottomonium sequential suppression in Pb-Pb collisions at $\sqrt{s_{NN}}=2.76$ TeV and 5.02 TeV and Au-Au collisions at $\sqrt{s_{NN}}=200$ GeV. Both color screening effect and the random scatterings with thermal partons are considered in the real and imaginary parts of the heavy-quark potentials. As t…
▽ More
We employ the time-dependent Schrödinger equation with different complex potentials to study the bottomonium sequential suppression in Pb-Pb collisions at $\sqrt{s_{NN}}=2.76$ TeV and 5.02 TeV and Au-Au collisions at $\sqrt{s_{NN}}=200$ GeV. Both color screening effect and the random scatterings with thermal partons are considered in the real and imaginary parts of the heavy-quark potentials. As the real part of the heavy-quark potential is between the free energy $F(T,r)$ and the internal energy $U(T,r)$ of heavy quarkonium, we parametrize different potentials with a function of $F$ and $U$ to evolve the bottomonium wave packages in the medium. We find that when the real part of the potential is close to $U(T,r)$, it can explain well the pattern of bottomonium sequential suppression where their nuclear modification factors satisfy the relation $R_{AA}(1s)>R_{AA}(2s)>R_{AA}(3s)$ observed in experiments. In the other limit of $F(T,r)$, bottomonium wave packages tend to expand due to weak attracted force, which results in evident transitions from $Υ(2s)$ to $Υ(3s)$ components and does not satisfy the sequential suppression pattern. We suggest that the bottomonium sequential suppression can be a probe of strong heavy-quark potential in the medium.
△ Less
Submitted 9 March, 2023; v1 submitted 22 August, 2022;
originally announced August 2022.
-
Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking Graph
Authors:
Aiwei Liu,
Xuming Hu,
Li Lin,
Lijie Wen
Abstract:
The generalizability to new databases is of vital importance to Text-to-SQL systems which aim to parse human utterances into SQL statements. Existing works achieve this goal by leveraging the exact matching method to identify the lexical matching between the question words and the schema items. However, these methods fail in other challenging scenarios, such as the synonym substitution in which th…
▽ More
The generalizability to new databases is of vital importance to Text-to-SQL systems which aim to parse human utterances into SQL statements. Existing works achieve this goal by leveraging the exact matching method to identify the lexical matching between the question words and the schema items. However, these methods fail in other challenging scenarios, such as the synonym substitution in which the surface form differs between the corresponding question words and schema items. In this paper, we propose a framework named ISESL-SQL to iteratively build a semantic enhanced schema-linking graph between question tokens and database schemas. First, we extract a schema linking graph from PLMs through a probing procedure in an unsupervised manner. Then the schema linking graph is further optimized during the training process through a deep graph learning method. Meanwhile, we also design an auxiliary task called graph regularization to improve the schema information mentioned in the schema-linking graph. Extensive experiments on three benchmarks demonstrate that ISESL-SQL could consistently outperform the baselines and further investigations show its generalizability and robustness.
△ Less
Submitted 7 August, 2022;
originally announced August 2022.
-
Rapid localization of gravitational wave sources from compact binary coalescences using deep learning
Authors:
Chayan Chatterjee,
Manoj Kovalam,
Linqing Wen,
Damon Beveridge,
Foivos Diakogiannis,
Kevin Vinsen
Abstract:
The mergers of neutron star-neutron star and neutron star-black hole binaries are the most promising gravitational wave events with electromagnetic counterparts. The rapid detection, localization and simultaneous multi-messenger follow-up of these sources is of primary importance in the upcoming science runs of the LIGO-Virgo-KAGRA Collaboration. While prompt electromagnetic counterparts during bi…
▽ More
The mergers of neutron star-neutron star and neutron star-black hole binaries are the most promising gravitational wave events with electromagnetic counterparts. The rapid detection, localization and simultaneous multi-messenger follow-up of these sources is of primary importance in the upcoming science runs of the LIGO-Virgo-KAGRA Collaboration. While prompt electromagnetic counterparts during binary mergers can last less than two seconds, the time scales of existing localization methods that use Bayesian techniques, varies from seconds to days. In this paper, we propose the first deep learning-based approach for rapid and accurate sky localization of all types of binary coalescences, including neutron star-neutron star and neutron star-black hole binaries for the first time. Specifically, we train and test a normalizing flow model on matched-filtering output from gravitational wave searches. Our model produces sky direction posteriors in milliseconds using a single P100 GPU, which is three to six orders of magnitude faster than Bayesian techniques.
△ Less
Submitted 5 December, 2023; v1 submitted 29 July, 2022;
originally announced July 2022.
-
Self-Supervision Can Be a Good Few-Shot Learner
Authors:
Yuning Lu,
Liangjian Wen,
Jianzhuang Liu,
Yajing Liu,
Xinmei Tian
Abstract:
Existing few-shot learning (FSL) methods rely on training with a large labeled dataset, which prevents them from leveraging abundant unlabeled data. From an information-theoretic perspective, we propose an effective unsupervised FSL method, learning representations with self-supervision. Following the InfoMax principle, our method learns comprehensive representations by capturing the intrinsic str…
▽ More
Existing few-shot learning (FSL) methods rely on training with a large labeled dataset, which prevents them from leveraging abundant unlabeled data. From an information-theoretic perspective, we propose an effective unsupervised FSL method, learning representations with self-supervision. Following the InfoMax principle, our method learns comprehensive representations by capturing the intrinsic structure of the data. Specifically, we maximize the mutual information (MI) of instances and their representations with a low-bias MI estimator to perform self-supervised pre-training. Rather than supervised pre-training focusing on the discriminable features of the seen classes, our self-supervised model has less bias toward the seen classes, resulting in better generalization for unseen classes. We explain that supervised pre-training and self-supervised pre-training are actually maximizing different MI objectives. Extensive experiments are further conducted to analyze their FSL performance with various training settings. Surprisingly, the results show that self-supervised pre-training can outperform supervised pre-training under the appropriate conditions. Compared with state-of-the-art FSL methods, our approach achieves comparable performance on widely used FSL benchmarks without any labels of the base classes.
△ Less
Submitted 19 July, 2022;
originally announced July 2022.
-
Development of silicon interposer: towards an ultralow radioactivity background photodetector system
Authors:
Haibo Yang,
Qidong Wang,
Guofu Cao,
Kali M. Melby,
Khadouja Harouaka,
Isaac J. Arnquist,
Fengwei Dai,
Liqiang Cao,
Liangjian Wen
Abstract:
It is of great importance to develop a photodetector system with an ultralow radioactivity background in rare event searches. Silicon photomultipliers (SiPMs) and application-specific integrated circuits (ASICs) are two ideal candidates for low background photosensors and readout electronics, respectively, because they are mainly composed of silicon, which can achieve good radio-purity without con…
▽ More
It is of great importance to develop a photodetector system with an ultralow radioactivity background in rare event searches. Silicon photomultipliers (SiPMs) and application-specific integrated circuits (ASICs) are two ideal candidates for low background photosensors and readout electronics, respectively, because they are mainly composed of silicon, which can achieve good radio-purity without considerable extra effort. However, interposers, used to provide mechanical support and signal routes between the photosensor and the electronics, are a bottleneck in building ultralow background photodetectors. Silicon and quartz are two candidates to construct the low background interposer because of their good radio-purity; nevertheless, it is non-trivial to produce through silicon vias (TSV) or through quartz vias (TQV) on the large area silicon or quartz wafer. In this work, based on double-sided TSV interconnect technology, we developed the first prototype of a silicon interposer with a size of 10~cm$\times$10~cm and a thickness of 320~$μ$m. The electrical properties of the interposer are carefully evaluated at room temperature, and its performance is also examined at -110~$^\circ$C with an integrated SiPM on the interposer. The testing results reveal quite promising performance of the prototype, and the single photoelectron signals can be clearly observed from the SiPM. The features of the observed signals are comparable with those from the SiPM mounted on a normal FR4-based PCB. Based on the success of the silicon interposer prototype, we started the follow-up studies that aimed to further improve the performance and yield of the silicon interposer, and eventually to provide a solution for building an ultralow background photodetector system.
△ Less
Submitted 19 July, 2022;
originally announced July 2022.
-
Dual-Stream Transformer for Generic Event Boundary Captioning
Authors:
Xin Gu,
Hanhua Ye,
Guang Chen,
Yufei Wang,
Libo Zhang,
Longyin Wen
Abstract:
This paper describes our champion solution for the CVPR2022 Generic Event Boundary Captioning (GEBC) competition. GEBC requires the captioning model to have a comprehension of instantaneous status changes around the given video boundary, which makes it much more challenging than conventional video captioning task. In this paper, a Dual-Stream Transformer with improvements on both video content enc…
▽ More
This paper describes our champion solution for the CVPR2022 Generic Event Boundary Captioning (GEBC) competition. GEBC requires the captioning model to have a comprehension of instantaneous status changes around the given video boundary, which makes it much more challenging than conventional video captioning task. In this paper, a Dual-Stream Transformer with improvements on both video content encoding and captions generation is proposed: (1) We utilize three pre-trained models to extract the video features from different granularities. Moreover, we exploit the types of boundary as hints to help the model generate captions. (2) We particularly design an model, termed as Dual-Stream Transformer, to learn discriminative representations for boundary captioning. (3) Towards generating content-relevant and human-like captions, we improve the description quality by designing a word-level ensemble strategy. The promising results on the GEBC test split demonstrate the efficacy of our proposed model.
△ Less
Submitted 24 March, 2023; v1 submitted 6 July, 2022;
originally announced July 2022.
-
Search for MeV Electron Recoils from Dark Matter in EXO-200
Authors:
EXO-200 Collaboration,
:,
S. Al Kharusi,
G. Anton,
I. Badhrees,
P. S. Barbeau,
D. Beck,
V. Belov,
T. Bhatta,
M. Breidenbach,
T. Brunner,
G. F. Cao,
W. R. Cen,
C. Chambers,
B. Cleveland,
M. Coon,
A. Craycraft,
T. Daniels,
L. Darroch,
S. J. Daugherty,
J. Davis,
S. Delaquis,
A. Der Mesrobian-Kabakian,
R. DeVoe,
J. Dilling
, et al. (83 additional authors not shown)
Abstract:
We present a search for electron-recoil signatures from the charged-current absorption of fermionic dark matter using the EXO-200 detector. We report an average electron recoil background rate of $6.8 \times 10^{-4}\, \mathrm{cts}\,\mathrm{kg}^{-1}\mathrm{yr}^{-1}\mathrm{keV}^{-1}$ above $4\,\mathrm{MeV}$ and find no statistically significant excess over our background projection. Using a total…
▽ More
We present a search for electron-recoil signatures from the charged-current absorption of fermionic dark matter using the EXO-200 detector. We report an average electron recoil background rate of $6.8 \times 10^{-4}\, \mathrm{cts}\,\mathrm{kg}^{-1}\mathrm{yr}^{-1}\mathrm{keV}^{-1}$ above $4\,\mathrm{MeV}$ and find no statistically significant excess over our background projection. Using a total ${}^{136}\mathrm{Xe}$ exposure of $234.1\,\mathrm{kg}\,\mathrm{yr}$ we exclude new parameter space for the charged-current absorption cross-section for dark matter masses between $m_χ= 2.6\,\mathrm{MeV} - 11.6\,\mathrm{MeV}$ with a minimum of $6\times 10^{-51}\,\mathrm{cm}^2$ at $8.3\,\mathrm{MeV}$ at the $90\%$ confidence level.
△ Less
Submitted 20 February, 2023; v1 submitted 2 July, 2022;
originally announced July 2022.
-
SC-Transformer++: Structured Context Transformer for Generic Event Boundary Detection
Authors:
Dexiang Hong,
Xiaoqi Ma,
Xinyao Wang,
Congcong Li,
Yufei Wang,
Longyin Wen
Abstract:
This report presents the algorithm used in the submission of Generic Event Boundary Detection (GEBD) Challenge at CVPR 2022. In this work, we improve the existing Structured Context Transformer (SC-Transformer) method for GEBD. Specifically, a transformer decoder module is added after transformer encoders to extract high quality frame features. The final classification is performed jointly on the…
▽ More
This report presents the algorithm used in the submission of Generic Event Boundary Detection (GEBD) Challenge at CVPR 2022. In this work, we improve the existing Structured Context Transformer (SC-Transformer) method for GEBD. Specifically, a transformer decoder module is added after transformer encoders to extract high quality frame features. The final classification is performed jointly on the results of the original binary classifier and a newly introduced multi-class classifier branch. To enrich motion information, optical flow is introduced as a new modality. Finally, model ensemble is used to further boost performance. The proposed method achieves 86.49% F1 score on Kinetics-GEBD test set. which improves 2.86% F1 score compared to the previous SOTA method.
△ Less
Submitted 25 June, 2022;
originally announced June 2022.
-
CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking
Authors:
Xuming Hu,
Zhijiang Guo,
Guanyu Wu,
Aiwei Liu,
Lijie Wen,
Philip S. Yu
Abstract:
The explosion of misinformation spreading in the media ecosystem urges for automated fact-checking. While misinformation spans both geographic and linguistic boundaries, most work in the field has focused on English. Datasets and tools available in other languages, such as Chinese, are limited. In order to bridge this gap, we construct CHEF, the first CHinese Evidence-based Fact-checking dataset o…
▽ More
The explosion of misinformation spreading in the media ecosystem urges for automated fact-checking. While misinformation spans both geographic and linguistic boundaries, most work in the field has focused on English. Datasets and tools available in other languages, such as Chinese, are limited. In order to bridge this gap, we construct CHEF, the first CHinese Evidence-based Fact-checking dataset of 10K real-world claims. The dataset covers multiple domains, ranging from politics to public health, and provides annotated evidence retrieved from the Internet. Further, we develop established baselines and a novel approach that is able to model the evidence retrieval as a latent variable, allowing jointly training with the veracity prediction model in an end-to-end fashion. Extensive experiments show that CHEF will provide a challenging testbed for the development of fact-checking systems designed to retrieve and reason over non-English claims.
△ Less
Submitted 6 June, 2022;
originally announced June 2022.
-
ET White Paper: To Find the First Earth 2.0
Authors:
Jian Ge,
Hui Zhang,
Weicheng Zang,
Hongping Deng,
Shude Mao,
Ji-Wei Xie,
Hui-Gen Liu,
Ji-Lin Zhou,
Kevin Willis,
Chelsea Huang,
Steve B. Howell,
Fabo Feng,
Jiapeng Zhu,
Xinyu Yao,
Beibei Liu,
Masataka Aizawa,
Wei Zhu,
Ya-Ping Li,
Bo Ma,
Quanzhi Ye,
Jie Yu,
Maosheng Xiang,
Cong Yu,
Shangfei Liu,
Ming Yang
, et al. (142 additional authors not shown)
Abstract:
We propose to develop a wide-field and ultra-high-precision photometric survey mission, temporarily named "Earth 2.0 (ET)". This mission is designed to measure, for the first time, the occurrence rate and the orbital distributions of Earth-sized planets. ET consists of seven 30cm telescopes, to be launched to the Earth-Sun's L2 point. Six of these are transit telescopes with a field of view of 500…
▽ More
We propose to develop a wide-field and ultra-high-precision photometric survey mission, temporarily named "Earth 2.0 (ET)". This mission is designed to measure, for the first time, the occurrence rate and the orbital distributions of Earth-sized planets. ET consists of seven 30cm telescopes, to be launched to the Earth-Sun's L2 point. Six of these are transit telescopes with a field of view of 500 square degrees. Staring in the direction that encompasses the original Kepler field for four continuous years, this monitoring will return tens of thousands of transiting planets, including the elusive Earth twins orbiting solar-type stars. The seventh telescope is a 30cm microlensing telescope that will monitor an area of 4 square degrees toward the galactic bulge. This, combined with simultaneous ground-based KMTNet observations, will measure masses for hundreds of long-period and free-floating planets. Together, the transit and the microlensing telescopes will revolutionize our understandings of terrestrial planets across a large swath of orbital distances and free space. In addition, the survey data will also facilitate studies in the fields of asteroseismology, Galactic archeology, time-domain sciences, and black holes in binaries.
△ Less
Submitted 14 June, 2022;
originally announced June 2022.
-
Structured Context Transformer for Generic Event Boundary Detection
Authors:
Congcong Li,
Xinyao Wang,
Dexiang Hong,
Yufei Wang,
Libo Zhang,
Tiejian Luo,
Longyin Wen
Abstract:
Generic Event Boundary Detection (GEBD) aims to detect moments where humans naturally perceive as event boundaries. In this paper, we present Structured Context Transformer (or SC-Transformer) to solve the GEBD task, which can be trained in an end-to-end fashion. Specifically, we use the backbone convolutional neural network (CNN) to extract the features of each video frame. To capture temporal co…
▽ More
Generic Event Boundary Detection (GEBD) aims to detect moments where humans naturally perceive as event boundaries. In this paper, we present Structured Context Transformer (or SC-Transformer) to solve the GEBD task, which can be trained in an end-to-end fashion. Specifically, we use the backbone convolutional neural network (CNN) to extract the features of each video frame. To capture temporal context information of each frame, we design the structure context transformer (SC-Transformer) by re-partitioning input frame sequence. Note that, the overall computation complexity of SC-Transformer is linear to the video length. After that, the group similarities are computed to capture the differences between frames. Then, a lightweight fully convolutional network is used to determine the event boundaries based on the grouped similarity maps. To remedy the ambiguities of boundary annotations, the Gaussian kernel is adopted to preprocess the ground-truth event boundaries to further boost the accuracy. Extensive experiments conducted on the challenging Kinetics-GEBD and TAPOS datasets demonstrate the effectiveness of the proposed method compared to the state-of-the-art methods.
△ Less
Submitted 6 June, 2022;
originally announced June 2022.
-
A Multi-level Supervised Contrastive Learning Framework for Low-Resource Natural Language Inference
Authors:
Shu'ang Li,
Xuming Hu,
Li Lin,
Aiwei Liu,
Lijie Wen,
Philip S. Yu
Abstract:
Natural Language Inference (NLI) is a growingly essential task in natural language understanding, which requires inferring the relationship between the sentence pairs (premise and hypothesis). Recently, low-resource natural language inference has gained increasing attention, due to significant savings in manual annotation costs and a better fit with real-world scenarios. Existing works fail to cha…
▽ More
Natural Language Inference (NLI) is a growingly essential task in natural language understanding, which requires inferring the relationship between the sentence pairs (premise and hypothesis). Recently, low-resource natural language inference has gained increasing attention, due to significant savings in manual annotation costs and a better fit with real-world scenarios. Existing works fail to characterize discriminative representations between different classes with limited training data, which may cause faults in label prediction. Here we propose a multi-level supervised contrastive learning framework named MultiSCL for low-resource natural language inference. MultiSCL leverages a sentence-level and pair-level contrastive learning objective to discriminate between different classes of sentence pairs by bringing those in one class together and pushing away those in different classes. MultiSCL adopts a data augmentation module that generates different views for input samples to better learn the latent representation. The pair-level representation is obtained from a cross attention module. We conduct extensive experiments on two public NLI datasets in low-resource settings, and the accuracy of MultiSCL exceeds other models by 3.1% on average. Moreover, our method outperforms the previous state-of-the-art method on cross-domain tasks of text classification.
△ Less
Submitted 31 May, 2022;
originally announced May 2022.