Search | arXiv e-print repository

Multi-MLLM Knowledge Distillation for Out-of-Context News Detection

Authors: Yimeng Gu, Zhao Tong, Ignacio Castro, Shu Wu, Gareth Tyson

Abstract: Multimodal out-of-context news is a type of misinformation in which the image is used outside of its original context. Many existing works have leveraged multimodal large language models (MLLMs) for detecting out-of-context news. However, observing the limited zero-shot performance of smaller MLLMs, they generally require label-rich fine-tuning and/or expensive API calls to GPT models to improve t… ▽ More Multimodal out-of-context news is a type of misinformation in which the image is used outside of its original context. Many existing works have leveraged multimodal large language models (MLLMs) for detecting out-of-context news. However, observing the limited zero-shot performance of smaller MLLMs, they generally require label-rich fine-tuning and/or expensive API calls to GPT models to improve the performance, which is impractical in low-resource scenarios. In contrast, we aim to improve the performance of small MLLMs in a more label-efficient and cost-effective manner. To this end, we first prompt multiple teacher MLLMs to generate both label predictions and corresponding rationales, which collectively serve as the teachers' knowledge. We then introduce a two-stage knowledge distillation framework to transfer this knowledge to a student MLLM. In Stage 1, we apply LoRA fine-tuning to the student model using all training data. In Stage 2, we further fine-tune the student model using both LoRA fine-tuning and DPO on the data points where teachers' predictions conflict. This two-stage strategy reduces annotation costs and helps the student model uncover subtle patterns in more challenging cases. Experimental results demonstrate that our approach achieves state-of-the-art performance using less than 10% labeled data. △ Less

Submitted 28 May, 2025; originally announced May 2025.

arXiv:2505.20837 [pdf, ps, other]

Observation of charge density wave excitonic order parameter in topological insulator monolayer WTe2

Authors: Liam Watson, Joan Ripoll, Zhengjue Tong, Amit Kumar, Yande Que, Yang-Hao Chan, Hsin Lin, Shantanu Mukherjee, Manuela Garnica, Mark T. Edmonds, Michał Papaj, Amadeo L. Vazquez de Parga, Bent Weber, Iolanda Di Bernardo, Michael S. Fuhrer

Abstract: Strong electron-hole interactions in a semimetal or narrow-gap semiconductor may drive a ground state of condensed excitons. Monolayer WTe2 has been proposed as a host material for such an exciton condensate, but the order parameter - the key signature of a macroscopic quantum-coherent condensate - has not been observed. Here we use Fourier-transform scanning tunnelling spectroscopy (FT-STS) to st… ▽ More Strong electron-hole interactions in a semimetal or narrow-gap semiconductor may drive a ground state of condensed excitons. Monolayer WTe2 has been proposed as a host material for such an exciton condensate, but the order parameter - the key signature of a macroscopic quantum-coherent condensate - has not been observed. Here we use Fourier-transform scanning tunnelling spectroscopy (FT-STS) to study quasi-particle interference (QPI) and periodic modulations of the local density of states (LDOS) in monolayer WTe2. In WTe2 on graphene, in which the carrier density can be varied via back-gating, FT-STS shows QPI features in the 2D bulk bands, confirming the interacting nature of the bandgap in neutral WTe2 and the semi-metallic nature of highly n- and p-doped WTe2. We observe additional non-dispersive spatial modulations in the LDOS imprinted on the topological edge mode of neutral WTe2 on metallic substrates (graphene and graphite), which we interpret as the interaction of the topological edge mode with the expected charge density wave order parameter of the excitonic condensate in WTe2 at low interaction strength due to screening by the metallic substrates. △ Less

Submitted 27 May, 2025; originally announced May 2025.

Comments: 14 pages, 4 figures in the main text

arXiv:2505.20371 [pdf]

Theoretical study on charge transfer properties of triphenylamino-ethynyl Polycyclic Aromatic Hydrocarbon derivatives

Authors: Zhipeng Tong, Xiaoqi Sun, Guiya Qin, Jinpu Bai, Qi Zhao, Aimin Ren, Jingfu Guo

Abstract: This study systematically investigates the regulation mechanisms of backbone topology (tri-/tetracyclic arenes), substitution positions, and functional groups on charge transport properties through molecular design of triphenylamine-ethynylene fused acene derivatives. By integrating Marcus charge transfer theory with kinetic Monte Carlo simulations, we demonstrate that sulfur-doped tricyclic arene… ▽ More This study systematically investigates the regulation mechanisms of backbone topology (tri-/tetracyclic arenes), substitution positions, and functional groups on charge transport properties through molecular design of triphenylamine-ethynylene fused acene derivatives. By integrating Marcus charge transfer theory with kinetic Monte Carlo simulations, we demonstrate that sulfur-doped tricyclic arene backbones (benzodithiophene and anthracene) effectively suppress high-frequency vibrational modes reducing reorganization energy to 146.1 meV. Concurrent optimization of intermolecular $π$-$π$ slippage enhances 2D hole mobility. Notably, asymmetric charge transport pathways in 2,7-disubstituted pyrene(27DTEP) decrease transfer integrals by 34%, while 1,6-substitution (16DTEP)reconstructs HOMO orbital distribution and induces rotational stacking, boosting transfer integrals by 28% and improving mobility isotropy. We further propose a "backbone-functional group synergy" strategy, revealing that concentrated orbital localization on the backbone amplifies transfer integral gains, outweighing the 38% increase in reorganization energy and significantly enhancing mobility. These findings establish a theoretical framework and quantitative model for the rational design of high-mobility organic ultraviolet photodetectors. △ Less

Submitted 26 May, 2025; originally announced May 2025.

Comments: in Chinese language

arXiv:2505.20228 [pdf]

Theoretical Study of Charge Transport Properties of Curved PAH Organic Semiconductors

Authors: Hengyu Jin, Xiaoqi Sun, Guiya Qin, Zhipeng Tong, Rui Wang, Qi Zhao, Ai-Min Ren, Jingfu Guo

Abstract: Curved polycyclic aromatic hydrocarbons (PAHs) exhibit distinctive geometric and electronic structures, rendering them highly promising in addressing issues of solubility and air stability, which are faced for large linear arene $π$-conjugated organic semiconductors. In this study, a series of surface-curved PAHs and the heteroatom doped derivatives are selected and designed, and the relationship… ▽ More Curved polycyclic aromatic hydrocarbons (PAHs) exhibit distinctive geometric and electronic structures, rendering them highly promising in addressing issues of solubility and air stability, which are faced for large linear arene $π$-conjugated organic semiconductors. In this study, a series of surface-curved PAHs and the heteroatom doped derivatives are selected and designed, and the relationship between electronic structure and charge transport properties of these molecules is investigated by using density functional theory (DFT). And the effects of sulfur/oxygen, nitrogen and boron doping on the charge transport performance of curved PAH semiconductors are explored. The results show that curved PAHs exhibit improved solubility and stability, with the degree of molecular curvature significantly affecting the material's transport properties. △ Less

Submitted 26 May, 2025; originally announced May 2025.

arXiv:2505.11066 [pdf, ps, other]

A Multi-modal Fusion Network for Terrain Perception Based on Illumination Aware

Authors: Rui Wang, Shichun Yang, Yuyi Chen, Zhuoyang Li, Zexiang Tong, Jianyi Xu, Jiayi Lu, Xinjie Feng, Yaoguang Cao

Abstract: Road terrains play a crucial role in ensuring the driving safety of autonomous vehicles (AVs). However, existing sensors of AVs, including cameras and Lidars, are susceptible to variations in lighting and weather conditions, making it challenging to achieve real-time perception of road conditions. In this paper, we propose an illumination-aware multi-modal fusion network (IMF), which leverages bot… ▽ More Road terrains play a crucial role in ensuring the driving safety of autonomous vehicles (AVs). However, existing sensors of AVs, including cameras and Lidars, are susceptible to variations in lighting and weather conditions, making it challenging to achieve real-time perception of road conditions. In this paper, we propose an illumination-aware multi-modal fusion network (IMF), which leverages both exteroceptive and proprioceptive perception and optimizes the fusion process based on illumination features. We introduce an illumination-perception sub-network to accurately estimate illumination features. Moreover, we design a multi-modal fusion network which is able to dynamically adjust weights of different modalities according to illumination features. We enhance the optimization process by pre-training of the illumination-perception sub-network and incorporating illumination loss as one of the training constraints. Extensive experiments demonstrate that the IMF shows a superior performance compared to state-of-the-art methods. The comparison results with single modality perception methods highlight the comprehensive advantages of multi-modal fusion in accurately perceiving road terrains under varying lighting conditions. Our dataset is available at: https://github.com/lindawang2016/IMF. △ Less

Submitted 16 May, 2025; originally announced May 2025.

arXiv:2505.03210 [pdf, other]

Weighted Birkhoff averages: Deterministic and probabilistic perspectives

Authors: Zhicheng Tong, Yong Li

Abstract: In this paper, we present physically related applications of a class of weighted quasi-Monte Carlo methods from a deterministic perspective, and establish quantitative universal rapid convergence results via various regularity assumptions. Specifically, we introduce weighting with compact support to the Birkhoff ergodic averages of quasi-periodic, almost periodic, and periodic systems, thereby ach… ▽ More In this paper, we present physically related applications of a class of weighted quasi-Monte Carlo methods from a deterministic perspective, and establish quantitative universal rapid convergence results via various regularity assumptions. Specifically, we introduce weighting with compact support to the Birkhoff ergodic averages of quasi-periodic, almost periodic, and periodic systems, thereby achieving universal rapid convergence, including both arbitrary polynomial and exponential types. This is in stark contrast to the typically slow convergence in classical ergodic theory. These results, as new contributions, not only discuss more general weighting functions but also provide quantitative improvements to existing results, and the explicit regularity settings facilitate the application of these methods to specific problems. We also revisit the physically related problems and, for the first time, establish universal exponential convergence results for the weighted computation of Fourier coefficients, in both finite-dimensional and infinite-dimensional cases. In addition to the above, we explore results from a probabilistic perspective, including the weighted strong law of large numbers and the weighted central limit theorem, by building upon the historical results. △ Less

Submitted 6 May, 2025; originally announced May 2025.

Comments: 37 pages, 5 figures

MSC Class: 37A25; 37A30; 37A44; 37A46; 60F05

arXiv:2504.00730 [pdf, other]

Detection of Disease on Nasal Breath Sound by New Lightweight Architecture: Using COVID-19 as An Example

Authors: Jiayuan She, Lin Shi, Peiqi Li, Ziling Dong, Renxing Li, Shengkai Li, Liping Gu, Zhao Tong, Zhuochang Yang, Yajie Ji, Liang Feng, Jiangang Chen

Abstract: Background. Infectious diseases, particularly COVID-19, continue to be a significant global health issue. Although many countries have reduced or stopped large-scale testing measures, the detection of such diseases remains a propriety. Objective. This study aims to develop a novel, lightweight deep neural network for efficient, accurate, and cost-effective detection of COVID-19 using a nasal breat… ▽ More Background. Infectious diseases, particularly COVID-19, continue to be a significant global health issue. Although many countries have reduced or stopped large-scale testing measures, the detection of such diseases remains a propriety. Objective. This study aims to develop a novel, lightweight deep neural network for efficient, accurate, and cost-effective detection of COVID-19 using a nasal breathing audio data collected via smartphones. Methodology. Nasal breathing audio from 128 patients diagnosed with the Omicron variant was collected. Mel-Frequency Cepstral Coefficients (MFCCs), a widely used feature in speech and sound analysis, were employed for extracting important characteristics from the audio signals. Additional feature selection was performed using Random Forest (RF) and Principal Component Analysis (PCA) for dimensionality reduction. A Dense-ReLU-Dropout model was trained with K-fold cross-validation (K=3), and performance metrics like accuracy, precision, recall, and F1-score were used to evaluate the model. Results. The proposed model achieved 97% accuracy in detecting COVID-19 from nasal breathing sounds, outperforming state-of-the-art methods such as those by [23] and [13]. Our Dense-ReLU-Dropout model, using RF and PCA for feature selection, achieves high accuracy with greater computational efficiency compared to existing methods that require more complex models or larger datasets. Conclusion. The findings suggest that the proposed method holds significant potential for clinical implementation, advancing smartphone-based diagnostics in infectious diseases. The Dense-ReLU-Dropout model, combined with innovative feature processing techniques, offers a promising approach for efficient and accurate COVID-19 detection, showcasing the capabilities of mobile device-based diagnostics △ Less

Submitted 19 April, 2025; v1 submitted 1 April, 2025; originally announced April 2025.

Comments: 14 pages, 5 figures, 6 tables

arXiv:2503.18158 [pdf, ps, other]

doi 10.1103/nr92-jvt3

New constraints on cosmic ray-boosted dark matter from the LUX-ZEPLIN experiment

Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araujo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, J. W. Bargemann, E. E. Barillier, K. Beattie, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. Bishop, G. M. Blockinger, B. Boxer, C. A. J. Brew , et al. (179 additional authors not shown)

Abstract: While dual-phase xenon time projection chambers (TPCs) have driven the sensitivity towards weakly interacting massive particles (WIMPs) at the GeV/c^2 to TeV/c^2 mass scale, the scope for sub-GeV/c^2 dark matter particles is hindered by a limited nuclear recoil energy detection threshold. One approach to probe for lighter candidates is to consider cases where they have been boosted by collisions w… ▽ More While dual-phase xenon time projection chambers (TPCs) have driven the sensitivity towards weakly interacting massive particles (WIMPs) at the GeV/c^2 to TeV/c^2 mass scale, the scope for sub-GeV/c^2 dark matter particles is hindered by a limited nuclear recoil energy detection threshold. One approach to probe for lighter candidates is to consider cases where they have been boosted by collisions with cosmic rays in the Milky Way, such that the additional kinetic energy lifts their induced signatures above the nominal threshold. In this Letter, we report first results of a search for cosmic ray-boosted dark matter (CRDM) with a combined 4.2 tonne-year exposure from the LUX-ZEPLIN (LZ) experiment. We observe no excess above the expected backgrounds and establish world-leading constraints on the spin-independent CRDM-nucleon cross section as small as 3.9 * 10^{-33} cm^2 at 90% confidence level for sub-GeV/c^2 masses. △ Less

Submitted 2 June, 2025; v1 submitted 23 March, 2025; originally announced March 2025.

Journal ref: Physical Review Letters 134, 241801 (2025)

arXiv:2503.05679 [pdf, ps, other]

Measurements and models of enhanced recombination following inner-shell vacancies in liquid xenon

Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, J. W. Bargemann, E. E. Barillier, D. Bauer, K. Beattie, T. Benson, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. Bishop, G. M. Blockinger , et al. (193 additional authors not shown)

Abstract: Electron-capture decays of $^{125}$Xe and $^{127}$Xe, and double-electron-capture decays of $^{124}$Xe, are backgrounds in searches for weakly interacting massive particles (WIMPs) conducted by dual-phase xenon time projection chambers such as LUX-ZEPLIN (LZ). These decays produce signals with more light and less charge than equivalent-energy $β$ decays, and correspondingly overlap more with WIMP… ▽ More Electron-capture decays of $^{125}$Xe and $^{127}$Xe, and double-electron-capture decays of $^{124}$Xe, are backgrounds in searches for weakly interacting massive particles (WIMPs) conducted by dual-phase xenon time projection chambers such as LUX-ZEPLIN (LZ). These decays produce signals with more light and less charge than equivalent-energy $β$ decays, and correspondingly overlap more with WIMP signals. We measure three electron-capture charge yields in LZ: the 1.1~keV M-shell, 5.2~keV L-shell, and 33.2~keV K-shell at drift fields of 193 and 96.5~V/cm. The LL double-electron-capture decay of $^{124}$Xe exhibits even more pronounced shifts in charge and light. We provide a first model of double-electron-capture charge yields using the link between ionization density and electron-ion recombination, and identify a need for more accurate calculations. Finally, we discuss the implications of the reduced charge yield of these decays and other interactions creating inner-shell vacancies for future dark matter searches. △ Less

Submitted 17 June, 2025; v1 submitted 7 March, 2025; originally announced March 2025.

Comments: 16 pages, 11 figures

arXiv:2501.11082 [pdf]

Ultrahigh interfacial thermal conductance for cooling gallium oxide electronics using cubic boron arsenide

Authors: Wenjiang Zhou, Nianjie Liang, Wei Xiao, Zhaofei Tong, Fei Tian, Bai Song

Abstract: Gallium oxide (Ga$_2$O$_3$) has attracted significant interest for its unique potential especially in power electronics. However, its low and anisotropic thermal conductivity poses a major challenge for heat dissipation. Here, we explore an effective cooling strategy centering on the heterogeneous integration of $β$-Ga$_2$O$_3$ devices with cubic boron arsenide (cBAs), an emerging material with an… ▽ More Gallium oxide (Ga$_2$O$_3$) has attracted significant interest for its unique potential especially in power electronics. However, its low and anisotropic thermal conductivity poses a major challenge for heat dissipation. Here, we explore an effective cooling strategy centering on the heterogeneous integration of $β$-Ga$_2$O$_3$ devices with cubic boron arsenide (cBAs), an emerging material with an ultrahigh thermal conductivity $κ$ of ~1300 Wm$^{-1}$K$^{-1}$. Machine-learned potentials for representative $β$-Ga$_2$O$_3$/cBAs interfaces are trained, enabling accurate and efficient calculation of the interfacial thermal conductance $G$ via nonequilibrium molecular dynamics. At 300 K, remarkable $G$ values of 749$\pm$33 MWm$^{-2}$K$^{-1}$ and 824$\pm$35 MWm$^{-2}$K$^{-1}$ are predicted for Ga-As and O-B bonding across the interface, respectively, which are primarily attributed to the well-matched phonon density of states considering the similar Debye temperatures of $β$-Ga$_2$O$_3$ and cBAs. Moreover, finite-element simulations directly show a notable device temperature reduction when comparing cBAs with other substrates. The simultaneously ultrahigh $κ$ and $G$ highlight cBAs as an ideal substrate for Ga$_2$O$_3$ electronics. △ Less

Submitted 11 February, 2025; v1 submitted 19 January, 2025; originally announced January 2025.

arXiv:2412.04854 [pdf, ps, other]

doi 10.1103/zhs9-65ds

First constraint for atmospheric millicharged particles with the LUX-ZEPLIN experiment

Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, J. W. Bargemann, E. E. Barillier, D. Bauer, K. Beattie, T. Benson, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. Bishop, G. M. Blockinger , et al. (193 additional authors not shown)

Abstract: We report on a search for millicharged particles (mCPs) produced in cosmic ray proton atmospheric interactions using data collected during the first science run of the LUX-ZEPLIN experiment. The mCPs produced by two processes -- meson decay and proton bremsstrahlung -- are considered in this study. This search utilized a novel signature unique to liquid xenon (LXe) time projection chambers (TPCs),… ▽ More We report on a search for millicharged particles (mCPs) produced in cosmic ray proton atmospheric interactions using data collected during the first science run of the LUX-ZEPLIN experiment. The mCPs produced by two processes -- meson decay and proton bremsstrahlung -- are considered in this study. This search utilized a novel signature unique to liquid xenon (LXe) time projection chambers (TPCs), allowing sensitivity to mCPs with masses ranging from 10 to 1000 MeV/c$^2$ and fractional charges between 0.001 and 0.02 of the electron charge e. With an exposure of 60 live days and a 5.5 tonne fiducial mass, we observed no significant excess over background. This represents the first experimental search for atmospheric mCPs and the first search for mCPs using an underground LXe experiment. △ Less

Submitted 9 June, 2025; v1 submitted 6 December, 2024; originally announced December 2024.

Journal ref: Physical Review Letters 134, 241802 (2025)

arXiv:2411.13140 [pdf, other]

Robust Convergency Indicator using High-dimension PID Controller in the presence of disturbance

Authors: Sheng Zimao, Yang Hongan, Wang Jiakang, Song Peng, Zhang Tong

Abstract: The PID controller currently occupies a prominent position as the most prevalent control architecture, which has achieved groundbreaking success across extensive implications. However, its parameters online regulation remains a formidable challenge. The majority of existing theories hinge on the linear constant system structure, contemplating only Single-Input, Single-Output (SISO) scenarios. Rest… ▽ More The PID controller currently occupies a prominent position as the most prevalent control architecture, which has achieved groundbreaking success across extensive implications. However, its parameters online regulation remains a formidable challenge. The majority of existing theories hinge on the linear constant system structure, contemplating only Single-Input, Single-Output (SISO) scenarios. Restricted research has been conducted on the intricate PID control problem within high-dimensional, Multi-Input, Multi-Output (MIMO) nonlinear systems that incorporate disturbances. This research, providing insights on the velocity form of nonlinear system, aims to bolster the controller's robustness. It establishes a quantitative metric to assess the robustness of high-dimensional PID controller, elucidates the pivotal theory regarding robustness's impact on error exponential convergence, and introduces a localized compensation strategy to optimize the robustness indicator. Guided by these theoretical insights, we exploit a robust high-dimensional PID (RH-PID) controller without the crutch of oversimplifying assumptions. Experimental results demonstrate the controller's commendable exponential stabilization efficacy and the controller exhibits exceptional robustness under the robust indicator's guidance. Notably, the robust convergence indicator can also effectively evaluate the comprehensive performance. △ Less

Submitted 20 November, 2024; originally announced November 2024.

Comments: 12 pages, 11 figures

arXiv:2410.19576 [pdf, other]

Unlocking high hole mobility in diamond over a wide temperature range via efficient shear strain

Authors: Jianshi Sun, Shouhang Li, Cheng Shao, Zhen Tong, Meng An, Yuhang Yao, Yue Hu, Xiongfei Zhu, Yifan Liu, Renzong Wang, Xiangjun Liu, Thomas Frauenheim

Abstract: As a wide bandgap semiconductor, diamond holds both excellent electrical and thermal properties, making it highly promising in the electrical industry. However, its hole mobility is relatively low and dramatically decreases with increasing temperature, which severely limits further applications. Herein, we proposed that the hole mobility can be efficiently enhanced via slight compressive shear str… ▽ More As a wide bandgap semiconductor, diamond holds both excellent electrical and thermal properties, making it highly promising in the electrical industry. However, its hole mobility is relatively low and dramatically decreases with increasing temperature, which severely limits further applications. Herein, we proposed that the hole mobility can be efficiently enhanced via slight compressive shear strain along the [100] direction, while the improvement via shear strain along the [111] direction is marginal. This impressive distinction is attributed to the deformation potential and the elastic compliance matrix. The shear strain breaks the symmetry of the crystalline structure and lifts the band degeneracy near the valence band edge, resulting in a significant suppression of interband electron-phonon scattering. Moreover, the hole mobility becomes less temperature-dependent due to the decrease of electron scatterings from high-frequency acoustic phonons. Remarkably, the in-plane hole mobility of diamond is increased by approximately 800% at 800 K with a 2% compressive shear strain along the [100] direction. The efficient shear strain strategy can be further extended to other semiconductors with face-centered cubic geometry. △ Less

Submitted 25 October, 2024; originally announced October 2024.

Comments: 7 pages, 4 figures

arXiv:2410.19016 [pdf, other]

doi 10.1088/1361-6471/adb900

Neutrinoless Double Beta Decay Sensitivity of the XLZD Rare Event Observatory

Authors: XLZD Collaboration, J. Aalbers, K. Abe, M. Adrover, S. Ahmed Maouloud, D. S. Akerib, A. K. Al Musalhi, F. Alder, L. Althueser, D. W. P. Amaral, C. S. Amarasinghe, A. Ames, B. Andrieu, N. Angelides, E. Angelino, B. Antunovic, E. Aprile, H. M. Araújo, J. E. Armstrong, M. Arthurs, M. Babicz, D. Bajpai, A. Baker, M. Balzer, J. Bang , et al. (419 additional authors not shown)

Abstract: The XLZD collaboration is developing a two-phase xenon time projection chamber with an active mass of 60 to 80 t capable of probing the remaining WIMP-nucleon interaction parameter space down to the so-called neutrino fog. In this work we show that, based on the performance of currently operating detectors using the same technology and a realistic reduction of radioactivity in detector materials,… ▽ More The XLZD collaboration is developing a two-phase xenon time projection chamber with an active mass of 60 to 80 t capable of probing the remaining WIMP-nucleon interaction parameter space down to the so-called neutrino fog. In this work we show that, based on the performance of currently operating detectors using the same technology and a realistic reduction of radioactivity in detector materials, such an experiment will also be able to competitively search for neutrinoless double beta decay in $^{136}$Xe using a natural-abundance xenon target. XLZD can reach a 3$σ$ discovery potential half-life of 5.7$\times$10$^{27}$ yr (and a 90% CL exclusion of 1.3$\times$10$^{28}$ yr) with 10 years of data taking, corresponding to a Majorana mass range of 7.3-31.3 meV (4.8-20.5 meV). XLZD will thus exclude the inverted neutrino mass ordering parameter space and will start to probe the normal ordering region for most of the nuclear matrix elements commonly considered by the community. △ Less

Submitted 30 April, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

Comments: 29 pages, 7 figures

Journal ref: J. Phys. G: Nucl. Part. Phys. 52 (2025) 045102

arXiv:2410.17137 [pdf, other]

The XLZD Design Book: Towards the Next-Generation Liquid Xenon Observatory for Dark Matter and Neutrino Physics

Authors: XLZD Collaboration, J. Aalbers, K. Abe, M. Adrover, S. Ahmed Maouloud, D. S. Akerib, A. K. Al Musalhi, F. Alder, L. Althueser, D. W. P. Amaral, C. S. Amarasinghe, A. Ames, B. Andrieu, N. Angelides, E. Angelino, B. Antunovic, E. Aprile, H. M. Araújo, J. E. Armstrong, M. Arthurs, M. Babicz, A. Baker, M. Balzer, J. Bang, E. Barberio , et al. (419 additional authors not shown)

Abstract: This report describes the experimental strategy and technologies for XLZD, the next-generation xenon observatory sensitive to dark matter and neutrino physics. In the baseline design, the detector will have an active liquid xenon target of 60 tonnes, which could be increased to 80 tonnes if the market conditions for xenon are favorable. It is based on the mature liquid xenon time projection chambe… ▽ More This report describes the experimental strategy and technologies for XLZD, the next-generation xenon observatory sensitive to dark matter and neutrino physics. In the baseline design, the detector will have an active liquid xenon target of 60 tonnes, which could be increased to 80 tonnes if the market conditions for xenon are favorable. It is based on the mature liquid xenon time projection chamber technology used in current-generation experiments, LZ and XENONnT. The report discusses the baseline design and opportunities for further optimization of the individual detector components. The experiment envisaged here has the capability to explore parameter space for Weakly Interacting Massive Particle (WIMP) dark matter down to the neutrino fog, with a 3$σ$ evidence potential for WIMP-nucleon cross sections as low as $3\times10^{-49}\rm\,cm^2$ (at 40 GeV/c$^2$ WIMP mass). The observatory will also have leading sensitivity to a wide range of alternative dark matter models. It is projected to have a 3$σ$ observation potential of neutrinoless double beta decay of $^{136}$Xe at a half-life of up to $5.7\times 10^{27}$ years. Additionally, it is sensitive to astrophysical neutrinos from the sun and galactic supernovae. △ Less

Submitted 14 April, 2025; v1 submitted 22 October, 2024; originally announced October 2024.

Comments: 33 pages, 14 figures

arXiv:2410.17036 [pdf, other]

Dark Matter Search Results from 4.2 Tonne-Years of Exposure of the LUX-ZEPLIN (LZ) Experiment

Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, J. W. Bargemann, E. E. Barillier, D. Bauer, K. Beattie, T. Benson, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. Bishop, G. M. Blockinger , et al. (193 additional authors not shown)

Abstract: We report results of a search for nuclear recoils induced by weakly interacting massive particle (WIMP) dark matter using the LUX-ZEPLIN (LZ) two-phase xenon time projection chamber. This analysis uses a total exposure of $4.2\pm0.1$ tonne-years from 280 live days of LZ operation, of which $3.3\pm0.1$ tonne-years and 220 live days are new. A technique to actively tag background electronic recoils… ▽ More We report results of a search for nuclear recoils induced by weakly interacting massive particle (WIMP) dark matter using the LUX-ZEPLIN (LZ) two-phase xenon time projection chamber. This analysis uses a total exposure of $4.2\pm0.1$ tonne-years from 280 live days of LZ operation, of which $3.3\pm0.1$ tonne-years and 220 live days are new. A technique to actively tag background electronic recoils from $^{214}$Pb $β$ decays is featured for the first time. Enhanced electron-ion recombination is observed in two-neutrino double electron capture decays of $^{124}$Xe, representing a noteworthy new background. After removal of artificial signal-like events injected into the data set to mitigate analyzer bias, we find no evidence for an excess over expected backgrounds. World-leading constraints are placed on spin-independent (SI) and spin-dependent WIMP-nucleon cross sections for masses $\geq$9 GeV/$c^2$. The strongest SI exclusion set is $2.1\times10^{-48}$ cm$^{2}$ at the 90% confidence level at a mass of 36 GeV/$c^2$, and the best SI median sensitivity achieved is $5.0\times10^{-48}$ cm$^{2}$ for a mass of 40 GeV/$c^2$. △ Less

Submitted 3 November, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

Comments: 9 pages, 7 figures. See https://www.hepdata.net/record/155182 for a data release related to this paper

arXiv:2408.17391 [pdf, other]

doi 10.1088/1361-6471/ad9039

Two-neutrino double electron capture of $^{124}$Xe in the first LUX-ZEPLIN exposure

Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, J. W. Bargemann, E. E. Barillier, K. Beattie, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. Bishop, G. M. Blockinger, B. Boxer, C. A. J. Brew , et al. (180 additional authors not shown)

Abstract: The broad physics reach of the LUX-ZEPLIN (LZ) experiment covers rare phenomena beyond the direct detection of dark matter. We report precise measurements of the extremely rare decay of $^{124}$Xe through the process of two-neutrino double electron capture (2$ν$2EC), utilizing a $1.39\,\mathrm{kg} \times \mathrm{yr}$ isotopic exposure from the first LZ science run. A half-life of… ▽ More The broad physics reach of the LUX-ZEPLIN (LZ) experiment covers rare phenomena beyond the direct detection of dark matter. We report precise measurements of the extremely rare decay of $^{124}$Xe through the process of two-neutrino double electron capture (2$ν$2EC), utilizing a $1.39\,\mathrm{kg} \times \mathrm{yr}$ isotopic exposure from the first LZ science run. A half-life of $T_{1/2}^{2\nu2\mathrm{EC}} = (1.09 \pm 0.14_{\text{stat}} \pm 0.05_{\text{sys}}) \times 10^{22}\,\mathrm{yr}$ is observed with a statistical significance of $8.3\,σ$, in agreement with literature. First empirical measurements of the KK capture fraction relative to other K-shell modes were conducted, and demonstrate consistency with respect to recent signal models at the $1.4\,σ$ level. △ Less

Submitted 7 December, 2024; v1 submitted 30 August, 2024; originally announced August 2024.

Comments: 15 pages, 3 figures

Journal ref: J. Phys. G: Nucl. Part. Phys. 52 015103 (2025)

arXiv:2408.09398 [pdf, other]

Quantitative uniform exponential acceleration of averages along decaying waves

Authors: Zhicheng Tong, Yong Li

Abstract: In this study, utilizing a specific exponential weighting function, we investigate the uniform exponential convergence of weighted Birkhoff averages along decaying waves and delve into several related variants. A key distinction from traditional scenarios is evident here: despite reduced regularity in observables, our method still maintains exponential convergence. In particular, we develop new te… ▽ More In this study, utilizing a specific exponential weighting function, we investigate the uniform exponential convergence of weighted Birkhoff averages along decaying waves and delve into several related variants. A key distinction from traditional scenarios is evident here: despite reduced regularity in observables, our method still maintains exponential convergence. In particular, we develop new techniques that yield very precise rates of exponential convergence, as evidenced by numerical simulations. Furthermore, this innovative approach extends to quantitative analyses involving different weighting functions employed by others, surpassing the limitations inherent in prior research. It also enhances the exponential convergence rates of weighted Birkhoff averages along quasi-periodic orbits via analytic observables. To the best of our knowledge, this is the first result on the uniform exponential acceleration beyond averages along quasi-periodic or almost periodic orbits, particularly from a quantitative perspective. △ Less

Submitted 22 March, 2025; v1 submitted 18 August, 2024; originally announced August 2024.

Comments: 30 pages, 2 figures. Accepted in Izv. Math

MSC Class: 37A25; 37A30; 37A46

arXiv:2407.05718 [pdf, other]

A Factuality and Diversity Reconciled Decoding Method for Knowledge-Grounded Dialogue Generation

Authors: Chenxu Yang, Zheng Lin, Chong Tian, Liang Pang, Lanrui Wang, Zhengyang Tong, Qirong Ho, Yanan Cao, Weiping Wang

Abstract: Grounding external knowledge can enhance the factuality of responses in dialogue generation. However, excessive emphasis on it might result in the lack of engaging and diverse expressions. Through the introduction of randomness in sampling, current approaches can increase the diversity. Nevertheless, such sampling method could undermine the factuality in dialogue generation. In this study, to disc… ▽ More Grounding external knowledge can enhance the factuality of responses in dialogue generation. However, excessive emphasis on it might result in the lack of engaging and diverse expressions. Through the introduction of randomness in sampling, current approaches can increase the diversity. Nevertheless, such sampling method could undermine the factuality in dialogue generation. In this study, to discover a solution for advancing creativity without relying on questionable randomness and to subtly reconcile the factuality and diversity within the source-grounded paradigm, a novel method named DoGe is proposed. DoGe can dynamically alternate between the utilization of internal parameter knowledge and external source knowledge based on the model's factual confidence. Extensive experiments on three widely-used datasets show that DoGe can not only enhance response diversity but also maintain factuality, and it significantly surpasses other various decoding strategy baselines. △ Less

Submitted 8 July, 2024; originally announced July 2024.

arXiv:2407.04842 [pdf, other]

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Authors: Zhaorun Chen, Yichao Du, Zichen Wen, Yiyang Zhou, Chenhang Cui, Zhenzhen Weng, Haoqin Tu, Chaoqi Wang, Zhengwei Tong, Qinglan Huang, Canyu Chen, Qinghao Ye, Zhihong Zhu, Yuqing Zhang, Jiawei Zhou, Zhuokai Zhao, Rafael Rafailov, Chelsea Finn, Huaxiu Yao

Abstract: While text-to-image models like DALLE-3 and Stable Diffusion are rapidly proliferating, they often encounter challenges such as hallucination, bias, and the production of unsafe, low-quality output. To effectively address these issues, it is crucial to align these models with desired behaviors based on feedback from a multimodal judge. Despite their significance, current multimodal judges frequent… ▽ More While text-to-image models like DALLE-3 and Stable Diffusion are rapidly proliferating, they often encounter challenges such as hallucination, bias, and the production of unsafe, low-quality output. To effectively address these issues, it is crucial to align these models with desired behaviors based on feedback from a multimodal judge. Despite their significance, current multimodal judges frequently undergo inadequate evaluation of their capabilities and limitations, potentially leading to misalignment and unsafe fine-tuning outcomes. To address this issue, we introduce MJ-Bench, a novel benchmark which incorporates a comprehensive preference dataset to evaluate multimodal judges in providing feedback for image generation models across four key perspectives: alignment, safety, image quality, and bias. Specifically, we evaluate a large variety of multimodal judges including smaller-sized CLIP-based scoring models, open-source VLMs (e.g. LLaVA family), and close-source VLMs (e.g. GPT-4o, Claude 3) on each decomposed subcategory of our preference dataset. Experiments reveal that close-source VLMs generally provide better feedback, with GPT-4o outperforming other judges in average. Compared with open-source VLMs, smaller-sized scoring models can provide better feedback regarding text-image alignment and image quality, while VLMs provide more accurate feedback regarding safety and generation bias due to their stronger reasoning capabilities. Further studies in feedback scale reveal that VLM judges can generally provide more accurate and stable feedback in natural language (Likert-scale) than numerical scales. Notably, human evaluations on end-to-end fine-tuned models using separate feedback from these multimodal judges provide similar conclusions, further confirming the effectiveness of MJ-Bench. All data, code, models are available at https://huggingface.co/MJ-Bench. △ Less

Submitted 5 July, 2024; originally announced July 2024.

Comments: 42 pages, 13 figures, 33 tables

arXiv:2406.16177 [pdf, other]

Flowy: Supporting UX Design Decisions Through AI-Driven Pattern Annotation in Multi-Screen User Flows

Authors: Yuwen Lu, Ziang Tong, Qinyi Zhao, Yewon Oh, Bryan Wang, Toby Jia-Jun Li

Abstract: Many recent AI-powered UX design tools focus on generating individual static UI screens from natural language. However, they overlook the crucial aspect of interactions and user experiences across multiple screens. Through formative studies with UX professionals, we identified limitations of these tools in supporting realistic UX design workflows. In response, we designed and developed Flowy, an a… ▽ More Many recent AI-powered UX design tools focus on generating individual static UI screens from natural language. However, they overlook the crucial aspect of interactions and user experiences across multiple screens. Through formative studies with UX professionals, we identified limitations of these tools in supporting realistic UX design workflows. In response, we designed and developed Flowy, an app that augments designers' information foraging process in ideation by supplementing specific user flow examples with distilled design pattern knowledge. Flowy utilizes large multimodal AI models and a high-quality user flow dataset to help designers identify and understand relevant abstract design patterns in the design space for multi-screen user flows. Our user study with professional UX designers demonstrates how Flowy supports realistic UX tasks. Our design considerations in Flowy, such as representations with appropriate levels of abstraction and assisted navigation through the solution space, are generalizable to other creative tasks and embody a human-centered, intelligence augmentation approach to using AI in UX design. △ Less

Submitted 23 June, 2024; originally announced June 2024.

arXiv:2406.13078 [pdf]

A universal bioluminescence tomography system for pre-clinical image-guided radiotherapy research

Authors: Zhishen Tong, Zijian Deng, Xiangkun Xu, Ciara Newman, Xun Jia, Yuncheng Zhong, Merle Reinhart, Paul Tsouchlos, Tim Devling, Hamid Dehghani, Iulian Iordachita, Debabrata Saha, James Kim, John W. Wong, Ken Kang-Hsin Wang

Abstract: CBCT-guided small animal irradiators encounter challenges in localizing soft-tissue targets due to low imaging contrast. Bioluminescence tomography (BLT) offers a promising solution, but they have largely remained in laboratorial development, limiting accessibility for researchers. In this work, we develop a universal, commercial-graded BLT-guided system (MuriGlo) designed to seamlessly integrate… ▽ More CBCT-guided small animal irradiators encounter challenges in localizing soft-tissue targets due to low imaging contrast. Bioluminescence tomography (BLT) offers a promising solution, but they have largely remained in laboratorial development, limiting accessibility for researchers. In this work, we develop a universal, commercial-graded BLT-guided system (MuriGlo) designed to seamlessly integrate with commercial irradiators and empower researchers for translational studies. We demonstrate its capabilities in supporting in vitro and in vivo studies. The MuriGlo comprises detachable mouse bed, thermostatic control, mirrors, filters, and CCD, enabling multi-projection and multi-spectral imaging. We evaluate that the thermostatic control effectively sustains animal temperature at 37°C throughout imaging, and quantify that the system can detect as few as 61 GL261-AkaLuc cells in vitro. To illustrate how the MuriGlo can be utilized for in vivo image-guided research, we present 3 strategies, BLT-guided 5-arc, 2-field box, and BLI-guided single-beam, ranging from complicated high-conformal to simplest high-throughput plans. The high conformal BLT-guided 5-arc plan fully covers the gross tumor volume (GTV) at prescribed dose with minimal normal tissue exposure (3.9%), while the simplified, high-throughput BLT-guided 2-field box achieves 100% GTV coverage but results in higher normal tissue exposure (13.1%). Moreover, we demonstrate that the localization accuracy of MuriGlo for both widely-used SARRP and SmART irradiators is within1 mm, and the tumor coverage reaches over 97% with 0.75mm margin. The universal BLT-guided system offers seamless integration with commercial irradiators, achieving comparable localization accuracy, expected to supporting high-precision radiation research. △ Less

Submitted 26 March, 2025; v1 submitted 18 June, 2024; originally announced June 2024.

arXiv:2406.12874 [pdf, other]

doi 10.1088/1748-0221/19/08/P08027

The Design, Implementation, and Performance of the LZ Calibration Systems

Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, E. E. Barillier, J. W. Bargemann, K. Beattie, T. Benson, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. Bishop, G. M. Blockinger, B. Boxer , et al. (179 additional authors not shown)

Abstract: LUX-ZEPLIN (LZ) is a tonne-scale experiment searching for direct dark matter interactions and other rare events. It is located at the Sanford Underground Research Facility (SURF) in Lead, South Dakota, USA. The core of the LZ detector is a dual-phase xenon time projection chamber (TPC), designed with the primary goal of detecting Weakly Interacting Massive Particles (WIMPs) via their induced low e… ▽ More LUX-ZEPLIN (LZ) is a tonne-scale experiment searching for direct dark matter interactions and other rare events. It is located at the Sanford Underground Research Facility (SURF) in Lead, South Dakota, USA. The core of the LZ detector is a dual-phase xenon time projection chamber (TPC), designed with the primary goal of detecting Weakly Interacting Massive Particles (WIMPs) via their induced low energy nuclear recoils. Surrounding the TPC, two veto detectors immersed in an ultra-pure water tank enable reducing background events to enhance the discovery potential. Intricate calibration systems are purposely designed to precisely understand the responses of these three detector volumes to various types of particle interactions and to demonstrate LZ's ability to discriminate between signals and backgrounds. In this paper, we present a comprehensive discussion of the key features, requirements, and performance of the LZ calibration systems, which play a crucial role in enabling LZ's WIMP-search and its broad science program. The thorough description of these calibration systems, with an emphasis on their novel aspects, is valuable for future calibration efforts in direct dark matter and other rare-event search experiments. △ Less

Submitted 5 September, 2024; v1 submitted 2 May, 2024; originally announced June 2024.

Journal ref: JINST 19 P08027 (2024)

arXiv:2406.12187 [pdf, other]

Diverse Responses in Lattice Thermal Conductivity of $n$-type/$p$-type Semiconductors Driven by Asymmetric Electron-Phonon Interactions

Authors: Jianshi Sun, Shouhang Li, Zhen Tong, Cheng Shao, Han Xie, Meng An, Chuang Zhang, Xiongfei Zhu, Chen Huang, Yucheng Xiong, Xiangjun Liu

Abstract: Accurately assessing the impact of electron-phonon interaction (EPI) on the lattice thermal conductivity of semiconductors is crucial for the thermal management of electronic devices and a unified physical understanding of this issue is highly desired. In this work, we predict the lattice thermal conductivities of typical direct and indirect bandgap semiconductors accounting for EPI based on mode-… ▽ More Accurately assessing the impact of electron-phonon interaction (EPI) on the lattice thermal conductivity of semiconductors is crucial for the thermal management of electronic devices and a unified physical understanding of this issue is highly desired. In this work, we predict the lattice thermal conductivities of typical direct and indirect bandgap semiconductors accounting for EPI based on mode-level first-principles calculations. It is found that EPI has a larger effect on the lattice thermal conductivity of $p$-type doping compared to $n$-type doping in the same semiconductor at high charge carrier concentrations. The stronger EPI in $p$-type doping is attributed to the relatively higher electron density of states caused by the relatively larger $p$-orbital component. Furthermore, EPI has a stronger influence on the lattice thermal conductivity of $n$-type indirect bandgap semiconductors than $n$-type direct bandgap semiconductors. This is attributed to the relatively lower electron density of states in direct bandgap semiconductors stemming from the $s$-orbital component. This work reveals that there exist diverse responses in lattice thermal conductivity of $n$-type/$p$-type semiconductors, which can be attributed to asymmetric EPIs. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 8 pages,5 figures

arXiv:2406.02874 [pdf, other]

Giant enhancement of hole mobility for 4H-silicon carbide through suppressing interband electron-phonon scattering

Authors: Jianshi Sun, Shouhang Li, Zhen Tong, Cheng Shao, Meng An, Xiongfei Zhu, Chuang Zhang, Xiangchuan Chen, Yucheng Xiong, Thomas Frauenheim, Xiangjun Liu

Abstract: 4H-Silicon Carbide (4H-SiC) possesses a high Baliga figure of merit, making it a promising material for power electronics. However, its applications are limited by its low hole mobility. Herein, we found that the hole mobility of 4H-SiC is mainly limited by the strong interband electron-phonon scattering using mode-level first-principles calculations. Our research indicates that applying compressi… ▽ More 4H-Silicon Carbide (4H-SiC) possesses a high Baliga figure of merit, making it a promising material for power electronics. However, its applications are limited by its low hole mobility. Herein, we found that the hole mobility of 4H-SiC is mainly limited by the strong interband electron-phonon scattering using mode-level first-principles calculations. Our research indicates that applying compressive strain can reverse the sign of crystal-field splitting and change the ordering of electron bands close to the valence band maximum. Therefore, the interband electron-phonon scattering is severely suppressed, and the out-of-plane hole mobility of 4H-SiC can be enhanced by 200% with 2% uniaxial compressive strain applied. This work provides new insights into the electron transport mechanisms in semiconductors and suggests a strategy to improve hole mobility that could be applied to other semiconductors with hexagonal crystalline geometries. △ Less

Submitted 20 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

Comments: 22 pages, 4 figures

arXiv:2406.02441 [pdf, other]

doi 10.1038/s42005-024-01774-8

Probing the Scalar WIMP-Pion Coupling with the first LUX-ZEPLIN data

Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, E. E. Barillier, J. W. Bargemann, K. Beattie, T. Benson, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. J. Bishop, G. M. Blockinger, B. Boxer , et al. (178 additional authors not shown)

Abstract: Weakly interacting massive particles (WIMPs) may interact with a virtual pion that is exchanged between nucleons. This interaction channel is important to consider in models where the spin-independent isoscalar channel is suppressed. Using data from the first science run of the LUX-ZEPLIN dark matter experiment, containing 60 live days of data in a 5.5~tonne fiducial mass of liquid xenon, we repor… ▽ More Weakly interacting massive particles (WIMPs) may interact with a virtual pion that is exchanged between nucleons. This interaction channel is important to consider in models where the spin-independent isoscalar channel is suppressed. Using data from the first science run of the LUX-ZEPLIN dark matter experiment, containing 60 live days of data in a 5.5~tonne fiducial mass of liquid xenon, we report the results on a search for WIMP-pion interactions. We observe no significant excess and set an upper limit of $1.5\times10^{-46}$~cm$^2$ at a 90\% confidence level for a WIMP mass of 33~GeV/c$^2$ for this interaction. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Journal ref: Commun Phys 7, 292 (2024)

arXiv:2405.18910 [pdf, other]

Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach

Authors: Huaiwu Zhang, Yutong Xia, Siru Zhong, Kun Wang, Zekun Tong, Qingsong Wen, Roger Zimmermann, Yuxuan Liang

Abstract: The increasing number of vehicles highlights the need for efficient parking space management. Predicting real-time Parking Availability (PA) can help mitigate traffic congestion and the corresponding social problems, which is a pressing issue in densely populated cities like Singapore. In this study, we aim to collectively predict future PA across Singapore with complex factors from various domain… ▽ More The increasing number of vehicles highlights the need for efficient parking space management. Predicting real-time Parking Availability (PA) can help mitigate traffic congestion and the corresponding social problems, which is a pressing issue in densely populated cities like Singapore. In this study, we aim to collectively predict future PA across Singapore with complex factors from various domains. The contributions in this paper are listed as follows: (1) A New Dataset: We introduce the \texttt{SINPA} dataset, containing a year's worth of PA data from 1,687 parking lots in Singapore, enriched with various spatial and temporal factors. (2) A Data-Driven Approach: We present DeepPA, a novel deep-learning framework, to collectively and efficiently predict future PA across thousands of parking lots. (3) Extensive Experiments and Deployment: DeepPA demonstrates a 9.2% reduction in prediction error for up to 3-hour forecasts compared to existing advanced models. Furthermore, we implement DeepPA in a practical web-based platform to provide real-time PA predictions to aid drivers and inform urban planning for the governors in Singapore. We release the dataset and source code at https://github.com/yoshall/SINPA. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: Accepted by IJCAI 2024 (Multi-Year Track On AI And Social Good with ~20% acceptance rate)

arXiv:2405.14732 [pdf, other]

The Data Acquisition System of the LZ Dark Matter Detector: FADR

Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, E. E. Barillier, J. W. Bargemann, K. Beattie, T. Benson, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. Bishop, G. M. Blockinger, B. Boxer , et al. (191 additional authors not shown)

Abstract: The Data Acquisition System (DAQ) for the LUX-ZEPLIN (LZ) dark matter detector is described. The signals from 745 PMTs, distributed across three subsystems, are sampled with 100-MHz 32-channel digitizers (DDC-32s). A basic waveform analysis is carried out on the on-board Field Programmable Gate Arrays (FPGAs) to extract information about the observed scintillation and electroluminescence signals.… ▽ More The Data Acquisition System (DAQ) for the LUX-ZEPLIN (LZ) dark matter detector is described. The signals from 745 PMTs, distributed across three subsystems, are sampled with 100-MHz 32-channel digitizers (DDC-32s). A basic waveform analysis is carried out on the on-board Field Programmable Gate Arrays (FPGAs) to extract information about the observed scintillation and electroluminescence signals. This information is used to determine if the digitized waveforms should be preserved for offline analysis. The system is designed around the Kintex-7 FPGA. In addition to digitizing the PMT signals and providing basic event selection in real time, the flexibility provided by the use of FPGAs allows us to monitor the performance of the detector and the DAQ in parallel to normal data acquisition. The hardware and software/firmware of this FPGA-based Architecture for Data acquisition and Realtime monitoring (FADR) are discussed and performance measurements are described. △ Less

Submitted 16 August, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

Comments: 18 pages, 24 figures

arXiv:2405.02866 [pdf, other]

Universal exponential pointwise convergence for weighted multiple ergodic averages over $ \mathbb{T}^\infty $

Authors: Zhicheng Tong, Yong Li

Abstract: By employing an accelerated weighting method, we establish arbitrary polynomial and exponential pointwise convergence for multiple ergodic averages under general conditions in both discrete and continuous settings, involving quasi-periodic and almost periodic cases, which breaks the well known slow convergence rate observed in classical ergodic theory. We also present joint Diophantine rotations a… ▽ More By employing an accelerated weighting method, we establish arbitrary polynomial and exponential pointwise convergence for multiple ergodic averages under general conditions in both discrete and continuous settings, involving quasi-periodic and almost periodic cases, which breaks the well known slow convergence rate observed in classical ergodic theory. We also present joint Diophantine rotations as explicit applications. Especially, in the sense that excluding nearly rational rotations with zero measure, we demonstrate that the pointwise exponential convergence is universal via analytic observables, even when multiplicatively averaging over the infinite-dimensional torus $ \mathbb{T}^\infty $, utilizing a novel truncated approach. Moreover, by constructing counterexamples concerning with multiple ergodicity, we highlight the irremovability of the joint nonresonance and establish the optimality of our weighting method in preserving rapid convergence. We also provide numerical simulations with analysis to further illustrate our results. △ Less

Submitted 31 July, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

Comments: 36pages. Comments are welcome!

MSC Class: 37A25; 37A30; 37A46

arXiv:2405.01864

Full-dimensional KAM torus with frequency-preserving in infinite-dimensional Hamiltonian systems

Authors: Zhicheng Tong, Yong Li

Abstract: In this paper, we present two infinite-dimensional KAM theorems with frequency-preserving for a nonresonant frequency of Diophantine type or even weaker. To be more precise, under a nondegenerate condition for an infinite-dimensional Hamiltonian system, we prove the persistence of a full-dimensional KAM torus with the specified frequency independent of any spectral asymptotics, by advantage of the… ▽ More In this paper, we present two infinite-dimensional KAM theorems with frequency-preserving for a nonresonant frequency of Diophantine type or even weaker. To be more precise, under a nondegenerate condition for an infinite-dimensional Hamiltonian system, we prove the persistence of a full-dimensional KAM torus with the specified frequency independent of any spectral asymptotics, by advantage of the generating function method. This appears to be the first Kolmogorov type result in the infinite-dimensional context. As a direct application, we provide a positive answer to Bourgain's conjecture: full-dimensional invariant tori for 1D nonlinear Schrödinger equations do exist. △ Less

Submitted 6 October, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

Comments: We found that there seemed to be some problems with the validation of the hypothesis, and we are trying to fix them.

MSC Class: 37K55; 35Q55

arXiv:2404.17666 [pdf, other]

doi 10.1103/PhysRevLett.133.221801

Constraints On Covariant WIMP-Nucleon Effective Field Theory Interactions from the First Science Run of the LUX-ZEPLIN Experiment

Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, E. E. Barillier, J. W. Bargemann, K. Beattie, T. Benson, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. J. Bishop, G. M. Blockinger, B. Boxer , et al. (179 additional authors not shown)

Abstract: The first science run of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time project chamber operating in the Sanford Underground Research Facility in South Dakota, USA, has reported leading limits on spin-independent WIMP-nucleon interactions and interactions described from a non-relativistic effective field theory (NREFT). Using the same 5.5~t fiducial mass and 60 live days of exposure we re… ▽ More The first science run of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time project chamber operating in the Sanford Underground Research Facility in South Dakota, USA, has reported leading limits on spin-independent WIMP-nucleon interactions and interactions described from a non-relativistic effective field theory (NREFT). Using the same 5.5~t fiducial mass and 60 live days of exposure we report on the results of a relativistic extension to the NREFT. We present constraints on couplings from covariant interactions arising from the coupling of vector, axial currents, and electric dipole moments of the nucleon to the magnetic and electric dipole moments of the WIMP which cannot be described by recasting previous results described by an NREFT. Using a profile-likelihood ratio analysis, in an energy region between 0~keV$_\text{nr}$ to 270~keV$_\text{nr}$, we report 90% confidence level exclusion limits on the coupling strength of five interactions in both the isoscalar and isovector bases. △ Less

Submitted 26 April, 2024; originally announced April 2024.

Comments: 7 pages, 4 figures

Journal ref: Phys. Rev. Lett. 133, 221801 (2024)

arXiv:2404.14464 [pdf, other]

Tree of Reviews: A Tree-based Dynamic Iterative Retrieval Framework for Multi-hop Question Answering

Authors: Li Jiapeng, Liu Runze, Li Yabo, Zhou Tong, Li Mingling, Chen Xiang

Abstract: Multi-hop question answering is a knowledge-intensive complex problem. Large Language Models (LLMs) use their Chain of Thoughts (CoT) capability to reason complex problems step by step, and retrieval-augmentation can effectively alleviate factual errors caused by outdated and unknown knowledge in LLMs. Recent works have introduced retrieval-augmentation in the CoT reasoning to solve multi-hop ques… ▽ More Multi-hop question answering is a knowledge-intensive complex problem. Large Language Models (LLMs) use their Chain of Thoughts (CoT) capability to reason complex problems step by step, and retrieval-augmentation can effectively alleviate factual errors caused by outdated and unknown knowledge in LLMs. Recent works have introduced retrieval-augmentation in the CoT reasoning to solve multi-hop question answering. However, these chain methods have the following problems: 1) Retrieved irrelevant paragraphs may mislead the reasoning; 2) An error in the chain structure may lead to a cascade of errors. In this paper, we propose a dynamic retrieval framework called Tree of Reviews (ToR), where the root node is the question, and the other nodes are paragraphs from retrieval, extending different reasoning paths from the root node to other nodes. Our framework dynamically decides to initiate a new search, reject, or accept based on the paragraphs on the reasoning paths. Compared to related work, we introduce a tree structure to handle each retrieved paragraph separately, alleviating the misleading effect of irrelevant paragraphs on the reasoning path; the diversity of reasoning path extension reduces the impact of a single reasoning error on the whole. We conducted experiments on three different multi-hop question answering datasets. The results show that compared to the baseline methods, ToR achieves state-of-the-art performance in both retrieval and response generation. In addition, we propose two tree-based search optimization strategies, pruning and effective expansion, to reduce time overhead and increase the diversity of path extension. We will release our code. △ Less

Submitted 22 April, 2024; originally announced April 2024.

Comments: Keywords: Muti-hop Question Answering; Retrieval-Augmented Generation; Tree of Thought; Reasoning TLDR: We proposed a tree-based dynamic, iterative retrieval framework for multi-hop question answering

arXiv:2403.12922 [pdf, other]

Contextual AD Narration with Interleaved Multimodal Sequence

Authors: Hanlin Wang, Zhan Tong, Kecheng Zheng, Yujun Shen, Limin Wang

Abstract: The Audio Description (AD) task aims to generate descriptions of visual elements for visually impaired individuals to help them access long-form video content, like movies. With video feature, text, character bank and context information as inputs, the generated ADs are able to correspond to the characters by name and provide reasonable, contextual descriptions to help audience understand the stor… ▽ More The Audio Description (AD) task aims to generate descriptions of visual elements for visually impaired individuals to help them access long-form video content, like movies. With video feature, text, character bank and context information as inputs, the generated ADs are able to correspond to the characters by name and provide reasonable, contextual descriptions to help audience understand the storyline of movie. To achieve this goal, we propose to leverage pre-trained foundation models through a simple and unified framework to generate ADs with interleaved multimodal sequence as input, termed as Uni-AD. To enhance the alignment of features across various modalities with finer granularity, we introduce a simple and lightweight module that maps video features into the textual feature space. Moreover, we also propose a character-refinement module to provide more precise information by identifying the main characters who play more significant roles in the video context. With these unique designs, we further incorporate contextual information and a contrastive loss into our architecture to generate smoother and more contextually appropriate ADs. Experiments on multiple AD datasets show that Uni-AD performs well on AD generation, which demonstrates the effectiveness of our approach. Our code is available at: https://github.com/ant-research/UniAD. △ Less

Submitted 15 April, 2025; v1 submitted 19 March, 2024; originally announced March 2024.

Comments: Accepted by CVPR25

arXiv:2402.08865 [pdf, other]

doi 10.1103/PhysRevD.109.112010

New constraints on ultraheavy dark matter from the LZ experiment

Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, J. W. Bargemann, A. Baxter, K. Beattie, T. Benson, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. Bishop, G. M. Blockinger, B. Boxer, C. A. J. Brew , et al. (174 additional authors not shown)

Abstract: Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal f… ▽ More Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal freeze-out, but they have generally been less explored experimentally. In this work, we present a re-analysis of the first science run (SR1) of the LZ experiment, with an exposure of $0.9$ tonne$\times$year, to search for ultraheavy particle dark matter. The signal topology consists of multiple energy deposits in the active region of the detector forming a straight line, from which the velocity of the incoming particle can be reconstructed on an event-by-event basis. Zero events with this topology were observed after applying the data selection calibrated on a simulated sample of signal-like events. New experimental constraints are derived, which rule out previously unexplored regions of the dark matter parameter space of spin-independent interactions beyond a mass of 10$^{17}$ GeV/$c^2$. △ Less

Submitted 13 February, 2024; originally announced February 2024.

Comments: 9 pages, 7 figures

Journal ref: Phys. Rev. D 109, 112010 (2024)

arXiv:2401.02133 [pdf, other]

Weak effects of electron-phonon interactions on the lattice thermal conductivity of wurtzite GaN with high electron concentrations

Authors: Jianshi Sun, Shouhang Li, Zhen Tong, Cheng Shao, Xiangchuan Chen, Qianqian Liu, Yucheng Xiong, Meng An, Xiangjun Liu

Abstract: Wurtzite gallium nitride (GaN) has great potential for high-frequency and high-power applications due to its excellent electrical and thermal transport properties. However, enhancing the performance of GaN-based power electronics relies on heavy doping. Previous studies showed that electron-phonon interactions have strong effects on the lattice thermal conductivity of GaN due to the Fröhlich inter… ▽ More Wurtzite gallium nitride (GaN) has great potential for high-frequency and high-power applications due to its excellent electrical and thermal transport properties. However, enhancing the performance of GaN-based power electronics relies on heavy doping. Previous studies showed that electron-phonon interactions have strong effects on the lattice thermal conductivity of GaN due to the Fröhlich interaction. Surprisingly, our investigation reveals weak effects of electron-phonon interactions on the lattice thermal conductivity of n-type GaN at ultra-high electron concentrations and the impact of the Fröhlich interaction can be ignored. The small phonon-electron scattering rate is attributed to the limited scattering channels, quantified by the Fermi surface nesting function. In contrast, there is a significant reduction in the lattice thermal conductivity of p-type GaN at high hole concentrations due to the relatively larger Fermi surface nesting function. Meanwhile, as p-type GaN has relatively smaller electron-phonon matrix elements, the reduction in lattice thermal conductivity is still weaker than that observed in p-type silicon. Our work provides a deep understanding of thermal transport in doped GaN and the conclusions can be further extended to other wide-bandgap semiconductors, including $β$-Ga2O3, AlN, and ZnO. △ Less

Submitted 5 May, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

arXiv:2312.14149 [pdf, other]

TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification

Authors: Qinying Liu, Wei Wu, Kecheng Zheng, Zhan Tong, Jiawei Liu, Yu Liu, Wei Chen, Zilei Wang, Yujun Shen

Abstract: The crux of learning vision-language models is to extract semantically aligned information from visual and linguistic data. Existing attempts usually face the problem of coarse alignment, e.g., the vision encoder struggles in localizing an attribute-specified object. In this work, we propose an embarrassingly simple approach to better align image and text features with no need of additional data f… ▽ More The crux of learning vision-language models is to extract semantically aligned information from visual and linguistic data. Existing attempts usually face the problem of coarse alignment, e.g., the vision encoder struggles in localizing an attribute-specified object. In this work, we propose an embarrassingly simple approach to better align image and text features with no need of additional data formats other than image-text pairs. Concretely, given an image and its paired text, we manage to parse objects (e.g., cat) and attributes (e.g., black) from the description, which are highly likely to exist in the image. It is noteworthy that the parsing pipeline is fully automatic and thus enjoys good scalability. With these parsed semantics as supervision signals, we can complement the commonly used image-text contrastive loss with the multi-tag classification loss. Extensive experimental results on a broad suite of semantic segmentation datasets substantiate the average 5.2\% improvement of our framework over existing alternatives. Furthermore, the visualization results indicate that attribute supervision makes vision-language models accurately localize attribute-specified objects. Project page can be found at https://qinying-liu.github.io/Tag-Align. △ Less

Submitted 26 March, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

arXiv:2312.02030 [pdf, other]

doi 10.1103/PhysRevD.109.092003

First Constraints on WIMP-Nucleon Effective Field Theory Couplings in an Extended Energy Region From LUX-ZEPLIN

Authors: LZ Collaboration, J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, J. W. Bargemann, A. Baxter, K. Beattie, T. Benson, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. Bishop, G. M. Blockinger , et al. (175 additional authors not shown)

Abstract: Following the first science results of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time projection chamber operating from the Sanford Underground Research Facility in Lead, South Dakota, USA, we report the initial limits on a model-independent non-relativistic effective field theory describing the complete set of possible interactions of a weakly interacting massive particle (WIMP) with a n… ▽ More Following the first science results of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time projection chamber operating from the Sanford Underground Research Facility in Lead, South Dakota, USA, we report the initial limits on a model-independent non-relativistic effective field theory describing the complete set of possible interactions of a weakly interacting massive particle (WIMP) with a nucleon. These results utilize the same 5.5 t fiducial mass and 60 live days of exposure collected for the LZ spin-independent and spin-dependent analyses while extending the upper limit of the energy region of interest by a factor of 7.5 to 270 keVnr. No significant excess in this high energy region is observed. Using a profile-likelihood ratio analysis, we report 90% confidence level exclusion limits on the coupling of each individual non-relativistic WIMP-nucleon operator for both elastic and inelastic interactions in the isoscalar and isovector bases. △ Less

Submitted 26 February, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

Comments: 17 pages 11 figures

Journal ref: Phys. Rev. D 109, 092003 (2024)

arXiv:2312.01987 [pdf, other]

Bootstrapping SparseFormers from Vision Foundation Models

Authors: Ziteng Gao, Zhan Tong, Kevin Qinghong Lin, Joya Chen, Mike Zheng Shou

Abstract: The recently proposed SparseFormer architecture provides an alternative approach to visual understanding by utilizing a significantly lower number of visual tokens via adjusting RoIs, greatly reducing computational costs while still achieving promising performance. However, training SparseFormers from scratch is still expensive, and scaling up the number of parameters can be challenging. In this p… ▽ More The recently proposed SparseFormer architecture provides an alternative approach to visual understanding by utilizing a significantly lower number of visual tokens via adjusting RoIs, greatly reducing computational costs while still achieving promising performance. However, training SparseFormers from scratch is still expensive, and scaling up the number of parameters can be challenging. In this paper, we propose to bootstrap SparseFormers from ViT-based vision foundation models in a simple and efficient way. Since the majority of SparseFormer blocks are the standard transformer ones, we can inherit weights from large-scale pre-trained vision transformers and freeze them as much as possible. Therefore, we only need to train the SparseFormer-specific lightweight focusing transformer to adjust token RoIs and fine-tune a few early pre-trained blocks to align the final token representation. In such a way, we can bootstrap SparseFormer architectures from various large-scale pre-trained models (e.g., IN-21K pre-trained AugRegs or CLIPs) using a rather smaller amount of training samples (e.g., IN-1K) and without labels or captions within just a few hours. As a result, the bootstrapped unimodal SparseFormer (from AugReg-ViT-L/16-384) can reach 84.9% accuracy on IN-1K with only 49 tokens, and the multimodal SparseFormer from CLIPs also demonstrates notable zero-shot performance with highly reduced computational cost without seeing any caption during the bootstrapping procedure. In addition, CLIP-bootstrapped SparseFormers, which align the output space with language without seeing a word, can serve as efficient vision encoders in multimodal large language models. Code and models are available at https://github.com/showlab/sparseformer △ Less

Submitted 4 April, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

Comments: CVPR 2024

arXiv:2311.15157 [pdf, other]

Advancing Vision Transformers with Group-Mix Attention

Authors: Chongjian Ge, Xiaohan Ding, Zhan Tong, Li Yuan, Jiangliu Wang, Yibing Song, Ping Luo

Abstract: Vision Transformers (ViTs) have been shown to enhance visual recognition through modeling long-range dependencies with multi-head self-attention (MHSA), which is typically formulated as Query-Key-Value computation. However, the attention map generated from the Query and Key captures only token-to-token correlations at one single granularity. In this paper, we argue that self-attention should have… ▽ More Vision Transformers (ViTs) have been shown to enhance visual recognition through modeling long-range dependencies with multi-head self-attention (MHSA), which is typically formulated as Query-Key-Value computation. However, the attention map generated from the Query and Key captures only token-to-token correlations at one single granularity. In this paper, we argue that self-attention should have a more comprehensive mechanism to capture correlations among tokens and groups (i.e., multiple adjacent tokens) for higher representational capacity. Thereby, we propose Group-Mix Attention (GMA) as an advanced replacement for traditional self-attention, which can simultaneously capture token-to-token, token-to-group, and group-to-group correlations with various group sizes. To this end, GMA splits the Query, Key, and Value into segments uniformly and performs different group aggregations to generate group proxies. The attention map is computed based on the mixtures of tokens and group proxies and used to re-combine the tokens and groups in Value. Based on GMA, we introduce a powerful backbone, namely GroupMixFormer, which achieves state-of-the-art performance in image classification, object detection, and semantic segmentation with fewer parameters than existing models. For instance, GroupMixFormer-L (with 70.3M parameters and 384^2 input) attains 86.2% Top-1 accuracy on ImageNet-1K without external data, while GroupMixFormer-B (with 45.8M parameters) attains 51.2% mIoU on ADE20K. △ Less

Submitted 25 November, 2023; originally announced November 2023.

arXiv:2310.15455 [pdf, other]

UI Layout Generation with LLMs Guided by UI Grammar

Authors: Yuwen Lu, Ziang Tong, Qinyi Zhao, Chengzhi Zhang, Toby Jia-Jun Li

Abstract: The recent advances in Large Language Models (LLMs) have stimulated interest among researchers and industry professionals, particularly in their application to tasks concerning mobile user interfaces (UIs). This position paper investigates the use of LLMs for UI layout generation. Central to our exploration is the introduction of UI grammar -- a novel approach we proposed to represent the hierarch… ▽ More The recent advances in Large Language Models (LLMs) have stimulated interest among researchers and industry professionals, particularly in their application to tasks concerning mobile user interfaces (UIs). This position paper investigates the use of LLMs for UI layout generation. Central to our exploration is the introduction of UI grammar -- a novel approach we proposed to represent the hierarchical structure inherent in UI screens. The aim of this approach is to guide the generative capacities of LLMs more effectively and improve the explainability and controllability of the process. Initial experiments conducted with GPT-4 showed the promising capability of LLMs to produce high-quality user interfaces via in-context learning. Furthermore, our preliminary comparative study suggested the potential of the grammar-based approach in improving the quality of generative results in specific aspects. △ Less

Submitted 23 October, 2023; originally announced October 2023.

Comments: ICML 2023 Workshop on AI and HCI

arXiv:2309.16260 [pdf, other]

doi 10.1002/adma.202309356

A gate-tunable quantum phase transition in a topological excitonic insulator

Authors: Yande Que, Yang-Hao Chan, Junxiang Jia, Anirban Das, Zhengjue Tong, Yu-Tzu Chang, Zhenhao Cui, Amit Kumar, Gagandeep Singh, Hsin Lin, Shantanu Mukherjee, Bent Weber

Abstract: Coulomb interactions among electrons and holes in two-dimensional (2D) semimetals with overlapping valence and conduction bands can give rise to a correlated insulating ground state via exciton formation and condensation. One candidate material in which such excitonic state uniquely combines with non-trivial band topology are atomic monolayers of tungsten ditelluride (WTe2), in which a 2D topologi… ▽ More Coulomb interactions among electrons and holes in two-dimensional (2D) semimetals with overlapping valence and conduction bands can give rise to a correlated insulating ground state via exciton formation and condensation. One candidate material in which such excitonic state uniquely combines with non-trivial band topology are atomic monolayers of tungsten ditelluride (WTe2), in which a 2D topological excitonic insulator (2D TEI) forms. However, the detailed mechanism of the 2D bulk gap formation in WTe2, in particular with regard to the role of Coulomb interactions, has remained a subject of ongoing debate. Here, we show that WTe2 is susceptible to a gate-tunable quantum phase transition, evident from an abrupt collapse of its 2D bulk energy gap upon ambipolar field-effect doping. Such gate tunability of a 2D TEI, into either n- and p-type semimetals, promises novel handles of control over non-trivial 2D superconductivity with excitonic pairing. △ Less

Submitted 28 September, 2023; originally announced September 2023.

Comments: 8 pages, 4 figures, under submission

arXiv:2309.13942 [pdf, other]

Speed Co-Augmentation for Unsupervised Audio-Visual Pre-training

Authors: Jiangliu Wang, Jianbo Jiao, Yibing Song, Stephen James, Zhan Tong, Chongjian Ge, Pieter Abbeel, Yun-hui Liu

Abstract: This work aims to improve unsupervised audio-visual pre-training. Inspired by the efficacy of data augmentation in visual contrastive learning, we propose a novel speed co-augmentation method that randomly changes the playback speeds of both audio and video data. Despite its simplicity, the speed co-augmentation method possesses two compelling attributes: (1) it increases the diversity of audio-vi… ▽ More This work aims to improve unsupervised audio-visual pre-training. Inspired by the efficacy of data augmentation in visual contrastive learning, we propose a novel speed co-augmentation method that randomly changes the playback speeds of both audio and video data. Despite its simplicity, the speed co-augmentation method possesses two compelling attributes: (1) it increases the diversity of audio-visual pairs and doubles the size of negative pairs, resulting in a significant enhancement in the learned representations, and (2) it changes the strict correlation between audio-visual pairs but introduces a partial relationship between the augmented pairs, which is modeled by our proposed SoftInfoNCE loss to further boost the performance. Experimental results show that the proposed method significantly improves the learned representations when compared to vanilla audio-visual contrastive learning. △ Less

Submitted 25 September, 2023; originally announced September 2023.

Comments: Published at the CVPR 2023 Sight and Sound workshop

arXiv:2309.11797 [pdf, other]

A generic approach via relative singularity and controllability: Frequency-preserving with arbitrarily weak regularity in parameterized Hamiltonian systems

Authors: Zhicheng Tong, Yong Li

Abstract: In this paper, we introduce a novel and generic approach to prove the persistence of frequency-preserving invariant tori in parameterized Hamiltonian systems, addressing irregular continuity with respect to parameters. Unlike traditional methods that strongly rely on domain extraction techniques or uniform weak convexity of the frequency mapping, we propose the concepts of relative singularity and… ▽ More In this paper, we introduce a novel and generic approach to prove the persistence of frequency-preserving invariant tori in parameterized Hamiltonian systems, addressing irregular continuity with respect to parameters. Unlike traditional methods that strongly rely on domain extraction techniques or uniform weak convexity of the frequency mapping, we propose the concepts of relative singularity and controllability for the first time. These concepts enable us to deal with a wide range of explicit parameterized Hamiltonian systems with arbitrarily weak regularity, thereby overcoming a previously insurmountable challenge. We also construct several counterexamples to highlight the indispensability of our new conditions in the sense of frequency-preserving. Furthermore, we demonstrate the broad applicability of our results to various cases with explicit arbitrarily weak regularity, including the partial frequency-preserving case and the infinite-dimensional case without any spectral asymptotics. Overall, our approach, based on the concepts of relative singularity and controllability, illustrates its genericity in the frequency-preserving KAM theory. △ Less

Submitted 27 May, 2025; v1 submitted 21 September, 2023; originally announced September 2023.

Comments: 42 pages, 2 figures

MSC Class: 37J40; 70H08; 70K43; 37K55

arXiv:2308.00333 [pdf]

doi 10.1088/1361-6528/acebf7

Performance benchmarking of an ultra-low vibration laboratory to host a commercial millikelvin scanning tunnelling microscope

Authors: Yande Que, Amit Kumar, Michael S. Lodge, Zhengjue Tong, Marcus Lai Kar Fai, Wei Tao, Zhenhao Cui, Ranjith Shivajirao, Junxiang Jia, Siew Eang Lee, Bent Weber

Abstract: Ultra-low temperature scanning tunnelling microscopy and spectroscopy (STM/STS) achieved by dilution refrigeration can provide unrivalled insight into the local electronic structure of quantum materials and atomic-scale quantum systems. Effective isolation from mechanical vibration and acoustic noise is critical in order to achieve ultimate spatial and energy resolution. Here, we report on the des… ▽ More Ultra-low temperature scanning tunnelling microscopy and spectroscopy (STM/STS) achieved by dilution refrigeration can provide unrivalled insight into the local electronic structure of quantum materials and atomic-scale quantum systems. Effective isolation from mechanical vibration and acoustic noise is critical in order to achieve ultimate spatial and energy resolution. Here, we report on the design and performance of an ultra-low vibration (ULV) laboratory hosting a customized but otherwise commercially available 40mK STM. The design of the vibration isolation consists of a T-shaped concrete mass block (55t), suspended by actively controlled pneumatic springs, and placed on a foundation separated from the surrounding building in a "room-within-a-room" design. Vibration levels achieved are meeting the VC-M vibration standard at >3 Hz, reached only in a limited number of laboratories worldwide. Measurement of the STM's junction noise confirms effective vibration isolation on par with custom built STMs in ULV laboratories. In this tailored low-vibration environment, the STM achieves an energy resolution of 43ueV (144 mK), promising for the investigation and control of quantum matter at atomic length scales. △ Less

Submitted 1 August, 2023; originally announced August 2023.

arXiv:2307.15753 [pdf, other]

doi 10.1103/PhysRevD.108.072006

A search for new physics in low-energy electron recoils from the first LZ exposure

Authors: The LZ Collaboration, J. Aalbers, D. S. Akerib, A. K. Al Musalhi, F. Alder, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, J. W. Bargemann, A. Baxter, K. Beattie, P. Beltrame, T. Benson, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, G. M. Blockinger , et al. (178 additional authors not shown)

Abstract: The LUX-ZEPLIN (LZ) experiment is a dark matter detector centered on a dual-phase xenon time projection chamber. We report searches for new physics appearing through few-keV-scale electron recoils, using the experiment's first exposure of 60 live days and a fiducial mass of 5.5t. The data are found to be consistent with a background-only hypothesis, and limits are set on models for new physics inc… ▽ More The LUX-ZEPLIN (LZ) experiment is a dark matter detector centered on a dual-phase xenon time projection chamber. We report searches for new physics appearing through few-keV-scale electron recoils, using the experiment's first exposure of 60 live days and a fiducial mass of 5.5t. The data are found to be consistent with a background-only hypothesis, and limits are set on models for new physics including solar axion electron coupling, solar neutrino magnetic moment and millicharge, and electron couplings to galactic axion-like particles and hidden photons. Similar limits are set on weakly interacting massive particle (WIMP) dark matter producing signals through ionized atomic states from the Migdal effect. △ Less

Submitted 9 September, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

Comments: 13 pages, 10 figures. See https://tinyurl.com/LZDataReleaseRun1ER for a data release related to this paper

Journal ref: Phys. Rev. D 108, 072006 (2023)

arXiv:2306.08211 [pdf, ps, other]

Towards sharp regularity: Full dimensional tori in $ C^\infty $ vector fields over $ \mathbb{T}^\infty $

Authors: Zhicheng Tong, Yong Li

Abstract: We consider linearization of perturbed vector field $ ω+P $ over infinite dimensional torus $ \mathbb{T}^\infty $ and give sharp regularity requirement for perturbation $ P $ under which there is a nearly identical transformation conjugating the unperturbed one $ ω$ onto $ ω-\tildeω+P $ via a small modifying term $ \tildeω $. Besides discussing the Diophantine type introduced by Bourgain [11], we… ▽ More We consider linearization of perturbed vector field $ ω+P $ over infinite dimensional torus $ \mathbb{T}^\infty $ and give sharp regularity requirement for perturbation $ P $ under which there is a nearly identical transformation conjugating the unperturbed one $ ω$ onto $ ω-\tildeω+P $ via a small modifying term $ \tildeω $. Besides discussing the Diophantine type introduced by Bourgain [11], we also investigate the universal nonresonance and provide weakest regularity of perturbations known so far for which KAM applies. Lower than analyticity, our results allow Gevrey or even only $ C^\infty $, and the new KAM scheme with a balancing sequence to overcome non-polynomial nonresonance is shown to be non-Newtonian that differs from the usual ones. Thereby, except deriving sharp Gevrey exponent along Diophantine nonresonance, we answer the fundamental question of what is the minimum regularity required for KAM in infinite dimensional case. Additionally, our linearization could also be employed to deal with quasi periodic case over $ \mathbb{T}^n $. △ Less

Submitted 6 July, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

Comments: 28 pages

MSC Class: 37K20; 37K55

arXiv:2305.14895 [pdf, other]

doi 10.1088/1674-4527/acd593

The Lobster Eye Imager for Astronomy Onboard the SATech-01 Satellite

Authors: Z. X. Ling, X. J. Sun, C. Zhang, S. L. Sun, G. Jin, S. N. Zhang, X. F. Zhang, J. B. Chang, F. S. Chen, Y. F. Chen, Z. W. Cheng, W. Fu, Y. X. Han, H. Li, J. F. Li, Y. Li, Z. D. Li, P. R. Liu, Y. H. Lv, X. H. Ma, Y. J. Tang, C. B. Wang, R. J. Xie, Y. L. Xue, A. L. Yan , et al. (101 additional authors not shown)

Abstract: The Lobster Eye Imager for Astronomy (LEIA), a pathfinder of the Wide-field X-ray Telescope of the Einstein Probe (EP) mission, was successfully launched onboard the SATech-01 satellite of the Chinese Academy of Sciences on 27 July 2022. In this paper, we introduce the design and on-ground test results of the LEIA instrument. Using state-of-the-art Micro-Pore Optics (MPO), a wide field-of-view (Fo… ▽ More The Lobster Eye Imager for Astronomy (LEIA), a pathfinder of the Wide-field X-ray Telescope of the Einstein Probe (EP) mission, was successfully launched onboard the SATech-01 satellite of the Chinese Academy of Sciences on 27 July 2022. In this paper, we introduce the design and on-ground test results of the LEIA instrument. Using state-of-the-art Micro-Pore Optics (MPO), a wide field-of-view (FoV) of 346 square degrees (18.6 degrees * 18.6 degrees) of the X-ray imager is realized. An optical assembly composed of 36 MPO chips is used to focus incident X-ray photons, and four large-format complementary metal-oxide semiconductor (CMOS) sensors, each of 6 cm * 6 cm, are used as the focal plane detectors. The instrument has an angular resolution of 4 - 8 arcmin (in FWHM) for the central focal spot of the point spread function, and an effective area of 2 - 3 cm2 at 1 keV in essentially all the directions within the field of view. The detection passband is 0.5 - 4 keV in the soft X-rays and the sensitivity is 2 - 3 * 10-11 erg s-1 cm-2 (about 1 mini-Crab) at 1,000 second observation. The total weight of LEIA is 56 kg and the power is 85 W. The satellite, with a design lifetime of 2 years, operates in a Sun-synchronous orbit of 500 km with an orbital period of 95 minutes. LEIA is paving the way for future missions by verifying in flight the technologies of both novel focusing imaging optics and CMOS sensors for X-ray observation, and by optimizing the working setups of the instrumental parameters. In addition, LEIA is able to carry out scientific observations to find new transients and to monitor known sources in the soft X-ray band, albeit limited useful observing time available. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: Accepted by RAA

arXiv:2305.14173 [pdf, other]

TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale

Authors: Ziyun Zeng, Yixiao Ge, Zhan Tong, Xihui Liu, Shu-Tao Xia, Ying Shan

Abstract: The ultimate goal for foundation models is realizing task-agnostic, i.e., supporting out-of-the-box usage without task-specific fine-tuning. Although breakthroughs have been made in natural language processing and image representation learning, it is still challenging for video models to reach it due to the increasing uncertainty of spatiotemporal signals. To ease training, existing works leverage… ▽ More The ultimate goal for foundation models is realizing task-agnostic, i.e., supporting out-of-the-box usage without task-specific fine-tuning. Although breakthroughs have been made in natural language processing and image representation learning, it is still challenging for video models to reach it due to the increasing uncertainty of spatiotemporal signals. To ease training, existing works leverage image foundation models' prior knowledge and equip them with efficient temporal modules. Despite the satisfactory fine-tuning performance, we empirically find they fall short of out-of-the-box usage, given the even degraded performance in zero-shot/linear protocols compared to their baseline counterparts. In this work, we analyze the factor that leads to degradation from the perspective of language supervision distortion. We argue that tuning a text encoder end-to-end, as done in previous work, is suboptimal since it may overfit in terms of styles, thereby losing its original generalization ability to capture the semantics of various language registers. The overfitted text encoder, in turn, provides a harmful supervision signal, degrading the video representation. To tackle this issue, we propose a degradation-free pre-training strategy to retain the generalization ability of the text encoder via freezing shallow layers while enabling the task-related semantics capturing in tunable deep layers. As for the training objective, we adopted the transcript sorting task in TVTS incorporated with masking techniques to enable scalable training. As a result, we produce a series of models, dubbed TVTSv2, with up to one billion parameters. We achieve new state-of-the-arts on various video benchmarks with a frozen backbone, surpassing the recent ImageBind, InternVideo, etc. Code is available at https://github.com/TencentARC/TVTS. △ Less

Submitted 23 May, 2023; originally announced May 2023.

Comments: Technical Report

arXiv:2305.07095 [pdf, other]

Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-Text Rationales

Authors: Brihi Joshi, Ziyi Liu, Sahana Ramnath, Aaron Chan, Zhewei Tong, Shaoliang Nie, Qifan Wang, Yejin Choi, Xiang Ren

Abstract: Among the remarkable emergent capabilities of large language models (LMs) is free-text rationalization; beyond a certain scale, large LMs are capable of generating seemingly useful rationalizations, which in turn, can dramatically enhance their performances on leaderboards. This phenomenon raises a question: can machine generated rationales also be useful for humans, especially when lay humans try… ▽ More Among the remarkable emergent capabilities of large language models (LMs) is free-text rationalization; beyond a certain scale, large LMs are capable of generating seemingly useful rationalizations, which in turn, can dramatically enhance their performances on leaderboards. This phenomenon raises a question: can machine generated rationales also be useful for humans, especially when lay humans try to answer questions based on those machine rationales? We observe that human utility of existing rationales is far from satisfactory, and expensive to estimate with human studies. Existing metrics like task performance of the LM generating the rationales, or similarity between generated and gold rationales are not good indicators of their human utility. While we observe that certain properties of rationales like conciseness and novelty are correlated with their human utility, estimating them without human involvement is challenging. We show that, by estimating a rationale's helpfulness in answering similar unseen instances, we can measure its human utility to a better extent. We also translate this finding into an automated score, GEN-U, that we propose, which can help improve LMs' ability to generate rationales with better human utility, while maintaining most of its task performance. Lastly, we release all code and collected data with this project. △ Less

Submitted 11 May, 2023; originally announced May 2023.

Comments: Accepted at ACL 2023

arXiv:2304.13838 [pdf]

doi 10.1115/1.4062844

Theoretical Puncture Mechanics of Soft Compressible Solids

Authors: Stefano Fregonese, Zhiyuan Tong, Sibo Wang, Mattia Bacca

Abstract: Accurate prediction of the force required to puncture a soft material is critical in many fields like medical technology, food processing, and manufacturing. However, such a prediction strongly depends on our understanding of the complex nonlinear behavior of the material subject to deep indentation and complex failure mechanisms. Only recently we developed theories capable of correlating puncture… ▽ More Accurate prediction of the force required to puncture a soft material is critical in many fields like medical technology, food processing, and manufacturing. However, such a prediction strongly depends on our understanding of the complex nonlinear behavior of the material subject to deep indentation and complex failure mechanisms. Only recently we developed theories capable of correlating puncture force with material properties and needle geometry. However, such models are based on simplifications that seldom limit their applicability to real cases. One common assumption is the incompressibility of the cut material, albeit no material is truly incompressible. In this paper we propose a simple model that accounts for linearly elastic compressibility, and its interplay with toughness, stiffness, and elastic strain-stiffening. Confirming previous theories and experiments, materials having high-toughness and low-modulus exhibit the highest puncture resistance at a given needle radius. Surprisingly, in these conditions, we observe that incompressible materials exhibit the lowest puncture resistance, where volumetric compressibility can create an additional (strain) energy barrier to puncture. Our model provides a valuable tool to assess the puncture resistance of soft compressible materials and suggests new design strategies for sharp needles and puncture-resistant materials. △ Less

Submitted 26 April, 2023; originally announced April 2023.

Showing 1–50 of 128 results for author: Tong, Z