Search | arXiv e-print repository

Sensitivity of the CUPID experiment to $0νββ$ decay of $^{100}$Mo

Authors: K. Alfonso, A. Armatol, C. Augier, F. T. Avignone III, O. Azzolini, A. S. Barabash, G. Bari, A. Barresi, D. Baudin, F. Bellini, G. Benato, L. Benussi, V. Berest, M. Beretta, L. Bergé, M. Bettelli, M. Biassoni, J. Billard, F. Boffelli, V. Boldrini, E. D. Brandani, C. Brofferio, C. Bucci, M. Buchynska, J. Camilleri , et al. (167 additional authors not shown)

Abstract: CUPID is a next-generation bolometric experiment to search for neutrinoless double-beta decay ($0νββ$) of $^{100}$Mo using Li$_2$MoO$_4$ scintillating crystals. It will operate 1596 crystals at $\sim$10 mK in the CUORE cryostat at the Laboratori Nazionali del Gran Sasso in Italy. Each crystal will be facing two Ge-based bolometric light detectors for $α$ rejection. We compute the discovery and the… ▽ More CUPID is a next-generation bolometric experiment to search for neutrinoless double-beta decay ($0νββ$) of $^{100}$Mo using Li$_2$MoO$_4$ scintillating crystals. It will operate 1596 crystals at $\sim$10 mK in the CUORE cryostat at the Laboratori Nazionali del Gran Sasso in Italy. Each crystal will be facing two Ge-based bolometric light detectors for $α$ rejection. We compute the discovery and the exclusion sensitivity of CUPID to $0νββ$ in a Frequentist and a Bayesian framework. This computation is done numerically based on pseudo-experiments. For the CUPID baseline scenario, with a background and an energy resolution of $1.0 \times 10^{-4}$ counts/keV/kg/yr and 5 keV FWHM at the Q-value, respectively, this results in a Bayesian exclusion sensitivity (90% c.i.) of $\hat{T}_{1/2} > 1.6^{+0.6}_{-0.5} \times 10^{27} \ \mathrm{yr}$, corresponding to the effective Majorana neutrino mass of $\hat{m}_{ββ} < \ 9.6$ -- $16.3 \ \mathrm{meV}$. The Frequentist discovery sensitivity (3$σ$) is $\hat{T}_{1/2}= 1.0 \times 10^{27} \ \mathrm{yr}$, corresponding to $\hat{m}_{ββ}= \ 12.2$ -- $20.6 \ \mathrm{meV}$. △ Less

Submitted 19 April, 2025; originally announced April 2025.

arXiv:2504.13771 [pdf, other]

Search for $J/ψ\rightarrow K^{0}_{S}K^{0}_{S}$ and $ψ(3686)\rightarrow K^{0}_{S}K^{0}_{S}$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

Abstract: Using data samples of $(10087\pm 44)\times10^{6}$ $J/ψ$ events and $(2712.4\pm 14.3)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, we search for the CP violating decays $J/ψ\rightarrow K^{0}_{S}K^{0}_{S}$ and $ψ(3686)\rightarrow K^{0}_{S}K^{0}_{S}$. No significant signals are observed over the expected background yields. The upper limits on their branchin… ▽ More Using data samples of $(10087\pm 44)\times10^{6}$ $J/ψ$ events and $(2712.4\pm 14.3)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, we search for the CP violating decays $J/ψ\rightarrow K^{0}_{S}K^{0}_{S}$ and $ψ(3686)\rightarrow K^{0}_{S}K^{0}_{S}$. No significant signals are observed over the expected background yields. The upper limits on their branching fractions are set as $\mathcal{B}(J/ψ\rightarrow K^{0}_{S}K^{0}_{S}) <4.7\times 10^{-9}$ and $\mathcal{B}(ψ(3686)\rightarrow K^{0}_{S}K^{0}_{S}) <1.1\times 10^{-8}$ at the 90% confidence level. These results improve the previous limits by a factor of three for $J/ψ\rightarrow K^{0}_{S} K^{0}_{S}$ and two orders of magnitude for $ψ(3686)\rightarrow K^{0}_{S} K^{0}_{S}$. △ Less

Submitted 18 April, 2025; originally announced April 2025.

arXiv:2504.13539 [pdf, other]

Search for $1^{-+}$ charmonium-like hybrid via $e^{+}e^{-}\rightarrow γη^{(\prime)} η_{c}$ at center-of-mass energies between 4.258 and 4.681 GeV

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (696 additional authors not shown)

Abstract: Using $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of 10.6 fb$^{-1}$ collected at center-of-mass energies between 4.258 and 4.681 GeV with the BESIII detector at the BEPCII collider, we search for the $1^{- +}$ charmonium-like hybrid via $e^{+}e^{-}\rightarrowγηη_{c}$ and $e^{+}e^{-}\rightarrowγη^{\prime}η_{c}$ decays for the first time. No significant signal is observed a… ▽ More Using $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of 10.6 fb$^{-1}$ collected at center-of-mass energies between 4.258 and 4.681 GeV with the BESIII detector at the BEPCII collider, we search for the $1^{- +}$ charmonium-like hybrid via $e^{+}e^{-}\rightarrowγηη_{c}$ and $e^{+}e^{-}\rightarrowγη^{\prime}η_{c}$ decays for the first time. No significant signal is observed and the upper limits on the Born cross sections for both processes are set at the 90% confidence level. △ Less

Submitted 18 April, 2025; originally announced April 2025.

arXiv:2504.12526 [pdf, other]

MOM: Memory-Efficient Offloaded Mini-Sequence Inference for Long Context Language Models

Authors: Junyang Zhang, Tianyi Zhu, Cheng Luo, Anima Anandkumar

Abstract: Long-context language models exhibit impressive performance but remain challenging to deploy due to high GPU memory demands during inference. We propose Memory-efficient Offloaded Mini-sequence Inference (MOM), a method that partitions critical layers into smaller "mini-sequences" and integrates seamlessly with KV cache offloading. Experiments on various Llama, Qwen, and Mistral models demonstrate… ▽ More Long-context language models exhibit impressive performance but remain challenging to deploy due to high GPU memory demands during inference. We propose Memory-efficient Offloaded Mini-sequence Inference (MOM), a method that partitions critical layers into smaller "mini-sequences" and integrates seamlessly with KV cache offloading. Experiments on various Llama, Qwen, and Mistral models demonstrate that MOM reduces peak memory usage by over 50\% on average. On Meta-Llama-3.2-8B, MOM extends the maximum context length from 155k to 455k tokens on a single A100 80GB GPU, while keeping outputs identical and not compromising accuracy. MOM also maintains highly competitive throughput due to minimal computational overhead and efficient last-layer processing. Compared to traditional chunked prefill methods, MOM achieves a 35\% greater context length extension. More importantly, our method drastically reduces prefill memory consumption, eliminating it as the longstanding dominant memory bottleneck during inference. This breakthrough fundamentally changes research priorities, redirecting future efforts from prefill-stage optimizations to improving decode-stage residual KV cache efficiency. △ Less

Submitted 16 April, 2025; originally announced April 2025.

Comments: Submitted to COLM

arXiv:2504.10956 [pdf, other]

The effects of asymptotically flat $R^2$ spacetime on black hole image of Sagittarius A*

Authors: Jian-Ming Yan, Tao Zhu, Qiang Wu

Abstract: A new class of analytically expressible vacuum solutions has recently been discovered for pure ${R}^2$ gravity, building upon Buchdahl's seminal work from 1962. These solutions, inspired by Buchdahl's framework, offer a promising avenue for testing ${R}^2$ gravity against astrophysical observations. Within a subset of asymptotically flat Buchdahl-inspired vacuum spacetimes, we introduce a free par… ▽ More A new class of analytically expressible vacuum solutions has recently been discovered for pure ${R}^2$ gravity, building upon Buchdahl's seminal work from 1962. These solutions, inspired by Buchdahl's framework, offer a promising avenue for testing ${R}^2$ gravity against astrophysical observations. Within a subset of asymptotically flat Buchdahl-inspired vacuum spacetimes, we introduce a free parameter $ε$ to characterize deviations from the Schwarzschild metric, which is recovered in the limit $ε= 0$. In this study, we employ the publicly available code \textit{ipole} to simulate black hole images under the Buchdahl-inspired metric, with a focus on the black hole at the center of the Milky Way, Sagittarius A* (Sgr A*). Our simulations show that both the shadow size and photon ring diameter decrease monotonically with increasing $ε$. By exploring a range of observational inclination angles, we find that the photon ring diameter being a direct observable is only weakly sensitive to the inclination angle. We further constrain the parameter $ε$ by comparing our simulation results with the Event Horizon Telescope (EHT) observations of Sgr A*. The obtained bounds are consistent with those previously derived from the orbital motion of the S2 star, but provide tighter constraints. In addition, we analyze the influence of the Buchdahl-inspired spacetime on the polarization patterns near the black hole and find its impact to be minimal. In contrast, the observational inclination angle has a substantial effect on the observed polarization structure, highlighting the dominant role of viewing geometry in shaping polarization features. △ Less

Submitted 16 April, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

Comments: 11 pages, 5 figures

arXiv:2504.10867 [pdf, other]

Precise measurement of the form factors in $D^0\rightarrow K^*(892)^-μ^+ν_μ$ and test of lepton universality with $D^0\rightarrow K^*(892)^-\ell^+ν_{\ell}$ decays

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (696 additional authors not shown)

Abstract: We report a study of the semileptonic decay $D^0 \rightarrow \bar{K}^0π^-μ^+ν_μ$ based on a sample of $7.9~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773~GeV with the BESIII detector at the BEPCII collider. The branching fraction of the decay is measured for the first time to be… ▽ More We report a study of the semileptonic decay $D^0 \rightarrow \bar{K}^0π^-μ^+ν_μ$ based on a sample of $7.9~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773~GeV with the BESIII detector at the BEPCII collider. The branching fraction of the decay is measured for the first time to be $\mathcal{B}(D^0\rightarrow \bar{K}^0π^-μ^+ν_μ) = (1.373 \pm 0.020_{\rm stat} \pm 0.023_{\rm syst})\%$, where the first uncertainty is statistical and the second is systematic. Based on the investigation of the decay dynamics, we find that the decay is dominated by the $K^{*}(892)^-$ resonance with the branching fraction measured to be $\mathcal{B}(D^0\rightarrow K^{*}(892)^-μ^+ν_μ) = (1.948 \pm 0.033_{\rm stat} \pm 0.036_{\rm syst})\%$. We also determine the hadronic form factors for the $D^0\rightarrow K^{*}(892)^-μ^+ν_μ$ decay to be $r_{V} = V(0)/A_1(0) = 1.46 \pm 0.11_{\rm stat} \pm 0.04_{\rm syst}$, $r_{2} = A_2(0)/A_1(0) = 0.71 \pm 0.08_{\rm stat} \pm 0.03_{\rm syst}$, and $A_1(0)=0.609 \pm 0.008_{\rm stat} \pm 0.008_{\rm syst}$, where $V(0)$ is the vector form factor and $A_{1,2}(0)$ are the axial form factors evaluated at $q^2=0$. The $A_1(0)$ is measured for the first time in $D^0\rightarrow K^{*}(892)^-μ^+ν_μ$ decay. Averaging the form-factor parameters that we reported previously in $D^0\rightarrow K^*(892)^-(\rightarrow \bar{K}^0π^-)e^+ν_{e}$ and $D^0\rightarrow K^*(892)^-(\rightarrow K^-π^0)μ^+ν_μ$ decays, we obtain $r_{V}=1.456\pm0.040_{\rm stat}\pm0.016_{\rm syst}$, $r_{2}=0.715\pm0.031_{\rm stat}\pm0.014_{\rm stat}$, and $A_1(0)=0.614\pm0.005_{\rm stat}\pm0.004_{\rm syst}$. This is the most precise determination of the form-factor parameters to date measured in $D\rightarrow K^*(892)$ transition, which provide the most stringent test on various theoretical models. △ Less

Submitted 15 April, 2025; originally announced April 2025.

Comments: 9 pages, 4 figures

arXiv:2504.07817 [pdf, other]

Search for the baryon and lepton number violating decay $J/ψ\to pe^-$ + c.c

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (664 additional authors not shown)

Abstract: Based on $(2712.4\pm 14.3) \times 10^{6} $ ${ψ(3686)}$ events collected by the BESIII detector operating at the BEPCII storage ring, we perform a search for the baryon- and lepton-number violating decay $J/ψ\to pe^{-}+c.c.$ via $ψ(3686) \to π^{+}π^{-}J/ψ$. No significant signal is found. An upper limit on the branching fraction of $\mathcal{B}(J/ψ\to p e^{-}+ c.c.) < 3.1 \times 10^{-8}$ at 90\% co… ▽ More Based on $(2712.4\pm 14.3) \times 10^{6} $ ${ψ(3686)}$ events collected by the BESIII detector operating at the BEPCII storage ring, we perform a search for the baryon- and lepton-number violating decay $J/ψ\to pe^{-}+c.c.$ via $ψ(3686) \to π^{+}π^{-}J/ψ$. No significant signal is found. An upper limit on the branching fraction of $\mathcal{B}(J/ψ\to p e^{-}+ c.c.) < 3.1 \times 10^{-8}$ at 90\% confidence level. △ Less

Submitted 10 April, 2025; originally announced April 2025.

Comments: 8 pages, 1 figure

arXiv:2504.07348 [pdf, other]

doi 10.1126/sciadv.adu5264

A millisecond integrated quantum memory for photonic qubits

Authors: Yu-Ping Liu, Zhong-Wen Ou, Tian-Xiang Zhu, Ming-Xu Su, Chao Liu, Yong-Jian Han, Zong-Quan Zhou, Chuan-Feng Li, Guang-Can Guo

Abstract: Quantum memories for light are essential building blocks for quantum repeaters and quantum networks. Integrated operations of quantum memories could enable scalable application with low-power consumption. However, the photonic quantum storage lifetime in integrated optical waveguide has so far been limited to tens of microseconds, falling short of the requirements for practical applications. Here,… ▽ More Quantum memories for light are essential building blocks for quantum repeaters and quantum networks. Integrated operations of quantum memories could enable scalable application with low-power consumption. However, the photonic quantum storage lifetime in integrated optical waveguide has so far been limited to tens of microseconds, falling short of the requirements for practical applications. Here, we demonstrate quantum storage of photonic qubits for 1.021 ms based on a laser-written optical waveguide fabricated in a 151Eu3+:Y2SiO5 crystal. Spin dephasing of 151Eu3+ is mitigated through dynamical decoupling applied via on-chip electric waveguides and we obtain a storage efficiency of 12.0(0.5)% at 1.021 ms, which is a demonstration of integrated quantum memories that outperforms the efficiency of a simple fiber delay line. Such long-lived waveguide-based quantum memory could support applications in quantum repeaters, and further combination with critical magnetic fields could enable potential application as transportable quantum memories. △ Less

Submitted 9 April, 2025; originally announced April 2025.

Journal ref: Science Advances 11.13.eadu5264 (2025)

arXiv:2504.05584 [pdf, other]

Observation of Transverse Polarization and Determination of Electromagnetic Form Factor of $Λ$ Hyperon at $\sqrt{s}= 3.773$ GeV

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (697 additional authors not shown)

Abstract: Using a 20.3 fb$^{-1}$ of $e^{+}e^{-}$ collision data sample collected by the BESIII detector at the BEPCII collider, we present an observation of transverse polarization and a complete determination of the electromagnetic form factor of the $Λ$ hyperon in $e^{+}e^{-}\toΛ\barΛ$ decay with the entangled $Λ-\barΛ$ pair at $\sqrt{s}=3.773$ GeV. The relative phase between the electric and magnetic for… ▽ More Using a 20.3 fb$^{-1}$ of $e^{+}e^{-}$ collision data sample collected by the BESIII detector at the BEPCII collider, we present an observation of transverse polarization and a complete determination of the electromagnetic form factor of the $Λ$ hyperon in $e^{+}e^{-}\toΛ\barΛ$ decay with the entangled $Λ-\barΛ$ pair at $\sqrt{s}=3.773$ GeV. The relative phase between the electric and magnetic form factors is determined to be $ΔΦ=(1.53\pm0.36\pm0.03)$ rad with a significance of 5.5$σ$ taking into account systematic uncertainty. This result indicates a non-zero phase between the transition amplitudes of the $Λ\barΛ$ helicity states. Additionally, we measure the angular distribution parameter and the modulus of the ratio between the electric and the magnetic form factor is found to be $η=0.86\pm0.05\pm0.03$ and $R(s)=|G_{E}(s)/G_{M}(s)|=0.47\pm0.08\pm0.05$, where the first uncertainty is statistical and the second systematic. △ Less

Submitted 7 April, 2025; originally announced April 2025.

Comments: 9 pages, 1 table, 5 figures

arXiv:2504.04420 [pdf, other]

Observation of $ψ(3686) \to Ξ^- K^0_S \barΩ^+ $+c.c

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

Abstract: Using a sample of $(2.712\pm0.014) \times 10^{9}$ $ψ(3686)$ events collected with the BESIII detector at the electron positron collider BEPCII, the decay $ψ(3686) \to Ξ^- K^0_S \barΩ^+ +c.c.$ is observed for the first time, which has a significance of 5.9 standard deviations. The branching fraction of this decay is measured to be $(2.91\pm0.47\pm0.33)\times 10^{-6}$, where the first and second unc… ▽ More Using a sample of $(2.712\pm0.014) \times 10^{9}$ $ψ(3686)$ events collected with the BESIII detector at the electron positron collider BEPCII, the decay $ψ(3686) \to Ξ^- K^0_S \barΩ^+ +c.c.$ is observed for the first time, which has a significance of 5.9 standard deviations. The branching fraction of this decay is measured to be $(2.91\pm0.47\pm0.33)\times 10^{-6}$, where the first and second uncertainties are statistical and systematic, respectively. The ratio between $\mathcal{B}_{ψ(3686) \to Ξ^- K^0_S \barΩ^+ +c.c.}$ and $\mathcal{B}_{ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.}$ is determined to be $1.05\pm0.23\pm0.14 $, which deviates with the isospin symmetry conservation predicted value of 0.5 by $2.1σ$. △ Less

Submitted 6 April, 2025; originally announced April 2025.

arXiv:2504.04096 [pdf, ps, other]

Observation of a Three-Resonance Structure in the Cross Section of $e^+e^-\toπ^+π^- h_c$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (697 additional authors not shown)

Abstract: Using $e^+e^-$ collision data collected with the BESIII detector operating at the Beijing Electron Positron Collider, the cross section of $e^+e^-\to π^+π^- h_c$ is measured at 59 points with center-of-mass energy $\sqrt{s}$ ranging from $4.009$ to $4.950~\mathrm{GeV}$ with a total integrated luminosity of $22.2~\mathrm{fb}^{-1}$. The cross section between $4.3$ and $4.45~\mathrm{GeV}$ exhibits a… ▽ More Using $e^+e^-$ collision data collected with the BESIII detector operating at the Beijing Electron Positron Collider, the cross section of $e^+e^-\to π^+π^- h_c$ is measured at 59 points with center-of-mass energy $\sqrt{s}$ ranging from $4.009$ to $4.950~\mathrm{GeV}$ with a total integrated luminosity of $22.2~\mathrm{fb}^{-1}$. The cross section between $4.3$ and $4.45~\mathrm{GeV}$ exhibits a plateau-like shape and drops sharply around $4.5~\mathrm{GeV}$, which cannot be described by two resonances only. Three coherent Breit-Wigner functions are used to parameterize the $\sqrt{s}$-dependent cross section line shape. The masses and widths are determined to be $M_1=(4223.6_{-3.7-2.9}^{+3.6+2.6})~\mathrm{MeV}/c^2$, $Γ_1=(58.5_{-11.4-6.5}^{+10.8+6.7})~\mathrm{MeV}$, $M_2=(4327.4_{-18.8-9.3}^{+20.1+10.7})~\mathrm{MeV}/c^2$, $Γ_2=(244.1_{-27.1-18.0}^{+34.0+23.9})~\mathrm{MeV}$, and $M_3=(4467.4_{-5.4-2.7}^{+7.2+3.2})~\mathrm{MeV}/c^2$, $Γ_3=(62.8_{-14.4-6.6}^{+19.2+9.8})~\mathrm{MeV}$. The first uncertainties are statistical and the other two are systematic. The statistical significance of the three Breit-Wigner assumption over the two Breit-Wigner assumption is greater than $5σ$. △ Less

Submitted 5 April, 2025; originally announced April 2025.

arXiv:2504.03603 [pdf, other]

Towards deployment-centric multimodal AI beyond vision and language

Authors: Xianyuan Liu, Jiayang Zhang, Shuo Zhou, Thijs L. van der Plas, Avish Vijayaraghavan, Anastasiia Grishina, Mengdie Zhuang, Daniel Schofield, Christopher Tomlinson, Yuhan Wang, Ruizhe Li, Louisa van Zeeland, Sina Tabakhi, Cyndie Demeocq, Xiang Li, Arunav Das, Orlando Timmerman, Thomas Baldwin-McDonald, Jinge Wu, Peizhen Bai, Zahraa Al Sahili, Omnia Alwazzan, Thao N. Do, Mohammod N. I. Suvon, Angeline Wang , et al. (23 additional authors not shown)

Abstract: Multimodal artificial intelligence (AI) integrates diverse types of data via machine learning to improve understanding, prediction, and decision-making across disciplines such as healthcare, science, and engineering. However, most multimodal AI advances focus on models for vision and language data, while their deployability remains a key challenge. We advocate a deployment-centric workflow that in… ▽ More Multimodal artificial intelligence (AI) integrates diverse types of data via machine learning to improve understanding, prediction, and decision-making across disciplines such as healthcare, science, and engineering. However, most multimodal AI advances focus on models for vision and language data, while their deployability remains a key challenge. We advocate a deployment-centric workflow that incorporates deployment constraints early to reduce the likelihood of undeployable solutions, complementing data-centric and model-centric approaches. We also emphasise deeper integration across multiple levels of multimodality and multidisciplinary collaboration to significantly broaden the research scope beyond vision and language. To facilitate this approach, we identify common multimodal-AI-specific challenges shared across disciplines and examine three real-world use cases: pandemic response, self-driving car design, and climate change adaptation, drawing expertise from healthcare, social science, engineering, science, sustainability, and finance. By fostering multidisciplinary dialogue and open research practices, our community can accelerate deployment-centric development for broad societal impact. △ Less

Submitted 4 April, 2025; originally announced April 2025.

arXiv:2504.03161 [pdf, ps, other]

Modified Tests of Linear Hypotheses Under Heteroscedasticity for Multivariate Functional Data with Finite Sample Sizes

Authors: Tianming Zhu

Abstract: As big data continues to grow, statistical inference for multivariate functional data (MFD) has become crucial. Although recent advancements have been made in testing the equality of mean functions, research on testing linear hypotheses for mean functions remains limited. Current methods primarily consist of permutation-based tests or asymptotic tests. However, permutation-based tests are known to… ▽ More As big data continues to grow, statistical inference for multivariate functional data (MFD) has become crucial. Although recent advancements have been made in testing the equality of mean functions, research on testing linear hypotheses for mean functions remains limited. Current methods primarily consist of permutation-based tests or asymptotic tests. However, permutation-based tests are known to be time-consuming, while asymptotic tests typically require larger sample sizes to maintain an accurate Type I error rate. This paper introduces three finite-sample tests that modify traditional MANOVA methods to tackle the general linear hypothesis testing problem for MFD. The test statistics rely on two symmetric, nonnegative-definite matrices, approximated by Wishart distributions, with degrees of freedom estimated via a U-statistics-based method. The proposed tests are affine-invariant, computationally more efficient than permutation-based tests, and better at controlling significance levels in small samples compared to asymptotic tests. A real-data example further showcases their practical utility. △ Less

Submitted 4 April, 2025; originally announced April 2025.

arXiv:2504.01823 [pdf, other]

Evidence of doubly OZI-suppressed decay $η_{c} \to ωφ$ in the radiative decay $J/ψ\to γη_{c}$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

Abstract: Using a sample of $(10087\pm44) \times 10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII collider, the first evidence for the doubly OZI-suppressed decay $η_{c} \to ωφ$ is reported with a significance of 4.0$σ$. The branching fraction of $η_{c} \to ωφ$ is measured to be $\mathcal{B}(η_{c} \to ωφ) = (3.86 \pm 0.92 \pm 0.62) \times 10^{-5}$, where the first uncertainty is statist… ▽ More Using a sample of $(10087\pm44) \times 10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII collider, the first evidence for the doubly OZI-suppressed decay $η_{c} \to ωφ$ is reported with a significance of 4.0$σ$. The branching fraction of $η_{c} \to ωφ$ is measured to be $\mathcal{B}(η_{c} \to ωφ) = (3.86 \pm 0.92 \pm 0.62) \times 10^{-5}$, where the first uncertainty is statistical and the second is systematic. This result provides valuable insights into the underlying mechanisms of charmonium decays, particularly for processes such as $η_{c} \to VV$ (where $V$ represents a vector meson). △ Less

Submitted 2 April, 2025; originally announced April 2025.

arXiv:2503.24137 [pdf, other]

Half-life and precision shape measurement of 2νββ decay of $^{130}$Te

Authors: D. Q. Adams, C. Alduino, K. Alfonso, F. T. Avignone III, O. Azzolini, G. Bari, F. Bellini, G. Benato, M. Beretta, M. Biassoni, A. Branca, C. Brofferio, C. Bucci, J. Camilleri, A. Caminata, A. Campani, J. Cao, C. Capelli, S. Capelli, L. Cappelli, L. Cardani, P. Carniti, N. Casali, E. Celi, D. Chiesa , et al. (97 additional authors not shown)

Abstract: We present a new measurement of the 2nbb half-life of 130Te (T1/2) using the first complete model of the CUORE data, based on 1038 kg yr of collected exposure. Thanks to optimized data selection, we achieve a factor of two improvement in precision, obtaining T1/2 = (9.32 +0.05 -0.04 (stat.) +0.07 -0.07 (syst.)) x10^20 yr. The signal-to-background ratio is increased by 70% compared to our previous… ▽ More We present a new measurement of the 2nbb half-life of 130Te (T1/2) using the first complete model of the CUORE data, based on 1038 kg yr of collected exposure. Thanks to optimized data selection, we achieve a factor of two improvement in precision, obtaining T1/2 = (9.32 +0.05 -0.04 (stat.) +0.07 -0.07 (syst.)) x10^20 yr. The signal-to-background ratio is increased by 70% compared to our previous results, enabling the first application of the improved 2nbb formalism to 130Te. Within this framework, we determine a credibility interval for the effective axial coupling in the nuclear medium as a function of nuclear matrix elements. We also extract values for the higher-order nuclear matrix element ratios: second-to-first and third-to-first. The second-to-first ratio agrees with nuclear model predictions, while the third-to-first ratio deviates from theoretical expectations. These findings provide essential tests of nuclear models and key inputs for future 0nbb searches. △ Less

Submitted 31 March, 2025; originally announced March 2025.

arXiv:2503.23322 [pdf]

High-Dimensional Evolutionary Algorithm Based Design of Semi-Adder

Authors: Xi Zhang, Huihui Liu, Junrui Xi, Menglu Chen, Tao Zhu

Abstract: Facing the physical limitations and energy consumption bottlenecks of traditional electronic devices, we propose an innovative design framework integrating evolutionary algorithms and metasurface technology, aiming to achieve intelligent inverse design of photonic devices. Based on a constructed high-dimensional evolutionary algorithm framework, a four-layer metasurface cascade regulation system w… ▽ More Facing the physical limitations and energy consumption bottlenecks of traditional electronic devices, we propose an innovative design framework integrating evolutionary algorithms and metasurface technology, aiming to achieve intelligent inverse design of photonic devices. Based on a constructed high-dimensional evolutionary algorithm framework, a four-layer metasurface cascade regulation system was developed to realize the full optical physical expression of half-adder logic functions. This algorithm enables global optimization of 10000 unit parameters and can be extended to the design of more complex functional devices,thereby promoting goal-oriented and functional customization development △ Less

Submitted 30 March, 2025; originally announced March 2025.

arXiv:2503.22257 [pdf, other]

DynaGraph: Interpretable Multi-Label Prediction from EHRs via Dynamic Graph Learning and Contrastive Augmentation

Authors: Munib Mesinovic, Soheila Molaei, Peter Watkinson, Tingting Zhu

Abstract: Learning from longitudinal electronic health records is limited if it does not capture the temporal trajectories of the patient's state in a clinical setting. Graph models allow us to capture the hidden dependencies of the multivariate time-series when the graphs are constructed in a similar dynamic manner. Previous dynamic graph models require a pre-defined and/or static graph structure, which is… ▽ More Learning from longitudinal electronic health records is limited if it does not capture the temporal trajectories of the patient's state in a clinical setting. Graph models allow us to capture the hidden dependencies of the multivariate time-series when the graphs are constructed in a similar dynamic manner. Previous dynamic graph models require a pre-defined and/or static graph structure, which is unknown in most cases, or they only capture the spatial relations between the features. Furthermore in healthcare, the interpretability of the model is an essential requirement to build trust with clinicians. In addition to previously proposed attention mechanisms, there has not been an interpretable dynamic graph framework for data from multivariate electronic health records (EHRs). Here, we propose DynaGraph, an end-to-end interpretable contrastive graph model that learns the dynamics of multivariate time-series EHRs as part of optimisation. We validate our model in four real-world clinical datasets, ranging from primary care to secondary care settings with broad demographics, in challenging settings where tasks are imbalanced and multi-labelled. Compared to state-of-the-art models, DynaGraph achieves significant improvements in balanced accuracy and sensitivity over the nearest complex competitors in time-series or dynamic graph modelling across three ICU and one primary care datasets. Through a pseudo-attention approach to graph construction, our model also indicates the importance of clinical covariates over time, providing means for clinical validation. △ Less

Submitted 28 March, 2025; originally announced March 2025.

arXiv:2503.22180 [pdf, other]

Knowledge Rectification for Camouflaged Object Detection: Unlocking Insights from Low-Quality Data

Authors: Juwei Guan, Xiaolin Fang, Donghyun Kim, Haotian Gong, Tongxin Zhu, Zhen Ling, Ming Yang

Abstract: Low-quality data often suffer from insufficient image details, introducing an extra implicit aspect of camouflage that complicates camouflaged object detection (COD). Existing COD methods focus primarily on high-quality data, overlooking the challenges posed by low-quality data, which leads to significant performance degradation. Therefore, we propose KRNet, the first framework explicitly designed… ▽ More Low-quality data often suffer from insufficient image details, introducing an extra implicit aspect of camouflage that complicates camouflaged object detection (COD). Existing COD methods focus primarily on high-quality data, overlooking the challenges posed by low-quality data, which leads to significant performance degradation. Therefore, we propose KRNet, the first framework explicitly designed for COD on low-quality data. KRNet presents a Leader-Follower framework where the Leader extracts dual gold-standard distributions: conditional and hybrid, from high-quality data to drive the Follower in rectifying knowledge learned from low-quality data. The framework further benefits from a cross-consistency strategy that improves the rectification of these distributions and a time-dependent conditional encoder that enriches the distribution diversity. Extensive experiments on benchmark datasets demonstrate that KRNet outperforms state-of-the-art COD methods and super-resolution-assisted COD approaches, proving its effectiveness in tackling the challenges of low-quality data in COD. △ Less

Submitted 28 March, 2025; originally announced March 2025.

arXiv:2503.22126 [pdf, other]

Updated model-independent measurement of the strong-phase differences between $D^0$ and $\bar{D}^0 \to K^{0}_{S/L}π^+π^-$ decays

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (696 additional authors not shown)

Abstract: The strong-phase differences between $D^0\to K_{S/L}^0π^+π^-$ and $\bar{D}^0\to K_{S/L}^0π^+π^-$ decays are one of the most important inputs in measuring the $C\!P$ violating angle $γ$ via $B^- \to D K^-$ decays. They also play a key role in studies of charm mixing and indirect $C\!P$ violation. In this paper, the strong-phase differences are determined in a model-independent way with quantum-corr… ▽ More The strong-phase differences between $D^0\to K_{S/L}^0π^+π^-$ and $\bar{D}^0\to K_{S/L}^0π^+π^-$ decays are one of the most important inputs in measuring the $C\!P$ violating angle $γ$ via $B^- \to D K^-$ decays. They also play a key role in studies of charm mixing and indirect $C\!P$ violation. In this paper, the strong-phase differences are determined in a model-independent way with quantum-correlated $D^0$-$\bar{D}^0$ decays from 7.93 fb$^{-1}$ of $e^+e^-$ annihilation data at $\sqrt{s}$=3.773 GeV by the BESIII experiment. These results are the most precise to date and are expected to significantly reduce associated uncertainties in determining the $C\!P$ violating angle $γ$ and related charm mixing parameters. △ Less

Submitted 18 April, 2025; v1 submitted 27 March, 2025; originally announced March 2025.

arXiv:2503.21413 [pdf, other]

First observation of $Λ_{c}(2595)^{+} \to Λ^{+}_{c}π^0π^0$ and $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^0π^0$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (657 additional authors not shown)

Abstract: By analysing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 368.48~pb$^{-1}$ collected at the centre-of-mass energies of $\sqrt{s} = 4.918$ and $4.951$~GeV with the BESIII detector, we report the first observation of $Λ_{c}(2595)^{+}$ and $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^0π^0$ with statistical significances of 7.9$σ$ and 11.8$σ$, respectively. The branching fractions of… ▽ More By analysing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 368.48~pb$^{-1}$ collected at the centre-of-mass energies of $\sqrt{s} = 4.918$ and $4.951$~GeV with the BESIII detector, we report the first observation of $Λ_{c}(2595)^{+}$ and $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^0π^0$ with statistical significances of 7.9$σ$ and 11.8$σ$, respectively. The branching fractions of $Λ_{c}(2595)^{+}$ and $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^0π^0$ are measured to be $(59.5 \pm 11.1_{\rm stat.} \pm 7.9_{\rm syst.}) \%$ and $(41.0 \pm 5.2_{\rm stat.} \pm 3.3_{\rm syst.}) \%$, respectively. The absolute branching fraction of $Λ_{c}(2595)^{+}$ is consistent with the expectation of the mechanism referred to as the threshold effect, proposed for the strong decays of $Λ_{c}(2595)^{+}$ within uncertainty. △ Less

Submitted 27 March, 2025; originally announced March 2025.

Comments: 20 pages, 4 figures

arXiv:2503.20638 [pdf, other]

Parity-violating corrections to the orbital precession of binary system

Authors: Jin Qiao, Qing-Guo Huang, Tao Zhu, Wen Zhao

Abstract: In this work, we test for gravitational parity violation in the PSR J1141-6545 system by analyzing the orbital plane inclination precession induced by the misalignment between the white dwarf's spin axis and the system's total angular momentum. Using the parity-violating metric of gravity that incorporates terms from both the exterior and boundary of the field source, we calculated corrections to… ▽ More In this work, we test for gravitational parity violation in the PSR J1141-6545 system by analyzing the orbital plane inclination precession induced by the misalignment between the white dwarf's spin axis and the system's total angular momentum. Using the parity-violating metric of gravity that incorporates terms from both the exterior and boundary of the field source, we calculated corrections to the relative acceleration and orbital inclination precession rates, which exhibit significant deviations from the GR prediction. The parity-violating contributions depend on the projection of the spin vector along the orbital angular momentum direction, contrasting with GR, where it depends on the projection within the orbital plane. The corrections are perpendicular to GR contribution, highlighting a fundamental distinction. The exterior field correction is linear in the theoretical parameter and coupled to eccentricity $e$, while the boundary term correction is quadratic. By comparing these corrections with GR and incorporating observational uncertainty, we derive constraint on the theoretical parameter, yielding $ \dot{f}_{\rm PV}\lesssim 10~ \rm m$. △ Less

Submitted 26 March, 2025; originally announced March 2025.

Comments: 12 pages, 1 figure, and 2 tables

arXiv:2503.19846 [pdf, other]

Attention IoU: Examining Biases in CelebA using Attention Maps

Authors: Aaron Serianni, Tyler Zhu, Olga Russakovsky, Vikram V. Ramaswamy

Abstract: Computer vision models have been shown to exhibit and amplify biases across a wide array of datasets and tasks. Existing methods for quantifying bias in classification models primarily focus on dataset distribution and model performance on subgroups, overlooking the internal workings of a model. We introduce the Attention-IoU (Attention Intersection over Union) metric and related scores, which use… ▽ More Computer vision models have been shown to exhibit and amplify biases across a wide array of datasets and tasks. Existing methods for quantifying bias in classification models primarily focus on dataset distribution and model performance on subgroups, overlooking the internal workings of a model. We introduce the Attention-IoU (Attention Intersection over Union) metric and related scores, which use attention maps to reveal biases within a model's internal representations and identify image features potentially causing the biases. First, we validate Attention-IoU on the synthetic Waterbirds dataset, showing that the metric accurately measures model bias. We then analyze the CelebA dataset, finding that Attention-IoU uncovers correlations beyond accuracy disparities. Through an investigation of individual attributes through the protected attribute of Male, we examine the distinct ways biases are represented in CelebA. Lastly, by subsampling the training set to change attribute correlations, we demonstrate that Attention-IoU reveals potential confounding variables not present in dataset labels. △ Less

Submitted 25 March, 2025; v1 submitted 25 March, 2025; originally announced March 2025.

Comments: To appear in CVPR 2025. Code and data is available at https://github.com/aaronserianni/attention-iou . 15 pages, 14 figures, including appendix

arXiv:2503.19542 [pdf, other]

Measurement of the branching fractions of doubly Cabibbo-suppressed $D$ decays

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (648 additional authors not shown)

Abstract: By analyzing $e^+e^-$ collision data collected at the center-of-mass energy of 3.773~GeV with the BESIII detector, corresponding to an integrated luminosity of 20.3~fb$^{-1}$, we measure the branching fractions of the doubly Cabibbo-suppressed (DCS) decays $D^0\to K^+π^-$, $D^0\to K^+π^-π^-π^+$, $D^0\to K^+π^-π^0$, $D^0\to K^+π^-π^0π^0$, $D^+\to K^+π^+π^-$, and $D^+\to K^+K^+K^-$. We also perform… ▽ More By analyzing $e^+e^-$ collision data collected at the center-of-mass energy of 3.773~GeV with the BESIII detector, corresponding to an integrated luminosity of 20.3~fb$^{-1}$, we measure the branching fractions of the doubly Cabibbo-suppressed (DCS) decays $D^0\to K^+π^-$, $D^0\to K^+π^-π^-π^+$, $D^0\to K^+π^-π^0$, $D^0\to K^+π^-π^0π^0$, $D^+\to K^+π^+π^-$, and $D^+\to K^+K^+K^-$. We also perform the first searches for $D^0\to K^+π^-η$, $D^0\to K^+π^-π^0η$, $D^+\to K^+π^+π^-η$, $D^{+} \to K^{+} \left(π^{+} π^{-} η\right)_{{\rm non}-η^{\prime}}$, and $D^+\to K^+ηη$ and report the first observations and evidence for some of these final states. Combining the measurements with the world averages of the corresponding Cabibbo-favored (CF) decays, the ratios of the DCS/CF branching fractions are obtained. For the $D^{+} \to K^{+} \left(π^{+} π^{-} η\right)_{{\rm non}-η^{\prime}}$ decay, the ratio is significantly larger than the corresponding ratios of the other DCS decays. △ Less

Submitted 25 March, 2025; originally announced March 2025.

Comments: 16 pages, 5 figures

arXiv:2503.18620 [pdf, ps, other]

doi 10.1103/PhysRevD.111.092007

Observation of the decay $ψ(3686)\rightarrow Σ^{0}\barΣ^{0}ω$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (695 additional authors not shown)

Abstract: Using a dataset of $(27.12\pm 0.14)\times 10^{8}$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we report the first observation of the decay $ψ(3686)\toΣ^{0}\barΣ^{0}ω$ with a statistical significance of 8.9$σ$. The measured branching fraction is $(1.24 \pm 0.16_{\textrm{stat}} \pm 0.11_{\textrm{sys}}) \times 10^{-5}$, where the first uncertainty i… ▽ More Using a dataset of $(27.12\pm 0.14)\times 10^{8}$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we report the first observation of the decay $ψ(3686)\toΣ^{0}\barΣ^{0}ω$ with a statistical significance of 8.9$σ$. The measured branching fraction is $(1.24 \pm 0.16_{\textrm{stat}} \pm 0.11_{\textrm{sys}}) \times 10^{-5}$, where the first uncertainty is statistical and the second is systematic. Additionally, we investigate potential intermediate states in the invariant mass distributions of $Σ^{0}ω$, $\barΣ^{0}ω$ and $Σ^{0}\barΣ^{0}$. A hint of a resonance is observed in the invariant mass distribution of $M_{Σ^{0}(\barΣ^{0})ω}$, located around 2.06 GeV/$c^2$, with a significance of 2.5$σ$. △ Less

Submitted 24 March, 2025; originally announced March 2025.

arXiv:2503.17165 [pdf, other]

Stringent test of $CP$ symmetry in $Σ^+$ hyperon decays

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

Abstract: The non-leptonic two-body weak decays $Σ^{+} \to p π^{0}$ and $\barΣ^{-} \to \bar{p} π^{0}$ are investigated, utilizing $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events and $(2.7124\pm0.0143)\times10^{9}$ $ψ(3686)$ events collected by BESIII experiment. The precision of the weak-decay parameters for the decays $Σ^{+} \to p π^{0}$ ($α_{0}$) and $\barΣ^{-} \to \bar{p} π^{0}$ ($\barα_{0}$) is improved b… ▽ More The non-leptonic two-body weak decays $Σ^{+} \to p π^{0}$ and $\barΣ^{-} \to \bar{p} π^{0}$ are investigated, utilizing $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events and $(2.7124\pm0.0143)\times10^{9}$ $ψ(3686)$ events collected by BESIII experiment. The precision of the weak-decay parameters for the decays $Σ^{+} \to p π^{0}$ ($α_{0}$) and $\barΣ^{-} \to \bar{p} π^{0}$ ($\barα_{0}$) is improved by a factor of three compared to the previous world average. Furthermore, the quantum-entangled $Σ^{+}\barΣ^{-}$ system enables the most precise test of $CP$ symmetry for the decay $Σ^+\to pπ^0$, through the asymmetry observable $A_{CP}=(α_{0}+\barα_{0})/(α_{0}-\barα_{0})$ that is measured to be $-0.0118\pm0.0083_{\rm stat}\pm0.0028_{\rm syst}$. Assuming $CP$ conservation, the average decay parameter is determined to be ${\left< α_{\rm 0}\right>} = (α_0-\barα_0)/2=-0.9869\pm0.0011_{\rm stat}\pm0.0016_{\rm syst}$, which is the most precise measurement of the asymmetry decay parameters in baryon sectors. The angular dependence of the ratio of the polarization of the $Σ^+$ in both $J/ψ$ and $ψ(3686)$ decays is studied for the first time. △ Less

Submitted 21 March, 2025; originally announced March 2025.

arXiv:2503.16835 [pdf, other]

Safe and Reliable Diffusion Models via Subspace Projection

Authors: Huiqiang Chen, Tianqing Zhu, Linlin Wang, Xin Yu, Longxiang Gao, Wanlei Zhou

Abstract: Large-scale text-to-image (T2I) diffusion models have revolutionized image generation, enabling the synthesis of highly detailed visuals from textual descriptions. However, these models may inadvertently generate inappropriate content, such as copyrighted works or offensive images. While existing methods attempt to eliminate specific unwanted concepts, they often fail to ensure complete removal, a… ▽ More Large-scale text-to-image (T2I) diffusion models have revolutionized image generation, enabling the synthesis of highly detailed visuals from textual descriptions. However, these models may inadvertently generate inappropriate content, such as copyrighted works or offensive images. While existing methods attempt to eliminate specific unwanted concepts, they often fail to ensure complete removal, allowing the concept to reappear in subtle forms. For instance, a model may successfully avoid generating images in Van Gogh's style when explicitly prompted with 'Van Gogh', yet still reproduce his signature artwork when given the prompt 'Starry Night'. In this paper, we propose SAFER, a novel and efficient approach for thoroughly removing target concepts from diffusion models. At a high level, SAFER is inspired by the observed low-dimensional structure of the text embedding space. The method first identifies a concept-specific subspace $S_c$ associated with the target concept c. It then projects the prompt embeddings onto the complementary subspace of $S_c$, effectively erasing the concept from the generated images. Since concepts can be abstract and difficult to fully capture using natural language alone, we employ textual inversion to learn an optimized embedding of the target concept from a reference image. This enables more precise subspace estimation and enhances removal performance. Furthermore, we introduce a subspace expansion strategy to ensure comprehensive and robust concept erasure. Extensive experiments demonstrate that SAFER consistently and effectively erases unwanted concepts from diffusion models while preserving generation quality. △ Less

Submitted 21 March, 2025; originally announced March 2025.

arXiv:2503.16779 [pdf, other]

Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models

Authors: Mengsong Wu, Tong Zhu, Han Han, Xiang Zhang, Wenbiao Shao, Wenliang Chen

Abstract: Tool learning can further broaden the usage scenarios of large language models (LLMs). However most of the existing methods either need to finetune that the model can only use tools seen in the training data, or add tool demonstrations into the prompt with lower efficiency. In this paper, we present a new Tool Learning method Chain-of-Tools. It makes full use of the powerful semantic representatio… ▽ More Tool learning can further broaden the usage scenarios of large language models (LLMs). However most of the existing methods either need to finetune that the model can only use tools seen in the training data, or add tool demonstrations into the prompt with lower efficiency. In this paper, we present a new Tool Learning method Chain-of-Tools. It makes full use of the powerful semantic representation capability of frozen LLMs to finish tool calling in CoT reasoning with a huge and flexible tool pool which may contain unseen tools. Especially, to validate the effectiveness of our approach in the massive unseen tool scenario, we construct a new dataset SimpleToolQuestions. We conduct experiments on two numerical reasoning benchmarks (GSM8K-XL and FuncQA) and two knowledge-based question answering benchmarks (KAMEL and SimpleToolQuestions). Experimental results show that our approach performs better than the baseline. We also identify dimensions of the model output that are critical in tool selection, enhancing the model interpretability. Our code and data are available at: https://github.com/fairyshine/Chain-of-Tools . △ Less

Submitted 20 March, 2025; originally announced March 2025.

Comments: 11 pages, 10 figures

arXiv:2503.16070 [pdf, other]

Search for the radiative leptonic decay $D^+\toγe^+ν_e$ with Deep Learning

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

Abstract: Using 20.3$~\rm fb^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773$~\rm GeV$ with the BESIII detector, we report an improved search for the radiative leptonic decay $D^+\toγe^+ν_e$. An upper limit on its partial branching fraction for photon energies $E_γ>10~\rm MeV$ is determined to be $1.2\times10^{-5}$ at 90\% confidence level, which excludes most current theor… ▽ More Using 20.3$~\rm fb^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773$~\rm GeV$ with the BESIII detector, we report an improved search for the radiative leptonic decay $D^+\toγe^+ν_e$. An upper limit on its partial branching fraction for photon energies $E_γ>10~\rm MeV$ is determined to be $1.2\times10^{-5}$ at 90\% confidence level, which excludes most current theoretical predictions. A sophisticated deep learning approach with thorough validation, based on the Transformer architecture, is implemented to efficiently distinguish the signal from massive backgrounds. △ Less

Submitted 20 March, 2025; originally announced March 2025.

Comments: 15 pages, 6 figures

arXiv:2503.15450 [pdf, other]

SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Authors: Tongyao Zhu, Qian Liu, Haonan Wang, Shiqi Chen, Xiangming Gu, Tianyu Pang, Min-Yen Kan

Abstract: Recent advancements in LLM pretraining have featured ever-expanding context windows to process longer sequences. However, our pilot study reveals that models pretrained with shorter context windows consistently outperform their long-context counterparts under a fixed token budget. This finding motivates us to explore an optimal context window scheduling strategy to better balance long-context capa… ▽ More Recent advancements in LLM pretraining have featured ever-expanding context windows to process longer sequences. However, our pilot study reveals that models pretrained with shorter context windows consistently outperform their long-context counterparts under a fixed token budget. This finding motivates us to explore an optimal context window scheduling strategy to better balance long-context capability with pretraining efficiency. To this end, we propose SkyLadder, a simple yet effective approach that implements a short-to-long context window transition. SkyLadder preserves strong standard benchmark performance, while matching or exceeding baseline results on long context tasks. Through extensive experiments, we pre-train 1B-parameter models (up to 32K context) and 3B-parameter models (8K context) on 100B tokens, demonstrating that SkyLadder yields consistent gains of up to 3.7% on common benchmarks, while achieving up to 22% faster training speeds compared to baselines. The code is at https://github.com/sail-sg/SkyLadder. △ Less

Submitted 19 March, 2025; originally announced March 2025.

Comments: 22 pages. Accepted to ICLR 2025 Workshop on Open Science for Foundation Models

arXiv:2503.14040 [pdf, other]

MAG: Multi-Modal Aligned Autoregressive Co-Speech Gesture Generation without Vector Quantization

Authors: Binjie Liu, Lina Liu, Sanyi Zhang, Songen Gu, Yihao Zhi, Tianyi Zhu, Lei Yang, Long Ye

Abstract: This work focuses on full-body co-speech gesture generation. Existing methods typically employ an autoregressive model accompanied by vector-quantized tokens for gesture generation, which results in information loss and compromises the realism of the generated gestures. To address this, inspired by the natural continuity of real-world human motion, we propose MAG, a novel multi-modal aligned frame… ▽ More This work focuses on full-body co-speech gesture generation. Existing methods typically employ an autoregressive model accompanied by vector-quantized tokens for gesture generation, which results in information loss and compromises the realism of the generated gestures. To address this, inspired by the natural continuity of real-world human motion, we propose MAG, a novel multi-modal aligned framework for high-quality and diverse co-speech gesture synthesis without relying on discrete tokenization. Specifically, (1) we introduce a motion-text-audio-aligned variational autoencoder (MTA-VAE), which leverages pre-trained WavCaps' text and audio embeddings to enhance both semantic and rhythmic alignment with motion, ultimately producing more realistic gestures. (2) Building on this, we propose a multimodal masked autoregressive model (MMAG) that enables autoregressive modeling in continuous motion embeddings through diffusion without vector quantization. To further ensure multi-modal consistency, MMAG incorporates a hybrid granularity audio-text fusion block, which serves as conditioning for diffusion process. Extensive experiments on two benchmark datasets demonstrate that MAG achieves stateof-the-art performance both quantitatively and qualitatively, producing highly realistic and diverse co-speech gestures.The code will be released to facilitate future research. △ Less

Submitted 18 March, 2025; originally announced March 2025.

arXiv:2503.13542 [pdf, other]

HAR-DoReMi: Optimizing Data Mixture for Self-Supervised Human Activity Recognition Across Heterogeneous IMU Datasets

Authors: Lulu Ban, Tao Zhu, Xiangqing Lu, Qi Qiu, Wenyong Han, Shuangjian Li, Liming Chen, Kevin I-Kai Wang, Mingxing Nie, Yaping Wan

Abstract: Cross-dataset Human Activity Recognition (HAR) suffers from limited model generalization, hindering its practical deployment. To address this critical challenge, inspired by the success of DoReMi in Large Language Models (LLMs), we introduce a data mixture optimization strategy for pre-training HAR models, aiming to improve the recognition performance across heterogeneous datasets. However, direct… ▽ More Cross-dataset Human Activity Recognition (HAR) suffers from limited model generalization, hindering its practical deployment. To address this critical challenge, inspired by the success of DoReMi in Large Language Models (LLMs), we introduce a data mixture optimization strategy for pre-training HAR models, aiming to improve the recognition performance across heterogeneous datasets. However, directly applying DoReMi to the HAR field encounters new challenges due to the continuous, multi-channel and intrinsic heterogeneous characteristics of IMU sensor data. To overcome these limitations, we propose a novel framework HAR-DoReMi, which introduces a masked reconstruction task based on Mean Squared Error (MSE) loss. By raplacing the discrete language sequence prediction task, which relies on the Negative Log-Likelihood (NLL) loss, in the original DoReMi framework, the proposed framework is inherently more appropriate for handling the continuous and multi-channel characteristics of IMU data. In addition, HAR-DoReMi integrates the Mahony fusion algorithm into the self-supervised HAR pre-training, aiming to mitigate the heterogeneity of varying sensor orientation. This is achieved by estimating the sensor orientation within each dataset and facilitating alignment with a unified coordinate system, thereby improving the cross-dataset generalization ability of the HAR model. Experimental evaluation on multiple cross-dataset HAR transfer tasks demonstrates that HAR-DoReMi improves the accuracy by an average of 6.51%, compared to the current state-of-the-art method with only approximately 30% to 50% of the data usage. These results confirm the effectiveness of HAR-DoReMi in improving the generalization and data efficiency of pre-training HAR models, underscoring its significant potential to facilitate the practical deployment of HAR technology. △ Less

Submitted 16 March, 2025; originally announced March 2025.

arXiv:2503.13436 [pdf, other]

Unified Autoregressive Visual Generation and Understanding with Continuous Tokens

Authors: Lijie Fan, Luming Tang, Siyang Qin, Tianhong Li, Xuan Yang, Siyuan Qiao, Andreas Steiner, Chen Sun, Yuanzhen Li, Tao Zhu, Michael Rubinstein, Michalis Raptis, Deqing Sun, Radu Soricut

Abstract: We present UniFluid, a unified autoregressive framework for joint visual generation and understanding leveraging continuous visual tokens. Our unified autoregressive architecture processes multimodal image and text inputs, generating discrete tokens for text and continuous tokens for image. We find though there is an inherent trade-off between the image generation and understanding task, a careful… ▽ More We present UniFluid, a unified autoregressive framework for joint visual generation and understanding leveraging continuous visual tokens. Our unified autoregressive architecture processes multimodal image and text inputs, generating discrete tokens for text and continuous tokens for image. We find though there is an inherent trade-off between the image generation and understanding task, a carefully tuned training recipe enables them to improve each other. By selecting an appropriate loss balance weight, the unified model achieves results comparable to or exceeding those of single-task baselines on both tasks. Furthermore, we demonstrate that employing stronger pre-trained LLMs and random-order generation during training is important to achieve high-fidelity image generation within this unified framework. Built upon the Gemma model series, UniFluid exhibits competitive performance across both image generation and understanding, demonstrating strong transferability to various downstream tasks, including image editing for generation, as well as visual captioning and question answering for understanding. △ Less

Submitted 17 March, 2025; originally announced March 2025.

Comments: Tech report

arXiv:2503.12497 [pdf, other]

Defense Against Model Stealing Based on Account-Aware Distribution Discrepancy

Authors: Jian-Ping Mei, Weibin Zhang, Jie Chen, Xuyun Zhang, Tiantian Zhu

Abstract: Malicious users attempt to replicate commercial models functionally at low cost by training a clone model with query responses. It is challenging to timely prevent such model-stealing attacks to achieve strong protection and maintain utility. In this paper, we propose a novel non-parametric detector called Account-aware Distribution Discrepancy (ADD) to recognize queries from malicious users by le… ▽ More Malicious users attempt to replicate commercial models functionally at low cost by training a clone model with query responses. It is challenging to timely prevent such model-stealing attacks to achieve strong protection and maintain utility. In this paper, we propose a novel non-parametric detector called Account-aware Distribution Discrepancy (ADD) to recognize queries from malicious users by leveraging account-wise local dependency. We formulate each class as a Multivariate Normal distribution (MVN) in the feature space and measure the malicious score as the sum of weighted class-wise distribution discrepancy. The ADD detector is combined with random-based prediction poisoning to yield a plug-and-play defense module named D-ADD for image classification models. Results of extensive experimental studies show that D-ADD achieves strong defense against different types of attacks with little interference in serving benign users for both soft and hard-label settings. △ Less

Submitted 16 March, 2025; originally announced March 2025.

Comments: 11 pages, 7 figures, published in AAAI 2025

arXiv:2503.12362 [pdf, other]

On the exponential synchronization for the asymmetric second-order Kuramoto model

Authors: Tingting Zhu, Xiongtao Zhang

Abstract: In this paper, we study the synchronization problem of nonuniform second-order Kuramoto model with homogeneous dampings and frustration effects on an asymmetric network. More precisely, we focus on the second order model defined on an asymmetric graph with depth no greater than two and present theories on the complete frequency synchronization. Due to the absence of the gradient flow structure, we… ▽ More In this paper, we study the synchronization problem of nonuniform second-order Kuramoto model with homogeneous dampings and frustration effects on an asymmetric network. More precisely, we focus on the second order model defined on an asymmetric graph with depth no greater than two and present theories on the complete frequency synchronization. Due to the absence of the gradient flow structure, we develop novel energy functions to control the diameters of phase and frequency respectively, which allows us to construct first-order Gronwall-type inequalities. This eventually gives rise to the exponential convergence to the synchronized state in a regime in terms of large coupling strength, small inertia and frustration. △ Less

Submitted 16 March, 2025; originally announced March 2025.

MSC Class: 34D05; 34D06; 34C15; 92D25

arXiv:2503.11733 [pdf, other]

LLM Agents for Education: Advances and Applications

Authors: Zhendong Chu, Shen Wang, Jian Xie, Tinghui Zhu, Yibo Yan, Jinheng Ye, Aoxiao Zhong, Xuming Hu, Jing Liang, Philip S. Yu, Qingsong Wen

Abstract: Large Language Model (LLM) agents have demonstrated remarkable capabilities in automating tasks and driving innovation across diverse educational applications. In this survey, we provide a systematic review of state-of-the-art research on LLM agents in education, categorizing them into two broad classes: (1) \emph{Pedagogical Agents}, which focus on automating complex pedagogical tasks to support… ▽ More Large Language Model (LLM) agents have demonstrated remarkable capabilities in automating tasks and driving innovation across diverse educational applications. In this survey, we provide a systematic review of state-of-the-art research on LLM agents in education, categorizing them into two broad classes: (1) \emph{Pedagogical Agents}, which focus on automating complex pedagogical tasks to support both teachers and students; and (2) \emph{Domain-Specific Educational Agents}, which are tailored for specialized fields such as science education, language learning, and professional development. We comprehensively examine the technological advancements underlying these LLM agents, including key datasets, benchmarks, and algorithmic frameworks that drive their effectiveness. Furthermore, we discuss critical challenges such as privacy, bias and fairness concerns, hallucination mitigation, and integration with existing educational ecosystems. This survey aims to provide a comprehensive technological overview of LLM agents for education, fostering further research and collaboration to enhance their impact for the greater good of learners and educators alike. △ Less

Submitted 14 March, 2025; originally announced March 2025.

Comments: 17 pages

arXiv:2503.11526 [pdf, other]

Sum-of-Max Chain Partition of a Tree

Authors: Ruixi Luo, Taikun Zhu, Kai Jin

Abstract: Path partition problems on trees have found various applications. In this paper, we present an $O(n \log n)$ time algorithm for solving the following variant of path partition problem: given a rooted tree of $n$ nodes $1, \ldots, n$, where vertex $i$ is associated with a weight $w_i$ and a cost $s_i$, partition the tree into several disjoint chains $C_1,\ldots,C_k$, so that the weight of each chai… ▽ More Path partition problems on trees have found various applications. In this paper, we present an $O(n \log n)$ time algorithm for solving the following variant of path partition problem: given a rooted tree of $n$ nodes $1, \ldots, n$, where vertex $i$ is associated with a weight $w_i$ and a cost $s_i$, partition the tree into several disjoint chains $C_1,\ldots,C_k$, so that the weight of each chain is no more than a threshold $w_0$ and the sum of the largest $s_i$ in each chain is minimized. We also generalize the algorithm to the case where the cost of a chain is determined by the $s_i$ of the vertex with the highest rank in the chain, which can be determined by an arbitrary total order defined on all nodes instead of the value of $s_i$. △ Less

Submitted 14 March, 2025; originally announced March 2025.

arXiv:2503.11383 [pdf, other]

Study of $φ\to K\bar{K}$ and $K_{S}^{0}-K_{L}^{0}$ asymmetry in the amplitude analysis of $D_{s}^{+} \to K_{S}^{0}K_{L}^{0}π^{+}$ decay

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (701 additional authors not shown)

Abstract: Using $e^+e^-$ annihilation data corresponding to a total integrated luminosity of 7.33 $\rm fb^{-1}$ collected at center-of-mass energies between 4.128 and 4.226~GeV with the BESIII detector, we provide the first amplitude analysis and absolute branching fraction measurement of the hadronic decay $D_{s}^{+} \to K_{S}^{0}K_{L}^{0}π^{+}$. The branching fraction of… ▽ More Using $e^+e^-$ annihilation data corresponding to a total integrated luminosity of 7.33 $\rm fb^{-1}$ collected at center-of-mass energies between 4.128 and 4.226~GeV with the BESIII detector, we provide the first amplitude analysis and absolute branching fraction measurement of the hadronic decay $D_{s}^{+} \to K_{S}^{0}K_{L}^{0}π^{+}$. The branching fraction of $D_{s}^{+} \to K_{S}^{0}K_{L}^{0}π^{+}$ is determined to be $(1.86\pm0.06_{\rm stat}\pm0.03_{\rm syst})\%$. Combining the $\mathcal{B}(D_{s}^{+} \to φ(\to K_{S}^0K_{L}^0) π^+)$ obtained in this work and the world average of $\mathcal{B}(D_{s}^{+} \to φ(\to K^+K^-) π^+)$, we measure the relative branching fraction $\mathcal{B}(φ\to K_S^0K_L^0)/\mathcal{B}(φ\to K^+K^-)$=($0.597 \pm 0.023_{\rm stat} \pm 0.018_{\rm syst} \pm 0.016_{\rm PDG}$), which deviates from the PDG value by more than 3$σ$. Furthermore, the asymmetry of the branching fractions of $D^+_s\to K_{S}^0K^{*}(892)^{+}$ and $D^+_s\to K_{L}^0K^{*}(892)^{+}$, $\frac{\mathcal{B}(D_{s}^{+} \to K_{S}^0K^{*}(892)^{+})-\mathcal{B}(D_{s}^{+} \to K_{L}^0K^{*}(892)^{+})}{\mathcal{B}(D_{s}^{+} \to K_{S}^0K^{*}(892)^{+})+\mathcal{B}(D_{s}^{+} \to K_{L}^0K^{*}(892)^{+})}$, is determined to be $(-13.4\pm5.0_{\rm stat}\pm3.4_{\rm syst})\%$. △ Less

Submitted 23 March, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

Comments: 11 pages, 4 figures

arXiv:2503.11107 [pdf, other]

Discrete Effort Distribution via Regret-enabled Greedy Algorithm

Authors: Song Cao, Taikun Zhu, Kai Jin

Abstract: This paper addresses resource allocation problem with a separable objective function under a single linear constraint, formulated as maximizing $\sum_{j=1}^{n}R_j(x_j)$ subject to $\sum_{j=1}^{n}x_j=k$ and $x_j\in\{0,\dots,m\}$. While classical dynamic programming approach solves this problem in $O(n^2m^2)$ time, we propose a regret-enabled greedy algorithm that achieves $O(n\log n)$ time when… ▽ More This paper addresses resource allocation problem with a separable objective function under a single linear constraint, formulated as maximizing $\sum_{j=1}^{n}R_j(x_j)$ subject to $\sum_{j=1}^{n}x_j=k$ and $x_j\in\{0,\dots,m\}$. While classical dynamic programming approach solves this problem in $O(n^2m^2)$ time, we propose a regret-enabled greedy algorithm that achieves $O(n\log n)$ time when $m=O(1)$. The algorithm significantly outperforms traditional dynamic programming for small $m$. Our algorithm actually solves the problem for all $k~(0\leq k\leq nm)$ in the mentioned time. △ Less

Submitted 21 May, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

arXiv:2503.11015 [pdf, other]

Search for a $1^{-+}$ molecular state via $e^{+}e^{-} \to γD^{+}_{s} D_{s1}^{-}(2536) +c.c.$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (649 additional authors not shown)

Abstract: We search, for the first time, for an exotic molecular state with quantum numbers $J^{PC}=1^{-+}$, called $X$, via the process $e^{+}e^{-} \to γD^{+}_{s} D_{s1}^{-}(2536) +c.c.$ using data samples corresponding to a luminosity of $5.8~\mathrm{fb^{-1}}$ across center-of-mass energies from 4.612 to 4.951~GeV, collected with the BESIII detector operating at the BEPCII collider. No statistically signi… ▽ More We search, for the first time, for an exotic molecular state with quantum numbers $J^{PC}=1^{-+}$, called $X$, via the process $e^{+}e^{-} \to γD^{+}_{s} D_{s1}^{-}(2536) +c.c.$ using data samples corresponding to a luminosity of $5.8~\mathrm{fb^{-1}}$ across center-of-mass energies from 4.612 to 4.951~GeV, collected with the BESIII detector operating at the BEPCII collider. No statistically significant signal is observed. The upper limits on the product of cross-section and branching fraction $σ({e^{+}e^{-} \to γX}) \times \mathcal{B}(X \to D^{+}_{s} D_{s1}^{-}(2536) +c.c.)$ at 90\% confidence level are reported for each energy point, assuming the $X$ mass to be 4.503~GeV/$c^{2}$ and the width 25, 50, 75, and 100~MeV, respectively. △ Less

Submitted 13 March, 2025; originally announced March 2025.

Comments: 13 pages,5 figures

arXiv:2503.08468 [pdf, other]

Flow and thermal modelling of the argon volume in the DarkSide-20k TPC

Authors: DarkSide-20k Collaboration, :, F. Acerbi, P. Adhikari, P. Agnes, I. Ahmad, S. Albergo, I. F. Albuquerque, T. Alexander, A. K. Alton, P. Amaudruz, M. Angiolilli, E. Aprile, M. Atzori Corona, D. J. Auty, M. Ave, I. C. Avetisov, O. Azzolini, H. O. Back, Z. Balmforth, A. Barrado Olmedo, P. Barrillon, G. Batignani, P. Bhowmick, M. Bloem , et al. (279 additional authors not shown)

Abstract: The DarkSide-20k dark matter experiment, currently under construction at LNGS, features a dual-phase time projection chamber (TPC) with a ~50 t argon target from an underground well. At this scale, it is crucial to optimise the argon flow pattern for efficient target purification and for fast distribution of internal gaseous calibration sources with lifetimes of the order of hours. To this end, we… ▽ More The DarkSide-20k dark matter experiment, currently under construction at LNGS, features a dual-phase time projection chamber (TPC) with a ~50 t argon target from an underground well. At this scale, it is crucial to optimise the argon flow pattern for efficient target purification and for fast distribution of internal gaseous calibration sources with lifetimes of the order of hours. To this end, we have performed computational fluid dynamics simulations and heat transfer calculations. The residence time distribution shows that the detector is well-mixed on time-scales of the turnover time (~40 d). Notably, simulations show that despite a two-order-of-magnitude difference between the turnover time and the half-life of $^{83\text{m}}$Kr of 1.83 h, source atoms have the highest probability to reach the centre of the TPC 13 min after their injection, allowing for a homogeneous distribution before undergoing radioactive decay. We further analyse the thermal aspects of dual-phase operation and define the requirements for the formation of a stable gas pocket on top of the liquid. We find a best-estimate value for the heat transfer rate at the liquid-gas interface of 62 W with an upper limit of 144 W and a minimum gas pocket inlet temperature of 89 K to avoid condensation on the acrylic anode. This study also informs the placement of liquid inlets and outlets in the TPC. The presented techniques are widely applicable to other large-scale, noble-liquid detectors. △ Less

Submitted 11 March, 2025; originally announced March 2025.

Comments: 37 pages, 19 figures, 7 tables

arXiv:2503.08162 [pdf, other]

FASIONAD++ : Integrating High-Level Instruction and Information Bottleneck in FAt-Slow fusION Systems for Enhanced Safety in Autonomous Driving with Adaptive Feedback

Authors: Kangan Qian, Ziang Luo, Sicong Jiang, Zilin Huang, Jinyu Miao, Zhikun Ma, Tianze Zhu, Jiayin Li, Yangfan He, Zheng Fu, Yining Shi, Boyue Wang, Hezhe Lin, Ziyu Chen, Jiangbo Yu, Xinyu Jiao, Mengmeng Yang, Kun Jiang, Diange Yang

Abstract: Ensuring safe, comfortable, and efficient planning is crucial for autonomous driving systems. While end-to-end models trained on large datasets perform well in standard driving scenarios, they struggle with complex low-frequency events. Recent Large Language Models (LLMs) and Vision Language Models (VLMs) advancements offer enhanced reasoning but suffer from computational inefficiency. Inspired by… ▽ More Ensuring safe, comfortable, and efficient planning is crucial for autonomous driving systems. While end-to-end models trained on large datasets perform well in standard driving scenarios, they struggle with complex low-frequency events. Recent Large Language Models (LLMs) and Vision Language Models (VLMs) advancements offer enhanced reasoning but suffer from computational inefficiency. Inspired by the dual-process cognitive model "Thinking, Fast and Slow", we propose $\textbf{FASIONAD}$ -- a novel dual-system framework that synergizes a fast end-to-end planner with a VLM-based reasoning module. The fast system leverages end-to-end learning to achieve real-time trajectory generation in common scenarios, while the slow system activates through uncertainty estimation to perform contextual analysis and complex scenario resolution. Our architecture introduces three key innovations: (1) A dynamic switching mechanism enabling slow system intervention based on real-time uncertainty assessment; (2) An information bottleneck with high-level plan feedback that optimizes the slow system's guidance capability; (3) A bidirectional knowledge exchange where visual prompts enhance the slow system's reasoning while its feedback refines the fast planner's decision-making. To strengthen VLM reasoning, we develop a question-answering mechanism coupled with reward-instruct training strategy. In open-loop experiments, FASIONAD achieves a $6.7\%$ reduction in average $L2$ trajectory error and $28.1\%$ lower collision rate. △ Less

Submitted 11 March, 2025; originally announced March 2025.

Comments: 8 pages, 4 figures

arXiv:2503.07153 [pdf, other]

PTMs-TSCIL Pre-Trained Models Based Class-Incremental Learning

Authors: Yuanlong Wu, Mingxing Nie, Tao Zhu, Liming Chen, Huansheng Ning, Yaping Wan

Abstract: Class-incremental learning (CIL) for time series data faces critical challenges in balancing stability against catastrophic forgetting and plasticity for new knowledge acquisition, particularly under real-world constraints where historical data access is restricted. While pre-trained models (PTMs) have shown promise in CIL for vision and NLP domains, their potential in time series class-incrementa… ▽ More Class-incremental learning (CIL) for time series data faces critical challenges in balancing stability against catastrophic forgetting and plasticity for new knowledge acquisition, particularly under real-world constraints where historical data access is restricted. While pre-trained models (PTMs) have shown promise in CIL for vision and NLP domains, their potential in time series class-incremental learning (TSCIL) remains underexplored due to the scarcity of large-scale time series pre-trained models. Prompted by the recent emergence of large-scale pre-trained models (PTMs) for time series data, we present the first exploration of PTM-based Time Series Class-Incremental Learning (TSCIL). Our approach leverages frozen PTM backbones coupled with incrementally tuning the shared adapter, preserving generalization capabilities while mitigating feature drift through knowledge distillation. Furthermore, we introduce a Feature Drift Compensation Network (DCN), designed with a novel two-stage training strategy to precisely model feature space transformations across incremental tasks. This allows for accurate projection of old class prototypes into the new feature space. By employing DCN-corrected prototypes, we effectively enhance the unified classifier retraining, mitigating model feature drift and alleviating catastrophic forgetting. Extensive experiments on five real-world datasets demonstrate state-of-the-art performance, with our method yielding final accuracy gains of 1.4%-6.1% across all datasets compared to existing PTM-based approaches. Our work establishes a new paradigm for TSCIL, providing insights into stability-plasticity optimization for continual learning systems. △ Less

Submitted 10 March, 2025; originally announced March 2025.

Comments: 13 pages,6 figures

arXiv:2503.07014 [pdf, other]

Vib2Mol: from vibrational spectra to molecular structures-a versatile deep learning model

Authors: Xinyu Lu, Hao Ma, Hui Li, Jia Li, Tong Zhu, Guokun Liu, Bin Ren

Abstract: There will be a paradigm shift in chemical and biological research, to be enabled by autonomous, closed-loop, real-time self-directed decision-making experimentation. Spectrum-to-structure correlation, which is to elucidate molecular structures with spectral information, is the core step in understanding the experimental results and to close the loop. However, current approaches usually divide the… ▽ More There will be a paradigm shift in chemical and biological research, to be enabled by autonomous, closed-loop, real-time self-directed decision-making experimentation. Spectrum-to-structure correlation, which is to elucidate molecular structures with spectral information, is the core step in understanding the experimental results and to close the loop. However, current approaches usually divide the task into either database-dependent retrieval and database-independent generation and neglect the inherent complementarity between them. In this study, we proposed Vib2Mol, a general deep learning model designed to flexibly handle diverse spectrum-to-structure tasks according to the available prior knowledge by bridging the retrieval and generation. It achieves state-of-the-art performance, even for the most demanding Raman spectra, over previous models in predicting reaction products and sequencing peptides as well as analyzing experimental spectra and integrating multi-modal spectral data. Vib2Mol enables vibrational spectroscopy a real-time guide for autonomous scientific discovery workflows. △ Less

Submitted 27 April, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

arXiv:2503.06849 [pdf, other]

First differential measurement of the single $\mathbfπ^+$ production cross section in neutrino neutral-current scattering

Authors: K. Abe, S. Abe, R. Akutsu, H. Alarakia-Charles, Y. I. Alj Hakim, S. Alonso Monsalve, L. Anthony, S. Aoki, K. A. Apte, T. Arai, T. Arihara, S. Arimoto, N. Babu, V. Baranov, G. J. Barker, G. Barr, D. Barrow, P. Bates, L. Bathe-Peters, M. Batkiewicz-Kwasniak, N. Baudis, V. Berardi, L. Berns, S. Bhattacharjee, A. Blanchet , et al. (338 additional authors not shown)

Abstract: Since its first observation in the 1970s, neutrino-induced neutral-current single positive pion production (NC1$π^+$) has remained an elusive and poorly understood interaction channel. This process is a significant background in neutrino oscillation experiments and studying it further is critical for the physics program of next-generation accelerator-based neutrino oscillation experiments. In this… ▽ More Since its first observation in the 1970s, neutrino-induced neutral-current single positive pion production (NC1$π^+$) has remained an elusive and poorly understood interaction channel. This process is a significant background in neutrino oscillation experiments and studying it further is critical for the physics program of next-generation accelerator-based neutrino oscillation experiments. In this Letter we present the first double-differential cross-section measurement of NC1$π^+$ interactions using data from the ND280 detector of the T2K experiment collected in $ν$-beam mode. We compare the results on a hydrocarbon target to the predictions of several neutrino interaction generators and final-state interaction models. While model predictions agree with the differential results, the data shows a weak preference for a cross-section normalization approximately 30\% higher than predicted by most models studied in this Letter. △ Less

Submitted 11 March, 2025; v1 submitted 9 March, 2025; originally announced March 2025.

arXiv:2503.06843 [pdf, other]

Signal selection and model-independent extraction of the neutrino neutral-current single $π^+$ cross section with the T2K experiment

Authors: K. Abe, S. Abe, R. Akutsu, H. Alarakia-Charles, Y. I. Alj Hakim, S. Alonso Monsalve, L. Anthony, S. Aoki, K. A. Apte, T. Arai, T. Arihara, S. Arimoto, N. Babu, V. Baranov, G. J. Barker, G. Barr, D. Barrow, P. Bates, L. Bathe-Peters, M. Batkiewicz-Kwasniak, N. Baudis, V. Berardi, L. Berns, S. Bhattacharjee, A. Blanchet , et al. (338 additional authors not shown)

Abstract: This article presents a study of single $π^+$ production in neutrino neutral-current interactions (NC1$π^+$) using the ND280 detector of the T2K experiment. We report the largest sample of such events selected by any experiment, providing the first new data for this channel in over four decades and the first using a sub-GeV neutrino flux. The signal selection strategy and its performance are detai… ▽ More This article presents a study of single $π^+$ production in neutrino neutral-current interactions (NC1$π^+$) using the ND280 detector of the T2K experiment. We report the largest sample of such events selected by any experiment, providing the first new data for this channel in over four decades and the first using a sub-GeV neutrino flux. The signal selection strategy and its performance are detailed together with validations of a robust cross section extraction methodology. The measured flux-averaged integrated cross-section is $ σ= (6.07 \pm 1.22 )\times 10^{-41} \,\, \text{cm}^2/\text{nucleon}$, 1.3~$σ~$ above the NEUT v5.4.0 expectation. △ Less

Submitted 11 March, 2025; v1 submitted 9 March, 2025; originally announced March 2025.

arXiv:2503.06750 [pdf, other]

Probing quantum corrected black hole through astrophysical tests with the orbit of S2 star and quasiperiodic oscillations

Authors: Tursunali Xamidov, Sanjar Shaymatov, Bobomurat Ahmedov, Tao Zhu

Abstract: In this study, we explore the influence of the quantum correction parameter $ξ$ on the motion of particles and the properties of quasiperiodic oscillations (QPOs) around a quantum-corrected black hole (QCBH). We first analyze the geodesics of a test particle and derive weak-field constraints on parameter $ξ$ from the perihelion precession of orbits, using observations from the Solar System and the… ▽ More In this study, we explore the influence of the quantum correction parameter $ξ$ on the motion of particles and the properties of quasiperiodic oscillations (QPOs) around a quantum-corrected black hole (QCBH). We first analyze the geodesics of a test particle and derive weak-field constraints on parameter $ξ$ from the perihelion precession of orbits, using observations from the Solar System and the S2 star's orbit around $\text{SgrA}^\star$ supermassive black hole in the center of our galaxy. We obtain $ξ\leq 0.01869$ and $ξ\leq 0.73528$ using the analysis of Solar System observations and the orbit of the S2 star around $\text{SgrA}^\star$, respectively. In the strong-field regime, we examine the dynamics of epicyclic motion around astrophysical black holes and, using observational data from four QPO sources and the Markov Chain Monte Carlo (MCMC) method, we determine the upper constraint $ξ\leq 2.086$. Our results provide new insights into the effects of quantum corrections on black hole spacetimes and highlight the potential of QPOs as a probe for testing quantum gravity in astrophysical environments. △ Less

Submitted 9 March, 2025; originally announced March 2025.

Comments: 11 pages, 4 captioned figures, 3 captioned tables

arXiv:2503.06150 [pdf, other]

doi 10.1109/TDSC.2025.3551157

Do Fairness Interventions Come at the Cost of Privacy: Evaluations for Binary Classifiers

Authors: Huan Tian, Guangsheng Zhang, Bo Liu, Tianqing Zhu, Ming Ding, Wanlei Zhou

Abstract: While in-processing fairness approaches show promise in mitigating biased predictions, their potential impact on privacy leakage remains under-explored. We aim to address this gap by assessing the privacy risks of fairness-enhanced binary classifiers via membership inference attacks (MIAs) and attribute inference attacks (AIAs). Surprisingly, our results reveal that enhancing fairness does not nec… ▽ More While in-processing fairness approaches show promise in mitigating biased predictions, their potential impact on privacy leakage remains under-explored. We aim to address this gap by assessing the privacy risks of fairness-enhanced binary classifiers via membership inference attacks (MIAs) and attribute inference attacks (AIAs). Surprisingly, our results reveal that enhancing fairness does not necessarily lead to privacy compromises. For example, these fairness interventions exhibit increased resilience against MIAs and AIAs. This is because fairness interventions tend to remove sensitive information among extracted features and reduce confidence scores for the majority of training data for fairer predictions. However, during the evaluations, we uncover a potential threat mechanism that exploits prediction discrepancies between fair and biased models, leading to advanced attack results for both MIAs and AIAs. This mechanism reveals potent vulnerabilities of fair models and poses significant privacy risks of current fairness methods. Extensive experiments across multiple datasets, attack methods, and representative fairness approaches confirm our findings and demonstrate the efficacy of the uncovered mechanism. Our study exposes the under-explored privacy threats in fairness studies, advocating for thorough evaluations of potential security vulnerabilities before model deployments. △ Less

Submitted 11 March, 2025; v1 submitted 8 March, 2025; originally announced March 2025.

Comments: Accepted to IEEE Transactions on Dependable and Secure Computing (TDSC)

arXiv:2503.05447 [pdf, other]

Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

Authors: Weigao Sun, Disen Lan, Tong Zhu, Xiaoye Qu, Yu Cheng

Abstract: Linear Sequence Modeling (LSM) like linear attention, state space models and linear RNNs, and Mixture-of-Experts (MoE) have recently emerged as significant architectural improvements. In this paper, we introduce Linear-MoE, a production-level system for modeling and training large-scale models that integrate LSM with MoE. Linear-MoE leverages the advantages of both LSM modules for linear-complexit… ▽ More Linear Sequence Modeling (LSM) like linear attention, state space models and linear RNNs, and Mixture-of-Experts (MoE) have recently emerged as significant architectural improvements. In this paper, we introduce Linear-MoE, a production-level system for modeling and training large-scale models that integrate LSM with MoE. Linear-MoE leverages the advantages of both LSM modules for linear-complexity sequence modeling and MoE layers for sparsely activation, aiming to offer high performance with efficient training. The Linear-MoE system comprises: 1) Modeling subsystem, which provides a unified framework supporting all instances of LSM. and 2) Training subsystem, which facilitates efficient training by incorporating various advanced parallelism technologies, particularly Sequence Parallelism designed for Linear-MoE models. Additionally, we explore hybrid models that combine Linear-MoE layers with standard Transformer-MoE layers with its Sequence Parallelism to further enhance model flexibility and performance. Evaluations on two model series, A0.3B-2B and A1B-7B, demonstrate Linear-MoE achieves efficiency gains while maintaining competitive performance on various benchmarks, showcasing its potential as a next-generation foundational model architecture. Code: https://github.com/OpenSparseLLMs/Linear-MoE. △ Less

Submitted 15 April, 2025; v1 submitted 7 March, 2025; originally announced March 2025.

Comments: Technical report, 17 pages

arXiv:2503.05382 [pdf, other]

Measurement of the branching fractions of $D^+ \to K^+K^-π^+π^+π^-$, $φπ^+π^+π^-$, $K^0_SK^+π^+π^-π^0$, $K^0_SK^+η$, and $K^0_SK^+ω$ decays

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (693 additional authors not shown)

Abstract: Using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of 3.773 GeV with the BESIII detector operating at the BEPCII collider, the branching fractions of three hadronic charm meson decays, $D^+\to φπ^+π^+π^-$, $D^+\to K^0_SK^+π^+π^-π^0$, and $D^+\to K^0_SK^+ω$, are measured for the first time to be $(0.54\pm0.19\pm0.02)\times 10^{-4}$,… ▽ More Using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of 3.773 GeV with the BESIII detector operating at the BEPCII collider, the branching fractions of three hadronic charm meson decays, $D^+\to φπ^+π^+π^-$, $D^+\to K^0_SK^+π^+π^-π^0$, and $D^+\to K^0_SK^+ω$, are measured for the first time to be $(0.54\pm0.19\pm0.02)\times 10^{-4}$, $(2.51\pm0.34\pm0.14)\times 10^{-4}$, and $(2.02\pm0.35\pm0.10)\times 10^{-4}$, respectively. Futhermore, the branching fractions of $D^+\to K^+K^-π^+π^+π^-$ and $D^+\to K^0_SK^+η$ are measured with improved precision, yielding values of $(0.66\pm0.11\pm0.03)\times 10^{-4}$ and $(2.27\pm0.22\pm0.05)\times 10^{-4}$, respectively. △ Less

Submitted 7 March, 2025; originally announced March 2025.

Comments: 11 pages, 3 figures

Report number: BAM-00841

arXiv:2503.04481 [pdf, other]

Innovating Bolometers' Mounting: A Gravity-Based Approach

Authors: The CUPID Collaboration, K. Alfonso, A. Armatol, C. Augier, F. T. Avignone III, O. Azzolini, A. S. Barabash, G. Bari, A. Barresi, D. Baudin, F. Bellini, G. Benato, L. Benussi, V. Berest, M. Beretta, M. Bettelli, M. Biassoni, J. Billard, F. Boffelli, V. Boldrini, E. D. Brandani, C. Brofferio, C. Bucci, M. Buchynska, J. Camilleri , et al. (168 additional authors not shown)

Abstract: Cryogenic calorimeters, also known as bolometers, are among the leading technologies for searching for rare events. The CUPID experiment is exploiting this technology to deploy a tonne-scale detector to search for neutrinoless double-beta decay of $^{100}$Mo. The CUPID collaboration proposed an innovative approach to assembling bolometers in a stacked configuration, held in position solely by grav… ▽ More Cryogenic calorimeters, also known as bolometers, are among the leading technologies for searching for rare events. The CUPID experiment is exploiting this technology to deploy a tonne-scale detector to search for neutrinoless double-beta decay of $^{100}$Mo. The CUPID collaboration proposed an innovative approach to assembling bolometers in a stacked configuration, held in position solely by gravity. This gravity-based assembly method is unprecedented in the field of bolometers and offers several advantages, including relaxed mechanical tolerances and simplified construction. To assess and optimize its performance, we constructed a medium-scale prototype hosting 28 Li$_2$MoO$_4$ crystals and 30 Ge light detectors, both operated as cryogenic calorimeters at the Laboratori Nazionali del Gran Sasso (Italy). Despite an unexpected excess of noise in the light detectors, the results of this test proved (i) a thermal stability better than $\pm$0.5 mK at 10 mK, (ii) a good energy resolution of Li$_2$MoO$_4$ bolometers, (6.6 $\pm$ 2.2) keV FWHM at 2615 keV, and (iii) a Li$_2$MoO$_4$ light yield measured by the closest light detector of 0.36 keV/MeV, sufficient to guarantee the particle identification requested by CUPID. △ Less

Submitted 6 March, 2025; originally announced March 2025.

Showing 51–100 of 1,239 results for author: Zhu, T