-
OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis
Authors:
Run Luo,
Ting-En Lin,
Haonan Zhang,
Yuchuan Wu,
Xiong Liu,
Min Yang,
Yongbin Li,
Longze Chen,
Jiaming Li,
Lei Zhang,
Yangyi Chen,
Xiaobo Xia,
Hamid Alinejad-Rokny,
Fei Huang
Abstract:
Recent advancements in omnimodal learning have significantly improved understanding and generation across images, text, and speech, yet these developments remain predominantly confined to proprietary models. The lack of high-quality omnimodal datasets and the challenges of real-time emotional speech synthesis have notably hindered progress in open-source research. To address these limitations, we…
▽ More
Recent advancements in omnimodal learning have significantly improved understanding and generation across images, text, and speech, yet these developments remain predominantly confined to proprietary models. The lack of high-quality omnimodal datasets and the challenges of real-time emotional speech synthesis have notably hindered progress in open-source research. To address these limitations, we introduce \name, a two-stage training framework that integrates omnimodal alignment and speech generation to develop a state-of-the-art omnimodal large language model. In the alignment phase, a pre-trained speech model undergoes further training on text-image tasks, enabling (near) zero-shot generalization from vision to speech, outperforming models trained on tri-modal datasets. In the speech generation phase, a lightweight decoder is trained on speech tasks with direct preference optimization, enabling real-time emotional speech synthesis with high fidelity. Experiments show that \name surpasses state-of-the-art models across omnimodal, vision-language, and speech-language benchmarks. It achieves a 4-point absolute improvement on OmniBench over the leading open-source model VITA, despite using 5x fewer training samples and a smaller model size (7B vs. 7x8B). Additionally, \name achieves real-time speech generation with <1s latency at non-autoregressive mode, reducing inference time by 5x compared to autoregressive methods, and improves emotion classification accuracy by 7.7\%
△ Less
Submitted 24 May, 2025; v1 submitted 8 January, 2025;
originally announced January 2025.
-
Observation of the $W$-annihilation process $D_s^+ \to ωρ^+$ and measurement of $D_s^+ \to φρ^+$ in $D^+_s\to π^+π^+π^-π^0π^0$ decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
We present the first amplitude analysis and branching fraction measurement of the decay $D^+_s\to π^+π^+π^-π^0π^0$, using $e^+e^-$ collision data collected with the BESIII detector at center-of-mass energies between 4.128 and 4.226 GeV corresponding to an integrated luminosity of 7.33 fb$^{-1}$, and report the first observation of the pure $W$-annihilation decay $D_s^+ \to ωρ^+$ with a branching f…
▽ More
We present the first amplitude analysis and branching fraction measurement of the decay $D^+_s\to π^+π^+π^-π^0π^0$, using $e^+e^-$ collision data collected with the BESIII detector at center-of-mass energies between 4.128 and 4.226 GeV corresponding to an integrated luminosity of 7.33 fb$^{-1}$, and report the first observation of the pure $W$-annihilation decay $D_s^+ \to ωρ^+$ with a branching fraction of $(0.99\pm0.08_{\rm stat}{\ ^{+0.05}_{-0.07}}_{\rm syst})\%$. %The absolute branching fraction is measured to be $(0.99\pm0.08_{\rm stat}\pm0.07_{\rm syst})\%$. In comparison to the low significance of the $\mathcal{D}$ wave in the decay $D_s^+ \to φρ^+$, the dominance of the $\mathcal{D}$ wave over the $\mathcal{S}$ and $\mathcal{P}$ waves, with a fraction of $(51.85\pm7.28_{\rm stat}{\ ^{+4.83}_{-7.90}}_{\rm syst})\%$ observed in the decay $D_s^+ \to ωρ^+$, provides crucial information for the``polarization puzzle", as well as for the understanding of charm meson decays. The branching fraction of $D^+_s\to π^+π^+π^-π^0π^0$ is measured to be ($4.41\pm0.15_{\rm stat}\pm0.13_{\rm syst}$)\%. Moreover, the branching fraction of $D_s^+ \to φρ^+$ is measured to be $(3.98\pm0.33_{\rm stat}{\ ^{+0.21}_{-0.19}}_{\rm syst})\%$, and the $R_φ= {\mathcal{B}(φ\toπ^+π^-π^0)}/{\mathcal{B}(φ\to K^+K^-)}$ is determined to be $(0.222\pm0.019_{\rm stat}{\ ^{+0.016}_{-0.016}}_{\rm syst}$), which is consistent with the previous measurement based on charm meson decays, but deviates from the results from $e^+e^-$ annihilation and $K$-$N$ scattering experiments by more than 3$σ$.
△ Less
Submitted 23 May, 2025; v1 submitted 8 January, 2025;
originally announced January 2025.
-
Study of the electromagnetic Dalitz decay $J/ψ\to e^+e^- π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (639 additional authors not shown)
Abstract:
We report the first measurement of the di-electron invariant mass dependent transition form factor in the electromagnetic Dalitz decay $J/ψ\to e^+e^- π^0$ using $(10087 \pm 44) \times 10^6$ $J/ψ$ events collected by the BESIII detector. A clear $ρ-ω$ interference structure is observed, consistent with the pion form factor, which offers a novel approach to extract the hadronic vacuum polarization c…
▽ More
We report the first measurement of the di-electron invariant mass dependent transition form factor in the electromagnetic Dalitz decay $J/ψ\to e^+e^- π^0$ using $(10087 \pm 44) \times 10^6$ $J/ψ$ events collected by the BESIII detector. A clear $ρ-ω$ interference structure is observed, consistent with the pion form factor, which offers a novel approach to extract the hadronic vacuum polarization contribution to the anomalous muon magnetic moment ($a_μ$) and refine the predictions of the Vector Meson Dominance (VMD) model and hadronic light-by-light contribution to $a_μ$. By taking into account the contribution of this $ρ-ω$ interference structure, the branching fraction of $J/ψ\to e^+e^- π^0$ in the full $e^+e^-$ invariant mass range is also measured for the first time to be $(8.06 \pm 0.31 (\rm{stat}) \pm 0.38 (\rm{syst}))\times 10^{-7}$, approximately twice the non-resonant VMD prediction.
△ Less
Submitted 3 July, 2025; v1 submitted 8 January, 2025;
originally announced January 2025.
-
Cosmos World Foundation Model Platform for Physical AI
Authors:
NVIDIA,
:,
Niket Agarwal,
Arslan Ali,
Maciej Bala,
Yogesh Balaji,
Erik Barker,
Tiffany Cai,
Prithvijit Chattopadhyay,
Yongxin Chen,
Yin Cui,
Yifan Ding,
Daniel Dworakowski,
Jiaojiao Fan,
Michele Fenzi,
Francesco Ferroni,
Sanja Fidler,
Dieter Fox,
Songwei Ge,
Yunhao Ge,
Jinwei Gu,
Siddharth Gururani,
Ethan He,
Jiahui Huang,
Jacob Huffman
, et al. (54 additional authors not shown)
Abstract:
Physical AI needs to be trained digitally first. It needs a digital twin of itself, the policy model, and a digital twin of the world, the world model. In this paper, we present the Cosmos World Foundation Model Platform to help developers build customized world models for their Physical AI setups. We position a world foundation model as a general-purpose world model that can be fine-tuned into cu…
▽ More
Physical AI needs to be trained digitally first. It needs a digital twin of itself, the policy model, and a digital twin of the world, the world model. In this paper, we present the Cosmos World Foundation Model Platform to help developers build customized world models for their Physical AI setups. We position a world foundation model as a general-purpose world model that can be fine-tuned into customized world models for downstream applications. Our platform covers a video curation pipeline, pre-trained world foundation models, examples of post-training of pre-trained world foundation models, and video tokenizers. To help Physical AI builders solve the most critical problems of our society, we make Cosmos open-source and our models open-weight with permissive licenses available via https://github.com/nvidia-cosmos/cosmos-predict1.
△ Less
Submitted 18 March, 2025; v1 submitted 7 January, 2025;
originally announced January 2025.
-
ScaleMAI: Accelerating the Development of Trusted Datasets and AI Models
Authors:
Wenxuan Li,
Pedro R. A. S. Bassi,
Tianyu Lin,
Yu-Cheng Chou,
Xinze Zhou,
Yucheng Tang,
Fabian Isensee,
Kang Wang,
Qi Chen,
Xiaowei Xu,
Xiaoxi Chen,
Lizhou Wu,
Qilong Wu,
Yannick Kirchhoff,
Maximilian Rokuss,
Saikat Roy,
Yuxuan Zhao,
Dexin Yu,
Kai Ding,
Constantin Ulrich,
Klaus Maier-Hein,
Yang Yang,
Alan L. Yuille,
Zongwei Zhou
Abstract:
Building trusted datasets is critical for transparent and responsible Medical AI (MAI) research, but creating even small, high-quality datasets can take years of effort from multidisciplinary teams. This process often delays AI benefits, as human-centric data creation and AI-centric model development are treated as separate, sequential steps. To overcome this, we propose ScaleMAI, an agent of AI-i…
▽ More
Building trusted datasets is critical for transparent and responsible Medical AI (MAI) research, but creating even small, high-quality datasets can take years of effort from multidisciplinary teams. This process often delays AI benefits, as human-centric data creation and AI-centric model development are treated as separate, sequential steps. To overcome this, we propose ScaleMAI, an agent of AI-integrated data curation and annotation, allowing data quality and AI performance to improve in a self-reinforcing cycle and reducing development time from years to months. We adopt pancreatic tumor detection as an example. First, ScaleMAI progressively creates a dataset of 25,362 CT scans, including per-voxel annotations for benign/malignant tumors and 24 anatomical structures. Second, through progressive human-in-the-loop iterations, ScaleMAI provides Flagship AI Model that can approach the proficiency of expert annotators (30-year experience) in detecting pancreatic tumors. Flagship Model significantly outperforms models developed from smaller, fixed-quality datasets, with substantial gains in tumor detection (+14%), segmentation (+5%), and classification (72%) on three prestigious benchmarks. In summary, ScaleMAI transforms the speed, scale, and reliability of medical dataset creation, paving the way for a variety of impactful, data-driven applications.
△ Less
Submitted 6 January, 2025;
originally announced January 2025.
-
Observation of $ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
Based on $(2712.4 \pm 14.3)\times 10^6$ $ψ(3686)$ events collected at the BESIII detector operating at the BEPCII collider, we present the first observation of the decay $ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.$. The product branching fraction ${\cal B}[ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.] \times {\cal B}[Λ(1520) \to pK^{-}]$ is measured to be $(9.5 \pm 0.8 \pm 1.1) \times 10^{-7}$, where th…
▽ More
Based on $(2712.4 \pm 14.3)\times 10^6$ $ψ(3686)$ events collected at the BESIII detector operating at the BEPCII collider, we present the first observation of the decay $ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.$. The product branching fraction ${\cal B}[ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.] \times {\cal B}[Λ(1520) \to pK^{-}]$ is measured to be $(9.5 \pm 0.8 \pm 1.1) \times 10^{-7}$, where the first uncertainty is statistical and the second systematic.
△ Less
Submitted 5 January, 2025;
originally announced January 2025.
-
Search for $η_c(2S)\to p\bar{p}K^+K^-$ and measurement of $χ_{cJ}\to p\bar{p}K^+K^-$ in $ψ(3686)$ radiative decays
Authors:
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (639 additional authors not shown)
Abstract:
A search for $η_c(2S)\to p\bar{p}K^+K^-$, together with measurement of branching fractions of $χ_{cJ(J=0,1,2)}\to p\bar{p}K^+K^-$ in the $ψ(3686) \to γη_c(2S)$ and the $ψ(3686) \to γχ_{cJ}$ radiative decays, is performed with $(2712.4\pm14.3)\times 10^6$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider. An evidence for $η_c(2S)\to p\bar{p}K^+K^-$ is found, with a signific…
▽ More
A search for $η_c(2S)\to p\bar{p}K^+K^-$, together with measurement of branching fractions of $χ_{cJ(J=0,1,2)}\to p\bar{p}K^+K^-$ in the $ψ(3686) \to γη_c(2S)$ and the $ψ(3686) \to γχ_{cJ}$ radiative decays, is performed with $(2712.4\pm14.3)\times 10^6$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider. An evidence for $η_c(2S)\to p\bar{p}K^+K^-$ is found, with a significance of $3.3σ$. The product branching fraction of $\mathcal{B}[ψ(3686)\toγη_c(2S)]\cdot\mathcal{B}[η_c(2S)\to p\bar{p}K^+K^-]$ is determined to be $(1.98\mkern 2mu\pm\mkern 2mu0.41_{\text{stat.}}\mkern 2mu\pm\mkern 2mu0.99_{\text{syst.}})\times 10^{-7}$. The product branching fractions of $\mathcal{B}[ψ(3686)\toγχ_{cJ}]\cdot\mathcal{B}[χ_{cJ}\to p\bar{p}K^+K^-]$ are measured to be $(2.49\mkern 2mu\pm\mkern 2mu 0.03_{\text{stat.}}\mkern 2mu\pm\mkern 2mu 0.15_{\text{syst.}})\times 10^{-5}$, $(1.83\mkern 2mu \pm\mkern 2mu 0.02_{\text{stat.}}\mkern 2mu \pm\mkern 2mu 0.11_{\text{syst.}})\times 10^{-5}$, and $(2.43\mkern 2mu\pm\mkern 2mu 0.02_{\text{stat.}}\mkern 2mu\pm\mkern 2mu 0.15_{\text{syst.}})\times 10^{-5}$, for $J=0,\ 1$, and 2, respectively.
△ Less
Submitted 3 January, 2025;
originally announced January 2025.
-
Search for continuous gravitational waves from known pulsars in the first part of the fourth LIGO-Virgo-KAGRA observing run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
D. Agarwal,
M. Agathos,
M. Aghaei Abchouyeh,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
A. Al-Jodah,
C. Alléné
, et al. (1794 additional authors not shown)
Abstract:
Continuous gravitational waves (CWs) emission from neutron stars carries information about their internal structure and equation of state, and it can provide tests of General Relativity. We present a search for CWs from a set of 45 known pulsars in the first part of the fourth LIGO--Virgo--KAGRA observing run, known as O4a. We conducted a targeted search for each pulsar using three independent ana…
▽ More
Continuous gravitational waves (CWs) emission from neutron stars carries information about their internal structure and equation of state, and it can provide tests of General Relativity. We present a search for CWs from a set of 45 known pulsars in the first part of the fourth LIGO--Virgo--KAGRA observing run, known as O4a. We conducted a targeted search for each pulsar using three independent analysis methods considering the single-harmonic and the dual-harmonic emission models. We find no evidence of a CW signal in O4a data for both models and set upper limits on the signal amplitude and on the ellipticity, which quantifies the asymmetry in the neutron star mass distribution. For the single-harmonic emission model, 29 targets have the upper limit on the amplitude below the theoretical spin-down limit. The lowest upper limit on the amplitude is $6.4\!\times\!10^{-27}$ for the young energetic pulsar J0537-6910, while the lowest constraint on the ellipticity is $8.8\!\times\!10^{-9}$ for the bright nearby millisecond pulsar J0437-4715. Additionally, for a subset of 16 targets we performed a narrowband search that is more robust regarding the emission model, with no evidence of a signal. We also found no evidence of non-standard polarizations as predicted by the Brans-Dicke theory.
△ Less
Submitted 2 January, 2025;
originally announced January 2025.
-
Deep UV Silicon Polaritonic Metasurfaces for Enhancing Biomolecule Autofluorescence and Two-Dimensional Material Double-Resonance Raman Scattering
Authors:
Bo-Ray Lee,
Mao Feng Chiang,
Pei Ying Ho,
Kuan-Heng Chen,
Jia-Hua Lee,
Po Hsiang Hsu,
Yu Chieh Peng,
Jun-Yi Hou,
Shih-Chieh Chen,
Qian-Yo Lee,
Chun-Hao Chang,
Bor-Ran Li,
Tzu-En Lin,
Chieh-Ting Lin,
Min-Hsiung Shih,
Der-Hsien Lien,
Yu-Chuan Lin,
Ray-Hua Horng,
Yuri Kivshar,
Ming Lun Tseng
Abstract:
High-performance DUV spectroscopy drives advancements in biomedical research, clinical diagnosis, and material science. Existing DUV resonant nanostructures face instability and photoluminescent noise challenges. We propose robust Si metasurfaces leveraging polaritonic resonances, a unique property driven by interband transitions, for enhanced nanophotonic sensing. Our polaritonic Kerker-type void…
▽ More
High-performance DUV spectroscopy drives advancements in biomedical research, clinical diagnosis, and material science. Existing DUV resonant nanostructures face instability and photoluminescent noise challenges. We propose robust Si metasurfaces leveraging polaritonic resonances, a unique property driven by interband transitions, for enhanced nanophotonic sensing. Our polaritonic Kerker-type void metasurface enables double-resonance Raman scattering to analyze 2D semiconductors, improves biomolecule autofluorescence, and offers superior stability. This scalable platform unlocks versatile applications in interdisciplinary DUV spectroscopy and emerging nanomaterials research.
△ Less
Submitted 1 January, 2025;
originally announced January 2025.
-
DPBridge: Latent Diffusion Bridge for Dense Prediction
Authors:
Haorui Ji,
Taojun Lin,
Hongdong Li
Abstract:
Diffusion models demonstrate remarkable capabilities in capturing complex data distributions and have achieved compelling results in many generative tasks. While they have recently been extended to dense prediction tasks such as depth estimation and surface normal prediction, their full potential in this area remains under-explored. In dense prediction settings, target signal maps and input images…
▽ More
Diffusion models demonstrate remarkable capabilities in capturing complex data distributions and have achieved compelling results in many generative tasks. While they have recently been extended to dense prediction tasks such as depth estimation and surface normal prediction, their full potential in this area remains under-explored. In dense prediction settings, target signal maps and input images are pixel-wise aligned. This makes conventional noise-to-data generation paradigm inefficient, as input images can serve as more informative prior compared to pure noise. Diffusion bridge models, which support data-to-data generation between two general data distributions, offer a promising alternative, but they typically fail to exploit the rich visual priors embedded in large pretrained foundation models. To address these limitations, we integrate diffusion bridge formulation with structured visual priors and introduce DPBridge, the first latent diffusion bridge framework for dense prediction tasks. Our method presents three key contributions: (1) a tractable reverse transition kernel for diffusion bridge process, enabling maximum likelihood training scheme for better compatibility with pretrained backbones; (2) a distribution-aligned normalization technique to mitigate the discrepancies between the bridge and standard diffusion processes; and (3) an auxiliary image consistency loss to preserve fine-grained details. Experiments across extensive benchmarks validate that our method consistently achieves superior performance, demonstrating its effectiveness and generalization capability under different scenarios.
△ Less
Submitted 19 May, 2025; v1 submitted 29 December, 2024;
originally announced December 2024.
-
JADE: Joint-aware Latent Diffusion for 3D Human Generative Modeling
Authors:
Haorui Ji,
Rong Wang,
Taojun Lin,
Hongdong Li
Abstract:
Generative modeling of 3D human bodies have been studied extensively in computer vision. The core is to design a compact latent representation that is both expressive and semantically interpretable, yet existing approaches struggle to achieve both requirements. In this work, we introduce JADE, a generative framework that learns the variations of human shapes with fined-grained control. Our key ins…
▽ More
Generative modeling of 3D human bodies have been studied extensively in computer vision. The core is to design a compact latent representation that is both expressive and semantically interpretable, yet existing approaches struggle to achieve both requirements. In this work, we introduce JADE, a generative framework that learns the variations of human shapes with fined-grained control. Our key insight is a joint-aware latent representation that decomposes human bodies into skeleton structures, modeled by joint positions, and local surface geometries, characterized by features attached to each joint. This disentangled latent space design enables geometric and semantic interpretation, facilitating users with flexible controllability. To generate coherent and plausible human shapes under our proposed decomposition, we also present a cascaded pipeline where two diffusions are employed to model the distribution of skeleton structures and local surface geometries respectively. Extensive experiments are conducted on public datasets, where we demonstrate the effectiveness of JADE framework in multiple tasks in terms of autoencoding reconstruction accuracy, editing controllability and generation quality compared with existing methods.
△ Less
Submitted 29 December, 2024;
originally announced December 2024.
-
Measurement of Born cross section of $e^+e^-\toΣ^0\barΣ^0$ at $\sqrt{s} = 3.50-4.95$ GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (649 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at thirty-two center-of-mass energies from 3.50 to 4.95 GeV, corresponding to an integrated luminosity of 25 $\rm{fb^{-1}}$, we measure the Born cross section of the $e^+e^-\toΣ^0\barΣ^0$ reaction and the effective form factor. No significant charmonium(-like) state, i.e., $ψ(3770)$, $ψ(4040)$, $ψ(4160)$,…
▽ More
Using $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at thirty-two center-of-mass energies from 3.50 to 4.95 GeV, corresponding to an integrated luminosity of 25 $\rm{fb^{-1}}$, we measure the Born cross section of the $e^+e^-\toΣ^0\barΣ^0$ reaction and the effective form factor. No significant charmonium(-like) state, i.e., $ψ(3770)$, $ψ(4040)$, $ψ(4160)$, $ψ(4230)$, $ψ(4360)$, $ψ(4415)$, or $ψ(4660)$, decaying into the $Σ^0\barΣ^0$ final state is observed by fitting the $e^+e^- \to Σ^0\barΣ^0$ dressed cross section. The upper limits for the product of the branching fraction and the electronic partial width at the 90% confidence level are provided for each assumed charmonium(-like) state. In addition, the ratios of the Born cross section and the effective form factor between the $e^+e^-\toΣ^0\barΣ^0$ and the $e^+e^-\toΣ^+\barΣ^-$ reactions are provided, which can be used to validate the prediction of the vector meson dominance model.
△ Less
Submitted 14 March, 2025; v1 submitted 28 December, 2024;
originally announced December 2024.
-
Search for the double Dalitz decays $η/η' \to e^+e^-μ^+μ^-$ and $η' \to μ^+μ^-μ^+μ^-$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (648 additional authors not shown)
Abstract:
Using a data sample of $(10087 \pm 44) \times {10^{6}}$ $J/ψ$ events collected with the BESIII detector, we search for the decays $η/η'\to e^+e^-μ^+μ^-$ and $η' \to μ^+μ^-μ^+μ^-$ via the radiative decays $J/ψ\toγη$/$γη'$. No excess of events over expected background is observed for any of the decays of interest. At 90% confidence level, we report the first upper limits on the branching fractions o…
▽ More
Using a data sample of $(10087 \pm 44) \times {10^{6}}$ $J/ψ$ events collected with the BESIII detector, we search for the decays $η/η'\to e^+e^-μ^+μ^-$ and $η' \to μ^+μ^-μ^+μ^-$ via the radiative decays $J/ψ\toγη$/$γη'$. No excess of events over expected background is observed for any of the decays of interest. At 90% confidence level, we report the first upper limits on the branching fractions of $η' \to e^{+}e^{-}μ^{+}μ^{-}$ and $η' \to μ^{+}μ^{-}μ^{+}μ^{-}$ to be $ 1.75 \times {10^{-6}}$ and $5.28 \times {10^{-7}}$, respectively. In addition, we set an upper limit on the branching fraction of $η\to e^{+}e^{-}μ^{+}μ^{-}$ to be $6.88 \times {10^{-6}}$, which improves the previous result by about two orders of magnitude.
△ Less
Submitted 27 December, 2024;
originally announced December 2024.
-
Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework
Authors:
Jiang Liu,
Bolin Li,
Haoyuan Li,
Tianwei Lin,
Wenqiao Zhang,
Tao Zhong,
Zhelun Yu,
Jinghao Wei,
Hao Cheng,
Wanggui He,
Fangxun Shu,
Hao Jiang,
Zheqi Lv,
Juncheng Li,
Siliang Tang,
Yueting Zhuang
Abstract:
Efficient multimodal large language models (EMLLMs), in contrast to multimodal large language models (MLLMs), reduce model size and computational costs and are often deployed on resource-constrained devices. However, due to data privacy concerns, existing open-source EMLLMs rarely have access to private domain-specific data during the pre-training process, making them difficult to directly apply i…
▽ More
Efficient multimodal large language models (EMLLMs), in contrast to multimodal large language models (MLLMs), reduce model size and computational costs and are often deployed on resource-constrained devices. However, due to data privacy concerns, existing open-source EMLLMs rarely have access to private domain-specific data during the pre-training process, making them difficult to directly apply in device-specific domains, such as certain business scenarios. To address this weakness, this paper focuses on the efficient adaptation of EMLLMs to private domains, specifically in two areas: 1) how to reduce data requirements, and 2) how to avoid parameter fine-tuning. Specifically, we propose a tun\textbf{\underline{I}}ng-free, a\textbf{\underline{D}}aptiv\textbf{\underline{E}}, univers\textbf{\underline{AL}} \textbf{\underline{Prompt}} Optimization Framework, abbreviated as \textit{\textbf{\ourmethod{}}} which consists of two stages: 1) Predefined Prompt, based on the reinforcement searching strategy, generate a prompt optimization strategy tree to acquire optimization priors; 2) Prompt Reflection initializes the prompt based on optimization priors, followed by self-reflection to further search and refine the prompt. By doing so, \ourmethod{} elegantly generates the ``ideal prompts'' for processing private domain-specific data. Note that our method requires no parameter fine-tuning and only a small amount of data to quickly adapt to the data distribution of private data. Extensive experiments across multiple tasks demonstrate that our proposed \ourmethod{} significantly improves both efficiency and performance compared to baselines.
△ Less
Submitted 17 February, 2025; v1 submitted 27 December, 2024;
originally announced December 2024.
-
MEDEC: A Benchmark for Medical Error Detection and Correction in Clinical Notes
Authors:
Asma Ben Abacha,
Wen-wai Yim,
Yujuan Fu,
Zhaoyi Sun,
Meliha Yetisgen,
Fei Xia,
Thomas Lin
Abstract:
Several studies showed that Large Language Models (LLMs) can answer medical questions correctly, even outperforming the average human score in some medical exams. However, to our knowledge, no study has been conducted to assess the ability of language models to validate existing or generated medical text for correctness and consistency. In this paper, we introduce MEDEC (https://github.com/abachaa…
▽ More
Several studies showed that Large Language Models (LLMs) can answer medical questions correctly, even outperforming the average human score in some medical exams. However, to our knowledge, no study has been conducted to assess the ability of language models to validate existing or generated medical text for correctness and consistency. In this paper, we introduce MEDEC (https://github.com/abachaa/MEDEC), the first publicly available benchmark for medical error detection and correction in clinical notes, covering five types of errors (Diagnosis, Management, Treatment, Pharmacotherapy, and Causal Organism). MEDEC consists of 3,848 clinical texts, including 488 clinical notes from three US hospital systems that were not previously seen by any LLM. The dataset has been used for the MEDIQA-CORR shared task to evaluate seventeen participating systems [Ben Abacha et al., 2024]. In this paper, we describe the data creation methods and we evaluate recent LLMs (e.g., o1-preview, GPT-4, Claude 3.5 Sonnet, and Gemini 2.0 Flash) for the tasks of detecting and correcting medical errors requiring both medical knowledge and reasoning capabilities. We also conducted a comparative study where two medical doctors performed the same task on the MEDEC test set. The results showed that MEDEC is a sufficiently challenging benchmark to assess the ability of models to validate existing or generated notes and to correct medical errors. We also found that although recent LLMs have a good performance in error detection and correction, they are still outperformed by medical doctors in these tasks. We discuss the potential factors behind this gap, the insights from our experiments, the limitations of current evaluation metrics, and share potential pointers for future research.
△ Less
Submitted 2 January, 2025; v1 submitted 26 December, 2024;
originally announced December 2024.
-
tubGEMM: Energy-Efficient and Sparsity-Effective Temporal-Unary-Binary Based Matrix Multiply Unit
Authors:
Prabhu Vellaisamy,
Harideep Nair,
Joseph Finn,
Manav Trivedi,
Albert Chen,
Anna Li,
Tsung-Han Lin,
Perry Wang,
Shawn Blanton,
John Paul Shen
Abstract:
General Matrix Multiplication (GEMM) is a ubiquitous compute kernel in deep learning (DL). To support energy-efficient edge-native processing, new GEMM hardware units have been proposed that operate on unary encoded bitstreams using much simpler hardware. Most unary approaches thus far focus on rate-based unary encoding of values and perform stochastic approximate computation. This work presents t…
▽ More
General Matrix Multiplication (GEMM) is a ubiquitous compute kernel in deep learning (DL). To support energy-efficient edge-native processing, new GEMM hardware units have been proposed that operate on unary encoded bitstreams using much simpler hardware. Most unary approaches thus far focus on rate-based unary encoding of values and perform stochastic approximate computation. This work presents tubGEMM, a novel matrix-multiply unit design that employs hybrid temporal-unary and binary (tub) encoding and performs exact (not approximate) GEMM. It intrinsically exploits dynamic value sparsity to improve energy efficiency. Compared to the current best unary design uGEMM, tubGEMM significantly reduces area, power, and energy by 89\%, 87\%, and 50\%, respectively. A tubGEMM design performing 128x128 matrix multiply on 8-bit integers, in commercial TSMC N5 (5nm) process node, consumes just 0.22 mm^2 die area, 417.72 mW power, and 8.86 uJ energy, assuming no sparsity. Typical sparsity in DL workloads (MobileNetv2, ResNet-50) reduces energy by more than 3x, and lowering precision to 4 and 2 bits further reduces it by 24x and 104x respectively.
△ Less
Submitted 23 December, 2024;
originally announced December 2024.
-
From Histopathology Images to Cell Clouds: Learning Slide Representations with Hierarchical Cell Transformer
Authors:
Zijiang Yang,
Zhongwei Qiu,
Tiancheng Lin,
Hanqing Chao,
Wanxing Chang,
Yelin Yang,
Yunshuo Zhang,
Wenpei Jiao,
Yixuan Shen,
Wenbin Liu,
Dongmei Fu,
Dakai Jin,
Ke Yan,
Le Lu,
Hui Jiang,
Yun Bian
Abstract:
It is clinically crucial and potentially very beneficial to be able to analyze and model directly the spatial distributions of cells in histopathology whole slide images (WSI). However, most existing WSI datasets lack cell-level annotations, owing to the extremely high cost over giga-pixel images. Thus, it remains an open question whether deep learning models can directly and effectively analyze W…
▽ More
It is clinically crucial and potentially very beneficial to be able to analyze and model directly the spatial distributions of cells in histopathology whole slide images (WSI). However, most existing WSI datasets lack cell-level annotations, owing to the extremely high cost over giga-pixel images. Thus, it remains an open question whether deep learning models can directly and effectively analyze WSIs from the semantic aspect of cell distributions. In this work, we construct a large-scale WSI dataset with more than 5 billion cell-level annotations, termed WSI-Cell5B, and a novel hierarchical Cell Cloud Transformer (CCFormer) to tackle these challenges. WSI-Cell5B is based on 6,998 WSIs of 11 cancers from The Cancer Genome Atlas Program, and all WSIs are annotated per cell by coordinates and types. To the best of our knowledge, WSI-Cell5B is the first WSI-level large-scale dataset integrating cell-level annotations. On the other hand, CCFormer formulates the collection of cells in each WSI as a cell cloud and models cell spatial distribution. Specifically, Neighboring Information Embedding (NIE) is proposed to characterize the distribution of cells within the neighborhood of each cell, and a novel Hierarchical Spatial Perception (HSP) module is proposed to learn the spatial relationship among cells in a bottom-up manner. The clinical analysis indicates that WSI-Cell5B can be used to design clinical evaluation metrics based on counting cells that effectively assess the survival risk of patients. Extensive experiments on survival prediction and cancer staging show that learning from cell spatial distribution alone can already achieve state-of-the-art (SOTA) performance, i.e., CCFormer strongly outperforms other competing methods.
△ Less
Submitted 21 December, 2024;
originally announced December 2024.
-
From Pixels to Gigapixels: Bridging Local Inductive Bias and Long-Range Dependencies with Pixel-Mamba
Authors:
Zhongwei Qiu,
Hanqing Chao,
Tiancheng Lin,
Wanxing Chang,
Zijiang Yang,
Wenpei Jiao,
Yixuan Shen,
Yunshuo Zhang,
Yelin Yang,
Wenbin Liu,
Hui Jiang,
Yun Bian,
Ke Yan,
Dakai Jin,
Le Lu
Abstract:
Histopathology plays a critical role in medical diagnostics, with whole slide images (WSIs) offering valuable insights that directly influence clinical decision-making. However, the large size and complexity of WSIs may pose significant challenges for deep learning models, in both computational efficiency and effective representation learning. In this work, we introduce Pixel-Mamba, a novel deep l…
▽ More
Histopathology plays a critical role in medical diagnostics, with whole slide images (WSIs) offering valuable insights that directly influence clinical decision-making. However, the large size and complexity of WSIs may pose significant challenges for deep learning models, in both computational efficiency and effective representation learning. In this work, we introduce Pixel-Mamba, a novel deep learning architecture designed to efficiently handle gigapixel WSIs. Pixel-Mamba leverages the Mamba module, a state-space model (SSM) with linear memory complexity, and incorporates local inductive biases through progressively expanding tokens, akin to convolutional neural networks. This enables Pixel-Mamba to hierarchically combine both local and global information while efficiently addressing computational challenges. Remarkably, Pixel-Mamba achieves or even surpasses the quantitative performance of state-of-the-art (SOTA) foundation models that were pretrained on millions of WSIs or WSI-text pairs, in a range of tumor staging and survival analysis tasks, {\bf even without requiring any pathology-specific pretraining}. Extensive experiments demonstrate the efficacy of Pixel-Mamba as a powerful and efficient framework for end-to-end WSI analysis.
△ Less
Submitted 21 December, 2024;
originally announced December 2024.
-
Measurement of $CP$ asymmetry in $B_s^0 \to D_s^{\mp} K^{\pm}$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1116 additional authors not shown)
Abstract:
A measurement of the $CP$-violating parameters in $B_s^0 \to D_s^{\mp} K^{\pm}$ decays is reported, based on the analysis of proton-proton collision data collected by the LHCb experiment corresponding to an integrated luminosity of $6\,\mathrm{fb}^{-1}$ at a centre-of-mass energy of $13 \,\mathrm{TeV}$. The measured parameters are $C_f = 0.791 \pm 0.061 \pm 0.022$,…
▽ More
A measurement of the $CP$-violating parameters in $B_s^0 \to D_s^{\mp} K^{\pm}$ decays is reported, based on the analysis of proton-proton collision data collected by the LHCb experiment corresponding to an integrated luminosity of $6\,\mathrm{fb}^{-1}$ at a centre-of-mass energy of $13 \,\mathrm{TeV}$. The measured parameters are $C_f = 0.791 \pm 0.061 \pm 0.022$, $A_f^{ΔΓ} = -0.051 \pm 0.134 \pm 0.058$, $A_{\overline{f}}^{ΔΓ} = -0.303 \pm 0.125 \pm 0.055$, $S_f = -0.571 \pm 0.084 \pm 0.023$ and $S_{\overline{f}} = -0.503 \pm 0.084 \pm 0.025$, where the first uncertainty is statistical and the second systematic. Together with the value of the Bs mixing phase $-2β_s$, these parameters are used to obtain a measurement of the CKM angle $γ$ equal to $ (74\pm12)^\circ$ modulo $180^{\circ}$, where the uncertainty contains both statistical and systematic contributions. This result is combined with the previous LHCb measurement in this channel using $3\,\mathrm{fb}^{-1}$ resulting in a determination of $γ= (81^{+12}_{-11})^\circ$.
△ Less
Submitted 16 April, 2025; v1 submitted 18 December, 2024;
originally announced December 2024.
-
Measurement of $CP$ asymmetries in $Λ_b^0\to ph^{-}$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1124 additional authors not shown)
Abstract:
A search for $CP$ violation in $Λ_b^0\rightarrow pK^-$ and $Λ_b^0\rightarrow pπ^-$ decays is presented using the full Run 1 and Run 2 data samples of $pp$ collisions collected with the LHCb detector, corresponding to an integrated luminosity of 9 $\mathrm{fb}^{-1}$ at center-of-mass energies of 7, 8, and 13 TeV. For the Run 2 data sample, the $CP$-violating asymmetries are measured to be…
▽ More
A search for $CP$ violation in $Λ_b^0\rightarrow pK^-$ and $Λ_b^0\rightarrow pπ^-$ decays is presented using the full Run 1 and Run 2 data samples of $pp$ collisions collected with the LHCb detector, corresponding to an integrated luminosity of 9 $\mathrm{fb}^{-1}$ at center-of-mass energies of 7, 8, and 13 TeV. For the Run 2 data sample, the $CP$-violating asymmetries are measured to be $A_{CP}^{pK^-} = (-1.4 \pm 0.7 \pm 0.4)\%$ and $A_{CP}^{pπ^-} = (0.4 \pm 0.9 \pm 0.4)\%$, where the first uncertainty is statistical and the second is systematic. Following significant improvements in the evaluation of systematic uncertainties compared to the previous LHCb measurement, the Run 1 dataset is reanalyzed to update the corresponding results. When combining the Run 2 and updated Run 1 measurements, the final results are found to be $A_{CP}^{pK^-} = (-1.1 \pm 0.7 \pm 0.4)\%$ and $A_{CP}^{pπ^-} = (0.2 \pm 0.8 \pm 0.4)\%$, constituting the most precise measurements of these asymmetries to date.
△ Less
Submitted 8 May, 2025; v1 submitted 18 December, 2024;
originally announced December 2024.
-
Measurement of the Branching Fraction for the Decay $χ_{cJ}\to p\bar{p}ηπ^{0}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
Using $(2712.4\pm 14.3)\times10^6 ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we present the first observations of the decays $χ_{cJ}(J=0,1,2)\to p\bar{p}ηπ^{0}$. Their decay branching fractions are determined to be ${\cal B}(χ_{c0}\to p\bar{p}ηπ^{0})=({2.41 \pm 0.07 \pm 0.19}) \times 10^{-4}$,…
▽ More
Using $(2712.4\pm 14.3)\times10^6 ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we present the first observations of the decays $χ_{cJ}(J=0,1,2)\to p\bar{p}ηπ^{0}$. Their decay branching fractions are determined to be ${\cal B}(χ_{c0}\to p\bar{p}ηπ^{0})=({2.41 \pm 0.07 \pm 0.19}) \times 10^{-4}$, ${\cal B}(χ_{c1}\to p\bar{p}ηπ^{0})=({1.95 \pm 0.05 \pm 0.12}) \times 10^{-4}$, and ${\cal B}(χ_{c2}\to p\bar{p}ηπ^{0})=({1.31 \pm 0.05 \pm 0.08}) \times 10^{-4}$, where the first uncertainties are statistical and the second systematic.
△ Less
Submitted 18 December, 2024; v1 submitted 18 December, 2024;
originally announced December 2024.
-
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
Authors:
Haoyi Jiang,
Liu Liu,
Tianheng Cheng,
Xinjie Wang,
Tianwei Lin,
Zhizhong Su,
Wenyu Liu,
Xinggang Wang
Abstract:
3D Semantic Occupancy Prediction is fundamental for spatial understanding, yet existing approaches face challenges in scalability and generalization due to their reliance on extensive labeled data and computationally intensive voxel-wise representations. In this paper, we introduce GaussTR, a novel Gaussian-based Transformer framework that unifies sparse 3D modeling with foundation model alignment…
▽ More
3D Semantic Occupancy Prediction is fundamental for spatial understanding, yet existing approaches face challenges in scalability and generalization due to their reliance on extensive labeled data and computationally intensive voxel-wise representations. In this paper, we introduce GaussTR, a novel Gaussian-based Transformer framework that unifies sparse 3D modeling with foundation model alignment through Gaussian representations to advance 3D spatial understanding. GaussTR predicts sparse sets of Gaussians in a feed-forward manner to represent 3D scenes. By splatting the Gaussians into 2D views and aligning the rendered features with foundation models, GaussTR facilitates self-supervised 3D representation learning and enables open-vocabulary semantic occupancy prediction without requiring explicit annotations. Empirical experiments on the Occ3D-nuScenes dataset demonstrate GaussTR's state-of-the-art zero-shot performance of 12.27 mIoU, along with a 40% reduction in training time. These results highlight the efficacy of GaussTR for scalable and holistic 3D spatial understanding, with promising implications in autonomous driving and embodied agents. The code is available at https://github.com/hustvl/GaussTR.
△ Less
Submitted 24 March, 2025; v1 submitted 17 December, 2024;
originally announced December 2024.
-
Observation of the charmonium decay $η_c\toγγ$ in $J/ψ\toγη_c$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (658 additional authors not shown)
Abstract:
Using $(2712.4\pm14.3)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the decay $η_c\toγγ$ in $J/ψ\toγη_c$ is observed. We determine the product branching fraction $\mathcal{B}(J/ψ\toγη_c)\times\mathcal{B}(η_c\toγγ)=(5.23\pm0.26_{\rm{stat.}}\pm0.30_{\rm{syst.}})\times10^{-6}$. This result is consistent with the LQCD calculation…
▽ More
Using $(2712.4\pm14.3)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the decay $η_c\toγγ$ in $J/ψ\toγη_c$ is observed. We determine the product branching fraction $\mathcal{B}(J/ψ\toγη_c)\times\mathcal{B}(η_c\toγγ)=(5.23\pm0.26_{\rm{stat.}}\pm0.30_{\rm{syst.}})\times10^{-6}$. This result is consistent with the LQCD calculation $(5.34\pm0.16)\times10^{-6}$ from HPQCD in 2023. By using the world-average values of $\mathcal{B}(J/ψ\toγη_c)$ and the total decay width of $η_c$, the partial decay width $Γ(η_c\toγγ)$ is determined to be $(11.30\pm0.56_{\rm{stat.}}\pm0.66_{\rm{syst.}}\pm1.14_{\rm{ref.}})~\rm{keV}$, which deviates from the corresponding world-average value by $3.4σ$.
△ Less
Submitted 2 April, 2025; v1 submitted 17 December, 2024;
originally announced December 2024.
-
Study of Satellite Plane Structure Characteristics Based on TNG50 Simulations: A Comparative Analysis from Plane to Non-Plane Structures
Authors:
Hu Caiyu,
Tang Lin
Abstract:
In recent years, multiple plane structures of satellite galaxies have been identified in the nearby universe, although their formation mechanisms remain unclear. In this work, we employ the TNG50-1 numerical simulation to classify satellite systems into plane and non-plane structures, based on their geometric and dynamical properties. We focus on comparing the characteristics of these plane and no…
▽ More
In recent years, multiple plane structures of satellite galaxies have been identified in the nearby universe, although their formation mechanisms remain unclear. In this work, we employ the TNG50-1 numerical simulation to classify satellite systems into plane and non-plane structures, based on their geometric and dynamical properties. We focus on comparing the characteristics of these plane and non-plane structures. The plane structures in TNG50-1 exhibit a mean height of 5.24 kpc, with most of them found in galaxy groups with intermediate halo virial masses within the narrow range of $10^{11.5}$ to $10^{12.5}$ $M_\odot$. Statistical analyses reveal that plane structures of satellite galaxies constitute approximately 11.30% in TNG50-1, with this proportion increasing to 27.11% in TNG100-1, aligning closely with previous observations. Additionally, central galaxies in clusters and groups hosting co-rotating plane structures are intermediate massive and slightly metal-poorer than those in non-plane structures. Significant difference are found between in-plane and out-of-plane satellite galaxies, suggesting that in-plane satellites exhibit slightly longer formation times, and more active interstellar matter cycles. The satellites within these plane structures in TNG50-1 exhibit similar radial distributions with observations, but are fainter and more massive than those in observational plane structures, due to the over- or under-estimation of galaxy properties in simulations. Our analysis also shows that the satellite plane structures might be effected by some low- or high-mass galaxies temporarily entered the plane structures due to the gravitational potential of the clusters and groups after the plane structures had formed.
△ Less
Submitted 16 December, 2024;
originally announced December 2024.
-
CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology
Authors:
Yuxuan Sun,
Yixuan Si,
Chenglu Zhu,
Xuan Gong,
Kai Zhang,
Pingyi Chen,
Ye Zhang,
Zhongyi Shui,
Tao Lin,
Lin Yang
Abstract:
The emergence of large multimodal models (LMMs) has brought significant advancements to pathology. Previous research has primarily focused on separately training patch-level and whole-slide image (WSI)-level models, limiting the integration of learned knowledge across patches and WSIs, and resulting in redundant models. In this work, we introduce CPath-Omni, the first 15-billion-parameter LMM desi…
▽ More
The emergence of large multimodal models (LMMs) has brought significant advancements to pathology. Previous research has primarily focused on separately training patch-level and whole-slide image (WSI)-level models, limiting the integration of learned knowledge across patches and WSIs, and resulting in redundant models. In this work, we introduce CPath-Omni, the first 15-billion-parameter LMM designed to unify both patch and WSI level image analysis, consolidating a variety of tasks at both levels, including classification, visual question answering, captioning, and visual referring prompting. Extensive experiments demonstrate that CPath-Omni achieves state-of-the-art (SOTA) performance across seven diverse tasks on 39 out of 42 datasets, outperforming or matching task-specific models trained for individual tasks. Additionally, we develop a specialized pathology CLIP-based visual processor for CPath-Omni, CPath-CLIP, which, for the first time, integrates different vision models and incorporates a large language model as a text encoder to build a more powerful CLIP model, which achieves SOTA performance on nine zero-shot and four few-shot datasets. Our findings highlight CPath-Omni's ability to unify diverse pathology tasks, demonstrating its potential to streamline and advance the field of foundation model in pathology.
△ Less
Submitted 16 December, 2024;
originally announced December 2024.
-
Test of lepton flavour universality with $B^+ \to K^+π^+π^-\ell^+\ell^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1127 additional authors not shown)
Abstract:
The first test of lepton flavor universality between muons and electrons using $B^+ \to K^+π^+π^-\ell^+\ell^-$ ($\ell=e,μ$) decays is presented. The measurement is performed with data from proton-proton collisions collected by the LHCb experiment at center-of-mass energies of 7, 8, and 13 TeV, corresponding to an integrated luminosity of $9\mathrm{fb}^{-1}$. The ratio of branching fractions betwee…
▽ More
The first test of lepton flavor universality between muons and electrons using $B^+ \to K^+π^+π^-\ell^+\ell^-$ ($\ell=e,μ$) decays is presented. The measurement is performed with data from proton-proton collisions collected by the LHCb experiment at center-of-mass energies of 7, 8, and 13 TeV, corresponding to an integrated luminosity of $9\mathrm{fb}^{-1}$. The ratio of branching fractions between $B^+ \to K^+π^+π^-e^+e^-$ and $B^+ \to K^+π^+π^-μ^+μ^-$decays is measured in the dilepton invariant-mass-squared range $1.1 < q^2 < 7.0~\mathrm{GeV}^2/c^4$ and is found to be $R_{Kππ}^{-1} = 1.31^{+0.18}_{-0.17} \;(\mathrm{stat})\;^{+0.12}_{-0.09} \;(\mathrm{syst})$, in agreement with the standard model prediction. The first observation of the $B^+ \to K^+π^+π^-e^+e^-$ decay is also reported.
△ Less
Submitted 12 May, 2025; v1 submitted 16 December, 2024;
originally announced December 2024.
-
Amplitude analysis and branching fraction measurement of the Cabibbo-favored decay $D^+ \to K^-π^+π^+π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (651 additional authors not shown)
Abstract:
An amplitude analysis of the Cabibbo-favored decay $D^+ \to K^-π^+π^+π^0$ is performed, using 7.93 $\rm{fb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773 GeV. The branching fractions of the intermediate processes are measured, with the dominant contribution $D^+ \to \bar{K}^{*}(892)^0ρ(770)^+$ observed to have a branching fraction of…
▽ More
An amplitude analysis of the Cabibbo-favored decay $D^+ \to K^-π^+π^+π^0$ is performed, using 7.93 $\rm{fb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773 GeV. The branching fractions of the intermediate processes are measured, with the dominant contribution $D^+ \to \bar{K}^{*}(892)^0ρ(770)^+$ observed to have a branching fraction of $(4.15\pm0.07_{\rm stat.}\pm0.17_{\rm syst.})\%$. With the detection efficiency derived from the amplitude analysis, the absolute branching fraction of $D^+ \to K^-π^+π^+π^0$ is measured to be $(6.06\pm0.04_{\rm stat.}\pm0.07_{\rm syst.})\%$.
△ Less
Submitted 14 December, 2024;
originally announced December 2024.
-
Study of the semileptonic decay $D^0\rightarrow \bar{K}^0π^-e^+ν_e$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (650 additional authors not shown)
Abstract:
We report an improved study of the semileptonic decay $D^0 \rightarrow \bar{K}^0π^-e^+ν_{e}$ based on a sample of $7.9~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773~GeV with the BESIII detector at the BEPCII collider. The branching fraction of this decay is measured to be…
▽ More
We report an improved study of the semileptonic decay $D^0 \rightarrow \bar{K}^0π^-e^+ν_{e}$ based on a sample of $7.9~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773~GeV with the BESIII detector at the BEPCII collider. The branching fraction of this decay is measured to be $\mathcal{B}(D^0\rightarrow \bar{K}^0π^-e^+ν_{e}) = (1.444 \pm 0.022_{\rm stat} \pm 0.024_{\rm syst})\%$, which is the most precise to date, where the first uncertainty is statistical and the second is systematic. Based on investigation of the decay dynamics, we find that the decay is dominated by the $K^{*}(892)^-$ component and present an improved measurement of its branching fraction to be $\mathcal{B}(D^0\rightarrow K^{*}(892)^-e^+ν_e) = (2.039 \pm 0.032_{\rm stat} \pm 0.034_{\rm syst})\%$. We also determine the ratios of the hadronic form factors for the $K^{*}(892)^-e^+ν_e$ decay to be $r_{V} = V(0)/A_1(0) = 1.48 \pm 0.05_{\rm stat} \pm 0.02_{\rm syst}$ and $r_{2} = A_2(0)/A_1(0) = 0.70 \pm 0.04_{\rm stat} \pm 0.02_{\rm syst}$, where $V(0)$ is the vector form factor and $A_{1,2}(0)$ are the axial form factors. In addition, the $\bar{K}^0π^-$ $\mathcal{S}$-wave component is found to account for $(5.87 \pm 0.32_{\rm stat} \pm 0.16_{\rm syst})\%$ of the total decay rate, corresponding to a branching fraction of $\mathcal{B}[D^0\rightarrow (\bar{K}^0π^-)_{S-{\rm wave}}e^+ν_e] = (0.085 \pm 0.005_{\rm stat} \pm 0.003_{\rm syst})\%$.
△ Less
Submitted 14 December, 2024;
originally announced December 2024.
-
Meshtron: High-Fidelity, Artist-Like 3D Mesh Generation at Scale
Authors:
Zekun Hao,
David W. Romero,
Tsung-Yi Lin,
Ming-Yu Liu
Abstract:
Meshes are fundamental representations of 3D surfaces. However, creating high-quality meshes is a labor-intensive task that requires significant time and expertise in 3D modeling. While a delicate object often requires over $10^4$ faces to be accurately modeled, recent attempts at generating artist-like meshes are limited to $1.6$K faces and heavy discretization of vertex coordinates. Hence, scali…
▽ More
Meshes are fundamental representations of 3D surfaces. However, creating high-quality meshes is a labor-intensive task that requires significant time and expertise in 3D modeling. While a delicate object often requires over $10^4$ faces to be accurately modeled, recent attempts at generating artist-like meshes are limited to $1.6$K faces and heavy discretization of vertex coordinates. Hence, scaling both the maximum face count and vertex coordinate resolution is crucial to producing high-quality meshes of realistic, complex 3D objects. We present Meshtron, a novel autoregressive mesh generation model able to generate meshes with up to 64K faces at 1024-level coordinate resolution --over an order of magnitude higher face count and $8{\times}$ higher coordinate resolution than current state-of-the-art methods. Meshtron's scalability is driven by four key components: (1) an hourglass neural architecture, (2) truncated sequence training, (3) sliding window inference, (4) a robust sampling strategy that enforces the order of mesh sequences. This results in over $50{\%}$ less training memory, $2.5{\times}$ faster throughput, and better consistency than existing works. Meshtron generates meshes of detailed, complex 3D objects at unprecedented levels of resolution and fidelity, closely resembling those created by professional artists, and opening the door to more realistic generation of detailed 3D assets for animation, gaming, and virtual environments.
△ Less
Submitted 12 December, 2024;
originally announced December 2024.
-
Search for $D^0$ meson decays to $π^+ π^- e^+ e^-$ and $K^+ K^- e^+ e^-$ final states
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1125 additional authors not shown)
Abstract:
A search for $D^0$ meson decays to the $π^+π^-e^+e^-$ and $K^+K^-e^+e^-$ final states is reported using a sample of proton-proton collisions collected by the LHCb experiment at a center-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 6 fb$^{-1}$. The decay $D^0 \rightarrow π^+π^-e^+e^-$ is observed for the first time when requiring that the two electrons are consistent with…
▽ More
A search for $D^0$ meson decays to the $π^+π^-e^+e^-$ and $K^+K^-e^+e^-$ final states is reported using a sample of proton-proton collisions collected by the LHCb experiment at a center-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 6 fb$^{-1}$. The decay $D^0 \rightarrow π^+π^-e^+e^-$ is observed for the first time when requiring that the two electrons are consistent with coming from the decay of a $φ$ or $ρ^0/ω$ meson. The corresponding branching fractions are measured relative to the $D^0 \rightarrow K^-π^-[e^+e^-]_{ρ^0/ω}$ decay, where the two electrons are consistent with coming from the decay of a $ρ^0$ or $ω$ meson. No evidence is found for the $D^0 \rightarrow K^+K^-e^+e^-$ decay and world-best limits are set on its branching fraction. The results are compared to, and found to be consistent with, the branching fractions of the $D^0 \rightarrow π^+π^-μ^+μ^-$ and $D^0 \rightarrow K^+K^-μ^+μ^-$ decays recently measured by LHCb and confirm lepton universality at the current precision.
△ Less
Submitted 20 May, 2025; v1 submitted 12 December, 2024;
originally announced December 2024.
-
GMem: A Modular Approach for Ultra-Efficient Generative Models
Authors:
Yi Tang,
Peng Sun,
Zhenglin Cheng,
Tao Lin
Abstract:
Recent studies indicate that the denoising process in deep generative diffusion models implicitly learns and memorizes semantic information from the data distribution. These findings suggest that capturing more complex data distributions requires larger neural networks, leading to a substantial increase in computational demands, which in turn become the primary bottleneck in both training and infe…
▽ More
Recent studies indicate that the denoising process in deep generative diffusion models implicitly learns and memorizes semantic information from the data distribution. These findings suggest that capturing more complex data distributions requires larger neural networks, leading to a substantial increase in computational demands, which in turn become the primary bottleneck in both training and inference of diffusion models. To this end, we introduce GMem: A Modular Approach for Ultra-Efficient Generative Models. Our approach GMem decouples the memory capacity from model and implements it as a separate, immutable memory set that preserves the essential semantic information in the data. The results are significant: GMem enhances both training, sampling efficiency, and diversity generation. This design on one hand reduces the reliance on network for memorize complex data distribution and thus enhancing both training and sampling efficiency. On ImageNet at $256 \times 256$ resolution, GMem achieves a $50\times$ training speedup compared to SiT, reaching FID $=7.66$ in fewer than $28$ epochs ($\sim 4$ hours training time), while SiT requires $1400$ epochs. Without classifier-free guidance, GMem achieves state-of-the-art (SoTA) performance FID $=1.53$ in $160$ epochs with only $\sim 20$ hours of training, outperforming LightningDiT which requires $800$ epochs and $\sim 95$ hours to attain FID $=2.17$.
△ Less
Submitted 11 February, 2025; v1 submitted 11 December, 2024;
originally announced December 2024.
-
Learn How to Query from Unlabeled Data Streams in Federated Learning
Authors:
Yuchang Sun,
Xinran Li,
Tao Lin,
Jun Zhang
Abstract:
Federated learning (FL) enables collaborative learning among decentralized clients while safeguarding the privacy of their local data. Existing studies on FL typically assume offline labeled data available at each client when the training starts. Nevertheless, the training data in practice often arrive at clients in a streaming fashion without ground-truth labels. Given the expensive annotation co…
▽ More
Federated learning (FL) enables collaborative learning among decentralized clients while safeguarding the privacy of their local data. Existing studies on FL typically assume offline labeled data available at each client when the training starts. Nevertheless, the training data in practice often arrive at clients in a streaming fashion without ground-truth labels. Given the expensive annotation cost, it is critical to identify a subset of informative samples for labeling on clients. However, selecting samples locally while accommodating the global training objective presents a challenge unique to FL. In this work, we tackle this conundrum by framing the data querying process in FL as a collaborative decentralized decision-making problem and proposing an effective solution named LeaDQ, which leverages multi-agent reinforcement learning algorithms. In particular, under the implicit guidance from global information, LeaDQ effectively learns the local policies for distributed clients and steers them towards selecting samples that can enhance the global model's accuracy. Extensive simulations on image and text tasks show that LeaDQ advances the model performance in various FL scenarios, outperforming the benchmarking algorithms.
△ Less
Submitted 11 December, 2024; v1 submitted 11 December, 2024;
originally announced December 2024.
-
Vaccination dynamics of age-structured populations in higher-order social networks
Authors:
Yanyi Nie,
Tao Lin,
Yanbing Liu,
Wei Wang
Abstract:
Voluntary vaccination is essential to protect oneself from infection and suppress the spread of infectious diseases. Voluntary vaccination behavior is influenced by factors such as age and interaction patterns. Differences in health consciousness and risk perception based on age result in heterogeneity in vaccination behavior among different age groups. Higher-order interactions among individuals…
▽ More
Voluntary vaccination is essential to protect oneself from infection and suppress the spread of infectious diseases. Voluntary vaccination behavior is influenced by factors such as age and interaction patterns. Differences in health consciousness and risk perception based on age result in heterogeneity in vaccination behavior among different age groups. Higher-order interactions among individuals of various ages facilitate the dissemination of vaccine-related information, further influencing vaccination intentions. To investigate the impact of individual age and interaction patterns on vaccination behavior, we propose an epidemic-game coevolution model in which age structure and higher-order interactions are considered. Combining the theoretical analysis of epidemic-game coevolution, this work calculates the evolutionarily stable strategies and dynamic equilibrium based on imitation dynamics in the well-mixed population. Extensive numerical experiments show that infants and the elderly exhibit conservative attitudes towards vaccination, and the vaccination levels of these two groups have no significant impact on the vaccination behavior of other age groups. The vaccination behavior of children is highly active, while the vaccination behavior of adults depends on the relative cost of vaccination. The increase in vaccination levels among children and adults leads to a decrease in vaccination levels in other groups. Furthermore, the infants exhibit the lowest level of vaccination, while the children have the highest vaccination rate. Higher-order interactions significantly enhance vaccination levels among children and adults.
△ Less
Submitted 10 December, 2024;
originally announced December 2024.
-
Light-induced ultrafast glide-mirror symmetry breaking in black phosphorus
Authors:
Changhua Bao,
Fei Wang,
Haoyuan Zhong,
Shaohua Zhou,
Tianyun Lin,
Hongyun Zhang,
Xuanxi Cai,
Wenhui Duan,
Shuyun Zhou
Abstract:
Symmetry breaking plays an important role in fields of physics, ranging from particle physics to condensed matter physics. In solid-state materials, phase transitions are deeply linked to the underlying symmetry breakings, resulting in a rich variety of emergent phases. Such symmetry breakings are often induced by controlling the chemical composition and temperature or applying an electric field a…
▽ More
Symmetry breaking plays an important role in fields of physics, ranging from particle physics to condensed matter physics. In solid-state materials, phase transitions are deeply linked to the underlying symmetry breakings, resulting in a rich variety of emergent phases. Such symmetry breakings are often induced by controlling the chemical composition and temperature or applying an electric field and strain, etc. In this work, we demonstrate an ultrafast glide-mirror symmetry breaking in black phosphorus through Floquet engineering. Upon near-resonance pumping, a light-induced full gap opening is observed at the glide-mirror symmetry protected nodal ring, suggesting light-induced breaking of the glide-mirror symmetry. Moreover, the full gap is observed only in the presence of the light-field and disappears almost instantaneously ($\ll$100 fs) when the light-field is turned off, suggesting the ultrafast manipulation of the symmetry and its Floquet engineering origin. This work not only demonstrates light-matter interaction as an effective way to realize ultrafast symmetry breaking in solid-state materials, but also moves forward towards the long-sought Floquet topological phases.
△ Less
Submitted 9 December, 2024;
originally announced December 2024.
-
Manipulating the symmetry of photon-dressed electronic states
Authors:
Changhua Bao,
Michael Schüler,
Teng Xiao,
Fei Wang,
Haoyuan Zhong,
Tianyun Lin,
Xuanxi Cai,
Tianshuang Sheng,
Xiao Tang,
Hongyun Zhang,
Pu Yu,
Zhiyuan Sun,
Wenhui Duan,
Shuyun Zhou
Abstract:
Strong light-matter interaction provides opportunities for tailoring the physical properties of quantum materials on the ultrafast timescale by forming photon-dressed electronic states, i.e., Floquet-Bloch states. While the light field can in principle imprint its symmetry properties onto the photon-dressed electronic states, so far, how to experimentally detect and further engineer the symmetry o…
▽ More
Strong light-matter interaction provides opportunities for tailoring the physical properties of quantum materials on the ultrafast timescale by forming photon-dressed electronic states, i.e., Floquet-Bloch states. While the light field can in principle imprint its symmetry properties onto the photon-dressed electronic states, so far, how to experimentally detect and further engineer the symmetry of photon-dressed electronic states remains elusive. Here by utilizing time- and angle-resolved photoemission spectroscopy (TrARPES) with polarization-dependent study, we directly visualize the parity symmetry of Floquet-Bloch states in black phosphorus. The photon-dressed sideband exhibits opposite photoemission intensity to the valence band at the $Γ$ point,suggesting a switch of the parity induced by the light field. Moreover, a "hot spot" with strong intensity confined near $Γ$ is observed, indicating a momentum-dependent modulation beyond the parity switch. Combining with theoretical calculations, we reveal the light-induced engineering of the wave function of the Floquet-Bloch states as a result of the hybridization between the conduction and valence bands with opposite parities, and show that the "hot spot" is intrinsically dictated by the symmetry properties of black phosphorus. Our work suggests TrARPES as a direct probe for the parity of the photon-dressed electronic states with energy- and momentum-resolved information, providing an example for engineering the wave function and symmetry of such photon-dressed electronic states via Floquet engineering.
△ Less
Submitted 9 December, 2024;
originally announced December 2024.
-
Study of the decay ψ(3686) \to Σ^{0}\barΣ^{0}φ
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (644 additional authors not shown)
Abstract:
Using $(27.12\pm 0.14)\times 10^{8}$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decay $ψ(3686)\toΣ^{0}\barΣ^{0}φ$ is observed for the first time with a statistical significance of 7.6$σ$. Its branching fraction is measured to be $(2.64 \pm 0.32_{\textrm{stat}} \pm 0.12_{\textrm{sys}}) \times 10^{-6}$, where the first uncertainty is statistical and the…
▽ More
Using $(27.12\pm 0.14)\times 10^{8}$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decay $ψ(3686)\toΣ^{0}\barΣ^{0}φ$ is observed for the first time with a statistical significance of 7.6$σ$. Its branching fraction is measured to be $(2.64 \pm 0.32_{\textrm{stat}} \pm 0.12_{\textrm{sys}}) \times 10^{-6}$, where the first uncertainty is statistical and the second is systematic. In addition, we search for potential intermediate states in the $Σ^{0}φ$($\barΣ^{0}φ$) invariant mass distribution and a possible threshold enhancement in the $Σ^{0}\barΣ^{0}$ system, but no conclusive evidence of is observed.
△ Less
Submitted 9 December, 2024;
originally announced December 2024.
-
Partial wave analyses of $ψ(3686)\to p\bar{p}π^0$ and $ψ(3686)\to p\bar{p}η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (644 additional authors not shown)
Abstract:
Using a sample of $(2712\pm14)\times10^6$ $ψ(3686)$ events collected with the BESIII detector, we perform partial wave analyses of the decays $ψ(3686)\to p\bar{p}π^0$ and $ψ(3686)\to p\bar{p}η$. The branching fractions of $ψ(3686)\to p\bar{p}π^0$ and $ψ(3686)\to p\bar{p}η$ are determined to be $(133.9\pm11.2\pm2.3)\times10^{-6}$ or $(183.7\pm13.7\pm3.2)\times10^{-6}$ and…
▽ More
Using a sample of $(2712\pm14)\times10^6$ $ψ(3686)$ events collected with the BESIII detector, we perform partial wave analyses of the decays $ψ(3686)\to p\bar{p}π^0$ and $ψ(3686)\to p\bar{p}η$. The branching fractions of $ψ(3686)\to p\bar{p}π^0$ and $ψ(3686)\to p\bar{p}η$ are determined to be $(133.9\pm11.2\pm2.3)\times10^{-6}$ or $(183.7\pm13.7\pm3.2)\times10^{-6}$ and $(61.5\pm6.5\pm1.1)\times10^{-6}$ or $(84.4\pm6.9\pm1.4)\times10^{-6}$, respectively, where the two solutions are caused by an ambiguous phase angle between resonant and continuum processes. Several well-established $N^*$ states are observed in the $pπ^0$ and $pη$ systems, and the corresponding branching fractions are measured. The ratio of decay widths $Γ_{N(1535)\to Nη}/Γ_{N(1535)\to Nπ}$ is determined to be $0.99\pm0.05\pm0.19$.
△ Less
Submitted 19 February, 2025; v1 submitted 9 December, 2024;
originally announced December 2024.
-
Dark Matter Annual Modulation Analysis with Combined Nuclear and Electron Recoil Channels
Authors:
TEXONO Collaboration,
H. B. Li,
M. K. Pandey,
C. H. Leung,
L. Singh,
H. T. Wong,
H. -C. Chi,
M. Deniz,
Greeshma C.,
J. -W. Chen,
H. C. Hsu,
S. Karadag,
S. Karmakar,
V. Kumar,
J. Li,
F. K. Lin,
S. T. Lin,
C. -P. Liu,
S. K. Liu,
H. Ma,
D. K. Mishra,
K. Saraswat,
V. Sharma,
M. K. Singh,
M. K. Singh
, et al. (7 additional authors not shown)
Abstract:
After decades of experimental efforts, the DAMA/LIBRA(DL) annual modulation (AM) analysis on the $χN$ (WIMP Dark Matter interactions on nucleus) channel remains the only one which can be interpreted as positive signatures. This has been refuted by numerous time-integrated (TI) and AM analysis. It has been shown that $χe$ (WIMP interactions with electrons) alone is not compatible with the DL AM dat…
▽ More
After decades of experimental efforts, the DAMA/LIBRA(DL) annual modulation (AM) analysis on the $χN$ (WIMP Dark Matter interactions on nucleus) channel remains the only one which can be interpreted as positive signatures. This has been refuted by numerous time-integrated (TI) and AM analysis. It has been shown that $χe$ (WIMP interactions with electrons) alone is not compatible with the DL AM data. We expand the investigations by performing an AM analysis with the addition of $χe$ long-range and short-range interactions to $χN$, derived using the Frozen Core Approximation method. Two scenarios are considered, where the $χN$ and $χe$ processes are due to a single $χ$ ($Γ^{1 χ}_{tot}$) or two different $χ$'s ($Γ^{2 χ}_{tot}$). The combined fits with $χN$ and $χe$ provide stronger significance to the DL AM data which are compatible with the presence of additional physical effects beyond $χN$ alone. This is the first analysis which explores how $χe$ AM can play a role in DL AM. The revised allowed regions as well as the exclusion contours from the other null AM experiments are presented. All DL AM allowed parameter spaces in $χN$ and $χe$ channels under both $Γ^{1 χ}_{tot}$ and $Γ^{2 χ}_{tot}$ are excluded at the 90\% confidence level by the combined null AM results. It can be projected that DL-allowed parameter spaces from generic models with interactions induced by two-WIMPs are ruled out.
△ Less
Submitted 22 April, 2025; v1 submitted 6 December, 2024;
originally announced December 2024.
-
Confined Magnetization at the Sublattice-Matched Ruthenium Oxide Heterointerface
Authors:
Yiyan Fan,
Qinghua Zhang,
Ting Lin,
He Bai,
Chuanrui Huo,
Qiao Jin,
Tielong Deng,
Songhee Choi,
Shengru Chen,
Haitao Hong,
Ting Cui,
Qianying Wang,
Dongke Rong,
Chen Liu,
Chen Ge,
Tao Zhu,
Lin Gu,
Kuijuan Jin,
Jun Chen,
Er-Jia Guo
Abstract:
Creating a heterostructure by combining two magnetically and structurally distinct ruthenium oxides is a crucial approach for investigating their emergent magnetic states and interactions. Previously, research has predominantly concentrated on the intrinsic properties of the ferromagnet SrRuO3 and recently discovered altermagnet RuO2 solely. Here, we engineered an ultrasharp sublattice-matched het…
▽ More
Creating a heterostructure by combining two magnetically and structurally distinct ruthenium oxides is a crucial approach for investigating their emergent magnetic states and interactions. Previously, research has predominantly concentrated on the intrinsic properties of the ferromagnet SrRuO3 and recently discovered altermagnet RuO2 solely. Here, we engineered an ultrasharp sublattice-matched heterointerface using pseudo-cubic SrRuO3 and rutile RuO2, conducting an in-depth analysis of their spin interactions. Structurally, to accommodate the lattice symmetry mismatch, the inverted RuO2 layer undergoes an in-plane rotation of 18 degrees during epitaxial growth on SrRuO3 layer, resulting in an interesting and rotational interface with perfect crystallinity and negligible chemical intermixing. Performance-wise, the interfacial layer of 6 nm in RuO2 adjacent to SrRuO3 exhibits a nonzero magnetic moment, contributing to an enhanced anomalous Hall effect (AHE) at low temperatures. Furthermore, our observations indicate that, in contrast to SrRuO3 single layers, the AHE of [(RuO2)15/(SrRuO3)n] heterostructures shows nonlinear behavior and reaches its maximum when the SrRuO3 thickness reaches tens of nm. These results suggest that the interfacial magnetic interaction surpasses that of all-perovskite oxides (~5-unit cells). This study underscores the significance and potential applications of magnetic interactions based on the crystallographic asymmetric interfaces in the design of spintronic devices.
△ Less
Submitted 4 December, 2024;
originally announced December 2024.
-
Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis
Authors:
Tao Jun Lin,
Wenqing Wang,
Yujiao Shi,
Akhil Perincherry,
Ankit Vora,
Hongdong Li
Abstract:
This paper presents a novel approach for cross-view synthesis aimed at generating plausible ground-level images from corresponding satellite imagery or vice versa. We refer to these tasks as satellite-to-ground (Sat2Grd) and ground-to-satellite (Grd2Sat) synthesis, respectively. Unlike previous works that typically focus on one-to-one generation, producing a single output image from a single input…
▽ More
This paper presents a novel approach for cross-view synthesis aimed at generating plausible ground-level images from corresponding satellite imagery or vice versa. We refer to these tasks as satellite-to-ground (Sat2Grd) and ground-to-satellite (Grd2Sat) synthesis, respectively. Unlike previous works that typically focus on one-to-one generation, producing a single output image from a single input image, our approach acknowledges the inherent one-to-many nature of the problem. This recognition stems from the challenges posed by differences in illumination, weather conditions, and occlusions between the two views. To effectively model this uncertainty, we leverage recent advancements in diffusion models. Specifically, we exploit random Gaussian noise to represent the diverse possibilities learnt from the target view data. We introduce a Geometry-guided Cross-view Condition (GCC) strategy to establish explicit geometric correspondences between satellite and street-view features. This enables us to resolve the geometry ambiguity introduced by camera pose between image pairs, boosting the performance of cross-view image synthesis. Through extensive quantitative and qualitative analyses on three benchmark cross-view datasets, we demonstrate the superiority of our proposed geometry-guided cross-view condition over baseline methods, including recent state-of-the-art approaches in cross-view image synthesis. Our method generates images of higher quality, fidelity, and diversity than other state-of-the-art approaches.
△ Less
Submitted 4 December, 2024;
originally announced December 2024.
-
Gaussian Object Carver: Object-Compositional Gaussian Splatting with surfaces completion
Authors:
Liu Liu,
Xinjie Wang,
Jiaxiong Qiu,
Tianwei Lin,
Xiaolin Zhou,
Zhizhong Su
Abstract:
3D scene reconstruction is a foundational problem in computer vision. Despite recent advancements in Neural Implicit Representations (NIR), existing methods often lack editability and compositional flexibility, limiting their use in scenarios requiring high interactivity and object-level manipulation. In this paper, we introduce the Gaussian Object Carver (GOC), a novel, efficient, and scalable fr…
▽ More
3D scene reconstruction is a foundational problem in computer vision. Despite recent advancements in Neural Implicit Representations (NIR), existing methods often lack editability and compositional flexibility, limiting their use in scenarios requiring high interactivity and object-level manipulation. In this paper, we introduce the Gaussian Object Carver (GOC), a novel, efficient, and scalable framework for object-compositional 3D scene reconstruction. GOC leverages 3D Gaussian Splatting (GS), enriched with monocular geometry priors and multi-view geometry regularization, to achieve high-quality and flexible reconstruction. Furthermore, we propose a zero-shot Object Surface Completion (OSC) model, which uses 3D priors from 3d object data to reconstruct unobserved surfaces, ensuring object completeness even in occluded areas. Experimental results demonstrate that GOC improves reconstruction efficiency and geometric fidelity. It holds promise for advancing the practical application of digital twins in embodied AI, AR/VR, and interactive simulation environments.
△ Less
Submitted 2 December, 2024;
originally announced December 2024.
-
Sample-Efficient Estimation of Nonlinear Quantum State Functions
Authors:
Hongshun Yao,
Yingjian Liu,
Tengxiang Lin,
Xin Wang
Abstract:
Efficient estimation of nonlinear functions of quantum states is crucial for various key tasks in quantum computing, such as entanglement spectroscopy, fidelity estimation, and feature analysis of quantum data. Conventional methods using state tomography and estimating numerous terms of the series expansion are computationally expensive, while alternative approaches based on a purified query oracl…
▽ More
Efficient estimation of nonlinear functions of quantum states is crucial for various key tasks in quantum computing, such as entanglement spectroscopy, fidelity estimation, and feature analysis of quantum data. Conventional methods using state tomography and estimating numerous terms of the series expansion are computationally expensive, while alternative approaches based on a purified query oracle impose practical constraints. In this paper, we introduce the quantum state function (QSF) framework by extending the SWAP test via linear combination of unitaries and parameterized quantum circuits. Our framework enables the implementation of arbitrarily normalized degree-$n$ polynomial functions of quantum states with precision $\varepsilon$ using $\mathcal{O}(n/\varepsilon^2)$ copies. We further apply QSF for developing quantum algorithms for fundamental tasks, including entropy, fidelity, and eigenvalue estimations. Specifically, for estimating von Neumann entropy, quantum relative entropy, and quantum state fidelity, where $κ$ and $γ$ represent the minimal nonzero eigenvalue and normalized factor, respectively, we achieve a sample complexity of $\tilde{\mathcal{O}}(γ^2/(\varepsilon^2κ))$. Our work establishes a concise and unified paradigm for estimating and realizing nonlinear functions of quantum states, paving the way for the practical processing and analysis of quantum data.
△ Less
Submitted 20 June, 2025; v1 submitted 2 December, 2024;
originally announced December 2024.
-
Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model
Authors:
Qianhan Feng,
Wenshuo Li,
Tong Lin,
Xinghao Chen
Abstract:
Vision-Language Models (VLMs) bring powerful understanding and reasoning capabilities to multimodal tasks. Meanwhile, the great need for capable aritificial intelligence on mobile devices also arises, such as the AI assistant software. Some efforts try to migrate VLMs to edge devices to expand their application scope. Simplifying the model structure is a common method, but as the model shrinks, th…
▽ More
Vision-Language Models (VLMs) bring powerful understanding and reasoning capabilities to multimodal tasks. Meanwhile, the great need for capable aritificial intelligence on mobile devices also arises, such as the AI assistant software. Some efforts try to migrate VLMs to edge devices to expand their application scope. Simplifying the model structure is a common method, but as the model shrinks, the trade-off between performance and size becomes more and more difficult. Knowledge distillation (KD) can help models improve comprehensive capabilities without increasing size or data volume. However, most of the existing large model distillation techniques only consider applications on single-modal LLMs, or only use teachers to create new data environments for students. None of these methods take into account the distillation of the most important cross-modal alignment knowledge in VLMs. We propose a method called Align-KD to guide the student model to learn the cross-modal matching that occurs at the shallow layer. The teacher also helps student learn the projection of vision token into text embedding space based on the focus of text. Under the guidance of Align-KD, the 1.7B MobileVLM V2 model can learn rich knowledge from the 7B teacher model with light design of training loss, and achieve an average score improvement of 2.0 across 6 benchmarks under two training subsets respectively. Code is available at: https://github.com/fqhank/Align-KD.
△ Less
Submitted 2 December, 2024;
originally announced December 2024.
-
Observation of the open-charm tetraquark candidate $T_{cs 0}^{*}(2870)^0$ in the $B^- \rightarrow D^- D^0 K_\mathrm{S}^0$ decay
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1128 additional authors not shown)
Abstract:
An amplitude analysis of $B^-\rightarrow D^- D^0 K_\mathrm{S}^0$ decays is performed using proton-proton collision data, corresponding to an integrated luminosity of $9\,\text{fb}^{-1}$, collected with the LHCb detector at center-of-mass energies of 7, 8, and 13$\mathrm{\,Te\kern -0.1em V}$. A resonant structure of spin-parity $0^+$ is observed in the $D^0 K_\mathrm{S}^0$ invariant-mass spectrum w…
▽ More
An amplitude analysis of $B^-\rightarrow D^- D^0 K_\mathrm{S}^0$ decays is performed using proton-proton collision data, corresponding to an integrated luminosity of $9\,\text{fb}^{-1}$, collected with the LHCb detector at center-of-mass energies of 7, 8, and 13$\mathrm{\,Te\kern -0.1em V}$. A resonant structure of spin-parity $0^+$ is observed in the $D^0 K_\mathrm{S}^0$ invariant-mass spectrum with a significance of $5.3\,σ$. The mass and width of the state, modeled with a Breit$-$Wigner lineshape, are determined to be $2883\pm11\pm8\mathrm{\,Me\kern -0.1em V\!/}c^2$ and $87_{-47}^{+22}\pm17\mathrm{\,Me\kern -0.1em V}$ respectively, where the first uncertainties are statistical and the second systematic. These properties and the quark content are consistent with those of the open-charm tetraquark candidate $T_{cs 0}^{*}(2870)^0$ observed previously in the $D^+ K^-$ final state of the $B^-\rightarrow D^- D^+ K^-$ decay. This result confirms the existence of the $T_{cs 0}^{*}(2870)^0$ state in a new decay mode. The $T_{cs1}^{*}(2900)^0$ state, reported in the $B^-\rightarrow D^- D^+ K^-$ decay, is also searched for in the $D^0 K_\mathrm{S}^0$ invariant-mass spectrum of the $B^- \rightarrow D^- D^0 K_\mathrm{S}^0$ decay, without finding evidence for it.
△ Less
Submitted 5 April, 2025; v1 submitted 29 November, 2024;
originally announced November 2024.
-
Measurement of the Inclusive Cross Sections of Prompt $J/ψ$ and $ψ(3686)$ Production in $e^{+}e^{-}$ Annihilation from $\sqrt{s}=3.808$ to $4.951$ GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (599 additional authors not shown)
Abstract:
The inclusive cross sections of prompt $J/ψ$ and $ψ(3686)$ production are measured at center-of-mass energies from 3.808 to 4.951 GeV. The dataset used is 22 fb$^{-1}$ of $e^{+}e^{-}$ annihilation data collected with the BESIII detector operating at the BEPCII storage ring. The results obtained are in agreement with the previous BESIII measurements of exclusive $J/ψ$ and $ψ(3686)$ production. The…
▽ More
The inclusive cross sections of prompt $J/ψ$ and $ψ(3686)$ production are measured at center-of-mass energies from 3.808 to 4.951 GeV. The dataset used is 22 fb$^{-1}$ of $e^{+}e^{-}$ annihilation data collected with the BESIII detector operating at the BEPCII storage ring. The results obtained are in agreement with the previous BESIII measurements of exclusive $J/ψ$ and $ψ(3686)$ production. The average values obtained for the cross sections measured in the center-of-mass energy ranges from 4.527 to 4.951 GeV for $J/ψ$ and from 4.843 to 4.951 GeV for $ψ(3686)$, where the impact of known resonances is negligible, are $14.0\pm1.7\pm3.1$ pb and $15.3\pm3.0$ pb, respectively. For $J/ψ$, the first and the second uncertainties are statistical and systematic, respectively. For $ψ(3686)$, the uncertainty is total. These values are useful for testing charmonium production models.
△ Less
Submitted 19 February, 2025; v1 submitted 29 November, 2024;
originally announced November 2024.
-
New Limits on Coherent Neutrino Nucleus Elastic Scattering Cross Section at the Kuo-Sheng Reactor Neutrino Laboratory
Authors:
TEXONO Collaboration,
S. Karmakar,
M. K. Singh,
V. Sharma,
H. T. Wong,
Greeshma C.,
H. B. Li,
L. Singh,
M. Agartioglu,
J. H. Chen,
C. I. Chiang,
M. Deniz,
H. C. Hsu,
S. Karadag,
V. Kumar,
C. H. Leung,
J. Li,
F. K. Lin,
S. T. Lin,
S. K. Liu,
H. Ma,
K. Saraswat,
M. K. Singh,
V. Singh,
D. Tanabe
, et al. (4 additional authors not shown)
Abstract:
Neutrino nucleus elastic scattering (νAel) with reactor neutrinos is an interaction under full quantum-mechanical coherence. It has not yet been experimentally observed. We present new results on the studies of νAel cross section with an electro-cooled p-type point-contact germanium detector at the Kuo-Sheng Reactor Neutrino laboratory. A total of (242)357 kg-days of Reactor ON(OFF) data at a dete…
▽ More
Neutrino nucleus elastic scattering (νAel) with reactor neutrinos is an interaction under full quantum-mechanical coherence. It has not yet been experimentally observed. We present new results on the studies of νAel cross section with an electro-cooled p-type point-contact germanium detector at the Kuo-Sheng Reactor Neutrino laboratory. A total of (242)357 kg-days of Reactor ON(OFF) data at a detector threshold of 200 eVee in electron equivalent unit are analyzed. The Lindhard model parametrized by a single variable k which characterizes the quenching function was used. Limits at 90% confidence level are derived on the ratio ρ relative to standard model (SM) cross section of ρ<4.7 at the predicted value of k=0.162, while k<0.285 at the SM-value of ρ=1. Prospects on future positive measurements are discussed.
△ Less
Submitted 7 April, 2025; v1 submitted 27 November, 2024;
originally announced November 2024.
-
GLS: Geometry-aware 3D Language Gaussian Splatting
Authors:
Jiaxiong Qiu,
Liu Liu,
Xinjie Wang,
Tianwei Lin,
Wei Sui,
Zhizhong Su
Abstract:
Recently, 3D Gaussian Splatting (3DGS) has achieved impressive performance on indoor surface reconstruction and 3D open-vocabulary segmentation. This paper presents GLS, a unified framework of 3D surface reconstruction and open-vocabulary segmentation based on 3DGS. GLS extends two fields by improving their sharpness and smoothness. For indoor surface reconstruction, we introduce surface normal pr…
▽ More
Recently, 3D Gaussian Splatting (3DGS) has achieved impressive performance on indoor surface reconstruction and 3D open-vocabulary segmentation. This paper presents GLS, a unified framework of 3D surface reconstruction and open-vocabulary segmentation based on 3DGS. GLS extends two fields by improving their sharpness and smoothness. For indoor surface reconstruction, we introduce surface normal prior as a geometric cue to guide the rendered normal, and use the normal error to optimize the rendered depth. For 3D open-vocabulary segmentation, we employ 2D CLIP features to guide instance features and enhance the surface smoothness, then utilize DEVA masks to maintain their view consistency. Extensive experiments demonstrate the effectiveness of jointly optimizing surface reconstruction and 3D open-vocabulary segmentation, where GLS surpasses state-of-the-art approaches of each task on MuSHRoom, ScanNet++ and LERF-OVS datasets. Project webpage: https://jiaxiongq.github.io/GLS_ProjectPage.
△ Less
Submitted 29 June, 2025; v1 submitted 27 November, 2024;
originally announced November 2024.
-
Enhancing Imbalance Learning: A Novel Slack-Factor Fuzzy SVM Approach
Authors:
M. Tanveer,
Anushka Tiwari,
Mushir Akhtar,
C. T. Lin
Abstract:
In real-world applications, class-imbalanced datasets pose significant challenges for machine learning algorithms, such as support vector machines (SVMs), particularly in effectively managing imbalance, noise, and outliers. Fuzzy support vector machines (FSVMs) address class imbalance by assigning varying fuzzy memberships to samples; however, their sensitivity to imbalanced datasets can lead to i…
▽ More
In real-world applications, class-imbalanced datasets pose significant challenges for machine learning algorithms, such as support vector machines (SVMs), particularly in effectively managing imbalance, noise, and outliers. Fuzzy support vector machines (FSVMs) address class imbalance by assigning varying fuzzy memberships to samples; however, their sensitivity to imbalanced datasets can lead to inaccurate assessments. The recently developed slack-factor-based FSVM (SFFSVM) improves traditional FSVMs by using slack factors to adjust fuzzy memberships based on misclassification likelihood, thereby rectifying misclassifications induced by the hyperplane obtained via different error cost (DEC). Building on SFFSVM, we propose an improved slack-factor-based FSVM (ISFFSVM) that introduces a novel location parameter. This novel parameter significantly advances the model by constraining the DEC hyperplane's extension, thereby mitigating the risk of misclassifying minority class samples. It ensures that majority class samples with slack factor scores approaching the location threshold are assigned lower fuzzy memberships, which enhances the model's discrimination capability. Extensive experimentation on a diverse array of real-world KEEL datasets demonstrates that the proposed ISFFSVM consistently achieves higher F1-scores, Matthews correlation coefficients (MCC), and area under the precision-recall curve (AUC-PR) compared to baseline classifiers. Consequently, the introduction of the location parameter, coupled with the slack-factor-based fuzzy membership, enables ISFFSVM to outperform traditional approaches, particularly in scenarios characterized by severe class disparity. The code for the proposed model is available at \url{https://github.com/mtanveer1/ISFFSVM}.
△ Less
Submitted 26 November, 2024;
originally announced November 2024.
-
Measurement of cross sections of $e^+e^-\to K^0_S K^0_S ψ(3686)$ from $\sqrt{s}=$ 4.682 to 4.951 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
The process $e^+e^-\to K^0_S K^0_S ψ(3686)$ is studied by analyzing $e^+e^-$ collision data samples collected at eight center-of-mass energies ranging from 4.682 to 4.951 GeV with the BESIII detector operating at the BEPCII collider, corresponding to an integrated luminosity of $4.1~{\rm fb}^{-1}$. Observation of the $e^+e^-\to K^0_S K^0_S ψ(3686)$ process is found for the first time with a statis…
▽ More
The process $e^+e^-\to K^0_S K^0_S ψ(3686)$ is studied by analyzing $e^+e^-$ collision data samples collected at eight center-of-mass energies ranging from 4.682 to 4.951 GeV with the BESIII detector operating at the BEPCII collider, corresponding to an integrated luminosity of $4.1~{\rm fb}^{-1}$. Observation of the $e^+e^-\to K^0_S K^0_S ψ(3686)$ process is found for the first time with a statistical significance of $6.3σ$, and the cross sections at each center-of-mass energy are measured. The ratio of cross sections of $e^+e^-\to K_S^0 K_S^0 ψ(3686)$ relative to $e^+e^-\to K^+ K^- ψ(3686)$ is determined to be $\frac{σ(e^+e^-\to K_S^0 K_S^0 ψ(3686))}{σ(e^+e^-\to K^+ K^- ψ(3686))}=0.45 \pm 0.25$, which is consistent with the prediction based on isospin symmetry. The uncertainty includes both statistical and systematic contributions. Additionally, the $K_S^0ψ(3686)$ invariant mass distribution is found to be consistent with three-body phase space. The significance of a contribution beyond three-body phase space is only $0.8σ$.
△ Less
Submitted 3 March, 2025; v1 submitted 24 November, 2024;
originally announced November 2024.
-
Study of $\itΛ_{\it{b}}^\rm{0}$ and $\itΞ_{\it{b}}^\rm{0}$ decays to $\itΛ h^+h^{'-}$ and evidence for $CP$ violation in $\itΛ_{\it{b}}^\rm{0}\to\itΛ K^+K^-$ decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1129 additional authors not shown)
Abstract:
A study of $\itΛ_{\it{b}}^\rm{0}$ and $\itΞ_{\it{b}}^\rm{0}$ decays to $\itΛ h^{+} h^{\prime -}$ $(h^{(\prime)}=π, K)$ is performed using $pp$ collision data collected by the LHCb experiment during LHC Runs 1$-$2, corresponding to an integrated luminosity of $9~\rm{fb}^{-1}$. The branching fractions for these decays are measured using the $\itΛ_{\it{b}}^\rm{0}\to\itΛ_{\it{c}}^+(\to\itΛπ^+)π^-$ dec…
▽ More
A study of $\itΛ_{\it{b}}^\rm{0}$ and $\itΞ_{\it{b}}^\rm{0}$ decays to $\itΛ h^{+} h^{\prime -}$ $(h^{(\prime)}=π, K)$ is performed using $pp$ collision data collected by the LHCb experiment during LHC Runs 1$-$2, corresponding to an integrated luminosity of $9~\rm{fb}^{-1}$. The branching fractions for these decays are measured using the $\itΛ_{\it{b}}^\rm{0}\to\itΛ_{\it{c}}^+(\to\itΛπ^+)π^-$ decay as control channel. The decays $\itΛ_{\it{b}}^\rm{0}\to\itΛπ^+π^-$ and $\itΞ_{\it{b}}^\rm{0}\to\itΛK^-π^+$ are observed for the first time. For decay modes with sufficient signal yields, $CP$ asymmetries are measured in the full and localized regions of the final-state phase space. Evidence is found for $CP$ violation in the $\itΛ_{\it{b}}^\rm{0}\to\itΛK^+K^-$ decay, interpreted as originating primarily from an asymmetric $\itΛ_{\it{b}}^\rm{0} \to \it{N}^{*+} \it{K}^-$ decay amplitude. The measured $CP$ asymmetries for the other decays are compatible with zero.
△ Less
Submitted 2 June, 2025; v1 submitted 22 November, 2024;
originally announced November 2024.