-
Pickles on FIRE: The 3D Shape Evolution of Simulated Milky Way-Mass Galaxies
Authors:
Luke Y. Xia,
Courtney Klein,
James S. Bullock,
Michael Boylan-Kolchin,
Vincent Caudillo,
Jorge Moreno,
Francisco J. Mercado,
Robert Feldmann
Abstract:
JWST and HST observations have revealed numerous elongated, pickle-shaped galaxies at high to intermediate redshifts, with masses close to those expected for Milky Way progenitors. Here we use reduced-mass eigentensors to quantify the ellipsoidal shape evolution of thirteen Milky Way-mass galaxies simulated using FIRE-2 physics; all but one form disks at $z=0$. We find that all of our Milky Way pr…
▽ More
JWST and HST observations have revealed numerous elongated, pickle-shaped galaxies at high to intermediate redshifts, with masses close to those expected for Milky Way progenitors. Here we use reduced-mass eigentensors to quantify the ellipsoidal shape evolution of thirteen Milky Way-mass galaxies simulated using FIRE-2 physics; all but one form disks at $z=0$. We find that all of our Milky Way progenitors go through phases when they are elongated. They often oscillate between spheroidal and elongated shapes in the early Universe over billion-year timescales. This is true whether we measure shapes weighted on stellar mass or luminosity, though the luminosity shapes show more extreme elongation and variance. In contrast, the stellar populations of our $z=0$ Milky Way analogs are never elongated and always symmetric about their minor axes. The youngest stars at $z=0$ reside in thin disks, intermediate-age stars reside in thick disks, and the oldest stars reside in flattened spheroids that are symmetric about their minor axes. Despite their symmetric shapes at $z=0$, the old and intermediate age stellar populations were often arranged in the shape of elongated pickles or triaxial spheroids at the time they formed, meaning that these populations changed shape significantly over time. Our results suggest that observed elongated galaxies seen in the early Universe are not stable structures, but rather reflect transitory phases of galaxy evolution.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
ETT-CKGE: Efficient Task-driven Tokens for Continual Knowledge Graph Embedding
Authors:
Lijing Zhu,
Qizhen Lan,
Qing Tian,
Wenbo Sun,
Li Yang,
Lu Xia,
Yixin Xie,
Xi Xiao,
Tiehang Duan,
Cui Tao,
Shuteng Niu
Abstract:
Continual Knowledge Graph Embedding (CKGE) seeks to integrate new knowledge while preserving past information. However, existing methods struggle with efficiency and scalability due to two key limitations: (1) suboptimal knowledge preservation between snapshots caused by manually designed node/relation importance scores that ignore graph dependencies relevant to the downstream task, and (2) comput…
▽ More
Continual Knowledge Graph Embedding (CKGE) seeks to integrate new knowledge while preserving past information. However, existing methods struggle with efficiency and scalability due to two key limitations: (1) suboptimal knowledge preservation between snapshots caused by manually designed node/relation importance scores that ignore graph dependencies relevant to the downstream task, and (2) computationally expensive graph traversal for node/relation importance calculation, leading to slow training and high memory overhead. To address these limitations, we introduce ETT-CKGE (Efficient, Task-driven, Tokens for Continual Knowledge Graph Embedding), a novel task-guided CKGE method that leverages efficient task-driven tokens for efficient and effective knowledge transfer between snapshots. Our method introduces a set of learnable tokens that directly capture task-relevant signals, eliminating the need for explicit node scoring or traversal. These tokens serve as consistent and reusable guidance across snapshots, enabling efficient token-masked embedding alignment between snapshots. Importantly, knowledge transfer is achieved through simple matrix operations, significantly reducing training time and memory usage. Extensive experiments across six benchmark datasets demonstrate that ETT-CKGE consistently achieves superior or competitive predictive performance, while substantially improving training efficiency and scalability compared to state-of-the-art CKGE methods. The code is available at: https://github.com/lijingzhu1/ETT-CKGE/tree/main
△ Less
Submitted 9 June, 2025;
originally announced June 2025.
-
A novel measurement of the strong-phase difference between $D^0\to K^-π^+$ and $\bar{D}^0\to K^-π^+$ decays using $C$-even and $C$-odd quantum-correlated $D\bar{D}$ pairs
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (707 additional authors not shown)
Abstract:
A novel measurement technique of strong-phase differences between the decay amplitudes of $D^0$ and $\bar{D}^0$ mesons is introduced which exploits quantum-correlated $D\bar{D}$ pairs produced by $e^+e^-$ collisions at energies above the $ψ(3770)$ production threshold, where $D\bar{D}$ pairs are produced in both even and odd eigenstates of the charge-conjugation symmetry. Employing this technique,…
▽ More
A novel measurement technique of strong-phase differences between the decay amplitudes of $D^0$ and $\bar{D}^0$ mesons is introduced which exploits quantum-correlated $D\bar{D}$ pairs produced by $e^+e^-$ collisions at energies above the $ψ(3770)$ production threshold, where $D\bar{D}$ pairs are produced in both even and odd eigenstates of the charge-conjugation symmetry. Employing this technique, the first determination of a $D^0$-$\bar{D^0}$ relative strong phase is reported with such data samples. The strong-phase difference between $D^0\to K^-π^+$ and $\bar{D}^0\to K^-π^+$ decays, $δ^{D}_{Kπ}$, is measured to be $δ^{D}_{Kπ}=\left(192.8^{+11.0 + 1.9}_{-12.4 -2.4}\right)^\circ$, using a dataset corresponding to an integrated luminosity of 7.13 $\text{fb}^{-1}$ collected at center-of-mass energies between $4.13-4.23 \text{ GeV}$ by the BESIII experiment.
△ Less
Submitted 10 June, 2025; v1 submitted 9 June, 2025;
originally announced June 2025.
-
First observation of quantum correlations in $e^+e^-\to XD\bar{D}$ and $C$-even constrained $D\bar{D}$ pairs
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (707 additional authors not shown)
Abstract:
The study of meson pairs produced with quantum correlations gives direct access to parameters that are challenging to measure in other systems. In this Letter, the existence of quantum correlations due to charge-conjugation symmetry $C$ are demonstrated in $D\bar{D}$ pairs produced through the processes $e^+e^-\to D\bar{D}$, $e^+e^- \to D^{*}\bar{D}$, and $e^+e^- \to D^{*} \bar{D}^*$, where the la…
▽ More
The study of meson pairs produced with quantum correlations gives direct access to parameters that are challenging to measure in other systems. In this Letter, the existence of quantum correlations due to charge-conjugation symmetry $C$ are demonstrated in $D\bar{D}$ pairs produced through the processes $e^+e^-\to D\bar{D}$, $e^+e^- \to D^{*}\bar{D}$, and $e^+e^- \to D^{*} \bar{D}^*$, where the lack of charge superscripts refers to an admixture of neutral-charm-meson particle and antiparticle states, using $7.13 \text{ fb}^{-1}$ of $e^+e^-$ collision data collected by the BESIII experiment between center-of-mass energies of $4.13-4.23 \text{ GeV}$. Processes with either $C$-even or $C$-odd constraints are identified and separated. A procedure is presented that harnesses the entangled production process to enable measurements of $D^0$-meson hadronic parameters. This study provides the first confirmation of quantum correlations in $e^+e^-\to X D\bar{D}$ processes and the first observation of a $C$-even constrained $D\bar{D}$ system. The procedure is applied to measure $δ^{D}_{Kπ}$, the strong phase between the $D^0\to K^-π^+$ and $\bar{D}^0\to K^-π^+$ decay amplitudes, which results in the determination of $δ^{D}_{Kπ}=\left(192.8^{+11.0 + 1.9}_{-12.4 -2.4}\right)^\circ$. The potential for measurements of other hadronic decay parameters and charm mixing with these and future datasets is also discussed.
△ Less
Submitted 10 June, 2025; v1 submitted 9 June, 2025;
originally announced June 2025.
-
RecGPT: A Foundation Model for Sequential Recommendation
Authors:
Yangqin Jiang,
Xubin Ren,
Lianghao Xia,
Da Luo,
Kangyi Lin,
Chao Huang
Abstract:
This work addresses a fundamental barrier in recommender systems: the inability to generalize across domains without extensive retraining. Traditional ID-based approaches fail entirely in cold-start and cross-domain scenarios where new users or items lack sufficient interaction history. Inspired by foundation models' cross-domain success, we develop a foundation model for sequential recommendation…
▽ More
This work addresses a fundamental barrier in recommender systems: the inability to generalize across domains without extensive retraining. Traditional ID-based approaches fail entirely in cold-start and cross-domain scenarios where new users or items lack sufficient interaction history. Inspired by foundation models' cross-domain success, we develop a foundation model for sequential recommendation that achieves genuine zero-shot generalization capabilities. Our approach fundamentally departs from existing ID-based methods by deriving item representations exclusively from textual features. This enables immediate embedding of any new item without model retraining. We introduce unified item tokenization with Finite Scalar Quantization that transforms heterogeneous textual descriptions into standardized discrete tokens. This eliminates domain barriers that plague existing systems. Additionally, the framework features hybrid bidirectional-causal attention that captures both intra-item token coherence and inter-item sequential dependencies. An efficient catalog-aware beam search decoder enables real-time token-to-item mapping. Unlike conventional approaches confined to their training domains, RecGPT naturally bridges diverse recommendation contexts through its domain-invariant tokenization mechanism. Comprehensive evaluations across six datasets and industrial scenarios demonstrate consistent performance advantages.
△ Less
Submitted 12 June, 2025; v1 submitted 6 June, 2025;
originally announced June 2025.
-
Observation of $D^+\to K^0_Sπ^0μ^+ν_μ$, Test of Lepton Flavor Universality and First Angular Analysis of $D^+\to \bar{K}^\ast(892)^0\ell^+ν_\ell$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (696 additional authors not shown)
Abstract:
We report a study of the semileptonic decays $D^+\to K_S^0π^0\ell^+ν_\ell$ ($\ell = e, μ$) based on $20.3\,\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy of 3.773 GeV with the BESIII detector.
The $D^+\to K_S^0π^0μ^+ν_μ$ decay is observed for the first time, with a branching fraction of $(0.896\pm0.017_{\rm stat}\pm0.008_{\rm syst})\%$, and the branching frac…
▽ More
We report a study of the semileptonic decays $D^+\to K_S^0π^0\ell^+ν_\ell$ ($\ell = e, μ$) based on $20.3\,\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy of 3.773 GeV with the BESIII detector.
The $D^+\to K_S^0π^0μ^+ν_μ$ decay is observed for the first time, with a branching fraction of $(0.896\pm0.017_{\rm stat}\pm0.008_{\rm syst})\%$, and the branching fraction of $D^+\to K_S^0π^0e^+ν_e$ is determined with the improved precision as $(0.943\pm0.012_{\rm stat}\pm0.010_{\rm syst})\%$.
From the analysis of the dynamics, we observe that the dominant $\bar{K}^\ast(892)^0$ component is accompanied by an $S$-wave contribution, which accounts for $(7.10 \pm 0.68_{\rm stat} \pm 0.41_{\rm syst})\%$ of the total decay rate of the $μ^+$ channel and $(6.39 \pm 0.17_{\rm stat} \pm 0.14_{\rm syst})\%$ of the $e^+$ channel. Assuming a single-pole dominance parameterization, the hadronic form factor ratios are extracted to be $r_V=V(0)/A_1(0)=1.42 \pm\, 0.03_{\rm stat} \pm\, 0.02_{\rm syst}$ and $r_2=A_2(0)/A_1(0)=0.75 \pm\, 0.03_{\rm stat} \pm\, 0.01_{\rm syst}$.
Based on the first comprehensive angular and the decay-rate $CP$ asymmetry analysis, the full set of averaged angular and $CP$ asymmetry observables are measured as a function of the momentum-transfer squared; they are consistent with expectations from the Standard Model. No evidence for violation of $μ-e$ lepton-flavor universality is observed in either the full range or the five chosen bins of momentum-transfer squared.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting
Authors:
Nan Wang,
Yuantao Chen,
Lixing Xiao,
Weiqing Xiao,
Bohan Li,
Zhaoxi Chen,
Chongjie Ye,
Shaocong Xu,
Saining Zhang,
Ziyang Yan,
Pierre Merriaux,
Lei Lei,
Tianfan Xue,
Hao Zhao
Abstract:
Neural rendering techniques, including NeRF and Gaussian Splatting (GS), rely on photometric consistency to produce high-quality reconstructions. However, in real-world scenarios, it is challenging to guarantee perfect photometric consistency in acquired images. Appearance codes have been widely used to address this issue, but their modeling capability is limited, as a single code is applied to th…
▽ More
Neural rendering techniques, including NeRF and Gaussian Splatting (GS), rely on photometric consistency to produce high-quality reconstructions. However, in real-world scenarios, it is challenging to guarantee perfect photometric consistency in acquired images. Appearance codes have been widely used to address this issue, but their modeling capability is limited, as a single code is applied to the entire image. Recently, the bilateral grid was introduced to perform pixel-wise color mapping, but it is difficult to optimize and constrain effectively. In this paper, we propose a novel multi-scale bilateral grid that unifies appearance codes and bilateral grids. We demonstrate that this approach significantly improves geometric accuracy in dynamic, decoupled autonomous driving scene reconstruction, outperforming both appearance codes and bilateral grids. This is crucial for autonomous driving, where accurate geometry is important for obstacle avoidance and control. Our method shows strong results across four datasets: Waymo, NuScenes, Argoverse, and PandaSet. We further demonstrate that the improvement in geometry is driven by the multi-scale bilateral grid, which effectively reduces floaters caused by photometric inconsistency.
△ Less
Submitted 4 August, 2025; v1 submitted 5 June, 2025;
originally announced June 2025.
-
Study of $f_1(1420)$ and $η(1405)$ in the decay $J/ψ\to γπ^{0}π^{0}π^{0}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (650 additional authors not shown)
Abstract:
A partial-wave analysis is performed on the decay $J/ψ\toγπ^{0}π^{0}π^{0}$ within the $π^{0}π^{0}π^{0}$ invariant-mass region below 1.6 GeV$/c^{2}$, using $(10.09~\pm~0.04)\times10^{9} ~J/ψ$ events collected with the BESIII detector. Significant isospin-violating decays of $η(1405)$ and $f_1(1420)$ into $f_0(980)π^{0}$ are observed. For the first time, three axial-vectors, $f_1(1285)$,…
▽ More
A partial-wave analysis is performed on the decay $J/ψ\toγπ^{0}π^{0}π^{0}$ within the $π^{0}π^{0}π^{0}$ invariant-mass region below 1.6 GeV$/c^{2}$, using $(10.09~\pm~0.04)\times10^{9} ~J/ψ$ events collected with the BESIII detector. Significant isospin-violating decays of $η(1405)$ and $f_1(1420)$ into $f_0(980)π^{0}$ are observed. For the first time, three axial-vectors, $f_1(1285)$, $f_1(1420)$ and $f_1(1510)$, are observed to decay into $π^{0}π^{0}π^{0}$. The product branching fractions of these resonances are reported.
△ Less
Submitted 3 August, 2025; v1 submitted 5 June, 2025;
originally announced June 2025.
-
Sparse Phase Retrieval with Redundant Dictionary via $\ell_q (0<q\le 1)$-Analysis Model
Authors:
Haiye Huo,
Li Xiao
Abstract:
Sparse phase retrieval with redundant dictionary is to reconstruct the signals of interest that are (nearly) sparse in a redundant dictionary or frame from the phaseless measurements via the optimization models. Gao [7] presented conditions on the measurement matrix, called null space property (NSP) and strong dictionary restricted isometry property (S-DRIP), for exact and stable recovery of dicti…
▽ More
Sparse phase retrieval with redundant dictionary is to reconstruct the signals of interest that are (nearly) sparse in a redundant dictionary or frame from the phaseless measurements via the optimization models. Gao [7] presented conditions on the measurement matrix, called null space property (NSP) and strong dictionary restricted isometry property (S-DRIP), for exact and stable recovery of dictionary-$k$-sparse signals via the $\ell_1$-analysis model for sparse phase retrieval with redundant dictionary, respectively, where, in particularly, the S-DRIP of order $tk$ with $t>1$ was derived. In this paper, motivated by many advantages of the $\ell_q$ minimization with $0<q\leq1$, e.g., reduction of the number of measurements required, we generalize these two conditions to the $\ell_q$-analysis model. Specifically, we first present two NSP variants for exact recovery of dictionary-$k$-sparse signals via the $\ell_q$-analysis model in the noiseless scenario. Moreover, we investigate the S-DRIP of order $tk$ with $0<t<\frac{4}{3}$ for stable recovery of dictionary-$k$-sparse signals via the $\ell_q$-analysis model in the noisy scenario, which will complement the existing result of the S-DRIP of order $tk$ with $t\geq2$ obtained in [4].
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Stable recovery of complex dictionary-sparse signals from phaseless measurements
Authors:
Lianxing Xia,
Haiye Huo
Abstract:
Dictionary-sparse phase retrieval, which is also known as phase retrieval with redundant dictionary, aims to reconstruct an original dictionary-sparse signal from its measurements without phase information. It is proved that if the measurement matrix $A$ satisfies null space property (NSP)/strong dictionary restricted isometry property (S-DRIP), then the dictionary-sparse signal can be exactly/sta…
▽ More
Dictionary-sparse phase retrieval, which is also known as phase retrieval with redundant dictionary, aims to reconstruct an original dictionary-sparse signal from its measurements without phase information. It is proved that if the measurement matrix $A$ satisfies null space property (NSP)/strong dictionary restricted isometry property (S-DRIP), then the dictionary-sparse signal can be exactly/stably recovered from its magnitude-only measurements up to a global phase. However, the S-DRIP holds only for real signals. Hence, in this paper, we mainly study the stability of the $\ell_1$-analysis minimization and its generalized $\ell_q\;(0<q\leq1)$-analysis minimization for the recovery of complex dictionary-sparse signals from phaseless measurements. First, we introduce a new $l_1$-dictionary restricted isometry property ($\ell_1$-DRIP) for rank-one and dictionary-sparse matrices, and show that complex dictionary-sparse signals can be stably recovered by magnitude-only measurements via $\ell_1$-analysis minimization provided that the quadratic measurement map $\mathcal{A}$ satisfies $\ell_1$-DRIP. Then, we generalized the $\ell_1$-DRIP condition under the framework of $\ell_q\;(0<q\leq1)$-analysis minimization.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Vision Remember: Alleviating Visual Forgetting in Efficient MLLM with Vision Feature Resample
Authors:
Ze Feng,
Jiang-Jiang Liu,
Sen Yang,
Lingyu Xiao,
Xiaofan Li,
Wankou Yang,
Jingdong Wang
Abstract:
In this work, we study the Efficient Multimodal Large Language Model. Redundant vision tokens consume a significant amount of computational memory and resources. Therefore, many previous works compress them in the Vision Projector to reduce the number of vision tokens. However, simply compressing in the Vision Projector can lead to the loss of visual information, especially for tasks that rely on…
▽ More
In this work, we study the Efficient Multimodal Large Language Model. Redundant vision tokens consume a significant amount of computational memory and resources. Therefore, many previous works compress them in the Vision Projector to reduce the number of vision tokens. However, simply compressing in the Vision Projector can lead to the loss of visual information, especially for tasks that rely on fine-grained spatial relationships, such as OCR and Chart \& Table Understanding. To address this problem, we propose Vision Remember, which is inserted between the LLM decoder layers to allow vision tokens to re-memorize vision features. Specifically, we retain multi-level vision features and resample them with the vision tokens that have interacted with the text token. During the resampling process, each vision token only attends to a local region in vision features, which is referred to as saliency-enhancing local attention. Saliency-enhancing local attention not only improves computational efficiency but also captures more fine-grained contextual information and spatial relationships within the region. Comprehensive experiments on multiple visual understanding benchmarks validate the effectiveness of our method when combined with various Efficient Vision Projectors, showing performance gains without sacrificing efficiency. Based on Vision Remember, LLaVA-VR with only 2B parameters is also superior to previous representative MLLMs such as Tokenpacker-HD-7B and DeepSeek-VL-7B.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Simulation of MAPS and a MAPS-based Inner Tracker for the Super Tau-Charm Facility
Authors:
Ruiyang Zhang,
Dongwei Xuan,
Jiajun Qin,
Lei Zhao,
Le Xiao,
Xiangming Sun,
Lailin Xu,
Jianbei Liu
Abstract:
Monolithic Active Pixel Sensors (MAPS) are a promising detector candidate for the inner tracker of the Super Tau-Charm Facility (STCF). To evaluate the performance of MAPS and the MAPS-based inner tracker, a dedicated simulation workflow has been developed, offering essential insights for detector design and optimization.
The intrinsic characteristics of MAPS, designed using several fabrication…
▽ More
Monolithic Active Pixel Sensors (MAPS) are a promising detector candidate for the inner tracker of the Super Tau-Charm Facility (STCF). To evaluate the performance of MAPS and the MAPS-based inner tracker, a dedicated simulation workflow has been developed, offering essential insights for detector design and optimization.
The intrinsic characteristics of MAPS, designed using several fabrication processes and pixel geometries, were investigated through a combination of Technology Computer Aided Design (TCAD) and Monte Carlo simulations. Simulations were conducted with both minimum ionizing particles and $^{55}$Fe X-rays to assess critical parameters such as detection efficiency, cluster size, spatial resolution, and charge collection efficiency. Based on these evaluations, a MAPS sensor featuring a strip-like pixel and a high-resistivity epitaxial layer is selected as the baseline sensor design for the STCF inner tracker due to its excellent performance.
Using this optimized MAPS design, a three-layer MAPS-based inner tracker was modeled and simulated. The simulation demonstrated an average detection efficiency exceeding 99%, spatial resolutions of 44.8$\rm{μm}$ in the $z$ direction and 8.2$\rm{μm}$ in the $r-φ$ direction, and an intrinsic sensor time resolution of 5.9ns for 1GeV/c $μ^-$ particles originating from the interaction point. These promising results suggest that the MAPS-based inner tracker fulfills the performance requirements of the STCF experiment.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Measurement of the branching fractions of the Cabibbo-favored decays $Λ_{c}^{+}\toΛK_{S}^{0}K^{+}$ and $Λ_{c}^{+}\toΞ^{0}K_{S}^{0}π^{+}$ and search for $Λ_{c}^{+}\toΣ^{0} K_{S}^{0}K^{+}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (660 additional authors not shown)
Abstract:
Based on $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of about 4.5 fb$^{-1}$ collected at center-of-mass energies between 4599.53 MeV and 4698.82 MeV with the BESIII detector, the absolute branching fraction of the Cabibbo-favored decay $Λ_{c}^{+}\toΛK_{S}^{0}K^{+}$ is measured to be $(3.12\pm0.46\pm0.15)\times10^{-3}$. Combined with a previous measurement from the BESIII…
▽ More
Based on $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of about 4.5 fb$^{-1}$ collected at center-of-mass energies between 4599.53 MeV and 4698.82 MeV with the BESIII detector, the absolute branching fraction of the Cabibbo-favored decay $Λ_{c}^{+}\toΛK_{S}^{0}K^{+}$ is measured to be $(3.12\pm0.46\pm0.15)\times10^{-3}$. Combined with a previous measurement from the BESIII Collaboration, the branching fraction of the decay $Λ_{c}^{+}\toΛK_{S}^{0}K^{+}$ is calculated to be $(3.07\pm0.26\pm0.13)\times10^{-3}$. The decay $Λ_{c}^{+}\toΞ^{0}K_{S}^{0}π^{+}$ is observed for the first time with a statistical significance of $6.6σ$, and its branching fraction is determined to be $(3.70\pm0.60\pm0.21)\times10^{-3}$. In addition, a search for the decay $Λ_{c}^{+}\toΣ^{0} K_{S}^{0}K^{+}$ is performed and its branching fraction is determined to be $(0.80^{+0.28}_{-0.24}\pm0.16)\times10^{-3}$, corresponding to an upper limit of $1.28\times10^{-3}$ at $90\%$ confidence level. These measurements provide new information that can be used to distinguish between theoretical models.
△ Less
Submitted 24 July, 2025; v1 submitted 3 June, 2025;
originally announced June 2025.
-
Improved Measurements of $D^+ \to ηe^+ν_e$ and $D^+ \to ημ^+ν_μ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (682 additional authors not shown)
Abstract:
Using 20.3 fb$^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy of 3.773 GeV with the BESIII detector, we measure the branching fractions of $D^+\to ηe^+ν_e$ and $D^+\to ημ^+ν_μ$ to be $(9.75\pm0.29\pm0.28)\times10^{-4}$ and $(9.08\pm0.35\pm0.23)\times10^{-4}$, where the first and second uncertainties are statistical and systematic, respectively. From a simultaneous fit to t…
▽ More
Using 20.3 fb$^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy of 3.773 GeV with the BESIII detector, we measure the branching fractions of $D^+\to ηe^+ν_e$ and $D^+\to ημ^+ν_μ$ to be $(9.75\pm0.29\pm0.28)\times10^{-4}$ and $(9.08\pm0.35\pm0.23)\times10^{-4}$, where the first and second uncertainties are statistical and systematic, respectively. From a simultaneous fit to their partial decay rates, we determine the product of the hadronic form factor $f^η_+(0)$ and the modulus of the $c\to d$ Cabibbo-Kobayashi-Maskawa matrix element $|V_{cd}|$ to be $f^η_+(0)|V_{cd}|=0.078\pm0.002\pm0.001$. Taking the $|V_{cd}|$ value from the Standard Model global fit as input, we obtain $f^η_+(0)=0.345\pm0.008\pm0.003$. The ratio between the measured branching fractions of $D^+\toη^+μ^+ν_μ$ and $D^+\toηe^+ν_e$, is determined to be $0.93\pm0.05_{\rm stat.}\pm0.02_{\rm syst.}$, indicating no violation of lepton flavor universality.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
Sodium-Decorated P-C3N: A Porous 2D Framework for High-Capacity and Reversible Hydrogen Storage
Authors:
Jose A. S. Laranjeira,
Nicolas F. Martins,
Kleuton A. L. Lima,
Lingtao Xiao,
Xihao Chen,
Luiz A. Ribeiro Junior,
Julio R. Sambrano
Abstract:
The development of reversible hydrogen storage materials has become crucial for enabling carbon-neutral energy systems. Based on this, the present work investigates the hydrogen storage on the sodium-decorated P-C$_3$N (Na@P-C$_3$N), a porous carbon nitride monolayer recently proposed as a stable semiconductor. First-principles calculations reveal that Na atoms preferentially adsorb with an adsorp…
▽ More
The development of reversible hydrogen storage materials has become crucial for enabling carbon-neutral energy systems. Based on this, the present work investigates the hydrogen storage on the sodium-decorated P-C$_3$N (Na@P-C$_3$N), a porous carbon nitride monolayer recently proposed as a stable semiconductor. First-principles calculations reveal that Na atoms preferentially adsorb with an adsorption energy of -4.48~eV, effectively suppressing clusterization effects. Upon decoration, the system becomes metallic, while \textit{ab initio} molecular dynamics simulations confirm the thermal stability of Na@P-C$_3$N at 300~K. Hydrogen adsorption on Na@P-C$_3$N occurs through weak physisorption, with energies ranging from -0.18 to -0.28~eV, and desorption temperatures between 231 and 357~K. The system can stably absorb 16 H$_2$ molecules per unit cell, corresponding to a gravimetric storage capacity of 9.88~wt\%, surpassing the U.S. Department of Energy target. These results demonstrate that Na@P-C$_3$N is a promising candidate for lightweight, stable, and reversible hydrogen storage.
△ Less
Submitted 15 September, 2025; v1 submitted 2 June, 2025;
originally announced June 2025.
-
A Low Power Monolithic Active Pixel Sensor Prototype for the STCF Inner Tracker
Authors:
Dongwei Xuan,
Ruiyang Zhang,
Jiajun Qin,
Hao Han,
Xinyu Bin,
Zihan Xu,
Lei Zhao,
Jianbei Liu,
Liang Zhang,
Anqing Wang,
Aodong Song,
Xiangming Sun,
Le Xiao,
Lailin Xu
Abstract:
The Super Tau-Charm Facility (STCF) is a proposed $e^+e^-$ collider with a peak luminosity 100 times higher than that of the present tau-charm factory. The inner tracker (ITK) of STCF should feature a low material budget and high readout speed. Under these requirements, the monolithic active pixel sensor (MAPS) is considered as a promising candidate for the ITK. To minimize the power consumption o…
▽ More
The Super Tau-Charm Facility (STCF) is a proposed $e^+e^-$ collider with a peak luminosity 100 times higher than that of the present tau-charm factory. The inner tracker (ITK) of STCF should feature a low material budget and high readout speed. Under these requirements, the monolithic active pixel sensor (MAPS) is considered as a promising candidate for the ITK. To minimize the power consumption of MAPS (for low material budget), larger-size sensors are proposed to reduce the scale of the readout circuitry while preserving the required position resolution. Multiple sensors with varying dimensions and structures were designed and integrated in several prototype chips for performance comparison, fabricated in a 180~nm CIS process. The in-pixel readout circuit can also provide time of arrival (ToA) and time-over-threshold (ToT) of the hit signal, with a least significant bit (LSB) of 50 ns. The peripheral readout circuit performs operations including timestamp correction, data aggregation, caching, framing, 8b/10b encoding, and serialization. According to simulation, the power consumption for a full-scale chip is about 55.7 mW/cm2. Preliminary measurements have been conducted on the prototype chips.
△ Less
Submitted 2 June, 2025;
originally announced June 2025.
-
PolyBERT: Fine-Tuned Poly Encoder BERT-Based Model for Word Sense Disambiguation
Authors:
Linhan Xia,
Mingzhan Yang,
Guohui Yuan,
Shengnan Tao,
Yujing Qiu,
Guo Yu,
Kai Lei
Abstract:
Mainstream Word Sense Disambiguation (WSD) approaches have employed BERT to extract semantics from both context and definitions of senses to determine the most suitable sense of a target word, achieving notable performance. However, there are two limitations in these approaches. First, previous studies failed to balance the representation of token-level (local) and sequence-level (global) semantic…
▽ More
Mainstream Word Sense Disambiguation (WSD) approaches have employed BERT to extract semantics from both context and definitions of senses to determine the most suitable sense of a target word, achieving notable performance. However, there are two limitations in these approaches. First, previous studies failed to balance the representation of token-level (local) and sequence-level (global) semantics during feature extraction, leading to insufficient semantic representation and a performance bottleneck. Second, these approaches incorporated all possible senses of each target word during the training phase, leading to unnecessary computational costs. To overcome these limitations, this paper introduces a poly-encoder BERT-based model with batch contrastive learning for WSD, named PolyBERT. Compared with previous WSD methods, PolyBERT has two improvements: (1) A poly-encoder with a multi-head attention mechanism is utilized to fuse token-level (local) and sequence-level (global) semantics, rather than focusing on just one. This approach enriches semantic representation by balancing local and global semantics. (2) To avoid redundant training inputs, Batch Contrastive Learning (BCL) is introduced. BCL utilizes the correct senses of other target words in the same batch as negative samples for the current target word, which reduces training inputs and computational cost. The experimental results demonstrate that PolyBERT outperforms baseline WSD methods such as Huang's GlossBERT and Blevins's BEM by 2\% in F1-score. In addition, PolyBERT with BCL reduces GPU hours by 37.6\% compared with PolyBERT without BCL.
△ Less
Submitted 1 June, 2025;
originally announced June 2025.
-
Pre-training for Recommendation Unlearning
Authors:
Guoxuan Chen,
Lianghao Xia,
Chao Huang
Abstract:
Modern recommender systems powered by Graph Neural Networks (GNNs) excel at modeling complex user-item interactions, yet increasingly face scenarios requiring selective forgetting of training data. Beyond user requests to remove specific interactions due to privacy concerns or preference changes, regulatory frameworks mandate recommender systems' ability to eliminate the influence of certain user…
▽ More
Modern recommender systems powered by Graph Neural Networks (GNNs) excel at modeling complex user-item interactions, yet increasingly face scenarios requiring selective forgetting of training data. Beyond user requests to remove specific interactions due to privacy concerns or preference changes, regulatory frameworks mandate recommender systems' ability to eliminate the influence of certain user data from models. This recommendation unlearning challenge presents unique difficulties as removing connections within interaction graphs creates ripple effects throughout the model, potentially impacting recommendations for numerous users. Traditional approaches suffer from significant drawbacks: fragmentation methods damage graph structure and diminish performance, while influence function techniques make assumptions that may not hold in complex GNNs, particularly with self-supervised or random architectures. To address these limitations, we propose a novel model-agnostic pre-training paradigm UnlearnRec that prepares systems for efficient unlearning operations. Our Influence Encoder takes unlearning requests together with existing model parameters and directly produces updated parameters of unlearned model with little fine-tuning, avoiding complete retraining while preserving model performance characteristics. Extensive evaluation on public benchmarks demonstrates that our method delivers exceptional unlearning effectiveness while providing more than 10x speedup compared to retraining approaches. We release our method implementation at: https://github.com/HKUDS/UnlearnRec.
△ Less
Submitted 29 May, 2025; v1 submitted 28 May, 2025;
originally announced May 2025.
-
Search for a dark baryon in the $Ξ^-\rightarrowπ^-+{\rm invisible}$ decay
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (697 additional authors not shown)
Abstract:
A search for a dark baryon is performed for the first time in the two-body decay $Ξ^-\rightarrowπ^-+{\rm invisible}$ using $(10.087\pm0.044)\times10^{9}$ $J/ψ$ events collected at a center-of-mass energy of $\sqrt{s}=3.097\,\mbox{GeV}$ with the BESIII detector at the BEPCII collider. No significant signal is observed, and the 90% (95%) confidence level upper limits on the branching fraction…
▽ More
A search for a dark baryon is performed for the first time in the two-body decay $Ξ^-\rightarrowπ^-+{\rm invisible}$ using $(10.087\pm0.044)\times10^{9}$ $J/ψ$ events collected at a center-of-mass energy of $\sqrt{s}=3.097\,\mbox{GeV}$ with the BESIII detector at the BEPCII collider. No significant signal is observed, and the 90% (95%) confidence level upper limits on the branching fraction $B(Ξ^-\rightarrowπ^-+{\rm invisible})$ are determined to be $4.2\times10^{-5}$ ($5.2\times10^{-5}$), $6.9\times10^{-5}$ ($8.4\times10^{-5}$), $6.5\times10^{-4}$ ($7.6\times10^{-4}$), $1.1\times10^{-4}$ ($1.3\times10^{-4}$) and $4.5\times10^{-5}$ ($5.5\times10^{-5}$), under the dark baryon mass hypotheses of 1.07$\,\mbox{GeV}/c^2$, 1.10$\,\mbox{GeV}/c^2$, $m_Λ$ (1.116$\,\mbox{GeV}/c^2$), 1.13$\,\mbox{GeV}/c^2$, and 1.16$\,\mbox{GeV}/c^2$, respectively. The constraints obtained on the Wilson coefficients $C_{u s, s}^L$ and $C_{u s, s}^R$ are more stringent than the previous limits derived from the LHC searches for the colored mediators.
△ Less
Submitted 28 May, 2025;
originally announced May 2025.
-
A candidate for True Type-2 AGN without hidden central BLRs Identified by central Tidal Disruption Event
Authors:
Gu Ying,
Zheng Qi,
Cheng Peizheng,
Li Xiao,
Xing-Qian Cheng,
Zhang XueGuang,
Liang EnWei
Abstract:
In this manuscript, through applications of TDE (tidal disruption event) expected variability properties, a potential candidate for True type-2 AGN without hidden central broad line regions (=TT2AGN) is reported in the SDSS J233454.07+145712.9 (=SDSS J2334). Through analyzing the 20-years optical light curves of SDSS J2334 from different Sky Survey projects, a TDE is preferred with a…
▽ More
In this manuscript, through applications of TDE (tidal disruption event) expected variability properties, a potential candidate for True type-2 AGN without hidden central broad line regions (=TT2AGN) is reported in the SDSS J233454.07+145712.9 (=SDSS J2334). Through analyzing the 20-years optical light curves of SDSS J2334 from different Sky Survey projects, a TDE is preferred with a $4.7{\rm M_\odot}$ main-sequence star tidally disrupted by the central BH with mass $11.7\times 10^6{\rm M_\odot}$, indicating that central region within distance about 20 light-days to central BH in SDSS J2334 is directly in the line-of-sight. Moreover, AGN activities in SDSS J2334 can be confirmed through applications of BPT diagrams. Meanwhile, comparing virial BH mass determined through assumed broad Balmer emission components and M-sigma expected BH mass by well measured stellar velocity dispersion through stellar absorption features, optical broad emission lines in SDSS J2334 are disfavored with confidence level higher than 6$σ$. Therefore, combining the unique properties of the TDE and the spectroscopic results with only narrow emission lines, SDSS J2334 can be well identified as a potential candidate for a TT2AGN. The results indicate the to detect TDE expected flares in normal Type-2 AGN classified by spectroscopic results should be a new practicable method for identifying
△ Less
Submitted 27 May, 2025;
originally announced May 2025.
-
First measurement of $Σ^{+}n\rightarrowΛp$ and $Σ^{+}n\rightarrowΣ^{0}p$ cross-sections via $Σ^+$-nucleus scattering at an electron-positron collider
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (680 additional authors not shown)
Abstract:
Using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected with the BESIII detector at the BEPCII storage ring, the reactions $Σ^{+}n\rightarrowΛp$ and $Σ^{+}n\rightarrowΣ^{0}p$ are studied, where the $Σ^{+}$ baryon is produced in the process $J/ψ\rightarrowΣ^{+}\barΣ^-$ and the neutron is a component of the $^9\rm{Be}$, $^{12}\rm{C}$ and $^{197}\rm{Au}$ nuclei in the beam pipe. Clear signals o…
▽ More
Using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected with the BESIII detector at the BEPCII storage ring, the reactions $Σ^{+}n\rightarrowΛp$ and $Σ^{+}n\rightarrowΣ^{0}p$ are studied, where the $Σ^{+}$ baryon is produced in the process $J/ψ\rightarrowΣ^{+}\barΣ^-$ and the neutron is a component of the $^9\rm{Be}$, $^{12}\rm{C}$ and $^{197}\rm{Au}$ nuclei in the beam pipe. Clear signals of these two reactions are observed for the first time. Their cross-sections are measured to be $σ(Σ^{+}+{^9\rm{Be}}\rightarrowΛ+p+{^8\rm{Be}})=(45.2\pm12.1_{\rm{stat}}\pm7.2_{\rm{sys}})$ mb and $σ(Σ^{+}+{^9\rm{Be}}\rightarrowΣ^{0}+p+{^8\rm{Be}})=(29.8\pm9.7_{\rm{stat}}\pm6.9_{\rm{sys}})$ mb for a $Σ^{+}$ average momentum of $0.992$ GeV/$c$, within a range of $\pm0.015$ GeV/$c$. This is the first study of $Σ^{+}$-nucleon scattering at an electron-positron collider.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective
Authors:
Junnan Liu,
Hongwei Liu,
Linchen Xiao,
Shudong Liu,
Taolin Zhang,
Zihan Ma,
Songyang Zhang,
Kai Chen
Abstract:
We propose a novel framework for comprehending the reasoning capabilities of large language models (LLMs) through the perspective of meta-learning. By conceptualizing reasoning trajectories as pseudo-gradient descent updates to the LLM's parameters, we identify parallels between LLM reasoning and various meta-learning paradigms. We formalize the training process for reasoning tasks as a meta-learn…
▽ More
We propose a novel framework for comprehending the reasoning capabilities of large language models (LLMs) through the perspective of meta-learning. By conceptualizing reasoning trajectories as pseudo-gradient descent updates to the LLM's parameters, we identify parallels between LLM reasoning and various meta-learning paradigms. We formalize the training process for reasoning tasks as a meta-learning setup, with each question treated as an individual task, and reasoning trajectories serving as the inner loop optimization for adapting model parameters. Once trained on a diverse set of questions, the LLM develops fundamental reasoning capabilities that can generalize to previously unseen questions. Extensive empirical evaluations substantiate the strong connection between LLM reasoning and meta-learning, exploring several issues of significant interest from a meta-learning standpoint. Our work not only enhances the understanding of LLM reasoning but also provides practical insights for improving these models through established meta-learning techniques.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
FlowSE: Efficient and High-Quality Speech Enhancement via Flow Matching
Authors:
Ziqian Wang,
Zikai Liu,
Xinfa Zhu,
Yike Zhu,
Mingshuai Liu,
Jun Chen,
Longshuai Xiao,
Chao Weng,
Lei Xie
Abstract:
Generative models have excelled in audio tasks using approaches such as language models, diffusion, and flow matching. However, existing generative approaches for speech enhancement (SE) face notable challenges: language model-based methods suffer from quantization loss, leading to compromised speaker similarity and intelligibility, while diffusion models require complex training and high inferenc…
▽ More
Generative models have excelled in audio tasks using approaches such as language models, diffusion, and flow matching. However, existing generative approaches for speech enhancement (SE) face notable challenges: language model-based methods suffer from quantization loss, leading to compromised speaker similarity and intelligibility, while diffusion models require complex training and high inference latency. To address these challenges, we propose FlowSE, a flow-matching-based model for SE. Flow matching learns a continuous transformation between noisy and clean speech distributions in a single pass, significantly reducing inference latency while maintaining high-quality reconstruction. Specifically, FlowSE trains on noisy mel spectrograms and optional character sequences, optimizing a conditional flow matching loss with ground-truth mel spectrograms as supervision. It implicitly learns speech's temporal-spectral structure and text-speech alignment. During inference, FlowSE can operate with or without textual information, achieving impressive results in both scenarios, with further improvements when transcripts are available. Extensive experiments demonstrate that FlowSE significantly outperforms state-of-the-art generative methods, establishing a new paradigm for generative-based SE and demonstrating the potential of flow matching to advance the field. Our code, pre-trained checkpoints, and audio samples are available.
△ Less
Submitted 27 May, 2025; v1 submitted 25 May, 2025;
originally announced May 2025.
-
Geometry-guided Online 3D Video Synthesis with Multi-View Temporal Consistency
Authors:
Hyunho Ha,
Lei Xiao,
Christian Richardt,
Thu Nguyen-Phuoc,
Changil Kim,
Min H. Kim,
Douglas Lanman,
Numair Khan
Abstract:
We introduce a novel geometry-guided online video view synthesis method with enhanced view and temporal consistency. Traditional approaches achieve high-quality synthesis from dense multi-view camera setups but require significant computational resources. In contrast, selective-input methods reduce this cost but often compromise quality, leading to multi-view and temporal inconsistencies such as f…
▽ More
We introduce a novel geometry-guided online video view synthesis method with enhanced view and temporal consistency. Traditional approaches achieve high-quality synthesis from dense multi-view camera setups but require significant computational resources. In contrast, selective-input methods reduce this cost but often compromise quality, leading to multi-view and temporal inconsistencies such as flickering artifacts. Our method addresses this challenge to deliver efficient, high-quality novel-view synthesis with view and temporal consistency. The key innovation of our approach lies in using global geometry to guide an image-based rendering pipeline. To accomplish this, we progressively refine depth maps using color difference masks across time. These depth maps are then accumulated through truncated signed distance fields in the synthesized view's image space. This depth representation is view and temporally consistent, and is used to guide a pre-trained blending network that fuses multiple forward-rendered input-view images. Thus, the network is encouraged to output geometrically consistent synthesis results across multiple views and time. Our approach achieves consistent, high-quality video synthesis, while running efficiently in an online manner.
△ Less
Submitted 24 May, 2025;
originally announced May 2025.
-
AI-Researcher: Autonomous Scientific Innovation
Authors:
Jiabin Tang,
Lianghao Xia,
Zhonghang Li,
Chao Huang
Abstract:
The powerful reasoning capabilities of Large Language Models (LLMs) in mathematics and coding, combined with their ability to automate complex tasks through agentic frameworks, present unprecedented opportunities for accelerating scientific innovation. In this paper, we introduce AI-Researcher, a fully autonomous research system that transforms how AI-driven scientific discovery is conducted and e…
▽ More
The powerful reasoning capabilities of Large Language Models (LLMs) in mathematics and coding, combined with their ability to automate complex tasks through agentic frameworks, present unprecedented opportunities for accelerating scientific innovation. In this paper, we introduce AI-Researcher, a fully autonomous research system that transforms how AI-driven scientific discovery is conducted and evaluated. Our framework seamlessly orchestrates the complete research pipeline--from literature review and hypothesis generation to algorithm implementation and publication-ready manuscript preparation--with minimal human intervention. To rigorously assess autonomous research capabilities, we develop Scientist-Bench, a comprehensive benchmark comprising state-of-the-art papers across diverse AI research domains, featuring both guided innovation and open-ended exploration tasks. Through extensive experiments, we demonstrate that AI-Researcher achieves remarkable implementation success rates and produces research papers that approach human-level quality. This work establishes new foundations for autonomous scientific innovation that can complement human researchers by systematically exploring solution spaces beyond cognitive limitations.
△ Less
Submitted 24 May, 2025;
originally announced May 2025.
-
ChartGalaxy: A Dataset for Infographic Chart Understanding and Generation
Authors:
Zhen Li,
Duan Li,
Yukai Guo,
Xinyuan Guo,
Bowen Li,
Lanxi Xiao,
Shenyu Qiao,
Jiashu Chen,
Zijian Wu,
Hui Zhang,
Xinhuan Shu,
Shixia Liu
Abstract:
Infographic charts are a powerful medium for communicating abstract data by combining visual elements (e.g., charts, images) with textual information. However, their visual and structural richness poses challenges for large vision-language models (LVLMs), which are typically trained on plain charts. To bridge this gap, we introduce ChartGalaxy, a million-scale dataset designed to advance the under…
▽ More
Infographic charts are a powerful medium for communicating abstract data by combining visual elements (e.g., charts, images) with textual information. However, their visual and structural richness poses challenges for large vision-language models (LVLMs), which are typically trained on plain charts. To bridge this gap, we introduce ChartGalaxy, a million-scale dataset designed to advance the understanding and generation of infographic charts. The dataset is constructed through an inductive process that identifies 75 chart types, 440 chart variations, and 68 layout templates from real infographic charts and uses them to create synthetic ones programmatically. We showcase the utility of this dataset through: 1) improving infographic chart understanding via fine-tuning, 2) benchmarking code generation for infographic charts, and 3) enabling example-based infographic chart generation. By capturing the visual and structural complexity of real design, ChartGalaxy provides a useful resource for enhancing multimodal reasoning and generation in LVLMs.
△ Less
Submitted 26 September, 2025; v1 submitted 24 May, 2025;
originally announced May 2025.
-
Ligand-SOC enhanced $4f^5$ Kitaev antiferromagnet: Application to $\mathrm{SmI}_3$
Authors:
Li-Hao Xia,
Yi-Peng Gao,
Zhao-Yang Dong,
Jian-Xin Li
Abstract:
The search for Kitaev quantum spin liquids (Kitaev-QSLs) in real materials has mainly focused on $4d$- and $5d$-electron honeycomb systems. A recent experimental study on the $4f^5$ honeycomb iodide $\mathrm{SmI}_3$ reported the absence of long-range magnetic order down to $0.1\ \text{K}$, suggesting a possible Kitaev-QSL phase. Motivated by the interplay between the complex exchange processes inh…
▽ More
The search for Kitaev quantum spin liquids (Kitaev-QSLs) in real materials has mainly focused on $4d$- and $5d$-electron honeycomb systems. A recent experimental study on the $4f^5$ honeycomb iodide $\mathrm{SmI}_3$ reported the absence of long-range magnetic order down to $0.1\ \text{K}$, suggesting a possible Kitaev-QSL phase. Motivated by the interplay between the complex exchange processes inherent to the $4f^5$ multi-electron configuration and the strong spin-orbit coupling (SOC) of the iodine ligands, we systematically investigate the effective exchange interactions in $\mathrm{SmI}_3$ using the strong coupling expansion method. Our findings reveal that bond-dependent SOCs (bond-SOCs), extracted from relativistic density functional theory (DFT) calculations, significantly enhance the antiferromagnetic (AFM) Kitaev interaction, driving the system close to the AFM Kitaev point. A microscopic analysis based on the Slater-Koster approach further indicates that the strong SOC of the iodine ligands (ligand-SOC) is the origin of bond-SOCs and plays a pivotal role in mediating the superexchange processes. Additionally, we identify a spin-flop transition induced by the bond-SOCs, where the enhanced AFM Kitaev interactions shift the AFM order from the out-of-plane $[1, 1, 1]$-direction to an in-plane orientation, breaking the $C_3$ rotational symmetry. Linear spin-wave theory (LSWT) further predicts the emergence of gapless modes following the spin-flop transition, indicating enhanced fluctuations and increased instability near the AFM Kitaev point. Our results highlight the crucial role of strong ligand-SOC in stabilizing the dominant AFM Kitaev interactions in $\mathrm{SmI}_3$ and provide valuable insights for discovering new $f$-electron Kitaev-QSL candidates.
△ Less
Submitted 24 May, 2025;
originally announced May 2025.
-
Measurement of branching fractions of $Λ_{c}^{+}$ decays to $Σ^{+} η$ and $Σ^{+} η'$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (644 additional authors not shown)
Abstract:
By analyzing $e^+e^-$ collision data taken at center-of-mass energies $\sqrt{s}$ between 4.600 and 4.699 GeV with the BESIII detector at the BEPCII collider, corresponding to an integrated luminosity of $\rm 4.5~fb^{-1}$, we study the hadronic decays $Λ_{c}^{+} \rightarrow Σ^{+} η$ and $Λ_{c}^{+} \rightarrow Σ^{+} η^{\prime}$ using the single-tag method. The branching fraction ratio of…
▽ More
By analyzing $e^+e^-$ collision data taken at center-of-mass energies $\sqrt{s}$ between 4.600 and 4.699 GeV with the BESIII detector at the BEPCII collider, corresponding to an integrated luminosity of $\rm 4.5~fb^{-1}$, we study the hadronic decays $Λ_{c}^{+} \rightarrow Σ^{+} η$ and $Λ_{c}^{+} \rightarrow Σ^{+} η^{\prime}$ using the single-tag method. The branching fraction ratio of $Λ_{c}^+ \rightarrow Σ^+ η$ relative to $Λ_{c}^+ \rightarrow Σ^+ π^0$ is determined to be $0.305 \pm 0.046_{\rm stat.} \pm 0.007_{\rm syst.}$, and that of $Λ_{c}^+ \rightarrow Σ^+ η'$ relative to $Λ_{c}^+ \rightarrow Σ^+ ω$ is $0.336 \pm 0.094_{\rm stat.} \pm 0.037_{\rm syst.}$. The ratio of $\frac{\mathcal{B}\left(Λ_{c}^{+} \rightarrow Σ^{+} η'\right)}{\mathcal{B}\left(Λ_{c}^{+} \rightarrow Σ^{+} η\right)} $ is determined to be $1.73 \pm 0.22_{\rm stat.} \pm 0.16_{\rm syst.}$. These results enrich our knowledge of charmed baryon decays.
△ Less
Submitted 5 September, 2025; v1 submitted 23 May, 2025;
originally announced May 2025.
-
Towards Robust Evaluation of STEM Education: Leveraging MLLMs in Project-Based Learning
Authors:
Yanhao Jia,
Xinyi Wu,
Qinglin Zhang,
Yiran Qin,
Luwei Xiao,
Shuai Zhao
Abstract:
Project-Based Learning (PBL) involves a variety of highly correlated multimodal data, making it a vital educational approach within STEM disciplines. With the rapid development of multimodal large language models (MLLMs), researchers have begun exploring their potential to enhance tasks such as information retrieval, knowledge comprehension, and data generation in educational settings. However, ex…
▽ More
Project-Based Learning (PBL) involves a variety of highly correlated multimodal data, making it a vital educational approach within STEM disciplines. With the rapid development of multimodal large language models (MLLMs), researchers have begun exploring their potential to enhance tasks such as information retrieval, knowledge comprehension, and data generation in educational settings. However, existing benchmarks fall short in providing both a free-form output structure and a rigorous human expert validation process, limiting their effectiveness in evaluating real-world educational tasks. Additionally, few methods have developed automated pipelines to assist with the complex responsibilities of teachers leveraging MLLMs, largely due to model hallucination and instability, which lead to unreliable implementation. To address this gap, we introduce PBLBench, a novel benchmark designed to evaluate complex reasoning grounded in domain-specific knowledge and long-context understanding, thereby challenging models with tasks that closely resemble those handled by human experts. To establish reliable ground truth, we adopt the Analytic Hierarchy Process (AHP), utilizing expert-driven pairwise comparisons to derive structured and weighted evaluation criteria. We assess the performance of 15 leading MLLMs/LLMs using PBLBench and demonstrate that even the most advanced models achieve only 59% rank accuracy, underscoring the significant challenges presented by this benchmark. We believe PBLBench will serve as a catalyst for the development of more capable AI agents, ultimately aiming to alleviate teacher workload and enhance educational productivity.
△ Less
Submitted 16 May, 2025;
originally announced May 2025.
-
Observation of $χ_{cJ}\to 3K_S^0K^\pmπ^\mp$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (678 additional authors not shown)
Abstract:
By analyzing $(2712.4\pm14.3)\times10^6$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays $χ_{c0,1,2} \to 3K_S^0K^\pmπ^\mp$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be $\mathcal{B}(χ_{c0}\to 3K_S^0K^\pmπ^\mp )=(7.95\pm0.50\pm0.65)\times10^{-5},$…
▽ More
By analyzing $(2712.4\pm14.3)\times10^6$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays $χ_{c0,1,2} \to 3K_S^0K^\pmπ^\mp$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be $\mathcal{B}(χ_{c0}\to 3K_S^0K^\pmπ^\mp )=(7.95\pm0.50\pm0.65)\times10^{-5},$ $\mathcal{B}(χ_{c1}\to 3K_S^0K^\pmπ^\mp)=(2.62\pm0.08\pm0.19)\times10^{-4},$ and $\mathcal{B}(χ_{c2}\to 3K_S^0K^\pmπ^\mp)=(1.72\pm0.07\pm0.15)\times10^{-4},$ where the first uncertainties are statistical and the second systematic.
△ Less
Submitted 21 May, 2025;
originally announced May 2025.
-
DeepCEE: Efficient Cross-Region Model Distributed Training System under Heterogeneous GPUs and Networks
Authors:
Jinquan Wang,
Xiaojian Liao,
Xuzhao Liu,
Jiashun Suo,
Zhisheng Huo,
Chenhao Zhang,
Xiangrong Xu,
Runnan Shen,
Xilong Xie,
Limin Xiao
Abstract:
Most existing training systems focus on a single region. In contrast, we envision that cross-region training offers more flexible GPU resource allocation and yields significant potential. However, the hierarchical cluster topology and unstable networks in the cloud-edge-end (CEE) environment, a typical cross-region scenario, pose substantial challenges to building an efficient and autonomous model…
▽ More
Most existing training systems focus on a single region. In contrast, we envision that cross-region training offers more flexible GPU resource allocation and yields significant potential. However, the hierarchical cluster topology and unstable networks in the cloud-edge-end (CEE) environment, a typical cross-region scenario, pose substantial challenges to building an efficient and autonomous model training system. We propose DeepCEE, a geo-distributed model training system tailored for heterogeneous GPUs and networks in CEE environments. DeepCEE adopts a communication-centric design philosophy to tackle challenges arising from slow and unstable inter-region networks. It begins with a heterogeneous device profiler that identifies and groups devices based on both network and compute characteristics. Leveraging device groups, DeepCEE implements compact, zero-bubble pipeline parallelism, automatically deriving optimal parallel strategies. To further adapt to runtime variability, DeepCEE integrates a dynamic environment adapter that reacts to network fluctuations. Extensive evaluations demonstrate that DeepCEE achieves 1.3-2.8x higher training throughput compared to widely used and SOTA training systems.
△ Less
Submitted 27 May, 2025; v1 submitted 21 May, 2025;
originally announced May 2025.
-
Test of local realism via entangled $Λ\barΛ$ system
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (597 additional authors not shown)
Abstract:
The non-locality of quantum correlations is a fundamental feature of quantum theory. The Bell inequality serves as a benchmark for distinguishing between predictions made by quantum theory and local hidden variable theory (LHVT). Recent advancements in photon-entanglement experiments have addressed potential loopholes and have observed significant violations of variants of Bell inequality. However…
▽ More
The non-locality of quantum correlations is a fundamental feature of quantum theory. The Bell inequality serves as a benchmark for distinguishing between predictions made by quantum theory and local hidden variable theory (LHVT). Recent advancements in photon-entanglement experiments have addressed potential loopholes and have observed significant violations of variants of Bell inequality. However, examples of Bell inequalities violation in high energy physics are scarce. In this study, we utilize $(10.087\pm0.044)\times10^{9}$ $J/ψ$ events collected with the BES-III detector at the BEPCII collider, performing non-local correlation tests using the entangled hyperon pairs. The massive-entangled $Λ\barΛ$ systems are formed and decay through strong and weak interactions, respectively. Through measurements of the angular distribution of $p\bar{p}$ in $J/ψ\to γη_c$ and subsequent $η_c\toΛ(pπ^-)\barΛ(\bar{p}π^{+})$ cascade decays, a significant violation of LHVT predictions is observed. The exclusion of LHVT is found to be statistically significant at a level exceeding $5.2σ$ in the testing of three Bell-like inequalities.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
Partial Wave Analysis of $e^{+}e^{-} \rightarrow π^{+}π^{-}J/ψ$ and Cross Section Measurement of $e^{+}e^{-} \rightarrow π^{\pm}Z_{c}(3900)^{\mp}$ from 4.1271 to 4.3583 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (639 additional authors not shown)
Abstract:
Based on 12.0 $\mathrm{fb^{-1}}$ of $e^{+}e^{-}$ collision data samples collected by the BESIII detector at center-of-mass energies from 4.1271 to 4.3583 GeV, a partial wave analysis is performed for the process $e^{+}e^{-} \rightarrow π^{+}π^{-}J/ψ$. The cross sections for the sub processes ${e^{+}e^{-}\rightarrowπ^{+}Z_{c}(3900)^{-}+c.c.\rightarrowπ^{+}π^{-}J/ψ}$,…
▽ More
Based on 12.0 $\mathrm{fb^{-1}}$ of $e^{+}e^{-}$ collision data samples collected by the BESIII detector at center-of-mass energies from 4.1271 to 4.3583 GeV, a partial wave analysis is performed for the process $e^{+}e^{-} \rightarrow π^{+}π^{-}J/ψ$. The cross sections for the sub processes ${e^{+}e^{-}\rightarrowπ^{+}Z_{c}(3900)^{-}+c.c.\rightarrowπ^{+}π^{-}J/ψ}$, $f_{0}(980)(\rightarrowπ^{+}π^{-})J/ψ$, and $(π^{+}π^{-})_{\rm{S\mbox{-}wave}} J/ψ$ are measured for the first time. The mass and width of the $Z_{c}(3900)^{\pm}$ are determined to be $3884.6\pm0.7\pm3.3$ MeV/$c^{2}$ and $37.2\pm1.3\pm6.6$ MeV, respectively. The first errors are statistical and the second systematic. The final state $(π^{+}π^{-})_{\rm{S\mbox{-}wave}} J/ψ$ dominates the process $e^{+}e^{-} \rightarrow π^{+}π^{-}J/ψ$. By analyzing the cross sections of $π^{\pm}Z_{c}(3900)^{\mp}$ and $f_{0}(980)J/ψ$, $Y(4220)$ has been observed. Its mass and width are determined to be $4225.8\pm4.2\pm3.1$ MeV/$c^{2}$ and $55.3\pm9.5\pm11.1$ MeV, respectively.
△ Less
Submitted 19 May, 2025;
originally announced May 2025.
-
Observation of $χ_{cJ}(J=0,1,2)\rightarrow p\bar{p}ηη$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (678 additional authors not shown)
Abstract:
Using $(2712.4\pm14.3)\times10^6$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII storage ring, the decays $χ_{cJ}(J=0,1,2)\rightarrow p\bar{p}ηη$ are observed for the first time through the radiative transition $ψ(3686)\toγχ_{cJ}$. The statistical significances for $χ_{cJ}$ signals are all larger than 5$σ$. The branching fractions of $χ_{c0,1,2}\to p\bar{p} ηη$ are deter…
▽ More
Using $(2712.4\pm14.3)\times10^6$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII storage ring, the decays $χ_{cJ}(J=0,1,2)\rightarrow p\bar{p}ηη$ are observed for the first time through the radiative transition $ψ(3686)\toγχ_{cJ}$. The statistical significances for $χ_{cJ}$ signals are all larger than 5$σ$. The branching fractions of $χ_{c0,1,2}\to p\bar{p} ηη$ are determined to be $({5.75 \pm 0.59 \pm 0.42}) \times 10^{-5}$, $({1.40 \pm 0.33 \pm 0.17}) \times 10^{-5}$, and $({2.64 \pm 0.40 \pm 0.27}) \times 10^{-5}$, respectively, where the first uncertainties are statistical and the second systematic. No evident resonant structures are found in the $p\bar{p}$ and $pη/\bar{p}η$ systems.
△ Less
Submitted 16 September, 2025; v1 submitted 18 May, 2025;
originally announced May 2025.
-
Observation of an Altered $a_{0}(980)$ Line-shape in $D^{+} \rightarrow π^{+}ηη$ due to the Triangle Loop Rescattering Effect
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (705 additional authors not shown)
Abstract:
Using 20.3~${\rm fb}^{-1}$ of $e^{+}e^{-}$ collision data taken with the BESIII detector at the center-of-mass energy 3.773~GeV, we report the first amplitude analysis of the hadronic decay $D^{+} \rightarrow π^{+}ηη$. The intermediate process $D^{+} \to a_{0}(980)^{+}η, a_{0}(980)^{+} \to π^{+}η$ is observed and is found to be the only component and its branching fraction is measured to be…
▽ More
Using 20.3~${\rm fb}^{-1}$ of $e^{+}e^{-}$ collision data taken with the BESIII detector at the center-of-mass energy 3.773~GeV, we report the first amplitude analysis of the hadronic decay $D^{+} \rightarrow π^{+}ηη$. The intermediate process $D^{+} \to a_{0}(980)^{+}η, a_{0}(980)^{+} \to π^{+}η$ is observed and is found to be the only component and its branching fraction is measured to be $(3.67\pm0.12_{\mathrm{stat.}}\pm 0.06_{\mathrm{syst.}})\times 10^{-3}$. Unlike the $a_{0}(980)$ line-shape observed in the decays of charmed mesons to $a_{0}(980)π$ and in the decay $D^{0} \to a_{0}(980)^{-}e^{+}ν_{e}$, where the low-mass side of the $a_0(980)$ is wider than the high-mass side, the $a_{0}(980)$ line-shape in $D^{+} \to a_{0}(980)^{+}η$ is found to be significantly altered, with the high-mass side being wider than the low-mass side. We establish that the $a_0(980)$ line-shape arises from the triangle loop rescattering of $D^+ \to \bar{K}_0^*(1430)^0K^+ \to a_0(980)^+ η$ and $D^+ \to K_0^*(1430)^+\bar{K}^0 \to a_0(980)^+ η$ with a significance of 5.8$σ$. This is the first experimental confirmation of the triangle loop rescattering effect.
△ Less
Submitted 17 May, 2025;
originally announced May 2025.
-
Multiplicative and mining property for stability numbers of graphs
Authors:
Metrose Metsidik,
Lixiao Xiao
Abstract:
$f$-vertex stability number $vs_f(G)=\min\{|X|: X\subseteq V(G) \enspace \text{and} \enspace f(G-X)\neq f(G)\}$, and $f$-edge stability number is defined similarly by setting $X\subseteq E(G)$. In this paper, for multiplicative and mining invariant $f$, we give some general bounds for $f$-vertex/edge stability numbers of graphs and some results about the relations between the $f$-vertex/edge stabi…
▽ More
$f$-vertex stability number $vs_f(G)=\min\{|X|: X\subseteq V(G) \enspace \text{and} \enspace f(G-X)\neq f(G)\}$, and $f$-edge stability number is defined similarly by setting $X\subseteq E(G)$. In this paper, for multiplicative and mining invariant $f$, we give some general bounds for $f$-vertex/edge stability numbers of graphs and some results about the relations between the $f$-vertex/edge stability numbers of graphs and their components.
△ Less
Submitted 16 May, 2025;
originally announced May 2025.
-
Human-Aligned Bench: Fine-Grained Assessment of Reasoning Ability in MLLMs vs. Humans
Authors:
Yansheng Qiu,
Li Xiao,
Zhaopan Xu,
Pengfei Zhou,
Zheng Wang,
Kaipeng Zhang
Abstract:
The goal of achieving Artificial General Intelligence (AGI) is to imitate humans and surpass them. Models such as OpenAI's o1, o3, and DeepSeek's R1 have demonstrated that large language models (LLMs) with human-like reasoning capabilities exhibit exceptional performance and are being gradually integrated into multimodal large language models (MLLMs). However, whether these models possess capabili…
▽ More
The goal of achieving Artificial General Intelligence (AGI) is to imitate humans and surpass them. Models such as OpenAI's o1, o3, and DeepSeek's R1 have demonstrated that large language models (LLMs) with human-like reasoning capabilities exhibit exceptional performance and are being gradually integrated into multimodal large language models (MLLMs). However, whether these models possess capabilities comparable to humans in handling reasoning tasks remains unclear at present. In this paper, we propose Human-Aligned Bench, a benchmark for fine-grained alignment of multimodal reasoning with human performance. Specifically, we collected 9,794 multimodal questions that solely rely on contextual reasoning, including bilingual (Chinese and English) multimodal questions and pure text-based questions, encompassing four question types: visual reasoning, definition judgment, analogical reasoning, and logical judgment. More importantly, each question is accompanied by human success rates and options that humans are prone to choosing incorrectly. Extensive experiments on the Human-Aligned Bench reveal notable differences between the performance of current MLLMs in multimodal reasoning and human performance. The findings on our benchmark provide insights into the development of the next-generation models.
△ Less
Submitted 23 May, 2025; v1 submitted 16 May, 2025;
originally announced May 2025.
-
Bridging Theory and Perception in Fair Division: A Study on Comparative and Fair Share Notions
Authors:
Hadi Hosseini,
Joshua Kavner,
Samarth Khanna,
Sujoy Sikdar,
Lirong Xia
Abstract:
The allocation of resources among multiple agents is a fundamental problem in both economics and computer science. In these settings, fairness plays a crucial role in ensuring social acceptability and practical implementation of resource allocation algorithms. Traditional fair division solutions have given rise to a variety of approximate fairness notions, often as a response to the challenges pos…
▽ More
The allocation of resources among multiple agents is a fundamental problem in both economics and computer science. In these settings, fairness plays a crucial role in ensuring social acceptability and practical implementation of resource allocation algorithms. Traditional fair division solutions have given rise to a variety of approximate fairness notions, often as a response to the challenges posed by non-existence or computational intractability of exact solutions. However, the inherent incompatibility among these notions raises a critical question: which concept of fairness is most suitable for practical applications? In this paper, we examine two broad frameworks -- threshold-based and comparison-based fairness notions -- and evaluate their perceived fairness through a comprehensive human subject study. Our findings uncover novel insights into the interplay between perception of fairness, theoretical guarantees, the role of externalities and subjective valuations, and underlying cognitive processes, shedding light on the theory and practice of fair division.
△ Less
Submitted 9 June, 2025; v1 submitted 15 May, 2025;
originally announced May 2025.
-
Aggregating Information and Preferences with Bounded-Size Deviations
Authors:
Qishen Han,
Grant Schoenebeck,
Biaoshuai Tao,
Lirong Xia
Abstract:
We investigate a voting scenario with two groups of agents whose preferences depend on a ground truth that cannot be directly observed. The majority's preferences align with the ground truth, while the minorities disagree. Focusing on strategic behavior, we analyze situations where agents can form coalitions up to a certain capacity and adopt the concept of ex-ante Bayesian $k$-strong equilibrium,…
▽ More
We investigate a voting scenario with two groups of agents whose preferences depend on a ground truth that cannot be directly observed. The majority's preferences align with the ground truth, while the minorities disagree. Focusing on strategic behavior, we analyze situations where agents can form coalitions up to a certain capacity and adopt the concept of ex-ante Bayesian $k$-strong equilibrium, in which no group of at most $k$ agents has an incentive to deviate. Our analysis provides a complete characterization of the region where equilibria exist and yield the majority-preferred outcome when the ground truth is common knowledge. This region is defined by two key parameters: the size of the majority group and the maximum coalition capacity. When agents cannot coordinate beyond a certain threshold determined by these parameters, a stable outcome supporting the informed majority emerges. The boundary of this region exhibits several distinct segments, notably including a surprising non-linear relationship between majority size and deviation capacity. Our results reveal the complexity of the strategic behaviors in this type of voting game, which in turn demonstrate the capability of the ex-ante Bayesian $k$-strong equilibrium to provide a more detailed analysis.
△ Less
Submitted 15 May, 2025;
originally announced May 2025.
-
The Art of Two-Round Voting
Authors:
Qishen Han,
Grant Schoenebeck,
Biaoshuai Tao,
Lirong Xia
Abstract:
We study the voting problem with two alternatives where voters' preferences depend on a not-directly-observable state variable. While equilibria in the one-round voting mechanisms lead to a good decision, they are usually hard to compute and follow. We consider the two-round voting mechanism where the first round serves as a polling stage and the winning alternative only depends on the outcome of…
▽ More
We study the voting problem with two alternatives where voters' preferences depend on a not-directly-observable state variable. While equilibria in the one-round voting mechanisms lead to a good decision, they are usually hard to compute and follow. We consider the two-round voting mechanism where the first round serves as a polling stage and the winning alternative only depends on the outcome of the second round. We show that the two-round voting mechanism is a powerful tool for making collective decisions. Firstly, every (approximated) equilibrium in the two-round voting mechanisms (asymptotically) leads to the decision preferred by the majority as if the state of the world were revealed to the voters. Moreover, there exist natural equilibria in the two-round game following intuitive behaviors such as informative voting, sincere voting [Austen-Smith and Banks, 1996], and the surprisingly popular strategy [Prelec et al., 2017]. This sharply contrasts with the one-round voting mechanisms in the previous literature, where no simple equilibrium is known. Finally, we show that every equilibrium in the standard one-round majority vote mechanism gives an equilibrium in the two-round mechanisms that is not more complicated than the one-round equilibrium. Therefore, the two-round voting mechanism provides a natural equilibrium in every instance, including those where one-round voting fails to have a natural solution, and it can reach an informed majority decision whenever one-round voting can. Our experiments on generative AI voters also imply that two-round voting leads to the correct outcome more often than one-round voting under some circumstances.
△ Less
Submitted 15 May, 2025;
originally announced May 2025.
-
CaMDN: Enhancing Cache Efficiency for Multi-tenant DNNs on Integrated NPUs
Authors:
Tianhao Cai,
Liang Wang,
Limin Xiao,
Meng Han,
Zeyu Wang,
Lin Sun,
Xiaojian Liao
Abstract:
With the rapid development of DNN applications, multi-tenant execution, where multiple DNNs are co-located on a single SoC, is becoming a prevailing trend. Although many methods are proposed in prior works to improve multi-tenant performance, the impact of shared cache is not well studied. This paper proposes CaMDN, an architecture-scheduling co-design to enhance cache efficiency for multi-tenant…
▽ More
With the rapid development of DNN applications, multi-tenant execution, where multiple DNNs are co-located on a single SoC, is becoming a prevailing trend. Although many methods are proposed in prior works to improve multi-tenant performance, the impact of shared cache is not well studied. This paper proposes CaMDN, an architecture-scheduling co-design to enhance cache efficiency for multi-tenant DNNs on integrated NPUs. Specifically, a lightweight architecture is proposed to support model-exclusive, NPU-controlled regions inside shared cache to eliminate unexpected cache contention. Moreover, a cache scheduling method is proposed to improve shared cache utilization. In particular, it includes a cache-aware mapping method for adaptability to the varying available cache capacity and a dynamic allocation algorithm to adjust the usage among co-located DNNs at runtime. Compared to prior works, CaMDN reduces the memory access by 33.4% on average and achieves a model speedup of up to 2.56$\times$ (1.88$\times$ on average).
△ Less
Submitted 10 May, 2025;
originally announced May 2025.
-
Measurement of the phase between strong and electromagnetic amplitudes in the decay $J/ψ\toφη$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (647 additional authors not shown)
Abstract:
The first direct measurement of the relative phase between the strong and electromagnetic amplitudes for a $J/ψ$ decaying into a vector-pseudoscalar final state is performed using 26 energy points of $e^+e^-$ annihilation data between $3.00\ \text{GeV}$ and \mbox{3.12 GeV}. The data sets were collected by the BESIII detector with a total integrated luminosity of 452 pb$^{-1}$. By investigating the…
▽ More
The first direct measurement of the relative phase between the strong and electromagnetic amplitudes for a $J/ψ$ decaying into a vector-pseudoscalar final state is performed using 26 energy points of $e^+e^-$ annihilation data between $3.00\ \text{GeV}$ and \mbox{3.12 GeV}. The data sets were collected by the BESIII detector with a total integrated luminosity of 452 pb$^{-1}$. By investigating the interference pattern in the cross section lineshape of $e^+e^-\toφη$, the relative phase between the strong and electromagnetic amplitudes of $J/ψ$ decay is determined to be within $[133^\circ,228^\circ]$ at 68\% confidence level.
△ Less
Submitted 30 July, 2025; v1 submitted 9 May, 2025;
originally announced May 2025.
-
Deep Learning to Improve the Sensitivity of Higgs Pair Searches in the $4b$ Channel at the LHC
Authors:
Yongcheng Wu,
Liang Xiao,
Yan Zhang
Abstract:
The Higgs self-coupling is crucial for understanding the structure of the scalar potential and the mechanism of electroweak symmetry breaking. In this work, utilizing deep neural network based on Particle Transformer that relies on attention mechanism, we present a comprehensive analysis of the measurement of the trilinear Higgs self-coupling through the Higgs pair production with subsequent decay…
▽ More
The Higgs self-coupling is crucial for understanding the structure of the scalar potential and the mechanism of electroweak symmetry breaking. In this work, utilizing deep neural network based on Particle Transformer that relies on attention mechanism, we present a comprehensive analysis of the measurement of the trilinear Higgs self-coupling through the Higgs pair production with subsequent decay into four $b$-quarks ($HH\to b\bar{b}b\bar{b}$) at the LHC. The model processes full event-level information as input, bypassing explicit jet pairing and can serves as an event classifier. At HL-LHC, our approach constrains the $κ_λ$ to $(-0.53,6.01)$ at 68\% CL achieving over 40\% improvement in precision over conventional cut-based analyses. Comparison against alternative machine learning architectures also shows the outstanding performance of the Transformer-based model, which is mainly due to its ability to capture the correlations in the high-dimensional collision data with the help of attention mechanism. The result highlights the potential of attention-based networks in collider phenomenology.
△ Less
Submitted 7 May, 2025;
originally announced May 2025.
-
Observation of resonant contribution to the $e^+e^-\to Ω^{-}\barΩ^{+}$ around 4.2 GeV and evidence of $ψ(3770)\to Ω^{-}\barΩ^{+}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (625 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data corresponding to a total integrated luminosity of 22.7 fb$^{-1}$, collected at center-of-mass energies between 3.7 and 4.7 GeV with the BESIII detector, we present a measurement of energy-dependent cross sections and effective form factors for the process of $e^+e^-\to Ω^{-}\barΩ^+$. By conducting a fit to the cross sections of $e^+e^-\to Ω^{-}\barΩ^+$ considering the…
▽ More
Using $e^+e^-$ collision data corresponding to a total integrated luminosity of 22.7 fb$^{-1}$, collected at center-of-mass energies between 3.7 and 4.7 GeV with the BESIII detector, we present a measurement of energy-dependent cross sections and effective form factors for the process of $e^+e^-\to Ω^{-}\barΩ^+$. By conducting a fit to the cross sections of $e^+e^-\to Ω^{-}\barΩ^+$ considering the continuum and resonant contributions, a clear resonant structure in the spectrum around 4.2 GeV is observed for the first time with a statistical significance exceeding 10$σ$, and it can be well described with the line shape of the $Y(4230)$ and $Y(4320)$ observed in $e^+e^-\to π^{+}π^{-}J/ψ$. Evidence for the decay $ψ(3770) \to Ω^-\barΩ^{+}$ is observed with a statistical significance of 4.4$σ$ by analyzing the measured cross sections together with earlier BESIII results, and the branching fraction is firstly measured to be $(4.0\pm1.0\pm0.6)$ $\times$ $10^{-5}$, where the first uncertainty is statistical and the second is systematic.
△ Less
Submitted 30 July, 2025; v1 submitted 6 May, 2025;
originally announced May 2025.
-
Asymptotic representations for Spearman's footrule correlation coefficient
Authors:
Liqi Xia,
Li Guan,
Weimin Xu
Abstract:
In order to address the theoretical challenges arising from the dependence structure of ranks in Spearman's footrule correlation coefficient, we propose two asymptotic representations to approximate the distribution of this coefficient under the hypothesis of independence. The first representation simplifies the dependence structure by replacing empirical distribution functions with their populati…
▽ More
In order to address the theoretical challenges arising from the dependence structure of ranks in Spearman's footrule correlation coefficient, we propose two asymptotic representations to approximate the distribution of this coefficient under the hypothesis of independence. The first representation simplifies the dependence structure by replacing empirical distribution functions with their population counterparts. The second representation leverages the Hájek projection technique to decompose the initial form into a sum of independent components, thereby rigorously justifying asymptotic normality. Simulation studies demonstrate the appropriateness of two proposed asymptotic representations, as well as their excellent approximation to the limiting normal distribution.
△ Less
Submitted 20 July, 2025; v1 submitted 3 May, 2025;
originally announced May 2025.
-
Very Late-Time JWST and Keck Spectra of the Oxygen-Rich Supernova 1995N
Authors:
Geoffrey C. Clayton,
R. Wesson,
Ori D. Fox,
Melissa Shahbandeh,
Alexei V. Filippenko,
Bryony Nickson,
Michael Engesser,
Schuyler D. Van Dyk,
WeiKang Zheng,
Thomas G. Brink,
Yi Yang,
Tea Temim,
Nathan Smith,
Jennifer Andrews,
Chris Ashall,
Ilse De Looze,
James M. Derkacy,
Luc Dessart,
Michael Dulude,
Eli Dwek,
Ryan J. Foley,
Suvi Gezari,
Sebastian Gomez,
Shireen Gonzaga,
Siva Indukuri
, et al. (21 additional authors not shown)
Abstract:
We present new {\it JWST}/MIRI MRS and Keck spectra of SN 1995N obtained in 2022--2023, more than 10,000 days after the supernova (SN) explosion. These spectra are among the latest direct detections of a core-collapse SN, both through emission lines in the optical and thermal continuum from infrared dust emission. The new infrared data show that dust heating from radiation produced by the ejecta i…
▽ More
We present new {\it JWST}/MIRI MRS and Keck spectra of SN 1995N obtained in 2022--2023, more than 10,000 days after the supernova (SN) explosion. These spectra are among the latest direct detections of a core-collapse SN, both through emission lines in the optical and thermal continuum from infrared dust emission. The new infrared data show that dust heating from radiation produced by the ejecta interacting with circumstellar matter is still present, but greatly reduced from when SN 1995N was observed by the {\it Spitzer Space Telescope} and {\it WISE} in 2009/2010 and 2018, when the dust mass was estimated to be 0.4 M(Sun). New radiative-transfer modeling suggests that the dust mass and grain size may have increased between 2010 and 2023. The new data can alternatively be well fit with a dust mass of 0.4 M(Sun) and a much reduced heating source luminosity. The new late-time spectra show unusually strong oxygen forbidden lines, stronger than the H-alpha emission. This indicates that SN 1995N may have exploded as a stripped-envelope SN which then interacted with a massive H-rich circumstellar shell, changing it from intrinsically Type Ib/c to Type IIn. The late-time spectrum results when the reverse shock begins to excite the inner H-poor, O-rich ejecta. This change in the spectrum is rarely seen, but marks the start of the transition from SN to SN remnant.
△ Less
Submitted 2 May, 2025;
originally announced May 2025.
-
Stable self-charged perovskite quantum rods for liquid laser with near-zero threshold
Authors:
Jialu Li,
Xue Han,
Wenjie Wang,
Jinhui Wang,
Tingting Zhang,
Yuting Wu,
Guofeng Zhang,
Bin Li,
Changgang Yang,
Wenli Guo,
Mi Zhang,
Ruiyun Chen,
Chengbing Qin,
Jianyong Hu,
Zhichun Yang,
Shaoding Liu,
Yue Wang,
Yunan Gao,
Jie Ma,
Liantuan Xiao,
Suotang Jia
Abstract:
Colloidal quantum dots (QDs) are promising optical gain materials that require further threshold reduction to realize their full potential. While QD charging theoretically reduces the threshold to zero, its effectiveness has been limited by strong Auger recombination and unstable charging. Here we theoretically reveal the optimal combination of charging number and Auger recombination to minimize t…
▽ More
Colloidal quantum dots (QDs) are promising optical gain materials that require further threshold reduction to realize their full potential. While QD charging theoretically reduces the threshold to zero, its effectiveness has been limited by strong Auger recombination and unstable charging. Here we theoretically reveal the optimal combination of charging number and Auger recombination to minimize the lasing threshold. Experimentally, we develop stable self-charged perovskite quantum rods (QRs) as an alternative to QDs via state engineering and Mn-doping strategy. An unprecedented two-order-of-magnitude reduction in nonradiative Auger recombination enables QRs to support a sufficient charging number of up to 6. The QR liquid lasing is then achieved with a near-zero threshold of 0.098 using quasi-continuous pumping of nanosecond pulses, which is the lowest threshold among all reported QD lasers. These achievements demonstrate the potential of the specially engineered QRs as an excellent gain media and pave the way for their prospective applications.
△ Less
Submitted 1 May, 2025;
originally announced May 2025.
-
Search for the lepton number violation decay $ω\to π^+ π^+ e^-e^- +c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (698 additional authors not shown)
Abstract:
The lepton number violation decay $ω\to π^+ π^+ e^-e^- +c.c.$ is searched for via $J/ψ\to ωη$ using a data sample of $(1.0087 \pm 0.0044) \times 10^{10}$ $J/ψ$ events collected by the BESIII detector at the BEPCII collider. No significant signal is observed, and the upper limit on the branching fraction of $ω\to π^+ π^+ e^-e^- +c.c.$ at the 90\% confidence level is determined for the first time to…
▽ More
The lepton number violation decay $ω\to π^+ π^+ e^-e^- +c.c.$ is searched for via $J/ψ\to ωη$ using a data sample of $(1.0087 \pm 0.0044) \times 10^{10}$ $J/ψ$ events collected by the BESIII detector at the BEPCII collider. No significant signal is observed, and the upper limit on the branching fraction of $ω\to π^+ π^+ e^-e^- +c.c.$ at the 90\% confidence level is determined for the first time to be $2.8 \times 10^{-6}$.
△ Less
Submitted 30 April, 2025;
originally announced April 2025.
-
Enhancing Cosmological Constraints by Two-dimensional $β$-cosmic-web Weighted Angular Correlation Functions
Authors:
Fenfen Yin,
Liang Xiao,
Wenying Du,
Zhujun Jiang,
Zhiwei Min,
Jaime Forero-Romero,
Jiacheng Ding,
Le Zhang,
Xiao-Dong Li
Abstract:
In this study, we investigate the potential of mark-weighted angular correlation functions (MACFs), which integrate $β$-cosmic-web classification with angular correlation function analysis to improve cosmological constraints. Using SDSS DR12 CMASS-NGC galaxies and mock catalogs with $Ω_m$ varying from 0.25 to 0.40, we assess the discriminative power of different statistics via the average improvem…
▽ More
In this study, we investigate the potential of mark-weighted angular correlation functions (MACFs), which integrate $β$-cosmic-web classification with angular correlation function analysis to improve cosmological constraints. Using SDSS DR12 CMASS-NGC galaxies and mock catalogs with $Ω_m$ varying from 0.25 to 0.40, we assess the discriminative power of different statistics via the average improvement in chi-squared, $Δ\overline{χ^2}$, across six redshift bins. This metric quantifies how effectively each statistic distinguishes between different cosmological models. Incorporating cosmic-web weights leads to substantial improvements. Using statistics weighted by the mean neighbor distance ($\bar{D}_{\rm nei}$) increases $Δ\overline{χ^2}$ by approximately 40%-130%, while applying inverse mean neighbor distance weighting ($1/\bar{D}_{\rm nei}$) yields even larger gains, boosting $Δ\overline{χ^2}$ by a factor of 2-3 compared to traditional unweighted angular statistics. These enhancements are consistent with previous 3D clustering results, demonstrating the superior sensitivity of the $β$-weighted approaches. Our method, based on thin redshift slices, is particularly suited for slitless surveys (e.g., Euclid, CSST) where redshift uncertainties limit 3D analyses. This study also offers a framework for applying marked statistics to 2D angular clustering.
△ Less
Submitted 30 May, 2025; v1 submitted 30 April, 2025;
originally announced April 2025.
-
Tomographic Alcock-Paczynski Test with Marked Correlation Functions
Authors:
Liang Xiao,
Limin Lai,
Zhujun Jiang,
Xiao-Dong Li,
Le Zhang
Abstract:
The tomographic Alcock-Paczynski (AP) method, developed over the past decade, exploits redshift evolution for cosmological determination, aiming to mitigate contamination from redshift distortions and capture nonlinear scale information. Marked Correlation Functions (MCFs) extend information beyond the two-point correlation. For the first time, this study integrated the tomographic AP test with MC…
▽ More
The tomographic Alcock-Paczynski (AP) method, developed over the past decade, exploits redshift evolution for cosmological determination, aiming to mitigate contamination from redshift distortions and capture nonlinear scale information. Marked Correlation Functions (MCFs) extend information beyond the two-point correlation. For the first time, this study integrated the tomographic AP test with MCFs to constrain the flat $w$CDM cosmology model. Our findings show that multiple density weights in MCFs outperform the traditional two-point correlation function, reducing the uncertainties of the matter density parameter $Ω_m$ and dark energy equation of state $w$ by 48\% and 45\%, respectively. Furthermore, we introduce a novel principal component analysis (PCA) compression scheme that efficiently projects high-dimensional statistical measurements into a compact set of eigenmodes while preserving most of the cosmological information. This approach retains significantly more information than traditional coarse binning methods, which simply average adjacent bins in a lossy manner. Applying PCA compression also enables the effective use of marked correlation functions in 2D $(s,μ)$ space, yielding an additional $\sim 50\%$ reduction in error margins. To assess robustness, we incorporate realistic redshift errors expected in future spectroscopic surveys. While these errors modestly degrade cosmological constraints, our combined framework, which utiizes MCFs and PCA compression within tomographic AP tests, is less affected and always yield to tight cosmological constraints. This scheme remains highly promising for upcoming slitless spectroscopic surveys, such as the Chinese Space Station Telescope (CSST).
△ Less
Submitted 15 August, 2025; v1 submitted 29 April, 2025;
originally announced April 2025.