-
Three quantities arising from Bézout's identity and resultants of integer polynomials
Authors:
Zhiqian Liu,
Xiaoting Li,
Wenheng Liu,
Min Sha
Abstract:
In this paper, we study three quantities arising naturally from Bézout's identity, the resultant and the reduced resultant of two non-zero coprime integer polynomials. We establish several new divisibility relations among them. We also pose two conjectures by making computations.
In this paper, we study three quantities arising naturally from Bézout's identity, the resultant and the reduced resultant of two non-zero coprime integer polynomials. We establish several new divisibility relations among them. We also pose two conjectures by making computations.
△ Less
Submitted 12 June, 2025;
originally announced June 2025.
-
Search for sub-GeV invisible particles in inclusive decays of $J/ψ$ to $φ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (704 additional authors not shown)
Abstract:
A search for an invisible particle, $X$, with a mass between 0 and 0.96 $\textrm{GeV}/\textit{c}^{2}$, is performed in the process $J/ψ\rightarrowφ+ X$ using $(8774.0\pm39.4)\times10^{6}$ $J/ψ$ events collected with the BESIII detector from 2017 to 2019. The $φ$ meson is fully reconstructed and an efficient veto of photons, neutral and charged hadrons up to twice the $K_L^0$ mass is applied to the…
▽ More
A search for an invisible particle, $X$, with a mass between 0 and 0.96 $\textrm{GeV}/\textit{c}^{2}$, is performed in the process $J/ψ\rightarrowφ+ X$ using $(8774.0\pm39.4)\times10^{6}$ $J/ψ$ events collected with the BESIII detector from 2017 to 2019. The $φ$ meson is fully reconstructed and an efficient veto of photons, neutral and charged hadrons up to twice the $K_L^0$ mass is applied to the rest of the events, and the recoil mass against the $φ$ is obtained precisely from the kinematic constraint in the event. No significant signal is observed in the investigated region and the upper limit on the inclusive branching fraction of $J/ψ\rightarrowφ+ X$ is determined to be $7.5\times10^{-8}$ at 90% confidence level. Upper limits at a 90% confidence level are also given for this branching fraction as a function of the invisible particle mass, varying from $9\times10^{-9}$ to $4\times10^{-8}$ over the investigated mass range. Additionally, a 90% confidence level upper limit on the branching fraction of $η\rightarrow \rm{invisible}$ is determined to $2.6\times10^{-5}$, which improves the previous best results by more than four times. The analysis technique in this work offers a clean window to search for sub-GeV invisible particles, which can be adapted for other $J/ψ$ decays and direct $e^+e^-$ annihilation experiments in future studies, and improve the sensitivity by orders of magnitude.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
Constructive interference at the edge of quantum ergodic dynamics
Authors:
Dmitry A. Abanin,
Rajeev Acharya,
Laleh Aghababaie-Beni,
Georg Aigeldinger,
Ashok Ajoy,
Ross Alcaraz,
Igor Aleiner,
Trond I. Andersen,
Markus Ansmann,
Frank Arute,
Kunal Arya,
Abraham Asfaw,
Nikita Astrakhantsev,
Juan Atalaya,
Ryan Babbush,
Dave Bacon,
Brian Ballard,
Joseph C. Bardin,
Christian Bengs,
Andreas Bengtsson,
Alexander Bilmes,
Sergio Boixo,
Gina Bortoli,
Alexandre Bourassa,
Jenna Bovaird
, et al. (240 additional authors not shown)
Abstract:
Quantum observables in the form of few-point correlators are the key to characterizing the dynamics of quantum many-body systems. In dynamics with fast entanglement generation, quantum observables generally become insensitive to the details of the underlying dynamics at long times due to the effects of scrambling. In experimental systems, repeated time-reversal protocols have been successfully imp…
▽ More
Quantum observables in the form of few-point correlators are the key to characterizing the dynamics of quantum many-body systems. In dynamics with fast entanglement generation, quantum observables generally become insensitive to the details of the underlying dynamics at long times due to the effects of scrambling. In experimental systems, repeated time-reversal protocols have been successfully implemented to restore sensitivities of quantum observables. Using a 103-qubit superconducting quantum processor, we characterize ergodic dynamics using the second-order out-of-time-order correlators, OTOC$^{(2)}$. In contrast to dynamics without time reversal, OTOC$^{(2)}$ are observed to remain sensitive to the underlying dynamics at long time scales. Furthermore, by inserting Pauli operators during quantum evolution and randomizing the phases of Pauli strings in the Heisenberg picture, we observe substantial changes in OTOC$^{(2)}$ values. This indicates that OTOC$^{(2)}$ is dominated by constructive interference between Pauli strings that form large loops in configuration space. The observed interference mechanism endows OTOC$^{(2)}$ with a high degree of classical simulation complexity, which culminates in a set of large-scale OTOC$^{(2)}$ measurements exceeding the simulation capacity of known classical algorithms. Further supported by an example of Hamiltonian learning through OTOC$^{(2)}$, our results indicate a viable path to practical quantum advantage.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling
Authors:
Tim Z. Xiao,
Johannes Zenn,
Zhen Liu,
Weiyang Liu,
Robert Bamler,
Bernhard Schölkopf
Abstract:
Large language models (LLMs) can often accurately describe probability distributions using natural language, yet they still struggle to generate faithful samples from them. This mismatch limits their use in tasks requiring reliable stochasticity, such as Monte Carlo methods, agent-based simulations, and randomized decision-making. We investigate this gap between knowledge and sampling in the conte…
▽ More
Large language models (LLMs) can often accurately describe probability distributions using natural language, yet they still struggle to generate faithful samples from them. This mismatch limits their use in tasks requiring reliable stochasticity, such as Monte Carlo methods, agent-based simulations, and randomized decision-making. We investigate this gap between knowledge and sampling in the context of Bernoulli distributions. We introduce Verbalized Rejection Sampling (VRS), a natural-language adaptation of classical rejection sampling that prompts the LLM to reason about and accept or reject proposed samples. Despite relying on the same Bernoulli mechanism internally, VRS substantially reduces sampling bias across models. We provide theoretical analysis showing that, under mild assumptions, VRS improves over direct sampling, with gains attributable to both the algorithm and prompt design. More broadly, our results show how classical probabilistic tools can be verbalized and embedded into LLM workflows to improve reliability, without requiring access to model internals or heavy prompt engineering.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
Optimization and Control Technologies for Renewable-Dominated Hydrogen-Blended Integrated Gas-Electricity System: A Review
Authors:
Wenxin Liu,
Jiakun Fang,
Shichang Cui,
Iskandar Abdullaev,
Suyang Zhou,
Xiaomeng Ai,
Jinyu Wen
Abstract:
The growing coupling among electricity, gas, and hydrogen systems is driven by green hydrogen blending into existing natural gas pipelines, paving the way toward a renewable-dominated energy future. However, the integration poses significant challenges, particularly ensuring efficient and safe operation under varying hydrogen penetration and infrastructure adaptability. This paper reviews progress…
▽ More
The growing coupling among electricity, gas, and hydrogen systems is driven by green hydrogen blending into existing natural gas pipelines, paving the way toward a renewable-dominated energy future. However, the integration poses significant challenges, particularly ensuring efficient and safe operation under varying hydrogen penetration and infrastructure adaptability. This paper reviews progress in optimization and control technologies for hydrogen-blended integrated gas-electricity system. First, key technologies and international demonstration projects are introduced to provide an overview of current developments. Besides, advances in gas-electricity system integration, including modeling, scheduling, planning and market design, are reviewed respectively. Then, the potential for cross-system fault propagation is highlighted, and practical methods for safety analysis and control are proposed. Finally, several possible research directions are introduced, aiming to ensure efficient renewable integration and reliable operation.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
Search for the charmonium weak decays $J/ψ\to D_{s}^{-}ρ^{+}+c.c.$ and $J/ψ\to D_{s}^{-}π^{+}+c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (705 additional authors not shown)
Abstract:
Based on $(10087\pm44)\times 10^6$ $J/ψ$ events recorded with the BESIII detector, we search for the rare charmonium weak decays $J/ψ\to D_{s}^{-}ρ^{+}+c.c.$ and $J/ψ\to D_{s}^{-}π^{+}+c.c.$ No signal is observed, and upper limits on the branching fractions at the $90\%$ confidence level are set as $\mathcal{B}(J/ψ\to D_{s}^{-}ρ^{+}+c.c.)<8.0\times10^{-7}$ and…
▽ More
Based on $(10087\pm44)\times 10^6$ $J/ψ$ events recorded with the BESIII detector, we search for the rare charmonium weak decays $J/ψ\to D_{s}^{-}ρ^{+}+c.c.$ and $J/ψ\to D_{s}^{-}π^{+}+c.c.$ No signal is observed, and upper limits on the branching fractions at the $90\%$ confidence level are set as $\mathcal{B}(J/ψ\to D_{s}^{-}ρ^{+}+c.c.)<8.0\times10^{-7}$ and $\mathcal{B}(J/ψ\to D_{s}^{-}π^{+}+c.c.)<4.1\times10^{-7}$. Our results provide the most stringent experimental constraints on these decays.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
Detection of Ultra-Trace Heavy metals in Aerosols with pg^m3 Sensitivity Using Filament-Induced Fluorescence Spectroscopy
Authors:
Yuezheng Wang,
Lu Sun,
Zhiwenqi An,
Jiayun Xue,
Zhixuan An,
Nan Zhang,
Lie Lin,
Weiwei Liu
Abstract:
Heavy metal pollution, particularly in the form of airborne aerosols such as lead (Pb), cadmium (Cd), mercury (Hg), and cobalt (Co), poses serious health and environmental risks, necessitating highly sensitive remote detection techniques. In this study, Filament-Induced Fluorescence Spectroscopy (FIFS) was employed to detect ultra-trace concentrations of heavy metal aerosols with high sensitivity…
▽ More
Heavy metal pollution, particularly in the form of airborne aerosols such as lead (Pb), cadmium (Cd), mercury (Hg), and cobalt (Co), poses serious health and environmental risks, necessitating highly sensitive remote detection techniques. In this study, Filament-Induced Fluorescence Spectroscopy (FIFS) was employed to detect ultra-trace concentrations of heavy metal aerosols with high sensitivity and stability. By systematically optimizing the balance between filament length and detection distance, the optimal detection distance under the current experimental conditions was determined. With a detection distance of 10 m, this work achieved a minimum detectable concentration of 0.47 pg m^-3 for Pb and an extrapolated limit of detection (LOD) of 0.3 pg m^-3, with excellent signal stability (RSD < 7%) over a concentration range from 0.47 pg m^-3 to 0.47 g m^-3. Additionally, Cd, Hg, and Co aerosols were also successfully detected under the same conditions, with detection limits of 2 pg m^-3, 0.25 pg m^-3, and 3 pg m^-3, respectively, further confirming the versatility of FIFS in detecting diverse heavy metals. Theoretical predictions suggest that increasing laser power could further enhance the detection capability. These results highlight the ultra-sensitive remote detection capability of FIFS for heavy metal aerosol detection and provide valuable insights for optimizing system parameters to enhance its application performance in environmental monitoring.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
LLaVA-c: Continual Improved Visual Instruction Tuning
Authors:
Wenzhuo Liu,
Fei Zhu,
Haiyang Guo,
Longhui Wei,
Cheng-Lin Liu
Abstract:
Multimodal models like LLaVA-1.5 achieve state-of-the-art visual understanding through visual instruction tuning on multitask datasets, enabling strong instruction-following and multimodal performance. However, multitask learning faces challenges such as task balancing, requiring careful adjustment of data proportions, and expansion costs, where new tasks risk catastrophic forgetting and need cost…
▽ More
Multimodal models like LLaVA-1.5 achieve state-of-the-art visual understanding through visual instruction tuning on multitask datasets, enabling strong instruction-following and multimodal performance. However, multitask learning faces challenges such as task balancing, requiring careful adjustment of data proportions, and expansion costs, where new tasks risk catastrophic forgetting and need costly retraining. Continual learning provides a promising alternative to acquiring new knowledge incrementally while preserving existing capabilities. However, current methods prioritize task-specific performance, neglecting base model degradation from overfitting to specific instructions, which undermines general capabilities. In this work, we propose a simple but effective method with two modifications on LLaVA-1.5: spectral-aware consolidation for improved task balance and unsupervised inquiry regularization to prevent base model degradation. We evaluate both general and task-specific performance across continual pretraining and fine-tuning. Experiments demonstrate that LLaVA-c consistently enhances standard benchmark performance and preserves general capabilities. For the first time, we show that task-by-task continual learning can achieve results that match or surpass multitask joint learning. The code will be publicly released.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
Measurement of $ψ(2S)$ to $J/ψ$ cross-section ratio as function of multiplicity in $p$Pb collisions at$\sqrt{s_{NN}} = 8.16$ TeV
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1137 additional authors not shown)
Abstract:
The production ratio of $ψ(2S)$ to $J/ψ$ charmonium states is presented as a function of multiplicity in proton-lead collisions at a centre-of-mass energy of $\sqrt{s_{NN}}=8.16$ TeV, for both prompt and nonprompt sources. The total luminosity recorded by the LHCb experiment corresponds to 13.6 $pb^{-1}$ for $p$Pb collisions and 20.8 $pb^{-1}$ for Pb$p$ collisions, where the first particle indicat…
▽ More
The production ratio of $ψ(2S)$ to $J/ψ$ charmonium states is presented as a function of multiplicity in proton-lead collisions at a centre-of-mass energy of $\sqrt{s_{NN}}=8.16$ TeV, for both prompt and nonprompt sources. The total luminosity recorded by the LHCb experiment corresponds to 13.6 $pb^{-1}$ for $p$Pb collisions and 20.8 $pb^{-1}$ for Pb$p$ collisions, where the first particle indicates the forward direction of the detector. Measurements are performed in the dimuon final state at forward (backward) centre-of-mass rapidity $1.5<y^*<4.0$ ($-5.0<y^*<-2.5$) for $p$Pb (Pb$p$) collisions.A multiplicity dependence of the prompt production ratio is observed in $p$Pb collisions, whereas no dependence is found in nonprompt production, nor in either prompt or nonprompt production in Pb$p$ collisions. These results suggest that in the Pb-going direction additional suppression mechanisms beyond comover effects may be present, possibly related to the formation of quark-gluon plasma. This highlights a transition from small to large collision systems and provides important insight into the suppression of charmonia in proton-nucleus collisions.
△ Less
Submitted 12 June, 2025; v1 submitted 10 June, 2025;
originally announced June 2025.
-
Measurement of the $η$ transition form factor through $η' \rightarrow π^+π^-η$ decay
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (680 additional authors not shown)
Abstract:
Based on a sample of $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected at BESIII, the transition form factor of the $η$ meson is extracted by analyzing $J/ψ\toγη',~η'\toπ^+π^-η,~η\toγl^+l^-$ ($l$=$e$, $μ$) events. The measured slope of the transition form factor is $Λ^{-2}=1.645\pm0.093_{\rm stat.}\pm {0.024_{\rm sys.}}$ (GeV/$c^2$)$^{-2}$ for the di-electron channel and…
▽ More
Based on a sample of $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected at BESIII, the transition form factor of the $η$ meson is extracted by analyzing $J/ψ\toγη',~η'\toπ^+π^-η,~η\toγl^+l^-$ ($l$=$e$, $μ$) events. The measured slope of the transition form factor is $Λ^{-2}=1.645\pm0.093_{\rm stat.}\pm {0.024_{\rm sys.}}$ (GeV/$c^2$)$^{-2}$ for the di-electron channel and $Λ^{-2}=1.645\pm0.343_{\rm stat.}\pm0.017_{\rm sys.}$ (GeV/$c^2$)$^{-2}$ for the di-muon channel. The branching fractions for $η\rightarrowγe^+e^-$ and $η\rightarrowγμ^+μ^-$ are measured to be $\mathcal{B}(η\toγe^+e^-)=(6.79\pm0.04_{\rm stat.}\pm0.36_{\rm sys.})\times 10^{-3}$ and $\mathcal{B}(η\toγμ^+μ^-)=(2.97\pm0.11_{\rm stat.}\pm0.07_{\rm sys.})\times 10^{-4}$. By combining with the results based on the $J/ψ\toγη,~η\toγe^+e^-$ events from the previous BESIII measurement, we determine $Λ^{-2}=1.707\pm0.076_{\rm stat.}\pm0.029_{\rm sys.}$ (GeV/$c^2$)$^{-2}$ and $\mathcal{B}(η\toγe^+e^-)=(6.93\pm0.28_{\rm tot.})\times 10^{-3}$. In addition, we search for the dark photon ($A'$) using the combined events. No significant signal is observed, and the upper limits on $\mathcal{B}(η\toγA',~A'\to e^+e^-)$ are set at 90\% confidence level for different $A'$ mass hypotheses.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
Learning event-triggered controllers for linear parameter-varying systems from data
Authors:
Renjie Ma,
Su Zhang,
Wenjie Liu,
Zhijian Hu,
Peng Shi
Abstract:
Nonlinear dynamical behaviours in engineering applications can be approximated by linear-parameter varying (LPV) representations, but obtaining precise model knowledge to develop a control algorithm is difficult in practice. In this paper, we develop the data-driven control strategies for event-triggered LPV systems with stability verifications. First, we provide the theoretical analysis of $θ$-pe…
▽ More
Nonlinear dynamical behaviours in engineering applications can be approximated by linear-parameter varying (LPV) representations, but obtaining precise model knowledge to develop a control algorithm is difficult in practice. In this paper, we develop the data-driven control strategies for event-triggered LPV systems with stability verifications. First, we provide the theoretical analysis of $θ$-persistence of excitation for LPV systems, which leads to the feasible data-based representations. Then, in terms of the available perturbed data, we derive the stability certificates for event-triggered LPV systems with the aid of Petersen's lemma in the sense of robust control, resulting in the computationally tractable semidefinite programmings, the feasible solutions of which yields the optimal gain schedulings. Besides, we generalize the data-driven eventtriggered LPV control methods to the scenario of reference trajectory tracking, and discuss the robust tracking stability accordingly. Finally, we verify the effectiveness of our theoretical derivations by numerical simulations.
△ Less
Submitted 9 June, 2025;
originally announced June 2025.
-
ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Authors:
Yongkang Li,
Kaixin Xiong,
Xiangyu Guo,
Fang Li,
Sixu Yan,
Gangwei Xu,
Lijun Zhou,
Long Chen,
Haiyang Sun,
Bing Wang,
Guang Chen,
Hangjun Ye,
Wenyu Liu,
Xinggang Wang
Abstract:
Although end-to-end autonomous driving has made remarkable progress, its performance degrades significantly in rare and long-tail scenarios. Recent approaches attempt to address this challenge by leveraging the rich world knowledge of Vision-Language Models (VLMs), but these methods suffer from several limitations: (1) a significant domain gap between the pre-training data of VLMs and real-world d…
▽ More
Although end-to-end autonomous driving has made remarkable progress, its performance degrades significantly in rare and long-tail scenarios. Recent approaches attempt to address this challenge by leveraging the rich world knowledge of Vision-Language Models (VLMs), but these methods suffer from several limitations: (1) a significant domain gap between the pre-training data of VLMs and real-world driving data, (2) a dimensionality mismatch between the discrete language space and the continuous action space, and (3) imitation learning tends to capture the average behavior present in the dataset, which may be suboptimal even dangerous. In this paper, we propose ReCogDrive, an autonomous driving system that integrates VLMs with diffusion planner, which adopts a three-stage paradigm for training. In the first stage, we use a large-scale driving question-answering datasets to train the VLMs, mitigating the domain discrepancy between generic content and real-world driving scenarios. In the second stage, we employ a diffusion-based planner to perform imitation learning, mapping representations from the latent language space to continuous driving actions. Finally, we fine-tune the diffusion planner using reinforcement learning with NAVSIM non-reactive simulator, enabling the model to generate safer, more human-like driving trajectories. We evaluate our approach on the planning-oriented NAVSIM benchmark, achieving a PDMS of 89.6 and setting a new state-of-the-art that surpasses the previous vision-only SOTA by 5.6 PDMS.
△ Less
Submitted 8 June, 2025;
originally announced June 2025.
-
Reparameterized LLM Training via Orthogonal Equivalence Transformation
Authors:
Zeju Qiu,
Simon Buchholz,
Tim Z. Xiao,
Maximilian Dax,
Bernhard Schölkopf,
Weiyang Liu
Abstract:
While large language models (LLMs) are driving the rapid advancement of artificial intelligence, effectively and reliably training these large models remains one of the field's most significant challenges. To address this challenge, we propose POET, a novel reParameterized training algorithm that uses Orthogonal Equivalence Transformation to optimize neurons. Specifically, POET reparameterizes eac…
▽ More
While large language models (LLMs) are driving the rapid advancement of artificial intelligence, effectively and reliably training these large models remains one of the field's most significant challenges. To address this challenge, we propose POET, a novel reParameterized training algorithm that uses Orthogonal Equivalence Transformation to optimize neurons. Specifically, POET reparameterizes each neuron with two learnable orthogonal matrices and a fixed random weight matrix. Because of its provable preservation of spectral properties of weight matrices, POET can stably optimize the objective function with improved generalization. We further develop efficient approximations that make POET flexible and scalable for training large-scale neural networks. Extensive experiments validate the effectiveness and scalability of POET in training LLMs.
△ Less
Submitted 9 June, 2025;
originally announced June 2025.
-
A novel measurement of the strong-phase difference between $D^0\to K^-π^+$ and $\bar{D}^0\to K^-π^+$ decays using $C$-even and $C$-odd quantum-correlated $D\bar{D}$ pairs
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (707 additional authors not shown)
Abstract:
A novel measurement technique of strong-phase differences between the decay amplitudes of $D^0$ and $\bar{D}^0$ mesons is introduced which exploits quantum-correlated $D\bar{D}$ pairs produced by $e^+e^-$ collisions at energies above the $ψ(3770)$ production threshold, where $D\bar{D}$ pairs are produced in both even and odd eigenstates of the charge-conjugation symmetry. Employing this technique,…
▽ More
A novel measurement technique of strong-phase differences between the decay amplitudes of $D^0$ and $\bar{D}^0$ mesons is introduced which exploits quantum-correlated $D\bar{D}$ pairs produced by $e^+e^-$ collisions at energies above the $ψ(3770)$ production threshold, where $D\bar{D}$ pairs are produced in both even and odd eigenstates of the charge-conjugation symmetry. Employing this technique, the first determination of a $D^0$-$\bar{D^0}$ relative strong phase is reported with such data samples. The strong-phase difference between $D^0\to K^-π^+$ and $\bar{D}^0\to K^-π^+$ decays, $δ^{D}_{Kπ}$, is measured to be $δ^{D}_{Kπ}=\left(192.8^{+11.0 + 1.9}_{-12.4 -2.4}\right)^\circ$, using a dataset corresponding to an integrated luminosity of 7.13 $\text{fb}^{-1}$ collected at center-of-mass energies between $4.13-4.23 \text{ GeV}$ by the BESIII experiment.
△ Less
Submitted 10 June, 2025; v1 submitted 9 June, 2025;
originally announced June 2025.
-
First observation of quantum correlations in $e^+e^-\to XD\bar{D}$ and $C$-even constrained $D\bar{D}$ pairs
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (707 additional authors not shown)
Abstract:
The study of meson pairs produced with quantum correlations gives direct access to parameters that are challenging to measure in other systems. In this Letter, the existence of quantum correlations due to charge-conjugation symmetry $C$ are demonstrated in $D\bar{D}$ pairs produced through the processes $e^+e^-\to D\bar{D}$, $e^+e^- \to D^{*}\bar{D}$, and $e^+e^- \to D^{*} \bar{D}^*$, where the la…
▽ More
The study of meson pairs produced with quantum correlations gives direct access to parameters that are challenging to measure in other systems. In this Letter, the existence of quantum correlations due to charge-conjugation symmetry $C$ are demonstrated in $D\bar{D}$ pairs produced through the processes $e^+e^-\to D\bar{D}$, $e^+e^- \to D^{*}\bar{D}$, and $e^+e^- \to D^{*} \bar{D}^*$, where the lack of charge superscripts refers to an admixture of neutral-charm-meson particle and antiparticle states, using $7.13 \text{ fb}^{-1}$ of $e^+e^-$ collision data collected by the BESIII experiment between center-of-mass energies of $4.13-4.23 \text{ GeV}$. Processes with either $C$-even or $C$-odd constraints are identified and separated. A procedure is presented that harnesses the entangled production process to enable measurements of $D^0$-meson hadronic parameters. This study provides the first confirmation of quantum correlations in $e^+e^-\to X D\bar{D}$ processes and the first observation of a $C$-even constrained $D\bar{D}$ system. The procedure is applied to measure $δ^{D}_{Kπ}$, the strong phase between the $D^0\to K^-π^+$ and $\bar{D}^0\to K^-π^+$ decay amplitudes, which results in the determination of $δ^{D}_{Kπ}=\left(192.8^{+11.0 + 1.9}_{-12.4 -2.4}\right)^\circ$. The potential for measurements of other hadronic decay parameters and charm mixing with these and future datasets is also discussed.
△ Less
Submitted 10 June, 2025; v1 submitted 9 June, 2025;
originally announced June 2025.
-
Intense THz s-SNOM for nonlinearity engineering in nanoscale
Authors:
Pengfei Qi,
Zeliang zhang,
Wenqi Qian,
Zijie Dai,
Xingyou Li,
Lu Sun,
See Leang Chin,
Pierre Agostini,
Weiwei Liu
Abstract:
Terahertz (THz) nonlinear optics offer powerful tools to investigate and manipulate electronic dynamics in condensed matter. Confining high-peak-power THz pulses within near field can effectively generates extremely localized electromagnetic fields in spatio-temporal, enabling to precisely explore and control carrier transient dynamics from THz nonlinearity perspective. However, the combination of…
▽ More
Terahertz (THz) nonlinear optics offer powerful tools to investigate and manipulate electronic dynamics in condensed matter. Confining high-peak-power THz pulses within near field can effectively generates extremely localized electromagnetic fields in spatio-temporal, enabling to precisely explore and control carrier transient dynamics from THz nonlinearity perspective. However, the combination of the high peak power THz pulses and the near-field optic techniques remains challenging due to the incompatibility between low repetition THz pulses and typical near-field demodulation schemes. Here, we construct high peak power THz scattering scanning near-field microscopy (THz s-SNOM) by combining THz pulses emitted from two-color femtosecond laser filaments with a tapping mode atomic force microscopy (AFM) and explore efficient THz third harmonics generation (THG) from the Cd3As2 film in nanoscale. The power-law dependence of the THz harmonics and theoretical calculation reveals a convincing third harmonic generation that is attributed to the nonequilibrium intraband dynamics driven by the strong THz pulses. Especially, the nanoscopic near-field THz third harmonic imaging with resolution of 200 nm (λ/3000) of 3D Dirac semimetal are demonstrated. The high peak power THz s-SNOM can provide a great platform for exploring and manipulating the nonlinear physics, carrier dynamics and quantum coherent phenomena driven by the localized THz field with nanoscale resolution, thereby guiding the development of the integrated high-performance nonlinear photonic devices.
△ Less
Submitted 9 June, 2025;
originally announced June 2025.
-
Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency
Authors:
Xiangyu Guo,
Zhanqian Wu,
Kaixin Xiong,
Ziyang Xu,
Lijun Zhou,
Gangwei Xu,
Shaoqing Xu,
Haiyang Sun,
Bing Wang,
Guang Chen,
Hangjun Ye,
Wenyu Liu,
Xinggang Wang
Abstract:
We present Genesis, a unified framework for joint generation of multi-view driving videos and LiDAR sequences with spatio-temporal and cross-modal consistency. Genesis employs a two-stage architecture that integrates a DiT-based video diffusion model with 3D-VAE encoding, and a BEV-aware LiDAR generator with NeRF-based rendering and adaptive sampling. Both modalities are directly coupled through a…
▽ More
We present Genesis, a unified framework for joint generation of multi-view driving videos and LiDAR sequences with spatio-temporal and cross-modal consistency. Genesis employs a two-stage architecture that integrates a DiT-based video diffusion model with 3D-VAE encoding, and a BEV-aware LiDAR generator with NeRF-based rendering and adaptive sampling. Both modalities are directly coupled through a shared latent space, enabling coherent evolution across visual and geometric domains. To guide the generation with structured semantics, we introduce DataCrafter, a captioning module built on vision-language models that provides scene-level and instance-level supervision. Extensive experiments on the nuScenes benchmark demonstrate that Genesis achieves state-of-the-art performance across video and LiDAR metrics (FVD 16.95, FID 4.24, Chamfer 0.611), and benefits downstream tasks including segmentation and 3D detection, validating the semantic fidelity and practical utility of the generated data.
△ Less
Submitted 11 June, 2025; v1 submitted 9 June, 2025;
originally announced June 2025.
-
Scalable Neural Quantum State based Kernel Polynomial Method for Optical Properties from the First Principle
Authors:
Wei Liu,
Rui-Hao Bi,
Wenjie Dou
Abstract:
Variational optimization of neural-network quantum state representations has achieved FCI-level accuracy for ground state calculations, yet computing optical properties involving excited states remains challenging. In this work, we present a neural-network-based variational quantum Monte Carlo approach for ab-initio absorption spectra. We leverage parallel batch autoregressive sampling and GPU-sup…
▽ More
Variational optimization of neural-network quantum state representations has achieved FCI-level accuracy for ground state calculations, yet computing optical properties involving excited states remains challenging. In this work, we present a neural-network-based variational quantum Monte Carlo approach for ab-initio absorption spectra. We leverage parallel batch autoregressive sampling and GPU-supported local energy parallelism to efficiently compute ground states of complex systems. By integrating neural quantum ground states with the kernel polynomial method, our approach accurately calculates absorption spectra for large molecules with over 50 electrons, achieving FCI-level precision. The proposed algorithm demonstrates superior scalability and reduced runtime compared to FCI, marking a significant step forward in optical property calculations for large-scale quantum systems.
△ Less
Submitted 9 June, 2025;
originally announced June 2025.
-
KnowCoder-V2: Deep Knowledge Analysis
Authors:
Zixuan Li,
Wenxuan Liu,
Long Bai,
Chunmao Zhang,
Wei Li,
Fenghui Zhang,
Quanxin Jin,
Ruoyun He,
Zhuo Chen,
Zhilei Hu,
Fei Wang,
Bingbing Xu,
Xuhui Jiang,
Xiaolong Jin,
Jiafeng Guo,
Xueqi Cheng
Abstract:
Deep knowledge analysis tasks always involve the systematic extraction and association of knowledge from large volumes of data, followed by logical reasoning to discover insights. However, to solve such complex tasks, existing deep research frameworks face three major challenges: 1) They lack systematic organization and management of knowledge; 2) They operate purely online, making it inefficient…
▽ More
Deep knowledge analysis tasks always involve the systematic extraction and association of knowledge from large volumes of data, followed by logical reasoning to discover insights. However, to solve such complex tasks, existing deep research frameworks face three major challenges: 1) They lack systematic organization and management of knowledge; 2) They operate purely online, making it inefficient for tasks that rely on shared and large-scale knowledge; 3) They cannot perform complex knowledge computation, limiting their abilities to produce insightful analytical results. Motivated by these, in this paper, we propose a \textbf{K}nowledgeable \textbf{D}eep \textbf{R}esearch (\textbf{KDR}) framework that empowers deep research with deep knowledge analysis capability. Specifically, it introduces an independent knowledge organization phase to preprocess large-scale, domain-relevant data into systematic knowledge offline. Based on this knowledge, it extends deep research with an additional kind of reasoning steps that perform complex knowledge computation in an online manner. To enhance the abilities of LLMs to solve knowledge analysis tasks in the above framework, we further introduce \textbf{\KCII}, an LLM that bridges knowledge organization and reasoning via unified code generation. For knowledge organization, it generates instantiation code for predefined classes, transforming data into knowledge objects. For knowledge computation, it generates analysis code and executes on the above knowledge objects to obtain deep analysis results. Experimental results on more than thirty datasets across six knowledge analysis tasks demonstrate the effectiveness of \KCII. Moreover, when integrated into the KDR framework, \KCII can generate high-quality reports with insightful analytical results compared to the mainstream deep research framework.
△ Less
Submitted 7 June, 2025;
originally announced June 2025.
-
The Identification of Two JWST/NIRCam-Dark Starburst Galaxies at $z=6.6$ with ALMA
Authors:
Fengwu Sun,
Jinyi Yang,
Feige Wang,
Daniel J. Eisenstein,
Roberto Decarli,
Xiaohui Fan,
George H. Rieke,
Eduardo Bañados,
Sarah E. I. Bosman,
Zheng Cai,
Jaclyn B. Champagne,
Luis Colina,
Francesco D'Eugenio,
Yoshinobu Fudamoto,
Mingyu Li,
Xiaojing Lin,
Weizhe Liu,
Jianwei Lyu,
Chiara Mazzucchelli,
Xiangyu Jin,
Hyunsung D. Jun,
Yunjing Wu,
Huanian Zhang
Abstract:
We analyze two dusty star-forming galaxies at $z=6.6$. These galaxies are selected from the ASPIRE survey, a JWST Cycle-1 medium and ALMA Cycle-9 large program targeting 25 quasars and their environments at $z\simeq6.5 - 6.8$. These galaxies are identified as companions to UV-luminous quasars and robustly detected in ALMA continuum and [C II] emission, yet they are extraordinarily faint at the NIR…
▽ More
We analyze two dusty star-forming galaxies at $z=6.6$. These galaxies are selected from the ASPIRE survey, a JWST Cycle-1 medium and ALMA Cycle-9 large program targeting 25 quasars and their environments at $z\simeq6.5 - 6.8$. These galaxies are identified as companions to UV-luminous quasars and robustly detected in ALMA continuum and [C II] emission, yet they are extraordinarily faint at the NIRCam wavelengths (down to $>28.0$ AB mag in the F356W band). They are more obscured than galaxies like Arp220, and thus we refer to them as "NIRCam-dark" starburst galaxies (star formation rate $\simeq 80 - 250\,\mathrm{M}_{\odot}\,\mathrm{yr}^{-1}$). Such galaxies are typically missed by (sub)-millimeter blank-field surveys. From the star-formation history (SFH), we show that the NIRCam-dark galaxies are viable progenitors of massive quiescent galaxies at $z\gtrsim4$ and descendants of UV-luminous galaxies at $z>10$. Although it is hard to constrain their number density from a quasar survey, we conclude that NIRCam-dark galaxies can be as abundant as $n\sim10^{-5.5}$ Mpc$^{-3}$ assuming a light halo occupation model. If true, this would equal to $\sim$30% of the number densities of both the quiescent galaxies at $z\gtrsim4$ and UV-luminous galaxies at $z>10$. We further predict that analogs at $z\sim8$ should exist according to the SFH of early massive quiescent galaxies. However, they may fall below the current detection limits of wide JWST and ALMA surveys, thus remaining "JWST-dark". To fully trace the evolution of massive galaxies and dust-obscured cosmic star formation at $z\gtrsim8$, wide-field JWST/NIRCam imaging and slitless spectroscopic surveys of early protoclusters are essential.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
Coherent photoproduction of $ρ^0, ω$ and excited vector mesons in ultraperipheral PbPb collisions
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1127 additional authors not shown)
Abstract:
The invariant-mass distribution for the coherent photoproduction of dipions in ultraperipheral PbPb collisions is measured using data, corresponding to an integrated luminosity of $ 224.6 \pm 9.6\ μ$b$^{-1}$, collected by the LHCb experiment in 2018 at a nucleon-nucleon centre-of-mass energy $\sqrt{s_{\rm NN}}=5.02$ TeV. The dominant contribution is due to the $ρ^0$ meson but a consistent descript…
▽ More
The invariant-mass distribution for the coherent photoproduction of dipions in ultraperipheral PbPb collisions is measured using data, corresponding to an integrated luminosity of $ 224.6 \pm 9.6\ μ$b$^{-1}$, collected by the LHCb experiment in 2018 at a nucleon-nucleon centre-of-mass energy $\sqrt{s_{\rm NN}}=5.02$ TeV. The dominant contribution is due to the $ρ^0$ meson but a consistent description across the full invariant-mass range requires accounting for the $ω$ meson and introducing two resonances at masses of $1350\pm20$ MeV and $1790\pm20$ MeV with widths of about 300 MeV. The cross-section for each meson is measured differentially in twelve bins of rapidity from 2.05 to 4.90. Significant nuclear suppression is observed for the $ρ^0$ meson compared to expectations based on photoproduction on the proton.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning
Authors:
Sheng Chen,
Peiyu He,
Jiaxin Hu,
Ziyang Liu,
Yansheng Wang,
Tao Xu,
Chi Zhang,
Chongchong Zhang,
Chao An,
Shiyu Cai,
Duo Cao,
Kangping Chen,
Shuai Chu,
Tianwei Chu,
Mingdi Dan,
Min Du,
Weiwei Fang,
Pengyou Fu,
Junkai Hu,
Xiaowei Jiang,
Zhaodi Jiang,
Fuxuan Li,
Jun Li,
Minghui Li,
Mingyao Li
, et al. (46 additional authors not shown)
Abstract:
Modern robot navigation systems encounter difficulties in diverse and complex indoor environments. Traditional approaches rely on multiple modules with small models or rule-based systems and thus lack adaptability to new environments. To address this, we developed Astra, a comprehensive dual-model architecture, Astra-Global and Astra-Local, for mobile robot navigation. Astra-Global, a multimodal L…
▽ More
Modern robot navigation systems encounter difficulties in diverse and complex indoor environments. Traditional approaches rely on multiple modules with small models or rule-based systems and thus lack adaptability to new environments. To address this, we developed Astra, a comprehensive dual-model architecture, Astra-Global and Astra-Local, for mobile robot navigation. Astra-Global, a multimodal LLM, processes vision and language inputs to perform self and goal localization using a hybrid topological-semantic graph as the global map, and outperforms traditional visual place recognition methods. Astra-Local, a multitask network, handles local path planning and odometry estimation. Its 4D spatial-temporal encoder, trained through self-supervised learning, generates robust 4D features for downstream tasks. The planning head utilizes flow matching and a novel masked ESDF loss to minimize collision risks for generating local trajectories, and the odometry head integrates multi-sensor inputs via a transformer encoder to predict the relative pose of the robot. Deployed on real in-house mobile robots, Astra achieves high end-to-end mission success rate across diverse indoor environments.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
Flexible Operator Fusion for Fast Sparse Transformer with Diverse Masking on GPU
Authors:
Wenhao Dai,
Haodong Deng,
Mengfei Rong,
Xinyu Yang,
Hongyu Liu,
Fangxin Liu,
Hailong Yang,
Weifeng Liu,
Qingxiao Sun
Abstract:
Large language models are popular around the world due to their powerful understanding capabilities. As the core component of LLMs, accelerating Transformer through parallelization has gradually become a hot research topic. Mask layers introduce sparsity into Transformer to reduce calculations. However, previous works rarely focus on the performance optimization of sparse Transformer. Moreover, ru…
▽ More
Large language models are popular around the world due to their powerful understanding capabilities. As the core component of LLMs, accelerating Transformer through parallelization has gradually become a hot research topic. Mask layers introduce sparsity into Transformer to reduce calculations. However, previous works rarely focus on the performance optimization of sparse Transformer. Moreover, rule-based mechanisms ignore the fusion opportunities of mixed-type operators and fail to adapt to various sequence lengths. To address the above problems, we propose STOF, a framework that incorporates optimizations for Sparse Transformer via flexible masking and operator fusion on GPU. We firstly unify the storage format and kernel implementation for the multi-head attention. Then, we map fusion schemes to compilation templates and determine the optimal parameter setting through a two-stage search engine. The experimental results show that compared to the state-of-the-art work, STOF achieves maximum speedups of 1.7x in MHA computation and 1.5x in end-to-end inference.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
Observation of $D^+\to K^0_Sπ^0μ^+ν_μ$, Test of Lepton Flavor Universality and First Angular Analysis of $D^+\to \bar{K}^\ast(892)^0\ell^+ν_\ell$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (696 additional authors not shown)
Abstract:
We report a study of the semileptonic decays $D^+\to K_S^0π^0\ell^+ν_\ell$ ($\ell = e, μ$) based on $20.3\,\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy of 3.773 GeV with the BESIII detector.
The $D^+\to K_S^0π^0μ^+ν_μ$ decay is observed for the first time, with a branching fraction of $(0.896\pm0.017_{\rm stat}\pm0.008_{\rm syst})\%$, and the branching frac…
▽ More
We report a study of the semileptonic decays $D^+\to K_S^0π^0\ell^+ν_\ell$ ($\ell = e, μ$) based on $20.3\,\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy of 3.773 GeV with the BESIII detector.
The $D^+\to K_S^0π^0μ^+ν_μ$ decay is observed for the first time, with a branching fraction of $(0.896\pm0.017_{\rm stat}\pm0.008_{\rm syst})\%$, and the branching fraction of $D^+\to K_S^0π^0e^+ν_e$ is determined with the improved precision as $(0.943\pm0.012_{\rm stat}\pm0.010_{\rm syst})\%$.
From the analysis of the dynamics, we observe that the dominant $\bar{K}^\ast(892)^0$ component is accompanied by an $S$-wave contribution, which accounts for $(7.10 \pm 0.68_{\rm stat} \pm 0.41_{\rm syst})\%$ of the total decay rate of the $μ^+$ channel and $(6.39 \pm 0.17_{\rm stat} \pm 0.14_{\rm syst})\%$ of the $e^+$ channel. Assuming a single-pole dominance parameterization, the hadronic form factor ratios are extracted to be $r_V=V(0)/A_1(0)=1.42 \pm\, 0.03_{\rm stat} \pm\, 0.02_{\rm syst}$ and $r_2=A_2(0)/A_1(0)=0.75 \pm\, 0.03_{\rm stat} \pm\, 0.01_{\rm syst}$.
Based on the first comprehensive angular and the decay-rate $CP$ asymmetry analysis, the full set of averaged angular and $CP$ asymmetry observables are measured as a function of the momentum-transfer squared; they are consistent with expectations from the Standard Model. No evidence for violation of $μ-e$ lepton-flavor universality is observed in either the full range or the five chosen bins of momentum-transfer squared.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
Mitigating Degree Bias Adaptively with Hard-to-Learn Nodes in Graph Contrastive Learning
Authors:
Jingyu Hu,
Hongbo Bo,
Jun Hong,
Xiaowei Liu,
Weiru Liu
Abstract:
Graph Neural Networks (GNNs) often suffer from degree bias in node classification tasks, where prediction performance varies across nodes with different degrees. Several approaches, which adopt Graph Contrastive Learning (GCL), have been proposed to mitigate this bias. However, the limited number of positive pairs and the equal weighting of all positives and negatives in GCL still lead to low-degr…
▽ More
Graph Neural Networks (GNNs) often suffer from degree bias in node classification tasks, where prediction performance varies across nodes with different degrees. Several approaches, which adopt Graph Contrastive Learning (GCL), have been proposed to mitigate this bias. However, the limited number of positive pairs and the equal weighting of all positives and negatives in GCL still lead to low-degree nodes acquiring insufficient and noisy information. This paper proposes the Hardness Adaptive Reweighted (HAR) contrastive loss to mitigate degree bias. It adds more positive pairs by leveraging node labels and adaptively weights positive and negative pairs based on their learning hardness. In addition, we develop an experimental framework named SHARP to extend HAR to a broader range of scenarios. Both our theoretical analysis and experiments validate the effectiveness of SHARP. The experimental results across four datasets show that SHARP achieves better performance against baselines at both global and degree levels.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
Study of $f_1(1420)$ and $η(1405)$ in the decay $J/ψ\to γπ^{0}π^{0}π^{0}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (650 additional authors not shown)
Abstract:
A partial-wave analysis is performed on the decay $J/ψ\toγπ^{0}π^{0}π^{0}$ within the $π^{0}π^{0}π^{0}$ invariant-mass region below 1.6 GeV$/c^{2}$, using $(10.09~\pm~0.04)\times10^{9} ~J/ψ$ events collected with the BESIII detector. Significant isospin-violating decays of $η(1405)$ and $f_1(1420)$ into $f_0(980)π^{0}$ are observed. For the first time, three axial-vectors, $f_1(1285)$,…
▽ More
A partial-wave analysis is performed on the decay $J/ψ\toγπ^{0}π^{0}π^{0}$ within the $π^{0}π^{0}π^{0}$ invariant-mass region below 1.6 GeV$/c^{2}$, using $(10.09~\pm~0.04)\times10^{9} ~J/ψ$ events collected with the BESIII detector. Significant isospin-violating decays of $η(1405)$ and $f_1(1420)$ into $f_0(980)π^{0}$ are observed. For the first time, three axial-vectors, $f_1(1285)$, $f_1(1420)$ and $f_1(1510)$, are observed to decay into $π^{0}π^{0}π^{0}$. The product branching fractions of these resonances are reported.
△ Less
Submitted 7 June, 2025; v1 submitted 5 June, 2025;
originally announced June 2025.
-
Fine-Grained Interpretation of Political Opinions in Large Language Models
Authors:
Jingyu Hu,
Mengyue Yang,
Mengnan Du,
Weiru Liu
Abstract:
Studies of LLMs' political opinions mainly rely on evaluations of their open-ended responses. Recent work indicates that there is a misalignment between LLMs' responses and their internal intentions. This motivates us to probe LLMs' internal mechanisms and help uncover their internal political states. Additionally, we found that the analysis of LLMs' political opinions often relies on single-axis…
▽ More
Studies of LLMs' political opinions mainly rely on evaluations of their open-ended responses. Recent work indicates that there is a misalignment between LLMs' responses and their internal intentions. This motivates us to probe LLMs' internal mechanisms and help uncover their internal political states. Additionally, we found that the analysis of LLMs' political opinions often relies on single-axis concepts, which can lead to concept confounds. In this work, we extend the single-axis to multi-dimensions and apply interpretable representation engineering techniques for more transparent LLM political concept learning. Specifically, we designed a four-dimensional political learning framework and constructed a corresponding dataset for fine-grained political concept vector learning. These vectors can be used to detect and intervene in LLM internals. Experiments are conducted on eight open-source LLMs with three representation engineering techniques. Results show these vectors can disentangle political concept confounds. Detection tasks validate the semantic meaning of the vectors and show good generalization and robustness in OOD settings. Intervention Experiments show these vectors can intervene in LLMs to generate responses with different political leanings.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
BVLSM: Write-Efficient LSM-Tree Storage via WAL-Time Key-Value Separation
Authors:
Ming Li,
Wendi Cheng,
Jiahe Wei,
Xueqiang Shan,
Weikai Liu,
Xiaonan Zhao,
Xiao Zhang
Abstract:
Modern data-intensive applications increasingly store and process big-value items, such as multimedia objects and machine learning embeddings, which exacerbate storage inefficiencies in Log-Structured Merge-Tree (LSM)-based key-value stores. This paper presents BVLSM, a Write-Ahead Log (WAL)-time key-value separation mechanism designed to address three key challenges in LSM-Tree storage systems: w…
▽ More
Modern data-intensive applications increasingly store and process big-value items, such as multimedia objects and machine learning embeddings, which exacerbate storage inefficiencies in Log-Structured Merge-Tree (LSM)-based key-value stores. This paper presents BVLSM, a Write-Ahead Log (WAL)-time key-value separation mechanism designed to address three key challenges in LSM-Tree storage systems: write amplification, poor memory utilization, and I/O jitter under big-value workloads. Unlike state-of-the-art approaches that delay key-value separation until the flush stage, leading to redundant data in MemTables and repeated writes. BVLSM proactively decouples keys and values during the WAL phase. The MemTable stores only lightweight metadata, allowing multi-queue parallel store for big value. The benchmark results show that BVLSM significantly outperforms both RocksDB and BlobDB under 64KB random write workloads. In asynchronous WAL mode, it achieves throughput improvements of 7.6x over RocksDB and 1.9x over BlobDB.
△ Less
Submitted 8 June, 2025; v1 submitted 5 June, 2025;
originally announced June 2025.
-
Diffusion Transformer-based Universal Dose Denoising for Pencil Beam Scanning Proton Therapy
Authors:
Yuzhen Ding,
Jason Holmes,
Hongying Feng,
Martin Bues,
Lisa A. McGee,
Jean-Claude M. Rwigema,
Nathan Y. Yu,
Terence S. Sio,
Sameer R. Keole,
William W. Wong,
Steven E. Schild,
Jonathan B. Ashman,
Sujay A. Vora,
Daniel J. Ma,
Samir H. Patel,
Wei Liu
Abstract:
Purpose: Intensity-modulated proton therapy (IMPT) offers precise tumor coverage while sparing organs at risk (OARs) in head and neck (H&N) cancer. However, its sensitivity to anatomical changes requires frequent adaptation through online adaptive radiation therapy (oART), which depends on fast, accurate dose calculation via Monte Carlo (MC) simulations. Reducing particle count accelerates MC but…
▽ More
Purpose: Intensity-modulated proton therapy (IMPT) offers precise tumor coverage while sparing organs at risk (OARs) in head and neck (H&N) cancer. However, its sensitivity to anatomical changes requires frequent adaptation through online adaptive radiation therapy (oART), which depends on fast, accurate dose calculation via Monte Carlo (MC) simulations. Reducing particle count accelerates MC but degrades accuracy. To address this, denoising low-statistics MC dose maps is proposed to enable fast, high-quality dose generation.
Methods: We developed a diffusion transformer-based denoising framework. IMPT plans and 3D CT images from 80 H&N patients were used to generate noisy and high-statistics dose maps using MCsquare (1 min and 10 min per plan, respectively). Data were standardized into uniform chunks with zero-padding, normalized, and transformed into quasi-Gaussian distributions. Testing was done on 10 H&N, 10 lung, 10 breast, and 10 prostate cancer cases, preprocessed identically. The model was trained with noisy dose maps and CT images as input and high-statistics dose maps as ground truth, using a combined loss of mean square error (MSE), residual loss, and regional MAE (focusing on top/bottom 10% dose voxels). Performance was assessed via MAE, 3D Gamma passing rate, and DVH indices.
Results: The model achieved MAEs of 0.195 (H&N), 0.120 (lung), 0.172 (breast), and 0.376 Gy[RBE] (prostate). 3D Gamma passing rates exceeded 92% (3%/2mm) across all sites. DVH indices for clinical target volumes (CTVs) and OARs closely matched the ground truth.
Conclusion: A diffusion transformer-based denoising framework was developed and, though trained only on H&N data, generalizes well across multiple disease sites.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
JWST Insights into Narrow-line Little Red Dots
Authors:
Zijian Zhang,
Linhua Jiang,
Weiyang Liu,
Luis C. Ho,
Kohei Inayoshi
Abstract:
James Webb Space Telescope (JWST) has revealed a population of red and compact objects with a unique V-shape SED at z >= 4 known as Little Red Dots (LRDs). Most of the LRDs with existing spectral observations exhibit broad Balmer lines and are thus likely to host active galactic nuclei (AGNs). Here we present a study of LRDs with no broad H-alpha component. Our sample consists of five LRDs at z~5…
▽ More
James Webb Space Telescope (JWST) has revealed a population of red and compact objects with a unique V-shape SED at z >= 4 known as Little Red Dots (LRDs). Most of the LRDs with existing spectral observations exhibit broad Balmer lines and are thus likely to host active galactic nuclei (AGNs). Here we present a study of LRDs with no broad H-alpha component. Our sample consists of five LRDs at z~5 with H-alpha line widths of about 250 km/s. They are selected from 32 LRDs that have NIRSpec high- or medium-resolution grating spectra covering H-alpha. During our construction of the sample, we find that approximately 20 percent of the LRD candidates previously selected do not show red continuum emission but resemble the V-shape spectra due to strong line emission. Compared to normal star-forming galaxies, narrow-line LRDs tend to have relatively higher H-alpha line widths and luminosities. If these LRDs are dominated by galaxies, our SED modeling suggests that they are dusty, compact star-forming galaxies with high stellar masses and star formation rates (SFRs). Alternatively, if their SEDs are produced by AGNs, the inferred central black hole masses (MBH) are in the range of 10^5 to 10^6 solar masses, placing them at the low-mass end of the AGN population. They may represent an early stage of super-Eddington growth, where the black holes have yet to accumulate significant masses. With large measurement uncertainties, these black holes appear slightly overmassive relative to the local MBH-Mstar relation, but consistent or undermassive with respect to the MBH-sigma and MBH-Mdyn relations. We further find that nearly half of the high-redshift broad-line AGNs exhibit V-shape SEDs. (abridged)
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
The optical Bloch equation for the finite-temperature fluctuations
Authors:
Weitao Liu
Abstract:
In this work, I analyze the quantum fluctuations and the thermal fluctuations in the framework of quantum mechanics. Being recognized as incoherent perturbations with different features, fluctuations of these two types lead to dissipative terms in the optical Bloch equations. The method allows one to use the optical Bloch equation to analyze time-dependent processes in the finite-temperature fluct…
▽ More
In this work, I analyze the quantum fluctuations and the thermal fluctuations in the framework of quantum mechanics. Being recognized as incoherent perturbations with different features, fluctuations of these two types lead to dissipative terms in the optical Bloch equations. The method allows one to use the optical Bloch equation to analyze time-dependent processes in the finite-temperature fluctuations. The numerical results show that the deexcitation is the limit of the equilibration at zero temperature. The impact of the fluctuations on the coherent excitations are also discussed.
△ Less
Submitted 11 June, 2025; v1 submitted 4 June, 2025;
originally announced June 2025.
-
Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning
Authors:
Muling Wu,
Qi Qian,
Wenhao Liu,
Xiaohua Wang,
Zisu Huang,
Di Liang,
LI Miao,
Shihan Dou,
Changze Lv,
Zhenghua Wang,
Zhibo Xu,
Lina Chen,
Tianlong Li,
Xiaoqing Zheng,
Xuanjing Huang
Abstract:
Large Language Models (LLMs) have achieved remarkable performance across various reasoning tasks, yet post-training is constrained by inefficient sample utilization and inflexible difficulty samples processing. To address these limitations, we propose Customized Curriculum Learning (CCL), a novel framework with two key innovations. First, we introduce model-adaptive difficulty definition that cust…
▽ More
Large Language Models (LLMs) have achieved remarkable performance across various reasoning tasks, yet post-training is constrained by inefficient sample utilization and inflexible difficulty samples processing. To address these limitations, we propose Customized Curriculum Learning (CCL), a novel framework with two key innovations. First, we introduce model-adaptive difficulty definition that customizes curriculum datasets based on each model's individual capabilities rather than using predefined difficulty metrics. Second, we develop "Guided Prompting," which dynamically reduces sample difficulty through strategic hints, enabling effective utilization of challenging samples that would otherwise degrade performance. Comprehensive experiments on supervised fine-tuning and reinforcement learning demonstrate that CCL significantly outperforms uniform training approaches across five mathematical reasoning benchmarks, confirming its effectiveness across both paradigms in enhancing sample utilization and model performance.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Act-as-Pet: Benchmarking the Abilities of Large Language Models as E-Pets in Social Network Services
Authors:
Hongcheng Guo,
Zheyong Xie,
Shaosheng Cao,
Boyang Wang,
Weiting Liu,
Zheyu Ye,
Zhoujun Li,
Zuozhu Liu
Abstract:
As interest in using Large Language Models (LLMs) for interactive and emotionally rich experiences grows, virtual pet companionship emerges as a novel yet underexplored application. Existing approaches focus on basic pet role-playing interactions without systematically benchmarking LLMs for comprehensive companionship. In this paper, we introduce Pet-Bench, a dedicated benchmark that evaluates LLM…
▽ More
As interest in using Large Language Models (LLMs) for interactive and emotionally rich experiences grows, virtual pet companionship emerges as a novel yet underexplored application. Existing approaches focus on basic pet role-playing interactions without systematically benchmarking LLMs for comprehensive companionship. In this paper, we introduce Pet-Bench, a dedicated benchmark that evaluates LLMs across both self-interaction and human-interaction dimensions. Unlike prior work, Pet-Bench emphasizes self-evolution and developmental behaviors alongside interactive engagement, offering a more realistic reflection of pet companionship. It features diverse tasks such as intelligent scheduling, memory-based dialogues, and psychological conversations, with over 7,500 interaction instances designed to simulate complex pet behaviors. Evaluation of 28 LLMs reveals significant performance variations linked to model size and inherent capabilities, underscoring the need for specialized optimization in this domain. Pet-Bench serves as a foundational resource for benchmarking pet-related LLM abilities and advancing emotionally immersive human-pet interactions.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Quenching of single-particle strength inferred from nucleon-removal transfer reactions on $^{15}$C
Authors:
Y. C. Jiang,
J. Chen,
B. P. Kay,
C. R. Hoffman,
T. L. Tang,
I. A. Tolstukhin,
M. R. Xie,
J. G. Li,
N. Michel,
M. L. Avila,
Y. Ayyad,
D. Bazin,
S. Bennett,
J. A. Clark,
S. J. Freeman,
H. Jayatissa,
G. Li,
W. P. Liu,
J. L. Lou,
A. Munoz-Ramos,
C. Müller-Gatermann,
T. Nathan,
D. Santiago-Gonzalez,
D. K. Sharp,
Y. P. Shen
, et al. (2 additional authors not shown)
Abstract:
The difference in the proton and neutron separation energies ($ΔS$) of the weakly bound $^{15}$C ground state is -19.86 MeV, an extreme value. Data from intermediate-energy heavy-ion induced (HI-induced) knockout reactions on nuclei spanning $-20\lesssimΔS\lesssim+20$ MeV, suggest that the degree to which single-particle strength is quenched, $R\mathrm{_{s}}$, has a negative correlation with $ΔS$,…
▽ More
The difference in the proton and neutron separation energies ($ΔS$) of the weakly bound $^{15}$C ground state is -19.86 MeV, an extreme value. Data from intermediate-energy heavy-ion induced (HI-induced) knockout reactions on nuclei spanning $-20\lesssimΔS\lesssim+20$ MeV, suggest that the degree to which single-particle strength is quenched, $R\mathrm{_{s}}$, has a negative correlation with $ΔS$, decreasing from unity around $-20$~MeV to around 0.2 at $+20$~MeV. For the $^{15}$C ground state ($R_s=0.96(4)$ in HI-induced knockout), contrasting results have recently been obtained via the neutron-adding transfer reaction, which reveal a value of $R_s=0.64(15)$, similar to the value observed at modest $ΔS$ and more extreme values of $ΔS$ with reaction probes other than HI knockout. In order to explore the any potential differences between $adding$ and $removing$ processes in transfer reactions at extreme $ΔS$, single-neutron removal transfer reactions on $^{15}$C were performed at 7.1MeV/u in inverse kinematics. The removal of a valence neutron in 2$s_{1/2}$ orbit using both ($p$,$d$) and ($d$,$t$) reactions shows consistent quenching factors and agrees with those from the neutron-adding reaction. The present results, which can be compared with neutron knockout reaction, suggest that correlations, represented by the quenching factor, show limited dependence on neutron-proton asymmetry under the most extreme asymmetry conditions so far achieved in transfer reactions.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
MiMo-VL Technical Report
Authors:
Xiaomi LLM-Core Team,
:,
Zihao Yue,
Zhenru Lin,
Yifan Song,
Weikun Wang,
Shuhuai Ren,
Shuhao Gu,
Shicheng Li,
Peidian Li,
Liang Zhao,
Lei Li,
Kainan Bao,
Hao Tian,
Hailin Zhang,
Gang Wang,
Dawei Zhu,
Cici,
Chenhong He,
Bowen Ye,
Bowen Shen,
Zihan Zhang,
Zihan Jiang,
Zhixian Zheng,
Zhichao Song
, et al. (50 additional authors not shown)
Abstract:
We open-source MiMo-VL-7B-SFT and MiMo-VL-7B-RL, two powerful vision-language models delivering state-of-the-art performance in both general visual understanding and multimodal reasoning. MiMo-VL-7B-RL outperforms Qwen2.5-VL-7B on 35 out of 40 evaluated tasks, and scores 59.4 on OlympiadBench, surpassing models with up to 78B parameters. For GUI grounding applications, it sets a new standard with…
▽ More
We open-source MiMo-VL-7B-SFT and MiMo-VL-7B-RL, two powerful vision-language models delivering state-of-the-art performance in both general visual understanding and multimodal reasoning. MiMo-VL-7B-RL outperforms Qwen2.5-VL-7B on 35 out of 40 evaluated tasks, and scores 59.4 on OlympiadBench, surpassing models with up to 78B parameters. For GUI grounding applications, it sets a new standard with 56.1 on OSWorld-G, even outperforming specialized models such as UI-TARS. Our training combines four-stage pre-training (2.4 trillion tokens) with Mixed On-policy Reinforcement Learning (MORL) integrating diverse reward signals. We identify the importance of incorporating high-quality reasoning data with long Chain-of-Thought into pre-training stages, and the benefits of mixed RL despite challenges in simultaneous multi-domain optimization. We also contribute a comprehensive evaluation suite covering 50+ tasks to promote reproducibility and advance the field. The model checkpoints and full evaluation suite are available at https://github.com/XiaomiMiMo/MiMo-VL.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Three-pion Bose-Einstein correlations measured in proton-proton collisions
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1125 additional authors not shown)
Abstract:
A study on the Bose-Einstein correlations for triplets of same-sign pions is presented. The analysis is performed using proton-proton collisions at a centre-of-mass energy of $\sqrt{s}$ = 7 TeV, recorded by the LHCb experiment, corresponding to an integrated luminosity of 1.0 fb$^{-1}$. For the first time, the results are interpreted in the core-halo model. The parameters of the model are determin…
▽ More
A study on the Bose-Einstein correlations for triplets of same-sign pions is presented. The analysis is performed using proton-proton collisions at a centre-of-mass energy of $\sqrt{s}$ = 7 TeV, recorded by the LHCb experiment, corresponding to an integrated luminosity of 1.0 fb$^{-1}$. For the first time, the results are interpreted in the core-halo model. The parameters of the model are determined in regions of charged-particle multiplicity. This measurement provides insight into the nature of hadronisation in terms of coherence, showing a coherent emission of pions.
△ Less
Submitted 9 June, 2025; v1 submitted 3 June, 2025;
originally announced June 2025.
-
Multi-Metric Adaptive Experimental Design under Fixed Budget with Validation
Authors:
Qining Zhang,
Tanner Fiez,
Yi Liu,
Wenyang Liu
Abstract:
Standard A/B tests in online experiments face statistical power challenges when testing multiple candidates simultaneously, while adaptive experimental designs (AED) alone fall short in inferring experiment statistics such as the average treatment effect, especially with many metrics (e.g., revenue, safety) and heterogeneous variances. This paper proposes a fixed-budget multi-metric AED framework…
▽ More
Standard A/B tests in online experiments face statistical power challenges when testing multiple candidates simultaneously, while adaptive experimental designs (AED) alone fall short in inferring experiment statistics such as the average treatment effect, especially with many metrics (e.g., revenue, safety) and heterogeneous variances. This paper proposes a fixed-budget multi-metric AED framework with a two-phase structure: an adaptive exploration phase to identify the best treatment, and a validation phase with an A/B test to verify the treatment's quality and infer statistics. We propose SHRVar, which generalizes sequential halving (SH) (Karnin et al., 2013) with a novel relative-variance-based sampling and an elimination strategy built on reward z-values. It achieves a provable error probability that decreases exponentially, where the exponent generalizes the complexity measure for SH (Karnin et al., 2013) and SHVar (Lalitha et al., 2023) with homogeneous and heterogeneous variances, respectively. Numerical experiments verify our analysis and demonstrate the superior performance of this new framework.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
Measurement of the branching fractions of the Cabibbo-favored decays $Λ_{c}^{+}\toΛK_{S}^{0}K^{+}$ and $Λ_{c}^{+}\toΞ^{0}K_{S}^{0}π^{+}$ and search for $Λ_{c}^{+}\toΣ^{0} K_{S}^{0}K^{+}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (660 additional authors not shown)
Abstract:
Based on $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of about 4.5 fb$^{-1}$ collected at center-of-mass energies between 4599.53 MeV and 4698.82 MeV with the BESIII detector, the absolute branching fraction of the Cabibbo-favored decay $Λ_{c}^{+}\toΛK_{S}^{0}K^{+}$ is measured to be $(3.12\pm0.46\pm0.15)\times10^{-3}$. Combined with a previous measurement from the BESIII…
▽ More
Based on $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of about 4.5 fb$^{-1}$ collected at center-of-mass energies between 4599.53 MeV and 4698.82 MeV with the BESIII detector, the absolute branching fraction of the Cabibbo-favored decay $Λ_{c}^{+}\toΛK_{S}^{0}K^{+}$ is measured to be $(3.12\pm0.46\pm0.15)\times10^{-3}$. Combined with a previous measurement from the BESIII Collaboration, the branching fraction of the decay $Λ_{c}^{+}\toΛK_{S}^{0}K^{+}$ is calculated to be $(3.07\pm0.26\pm0.13)\times10^{-3}$. The decay $Λ_{c}^{+}\toΞ^{0}K_{S}^{0}π^{+}$ is observed for the first time with a statistical significance of $6.6σ$, and its branching fraction is determined to be $(3.70\pm0.60\pm0.21)\times10^{-3}$. In addition, a search for the decay $Λ_{c}^{+}\toΣ^{0} K_{S}^{0}K^{+}$ is performed and its branching fraction is determined to be $(0.80^{+0.28}_{-0.24}\pm0.16)\times10^{-3}$, corresponding to an upper limit of $1.28\times10^{-3}$ at $90\%$ confidence level. These measurements provide new information that can be used to distinguish between theoretical models.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
Heterogeneous Group-Based Reinforcement Learning for LLM-based Multi-Agent Systems
Authors:
Guanzhong Chen,
Shaoxiong Yang,
Chao Li,
Wei Liu,
Jian Luan,
Zenglin Xu
Abstract:
Large Language Models (LLMs) have achieved remarkable success across diverse natural language processing tasks, yet their deployment in real-world applications is hindered by fixed knowledge cutoffs and difficulties in generating controllable, accurate outputs in a single inference. Multi-agent systems (MAS) built from specialized LLM agents offer a promising solution, enabling dynamic collaborati…
▽ More
Large Language Models (LLMs) have achieved remarkable success across diverse natural language processing tasks, yet their deployment in real-world applications is hindered by fixed knowledge cutoffs and difficulties in generating controllable, accurate outputs in a single inference. Multi-agent systems (MAS) built from specialized LLM agents offer a promising solution, enabling dynamic collaboration and iterative reasoning. However, optimizing these systems remains a challenge, as conventional methods such as prompt engineering and supervised fine-tuning entail high engineering overhead and limited adaptability. Reinforcement learning (RL), particularly multi-agent reinforcement learning (MARL), provides a scalable framework by refining agent policies based on system-level feedback. Nevertheless, existing MARL algorithms, such as Multi-Agent Proximal Policy Optimization (MAPPO), rely on Critic networks, which can cause training instability and increase computational burden. To address these limitations and target the prototypical Multi-Agent Search System (MASS), we propose Multi-Agent Heterogeneous Group Policy Optimization (MHGPO), a novel Critic-free algorithm that guides policy updates by estimating relative reward advantages across heterogeneous groups of rollouts. MHGPO eliminates the need for Critic networks, enhancing stability and reducing computational overhead. Additionally, we introduce three group rollout sampling strategies that trade off between efficiency and effectiveness. Experiments on a multi-agent LLM-based search system demonstrate that MHGPO consistently outperforms MAPPO in both task performance and computational efficiency, without requiring warm-up, underscoring its potential for stable and scalable optimization of complex LLM-based MAS.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
Improved Measurements of $D^+ \to ηe^+ν_e$ and $D^+ \to ημ^+ν_μ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (682 additional authors not shown)
Abstract:
Using 20.3 fb$^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy of 3.773 GeV with the BESIII detector, we measure the branching fractions of $D^+\to ηe^+ν_e$ and $D^+\to ημ^+ν_μ$ to be $(9.75\pm0.29\pm0.28)\times10^{-4}$ and $(9.08\pm0.35\pm0.23)\times10^{-4}$, where the first and second uncertainties are statistical and systematic, respectively. From a simultaneous fit to t…
▽ More
Using 20.3 fb$^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy of 3.773 GeV with the BESIII detector, we measure the branching fractions of $D^+\to ηe^+ν_e$ and $D^+\to ημ^+ν_μ$ to be $(9.75\pm0.29\pm0.28)\times10^{-4}$ and $(9.08\pm0.35\pm0.23)\times10^{-4}$, where the first and second uncertainties are statistical and systematic, respectively. From a simultaneous fit to their partial decay rates, we determine the product of the hadronic form factor $f^η_+(0)$ and the modulus of the $c\to d$ Cabibbo-Kobayashi-Maskawa matrix element $|V_{cd}|$ to be $f^η_+(0)|V_{cd}|=0.078\pm0.002\pm0.001$. Taking the $|V_{cd}|$ value from the Standard Model global fit as input, we obtain $f^η_+(0)=0.345\pm0.008\pm0.003$. The ratio between the measured branching fractions of $D^+\toη^+μ^+ν_μ$ and $D^+\toηe^+ν_e$, is determined to be $0.93\pm0.05_{\rm stat.}\pm0.02_{\rm syst.}$, indicating no violation of lepton flavor universality.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
ESGenius: Benchmarking LLMs on Environmental, Social, and Governance (ESG) and Sustainability Knowledge
Authors:
Chaoyue He,
Xin Zhou,
Yi Wu,
Xinjia Yu,
Yan Zhang,
Lei Zhang,
Di Wang,
Shengfei Lyu,
Hong Xu,
Xiaoqiao Wang,
Wei Liu,
Chunyan Miao
Abstract:
We introduce ESGenius, a comprehensive benchmark for evaluating and enhancing the proficiency of Large Language Models (LLMs) in Environmental, Social and Governance (ESG) and sustainability-focused question answering. ESGenius comprises two key components: (i) ESGenius-QA, a collection of 1 136 multiple-choice questions generated by LLMs and rigorously validated by domain experts, covering a broa…
▽ More
We introduce ESGenius, a comprehensive benchmark for evaluating and enhancing the proficiency of Large Language Models (LLMs) in Environmental, Social and Governance (ESG) and sustainability-focused question answering. ESGenius comprises two key components: (i) ESGenius-QA, a collection of 1 136 multiple-choice questions generated by LLMs and rigorously validated by domain experts, covering a broad range of ESG pillars and sustainability topics. Each question is systematically linked to its corresponding source text, enabling transparent evaluation and supporting retrieval-augmented generation (RAG) methods; and (ii) ESGenius-Corpus, a meticulously curated repository of 231 foundational frameworks, standards, reports and recommendation documents from seven authoritative sources. Moreover, to fully assess the capabilities and adaptation potential of the model, we implement a rigorous two-stage evaluation protocol -- Zero-Shot and RAG. Extensive experiments across 50 LLMs (ranging from 0.5 B to 671 B parameters) demonstrate that state-of-the-art models achieve only moderate performance in zero-shot settings, with accuracies typically around 55--70\%, highlighting ESGenius's challenging nature for LLMs in interdisciplinary contexts. However, models employing RAG show significant performance improvements, particularly for smaller models. For example, "DeepSeek-R1-Distill-Qwen-14B" improves from 63.82\% (zero-shot) to 80.46\% with RAG. These results underscore the necessity of grounding responses in authoritative sources for enhanced ESG understanding. To the best of our knowledge, ESGenius is the first benchmark curated for LLMs and the relevant enhancement technologies that focuses on ESG and sustainability topics.
△ Less
Submitted 2 June, 2025;
originally announced June 2025.
-
STORM-BORN: A Challenging Mathematical Derivations Dataset Curated via a Human-in-the-Loop Multi-Agent Framework
Authors:
Wenhao Liu,
Zhenyi Lu,
Xinyu Hu,
Jierui Zhang,
Dailin Li,
Jiacheng Cen,
Huilin Cao,
Haiteng Wang,
Yuhan Li,
Kun Xie,
Dandan Li,
Pei Zhang,
Chengbo Zhang,
Yuxiang Ren,
Xiaohong Huang,
Yan Ma
Abstract:
High-quality math datasets are crucial for advancing the reasoning abilities of large language models (LLMs). However, existing datasets often suffer from three key issues: outdated and insufficient challenging content, neglecting human-like reasoning, and limited reliability due to single-LLM generation. To address these, we introduce STORM-BORN, an ultra-challenging dataset of mathematical deriv…
▽ More
High-quality math datasets are crucial for advancing the reasoning abilities of large language models (LLMs). However, existing datasets often suffer from three key issues: outdated and insufficient challenging content, neglecting human-like reasoning, and limited reliability due to single-LLM generation. To address these, we introduce STORM-BORN, an ultra-challenging dataset of mathematical derivations sourced from cutting-edge academic papers, which includes dense human-like approximations and heuristic cues. To ensure the reliability and quality, we propose a novel human-in-the-loop, multi-agent data generation framework, integrating reasoning-dense filters, multi-agent collaboration, and human mathematicians' evaluations. We curated a set of 2,000 synthetic samples and deliberately selected the 100 most difficult problems. Even most advanced models like GPT-o1 solved fewer than 5% of them. Fine-tuning on STORM-BORN boosts accuracy by 7.84% (LLaMA3-8B) and 9.12% (Qwen2.5-7B). As AI approaches mathematician-level reasoning, STORM-BORN provides both a high-difficulty benchmark and a human-like reasoning training resource. Our code and dataset are publicly available at https://github.com/lwhere/STORM-BORN.
△ Less
Submitted 3 June, 2025; v1 submitted 2 June, 2025;
originally announced June 2025.
-
Quantitative Error Feedback for Quantization Noise Reduction of Filtering over Graphs
Authors:
Xue Xian Zheng,
Weihang Liu,
Xin Lou,
Stefan Vlaski,
Tareq Al-Naffouri
Abstract:
This paper introduces an innovative error feedback framework designed to mitigate quantization noise in distributed graph filtering, where communications are constrained to quantized messages. It comes from error spectrum shaping techniques from state-space digital filters, and therefore establishes connections between quantized filtering processes over different domains. In contrast to existing e…
▽ More
This paper introduces an innovative error feedback framework designed to mitigate quantization noise in distributed graph filtering, where communications are constrained to quantized messages. It comes from error spectrum shaping techniques from state-space digital filters, and therefore establishes connections between quantized filtering processes over different domains. In contrast to existing error compensation methods, our framework quantitatively feeds back the quantization noise for exact compensation. We examine the framework under three key scenarios: (i) deterministic graph filtering, (ii) graph filtering over random graphs, and (iii) graph filtering with random node-asynchronous updates. Rigorous theoretical analysis demonstrates that the proposed framework significantly reduces the effect of quantization noise, and we provide closed-form solutions for the optimal error feedback coefficients. Moreover, this quantitative error feedback mechanism can be seamlessly integrated into communication-efficient decentralized optimization frameworks, enabling lower error floors. Numerical experiments validate the theoretical results, consistently showing that our method outperforms conventional quantization strategies in terms of both accuracy and robustness.
△ Less
Submitted 2 June, 2025;
originally announced June 2025.
-
STSA: Federated Class-Incremental Learning via Spatial-Temporal Statistics Aggregation
Authors:
Zenghao Guan,
Guojun Zhu,
Yucan Zhou,
Wu Liu,
Weiping Wang,
Jiebo Luo,
Xiaoyan Gu
Abstract:
Federated Class-Incremental Learning (FCIL) enables Class-Incremental Learning (CIL) from distributed data. Existing FCIL methods typically integrate old knowledge preservation into local client training. However, these methods cannot avoid spatial-temporal client drift caused by data heterogeneity and often incur significant computational and communication overhead, limiting practical deployment.…
▽ More
Federated Class-Incremental Learning (FCIL) enables Class-Incremental Learning (CIL) from distributed data. Existing FCIL methods typically integrate old knowledge preservation into local client training. However, these methods cannot avoid spatial-temporal client drift caused by data heterogeneity and often incur significant computational and communication overhead, limiting practical deployment. To address these challenges simultaneously, we propose a novel approach, Spatial-Temporal Statistics Aggregation (STSA), which provides a unified framework to aggregate feature statistics both spatially (across clients) and temporally (across stages). The aggregated feature statistics are unaffected by data heterogeneity and can be used to update the classifier in closed form at each stage. Additionally, we introduce STSA-E, a communication-efficient variant with theoretical guarantees, achieving similar performance to STSA-E with much lower communication overhead. Extensive experiments on three widely used FCIL datasets, with varying degrees of data heterogeneity, show that our method outperforms state-of-the-art FCIL methods in terms of performance, flexibility, and both communication and computation efficiency.
△ Less
Submitted 2 June, 2025;
originally announced June 2025.
-
Testing the Young FRB Progenitor Hypothesis: A Crossmatch of Catalog-1 CHIME Bursts with Historic Local Universe Supernovae
Authors:
Wanqing Liu,
Mohit Bhardwaj,
Ben Margalit
Abstract:
Fast radio bursts (FRBs) are among the most energetic and enigmatic transients in the radio sky, with mounting evidence suggesting newborn, highly magnetized neutron stars formed in core-collapse supernovae (CCSNe) as their sources. A definitive spatial association between an FRB and a historic CCSN would confirm this link and tightly constrain young neutron star source models. Here we report on t…
▽ More
Fast radio bursts (FRBs) are among the most energetic and enigmatic transients in the radio sky, with mounting evidence suggesting newborn, highly magnetized neutron stars formed in core-collapse supernovae (CCSNe) as their sources. A definitive spatial association between an FRB and a historic CCSN would confirm this link and tightly constrain young neutron star source models. Here we report on the first systematic cross-matching of 886 spectroscopically classified CCSNe in the local Universe (z $\leq$ 0.043) against 241 CHIME/FRB Catalog 1 events, applying rigorous spatial, dispersion measure (DM), and scattering time ($τ$) criteria. We identify four positional overlaps, all consistent with chance alignment; however, one pair, FRB 20190412B-SN 2009gi, also satisfies independent host-DM and $τ$ constraints, making it a promising candidate for targeted follow-up. Next, we search for compact (persistent or transient) radio emission at all matched supernova sites using multi-epoch VLASS data and detect none. Treating every CCSN sight line as a non-detection, we derive Poisson upper limits on the FRB burst rate at these locations, which lie well below the rates observed for the most active repeaters unless their activity is heavily suppressed by beaming, intermittency, or residual free-free absorption. We then develop a galaxy-integrated FRB-rate model that incorporates an intrinsic spectral index, secular magnetar-activity decay, and frequency-dependent free-free opacity. Applying this formalism to existing FRB data shows that reproducing the observed CHIME/CRAFT all-sky rate ratio requires a steep decline in magnetar burst rates with age. Finally, our work underscores the necessity of sub-arcsecond localizations and multiwavelength follow-up to definitively test the young neutron star source hypothesis.
△ Less
Submitted 1 June, 2025;
originally announced June 2025.
-
SEED: A Benchmark Dataset for Sequential Facial Attribute Editing with Diffusion Models
Authors:
Yule Zhu,
Ping Liu,
Zhedong Zheng,
Wei Liu
Abstract:
Diffusion models have recently enabled precise and photorealistic facial editing across a wide range of semantic attributes. Beyond single-step modifications, a growing class of applications now demands the ability to analyze and track sequences of progressive edits, such as stepwise changes to hair, makeup, or accessories. However, sequential editing introduces significant challenges in edit attr…
▽ More
Diffusion models have recently enabled precise and photorealistic facial editing across a wide range of semantic attributes. Beyond single-step modifications, a growing class of applications now demands the ability to analyze and track sequences of progressive edits, such as stepwise changes to hair, makeup, or accessories. However, sequential editing introduces significant challenges in edit attribution and detection robustness, further complicated by the lack of large-scale, finely annotated benchmarks tailored explicitly for this task. We introduce SEED, a large-scale Sequentially Edited facE Dataset constructed via state-of-the-art diffusion models. SEED contains over 90,000 facial images with one to four sequential attribute modifications, generated using diverse diffusion-based editing pipelines (LEdits, SDXL, SD3). Each image is annotated with detailed edit sequences, attribute masks, and prompts, facilitating research on sequential edit tracking, visual provenance analysis, and manipulation robustness assessment. To benchmark this task, we propose FAITH, a frequency-aware transformer-based model that incorporates high-frequency cues to enhance sensitivity to subtle sequential changes. Comprehensive experiments, including systematic comparisons of multiple frequency-domain methods, demonstrate the effectiveness of FAITH and the unique challenges posed by SEED. SEED offers a challenging and flexible resource for studying progressive diffusion-based edits at scale. Dataset and code will be publicly released at: https://github.com/Zeus1037/SEED.
△ Less
Submitted 31 May, 2025;
originally announced June 2025.
-
Manipulation of photonic spin Hall effect in the Rydberg atomic medium
Authors:
Wenzhang Liu,
Muqaddar Abbas,
Jiawei Lai,
Pei Zhang
Abstract:
We present a theoretical study demonstrating enhanced tunability of the photonic spin Hall effect (PSHE) using a strongly interacting Rydberg atomic medium under electromagnetically induced transparency (EIT) conditions. In contrast to conventional approaches that rely on static refractiveindex profiles or metamaterials, here the PSHE is controlled via a nonlocal third-order nonlinear susceptibili…
▽ More
We present a theoretical study demonstrating enhanced tunability of the photonic spin Hall effect (PSHE) using a strongly interacting Rydberg atomic medium under electromagnetically induced transparency (EIT) conditions. In contrast to conventional approaches that rely on static refractiveindex profiles or metamaterials, here the PSHE is controlled via a nonlocal third-order nonlinear susceptibility arising from long range Rydberg-Rydberg interactions. We show that this nonlocal nonlinearity enables dynamic modulation of spin-dependent light trajectories, amplifying the normally weak PSHE into a readily observable and adjustable effect. These results pave the way for new capabilities in photonic information processing and sensing. In particular, an adjustable PSHE may enable beam steering based on photon spin, improve the sensitivity of precision measurements, and support photonic devices whose functionality can be reconfigured in real time.
△ Less
Submitted 5 June, 2025; v1 submitted 31 May, 2025;
originally announced June 2025.
-
MotionPersona: Characteristics-aware Locomotion Control
Authors:
Mingyi Shi,
Wei Liu,
Jidong Mei,
Wangpok Tse,
Rui Chen,
Xuelin Chen,
Taku Komura
Abstract:
We present MotionPersona, a novel real-time character controller that allows users to characterize a character by specifying attributes such as physical traits, mental states, and demographics, and projects these properties into the generated motions for animating the character. In contrast to existing deep learning-based controllers, which typically produce homogeneous animations tailored to a si…
▽ More
We present MotionPersona, a novel real-time character controller that allows users to characterize a character by specifying attributes such as physical traits, mental states, and demographics, and projects these properties into the generated motions for animating the character. In contrast to existing deep learning-based controllers, which typically produce homogeneous animations tailored to a single, predefined character, MotionPersona accounts for the impact of various traits on human motion as observed in the real world. To achieve this, we develop a block autoregressive motion diffusion model conditioned on SMPLX parameters, textual prompts, and user-defined locomotion control signals. We also curate a comprehensive dataset featuring a wide range of locomotion types and actor traits to enable the training of this characteristic-aware controller. Unlike prior work, MotionPersona is the first method capable of generating motion that faithfully reflects user-specified characteristics (e.g., an elderly person's shuffling gait) while responding in real time to dynamic control inputs. Additionally, we introduce a few-shot characterization technique as a complementary conditioning mechanism, enabling customization via short motion clips when language prompts fall short. Through extensive experiments, we demonstrate that MotionPersona outperforms existing methods in characteristics-aware locomotion control, achieving superior motion quality and diversity. Results, code, and demo can be found at: https://motionpersona25.github.io/.
△ Less
Submitted 30 May, 2025;
originally announced June 2025.
-
New Physics Search at the CEPC: a General Perspective
Authors:
Stefan Antusch,
Peter Athron,
Daniele Barducci,
Long Chen,
Mingshui Chen,
Xiang Chen,
Huajie Cheng,
Kingman Cheung,
Joao Guimaraes da Costa,
Arindam Das,
Frank F. Deppisch,
P. S. Bhupal Dev,
Xiaokang Du,
Yong Du,
Yaquan Fang,
Andrew Fowlie,
Yu Gao,
Bruce Mellado Garcia,
Shao-Feng Ge,
Jiayin Gu,
Yu-Chen Guo,
Jan Hajer,
Chengcheng Han,
Tao Han,
Sven Heinemeyer
, et al. (68 additional authors not shown)
Abstract:
The Circular Electron-Positron Collider (CEPC), a proposed next-generation Higgs factory, provides new opportunities to explore physics beyond the Standard Model (SM). With its clean electron-positron collision environment and the ability to collect large samples of Higgs, W, and Z bosons, the CEPC enables precision measurements and searches for new physics. This white paper outlines the CEPC's di…
▽ More
The Circular Electron-Positron Collider (CEPC), a proposed next-generation Higgs factory, provides new opportunities to explore physics beyond the Standard Model (SM). With its clean electron-positron collision environment and the ability to collect large samples of Higgs, W, and Z bosons, the CEPC enables precision measurements and searches for new physics. This white paper outlines the CEPC's discovery potential, including studies of exotic decays of the Higgs, Z, and top quarks, dark matter and dark sector phenomena, long-lived particles, supersymmetry, and neutrino-related signatures. Advanced detector technologies and reconstruction techniques, such as one-to-one correspondence reconstruction and jet origin identification, significantly improve sensitivity to rare and weakly interacting processes. The CEPC is particularly well suited to probe the electroweak phase transition and test models of electroweak baryogenesis and dark sector interactions. In addition, global fit analyses highlight the CEPC's complementary role in constraining a wide range of new physics scenarios. These features position the CEPC as a powerful tool for exploring the next frontier in fundamental particle physics in the post-Higgs discovery era.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
Coordinated Beamforming for RIS-Empowered ISAC Systems over Secure Low-Altitude Networks
Authors:
Chunjie Wang,
Xuhui Zhang,
Wenchao Liu,
Jinke Ren,
Huijun Xing,
Shuqiang Wang,
Yanyan Shen
Abstract:
Emerging as a cornerstone for next-generation wireless networks, integrated sensing and communication (ISAC) systems demand innovative solutions to balance spectral efficiency and sensing accuracy. In this paper, we propose a coordinated beamforming framework for a reconfigurable intelligent surface (RIS)-empowered ISAC system, where the active precoding at the dual-functional base station (DFBS)…
▽ More
Emerging as a cornerstone for next-generation wireless networks, integrated sensing and communication (ISAC) systems demand innovative solutions to balance spectral efficiency and sensing accuracy. In this paper, we propose a coordinated beamforming framework for a reconfigurable intelligent surface (RIS)-empowered ISAC system, where the active precoding at the dual-functional base station (DFBS) and the passive beamforming at the RIS are jointly optimized to provide communication services for legitimate unmanned aerial vehicles (UAVs) while sensing the unauthorized UAVs. The sum-rate of all legitimate UAVs are maximized, while satisfying the radar sensing signal-to-noise ratio requirements, the transmit power constraints, and the reflection coefficients of the RIS. To address the inherent non-convexity from coupled variables, we propose a low-complexity algorithm integrating fractional programming with alternating optimization, featuring convergence guarantees. Numerical results demonstrate that the proposed algorithm achieves higher data rate compared to disjoint optimization benchmarks. This underscores RIS's pivotal role in harmonizing communication and target sensing functionalities for low-altitude networks.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.