-
Coresets for Robust Clustering via Black-box Reductions to Vanilla Case
Authors:
Shaofeng H. -C. Jiang,
Jianing Lou
Abstract:
We devise $ε$-coresets for robust $(k,z)$-Clustering with $m$ outliers through black-box reductions to vanilla case. Given an $ε$-coreset construction for vanilla clustering with size $N$, we construct coresets of size $N\cdot \mathrm{poly}\log(kmε^{-1}) + O_z\left(\min\{kmε^{-1}, mε^{-2z}\log^z(kmε^{-1}) \}\right)$ for various metric spaces, where $O_z$ hides $2^{O(z\log z)}$ factors. This increa…
▽ More
We devise $ε$-coresets for robust $(k,z)$-Clustering with $m$ outliers through black-box reductions to vanilla case. Given an $ε$-coreset construction for vanilla clustering with size $N$, we construct coresets of size $N\cdot \mathrm{poly}\log(kmε^{-1}) + O_z\left(\min\{kmε^{-1}, mε^{-2z}\log^z(kmε^{-1}) \}\right)$ for various metric spaces, where $O_z$ hides $2^{O(z\log z)}$ factors. This increases the size of the vanilla coreset by a small multiplicative factor of $\mathrm{poly}\log(kmε^{-1})$, and the additive term is up to a $(ε^{-1}\log (km))^{O(z)}$ factor to the size of the optimal robust coreset. Plugging in vanilla coreset results of [Cohen-Addad et al., STOC'21], we obtain the first coresets for $(k,z)$-Clustering with $m$ outliers with size near-linear in $k$ while previous results have size at least $Ω(k^2)$ [Huang et al., ICLR'23; Huang et al., SODA'25].
Technically, we establish two conditions under which a vanilla coreset is as well a robust coreset. The first condition requires the dataset to satisfy special structures - it can be broken into "dense" parts with bounded diameter. We combine this with a new bounded-diameter decomposition that has only $O_z(km ε^{-1})$ non-dense points to obtain the $O_z(km ε^{-1})$ additive bound. Another condition requires the vanilla coreset to possess an extra size-preserving property. We further give a black-box reduction that turns a vanilla coreset to the one satisfying the said size-preserving property, leading to the alternative $O_z(mε^{-2z}\log^{z}(kmε^{-1}))$ additive bound.
We also implement our reductions in the dynamic streaming setting and obtain the first streaming algorithms for $k$-Median and $k$-Means with $m$ outliers, using space $\tilde{O}(k+m)\cdot\mathrm{poly}(dε^{-1}\logΔ)$ for inputs on the grid $[Δ]^d$.
△ Less
Submitted 11 February, 2025;
originally announced February 2025.
-
Search for $e^+e^-\to K_S^0 K_S^0 h_c$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data at 13 center-of-mass energies ranging from 4.600 to 4.950 GeV collected with the BESIII detector, we search for the unmeasured $e^+e^-\to K_S^0 K_S^0 h_c$ process . No significant signal is observed, and the upper limits of the Born cross sections at each center-of-mass energy are presented.
Using $e^+e^-$ collision data at 13 center-of-mass energies ranging from 4.600 to 4.950 GeV collected with the BESIII detector, we search for the unmeasured $e^+e^-\to K_S^0 K_S^0 h_c$ process . No significant signal is observed, and the upper limits of the Born cross sections at each center-of-mass energy are presented.
△ Less
Submitted 11 February, 2025;
originally announced February 2025.
-
DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization
Authors:
Xuefeng Liu,
Songhao Jiang,
Siyu Chen,
Zhuoran Yang,
Yuxin Chen,
Ian Foster,
Rick Stevens
Abstract:
Finetuning a Large Language Model (LLM) is crucial for generating results towards specific objectives. This research delves into the realm of drug optimization and introduce a novel reinforcement learning algorithm to finetune a drug optimization LLM-based generative model, enhancing the original drug across target objectives, while retains the beneficial chemical properties of the original drug.…
▽ More
Finetuning a Large Language Model (LLM) is crucial for generating results towards specific objectives. This research delves into the realm of drug optimization and introduce a novel reinforcement learning algorithm to finetune a drug optimization LLM-based generative model, enhancing the original drug across target objectives, while retains the beneficial chemical properties of the original drug. This work is comprised of two primary components: (1) DrugImprover: A framework tailored for improving robustness and efficiency in drug optimization. It includes a LLM designed for drug optimization and a novel Structured Policy Optimization (SPO) algorithm, which is theoretically grounded. This algorithm offers a unique perspective for fine-tuning the LLM-based generative model by aligning the improvement of the generated molecule with the input molecule under desired objectives. (2) A dataset of 1 million compounds, each with OEDOCK docking scores on 5 human proteins associated with cancer cells and 24 binding sites from SARS-CoV-2 virus. We conduct a comprehensive evaluation of SPO and demonstrate its effectiveness in improving the original drug across target properties. Our code and dataset will be publicly available at: https://github.com/xuefeng-cs/DrugImproverGPT.
△ Less
Submitted 10 February, 2025;
originally announced February 2025.
-
ScaffoldGPT: A Scaffold-based GPT Model for Drug Optimization
Authors:
Xuefeng Liu,
Songhao Jiang,
Ian Foster,
Jinbo Xu,
Rick Stevens
Abstract:
Drug optimization has become increasingly crucial in light of fast-mutating virus strains and drug-resistant cancer cells. Nevertheless, it remains challenging as it necessitates retaining the beneficial properties of the original drug while simultaneously enhancing desired attributes beyond its scope. In this work, we aim to tackle this challenge by introducing ScaffoldGPT, a novel Generative Pre…
▽ More
Drug optimization has become increasingly crucial in light of fast-mutating virus strains and drug-resistant cancer cells. Nevertheless, it remains challenging as it necessitates retaining the beneficial properties of the original drug while simultaneously enhancing desired attributes beyond its scope. In this work, we aim to tackle this challenge by introducing ScaffoldGPT, a novel Generative Pretrained Transformer (GPT) designed for drug optimization based on molecular scaffolds. Our work comprises three key components: (1) A three-stage drug optimization approach that integrates pretraining, finetuning, and decoding optimization. (2) A uniquely designed two-phase incremental training approach for pre-training the drug optimization GPT on molecule scaffold with enhanced performance. (3) A token-level decoding optimization strategy, TOP-N, that enabling controlled, reward-guided generation using pretrained/finetuned GPT. We demonstrate via a comprehensive evaluation on COVID and cancer benchmarks that ScaffoldGPT outperforms the competing baselines in drug optimization benchmarks, while excelling in preserving original functional scaffold and enhancing desired properties.
△ Less
Submitted 11 April, 2025; v1 submitted 9 February, 2025;
originally announced February 2025.
-
Faster Approximation Algorithms for k-Center via Data Reduction
Authors:
Arnold Filtser,
Shaofeng H. -C. Jiang,
Yi Li,
Anurag Murty Naredla,
Ioannis Psarros,
Qiaoyuan Yang,
Qin Zhang
Abstract:
We study efficient algorithms for the Euclidean $k$-Center problem, focusing on the regime of large $k$. We take the approach of data reduction by considering $α$-coreset, which is a small subset $S$ of the dataset $P$ such that any $β$-approximation on $S$ is an $(α+ β)$-approximation on $P$. We give efficient algorithms to construct coresets whose size is $k \cdot o(n)$, which immediately speeds…
▽ More
We study efficient algorithms for the Euclidean $k$-Center problem, focusing on the regime of large $k$. We take the approach of data reduction by considering $α$-coreset, which is a small subset $S$ of the dataset $P$ such that any $β$-approximation on $S$ is an $(α+ β)$-approximation on $P$. We give efficient algorithms to construct coresets whose size is $k \cdot o(n)$, which immediately speeds up existing approximation algorithms. Notably, we obtain a near-linear time $O(1)$-approximation when $k = n^c$ for any $0 < c < 1$. We validate the performance of our coresets on real-world datasets with large $k$, and we observe that the coreset speeds up the well-known Gonzalez algorithm by up to $4$ times, while still achieving similar clustering cost. Technically, one of our coreset results is based on a new efficient construction of consistent hashing with competitive parameters. This general tool may be of independent interest for algorithm design in high dimensional Euclidean spaces.
△ Less
Submitted 9 February, 2025;
originally announced February 2025.
-
HCMRM: A High-Consistency Multimodal Relevance Model for Search Ads
Authors:
Guobing Gan,
Kaiming Gao,
Li Wang,
Shen Jiang,
Peng Jiang
Abstract:
Search advertising is essential for merchants to reach the target users on short video platforms. Short video ads aligned with user search intents are displayed through relevance matching and bid ranking mechanisms. This paper focuses on improving query-to-video relevance matching to enhance the effectiveness of ranking in ad systems. Recent vision-language pre-training models have demonstrated pr…
▽ More
Search advertising is essential for merchants to reach the target users on short video platforms. Short video ads aligned with user search intents are displayed through relevance matching and bid ranking mechanisms. This paper focuses on improving query-to-video relevance matching to enhance the effectiveness of ranking in ad systems. Recent vision-language pre-training models have demonstrated promise in various multimodal tasks. However, their contribution to downstream query-video relevance tasks is limited, as the alignment between the pair of visual signals and text differs from the modeling of the triplet of the query, visual signals, and video text. In addition, our previous relevance model provides limited ranking capabilities, largely due to the discrepancy between the binary cross-entropy fine-tuning objective and the ranking objective. To address these limitations, we design a high-consistency multimodal relevance model (HCMRM). It utilizes a simple yet effective method to enhance the consistency between pre-training and relevance tasks. Specifically, during the pre-training phase, along with aligning visual signals and video text, several keywords are extracted from the video text as pseudo-queries to perform the triplet relevance modeling. For the fine-tuning phase, we introduce a hierarchical softmax loss, which enables the model to learn the order within labels while maximizing the distinction between positive and negative samples. This promotes the fusion ranking of relevance and bidding in the subsequent ranking stage. The proposed method has been deployed in the Kuaishou search advertising system for over a year, contributing to a 6.1% reduction in the proportion of irrelevant ads and a 1.4% increase in ad revenue.
△ Less
Submitted 9 February, 2025;
originally announced February 2025.
-
Observation of $D\to \bar{K}_{1}(1270)μ^+ν_μ$ and test of lepton flavor universality with $D\to \bar{K}_1(1270) \ell^{+} ν_{\ell}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (646 additional authors not shown)
Abstract:
By analyzing 7.93 $\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy of 3.773 GeV with the BESIII detector operated at the BEPCII collider, we report the observation of the semimuonic decays of $D^+\to \bar K_1(1270)^0μ^+ν_μ$ and $D^0\to K_1(1270)^-μ^+ν_μ$ with statistical significances of $12.5σ$ and $6.0σ$, respectively. Their decay branching fractions are determined…
▽ More
By analyzing 7.93 $\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy of 3.773 GeV with the BESIII detector operated at the BEPCII collider, we report the observation of the semimuonic decays of $D^+\to \bar K_1(1270)^0μ^+ν_μ$ and $D^0\to K_1(1270)^-μ^+ν_μ$ with statistical significances of $12.5σ$ and $6.0σ$, respectively. Their decay branching fractions are determined to be ${\mathcal B}[D^{+}\to \bar{K}_1(1270)^0 μ^{+}ν_μ]=(2.36\pm0.20^{+0.18}_{-0.27}\pm 0.48)\times10^{-3}$ and ${\mathcal B}[D^{0}\to K_1(1270)^{-} μ^{+}ν_μ]=(0.78\pm0.11^{+0.05}_{-0.09}\pm 0.15)\times10^{-3}$, where the first and second uncertainties are statistical and systematic, respectively, and the third originates from the input branching fraction of $\bar K_{1}(1270)^0\to K^- π^+π^0$ or $K_1(1270)^-\to K^-π^+π^-$. Combining our branching fractions with the previous measurements of ${\mathcal B}[D^+\to \bar K_1(1270)^0e^+ν_{e}]$ and ${\mathcal B}[D^0\to K_1(1270)^-e^+ν_{e}]$, we determine the branching fraction ratios to be ${\mathcal B}[D^+\to \bar K_1(1270)^0μ^+ν_μ]/{\mathcal B}[D^+\to \bar K_1(1270)^0e^+ν_{e}]=1.03 \pm 0.14 \substack{+0.11\\-0.15}$ and ${\mathcal B}[D^0\to K_1(1270)^-μ^+ν_μ]/{\mathcal B}[D^0\to K_1(1270)^-e^+ν_{e}]=0.74\pm 0.13 \substack{+0.08\\-0.13}$. Using the branching fractions measured in this work and the world-average lifetimes of the $D^+$ and $D^0$ mesons, we determine the semimuonic partial decay width ratio to be $Γ[D^+\to \bar K_1(1270)^0 μ^+ν_μ]/Γ[D^0\to K_1(1270)^- μ^+ν_μ]=1.22\pm 0.10\substack{+0.06\\-0.09}$, which is consistent with unity as predicted by isospin conservation.
△ Less
Submitted 18 April, 2025; v1 submitted 6 February, 2025;
originally announced February 2025.
-
The Cost of Shuffling in Private Gradient Based Optimization
Authors:
Shuli Jiang,
Pranay Sharma,
Zhiwei Steven Wu,
Gauri Joshi
Abstract:
We consider the problem of differentially private (DP) convex empirical risk minimization (ERM). While the standard DP-SGD algorithm is theoretically well-established, practical implementations often rely on shuffled gradient methods that traverse the training data sequentially rather than sampling with replacement in each iteration. Despite their widespread use, the theoretical privacy-accuracy t…
▽ More
We consider the problem of differentially private (DP) convex empirical risk minimization (ERM). While the standard DP-SGD algorithm is theoretically well-established, practical implementations often rely on shuffled gradient methods that traverse the training data sequentially rather than sampling with replacement in each iteration. Despite their widespread use, the theoretical privacy-accuracy trade-offs of private shuffled gradient methods (\textit{DP-ShuffleG}) remain poorly understood, leading to a gap between theory and practice. In this work, we leverage privacy amplification by iteration (PABI) and a novel application of Stein's lemma to provide the first empirical excess risk bound of \textit{DP-ShuffleG}. Our result shows that data shuffling results in worse empirical excess risk for \textit{DP-ShuffleG} compared to DP-SGD. To address this limitation, we propose \textit{Interleaved-ShuffleG}, a hybrid approach that integrates public data samples in private optimization. By alternating optimization steps that use private and public samples, \textit{Interleaved-ShuffleG} effectively reduces empirical excess risk. Our analysis introduces a new optimization framework with surrogate objectives, adaptive noise injection, and a dissimilarity metric, which can be of independent interest. Our experiments on diverse datasets and tasks demonstrate the superiority of \textit{Interleaved-ShuffleG} over several baselines.
△ Less
Submitted 5 February, 2025;
originally announced February 2025.
-
Learning While Repositioning in On-Demand Vehicle Sharing Networks
Authors:
Hansheng Jiang,
Chunlin Sun,
Zuo-Jun Max Shen,
Shunan Jiang
Abstract:
We consider a network inventory problem motivated by one-way, on-demand vehicle sharing services. Due to uncertainties in both demand and returns, as well as a fixed number of rental units across an $n$-location network, the service provider must periodically reposition vehicles to match supply with demand spatially while minimizing costs. The optimal repositioning policy under a general $n$-locat…
▽ More
We consider a network inventory problem motivated by one-way, on-demand vehicle sharing services. Due to uncertainties in both demand and returns, as well as a fixed number of rental units across an $n$-location network, the service provider must periodically reposition vehicles to match supply with demand spatially while minimizing costs. The optimal repositioning policy under a general $n$-location network is intractable without knowing the optimal value function. We introduce the best base-stock repositioning policy as a generalization of the classical inventory control policy to $n$ dimensions, and establish its asymptotic optimality in two distinct limiting regimes under general network structures. We present reformulations to efficiently compute this best base-stock policy in an offline setting with pre-collected data.
In the online setting, we show that a natural Lipschitz-bandit approach achieves a regret guarantee of $\widetilde{O}(T^{\frac{n}{n+1}})$, which suffers from the exponential dependence on $n$. We illustrate the challenges of learning with censored data in networked systems through a regret lower bound analysis and by demonstrating the suboptimality of alternative algorithmic approaches. Motivated by these challenges, we propose an Online Gradient Repositioning algorithm that relies solely on censored demand. Under a mild cost-structure assumption, we prove that it attains an optimal regret of $O(n^{2.5} \sqrt{T})$, which matches the regret lower bound in $T$ and achieves only polynomial dependence on $n$. The key algorithmic innovation involves proposing surrogate costs to disentangle intertemporal dependencies and leveraging dual solutions to find the gradient of policy change. Numerical experiments demonstrate the effectiveness of our proposed methods.
△ Less
Submitted 31 January, 2025;
originally announced January 2025.
-
DeepFRC: An End-to-End Deep Learning Model for Functional Registration and Classification
Authors:
Siyuan Jiang,
Yihan Hu,
Wenjie Li,
Pengcheng Zeng
Abstract:
Functional data analysis (FDA) is essential for analyzing continuous, high-dimensional data, yet existing methods often decouple functional registration and classification, limiting their efficiency and performance. We present DeepFRC, an end-to-end deep learning framework that unifies these tasks within a single model. Our approach incorporates an alignment module that learns time warping functio…
▽ More
Functional data analysis (FDA) is essential for analyzing continuous, high-dimensional data, yet existing methods often decouple functional registration and classification, limiting their efficiency and performance. We present DeepFRC, an end-to-end deep learning framework that unifies these tasks within a single model. Our approach incorporates an alignment module that learns time warping functions via elastic function registration and a learnable basis representation module for dimensionality reduction on aligned data. This integration enhances both alignment accuracy and predictive performance. Theoretical analysis establishes that DeepFRC achieves low misalignment and generalization error, while simulations elucidate the progression of registration, reconstruction, and classification during training. Experiments on real-world datasets demonstrate that DeepFRC consistently outperforms state-of-the-art methods, particularly in addressing complex registration challenges. Code is available at: https://github.com/Drivergo-93589/DeepFRC.
△ Less
Submitted 29 January, 2025;
originally announced January 2025.
-
Equivariant localization in Batalin-Vilkovisky formalism
Authors:
Alberto S. Cattaneo,
Shuhan Jiang
Abstract:
We derive equivariant localization formulas of Atiyah--Bott and cohomological field theory types in the Batalin-Vilkovisky formalism and discuss their applications in Poisson geometry and quantum field theory.
We derive equivariant localization formulas of Atiyah--Bott and cohomological field theory types in the Batalin-Vilkovisky formalism and discuss their applications in Poisson geometry and quantum field theory.
△ Less
Submitted 30 January, 2025; v1 submitted 28 January, 2025;
originally announced January 2025.
-
Demystifying OS Kernel Fuzzing with a Novel Taxonomy
Authors:
Jiacheng Xu,
He Sun,
Shihao Jiang,
Qinying Wang,
Mingming Zhang,
Xiang Li,
Kaiwen Shen,
Peng Cheng,
Jiming Chen,
Charles Zhang,
Shouling Ji
Abstract:
The Operating System (OS) kernel is foundational in modern computing, especially with the proliferation of diverse computing devices. However, its development also comes with vulnerabilities that can lead to severe security breaches. Kernel fuzzing, a technique used to uncover these vulnerabilities, poses distinct challenges when compared to userspace fuzzing. These include the complexity of confi…
▽ More
The Operating System (OS) kernel is foundational in modern computing, especially with the proliferation of diverse computing devices. However, its development also comes with vulnerabilities that can lead to severe security breaches. Kernel fuzzing, a technique used to uncover these vulnerabilities, poses distinct challenges when compared to userspace fuzzing. These include the complexity of configuring the testing environment and addressing the statefulness inherent to both the kernel and the fuzzing process. Despite the significant interest from the security community, a comprehensive understanding of kernel fuzzing remains lacking, hindering further progress in the field. In this paper, we present the first systematic study dedicated to OS kernel fuzzing. It begins by summarizing the progress of 99 academic studies from top-tier venues between 2017 and 2024. Following this, we introduce a stage-based fuzzing model and a novel fuzzing taxonomy that highlights nine core functionalities unique to kernel fuzzing. These functionalities are examined alongside their corresponding methodological approaches based on qualitative evaluation criteria. Our systematization identifies challenges in meeting functionality requirements and proposes potential technical solutions. Finally, we outline promising and practical future directions to guide forthcoming research in kernel security, supported in part by insights derived from our case study.
△ Less
Submitted 27 January, 2025;
originally announced January 2025.
-
Mode Switching-Induced Instability of Multi-source Feed DC Microgrid
Authors:
Shanshan Jiang,
Zelin Sun,
Jiankun Zhang,
Hua Geng
Abstract:
In DC microgrids (DCMGs), DC-bus signaling based control strategy is extensively used for power management, where mode switching plays a crucial role in achieving multi-source coordination. However, few studies have noticed the impact of mode switching and switching strategies on system voltage stability. To fill this gap, this paper aims to provide a general analysis framework for mode switching-…
▽ More
In DC microgrids (DCMGs), DC-bus signaling based control strategy is extensively used for power management, where mode switching plays a crucial role in achieving multi-source coordination. However, few studies have noticed the impact of mode switching and switching strategies on system voltage stability. To fill this gap, this paper aims to provide a general analysis framework for mode switching-induced instability in multi-source DCMGs. First, manifold theory is employed to analyze the stability of the DCMG switched system. Subsequently, the instability mechanism and its physical interpretation are explored. The positive feedback activated by the decreasing DC bus voltage during the switching process leads to instability. Switching strategy may inadvertently contribute to this instability. To improve stability, a novel control method based on mode scheduling is proposed, by adjusting switching strategy and thereby correcting the system trajectory. Finally, both real-time simulations and experimental tests on a DCMG system verify the correctness and effectiveness of theoretical analysis results.
△ Less
Submitted 10 April, 2025; v1 submitted 27 January, 2025;
originally announced January 2025.
-
Observation of $h_{c}$ radiative decays to multiple light hadrons and the tensor state $f_2(1270)$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (666 additional authors not shown)
Abstract:
Using $ψ(3686)\rightarrow π^{0} h_{c}$ decays from a data sample of $(27.12\pm0.14)\times10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the BEPCII collider, $h_c$ radiative decays to $γπ^{+}π^{-},~γπ^{+}π^{-}η,~\gamma2(π^{+}π^{-})$, and $γp\bar{p}$ are observed for the first time, each with a significance greater than $5σ$. The corresponding branching fractions are measured. Furtherm…
▽ More
Using $ψ(3686)\rightarrow π^{0} h_{c}$ decays from a data sample of $(27.12\pm0.14)\times10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the BEPCII collider, $h_c$ radiative decays to $γπ^{+}π^{-},~γπ^{+}π^{-}η,~\gamma2(π^{+}π^{-})$, and $γp\bar{p}$ are observed for the first time, each with a significance greater than $5σ$. The corresponding branching fractions are measured. Furthermore, intermediate states below 2.8 GeV/$c^{2}$ are investigated, leading to the first observation of the decay process of $h_c\rightarrowγf_{2}(1270)\rightarrowγπ^{+}π^{-}$ with a significance of $5.5\,σ$. This observation represents the first instance of $h_c$ radiative decay to a tensor state.
△ Less
Submitted 26 January, 2025;
originally announced January 2025.
-
Cross section measurement of $e^{+}e^{-} \to f_{1}(1285)π^{+}π^{-}$ at center-of-mass energies between $3.808$ and $4.951\rm GeV$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (639 additional authors not shown)
Abstract:
Using data samples collected by the \mbox{BESIII} detector located at the Beijing Electron Positron Collider, the cross sections of the process $e^+e^-\to f_{1}(1285)π^+π^-$ are measured at forty-five center-of-mass energies from $3.808$ to $4.951 {\rm GeV}$. An investigation on the cross section line shape is performed, and no significant structure is observed.
Using data samples collected by the \mbox{BESIII} detector located at the Beijing Electron Positron Collider, the cross sections of the process $e^+e^-\to f_{1}(1285)π^+π^-$ are measured at forty-five center-of-mass energies from $3.808$ to $4.951 {\rm GeV}$. An investigation on the cross section line shape is performed, and no significant structure is observed.
△ Less
Submitted 23 January, 2025;
originally announced January 2025.
-
MedS$^3$: Towards Medical Small Language Models with Self-Evolved Slow Thinking
Authors:
Shuyang Jiang,
Yusheng Liao,
Zhe Chen,
Ya Zhang,
Yanfeng Wang,
Yu Wang
Abstract:
Medical language models (MLMs) have become pivotal in advancing medical natural language processing. However, prior models that rely on pre-training or supervised fine-tuning often exhibit low data efficiency and limited practicality in real-world clinical applications. While OpenAI's o1 highlights test-time scaling in mathematics, attempts to replicate this approach in medicine typically distill…
▽ More
Medical language models (MLMs) have become pivotal in advancing medical natural language processing. However, prior models that rely on pre-training or supervised fine-tuning often exhibit low data efficiency and limited practicality in real-world clinical applications. While OpenAI's o1 highlights test-time scaling in mathematics, attempts to replicate this approach in medicine typically distill responses from GPT-series models to open-source models, focusing primarily on multiple-choice tasks. This strategy, though straightforward, neglects critical concerns like data privacy and realistic deployment in clinical settings. In this work, we present a deployable, small-scale medical reasoning system, MedS3, designed for long-chain reasoning in clinical tasks using a self-evolution paradigm. Starting with a seed dataset of around 8,000 instances spanning five domains and 16 datasets, we prompt a base policy model to perform Monte Carlo Tree Search (MCTS) to construct rule-verifiable reasoning chains. Each reasoning step is assigned an evolution rollout value, allowing verified trajectories to train the policy model and the process reward model (PRM). During inference, the policy model generates multiple responses, and the reward model selects the one with a newly proposed PRM-guided Vote-Sum (P-VS) strategy. Experiments on eleven evaluation datasets demonstrate that MedS3 outperforms not only the prior strongest medical model by 6.59, but also 32B-level general reasoning models by 8.71 points. Code and data are available at https://github.com/pixas/MedSSS.
△ Less
Submitted 17 February, 2025; v1 submitted 21 January, 2025;
originally announced January 2025.
-
AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language Model
Authors:
Lipeng Ma,
Weidong Yang,
Yixuan Li,
Ben Fei,
Mingjie Zhou,
Shuhao Li,
Sihang Jiang,
Bo Xu,
Yanghua Xiao
Abstract:
Automated log analysis is crucial to ensure high availability and reliability of complex systems. The advent of LLMs in NLP has ushered in a new era of language model-driven automated log analysis, garnering significant interest. Within this field, two primary paradigms based on language models for log analysis have become prominent. Small Language Models (SLMs) follow the pre-train and fine-tune…
▽ More
Automated log analysis is crucial to ensure high availability and reliability of complex systems. The advent of LLMs in NLP has ushered in a new era of language model-driven automated log analysis, garnering significant interest. Within this field, two primary paradigms based on language models for log analysis have become prominent. Small Language Models (SLMs) follow the pre-train and fine-tune paradigm, focusing on the specific log analysis task through fine-tuning on supervised datasets. On the other hand, LLMs following the in-context learning paradigm, analyze logs by providing a few examples in prompt contexts without updating parameters. Despite their respective strengths, we notice that SLMs are more cost-effective but less powerful, whereas LLMs with large parameters are highly powerful but expensive and inefficient. To trade-off between the performance and inference costs of both models in automated log analysis, this paper introduces an adaptive log analysis framework known as AdaptiveLog, which effectively reduces the costs associated with LLM while ensuring superior results. This framework collaborates an LLM and a small language model, strategically allocating the LLM to tackle complex logs while delegating simpler logs to the SLM. Specifically, to efficiently query the LLM, we propose an adaptive selection strategy based on the uncertainty estimation of the SLM, where the LLM is invoked only when the SLM is uncertain. In addition, to enhance the reasoning ability of the LLM in log analysis tasks, we propose a novel prompt strategy by retrieving similar error-prone cases as the reference, enabling the model to leverage past error experiences and learn solutions from these cases. Extensive experiments demonstrate that AdaptiveLog achieves state-of-the-art results across different tasks, elevating the overall accuracy of log analysis while maintaining cost efficiency.
△ Less
Submitted 19 January, 2025;
originally announced January 2025.
-
Study of $η\rightarrowπ^+π^-l^+l^-$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (637 additional authors not shown)
Abstract:
Using a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events accumulated with the BESIII detector, we analyze the decays $η\rightarrowπ^+π^-l^+l^-$ ($l=e$ or $μ$) via the process $J/ψ\rightarrowγη$. The branching fraction of $η\rightarrowπ^+π^-e^+e^-$ is measured to be $\mathcal{B}(η\rightarrowπ^+π^-e^+e^-)=(3.07\pm0.12_{\rm{stat.}}\pm0.19_{\rm{syst.}}) \times10^{-4}$. No signal events are observed f…
▽ More
Using a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events accumulated with the BESIII detector, we analyze the decays $η\rightarrowπ^+π^-l^+l^-$ ($l=e$ or $μ$) via the process $J/ψ\rightarrowγη$. The branching fraction of $η\rightarrowπ^+π^-e^+e^-$ is measured to be $\mathcal{B}(η\rightarrowπ^+π^-e^+e^-)=(3.07\pm0.12_{\rm{stat.}}\pm0.19_{\rm{syst.}}) \times10^{-4}$. No signal events are observed for the $η\rightarrowπ^{+}π^{-}μ^{+}μ^{-}$ decay, leading to an upper limit on the branching fraction of $\mathcal{B}(η\rightarrowπ^{+}π^{-}μ^{+}μ^{-})<4.0\times10^{-7}$ at the 90\% confidence level. Furthermore, the $CP$-violation asymmetry parameter is found to be $\mathcal{A}_{CP}(η\rightarrowπ^{+}π^{-}e^{+}e^{-})=(-4.04\pm4.69_{\rm{stat.}}\pm0.14_{\rm{syst.}})\%$, showing no evidence of $CP$-violation with current statistics. Additionally, we extract the transition form factor from the decay amplitude of $η\rightarrowπ^+π^-e^+e^-$. Finally, axion-like particles are searched for via the decay $η\rightarrowπ^+π^-a, a\rightarrow e^+e^-$, and upper limits on this branching fraction relative to that of $η\rightarrowπ^+π^-e^+e^-$ are presented as a function of the axion-like particle mass in the range $5-200\ \mathrm{MeV}/c^{2}$.
△ Less
Submitted 17 January, 2025;
originally announced January 2025.
-
Search for the FCNC charmonium decay $J/ψ\to D^0 μ^+ μ^- + \text{c.c.}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (680 additional authors not shown)
Abstract:
Based on a data sample of $(10087 \pm 44) \times 10^6$ $J/ψ$ events taken with the BESIII detector, we search for the flavor-changing neutral current charmonium decay $J/ψ\to D^{0} μ^{+} μ^{-} + \text{c.c.}$. No significant signal above the background is observed, and the upper limit on its branching fraction is set to be $\mathcal{B}(J/ψ\to D^{0}μ^{+}μ^{-} + \text{c.c.} ) < 1.1 \times 10^{-7}$ at…
▽ More
Based on a data sample of $(10087 \pm 44) \times 10^6$ $J/ψ$ events taken with the BESIII detector, we search for the flavor-changing neutral current charmonium decay $J/ψ\to D^{0} μ^{+} μ^{-} + \text{c.c.}$. No significant signal above the background is observed, and the upper limit on its branching fraction is set to be $\mathcal{B}(J/ψ\to D^{0}μ^{+}μ^{-} + \text{c.c.} ) < 1.1 \times 10^{-7}$ at the 90% confidence level. This marks the first search for a flavor-changing neutral current charmonium decay involving muons in the final state.
△ Less
Submitted 14 February, 2025; v1 submitted 14 January, 2025;
originally announced January 2025.
-
Subtype-Aware Registration of Longitudinal Electronic Health Records
Authors:
Xin Gai,
Shiyi Jiang,
Anru R. Zhang
Abstract:
Electronic Health Records (EHRs) contain extensive patient information that can inform downstream clinical decisions, such as mortality prediction, disease phenotyping, and disease onset prediction. A key challenge in EHR data analysis is the temporal gap between when a condition is first recorded and its actual onset time. Such timeline misalignment can lead to artificially distinct biomarker tre…
▽ More
Electronic Health Records (EHRs) contain extensive patient information that can inform downstream clinical decisions, such as mortality prediction, disease phenotyping, and disease onset prediction. A key challenge in EHR data analysis is the temporal gap between when a condition is first recorded and its actual onset time. Such timeline misalignment can lead to artificially distinct biomarker trends among patients with similar disease progression, undermining the reliability of downstream analysis and complicating tasks like disease subtyping. To address this challenge, we provide a subtype-aware timeline registration method that leverages data projection and discrete optimization to simultaneously correct timeline misalignment and improve disease subtyping. Through simulation and real-world data analyses, we demonstrate that the proposed method effectively aligns distorted observed records with the true disease progression patterns, enhancing subtyping clarity and improving performance in downstream clinical analyses.
△ Less
Submitted 13 January, 2025;
originally announced January 2025.
-
Switching spin filling sequence in a bilayer graphene quantum dot through trigonal warping
Authors:
Guo-Quan Qin,
Fang-Ming Jing,
Tian-Yue Hao,
Shun-Li Jiang,
Zhuo-Zhi Zhang,
Gang Cao,
Xiang-Xiang Song,
Guo-Ping Guo
Abstract:
We demonstrate a switchable electron shell structure in a bilayer graphene quantum dot by manipulating the trigonal warping effect upon electrical gating. Under a small perpendicular electric field, the lowest s-shell is sequentially filled with two spin-up and two spin-down electrons of opposite valleys. When increasing the electric field, an additional three-fold minivalley degeneracy is generat…
▽ More
We demonstrate a switchable electron shell structure in a bilayer graphene quantum dot by manipulating the trigonal warping effect upon electrical gating. Under a small perpendicular electric field, the lowest s-shell is sequentially filled with two spin-up and two spin-down electrons of opposite valleys. When increasing the electric field, an additional three-fold minivalley degeneracy is generated so that the s-shell can be filled with 12 electrons with the first/last 6 electrons having the same spin polarization. The switched spin filling sequence demonstrates the possibility of using the trigonal warping effect to electrically access and manipulate the spin degree of freedom in bilayer graphene.
△ Less
Submitted 13 January, 2025;
originally announced January 2025.
-
Real-Time Neural-Enhancement for Online Cloud Gaming
Authors:
Shan Jiang,
Zhenhua Han,
Haisheng Tan,
Xinyang Jiang,
Yifan Yang,
Xiaoxi Zhang,
Hongqiu Ni,
Yuqing Yang,
Xiang-Yang Li
Abstract:
Online Cloud gaming demands real-time, high-quality video transmission across variable wide-area networks (WANs). Neural-enhanced video transmission algorithms employing super-resolution (SR) for video quality enhancement have effectively challenged WAN environments. However, these SR-based methods require intensive fine-tuning for the whole video, making it infeasible in diverse online cloud gami…
▽ More
Online Cloud gaming demands real-time, high-quality video transmission across variable wide-area networks (WANs). Neural-enhanced video transmission algorithms employing super-resolution (SR) for video quality enhancement have effectively challenged WAN environments. However, these SR-based methods require intensive fine-tuning for the whole video, making it infeasible in diverse online cloud gaming. To address this, we introduce River, a cloud gaming delivery framework designed based on the observation that video segment features in cloud gaming are typically repetitive and redundant. This permits a significant opportunity to reuse fine-tuned SR models, reducing the fine-tuning latency of minutes to query latency of milliseconds. To enable the idea, we design a practical system that addresses several challenges, such as model organization, online model scheduler, and transfer strategy. River first builds a content-aware encoder that fine-tunes SR models for diverse video segments and stores them in a lookup table. When delivering cloud gaming video streams online, River checks the video features and retrieves the most relevant SR models to enhance the frame quality. Meanwhile, if no existing SR model performs well enough for some video segments, River will further fine-tune new models and update the lookup table. Finally, to avoid the overhead of streaming model weight to the clients, River designs a prefetching strategy that predicts the models with the highest possibility of being retrieved. Our evaluation based on real video game streaming demonstrates River can reduce redundant training overhead by 44% and improve the Peak-Signal-to-Noise-Ratio by 1.81dB compared to the SOTA solutions. Practical deployment shows River meets real-time requirements, achieving approximately 720p 20fps on mobile devices.
△ Less
Submitted 12 January, 2025;
originally announced January 2025.
-
Search for $K^0_S$ invisible decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
Based on $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected with the BESIII detector at the BEPCII $e^+e^-$ storage ring, we search for $K_{S}^{0}$ invisible decays via the $J/ψ\to φK_{S}^{0} K_{S}^{0}$ process. No significant signal is observed, and the upper limit of the branching fraction of these invisible decays is set at 8.4 $\times$ $10^{-4}$ at the 90\% confidence level. This is the f…
▽ More
Based on $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected with the BESIII detector at the BEPCII $e^+e^-$ storage ring, we search for $K_{S}^{0}$ invisible decays via the $J/ψ\to φK_{S}^{0} K_{S}^{0}$ process. No significant signal is observed, and the upper limit of the branching fraction of these invisible decays is set at 8.4 $\times$ $10^{-4}$ at the 90\% confidence level. This is the first experimental search for $K^0_S$ invisible decays.
△ Less
Submitted 10 January, 2025;
originally announced January 2025.
-
GRB 241030A: a prompt thermal X-ray emission component and diverse origin of the very early UVOT WHITE and U band emission
Authors:
Qiu-Li Wang,
Hao Zhou,
Yun Wang,
Jia Ren,
Samaporn Tinyanont,
Dong Xu,
Ning-Chen Sun,
Johan P. U. Fynbo,
Daniele B. Malesani,
Jie An,
Rungrit Anutarawiramku,
Pathompong Butpa,
Shao-Yu Fu,
Shuai-Qing Jiang,
Xing Liu,
Kritsada Palee,
Pakawat Prasit,
Zi-Pei Zhu,
Zhi-Ping Jin,
Da-Ming Wei
Abstract:
We present a detailed analysis of the long-duration GRB 241030A detected by {\it Swift}. Thanks to the rapid response of XRT and UVOT, the strongest part of the prompt emission of GRB 241030A has been well measured simultaneously from optical to hard X-ray band. The time-resolved WHITE band emission shows strong variability, largely tracing the activity of the prompt gamma-ray emission, may be pro…
▽ More
We present a detailed analysis of the long-duration GRB 241030A detected by {\it Swift}. Thanks to the rapid response of XRT and UVOT, the strongest part of the prompt emission of GRB 241030A has been well measured simultaneously from optical to hard X-ray band. The time-resolved WHITE band emission shows strong variability, largely tracing the activity of the prompt gamma-ray emission, may be produced by internal shocks too. The joint analysis of the XRT and BAT data reveals the presence of a thermal component with a temperature of a few keV, which can be interpreted as the photosphere radiation, and the upper limit of the Lorentz factor of this region is found to range between approximately 20 and 80. The time-resolved analysis of the initial U-band exposure data yields a very rapid rise ($ \sim t^{5.3}$) with a bright peak reaching 13.6 AB magnitude around 410 seconds, which is most likely attributed to the onset of the external shock emission. The richness and fineness of early observational data have made this burst a unique sample for studying the various radiation mechanisms of gamma-ray bursts.
△ Less
Submitted 8 January, 2025;
originally announced January 2025.
-
Search for the leptonic decay $D^{+}\to e^{+}ν_{e}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (646 additional authors not shown)
Abstract:
We search for the leptonic decay $D^+\to e^+ν_{e}$ using an $e^+e^-$ collision data sample with an integrated luminosity of 20.3~fb$^{-1}$ collected with the BESIII detector at the center-of-mass energy of 3.773~GeV. No significant signal is observed and an upper limit on the branching fraction of $D^+\to e^+ν_{e}$ is set as $9.7 \times 10^{-7}$, at the 90\% confidence level. Our upper limit is an…
▽ More
We search for the leptonic decay $D^+\to e^+ν_{e}$ using an $e^+e^-$ collision data sample with an integrated luminosity of 20.3~fb$^{-1}$ collected with the BESIII detector at the center-of-mass energy of 3.773~GeV. No significant signal is observed and an upper limit on the branching fraction of $D^+\to e^+ν_{e}$ is set as $9.7 \times 10^{-7}$, at the 90\% confidence level. Our upper limit is an order of magnitude smaller than the previous limit for this decay mode.
△ Less
Submitted 8 January, 2025;
originally announced January 2025.
-
Observation of topological prethermal strong zero modes
Authors:
Feitong Jin,
Si Jiang,
Xuhao Zhu,
Zehang Bao,
Fanhao Shen,
Ke Wang,
Zitian Zhu,
Shibo Xu,
Zixuan Song,
Jiachen Chen,
Ziqi Tan,
Yaozu Wu,
Chuanyu Zhang,
Yu Gao,
Ning Wang,
Yiren Zou,
Aosai Zhang,
Tingting Li,
Jiarun Zhong,
Zhengyi Cui,
Yihang Han,
Yiyang He,
Han Wang,
Jianan Yang,
Yanzhe Wang
, et al. (20 additional authors not shown)
Abstract:
Symmetry-protected topological phases cannot be described by any local order parameter and are beyond the conventional symmetry-breaking paradigm for understanding quantum matter. They are characterized by topological boundary states robust against perturbations that respect the protecting symmetry. In a clean system without disorder, these edge modes typically only occur for the ground states of…
▽ More
Symmetry-protected topological phases cannot be described by any local order parameter and are beyond the conventional symmetry-breaking paradigm for understanding quantum matter. They are characterized by topological boundary states robust against perturbations that respect the protecting symmetry. In a clean system without disorder, these edge modes typically only occur for the ground states of systems with a bulk energy gap and would not survive at finite temperatures due to mobile thermal excitations. Here, we report the observation of a distinct type of topological edge modes, which are protected by emergent symmetries and persist even up to infinite temperature, with an array of 100 programmable superconducting qubits. In particular, through digital quantum simulation of the dynamics of a one-dimensional disorder-free "cluster" Hamiltonian, we observe robust long-lived topological edge modes over up to 30 cycles at a wide range of temperatures. By monitoring the propagation of thermal excitations, we show that despite the free mobility of these excitations, their interactions with the edge modes are substantially suppressed in the dimerized regime due to an emergent U(1)$\times$U(1) symmetry, resulting in an unusually prolonged lifetime of the topological edge modes even at infinite temperature. In addition, we exploit these topological edge modes as logical qubits and prepare a logical Bell state, which exhibits persistent coherence in the dimerized and off-resonant regime, despite the system being disorder-free and far from its ground state. Our results establish a viable digital simulation approach to experimentally exploring a variety of finite-temperature topological phases and demonstrate a potential route to construct long-lived robust boundary qubits that survive to infinite temperature in disorder-free systems.
△ Less
Submitted 8 January, 2025;
originally announced January 2025.
-
Observation of the $W$-annihilation process $D_s^+ \to ωρ^+$ and measurement of $D_s^+ \to φρ^+$ in $D^+_s\to π^+π^+π^-π^0π^0$ decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
We present the first amplitude analysis and branching fraction measurement of the decay $D^+_s\to π^+π^+π^-π^0π^0$, using $e^+e^-$ collision data collected with the BESIII detector at center-of-mass energies between 4.128 and 4.226 GeV corresponding to an integrated luminosity of 7.33 fb$^{-1}$, and report the first observation of the pure $W$-annihilation decay $D_s^+ \to ωρ^+$ with a branching f…
▽ More
We present the first amplitude analysis and branching fraction measurement of the decay $D^+_s\to π^+π^+π^-π^0π^0$, using $e^+e^-$ collision data collected with the BESIII detector at center-of-mass energies between 4.128 and 4.226 GeV corresponding to an integrated luminosity of 7.33 fb$^{-1}$, and report the first observation of the pure $W$-annihilation decay $D_s^+ \to ωρ^+$ with a branching fraction of $(0.99\pm0.08_{\rm stat}\pm0.07_{\rm syst})\%$. In comparison to the low significance of the $\mathcal{D}$ wave in the decay $D_s^+ \to φρ^+$, the dominance of the $\mathcal{D}$ wave over the $\mathcal{S}$ and $\mathcal{P}$ waves, with a fraction of $(51.85\pm7.28_{\rm stat}\pm7.90_{\rm syst})\%$ observed in the decay, provides crucial information for the``polarization puzzle", as well as for the understanding of charm meson decays. The branching fraction of $D^+_s\to π^+π^+π^-π^0π^0$ is measured to be $(4.41\pm0.15_{\rm stat}\pm0.13_{\rm syst})\%$. Moreover, the branching fraction of $D_s^+ \to φρ^+$ is measured to be $(3.98\pm0.33_{\rm stat}\pm0.21_{\rm syst})\%$, and the $R_φ= {\mathcal{B}(φ\toπ^+π^-π^0)}/{\mathcal{B}(φ\to K^+K^-)}$ is determined to be $(0.222\pm0.019_{\rm stat}\pm0.016_{\rm syst}$), which is consistent with the previous measurement based on charm meson decays, but deviates from the results from $e^+e^-$ annihilation and $K$-$N$ scattering experiments by more than 3$σ$.
△ Less
Submitted 8 January, 2025;
originally announced January 2025.
-
Study of the electromagnetic Dalitz decay $J/ψ\to e^+e^- π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (639 additional authors not shown)
Abstract:
We study the electromagnetic Dalitz decay $J/ψ\to e^+e^- π^0$ using $(10087 \pm 44) \times 10^6$ $J/ψ$ events collected by the \bes detector. The di-electron-invariant-mass dependent transition form factor of this decay is explored for the first time. A significant resonant structure corresponding to the $ρ/ω$ resonance is observed, which cannot be described by existing theoretical models, due to…
▽ More
We study the electromagnetic Dalitz decay $J/ψ\to e^+e^- π^0$ using $(10087 \pm 44) \times 10^6$ $J/ψ$ events collected by the \bes detector. The di-electron-invariant-mass dependent transition form factor of this decay is explored for the first time. A significant resonant structure corresponding to the $ρ/ω$ resonance is observed, which cannot be described by existing theoretical models, due to contributions from the isospin-conserving $J/ψ\to ρπ^0$ and isospin-volating $J/ψ\to ωπ^0$ decays. The observed $ρ$--$ω$ interference is consistent with that of the pion form factor but features a relatively narrow $ρ$ peak. By taking into account the contribution of this resonant structure, the branching fraction of $J/ψ\to e^+e^- π^0$ in the full $e^+e^-$ invariant mass spectrum range is also measured for the first time to be $(8.06 \pm 0.31 (\rm{stat}) \pm 0.38 (\rm{syst}))\times 10^{-7}$, which is two times larger than the prediction of the Vector Meson Dominance model due to the observed resonant contribution of $ρ/ω$ resonances.
△ Less
Submitted 8 January, 2025;
originally announced January 2025.
-
Observation of $ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
Based on $(2712.4 \pm 14.3)\times 10^6$ $ψ(3686)$ events collected at the BESIII detector operating at the BEPCII collider, we present the first observation of the decay $ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.$. The product branching fraction ${\cal B}[ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.] \times {\cal B}[Λ(1520) \to pK^{-}]$ is measured to be $(9.5 \pm 0.8 \pm 1.1) \times 10^{-7}$, where th…
▽ More
Based on $(2712.4 \pm 14.3)\times 10^6$ $ψ(3686)$ events collected at the BESIII detector operating at the BEPCII collider, we present the first observation of the decay $ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.$. The product branching fraction ${\cal B}[ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.] \times {\cal B}[Λ(1520) \to pK^{-}]$ is measured to be $(9.5 \pm 0.8 \pm 1.1) \times 10^{-7}$, where the first uncertainty is statistical and the second systematic.
△ Less
Submitted 5 January, 2025;
originally announced January 2025.
-
Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications
Authors:
Zhe Chen,
Yusheng Liao,
Shuyang Jiang,
Pingjie Wang,
Yiqiu Guo,
Yanfeng Wang,
Yu Wang
Abstract:
Large language models hold promise for addressing medical challenges, such as medical diagnosis reasoning, research knowledge acquisition, clinical decision-making, and consumer health inquiry support. However, they often generate hallucinations due to limited medical knowledge. Incorporating external knowledge is therefore critical, which necessitates multi-source knowledge acquisition. We addres…
▽ More
Large language models hold promise for addressing medical challenges, such as medical diagnosis reasoning, research knowledge acquisition, clinical decision-making, and consumer health inquiry support. However, they often generate hallucinations due to limited medical knowledge. Incorporating external knowledge is therefore critical, which necessitates multi-source knowledge acquisition. We address this challenge by framing it as a source planning problem, which is to formulate context-appropriate queries tailored to the attributes of diverse sources. Existing approaches either overlook source planning or fail to achieve it effectively due to misalignment between the model's expectation of the sources and their actual content. To bridge this gap, we present MedOmniKB, a repository comprising multigenre and multi-structured medical knowledge sources. Leveraging these sources, we propose the Source Planning Optimisation method, which enhances multi-source utilisation. Our approach involves enabling an expert model to explore and evaluate potential plans while training a smaller model to learn source alignment. Experimental results demonstrate that our method substantially improves multi-source planning performance, enabling the optimised small model to achieve state-of-the-art results in leveraging diverse medical knowledge sources.
△ Less
Submitted 18 February, 2025; v1 submitted 5 January, 2025;
originally announced January 2025.
-
Search for $η_c(2S)\to p\bar{p}K^+K^-$ and measurement of $χ_{cJ}\to p\bar{p}K^+K^-$ in $ψ(3686)$ radiative decays
Authors:
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (639 additional authors not shown)
Abstract:
A search for $η_c(2S)\to p\bar{p}K^+K^-$, together with measurement of branching fractions of $χ_{cJ(J=0,1,2)}\to p\bar{p}K^+K^-$ in the $ψ(3686) \to γη_c(2S)$ and the $ψ(3686) \to γχ_{cJ}$ radiative decays, is performed with $(2712.4\pm14.3)\times 10^6$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider. An evidence for $η_c(2S)\to p\bar{p}K^+K^-$ is found, with a signific…
▽ More
A search for $η_c(2S)\to p\bar{p}K^+K^-$, together with measurement of branching fractions of $χ_{cJ(J=0,1,2)}\to p\bar{p}K^+K^-$ in the $ψ(3686) \to γη_c(2S)$ and the $ψ(3686) \to γχ_{cJ}$ radiative decays, is performed with $(2712.4\pm14.3)\times 10^6$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider. An evidence for $η_c(2S)\to p\bar{p}K^+K^-$ is found, with a significance of $3.3σ$. The product branching fraction of $\mathcal{B}[ψ(3686)\toγη_c(2S)]\cdot\mathcal{B}[η_c(2S)\to p\bar{p}K^+K^-]$ is determined to be $(1.98\mkern 2mu\pm\mkern 2mu0.41_{\text{stat.}}\mkern 2mu\pm\mkern 2mu0.99_{\text{syst.}})\times 10^{-7}$. The product branching fractions of $\mathcal{B}[ψ(3686)\toγχ_{cJ}]\cdot\mathcal{B}[χ_{cJ}\to p\bar{p}K^+K^-]$ are measured to be $(2.49\mkern 2mu\pm\mkern 2mu 0.03_{\text{stat.}}\mkern 2mu\pm\mkern 2mu 0.15_{\text{syst.}})\times 10^{-5}$, $(1.83\mkern 2mu \pm\mkern 2mu 0.02_{\text{stat.}}\mkern 2mu \pm\mkern 2mu 0.11_{\text{syst.}})\times 10^{-5}$, and $(2.43\mkern 2mu\pm\mkern 2mu 0.02_{\text{stat.}}\mkern 2mu\pm\mkern 2mu 0.15_{\text{syst.}})\times 10^{-5}$, for $J=0,\ 1$, and 2, respectively.
△ Less
Submitted 3 January, 2025;
originally announced January 2025.
-
ProjectedEx: Enhancing Generation in Explainable AI for Prostate Cancer
Authors:
Xuyin Qi,
Zeyu Zhang,
Aaron Berliano Handoko,
Huazhan Zheng,
Mingxi Chen,
Ta Duc Huy,
Vu Minh Hieu Phan,
Lei Zhang,
Linqi Cheng,
Shiyu Jiang,
Zhiwei Zhang,
Zhibin Liao,
Yang Zhao,
Minh-Son To
Abstract:
Prostate cancer, a growing global health concern, necessitates precise diagnostic tools, with Magnetic Resonance Imaging (MRI) offering high-resolution soft tissue imaging that significantly enhances diagnostic accuracy. Recent advancements in explainable AI and representation learning have significantly improved prostate cancer diagnosis by enabling automated and precise lesion classification. Ho…
▽ More
Prostate cancer, a growing global health concern, necessitates precise diagnostic tools, with Magnetic Resonance Imaging (MRI) offering high-resolution soft tissue imaging that significantly enhances diagnostic accuracy. Recent advancements in explainable AI and representation learning have significantly improved prostate cancer diagnosis by enabling automated and precise lesion classification. However, existing explainable AI methods, particularly those based on frameworks like generative adversarial networks (GANs), are predominantly developed for natural image generation, and their application to medical imaging often leads to suboptimal performance due to the unique characteristics and complexity of medical image. To address these challenges, our paper introduces three key contributions. First, we propose ProjectedEx, a generative framework that provides interpretable, multi-attribute explanations, effectively linking medical image features to classifier decisions. Second, we enhance the encoder module by incorporating feature pyramids, which enables multiscale feedback to refine the latent space and improves the quality of generated explanations. Additionally, we conduct comprehensive experiments on both the generator and classifier, demonstrating the clinical relevance and effectiveness of ProjectedEx in enhancing interpretability and supporting the adoption of AI in medical settings. Code will be released at https://github.com/Richardqiyi/ProjectedEx
△ Less
Submitted 2 January, 2025;
originally announced January 2025.
-
Atomic-scale observation of $d$-$π$-$d$ spin coupling in coordination structures
Authors:
Xue Zhang,
Xin Li,
Jie Li,
Haoyang Pan,
Minghui Yu,
Yajie Zhang,
Gui-Lin Zhu,
Zhen Xu,
Ziyong Shen,
Shimin Hou,
Yaping Zang,
Bingwu Wang,
Kai Wu,
Shang-Da Jiang,
Ivano E. Castelli,
Lianmao Peng,
Per Hedegård,
Song Gao,
Jing-Tao Lü,
Yongfeng Wang
Abstract:
Spin coupling between magnetic metal atoms and organic radicals plays a pivotal role in high-performance magnetic materials. The complex interaction involving multi-spin centers in bulk materials makes it challenging to study spin coupling at the atomic scale. Here, we investigate the $d$-$π$-$d$ spin interaction in well-defined metal-organic coordinated structures composed of two iron (Fe) atoms…
▽ More
Spin coupling between magnetic metal atoms and organic radicals plays a pivotal role in high-performance magnetic materials. The complex interaction involving multi-spin centers in bulk materials makes it challenging to study spin coupling at the atomic scale. Here, we investigate the $d$-$π$-$d$ spin interaction in well-defined metal-organic coordinated structures composed of two iron (Fe) atoms and four all-trans retinoic acid (ReA) molecules, using low-temperature scanning tunneling microscopy and atomic force microscopy. The ReA molecule is turned into a spin-$1/2$ radical state by dehydrogenation, facilitating strong magnetic coupling with the coordinated Fe atoms. Comprehensive theoretical analysis, based on density functional theory and valence bond theory, further elucidates the intrinsic mechanism of ferrimagnetic spin coupling in the coordination structure. Specifically, simultaneous antiferromagnetic coupling of Fe dimer to ReA radicals parallelizes the dimer spin orientation. This work contributes to the fundamental understanding of spin interaction in metal-organic coordination structures and provides microscopic insights for designing advanced magnetic materials.
△ Less
Submitted 2 January, 2025;
originally announced January 2025.
-
Measurement of Born cross section of $e^+e^-\toΣ^0\barΣ^0$ at $\sqrt{s} = 3.50-4.95$ GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (649 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at thirty-two center-of-mass energies from 3.50 to 4.95 GeV, corresponding to an integrated luminosity of 25 $\rm{fb^{-1}}$, we measure the Born cross section of the $e^+e^-\toΣ^0\barΣ^0$ reaction and the effective form factor. No significant charmonium(-like) state, i.e., $ψ(3770)$, $ψ(4040)$, $ψ(4160)$,…
▽ More
Using $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at thirty-two center-of-mass energies from 3.50 to 4.95 GeV, corresponding to an integrated luminosity of 25 $\rm{fb^{-1}}$, we measure the Born cross section of the $e^+e^-\toΣ^0\barΣ^0$ reaction and the effective form factor. No significant charmonium(-like) state, i.e., $ψ(3770)$, $ψ(4040)$, $ψ(4160)$, $ψ(4230)$, $ψ(4360)$, $ψ(4415)$, or $ψ(4660)$, decaying into the $Σ^0\barΣ^0$ final state is observed by fitting the $e^+e^- \to Σ^0\barΣ^0$ dressed cross section. The upper limits for the product of the branching fraction and the electronic partial width at the 90% confidence level are provided for each assumed charmonium(-like) state. In addition, the ratios of the Born cross section and the effective form factor between the $e^+e^-\toΣ^0\barΣ^0$ and the $e^+e^-\toΣ^+\barΣ^-$ reactions are provided, which can be used to validate the prediction of the vector meson dominance model.
△ Less
Submitted 14 March, 2025; v1 submitted 28 December, 2024;
originally announced December 2024.
-
Distinguishability-guided Test Program Generation for WebAssembly Runtime Performance Testing
Authors:
Shuyao Jiang,
Ruiying Zeng,
Yangfan Zhou,
Michael R. Lyu
Abstract:
WebAssembly (Wasm) is a binary instruction format designed as a portable compilation target, which has been widely used on both the web and server sides in recent years. As high performance is a critical design goal of Wasm, it is essential to conduct performance testing for Wasm runtimes. However, existing research on Wasm runtime performance testing still suffers from insufficient high-quality t…
▽ More
WebAssembly (Wasm) is a binary instruction format designed as a portable compilation target, which has been widely used on both the web and server sides in recent years. As high performance is a critical design goal of Wasm, it is essential to conduct performance testing for Wasm runtimes. However, existing research on Wasm runtime performance testing still suffers from insufficient high-quality test programs. To solve this problem, we propose a novel test program generation approach WarpGen. It first extracts code snippets from historical issue-triggering test programs as initial operators, then inserts an operator into a seed program to synthesize a new test program. To verify the quality of generated programs, we propose an indicator called distinguishability, which refers to the ability of a test program to distinguish abnormal performance of specific Wasm runtimes. We apply WarpGen for performance testing on four Wasm runtimes and verify its effectiveness compared with baseline approaches. In particular, WarpGen has identified seven new performance issues in three Wasm runtimes.
△ Less
Submitted 28 December, 2024;
originally announced December 2024.
-
LINKs: Large Language Model Integrated Management for 6G Empowered Digital Twin NetworKs
Authors:
Shufan Jiang,
Bangyan Lin,
Yue Wu,
Yuan Gao
Abstract:
In the rapidly evolving landscape of digital twins (DT) and 6G networks, the integration of large language models (LLMs) presents a novel approach to network management. This paper explores the application of LLMs in managing 6G-empowered DT networks, with a focus on optimizing data retrieval and communication efficiency in smart city scenarios. The proposed framework leverages LLMs for intelligen…
▽ More
In the rapidly evolving landscape of digital twins (DT) and 6G networks, the integration of large language models (LLMs) presents a novel approach to network management. This paper explores the application of LLMs in managing 6G-empowered DT networks, with a focus on optimizing data retrieval and communication efficiency in smart city scenarios. The proposed framework leverages LLMs for intelligent DT problem analysis and radio resource management (RRM) in fully autonomous way without any manual intervention. Our proposed framework -- LINKs, builds up a lazy loading strategy which can minimize transmission delay by selectively retrieving the relevant data. Based on the data retrieval plan, LLMs transform the retrieval task into an numerical optimization problem and utilizing solvers to build an optimal RRM, ensuring efficient communication across the network. Simulation results demonstrate the performance improvements in data planning and network management, highlighting the potential of LLMs to enhance the integration of DT and 6G technologies.
△ Less
Submitted 9 December, 2024;
originally announced December 2024.
-
Search for the double Dalitz decays $η/η' \to e^+e^-μ^+μ^-$ and $η' \to μ^+μ^-μ^+μ^-$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (648 additional authors not shown)
Abstract:
Using a data sample of $(10087 \pm 44) \times {10^{6}}$ $J/ψ$ events collected with the BESIII detector, we search for the decays $η/η'\to e^+e^-μ^+μ^-$ and $η' \to μ^+μ^-μ^+μ^-$ via the radiative decays $J/ψ\toγη$/$γη'$. No excess of events over expected background is observed for any of the decays of interest. At 90% confidence level, we report the first upper limits on the branching fractions o…
▽ More
Using a data sample of $(10087 \pm 44) \times {10^{6}}$ $J/ψ$ events collected with the BESIII detector, we search for the decays $η/η'\to e^+e^-μ^+μ^-$ and $η' \to μ^+μ^-μ^+μ^-$ via the radiative decays $J/ψ\toγη$/$γη'$. No excess of events over expected background is observed for any of the decays of interest. At 90% confidence level, we report the first upper limits on the branching fractions of $η' \to e^{+}e^{-}μ^{+}μ^{-}$ and $η' \to μ^{+}μ^{-}μ^{+}μ^{-}$ to be $ 1.75 \times {10^{-6}}$ and $5.28 \times {10^{-7}}$, respectively. In addition, we set an upper limit on the branching fraction of $η\to e^{+}e^{-}μ^{+}μ^{-}$ to be $6.88 \times {10^{-6}}$, which improves the previous result by about two orders of magnitude.
△ Less
Submitted 27 December, 2024;
originally announced December 2024.
-
Study on Toponium: Spectrum and Associated Processes
Authors:
Sheng-Juan Jiang,
Bai-Qing Li,
Guang-Zhi Xu,
Kui-Yong Liu
Abstract:
In this paper, we calculate the toponium spectrum in the potential model with the screened effects. Coulombic part is dominant for toponium, and the coefficient of the Coulomb potential is chosen from lattice QCD calculations at an infinite quark mass in accord with the ultrasoft scale choice. We also derive the relative E1 transitions, decays, and production.
In this paper, we calculate the toponium spectrum in the potential model with the screened effects. Coulombic part is dominant for toponium, and the coefficient of the Coulomb potential is chosen from lattice QCD calculations at an infinite quark mass in accord with the ultrasoft scale choice. We also derive the relative E1 transitions, decays, and production.
△ Less
Submitted 24 December, 2024;
originally announced December 2024.
-
Detection of an Orphan X-ray Flare from a Blazar Candidate EP240709a with Einstein Probe
Authors:
Mingjun Liu,
Yijia Zhang,
Yun Wang,
Rui Xue,
David Buckley,
D. Andrew Howell,
Chichuan Jin,
Wenxiong Li,
Itumeleng Monageng,
Haiwu Pan,
Ning-Chen Sun,
Samaporn Tinyanont,
Lingzhi Wang,
Weimin Yuan,
Jie An,
Moira Andrews,
Rungrit Anutarawiramkul,
Pathompong Butpan,
Huaqing Cheng,
Cui-Yuan Dai,
Lixin Dai,
Joseph Farah,
Hua Feng,
Shaoyu Fu,
Zhen Guo
, et al. (27 additional authors not shown)
Abstract:
Blazars are often observed to flare across multiple wavelengths. Orphan flares from blazars have been only detected a few times, providing an opportunity to understand the structure of the jet in the accreting system. We report a remarkable orphan X-ray flare from a blazar candidate EP240709a, detected by Einstein Probe (EP) in July 2024. The multi-band spectral properties and variability support…
▽ More
Blazars are often observed to flare across multiple wavelengths. Orphan flares from blazars have been only detected a few times, providing an opportunity to understand the structure of the jet in the accreting system. We report a remarkable orphan X-ray flare from a blazar candidate EP240709a, detected by Einstein Probe (EP) in July 2024. The multi-band spectral properties and variability support EP240709a as a high-energy peaked BL Lacertae-type object. The flux in 0.5-10 keV increases by at least 28 times to the value of low state in 2020, with non-detection of remarkable flaring in other bands during the same period. EP240709a exhibits the harder-when-brighter tendency in the X-ray band during the orphan flare, while its infrared-optical spectra are featureless. We employ one-zone and two-zone leptonic synchrotron self-Compton models to perform the spectral energy distribution fitting. Detecting this rare orphan flare shows the potential of EP in discovering peculiar activities from AGN in high-cadence X-ray sky surveys.
△ Less
Submitted 24 December, 2024;
originally announced December 2024.
-
Query Quantized Neural SLAM
Authors:
Sijia Jiang,
Jing Hua,
Zhizhong Han
Abstract:
Neural implicit representations have shown remarkable abilities in jointly modeling geometry, color, and camera poses in simultaneous localization and mapping (SLAM). Current methods use coordinates, positional encodings, or other geometry features as input to query neural implicit functions for signed distances and color which produce rendering errors to drive the optimization in overfitting imag…
▽ More
Neural implicit representations have shown remarkable abilities in jointly modeling geometry, color, and camera poses in simultaneous localization and mapping (SLAM). Current methods use coordinates, positional encodings, or other geometry features as input to query neural implicit functions for signed distances and color which produce rendering errors to drive the optimization in overfitting image observations. However, due to the run time efficiency requirement in SLAM systems, we are merely allowed to conduct optimization on each frame in few iterations, which is far from enough for neural networks to overfit these queries. The underfitting usually results in severe drifts in camera tracking and artifacts in reconstruction. To resolve this issue, we propose query quantized neural SLAM which uses quantized queries to reduce variations of input for much easier and faster overfitting a frame. To this end, we quantize a query into a discrete representation with a set of codes, and only allow neural networks to observe a finite number of variations. This allows neural networks to become increasingly familiar with these codes after overfitting more and more previous frames. Moreover, we also introduce novel initialization, losses, and argumentation to stabilize the optimization with significant uncertainty in the early optimization stage, constrain the optimization space, and estimate camera poses more accurately. We justify the effectiveness of each design and report visual and numerical comparisons on widely used benchmarks to show our superiority over the latest methods in both reconstruction and camera tracking.
△ Less
Submitted 20 December, 2024;
originally announced December 2024.
-
Sensing Surface Patches in Volume Rendering for Inferring Signed Distance Functions
Authors:
Sijia Jiang,
Tong Wu,
Jing Hua,
Zhizhong Han
Abstract:
It is vital to recover 3D geometry from multi-view RGB images in many 3D computer vision tasks. The latest methods infer the geometry represented as a signed distance field by minimizing the rendering error on the field through volume rendering. However, it is still challenging to explicitly impose constraints on surfaces for inferring more geometry details due to the limited ability of sensing su…
▽ More
It is vital to recover 3D geometry from multi-view RGB images in many 3D computer vision tasks. The latest methods infer the geometry represented as a signed distance field by minimizing the rendering error on the field through volume rendering. However, it is still challenging to explicitly impose constraints on surfaces for inferring more geometry details due to the limited ability of sensing surfaces in volume rendering. To resolve this problem, we introduce a method to infer signed distance functions (SDFs) with a better sense of surfaces through volume rendering. Using the gradients and signed distances, we establish a small surface patch centered at the estimated intersection along a ray by pulling points randomly sampled nearby. Hence, we are able to explicitly impose surface constraints on the sensed surface patch, such as multi-view photo consistency and supervision from depth or normal priors, through volume rendering. We evaluate our method by numerical and visual comparisons on scene benchmarks. Our superiority over the latest methods justifies our effectiveness.
△ Less
Submitted 20 December, 2024;
originally announced December 2024.
-
Measurement of the Branching Fraction for the Decay $χ_{cJ}\to p\bar{p}ηπ^{0}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
Using $(2712.4\pm 14.3)\times10^6 ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we present the first observations of the decays $χ_{cJ}(J=0,1,2)\to p\bar{p}ηπ^{0}$. Their decay branching fractions are determined to be ${\cal B}(χ_{c0}\to p\bar{p}ηπ^{0})=({2.41 \pm 0.07 \pm 0.19}) \times 10^{-4}$,…
▽ More
Using $(2712.4\pm 14.3)\times10^6 ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we present the first observations of the decays $χ_{cJ}(J=0,1,2)\to p\bar{p}ηπ^{0}$. Their decay branching fractions are determined to be ${\cal B}(χ_{c0}\to p\bar{p}ηπ^{0})=({2.41 \pm 0.07 \pm 0.19}) \times 10^{-4}$, ${\cal B}(χ_{c1}\to p\bar{p}ηπ^{0})=({1.95 \pm 0.05 \pm 0.12}) \times 10^{-4}$, and ${\cal B}(χ_{c2}\to p\bar{p}ηπ^{0})=({1.31 \pm 0.05 \pm 0.08}) \times 10^{-4}$, where the first uncertainties are statistical and the second systematic.
△ Less
Submitted 18 December, 2024; v1 submitted 18 December, 2024;
originally announced December 2024.
-
Observation of the charmonium decay $η_c\toγγ$ in $J/ψ\toγη_c$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (658 additional authors not shown)
Abstract:
Using $(2712.4\pm14.3)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the decay $η_c\toγγ$ in $J/ψ\toγη_c$ is observed. We determine the product branching fraction $\mathcal{B}(J/ψ\toγη_c)\times\mathcal{B}(η_c\toγγ)=(5.23\pm0.26_{\rm{stat.}}\pm0.30_{\rm{syst.}})\times10^{-6}$. This result is consistent with the LQCD calculation…
▽ More
Using $(2712.4\pm14.3)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the decay $η_c\toγγ$ in $J/ψ\toγη_c$ is observed. We determine the product branching fraction $\mathcal{B}(J/ψ\toγη_c)\times\mathcal{B}(η_c\toγγ)=(5.23\pm0.26_{\rm{stat.}}\pm0.30_{\rm{syst.}})\times10^{-6}$. This result is consistent with the LQCD calculation $(5.34\pm0.16)\times10^{-6}$ from HPQCD in 2023. By using the world-average values of $\mathcal{B}(J/ψ\toγη_c)$ and the total decay width of $η_c$, the partial decay width $Γ(η_c\toγγ)$ is determined to be $(11.30\pm0.56_{\rm{stat.}}\pm0.66_{\rm{syst.}}\pm1.14_{\rm{ref.}})~\rm{keV}$, which deviates from the corresponding world-average value by $3.4σ$.
△ Less
Submitted 2 April, 2025; v1 submitted 17 December, 2024;
originally announced December 2024.
-
Distributed satellite information networks: Architecture, enabling technologies, and trends
Authors:
Qinyu Zhang,
Liang Xu,
Jianhao Huang,
Tao Yang,
Jian Jiao,
Ye Wang,
Yao Shi,
Chiya Zhang,
Xingjian Zhang,
Ke Zhang,
Yupeng Gong,
Na Deng,
Nan Zhao,
Zhen Gao,
Shujun Han,
Xiaodong Xu,
Li You,
Dongming Wang,
Shan Jiang,
Dixian Zhao,
Nan Zhang,
Liujun Hu,
Xiongwen He,
Yonghui Li,
Xiqi Gao
, et al. (1 additional authors not shown)
Abstract:
Driven by the vision of ubiquitous connectivity and wireless intelligence, the evolution of ultra-dense constellation-based satellite-integrated Internet is underway, now taking preliminary shape. Nevertheless, the entrenched institutional silos and limited, nonrenewable heterogeneous network resources leave current satellite systems struggling to accommodate the escalating demands of next-generat…
▽ More
Driven by the vision of ubiquitous connectivity and wireless intelligence, the evolution of ultra-dense constellation-based satellite-integrated Internet is underway, now taking preliminary shape. Nevertheless, the entrenched institutional silos and limited, nonrenewable heterogeneous network resources leave current satellite systems struggling to accommodate the escalating demands of next-generation intelligent applications. In this context, the distributed satellite information networks (DSIN), exemplified by the cohesive clustered satellites system, have emerged as an innovative architecture, bridging information gaps across diverse satellite systems, such as communication, navigation, and remote sensing, and establishing a unified, open information network paradigm to support resilient space information services. This survey first provides a profound discussion about innovative network architectures of DSIN, encompassing distributed regenerative satellite network architecture, distributed satellite computing network architecture, and reconfigurable satellite formation flying, to enable flexible and scalable communication, computing and control. The DSIN faces challenges from network heterogeneity, unpredictable channel dynamics, sparse resources, and decentralized collaboration frameworks. To address these issues, a series of enabling technologies is identified, including channel modeling and estimation, cloud-native distributed MIMO cooperation, grant-free massive access, network routing, and the proper combination of all these diversity techniques. Furthermore, to heighten the overall resource efficiency, the cross-layer optimization techniques are further developed to meet upper-layer deterministic, adaptive and secure information services requirements. In addition, emerging research directions and new opportunities are highlighted on the way to achieving the DSIN vision.
△ Less
Submitted 17 December, 2024;
originally announced December 2024.
-
From Specific-MLLMs to Omni-MLLMs: A Survey on MLLMs Aligned with Multi-modalities
Authors:
Shixin Jiang,
Jiafeng Liang,
Jiyuan Wang,
Xuan Dong,
Heng Chang,
Weijiang Yu,
Jinhua Du,
Ming Liu,
Bing Qin
Abstract:
To tackle complex tasks in real-world scenarios, more researchers are focusing on Omni-MLLMs, which aim to achieve omni-modal understanding and generation. Beyond the constraints of any specific non-linguistic modality, Omni-MLLMs map various non-linguistic modalities into the embedding space of LLMs and enable the interaction and understanding of arbitrary combinations of modalities within a sing…
▽ More
To tackle complex tasks in real-world scenarios, more researchers are focusing on Omni-MLLMs, which aim to achieve omni-modal understanding and generation. Beyond the constraints of any specific non-linguistic modality, Omni-MLLMs map various non-linguistic modalities into the embedding space of LLMs and enable the interaction and understanding of arbitrary combinations of modalities within a single model. In this paper, we systematically investigate relevant research and provide a comprehensive survey of Omni-MLLMs. Specifically, we first explain the four core components of Omni-MLLMs for unified multi-modal modeling with a meticulous taxonomy that offers novel perspectives. Then, we introduce the effective integration achieved through two-stage training and discuss the corresponding datasets as well as evaluation. Furthermore, we summarize the main challenges of current Omni-MLLMs and outline future directions. We hope this paper serves as an introduction for beginners and promotes the advancement of related research. Resources have been made publicly available at https://github.com/threegold116/Awesome-Omni-MLLMs.
△ Less
Submitted 3 March, 2025; v1 submitted 16 December, 2024;
originally announced December 2024.
-
Amplitude analysis and branching fraction measurement of the Cabibbo-favored decay $D^+ \to K^-π^+π^+π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (651 additional authors not shown)
Abstract:
An amplitude analysis of the Cabibbo-favored decay $D^+ \to K^-π^+π^+π^0$ is performed, using 7.93 $\rm{fb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773 GeV. The branching fractions of the intermediate processes are measured, with the dominant contribution $D^+ \to \bar{K}^{*}(892)^0ρ(770)^+$ observed to have a branching fraction of…
▽ More
An amplitude analysis of the Cabibbo-favored decay $D^+ \to K^-π^+π^+π^0$ is performed, using 7.93 $\rm{fb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773 GeV. The branching fractions of the intermediate processes are measured, with the dominant contribution $D^+ \to \bar{K}^{*}(892)^0ρ(770)^+$ observed to have a branching fraction of $(4.15\pm0.07_{\rm stat.}\pm0.17_{\rm syst.})\%$. With the detection efficiency derived from the amplitude analysis, the absolute branching fraction of $D^+ \to K^-π^+π^+π^0$ is measured to be $(6.06\pm0.04_{\rm stat.}\pm0.07_{\rm syst.})\%$.
△ Less
Submitted 14 December, 2024;
originally announced December 2024.
-
Study of the semileptonic decay $D^0\rightarrow \bar{K}^0π^-e^+ν_e$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (650 additional authors not shown)
Abstract:
We report an improved study of the semileptonic decay $D^0 \rightarrow \bar{K}^0π^-e^+ν_{e}$ based on a sample of $7.9~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773~GeV with the BESIII detector at the BEPCII collider. The branching fraction of this decay is measured to be…
▽ More
We report an improved study of the semileptonic decay $D^0 \rightarrow \bar{K}^0π^-e^+ν_{e}$ based on a sample of $7.9~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773~GeV with the BESIII detector at the BEPCII collider. The branching fraction of this decay is measured to be $\mathcal{B}(D^0\rightarrow \bar{K}^0π^-e^+ν_{e}) = (1.444 \pm 0.022_{\rm stat} \pm 0.024_{\rm syst})\%$, which is the most precise to date, where the first uncertainty is statistical and the second is systematic. Based on investigation of the decay dynamics, we find that the decay is dominated by the $K^{*}(892)^-$ component and present an improved measurement of its branching fraction to be $\mathcal{B}(D^0\rightarrow K^{*}(892)^-e^+ν_e) = (2.039 \pm 0.032_{\rm stat} \pm 0.034_{\rm syst})\%$. We also determine the ratios of the hadronic form factors for the $K^{*}(892)^-e^+ν_e$ decay to be $r_{V} = V(0)/A_1(0) = 1.48 \pm 0.05_{\rm stat} \pm 0.02_{\rm syst}$ and $r_{2} = A_2(0)/A_1(0) = 0.70 \pm 0.04_{\rm stat} \pm 0.02_{\rm syst}$, where $V(0)$ is the vector form factor and $A_{1,2}(0)$ are the axial form factors. In addition, the $\bar{K}^0π^-$ $\mathcal{S}$-wave component is found to account for $(5.87 \pm 0.32_{\rm stat} \pm 0.16_{\rm syst})\%$ of the total decay rate, corresponding to a branching fraction of $\mathcal{B}[D^0\rightarrow (\bar{K}^0π^-)_{S-{\rm wave}}e^+ν_e] = (0.085 \pm 0.005_{\rm stat} \pm 0.003_{\rm syst})\%$.
△ Less
Submitted 14 December, 2024;
originally announced December 2024.
-
DSRC: Learning Density-insensitive and Semantic-aware Collaborative Representation against Corruptions
Authors:
Jingyu Zhang,
Yilei Wang,
Lang Qian,
Peng Sun,
Zengwen Li,
Sudong Jiang,
Maolin Liu,
Liang Song
Abstract:
As a potential application of Vehicle-to-Everything (V2X) communication, multi-agent collaborative perception has achieved significant success in 3D object detection. While these methods have demonstrated impressive results on standard benchmarks, the robustness of such approaches in the face of complex real-world environments requires additional verification. To bridge this gap, we introduce the…
▽ More
As a potential application of Vehicle-to-Everything (V2X) communication, multi-agent collaborative perception has achieved significant success in 3D object detection. While these methods have demonstrated impressive results on standard benchmarks, the robustness of such approaches in the face of complex real-world environments requires additional verification. To bridge this gap, we introduce the first comprehensive benchmark designed to evaluate the robustness of collaborative perception methods in the presence of natural corruptions typical of real-world environments. Furthermore, we propose DSRC, a robustness-enhanced collaborative perception method aiming to learn Density-insensitive and Semantic-aware collaborative Representation against Corruptions. DSRC consists of two key designs: i) a semantic-guided sparse-to-dense distillation framework, which constructs multi-view dense objects painted by ground truth bounding boxes to effectively learn density-insensitive and semantic-aware collaborative representation; ii) a feature-to-point cloud reconstruction approach to better fuse critical collaborative representation across agents. To thoroughly evaluate DSRC, we conduct extensive experiments on real-world and simulated datasets. The results demonstrate that our method outperforms SOTA collaborative perception methods in both clean and corrupted conditions. Code is available at https://github.com/Terry9a/DSRC.
△ Less
Submitted 27 February, 2025; v1 submitted 14 December, 2024;
originally announced December 2024.
-
Adversarial Vulnerabilities in Large Language Models for Time Series Forecasting
Authors:
Fuqiang Liu,
Sicong Jiang,
Luis Miranda-Moreno,
Seongjin Choi,
Lijun Sun
Abstract:
Large Language Models (LLMs) have recently demonstrated significant potential in time series forecasting, offering impressive capabilities in handling complex temporal data. However, their robustness and reliability in real-world applications remain under-explored, particularly concerning their susceptibility to adversarial attacks. In this paper, we introduce a targeted adversarial attack framewo…
▽ More
Large Language Models (LLMs) have recently demonstrated significant potential in time series forecasting, offering impressive capabilities in handling complex temporal data. However, their robustness and reliability in real-world applications remain under-explored, particularly concerning their susceptibility to adversarial attacks. In this paper, we introduce a targeted adversarial attack framework for LLM-based time series forecasting. By employing both gradient-free and black-box optimization methods, we generate minimal yet highly effective perturbations that significantly degrade the forecasting accuracy across multiple datasets and LLM architectures. Our experiments, which include models like LLMTime with GPT-3.5, GPT-4, LLaMa, and Mistral, TimeGPT, and TimeLLM show that adversarial attacks lead to much more severe performance degradation than random noise, and demonstrate the broad effectiveness of our attacks across different LLMs. The results underscore the critical vulnerabilities of LLMs in time series forecasting, highlighting the need for robust defense mechanisms to ensure their reliable deployment in practical applications. The code repository can be found at https://github.com/JohnsonJiang1996/AdvAttack_LLM4TS.
△ Less
Submitted 12 March, 2025; v1 submitted 10 December, 2024;
originally announced December 2024.
-
XLSTM-HVED: Cross-Modal Brain Tumor Segmentation and MRI Reconstruction Method Using Vision XLSTM and Heteromodal Variational Encoder-Decoder
Authors:
Shenghao Zhu,
Yifei Chen,
Shuo Jiang,
Weihong Chen,
Chang Liu,
Yuanhan Wang,
Xu Chen,
Yifan Ke,
Feiwei Qin,
Changmiao Wang,
Zhu Zhu
Abstract:
Neurogliomas are among the most aggressive forms of cancer, presenting considerable challenges in both treatment and monitoring due to their unpredictable biological behavior. Magnetic resonance imaging (MRI) is currently the preferred method for diagnosing and monitoring gliomas. However, the lack of specific imaging techniques often compromises the accuracy of tumor segmentation during the imagi…
▽ More
Neurogliomas are among the most aggressive forms of cancer, presenting considerable challenges in both treatment and monitoring due to their unpredictable biological behavior. Magnetic resonance imaging (MRI) is currently the preferred method for diagnosing and monitoring gliomas. However, the lack of specific imaging techniques often compromises the accuracy of tumor segmentation during the imaging process. To address this issue, we introduce the XLSTM-HVED model. This model integrates a hetero-modal encoder-decoder framework with the Vision XLSTM module to reconstruct missing MRI modalities. By deeply fusing spatial and temporal features, it enhances tumor segmentation performance. The key innovation of our approach is the Self-Attention Variational Encoder (SAVE) module, which improves the integration of modal features. Additionally, it optimizes the interaction of features between segmentation and reconstruction tasks through the Squeeze-Fusion-Excitation Cross Awareness (SFECA) module. Our experiments using the BraTS 2024 dataset demonstrate that our model significantly outperforms existing advanced methods in handling cases where modalities are missing. Our source code is available at https://github.com/Quanato607/XLSTM-HVED.
△ Less
Submitted 5 March, 2025; v1 submitted 9 December, 2024;
originally announced December 2024.