-
Blockchain Address Poisoning
Authors:
Taro Tsuchiya,
Jin-Dong Dong,
Kyle Soska,
Nicolas Christin
Abstract:
In many blockchains, e.g., Ethereum, Binance Smart Chain (BSC), the primary representation used for wallet addresses is a hardly memorable 40-digit hexadecimal string. As a result, users often select addresses from their recent transaction history, which enables blockchain address poisoning. The adversary first generates lookalike addresses similar to one with which the victim has previously inter…
▽ More
In many blockchains, e.g., Ethereum, Binance Smart Chain (BSC), the primary representation used for wallet addresses is a hardly memorable 40-digit hexadecimal string. As a result, users often select addresses from their recent transaction history, which enables blockchain address poisoning. The adversary first generates lookalike addresses similar to one with which the victim has previously interacted, and then engages with the victim to ``poison'' their transaction history. The goal is to have the victim mistakenly send tokens to the lookalike address, as opposed to the intended recipient. Compared to contemporary studies, this paper provides four notable contributions. First, we develop a detection system and perform measurements over two years on both Ethereum and BSC. We identify 13~times more attack attempts than reported previously -- totaling 270M on-chain attacks targeting 17M victims. 6,633 incidents have caused at least 83.8M USD in losses, which makes blockchain address poisoning one of the largest cryptocurrency phishing schemes observed in the wild. Second, we analyze a few large attack entities using improved clustering techniques, and model attacker profitability and competition. Third, we reveal attack strategies -- targeted populations, success conditions (address similarity, timing), and cross-chain attacks. Fourth, we mathematically define and simulate the lookalike address generation process across various software- and hardware-based implementations, and identify a large-scale attacker group that appears to use GPUs. We also discuss defensive countermeasures.
△ Less
Submitted 2 July, 2025; v1 submitted 27 January, 2025;
originally announced January 2025.
-
EDSep: An Effective Diffusion-Based Method for Speech Source Separation
Authors:
Jinwei Dong,
Xinsheng Wang,
Qirong Mao
Abstract:
Generative models have attracted considerable attention for speech separation tasks, and among these, diffusion-based methods are being explored. Despite the notable success of diffusion techniques in generation tasks, their adaptation to speech separation has encountered challenges, notably slow convergence and suboptimal separation outcomes. To address these issues and enhance the efficacy of di…
▽ More
Generative models have attracted considerable attention for speech separation tasks, and among these, diffusion-based methods are being explored. Despite the notable success of diffusion techniques in generation tasks, their adaptation to speech separation has encountered challenges, notably slow convergence and suboptimal separation outcomes. To address these issues and enhance the efficacy of diffusion-based speech separation, we introduce EDSep, a novel single-channel method grounded in score matching via stochastic differential equation (SDE). This method enhances generative modeling for speech source separation by optimizing training and sampling efficiency. Specifically, a novel denoiser function is proposed to approximate data distributions, which obtains ideal denoiser outputs. Additionally, a stochastic sampler is carefully designed to resolve the reverse SDE during the sampling process, gradually separating speech from mixtures. Extensive experiments on databases such as WSJ0-2mix, LRS2-2mix, and VoxCeleb2-2mix demonstrate our proposed method's superior performance over existing diffusion and discriminative models, validating its efficacy.
△ Less
Submitted 27 January, 2025;
originally announced January 2025.
-
GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting
Authors:
Jiajun Dong,
Chengkun Wang,
Wenzhao Zheng,
Lei Chen,
Jiwen Lu,
Yansong Tang
Abstract:
Effective image tokenization is crucial for both multi-modal understanding and generation tasks due to the necessity of the alignment with discrete text data. To this end, existing approaches utilize vector quantization (VQ) to project pixels onto a discrete codebook and reconstruct images from the discrete representation. However, compared with the continuous latent space, the limited discrete co…
▽ More
Effective image tokenization is crucial for both multi-modal understanding and generation tasks due to the necessity of the alignment with discrete text data. To this end, existing approaches utilize vector quantization (VQ) to project pixels onto a discrete codebook and reconstruct images from the discrete representation. However, compared with the continuous latent space, the limited discrete codebook space significantly restrict the representational ability of these image tokenizers. In this paper, we propose GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting as a solution. We first represent the encoded samples as multiple flexible featured 2D Gaussians characterized by positions, rotation angles, scaling factors, and feature coefficients. We adopt the standard quantization for the Gaussian features and then concatenate the quantization results with the other intrinsic Gaussian parameters before the corresponding splatting operation and the subsequent decoding module. In general, GaussianToken integrates the local influence of 2D Gaussian distribution into the discrete space and thus enhances the representation capability of the image tokenizer. Competitive reconstruction performances on CIFAR, Mini-ImageNet, and ImageNet-1K demonstrate the effectiveness of our framework. Our code is available at: https://github.com/ChrisDong-THU/GaussianToken.
△ Less
Submitted 26 January, 2025;
originally announced January 2025.
-
CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary
Authors:
Jiahang Tu,
Qian Feng,
Chufan Chen,
Jiahua Dong,
Hanbin Zhao,
Chao Zhang,
Hui Qian
Abstract:
Large-scale text-to-image (T2I) diffusion models have achieved remarkable generative performance about various concepts. With the limitation of privacy and safety in practice, the generative capability concerning NSFW (Not Safe For Work) concepts is undesirable, e.g., producing sexually explicit photos, and licensed images. The concept erasure task for T2I diffusion models has attracted considerab…
▽ More
Large-scale text-to-image (T2I) diffusion models have achieved remarkable generative performance about various concepts. With the limitation of privacy and safety in practice, the generative capability concerning NSFW (Not Safe For Work) concepts is undesirable, e.g., producing sexually explicit photos, and licensed images. The concept erasure task for T2I diffusion models has attracted considerable attention and requires an effective and efficient method. To achieve this goal, we propose a CE-SDWV framework, which removes the target concepts (e.g., NSFW concepts) of T2I diffusion models in the text semantic space by only adjusting the text condition tokens and does not need to re-train the original T2I diffusion model's weights. Specifically, our framework first builds a target concept-related word vocabulary to enhance the representation of the target concepts within the text semantic space, and then utilizes an adaptive semantic component suppression strategy to ablate the target concept-related semantic information in the text condition tokens. To further adapt the above text condition tokens to the original image semantic space, we propose an end-to-end gradient-orthogonal token optimization strategy. Extensive experiments on I2P and UnlearnCanvas benchmarks demonstrate the effectiveness and efficiency of our method.
△ Less
Submitted 26 January, 2025;
originally announced January 2025.
-
Observation of $h_{c}$ radiative decays to multiple light hadrons and the tensor state $f_2(1270)$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (666 additional authors not shown)
Abstract:
Using $ψ(3686)\rightarrow π^{0} h_{c}$ decays from a data sample of $(27.12\pm0.14)\times10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the BEPCII collider, $h_c$ radiative decays to $γπ^{+}π^{-},~γπ^{+}π^{-}η,~\gamma2(π^{+}π^{-})$, and $γp\bar{p}$ are observed for the first time, each with a significance greater than $5σ$. The corresponding branching fractions are measured. Furtherm…
▽ More
Using $ψ(3686)\rightarrow π^{0} h_{c}$ decays from a data sample of $(27.12\pm0.14)\times10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the BEPCII collider, $h_c$ radiative decays to $γπ^{+}π^{-},~γπ^{+}π^{-}η,~\gamma2(π^{+}π^{-})$, and $γp\bar{p}$ are observed for the first time, each with a significance greater than $5σ$. The corresponding branching fractions are measured. Furthermore, intermediate states below 2.8 GeV/$c^{2}$ are investigated, leading to the first observation of the decay process of $h_c\rightarrowγf_{2}(1270)\rightarrowγπ^{+}π^{-}$ with a significance of $5.5\,σ$. This observation represents the first instance of $h_c$ radiative decay to a tensor state.
△ Less
Submitted 26 January, 2025;
originally announced January 2025.
-
Charge density wave modulated third-order nonlinear Hall effect in 1$T$-VSe$_2$ nanosheets
Authors:
Zhao-Hui Chen,
Xin Liao,
Jing-Wei Dong,
Xing-Yu Liu,
Tong-Yang Zhao,
Dong Li,
An-Qi Wang,
Zhi-Min Liao
Abstract:
We report the observation of a pronounced third-order nonlinear Hall effect (NLHE) in 1$T$-phase VSe$_2$ nanosheets, synthesized using chemical vapor deposition (CVD). The nanosheets exhibit a charge density wave (CDW) transition at $\sim$77 K. Detailed angle-resolved and temperature-dependent measurements reveal a strong cubic relationship between the third-harmonic Hall voltage $V_{3ω}^\perp$ an…
▽ More
We report the observation of a pronounced third-order nonlinear Hall effect (NLHE) in 1$T$-phase VSe$_2$ nanosheets, synthesized using chemical vapor deposition (CVD). The nanosheets exhibit a charge density wave (CDW) transition at $\sim$77 K. Detailed angle-resolved and temperature-dependent measurements reveal a strong cubic relationship between the third-harmonic Hall voltage $V_{3ω}^\perp$ and the bias current $I_ω$, persisting up to room temperature. Notably, the third-order NLHE demonstrates a twofold angular dependence and significant enhancement below the CDW transition temperature, indicative of threefold symmetry breaking in the CDW phase. Scaling analysis suggests that the intrinsic contribution from the Berry connection polarizability tensor is substantially increased in the CDW phase, while extrinsic effects dominate at higher temperatures. Our findings highlight the critical role of CDW-induced symmetry breaking in modulating quantum geometric properties and nonlinear transport phenomena in VSe$_2$, paving the way for future explorations in low-dimensional quantum materials.
△ Less
Submitted 25 January, 2025;
originally announced January 2025.
-
Cross section measurement of $e^{+}e^{-} \to f_{1}(1285)π^{+}π^{-}$ at center-of-mass energies between $3.808$ and $4.951\rm GeV$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (639 additional authors not shown)
Abstract:
Using data samples collected by the \mbox{BESIII} detector located at the Beijing Electron Positron Collider, the cross sections of the process $e^+e^-\to f_{1}(1285)π^+π^-$ are measured at forty-five center-of-mass energies from $3.808$ to $4.951 {\rm GeV}$. An investigation on the cross section line shape is performed, and no significant structure is observed.
Using data samples collected by the \mbox{BESIII} detector located at the Beijing Electron Positron Collider, the cross sections of the process $e^+e^-\to f_{1}(1285)π^+π^-$ are measured at forty-five center-of-mass energies from $3.808$ to $4.951 {\rm GeV}$. An investigation on the cross section line shape is performed, and no significant structure is observed.
△ Less
Submitted 23 January, 2025;
originally announced January 2025.
-
A Survey of Graph Retrieval-Augmented Generation for Customized Large Language Models
Authors:
Qinggang Zhang,
Shengyuan Chen,
Yuanchen Bei,
Zheng Yuan,
Huachi Zhou,
Zijin Hong,
Junnan Dong,
Hao Chen,
Yi Chang,
Xiao Huang
Abstract:
Large language models (LLMs) have demonstrated remarkable capabilities in a wide range of tasks, yet their application to specialized domains remains challenging due to the need for deep expertise. Retrieval-augmented generation (RAG) has emerged as a promising solution to customize LLMs for professional fields by seamlessly integrating external knowledge bases, enabling real-time access to domain…
▽ More
Large language models (LLMs) have demonstrated remarkable capabilities in a wide range of tasks, yet their application to specialized domains remains challenging due to the need for deep expertise. Retrieval-augmented generation (RAG) has emerged as a promising solution to customize LLMs for professional fields by seamlessly integrating external knowledge bases, enabling real-time access to domain-specific expertise during inference. Despite its potential, traditional RAG systems, based on flat text retrieval, face three critical challenges: (i) complex query understanding in professional contexts, (ii) difficulties in knowledge integration across distributed sources, and (iii) system efficiency bottlenecks at scale. This survey presents a systematic analysis of Graph-based Retrieval-Augmented Generation (GraphRAG), a new paradigm that revolutionizes domain-specific LLM applications. GraphRAG addresses traditional RAG limitations through three key innovations: (i) graph-structured knowledge representation that explicitly captures entity relationships and domain hierarchies, (ii) efficient graph-based retrieval techniques that enable context-preserving knowledge retrieval with multihop reasoning ability, and (iii) structure-aware knowledge integration algorithms that leverage retrieved knowledge for accurate and logical coherent generation of LLMs. In this survey, we systematically analyze the technical foundations of GraphRAG and examine current implementations across various professional domains, identifying key technical challenges and promising research directions. All the related resources of GraphRAG, including research papers, open-source data, and projects, are collected for the community in \textcolor{blue}{\url{https://github.com/DEEP-PolyU/Awesome-GraphRAG}}.
△ Less
Submitted 21 January, 2025;
originally announced January 2025.
-
Benchmarking Large Language Models via Random Variables
Authors:
Zijin Hong,
Hao Wu,
Su Dong,
Junnan Dong,
Yilin Xiao,
Yujing Zhang,
Zhu Wang,
Feiran Huang,
Linyi Li,
Hongxia Yang,
Xiao Huang
Abstract:
Recent studies have raised concerns about the reliability of current mathematical benchmarks, highlighting issues such as simplistic design and potential data contamination. Therefore, creating a reliable benchmark that effectively evaluates the genuine capabilities of large language models (LLMs) in mathematical reasoning remains a significant challenge. To address this, we propose RV-Bench, a fr…
▽ More
Recent studies have raised concerns about the reliability of current mathematical benchmarks, highlighting issues such as simplistic design and potential data contamination. Therefore, creating a reliable benchmark that effectively evaluates the genuine capabilities of large language models (LLMs) in mathematical reasoning remains a significant challenge. To address this, we propose RV-Bench, a framework for Benchmarking LLMs via Random Variables in mathematical reasoning. Specifically, the background content of a random variable question (RV question) mirrors the original problem in existing benchmarks, but the variable combinations are randomized, making it "unseen" by the LLMs. Models must completely understand the question pattern of the original problem to correctly answer RV questions with various variable values. As a result, the LLM's genuine capability in mathematical reasoning is reflected by its accuracy and robustness on RV-Bench. We conducted extensive experiments on over 30 representative LLMs across more than 1000 RV questions. Our findings suggest that LLMs exhibit an imbalance in proficiency between encountered and "unseen" data domains. Proficiency generalization across similar mathematical reasoning tasks is verified to be limited by accuracy and robustness, but it can still be enhanced through test-time scaling.
△ Less
Submitted 15 March, 2025; v1 submitted 20 January, 2025;
originally announced January 2025.
-
Multiclass Queue Scheduling Under Slowdown: An Approximate Dynamic Programming Approach
Authors:
Jing Dong,
Berk Görgülü,
Vahid Sarhangian
Abstract:
In many service systems, especially those in healthcare, customer waiting times can result in increased service requirements. Such service slowdowns can significantly impact system performance. Therefore, it is important to properly account for their impact when designing scheduling policies. Scheduling under wait-dependent service times is challenging, especially when multiple customer classes ar…
▽ More
In many service systems, especially those in healthcare, customer waiting times can result in increased service requirements. Such service slowdowns can significantly impact system performance. Therefore, it is important to properly account for their impact when designing scheduling policies. Scheduling under wait-dependent service times is challenging, especially when multiple customer classes are heterogeneously affected by waiting. In this work, we study scheduling policies in multiclass, multiserver queues with wait-dependent service slowdowns. We propose a simulation-based Approximate Dynamic Programming (ADP) algorithm to find close-to-optimal scheduling policies. The ADP algorithm (i) represents the policy using classifiers based on the index policy structure, (ii) leverages a coupling method to estimate the differences of the relative value functions directly, and (iii) uses adaptive sampling for efficient state-space exploration. Through extensive numerical experiments, we illustrate that the ADP algorithm generates close-to-optimal policies that outperform well-known benchmarks. We also provide insights into the structure of the optimal policy, which reveals an important trade-off between instantaneous cost reduction and preventing the system from reaching high-cost equilibria. Lastly, we conduct a case study on scheduling admissions into rehabilitation care to illustrate the effectiveness of the ADP algorithm in practice.
△ Less
Submitted 17 January, 2025;
originally announced January 2025.
-
Study of $η\rightarrowπ^+π^-l^+l^-$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (637 additional authors not shown)
Abstract:
Using a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events accumulated with the BESIII detector, we analyze the decays $η\rightarrowπ^+π^-l^+l^-$ ($l=e$ or $μ$) via the process $J/ψ\rightarrowγη$. The branching fraction of $η\rightarrowπ^+π^-e^+e^-$ is measured to be $\mathcal{B}(η\rightarrowπ^+π^-e^+e^-)=(3.07\pm0.12_{\rm{stat.}}\pm0.19_{\rm{syst.}}) \times10^{-4}$. No signal events are observed f…
▽ More
Using a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events accumulated with the BESIII detector, we analyze the decays $η\rightarrowπ^+π^-l^+l^-$ ($l=e$ or $μ$) via the process $J/ψ\rightarrowγη$. The branching fraction of $η\rightarrowπ^+π^-e^+e^-$ is measured to be $\mathcal{B}(η\rightarrowπ^+π^-e^+e^-)=(3.07\pm0.12_{\rm{stat.}}\pm0.19_{\rm{syst.}}) \times10^{-4}$. No signal events are observed for the $η\rightarrowπ^{+}π^{-}μ^{+}μ^{-}$ decay, leading to an upper limit on the branching fraction of $\mathcal{B}(η\rightarrowπ^{+}π^{-}μ^{+}μ^{-})<4.0\times10^{-7}$ at the 90\% confidence level. Furthermore, the $CP$-violation asymmetry parameter is found to be $\mathcal{A}_{CP}(η\rightarrowπ^{+}π^{-}e^{+}e^{-})=(-4.04\pm4.69_{\rm{stat.}}\pm0.14_{\rm{syst.}})\%$, showing no evidence of $CP$-violation with current statistics. Additionally, we extract the transition form factor from the decay amplitude of $η\rightarrowπ^+π^-e^+e^-$. Finally, axion-like particles are searched for via the decay $η\rightarrowπ^+π^-a, a\rightarrow e^+e^-$, and upper limits on this branching fraction relative to that of $η\rightarrowπ^+π^-e^+e^-$ are presented as a function of the axion-like particle mass in the range $5-200\ \mathrm{MeV}/c^{2}$.
△ Less
Submitted 17 January, 2025;
originally announced January 2025.
-
Resource-Constrained Federated Continual Learning: What Does Matter?
Authors:
Yichen Li,
Yuying Wang,
Jiahua Dong,
Haozhao Wang,
Yining Qi,
Rui Zhang,
Ruixuan Li
Abstract:
Federated Continual Learning (FCL) aims to enable sequentially privacy-preserving model training on streams of incoming data that vary in edge devices by preserving previous knowledge while adapting to new data. Current FCL literature focuses on restricted data privacy and access to previously seen data while imposing no constraints on the training overhead. This is unreasonable for FCL applicatio…
▽ More
Federated Continual Learning (FCL) aims to enable sequentially privacy-preserving model training on streams of incoming data that vary in edge devices by preserving previous knowledge while adapting to new data. Current FCL literature focuses on restricted data privacy and access to previously seen data while imposing no constraints on the training overhead. This is unreasonable for FCL applications in real-world scenarios, where edge devices are primarily constrained by resources such as storage, computational budget, and label rate. We revisit this problem with a large-scale benchmark and analyze the performance of state-of-the-art FCL approaches under different resource-constrained settings. Various typical FCL techniques and six datasets in two incremental learning scenarios (Class-IL and Domain-IL) are involved in our experiments. Through extensive experiments amounting to a total of over 1,000+ GPU hours, we find that, under limited resource-constrained settings, existing FCL approaches, with no exception, fail to achieve the expected performance. Our conclusions are consistent in the sensitivity analysis. This suggests that most existing FCL methods are particularly too resource-dependent for real-world deployment. Moreover, we study the performance of typical FCL techniques with resource constraints and shed light on future research directions in FCL.
△ Less
Submitted 15 January, 2025;
originally announced January 2025.
-
Search for the FCNC charmonium decay $J/ψ\to D^0 μ^+ μ^- + \text{c.c.}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (680 additional authors not shown)
Abstract:
Based on a data sample of $(10087 \pm 44) \times 10^6$ $J/ψ$ events taken with the BESIII detector, we search for the flavor-changing neutral current charmonium decay $J/ψ\to D^{0} μ^{+} μ^{-} + \text{c.c.}$. No significant signal above the background is observed, and the upper limit on its branching fraction is set to be $\mathcal{B}(J/ψ\to D^{0}μ^{+}μ^{-} + \text{c.c.} ) < 1.1 \times 10^{-7}$ at…
▽ More
Based on a data sample of $(10087 \pm 44) \times 10^6$ $J/ψ$ events taken with the BESIII detector, we search for the flavor-changing neutral current charmonium decay $J/ψ\to D^{0} μ^{+} μ^{-} + \text{c.c.}$. No significant signal above the background is observed, and the upper limit on its branching fraction is set to be $\mathcal{B}(J/ψ\to D^{0}μ^{+}μ^{-} + \text{c.c.} ) < 1.1 \times 10^{-7}$ at the 90% confidence level. This marks the first search for a flavor-changing neutral current charmonium decay involving muons in the final state.
△ Less
Submitted 14 February, 2025; v1 submitted 14 January, 2025;
originally announced January 2025.
-
Perturbative Fourier Ptychographic Microscopy for Fast Quantitative Phase Imaging
Authors:
Martin Zach,
Kuan-Chen Shen,
Ruiming Cao,
Michael Unser,
Laura Waller,
Jonathan Dong
Abstract:
In computational phase imaging with a microscope equipped with an array of light emitting diodes as illumination unit, conventional Fourier ptychographic microscopy achieves high resolution and wide-field reconstructions but is constrained by a lengthy acquisition time. Conversely, differential phase contrast (DPC) offers fast imaging but is limited in resolution. Here, we introduce perturbative F…
▽ More
In computational phase imaging with a microscope equipped with an array of light emitting diodes as illumination unit, conventional Fourier ptychographic microscopy achieves high resolution and wide-field reconstructions but is constrained by a lengthy acquisition time. Conversely, differential phase contrast (DPC) offers fast imaging but is limited in resolution. Here, we introduce perturbative Fourier ptychographic microscopy (pFPM). pFPM is an extension of DPC that incorporates dark-field illumination to enable fast, high-resolution, wide-field quantitative phase imaging with few measurements. We interpret DPC as the initial iteration of a Gauss-Newton algorithm with quadratic regularization and generalize it to multiple iterations and more sophisticated regularizers. This broader framework is not restricted to bright-field measurements and allows us to overcome resolution limitations of DPC. We develop tailored annular dark-field illumination patterns that align with the perturbative interpretation and lead to an improvement in the quality of reconstruction with respect to other common illumination schemes. Consequently, our methodology combines an enhanced phase reconstruction algorithm with a specialized illumination strategy and offers significant advantages in both imaging speed and resolution.
△ Less
Submitted 10 March, 2025; v1 submitted 13 January, 2025;
originally announced January 2025.
-
Interplay of Electrostatic Interaction and Steric Repulsion between Bacteria and Gold Surface Influences Raman Enhancement
Authors:
Jia Dong,
Jeong Hee Kim,
Isaac Pincus,
Sujan Manna,
Jennifer M. Podgorski,
Yanmin Zhu,
Loza F. Tadesse
Abstract:
Plasmonic nanostructures have wide applications in photonics including pathogen detection and diagnosis via Surface-Enhanced Raman Spectroscopy (SERS). Despite major role plasmonics play in signal enhancement, electrostatics in SERS is yet to be fully understood and harnessed. Here, we perform a systematic study of electrostatic interactions between 785 nm resonant gold nanorods designed to harbor…
▽ More
Plasmonic nanostructures have wide applications in photonics including pathogen detection and diagnosis via Surface-Enhanced Raman Spectroscopy (SERS). Despite major role plasmonics play in signal enhancement, electrostatics in SERS is yet to be fully understood and harnessed. Here, we perform a systematic study of electrostatic interactions between 785 nm resonant gold nanorods designed to harbor zeta potentials of +29, +16, 0 and -9 mV spanning positive neutral and negative domains. SERS activity is tested on representative Gram-negative Escherichia coli and Gram-positive Staphylococcus epidermidis bacteria with zeta potentials of -30 and -23 mV respectively in water. Raman spectroscopy and Cryo-Electron microscopy reveal that +29, +16, 0 and -9 mV nanorods give SERS enhancement of 7.2X, 3.6X, 4.2X, 1.3X to Staphylococcus epidermidis and 3.9X, 2.8X, 2.9X, 1.1X to Escherichia coli. Theoretical results show that electrostatics play the major role among all interaction forces in determining cell-nanorod proximity and signal enhancement. We identify steric repulsion due to cell protrusions to be the critical opposing force. Finally, a design principle is proposed to estimate the electrostatic strength in SERS. Our work provides new insights into the principle of bacteria-nanorod interactions, enabling reproducible and precise biomolecular readouts, critical for next-generation point-of-care diagnostics and smart healthcare applications.
△ Less
Submitted 12 January, 2025;
originally announced January 2025.
-
Cooperative Optimal Output Tracking for Discrete-Time Multiagent Systems: Stabilizing Policy Iteration Frameworks and Analysis
Authors:
Dongdong Li,
Jiuxiang Dong
Abstract:
In this paper, two model-free optimal output tracking frameworks based on policy iteration for discrete-time multi-agent systems are proposed. First, we establish a framework of stabilizing policy iteration that can start from any initial feedback control policy, relaxing the dependence of traditional policy iteration on the initial stabilizing control policy. Then, another efficient and equivalen…
▽ More
In this paper, two model-free optimal output tracking frameworks based on policy iteration for discrete-time multi-agent systems are proposed. First, we establish a framework of stabilizing policy iteration that can start from any initial feedback control policy, relaxing the dependence of traditional policy iteration on the initial stabilizing control policy. Then, another efficient and equivalent $Q$-learning policy iteration framework is developed, which is shown to require only less system data to get the same results as the stabilizing policy iteration. Both frameworks obtain stabilizing control policy by iterating the stabilizing virtual closed-loop system step-by-step to the actual closed-loop system. Multiple explicit schemes for the iteration step-size/coefficient are designed and their stability during the above iterations is analyzed. By using the generated closed-loop stabilizing control policy and two frameworks, the optimal feedback control gain is obtained. The approximate solution of the regulator equations is found by model-free iteration, which leads to the optimal feedforward gain. Finally, the cooperative optimal output tracking is realized by a distributed feedforward-feedback controller. The proposed algorithms are validated by simulation.
△ Less
Submitted 11 January, 2025;
originally announced January 2025.
-
Probing electric field tunable multiband superconductivity in alternating twisted quadralayer graphene
Authors:
Le Liu,
Yu Hong,
Chengping Zhang,
Jundong Zhu,
Jingwei Dong,
Kenji Watanabe,
Takashi Taniguchi,
Luojun Du,
Dongxia Shi,
Kam Tuen Law,
Wei Yang,
Guangyu Zhang
Abstract:
Alternating twisted multilayer graphene presents a compelling multiband system for exploring superconductivity. Here we investigate robust superconductivity in alternating twisted quadralayer graphene, elucidating carrier contributions from both flat and dispersive bands. The superconductivity is robust, with a strong electrical field tunability, a maximum BKT transition temperature of 1.6 K, and…
▽ More
Alternating twisted multilayer graphene presents a compelling multiband system for exploring superconductivity. Here we investigate robust superconductivity in alternating twisted quadralayer graphene, elucidating carrier contributions from both flat and dispersive bands. The superconductivity is robust, with a strong electrical field tunability, a maximum BKT transition temperature of 1.6 K, and high critical magnetic fields beyond the Pauli limit. We disentangle the carrier density of Dirac bands and flat bands from the Landau fan diagram. Moreover, we could estimate the flatband Fermi velocity from the obtained high critical current near half filling when superconductivity is killed at finite magnetic fields, and further quantify the superfluid stiffness from the low critical current in the superconducting regime. Our results exhibit the electric field tunable coupling strength within the superconducting phase, revealing unconventional properties with vanishing Fermi velocity and large superfluid stiffness. These phenomena, attributed to substantial quantum metric contributions, offer new insights into the mechanisms underlying unconventional superconductivity in moire systems.
△ Less
Submitted 11 January, 2025;
originally announced January 2025.
-
Search for $K^0_S$ invisible decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
Based on $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected with the BESIII detector at the BEPCII $e^+e^-$ storage ring, we search for $K_{S}^{0}$ invisible decays via the $J/ψ\to φK_{S}^{0} K_{S}^{0}$ process. No significant signal is observed, and the upper limit of the branching fraction of these invisible decays is set at 8.4 $\times$ $10^{-4}$ at the 90\% confidence level. This is the f…
▽ More
Based on $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected with the BESIII detector at the BEPCII $e^+e^-$ storage ring, we search for $K_{S}^{0}$ invisible decays via the $J/ψ\to φK_{S}^{0} K_{S}^{0}$ process. No significant signal is observed, and the upper limit of the branching fraction of these invisible decays is set at 8.4 $\times$ $10^{-4}$ at the 90\% confidence level. This is the first experimental search for $K^0_S$ invisible decays.
△ Less
Submitted 10 January, 2025;
originally announced January 2025.
-
Enhancing, Refining, and Fusing: Towards Robust Multi-Scale and Dense Ship Detection
Authors:
Congxia Zhao,
Xiongjun Fu,
Jian Dong,
Shen Cao,
Chunyan Zhang
Abstract:
Synthetic aperture radar (SAR) imaging, celebrated for its high resolution, all-weather capability, and day-night operability, is indispensable for maritime applications. However, ship detection in SAR imagery faces significant challenges, including complex backgrounds, densely arranged targets, and large scale variations. To address these issues, we propose a novel framework, Center-Aware SAR Shi…
▽ More
Synthetic aperture radar (SAR) imaging, celebrated for its high resolution, all-weather capability, and day-night operability, is indispensable for maritime applications. However, ship detection in SAR imagery faces significant challenges, including complex backgrounds, densely arranged targets, and large scale variations. To address these issues, we propose a novel framework, Center-Aware SAR Ship Detector (CASS-Det), designed for robust multi-scale and densely packed ship detection. CASS-Det integrates three key innovations: (1) a center enhancement module (CEM) that employs rotational convolution to emphasize ship centers, improving localization while suppressing background interference; (2) a neighbor attention module (NAM) that leverages cross-layer dependencies to refine ship boundaries in densely populated scenes; and (3) a cross-connected feature pyramid network (CC-FPN) that enhances multi-scale feature fusion by integrating shallow and deep features. Extensive experiments on the SSDD, HRSID, and LS-SSDD-v1.0 datasets demonstrate the state-of-the-art performance of CASS-Det, excelling at detecting multi-scale and densely arranged ships.
△ Less
Submitted 10 January, 2025;
originally announced January 2025.
-
Arbitrary control of the flow of light using pseudomagnetic fields in photonic crystals at telecommunication wavelengths
Authors:
Pan Hu,
Lu Sun,
Ce Chen,
Jingchi Li,
Xiong Ni,
Xintao He,
Jianwen Dong,
Yikai Su
Abstract:
In photonics, the idea of controlling light in a similar way that magnetic fields control electrons has always been attractive. It can be realized by synthesizing pseudomagnetic fields (PMFs) in photonic crystals (PhCs). Previous works mainly focus on the Landau levels and the robust transport of the chiral states. More versatile control over light using complex nonuniform PMFs such as the flexibl…
▽ More
In photonics, the idea of controlling light in a similar way that magnetic fields control electrons has always been attractive. It can be realized by synthesizing pseudomagnetic fields (PMFs) in photonic crystals (PhCs). Previous works mainly focus on the Landau levels and the robust transport of the chiral states. More versatile control over light using complex nonuniform PMFs such as the flexible splitting and routing of light has been elusive, which hinders their application in practical photonic integrated circuits. Here we propose an universal and systematic methodology to design nonuniform PMFs and arbitrarily control the flow of light in silicon PhCs at telecommunication wavelengths. As proofs of concept, a low-loss S-bend and a highly efficient 50:50 power splitter based on PMFs are experimentally demonstrated. A high-speed data transmission experiment is performed on these devices to prove their applicability in real communication systems. The proposed method offers a new paradigm for the exploration of fundamental physics and the development of novel nanophotonic devices.
△ Less
Submitted 8 January, 2025;
originally announced January 2025.
-
Search for the leptonic decay $D^{+}\to e^{+}ν_{e}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (646 additional authors not shown)
Abstract:
We search for the leptonic decay $D^+\to e^+ν_{e}$ using an $e^+e^-$ collision data sample with an integrated luminosity of 20.3~fb$^{-1}$ collected with the BESIII detector at the center-of-mass energy of 3.773~GeV. No significant signal is observed and an upper limit on the branching fraction of $D^+\to e^+ν_{e}$ is set as $9.7 \times 10^{-7}$, at the 90\% confidence level. Our upper limit is an…
▽ More
We search for the leptonic decay $D^+\to e^+ν_{e}$ using an $e^+e^-$ collision data sample with an integrated luminosity of 20.3~fb$^{-1}$ collected with the BESIII detector at the center-of-mass energy of 3.773~GeV. No significant signal is observed and an upper limit on the branching fraction of $D^+\to e^+ν_{e}$ is set as $9.7 \times 10^{-7}$, at the 90\% confidence level. Our upper limit is an order of magnitude smaller than the previous limit for this decay mode.
△ Less
Submitted 8 January, 2025;
originally announced January 2025.
-
Observation of the $W$-annihilation process $D_s^+ \to ωρ^+$ and measurement of $D_s^+ \to φρ^+$ in $D^+_s\to π^+π^+π^-π^0π^0$ decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
We present the first amplitude analysis and branching fraction measurement of the decay $D^+_s\to π^+π^+π^-π^0π^0$, using $e^+e^-$ collision data collected with the BESIII detector at center-of-mass energies between 4.128 and 4.226 GeV corresponding to an integrated luminosity of 7.33 fb$^{-1}$, and report the first observation of the pure $W$-annihilation decay $D_s^+ \to ωρ^+$ with a branching f…
▽ More
We present the first amplitude analysis and branching fraction measurement of the decay $D^+_s\to π^+π^+π^-π^0π^0$, using $e^+e^-$ collision data collected with the BESIII detector at center-of-mass energies between 4.128 and 4.226 GeV corresponding to an integrated luminosity of 7.33 fb$^{-1}$, and report the first observation of the pure $W$-annihilation decay $D_s^+ \to ωρ^+$ with a branching fraction of $(0.99\pm0.08_{\rm stat}{\ ^{+0.05}_{-0.07}}_{\rm syst})\%$. %The absolute branching fraction is measured to be $(0.99\pm0.08_{\rm stat}\pm0.07_{\rm syst})\%$. In comparison to the low significance of the $\mathcal{D}$ wave in the decay $D_s^+ \to φρ^+$, the dominance of the $\mathcal{D}$ wave over the $\mathcal{S}$ and $\mathcal{P}$ waves, with a fraction of $(51.85\pm7.28_{\rm stat}{\ ^{+4.83}_{-7.90}}_{\rm syst})\%$ observed in the decay $D_s^+ \to ωρ^+$, provides crucial information for the``polarization puzzle", as well as for the understanding of charm meson decays. The branching fraction of $D^+_s\to π^+π^+π^-π^0π^0$ is measured to be ($4.41\pm0.15_{\rm stat}\pm0.13_{\rm syst}$)\%. Moreover, the branching fraction of $D_s^+ \to φρ^+$ is measured to be $(3.98\pm0.33_{\rm stat}{\ ^{+0.21}_{-0.19}}_{\rm syst})\%$, and the $R_φ= {\mathcal{B}(φ\toπ^+π^-π^0)}/{\mathcal{B}(φ\to K^+K^-)}$ is determined to be $(0.222\pm0.019_{\rm stat}{\ ^{+0.016}_{-0.016}}_{\rm syst}$), which is consistent with the previous measurement based on charm meson decays, but deviates from the results from $e^+e^-$ annihilation and $K$-$N$ scattering experiments by more than 3$σ$.
△ Less
Submitted 23 May, 2025; v1 submitted 8 January, 2025;
originally announced January 2025.
-
Study of the electromagnetic Dalitz decay $J/ψ\to e^+e^- π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (639 additional authors not shown)
Abstract:
We report the first measurement of the di-electron invariant mass dependent transition form factor in the electromagnetic Dalitz decay $J/ψ\to e^+e^- π^0$ using $(10087 \pm 44) \times 10^6$ $J/ψ$ events collected by the BESIII detector. A clear $ρ-ω$ interference structure is observed, consistent with the pion form factor, which offers a novel approach to extract the hadronic vacuum polarization c…
▽ More
We report the first measurement of the di-electron invariant mass dependent transition form factor in the electromagnetic Dalitz decay $J/ψ\to e^+e^- π^0$ using $(10087 \pm 44) \times 10^6$ $J/ψ$ events collected by the BESIII detector. A clear $ρ-ω$ interference structure is observed, consistent with the pion form factor, which offers a novel approach to extract the hadronic vacuum polarization contribution to the anomalous muon magnetic moment ($a_μ$) and refine the predictions of the Vector Meson Dominance (VMD) model and hadronic light-by-light contribution to $a_μ$. By taking into account the contribution of this $ρ-ω$ interference structure, the branching fraction of $J/ψ\to e^+e^- π^0$ in the full $e^+e^-$ invariant mass range is also measured for the first time to be $(8.06 \pm 0.31 (\rm{stat}) \pm 0.38 (\rm{syst}))\times 10^{-7}$, approximately twice the non-resonant VMD prediction.
△ Less
Submitted 3 July, 2025; v1 submitted 8 January, 2025;
originally announced January 2025.
-
Critical properties in the non-Hermitian Aubry-Andre-Stark model
Authors:
Ji-Long Dong,
En-Wen Liang,
Shi-Yang Liu,
Guo-Qing Zhang,
Ling-Zhi Tang,
Dan-Wei Zhang
Abstract:
We explore the critical properties of the localization transition in the non-Hermitian Aubry-Andre-Stark (AAS) model with quasiperiodic and Stark potentials, where the non-Hermiticity comes from the nonreciprocal hopping. The localization length, the inverse participation ratio and the energy gap are adopted as the characteristic quantities. We perform the scaling analysis to derive the scaling fu…
▽ More
We explore the critical properties of the localization transition in the non-Hermitian Aubry-Andre-Stark (AAS) model with quasiperiodic and Stark potentials, where the non-Hermiticity comes from the nonreciprocal hopping. The localization length, the inverse participation ratio and the energy gap are adopted as the characteristic quantities. We perform the scaling analysis to derive the scaling functions of the three quantities with critical exponents in several critical regions, with respect to the quasiperiodic and Stark potentials and the nonreciprocal strength. We numerically verify the finite-size scaling forms and extract the critical exponents in different situations. Two groups of new critical exponents for the non-Hermitian AAS model and its pure Stark limit are obtained, which are distinct to those for the non-Hermitian Aubry-Andre model and their Hermitian counterparts. Our results indicate that the Hermitian and non-Hermitian AAS, Aubry-Andre, and Stark models belong to different universality classes. We demonstrate that these critical exponents are independent of the nonreciprocal strength, and remain the same in different critical regions and boundary conditions. Furthermore, we establish a hybrid scaling function with a hybrid exponent in the overlap region between the critical regions for the non-Hermitian AAS and Stark models.
△ Less
Submitted 13 May, 2025; v1 submitted 7 January, 2025;
originally announced January 2025.
-
Nonrelativistic spin-splitting multiferroic antiferromagnet and compensated ferrimagnet with zero net magnetization
Authors:
Jianting Dong,
Kun Wu,
Meng Zhu,
Fanxing Zheng,
Xinlu Li,
Jia Zhang
Abstract:
Spin-splitting antiferromagnets with spin-polarized band structures in momentum space have garnered intensive research attention due to their zero net magnetic moments, ultras fast spin dynamics as conventional antiferromagnets, and spin-polarized transport properties akin to ferromagnets, making them promising candidates for antiferromagnetic spintronics. However, unlike spin-torque switching of…
▽ More
Spin-splitting antiferromagnets with spin-polarized band structures in momentum space have garnered intensive research attention due to their zero net magnetic moments, ultras fast spin dynamics as conventional antiferromagnets, and spin-polarized transport properties akin to ferromagnets, making them promising candidates for antiferromagnetic spintronics. However, unlike spin-torque switching of ferromagnets by electric current, efficient electric control of spin-splitting antiferromagnetic order remains challenges. In this work, we identify prototypes of multiferroic spin-splitting antiferromagnets, including BiFeO3, Fe2Mo3O8 and compensated ferrimagnet GaFeO3 with ferroelectric polarization as well as spin-polarized electronic structures. We establish design principles for the spin-splitting multiferroic antiferromagnets and compensated ferrimagnets, elucidating the band symmetry features in Brillouin zone. We demonstrate that the spin polarization in spin-splitting magnets, despite of zero net magnetic moment, can be switched by ferroelectric polarization, providing an efficient means of controlling the antiferromagnetic order. Our work may inspire future development of novel multiferroic functional magnets with zero magnetic moments and pave the way for their applications in magnetoelectric spintronic devices.
△ Less
Submitted 6 January, 2025;
originally announced January 2025.
-
ICFNet: Integrated Cross-modal Fusion Network for Survival Prediction
Authors:
Binyu Zhang,
Zhu Meng,
Junhao Dong,
Fei Su,
Zhicheng Zhao
Abstract:
Survival prediction is a crucial task in the medical field and is essential for optimizing treatment options and resource allocation. However, current methods often rely on limited data modalities, resulting in suboptimal performance. In this paper, we propose an Integrated Cross-modal Fusion Network (ICFNet) that integrates histopathology whole slide images, genomic expression profiles, patient d…
▽ More
Survival prediction is a crucial task in the medical field and is essential for optimizing treatment options and resource allocation. However, current methods often rely on limited data modalities, resulting in suboptimal performance. In this paper, we propose an Integrated Cross-modal Fusion Network (ICFNet) that integrates histopathology whole slide images, genomic expression profiles, patient demographics, and treatment protocols. Specifically, three types of encoders, a residual orthogonal decomposition module and a unification fusion module are employed to merge multi-modal features to enhance prediction accuracy. Additionally, a balanced negative log-likelihood loss function is designed to ensure fair training across different patients. Extensive experiments demonstrate that our ICFNet outperforms state-of-the-art algorithms on five public TCGA datasets, including BLCA, BRCA, GBMLGG, LUAD, and UCEC, and shows its potential to support clinical decision-making and advance precision medicine. The codes are available at: https://github.com/binging512/ICFNet.
△ Less
Submitted 6 January, 2025;
originally announced January 2025.
-
Observation of $ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
Based on $(2712.4 \pm 14.3)\times 10^6$ $ψ(3686)$ events collected at the BESIII detector operating at the BEPCII collider, we present the first observation of the decay $ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.$. The product branching fraction ${\cal B}[ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.] \times {\cal B}[Λ(1520) \to pK^{-}]$ is measured to be $(9.5 \pm 0.8 \pm 1.1) \times 10^{-7}$, where th…
▽ More
Based on $(2712.4 \pm 14.3)\times 10^6$ $ψ(3686)$ events collected at the BESIII detector operating at the BEPCII collider, we present the first observation of the decay $ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.$. The product branching fraction ${\cal B}[ψ(3686) \to K^{-}Λ(1520)\barΞ^{+} + c.c.] \times {\cal B}[Λ(1520) \to pK^{-}]$ is measured to be $(9.5 \pm 0.8 \pm 1.1) \times 10^{-7}$, where the first uncertainty is statistical and the second systematic.
△ Less
Submitted 5 January, 2025;
originally announced January 2025.
-
Search for $η_c(2S)\to p\bar{p}K^+K^-$ and measurement of $χ_{cJ}\to p\bar{p}K^+K^-$ in $ψ(3686)$ radiative decays
Authors:
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (639 additional authors not shown)
Abstract:
A search for $η_c(2S)\to p\bar{p}K^+K^-$, together with measurement of branching fractions of $χ_{cJ(J=0,1,2)}\to p\bar{p}K^+K^-$ in the $ψ(3686) \to γη_c(2S)$ and the $ψ(3686) \to γχ_{cJ}$ radiative decays, is performed with $(2712.4\pm14.3)\times 10^6$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider. An evidence for $η_c(2S)\to p\bar{p}K^+K^-$ is found, with a signific…
▽ More
A search for $η_c(2S)\to p\bar{p}K^+K^-$, together with measurement of branching fractions of $χ_{cJ(J=0,1,2)}\to p\bar{p}K^+K^-$ in the $ψ(3686) \to γη_c(2S)$ and the $ψ(3686) \to γχ_{cJ}$ radiative decays, is performed with $(2712.4\pm14.3)\times 10^6$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider. An evidence for $η_c(2S)\to p\bar{p}K^+K^-$ is found, with a significance of $3.3σ$. The product branching fraction of $\mathcal{B}[ψ(3686)\toγη_c(2S)]\cdot\mathcal{B}[η_c(2S)\to p\bar{p}K^+K^-]$ is determined to be $(1.98\mkern 2mu\pm\mkern 2mu0.41_{\text{stat.}}\mkern 2mu\pm\mkern 2mu0.99_{\text{syst.}})\times 10^{-7}$. The product branching fractions of $\mathcal{B}[ψ(3686)\toγχ_{cJ}]\cdot\mathcal{B}[χ_{cJ}\to p\bar{p}K^+K^-]$ are measured to be $(2.49\mkern 2mu\pm\mkern 2mu 0.03_{\text{stat.}}\mkern 2mu\pm\mkern 2mu 0.15_{\text{syst.}})\times 10^{-5}$, $(1.83\mkern 2mu \pm\mkern 2mu 0.02_{\text{stat.}}\mkern 2mu \pm\mkern 2mu 0.11_{\text{syst.}})\times 10^{-5}$, and $(2.43\mkern 2mu\pm\mkern 2mu 0.02_{\text{stat.}}\mkern 2mu\pm\mkern 2mu 0.15_{\text{syst.}})\times 10^{-5}$, for $J=0,\ 1$, and 2, respectively.
△ Less
Submitted 3 January, 2025;
originally announced January 2025.
-
Realization of chiral whispering gallery mode cavities enabled by photonic Chern insulators
Authors:
Hao-Chang Mo,
Zi-Xuan Gao,
Xiao-Dong Chen,
Jian-Wen Dong
Abstract:
Recently, whispering gallery modes (WGMs) have attracted considerable attention due to their extensive applications in the development of on-chip microcavities, high-sensitivity sensors, and high-performance lasers. Conventional WGMs are achiral under the time-reversal symmetry, and show high sensitivity to defects in optical devices. Here, we introduce topological physics into photonic cavities a…
▽ More
Recently, whispering gallery modes (WGMs) have attracted considerable attention due to their extensive applications in the development of on-chip microcavities, high-sensitivity sensors, and high-performance lasers. Conventional WGMs are achiral under the time-reversal symmetry, and show high sensitivity to defects in optical devices. Here, we introduce topological physics into photonic cavities and demonstrate the realization of chiral WGMs enabled by photonic Chern insulators. Through comprehensive numerical simulations and experimental measurements, we reveal the critical differences between chiral and achiral WGMs, highlighting the robustness of chiral WGMs even in the presence of defects within the cavities. Our research provides valuable insights into the design of robust WGM cavities and offers a novel platform for exploring light-matter interaction phenomena.
△ Less
Submitted 1 January, 2025;
originally announced January 2025.
-
Spin Hall effect in 3d ferromagnetic metals for field-free switching of perpendicular magnetization: A first-principles investigation
Authors:
Fanxing Zheng,
Jianting Dong,
Yizhuo Song,
Meng Zhu,
Xinlu Li,
Jia Zhang
Abstract:
Ferromagnetic metals, with the potential to generate spin current with unconventional spin polarization via the spin Hall effect, offer promising opportunities for field-free switching of perpendicular magnetization and for the spin-orbit torque devices. In this study, we investigate two distinct spin Hall mechanisms in 3d ferromagnetic metals including spin-orbit coupling driven spin Hall effect…
▽ More
Ferromagnetic metals, with the potential to generate spin current with unconventional spin polarization via the spin Hall effect, offer promising opportunities for field-free switching of perpendicular magnetization and for the spin-orbit torque devices. In this study, we investigate two distinct spin Hall mechanisms in 3d ferromagnetic metals including spin-orbit coupling driven spin Hall effect in Fe, Co, Ni and their alloys, and non-relativistic spin Hall effect arising from anisotropic spin-polarized transport by taking L10-MnAl as an example. By employing first-principles calculations, we examine the temperature and alloy composition dependence of spin Hall conductivity in Fe, Co, Ni and their alloys. Our results reveal that the spin Hall conductivities with out-of-plane spin polarization in 3d ferromagnetic metals are at the order of 1000 \frac{\hbar}{2e} \left( Ω\, \text{cm} \right)^{-1} at 300 K, but with a relatively low spin Hall angles around 0.01~0.02 due to the large longitudinal conductivity. For L10-MnAl(101), the non-relativistic spin Hall conductivity can reach up to 10000\frac{\hbar}{2e} \left( Ω\, \text{cm} \right)^{-1}, with a giant spin Hall angle around 0.25 at room temperature. By analyzing the magnetization switching process, we demonstrate deterministic switching of perpendicular magnetization without an external magnetic field by using 3d ferromagnetic metals as spin current sources. Our work may provide an unambiguous understanding on spin Hall effect in ferromagnetic metals and pave the way for their potential applications in related spintronic devices.
△ Less
Submitted 1 January, 2025;
originally announced January 2025.
-
Theory of Transient Heat Conduction
Authors:
David E. Crawford,
Yi Zeng,
Judith Vidal,
Jianjun Dong
Abstract:
Ultrafast and nanoscale heat conduction demands a unified theoretical framework that rigorously bridges macroscopic transport equations with microscopic material properties derived from statistical physics.Existing empirical generalizations of Fourier's law often lack a solid microscopic foundation, failing to connect observed non-Fourier behavior with underlying atomic scale mechanisms. In this w…
▽ More
Ultrafast and nanoscale heat conduction demands a unified theoretical framework that rigorously bridges macroscopic transport equations with microscopic material properties derived from statistical physics.Existing empirical generalizations of Fourier's law often lack a solid microscopic foundation, failing to connect observed non-Fourier behavior with underlying atomic scale mechanisms. In this work, we present a time-domain theory of transient heat conduction rooted in Zwanzig's statistical theory of irreversible processes. Central to this framework is the time-domain transport function, Z(t), defined through equilibrium time-correlation functions of heat fluxes. This function generalizes the conventional concept of steady-state thermal conductivity, governing the transition of conduction dynamics from onset second sound type wave propagation at finite speeds to diffusion-dominated behavior across broad temporal and spatial scales. Unlike phonon hydrodynamic models that rely on mesoscopic constructs such as phonon drift velocity, our approach provides a quantitative and microscopic description of intrinsic memory effects in transient heat fluxes and applies universally to bulk materials at any temperature or length scale. By integrating atomistic-scale first-principles calculations with continuum-level macroscopic equations, this framework offers a robust foundation for numerical simulations of transient temperature fields. Furthermore, it facilitates the interpretation and design of transient thermal grating experiments using nanometer-scale heat sources and ultrafast laser systems in the extreme ultraviolet and x-ray wavelength ranges, advancing our understanding of heat dissipation dynamics.
△ Less
Submitted 31 December, 2024;
originally announced January 2025.
-
Score-Based Metropolis-Hastings Algorithms
Authors:
Ahmed Aloui,
Ali Hasan,
Juncheng Dong,
Zihao Wu,
Vahid Tarokh
Abstract:
In this paper, we introduce a new approach for integrating score-based models with the Metropolis-Hastings algorithm. While traditional score-based diffusion models excel in accurately learning the score function from data points, they lack an energy function, making the Metropolis-Hastings adjustment step inaccessible. Consequently, the unadjusted Langevin algorithm is often used for sampling usi…
▽ More
In this paper, we introduce a new approach for integrating score-based models with the Metropolis-Hastings algorithm. While traditional score-based diffusion models excel in accurately learning the score function from data points, they lack an energy function, making the Metropolis-Hastings adjustment step inaccessible. Consequently, the unadjusted Langevin algorithm is often used for sampling using estimated score functions. The lack of an energy function then prevents the application of the Metropolis-adjusted Langevin algorithm and other Metropolis-Hastings methods, limiting the wealth of other algorithms developed that use acceptance functions. We address this limitation by introducing a new loss function based on the \emph{detailed balance condition}, allowing the estimation of the Metropolis-Hastings acceptance probabilities given a learned score function. We demonstrate the effectiveness of the proposed method for various scenarios, including sampling from heavy-tail distributions.
△ Less
Submitted 31 March, 2025; v1 submitted 31 December, 2024;
originally announced January 2025.
-
Data-Based Efficient Off-Policy Stabilizing Optimal Control Algorithms for Discrete-Time Linear Systems via Damping Coefficients
Authors:
Dongdong Li,
Jiuxiang Dong
Abstract:
Policy iteration is one of the classical frameworks of reinforcement learning, which requires a known initial stabilizing control. However, finding the initial stabilizing control depends on the known system model. To relax this requirement and achieve model-free optimal control, in this paper, two different reinforcement learning algorithms based on policy iteration and variable damping coefficie…
▽ More
Policy iteration is one of the classical frameworks of reinforcement learning, which requires a known initial stabilizing control. However, finding the initial stabilizing control depends on the known system model. To relax this requirement and achieve model-free optimal control, in this paper, two different reinforcement learning algorithms based on policy iteration and variable damping coefficients are designed for unknown discrete-time linear systems. First, a stable artificial system is designed, and this system is gradually iterated to the original system by varying the damping coefficients. This allows the initial stabilizing control to be obtained in a finite number of iteration steps. Then, an off-policy iteration algorithm and an off-policy $\mathcal{Q}$-learning algorithm are designed to select the appropriate damping coefficients and realize data-driven. In these two algorithms, the current estimates of optimal control gain are not applied to the system to re-collect data. Moreover, they are characterized by the fast convergence of the traditional policy iteration. Finally, the proposed algorithms are validated by simulation.
△ Less
Submitted 19 March, 2025; v1 submitted 30 December, 2024;
originally announced December 2024.
-
Measurement of Born cross section of $e^+e^-\toΣ^0\barΣ^0$ at $\sqrt{s} = 3.50-4.95$ GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (649 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at thirty-two center-of-mass energies from 3.50 to 4.95 GeV, corresponding to an integrated luminosity of 25 $\rm{fb^{-1}}$, we measure the Born cross section of the $e^+e^-\toΣ^0\barΣ^0$ reaction and the effective form factor. No significant charmonium(-like) state, i.e., $ψ(3770)$, $ψ(4040)$, $ψ(4160)$,…
▽ More
Using $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at thirty-two center-of-mass energies from 3.50 to 4.95 GeV, corresponding to an integrated luminosity of 25 $\rm{fb^{-1}}$, we measure the Born cross section of the $e^+e^-\toΣ^0\barΣ^0$ reaction and the effective form factor. No significant charmonium(-like) state, i.e., $ψ(3770)$, $ψ(4040)$, $ψ(4160)$, $ψ(4230)$, $ψ(4360)$, $ψ(4415)$, or $ψ(4660)$, decaying into the $Σ^0\barΣ^0$ final state is observed by fitting the $e^+e^- \to Σ^0\barΣ^0$ dressed cross section. The upper limits for the product of the branching fraction and the electronic partial width at the 90% confidence level are provided for each assumed charmonium(-like) state. In addition, the ratios of the Born cross section and the effective form factor between the $e^+e^-\toΣ^0\barΣ^0$ and the $e^+e^-\toΣ^+\barΣ^-$ reactions are provided, which can be used to validate the prediction of the vector meson dominance model.
△ Less
Submitted 14 March, 2025; v1 submitted 28 December, 2024;
originally announced December 2024.
-
Search for the double Dalitz decays $η/η' \to e^+e^-μ^+μ^-$ and $η' \to μ^+μ^-μ^+μ^-$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (648 additional authors not shown)
Abstract:
Using a data sample of $(10087 \pm 44) \times {10^{6}}$ $J/ψ$ events collected with the BESIII detector, we search for the decays $η/η'\to e^+e^-μ^+μ^-$ and $η' \to μ^+μ^-μ^+μ^-$ via the radiative decays $J/ψ\toγη$/$γη'$. No excess of events over expected background is observed for any of the decays of interest. At 90% confidence level, we report the first upper limits on the branching fractions o…
▽ More
Using a data sample of $(10087 \pm 44) \times {10^{6}}$ $J/ψ$ events collected with the BESIII detector, we search for the decays $η/η'\to e^+e^-μ^+μ^-$ and $η' \to μ^+μ^-μ^+μ^-$ via the radiative decays $J/ψ\toγη$/$γη'$. No excess of events over expected background is observed for any of the decays of interest. At 90% confidence level, we report the first upper limits on the branching fractions of $η' \to e^{+}e^{-}μ^{+}μ^{-}$ and $η' \to μ^{+}μ^{-}μ^{+}μ^{-}$ to be $ 1.75 \times {10^{-6}}$ and $5.28 \times {10^{-7}}$, respectively. In addition, we set an upper limit on the branching fraction of $η\to e^{+}e^{-}μ^{+}μ^{-}$ to be $6.88 \times {10^{-6}}$, which improves the previous result by about two orders of magnitude.
△ Less
Submitted 27 December, 2024;
originally announced December 2024.
-
On one-loop amplitudes in gauge theories
Authors:
Qu Cao,
Jin Dong,
Song He,
Fan Zhu
Abstract:
We propose a new ``universal expansion" for one-loop amplitudes with arbitrary number of gluons in $D$ dimensions, which holds for general gauge theories with gluons/fermions/scalars in the loop, including pure and supersymmetric Yang-Mills theories. It expresses the $n$-gluon amplitudes as a linear combination of universal scalar-loop amplitudes with $n{-}m$ gluons and $m$ scalars, multiplied by…
▽ More
We propose a new ``universal expansion" for one-loop amplitudes with arbitrary number of gluons in $D$ dimensions, which holds for general gauge theories with gluons/fermions/scalars in the loop, including pure and supersymmetric Yang-Mills theories. It expresses the $n$-gluon amplitudes as a linear combination of universal scalar-loop amplitudes with $n{-}m$ gluons and $m$ scalars, multiplied by gauge-invariant building blocks (defined for general gauge theories); the integrands of these scalar-loop amplitudes are given in terms of tree-level objects attached to the scalar loop, or by differential operators acting on the most important part which is proportional to $D$ (with $m=0$). We present closed-formula for these one-loop integrands and prove them by showing that the single cuts are correctly reproduced by the gluing of an additional pair of gluons (fermions/scalars) in the forward limit, plus $n$ gluons in a tree amplitude.
△ Less
Submitted 27 December, 2024;
originally announced December 2024.
-
An Engorgio Prompt Makes Large Language Model Babble on
Authors:
Jianshuo Dong,
Ziyuan Zhang,
Qingjie Zhang,
Tianwei Zhang,
Hao Wang,
Hewu Li,
Qi Li,
Chao Zhang,
Ke Xu,
Han Qiu
Abstract:
Auto-regressive large language models (LLMs) have yielded impressive performance in many real-world tasks. However, the new paradigm of these LLMs also exposes novel threats. In this paper, we explore their vulnerability to inference cost attacks, where a malicious user crafts Engorgio prompts to intentionally increase the computation cost and latency of the inference process. We design Engorgio,…
▽ More
Auto-regressive large language models (LLMs) have yielded impressive performance in many real-world tasks. However, the new paradigm of these LLMs also exposes novel threats. In this paper, we explore their vulnerability to inference cost attacks, where a malicious user crafts Engorgio prompts to intentionally increase the computation cost and latency of the inference process. We design Engorgio, a novel methodology, to efficiently generate adversarial Engorgio prompts to affect the target LLM's service availability. Engorgio has the following two technical contributions. (1) We employ a parameterized distribution to track LLMs' prediction trajectory. (2) Targeting the auto-regressive nature of LLMs' inference process, we propose novel loss functions to stably suppress the appearance of the <EOS> token, whose occurrence will interrupt the LLM's generation process. We conduct extensive experiments on 13 open-sourced LLMs with parameters ranging from 125M to 30B. The results show that Engorgio prompts can successfully induce LLMs to generate abnormally long outputs (i.e., roughly 2-13$\times$ longer to reach 90%+ of the output length limit) in a white-box scenario and our real-world experiment demonstrates Engergio's threat to LLM service with limited computing resources. The code is released at: https://github.com/jianshuod/Engorgio-prompt.
△ Less
Submitted 12 February, 2025; v1 submitted 26 December, 2024;
originally announced December 2024.
-
High-Accuracy Schottky Diagnostics for Low-SNR Betatron Tune Measurement in Ramping Synchrotrons
Authors:
Peihan Sun,
Manzhou Zhang,
Renxian Yuan,
Deming Li,
Jian Dong,
Ying Shi
Abstract:
This study introduces a novel real-time betatron tune measurement algorithm, utilizing Schottky signals and an FPGA-based backend architecture, specifically designed for rapidly ramping synchrotrons, with particular application to the Shanghai Advanced Proton Therapy (SAPT) facility. The developed algorithm demonstrates improved measurement accuracy under challenging operational conditions, especi…
▽ More
This study introduces a novel real-time betatron tune measurement algorithm, utilizing Schottky signals and an FPGA-based backend architecture, specifically designed for rapidly ramping synchrotrons, with particular application to the Shanghai Advanced Proton Therapy (SAPT) facility. The developed algorithm demonstrates improved measurement accuracy under challenging operational conditions, especially in scenarios with limited sampling time and signal-to-noise ratios (SNR) as low as \(-20\) dB. By applying Short-Time Fourier Transform (STFT) analysis, the algorithm effectively accommodates the rapid increase in revolution frequency from 4 MHz to 7.5 MHz over 0.35 seconds, along with tune shifts. A macro-particle simulation methodology is employed to generate Schottky signals, which are then combined with real noise collected from an analog-to-digital converter (ADC) to simulate practical conditions. The proposed betatron tune measurement algorithm integrates advanced spectral processing techniques and an enhanced peak detection algorithm specifically tailored for low SNR conditions. Experimental validation confirms the superior performance of the proposed algorithm over conventional approaches in terms of measurement accuracy, stability, and system robustness, while meeting the stringent operational requirements of proton therapy applications. This innovative approach effectively addresses critical limitations associated with Schottky diagnostics for betatron tune measurement in rapidly ramping synchrotrons operating under low SNR conditions, laying a robust foundation and providing a viable solution for advanced applications in proton therapy and related accelerator physics fields.
△ Less
Submitted 9 June, 2025; v1 submitted 26 December, 2024;
originally announced December 2024.
-
TPAoI: Ensuring Fresh Service Status at the Network Edge in Compute-First Networking
Authors:
Haosheng He,
Jianpeng Qi,
Chao Liu,
Junyu Dong,
Yanwei Yu
Abstract:
In compute-first networking, maintaining fresh and accurate status information at the network edge is crucial for effective access to remote services. This process typically involves three phases: Status updating, user accessing, and user requesting. However, current studies on status effectiveness, such as Age of Information at Query (QAoI), do not comprehensively cover all these phases. Therefor…
▽ More
In compute-first networking, maintaining fresh and accurate status information at the network edge is crucial for effective access to remote services. This process typically involves three phases: Status updating, user accessing, and user requesting. However, current studies on status effectiveness, such as Age of Information at Query (QAoI), do not comprehensively cover all these phases. Therefore, this paper introduces a novel metric, TPAoI, aimed at optimizing update decisions by measuring the freshness of service status. The stochastic nature of edge environments, characterized by unpredictable communication delays in updating, requesting, and user access times, poses a significant challenge when modeling. To address this, we model the problem as a Markov Decision Process (MDP) and employ a Dueling Double Deep Q-Network (D3QN) algorithm for optimization. Extensive experiments demonstrate that the proposed TPAoI metric effectively minimizes AoI, ensuring timely and reliable service updates in dynamic edge environments. Results indicate that TPAoI reduces AoI by an average of 47\% compared to QAoI metrics and decreases update frequency by an average of 48\% relative to conventional AoI metrics, showing significant improvement.
△ Less
Submitted 24 December, 2024;
originally announced December 2024.
-
AutoSculpt: A Pattern-based Model Auto-pruning Framework Using Reinforcement Learning and Graph Learning
Authors:
Lixian Jing,
Jianpeng Qi,
Junyu Dong,
Yanwei Yu
Abstract:
As deep neural networks (DNNs) are increasingly deployed on edge devices, optimizing models for constrained computational resources is critical. Existing auto-pruning methods face challenges due to the diversity of DNN models, various operators (e.g., filters), and the difficulty in balancing pruning granularity with model accuracy. To address these limitations, we introduce AutoSculpt, a pattern-…
▽ More
As deep neural networks (DNNs) are increasingly deployed on edge devices, optimizing models for constrained computational resources is critical. Existing auto-pruning methods face challenges due to the diversity of DNN models, various operators (e.g., filters), and the difficulty in balancing pruning granularity with model accuracy. To address these limitations, we introduce AutoSculpt, a pattern-based automated pruning framework designed to enhance efficiency and accuracy by leveraging graph learning and deep reinforcement learning (DRL). AutoSculpt automatically identifies and prunes regular patterns within DNN architectures that can be recognized by existing inference engines, enabling runtime acceleration. Three key steps in AutoSculpt include: (1) Constructing DNNs as graphs to encode their topology and parameter dependencies, (2) embedding computationally efficient pruning patterns, and (3) utilizing DRL to iteratively refine auto-pruning strategies until the optimal balance between compression and accuracy is achieved. Experimental results demonstrate the effectiveness of AutoSculpt across various architectures, including ResNet, MobileNet, VGG, and Vision Transformer, achieving pruning rates of up to 90% and nearly 18% improvement in FLOPs reduction, outperforming all baselines. The codes can be available at https://anonymous.4open.science/r/AutoSculpt-DDA0
△ Less
Submitted 19 June, 2025; v1 submitted 23 December, 2024;
originally announced December 2024.
-
Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network
Authors:
Xiang Fang,
Wanlong Fang,
Changshuo Wang,
Daizong Liu,
Keke Tang,
Jianfeng Dong,
Pan Zhou,
Beibei Li
Abstract:
Given some video-query pairs with untrimmed videos and sentence queries, temporal sentence grounding (TSG) aims to locate query-relevant segments in these videos. Although previous respectable TSG methods have achieved remarkable success, they train each video-query pair separately and ignore the relationship between different pairs. We observe that the similar video/query content not only helps t…
▽ More
Given some video-query pairs with untrimmed videos and sentence queries, temporal sentence grounding (TSG) aims to locate query-relevant segments in these videos. Although previous respectable TSG methods have achieved remarkable success, they train each video-query pair separately and ignore the relationship between different pairs. We observe that the similar video/query content not only helps the TSG model better understand and generalize the cross-modal representation but also assists the model in locating some complex video-query pairs. Previous methods follow a single-thread framework that cannot co-train different pairs and usually spends much time re-obtaining redundant knowledge, limiting their real-world applications. To this end, in this paper, we pose a brand-new setting: Multi-Pair TSG, which aims to co-train these pairs. In particular, we propose a novel video-query co-training approach, Multi-Thread Knowledge Transfer Network, to locate a variety of video-query pairs effectively and efficiently. Firstly, we mine the spatial and temporal semantics across different queries to cooperate with each other. To learn intra- and inter-modal representations simultaneously, we design a cross-modal contrast module to explore the semantic consistency by a self-supervised strategy. To fully align visual and textual representations between different pairs, we design a prototype alignment strategy to 1) match object prototypes and phrase prototypes for spatial alignment, and 2) align activity prototypes and sentence prototypes for temporal alignment. Finally, we develop an adaptive negative selection module to adaptively generate a threshold for cross-modal matching. Extensive experiments show the effectiveness and efficiency of our proposed method.
△ Less
Submitted 3 April, 2025; v1 submitted 20 December, 2024;
originally announced December 2024.
-
PersonaMagic: Stage-Regulated High-Fidelity Face Customization with Tandem Equilibrium
Authors:
Xinzhe Li,
Jiahui Zhan,
Shengfeng He,
Yangyang Xu,
Junyu Dong,
Huaidong Zhang,
Yong Du
Abstract:
Personalized image generation has made significant strides in adapting content to novel concepts. However, a persistent challenge remains: balancing the accurate reconstruction of unseen concepts with the need for editability according to the prompt, especially when dealing with the complex nuances of facial features. In this study, we delve into the temporal dynamics of the text-to-image conditio…
▽ More
Personalized image generation has made significant strides in adapting content to novel concepts. However, a persistent challenge remains: balancing the accurate reconstruction of unseen concepts with the need for editability according to the prompt, especially when dealing with the complex nuances of facial features. In this study, we delve into the temporal dynamics of the text-to-image conditioning process, emphasizing the crucial role of stage partitioning in introducing new concepts. We present PersonaMagic, a stage-regulated generative technique designed for high-fidelity face customization. Using a simple MLP network, our method learns a series of embeddings within a specific timestep interval to capture face concepts. Additionally, we develop a Tandem Equilibrium mechanism that adjusts self-attention responses in the text encoder, balancing text description and identity preservation, improving both areas. Extensive experiments confirm the superiority of PersonaMagic over state-of-the-art methods in both qualitative and quantitative evaluations. Moreover, its robustness and flexibility are validated in non-facial domains, and it can also serve as a valuable plug-in for enhancing the performance of pretrained personalization models.
△ Less
Submitted 20 December, 2024;
originally announced December 2024.
-
ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model
Authors:
Shunlin Lu,
Jingbo Wang,
Zeyu Lu,
Ling-Hao Chen,
Wenxun Dai,
Junting Dong,
Zhiyang Dou,
Bo Dai,
Ruimao Zhang
Abstract:
The scaling law has been validated in various domains, such as natural language processing (NLP) and massive computer vision tasks; however, its application to motion generation remains largely unexplored. In this paper, we introduce a scalable motion generation framework that includes the motion tokenizer Motion FSQ-VAE and a text-prefix autoregressive transformer. Through comprehensive experimen…
▽ More
The scaling law has been validated in various domains, such as natural language processing (NLP) and massive computer vision tasks; however, its application to motion generation remains largely unexplored. In this paper, we introduce a scalable motion generation framework that includes the motion tokenizer Motion FSQ-VAE and a text-prefix autoregressive transformer. Through comprehensive experiments, we observe the scaling behavior of this system. For the first time, we confirm the existence of scaling laws within the context of motion generation. Specifically, our results demonstrate that the normalized test loss of our prefix autoregressive models adheres to a logarithmic law in relation to compute budgets. Furthermore, we also confirm the power law between Non-Vocabulary Parameters, Vocabulary Parameters, and Data Tokens with respect to compute budgets respectively. Leveraging the scaling law, we predict the optimal transformer size, vocabulary size, and data requirements for a compute budget of $1e18$. The test loss of the system, when trained with the optimal model size, vocabulary size, and required data, aligns precisely with the predicted test loss, thereby validating the scaling law.
△ Less
Submitted 19 December, 2024;
originally announced December 2024.
-
Measurement of the Branching Fraction for the Decay $χ_{cJ}\to p\bar{p}ηπ^{0}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
Using $(2712.4\pm 14.3)\times10^6 ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we present the first observations of the decays $χ_{cJ}(J=0,1,2)\to p\bar{p}ηπ^{0}$. Their decay branching fractions are determined to be ${\cal B}(χ_{c0}\to p\bar{p}ηπ^{0})=({2.41 \pm 0.07 \pm 0.19}) \times 10^{-4}$,…
▽ More
Using $(2712.4\pm 14.3)\times10^6 ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we present the first observations of the decays $χ_{cJ}(J=0,1,2)\to p\bar{p}ηπ^{0}$. Their decay branching fractions are determined to be ${\cal B}(χ_{c0}\to p\bar{p}ηπ^{0})=({2.41 \pm 0.07 \pm 0.19}) \times 10^{-4}$, ${\cal B}(χ_{c1}\to p\bar{p}ηπ^{0})=({1.95 \pm 0.05 \pm 0.12}) \times 10^{-4}$, and ${\cal B}(χ_{c2}\to p\bar{p}ηπ^{0})=({1.31 \pm 0.05 \pm 0.08}) \times 10^{-4}$, where the first uncertainties are statistical and the second systematic.
△ Less
Submitted 18 December, 2024; v1 submitted 18 December, 2024;
originally announced December 2024.
-
Dynamic Adapter with Semantics Disentangling for Cross-lingual Cross-modal Retrieval
Authors:
Rui Cai,
Zhiyu Dong,
Jianfeng Dong,
Xun Wang
Abstract:
Existing cross-modal retrieval methods typically rely on large-scale vision-language pair data. This makes it challenging to efficiently develop a cross-modal retrieval model for under-resourced languages of interest. Therefore, Cross-lingual Cross-modal Retrieval (CCR), which aims to align vision and the low-resource language (the target language) without using any human-labeled target-language d…
▽ More
Existing cross-modal retrieval methods typically rely on large-scale vision-language pair data. This makes it challenging to efficiently develop a cross-modal retrieval model for under-resourced languages of interest. Therefore, Cross-lingual Cross-modal Retrieval (CCR), which aims to align vision and the low-resource language (the target language) without using any human-labeled target-language data, has gained increasing attention. As a general parameter-efficient way, a common solution is to utilize adapter modules to transfer the vision-language alignment ability of Vision-Language Pretraining (VLP) models from a source language to a target language. However, these adapters are usually static once learned, making it difficult to adapt to target-language captions with varied expressions. To alleviate it, we propose Dynamic Adapter with Semantics Disentangling (DASD), whose parameters are dynamically generated conditioned on the characteristics of the input captions. Considering that the semantics and expression styles of the input caption largely influence how to encode it, we propose a semantic disentangling module to extract the semantic-related and semantic-agnostic features from the input, ensuring that generated adapters are well-suited to the characteristics of input caption. Extensive experiments on two image-text datasets and one video-text dataset demonstrate the effectiveness of our model for cross-lingual cross-modal retrieval, as well as its good compatibility with various VLP models.
△ Less
Submitted 18 December, 2024;
originally announced December 2024.
-
Observation of the charmonium decay $η_c\toγγ$ in $J/ψ\toγη_c$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (658 additional authors not shown)
Abstract:
Using $(2712.4\pm14.3)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the decay $η_c\toγγ$ in $J/ψ\toγη_c$ is observed. We determine the product branching fraction $\mathcal{B}(J/ψ\toγη_c)\times\mathcal{B}(η_c\toγγ)=(5.23\pm0.26_{\rm{stat.}}\pm0.30_{\rm{syst.}})\times10^{-6}$. This result is consistent with the LQCD calculation…
▽ More
Using $(2712.4\pm14.3)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the decay $η_c\toγγ$ in $J/ψ\toγη_c$ is observed. We determine the product branching fraction $\mathcal{B}(J/ψ\toγη_c)\times\mathcal{B}(η_c\toγγ)=(5.23\pm0.26_{\rm{stat.}}\pm0.30_{\rm{syst.}})\times10^{-6}$. This result is consistent with the LQCD calculation $(5.34\pm0.16)\times10^{-6}$ from HPQCD in 2023. By using the world-average values of $\mathcal{B}(J/ψ\toγη_c)$ and the total decay width of $η_c$, the partial decay width $Γ(η_c\toγγ)$ is determined to be $(11.30\pm0.56_{\rm{stat.}}\pm0.66_{\rm{syst.}}\pm1.14_{\rm{ref.}})~\rm{keV}$, which deviates from the corresponding world-average value by $3.4σ$.
△ Less
Submitted 2 April, 2025; v1 submitted 17 December, 2024;
originally announced December 2024.
-
Defending LVLMs Against Vision Attacks through Partial-Perception Supervision
Authors:
Qi Zhou,
Tianlin Li,
Qing Guo,
Dongxia Wang,
Yun Lin,
Yang Liu,
Jin Song Dong
Abstract:
Recent studies have raised significant concerns regarding the vulnerability of Large Vision Language Models (LVLMs) to maliciously injected or perturbed input images, which can mislead their responses. Existing defense methods show that such vision attacks are sensitive to image modifications especially cropping, using majority voting across responses of modified images as corrected responses. How…
▽ More
Recent studies have raised significant concerns regarding the vulnerability of Large Vision Language Models (LVLMs) to maliciously injected or perturbed input images, which can mislead their responses. Existing defense methods show that such vision attacks are sensitive to image modifications especially cropping, using majority voting across responses of modified images as corrected responses. However, these modifications often result in partial images and distort the semantics, which reduces response quality on clean images after voting. Instead of directly using responses from partial images for voting, we investigate using them to supervise the LVLM's responses to the original images. We propose a black-box, training-free method called DPS (Defense through Partial-Perception Supervision). In this approach, the model is prompted using the responses generated by a model that perceives only a partial image. With DPS, the model can adjust its response based on partial image understanding when under attack, while confidently maintaining its original response for clean input. Our findings show that the weak model can supervise the strong model: when faced with an attacked input, the strong model becomes less confident and adjusts its response based on the weak model's partial understanding, effectively defending against the attack. With clean input, it confidently maintains its original response. Empirical experiments show our method outperforms the baseline, cutting the average attack success rate by 76.3% across six datasets on three popular models.
△ Less
Submitted 17 December, 2024;
originally announced December 2024.
-
Near-half-metallic state in the half Heusler PtMnSb film on a III-V substrate
Authors:
Shinichi Nishihaya,
Malcolm J. A. Jardine,
Hadass S. Inbar,
Aranya Goswami,
Jason T. Dong,
Aaron N. Engel,
Yu-Hao Chang,
Connor P. Dempsey,
Makoto Hashimoto,
Donghui Lu,
Noa Marom,
Chris J. Palmstrøm
Abstract:
The interplay between half-metallic ferromagnetism and spin-orbit coupling within the inversion symmetry-broken structure of half Heuslers provides an ideal platform for various spintronics functionalities. Taking advantage of good lattice matching, it is highly desired to epitaxially integrate promising Heuslers into III-V semiconductor-based devices. PtMnSb is one of the first half Heuslers pred…
▽ More
The interplay between half-metallic ferromagnetism and spin-orbit coupling within the inversion symmetry-broken structure of half Heuslers provides an ideal platform for various spintronics functionalities. Taking advantage of good lattice matching, it is highly desired to epitaxially integrate promising Heuslers into III-V semiconductor-based devices. PtMnSb is one of the first half Heuslers predicted to be an above-room-temperature half-metal with large spin orbit coupling, however, its half-metallicity and potential as a spintronics material has remained elusive due to lack of high quality samples. Here we demonstrate epitaxial growth of single crystal PtMnSb(001) film on GaSb(001) substrates using molecular beam epitaxy. Direct observation of the band structure via angle-resolved photoemission spectroscopy and many-body perturbation theory within the quasiparticle self-consistent GW approximation (QPGW) reveal that PtMnSb hosts rather a near-halfmetallic state with both spin bands crossing the Fermi level and with high spin polarization over 90%. Temperature dependence of magnetization also shows an anomalous enhancement below 60 K, which can be associated with the development of such a near-half-metallic state at low temperatures. Epitaxial growth of high crystalline PtMnSb on a III-V paves the way for systematic clarification of its spin transport properties with fine-tuning of strain in heterostructure devices.
△ Less
Submitted 16 December, 2024;
originally announced December 2024.
-
Image Gradient-Aided Photometric Stereo Network
Authors:
Kaixuan Wang,
Lin Qi,
Shiyu Qin,
Kai Luo,
Yakun Ju,
Xia Li,
Junyu Dong
Abstract:
Photometric stereo (PS) endeavors to ascertain surface normals using shading clues from photometric images under various illuminations. Recent deep learning-based PS methods often overlook the complexity of object surfaces. These neural network models, which exclusively rely on photometric images for training, often produce blurred results in high-frequency regions characterized by local discontin…
▽ More
Photometric stereo (PS) endeavors to ascertain surface normals using shading clues from photometric images under various illuminations. Recent deep learning-based PS methods often overlook the complexity of object surfaces. These neural network models, which exclusively rely on photometric images for training, often produce blurred results in high-frequency regions characterized by local discontinuities, such as wrinkles and edges with significant gradient changes. To address this, we propose the Image Gradient-Aided Photometric Stereo Network (IGA-PSN), a dual-branch framework extracting features from both photometric images and their gradients. Furthermore, we incorporate an hourglass regression network along with supervision to regularize normal regression. Experiments on DiLiGenT benchmarks show that IGA-PSN outperforms previous methods in surface normal estimation, achieving a mean angular error of 6.46 while preserving textures and geometric shapes in complex regions.
△ Less
Submitted 16 December, 2024;
originally announced December 2024.
-
Amplitude analysis and branching fraction measurement of the Cabibbo-favored decay $D^+ \to K^-π^+π^+π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (651 additional authors not shown)
Abstract:
An amplitude analysis of the Cabibbo-favored decay $D^+ \to K^-π^+π^+π^0$ is performed, using 7.93 $\rm{fb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773 GeV. The branching fractions of the intermediate processes are measured, with the dominant contribution $D^+ \to \bar{K}^{*}(892)^0ρ(770)^+$ observed to have a branching fraction of…
▽ More
An amplitude analysis of the Cabibbo-favored decay $D^+ \to K^-π^+π^+π^0$ is performed, using 7.93 $\rm{fb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773 GeV. The branching fractions of the intermediate processes are measured, with the dominant contribution $D^+ \to \bar{K}^{*}(892)^0ρ(770)^+$ observed to have a branching fraction of $(4.15\pm0.07_{\rm stat.}\pm0.17_{\rm syst.})\%$. With the detection efficiency derived from the amplitude analysis, the absolute branching fraction of $D^+ \to K^-π^+π^+π^0$ is measured to be $(6.06\pm0.04_{\rm stat.}\pm0.07_{\rm syst.})\%$.
△ Less
Submitted 14 December, 2024;
originally announced December 2024.