Search | arXiv e-print repository

Repeated singular values of a random symmetric matrix and decoupled singular value estimates

Authors: Yi Han

Abstract: Let $A_n$ be a random symmetric matrix with Bernoulli $\{\pm 1\}$ entries. For any $κ>0$ and two real numbers $λ_1,λ_2$ with a separation $|λ_1-λ_2|\geq κn^{1/2}$ and both lying in the bulk $[-(2-κ)n^{1/2},(2-κ)n^{1/2}]$, we prove a joint singular value estimate $$ \mathbb{P}(σ_{min}(A_n-λ_i I_n)\leqεn^{-1/2};i=1,2)\leq Cε^2+2e^{-cn}. $$ For general subgaussian distribution and a mesoscopic separa… ▽ More Let $A_n$ be a random symmetric matrix with Bernoulli $\{\pm 1\}$ entries. For any $κ>0$ and two real numbers $λ_1,λ_2$ with a separation $|λ_1-λ_2|\geq κn^{1/2}$ and both lying in the bulk $[-(2-κ)n^{1/2},(2-κ)n^{1/2}]$, we prove a joint singular value estimate $$ \mathbb{P}(σ_{min}(A_n-λ_i I_n)\leqεn^{-1/2};i=1,2)\leq Cε^2+2e^{-cn}. $$ For general subgaussian distribution and a mesoscopic separation $|λ_1-λ_2|\geq κn^{-1/2+σ},σ>0$ we prove the same estimate with $e^{-cn}$ replaced by an exponential type error. This means that extreme behaviors of the least singular value at two locations can essentially be decoupled all the way down to the exponential scale when the two locations are separated. As a corollary, we prove that all the singular values of $A_n$ in $[κn^{1/2},(2-κ)n^{1/2}]$ are distinct with probability $1-e^{-cn}$, and with high probability the minimal gap between these singular values has order at least $n^{-3/2}$. This justifies, in a strong quantitative form, a conjecture of Vu up to $(1-κ)$-fraction of the spectrum for any $κ>0$. △ Less

Submitted 22 April, 2025; originally announced April 2025.

Comments: 76 pages. This paper replaces 2405.04999 with strengthened results and several corrections

arXiv:2504.15881 [pdf, other]

Measurement of the time-integrated $CP$ asymmetry in $D^0 \to K^0_{\rm S} K^0_{\rm S}$ decays using opposite-side flavor tagging at Belle and Belle II

Authors: Belle, Belle II Collaborations, :, I. Adachi, Y. Ahn, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, N. Althubiti, K. Amos, M. Angelsmark, N. Anh Ky, C. Antonioli, D. M. Asner, H. Atmacan, T. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, N. K. Baghel, S. Bahinipati, P. Bambade, Sw. Banerjee , et al. (356 additional authors not shown)

Abstract: We measure the time-integrated $CP$ asymmetry in $D^0 \to K^0_{\rm S} K^0_{\rm S}$ decays reconstructed in $e^+e^-\to c{\overline c}$ events collected by the Belle and Belle II experiments. The corresponding data samples have integrated luminosities of 980 and 428 fb${}^{-1}$, respectively. To infer the flavor of the $D^0$ meson, we exploit the correlation between the flavor of the reconstructed d… ▽ More We measure the time-integrated $CP$ asymmetry in $D^0 \to K^0_{\rm S} K^0_{\rm S}$ decays reconstructed in $e^+e^-\to c{\overline c}$ events collected by the Belle and Belle II experiments. The corresponding data samples have integrated luminosities of 980 and 428 fb${}^{-1}$, respectively. To infer the flavor of the $D^0$ meson, we exploit the correlation between the flavor of the reconstructed decay and the electric charges of particles reconstructed in the rest of the $e^+e^-\to c{\overline c}$ event. This results in a sample which is independent from any other previously used at Belle or Belle II. The result, $A_{CP}(D^0 \to K^0_{\rm S} K^0_{\rm S}) = (1.3 \pm 2.0 \pm 0.2)\%$, where the first uncertainty is statistical and the second systematic, is consistent with previous determinations and with $CP$ symmetry. △ Less

Submitted 22 April, 2025; originally announced April 2025.

Report number: Belle II Preprint 2025-006, KEK Preprint 2025-4

arXiv:2504.15745 [pdf, other]

Search for lepton-flavor-violating $τ^- \to \ell^- K_s^0$ decays at Belle and Belle II

Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Ahmed, Y. Ahn, H. Aihara, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, N. Althubiti, K. Amos, M. Angelsmark, N. Anh Ky, C. Antonioli, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, N. K. Baghel, S. Bahinipati , et al. (397 additional authors not shown)

Abstract: We present the results of a search for charged-lepton-flavor violating decays $τ^{-} \rightarrow \ell^{-}K_{S}^{0}$, where $\ell^{-}$ is either an electron or a muon. We combine $e^+e^-$ data samples recorded by the Belle II experiment at the SuperKEKB collider (428 fb$^{-1}$) with samples recorded by the Belle experiment at the KEKB collider (980 fb$^{-1}$) to obtain a sample of 1.3 billion… ▽ More We present the results of a search for charged-lepton-flavor violating decays $τ^{-} \rightarrow \ell^{-}K_{S}^{0}$, where $\ell^{-}$ is either an electron or a muon. We combine $e^+e^-$ data samples recorded by the Belle II experiment at the SuperKEKB collider (428 fb$^{-1}$) with samples recorded by the Belle experiment at the KEKB collider (980 fb$^{-1}$) to obtain a sample of 1.3 billion $e^+e^-\toτ^+τ^-$ events. We observe 0 and 1 events and set $90\%$ confidence level upper limits of $0.8 \times 10^{-8}$ and $1.2 \times 10^{-8}$ on the branching fractions of the decay modes $τ^{-} \rightarrow e^{-}K_{S}^{0}$ and $τ^{-} \rightarrow μ^{-}K_{S}^{0}$, respectively. These are the most stringent upper limits to date. △ Less

Submitted 22 April, 2025; originally announced April 2025.

arXiv:2504.14254 [pdf, other]

Visual Consensus Prompting for Co-Salient Object Detection

Authors: Jie Wang, Nana Yu, Zihao Zhang, Yahong Han

Abstract: Existing co-salient object detection (CoSOD) methods generally employ a three-stage architecture (i.e., encoding, consensus extraction & dispersion, and prediction) along with a typical full fine-tuning paradigm. Although they yield certain benefits, they exhibit two notable limitations: 1) This architecture relies on encoded features to facilitate consensus extraction, but the meticulously extrac… ▽ More Existing co-salient object detection (CoSOD) methods generally employ a three-stage architecture (i.e., encoding, consensus extraction & dispersion, and prediction) along with a typical full fine-tuning paradigm. Although they yield certain benefits, they exhibit two notable limitations: 1) This architecture relies on encoded features to facilitate consensus extraction, but the meticulously extracted consensus does not provide timely guidance to the encoding stage. 2) This paradigm involves globally updating all parameters of the model, which is parameter-inefficient and hinders the effective representation of knowledge within the foundation model for this task. Therefore, in this paper, we propose an interaction-effective and parameter-efficient concise architecture for the CoSOD task, addressing two key limitations. It introduces, for the first time, a parameter-efficient prompt tuning paradigm and seamlessly embeds consensus into the prompts to formulate task-specific Visual Consensus Prompts (VCP). Our VCP aims to induce the frozen foundation model to perform better on CoSOD tasks by formulating task-specific visual consensus prompts with minimized tunable parameters. Concretely, the primary insight of the purposeful Consensus Prompt Generator (CPG) is to enforce limited tunable parameters to focus on co-salient representations and generate consensus prompts. The formulated Consensus Prompt Disperser (CPD) leverages consensus prompts to form task-specific visual consensus prompts, thereby arousing the powerful potential of pre-trained models in addressing CoSOD tasks. Extensive experiments demonstrate that our concise VCP outperforms 13 cutting-edge full fine-tuning models, achieving the new state of the art (with 6.8% improvement in F_m metrics on the most challenging CoCA dataset). Source code has been available at https://github.com/WJ-CV/VCP. △ Less

Submitted 19 April, 2025; originally announced April 2025.

Comments: CVPR 2025

arXiv:2504.12074 [pdf, ps, other]

Learning from the Past: Adaptive Parallelism Tuning for Stream Processing Systems

Authors: Yuxing Han, Lixiang Chen, Haoyu Wang, Zhanghao Chen, Yifan Zhang, Chengcheng Yang, Kongzhang Hao, Zhengyi Yang

Abstract: Distributed stream processing systems rely on the dataflow model to define and execute streaming jobs, organizing computations as Directed Acyclic Graphs (DAGs) of operators. Adjusting the parallelism of these operators is crucial to handling fluctuating workloads efficiently while balancing resource usage and processing performance. However, existing methods often fail to effectively utilize exec… ▽ More Distributed stream processing systems rely on the dataflow model to define and execute streaming jobs, organizing computations as Directed Acyclic Graphs (DAGs) of operators. Adjusting the parallelism of these operators is crucial to handling fluctuating workloads efficiently while balancing resource usage and processing performance. However, existing methods often fail to effectively utilize execution histories or fully exploit DAG structures, limiting their ability to identity bottlenecks and determine the optimal parallelism. In this paper, we propose StreamTune, a novel approach for adaptive paralelism tuning in stream processing systems. StreamTune incorporates a pre-training and fine-tuning framework that leverages global knowledge from historical execution data for job-specific parallelism tuning. In the pre-training phase, Stream Tune clusters the historical data with Graph Edit Distance and pre-trains a Graph Neural Networkbased encoder per cluster to capture the correlation between the operator parallelism, DAG structures, and the identified operator-level bottlenecks. In the online tuning phase, StreamTune iteratively refines operator parallelism recommendations using an operator-level bottleneck prediction model enforced with a monotonic constraint, which aligns with the observed system performance behavior. Evaluation results demonstrate that StreamTune reduces reconfigurations by up to 29.6% and parallelism degrees by up to 30.8% in Apache Flink under a synthetic workload. In Timely Dataflow, StreamTune achieves up to an 83.3% reduction in parallelism degrees while maintaining comparable processing performance under the Nexmark benchmark, when compared to the state-of-the-art methods. △ Less

Submitted 7 July, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

arXiv:2504.11949 [pdf, other]

Flow Intelligence: Robust Feature Matching via Temporal Signature Correlation

Authors: Jie Wang, Chen Ye Gan, Caoqi Wei, Jiangtao Wen, Yuxing Han

Abstract: Feature matching across video streams remains a cornerstone challenge in computer vision. Increasingly, robust multimodal matching has garnered interest in robotics, surveillance, remote sensing, and medical imaging. While traditional rely on detecting and matching spatial features, they break down when faced with noisy, misaligned, or cross-modal data. Recent deep learning methods have improved r… ▽ More Feature matching across video streams remains a cornerstone challenge in computer vision. Increasingly, robust multimodal matching has garnered interest in robotics, surveillance, remote sensing, and medical imaging. While traditional rely on detecting and matching spatial features, they break down when faced with noisy, misaligned, or cross-modal data. Recent deep learning methods have improved robustness through learned representations, but remain constrained by their dependence on extensive training data and computational demands. We present Flow Intelligence, a paradigm-shifting approach that moves beyond spatial features by focusing on temporal motion patterns exclusively. Instead of detecting traditional keypoints, our method extracts motion signatures from pixel blocks across consecutive frames and extract temporal motion signatures between videos. These motion-based descriptors achieve natural invariance to translation, rotation, and scale variations while remaining robust across different imaging modalities. This novel approach also requires no pretraining data, eliminates the need for spatial feature detection, enables cross-modal matching using only temporal motion, and it outperforms existing methods in challenging scenarios where traditional approaches fail. By leveraging motion rather than appearance, Flow Intelligence enables robust, real-time video feature matching in diverse environments. △ Less

Submitted 16 April, 2025; originally announced April 2025.

arXiv:2504.11756 [pdf, ps, other]

AQETuner: Reliable Query-level Configuration Tuning for Analytical Query Engines

Authors: Lixiang Chen, Yuxing Han, Yu Chen, Xing Chen, Chengcheng Yang, Weining Qian

Abstract: Modern analytical query engines (AQEs) are essential for large-scale data analysis and processing. These systems usually provide numerous query-level tunable knobs that significantly affect individual query performance. While several studies have explored automatic DBMS configuration tuning, they have several limitations to handle query-level tuning. Firstly, they fail to capture how knobs influen… ▽ More Modern analytical query engines (AQEs) are essential for large-scale data analysis and processing. These systems usually provide numerous query-level tunable knobs that significantly affect individual query performance. While several studies have explored automatic DBMS configuration tuning, they have several limitations to handle query-level tuning. Firstly, they fail to capture how knobs influence query plans, which directly affect query performance. Secondly, they overlook query failures during the tuning processing, resulting in low tuning efficiency. Thirdly, they struggle with cold-start problems for new queries, leading to prolonged tuning time. To address these challenges, we propose AQETuner, a novel Bayesian Optimization-based system tailored for reliable query-level knob tuning in AQEs. AQETuner first applies the attention mechanisms to jointly encode the knobs and plan query, effectively identifying the impact of knobs on plan nodes. Then, AQETuner employs a dual-task Neural Process to predict both query performance and failures, leveraging their interactions to guide the tuning process. Furthermore, AQETuner utilizes Particle Swarm Optimization to efficiently generate high-quality samples in parallel during the initial tuning stage for the new queries. Experimental results show that AQETuner significantly outperforms existing methods, reducing query latency by up to 23.7% and query failures by up to 51.2%. △ Less

Submitted 20 June, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

arXiv:2504.11372 [pdf, other]

A Review of Stop-and-Go Traffic Wave Suppression Strategies: Variable Speed Limit vs. Jam-Absorption Driving

Authors: Zhengbing He, Jorge Laval, Yu Han, Andreas Hegyi, Ryosuke Nishi, Cathy Wu

Abstract: The main form of freeway traffic congestion is the familiar stop-and-go wave, characterized by wide moving jams that propagate indefinitely upstream provided enough traffic demand. They cause severe, long-lasting adverse effects, such as reduced traffic efficiency, increased driving risks, and higher vehicle emissions. This underscores the crucial importance of artificial intervention in the propa… ▽ More The main form of freeway traffic congestion is the familiar stop-and-go wave, characterized by wide moving jams that propagate indefinitely upstream provided enough traffic demand. They cause severe, long-lasting adverse effects, such as reduced traffic efficiency, increased driving risks, and higher vehicle emissions. This underscores the crucial importance of artificial intervention in the propagation of stop-and-go waves. Over the past two decades, two prominent strategies for stop-and-go wave suppression have emerged: variable speed limit (VSL) and jam-absorption driving (JAD). Although they share similar research motivations, objectives, and theoretical foundations, the development of these strategies has remained relatively disconnected. To synthesize fragmented advances and drive the field forward, this paper first provides a comprehensive review of the achievements in the stop-and-go wave suppression-oriented VSL and JAD, respectively. It then focuses on bridging the two areas and identifying research opportunities from the following perspectives: fundamental diagrams, secondary waves, generalizability, traffic state estimation and prediction, robustness to randomness, scenarios for strategy validation, and field tests and practical deployment. We expect that through this review, one area can effectively address its limitations by identifying and leveraging the strengths of the other, thus promoting the overall research goal of freeway stop-and-go wave suppression. △ Less

Submitted 20 May, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

arXiv:2504.11220 [pdf, other]

Test of lepton flavor universality with measurements of $R(D^{+})$ and $R(D^{*+})$ using semileptonic $B$ tagging at the Belle II experiment

Authors: Belle II Collaboration, I. Adachi, K. Adamczyk, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, N. Althubiti, K. Amos, M. Angelsmark, N. Anh Ky, C. Antonioli, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, N. K. Baghel, S. Bahinipati , et al. (428 additional authors not shown)

Abstract: We report measurements of the ratios of branching fractions $\mathcal{R}(D^{(*)+}) = \mathcal{B}(\overline{B}{}^0 \to D^{(*)+} \,τ^- \, \overlineν_τ) / \mathcal{B}(\overline{B}{}^0 \to D^{(*)+} \, \ell^- \, \overlineν_\ell)$, where $\ell$ denotes either an electron or a muon. These ratios test the universality of the charged-current weak interaction. The results are based on a… ▽ More We report measurements of the ratios of branching fractions $\mathcal{R}(D^{(*)+}) = \mathcal{B}(\overline{B}{}^0 \to D^{(*)+} \,τ^- \, \overlineν_τ) / \mathcal{B}(\overline{B}{}^0 \to D^{(*)+} \, \ell^- \, \overlineν_\ell)$, where $\ell$ denotes either an electron or a muon. These ratios test the universality of the charged-current weak interaction. The results are based on a $365\, \mathrm{fb}^{-1}$ data sample collected with the Belle II detector at the SuperKEKB $e^+e^-$ collider, which operates at a center-of-mass energy corresponding to the $Υ(4S)$ resonance, just above the threshold for $B\overline{B}{}$ production. Signal candidates are reconstructed by selecting events in which the companion $B$ meson from the $Υ(4S) \to B\overline{B}{}$ decay is identified in semileptonic modes. The $τ$ lepton is reconstructed via its leptonic decays. We obtain $\mathcal{R}(D^+) = 0.418 \pm 0.074 ~({\mathrm{stat}}) \pm 0.051 ~({\mathrm{syst}})$ and $\mathcal{R}(D^{*+}) = 0.306 \pm 0.034 ~({\mathrm{stat}}) \pm 0.018 ~({\mathrm{syst}})$, which are consistent with world average values. Accounting for the correlation between them, these values differ from the Standard Model expectation by a collective significance of $1.7$ standard deviations. △ Less

Submitted 15 April, 2025; originally announced April 2025.

Report number: Belle II Preprint 2025-011, KEK Preprint 2025-9

arXiv:2504.10970 [pdf, ps, other]

Mountain pass solution to the Brézis-Nirenberg problem with logarithmic perturbation

Authors: Q. Zhang, Y. Z. Han

Abstract: In this paper we give a positive answer to the conjecture raised by Hajaiej et al. (J. Geom. Anal., 2024, 34(6): No. 182, 44 pp) on the existence of a mountain pass solution at positive energy level to the Brézis-Nirenberg problem with logarithmic perturbation. To be a little more precise, by taking full advantage of the local minimum solution and some very delicate estimates on the logarithmic te… ▽ More In this paper we give a positive answer to the conjecture raised by Hajaiej et al. (J. Geom. Anal., 2024, 34(6): No. 182, 44 pp) on the existence of a mountain pass solution at positive energy level to the Brézis-Nirenberg problem with logarithmic perturbation. To be a little more precise, by taking full advantage of the local minimum solution and some very delicate estimates on the logarithmic term and the critical term, we prove that the following problem \begin{eqnarray*} \begin{cases} -Δu= λu+μ|u|^2u+θu\log u^2, &x\inΩ,\\ u=0, &x\in\partialΩ\end{cases} \end{eqnarray*} possesses a positive mountain pass solution at positive energy level, where $Ω\subset \mathbb{R}^4$ is a bounded domain with smooth boundary $\partialΩ$, $λ\in \mathbb{R}$, $μ>0$ and $θ<0$. A key step in the proof is to control the mountain pass level around the local minimum solution from above by a proper constant to ensure the local compactness. Moreover, this result is also extended to three-dimensional and five-dimensional cases. △ Less

Submitted 15 April, 2025; originally announced April 2025.

arXiv:2504.10042 [pdf, other]

Search for $B^0 \to K^{\ast 0} τ^+ τ^-$ decays at the Belle II experiment

Authors: Belle II Collaboration, I. Adachi, K. Adamczyk, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, M. Alhakami, A. Aloisio, N. Althubiti, M. Angelsmark, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, N. K. Baghel, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett , et al. (424 additional authors not shown)

Abstract: We present a search for the rare flavor-changing neutral-current decay $B^0 \to K^{\ast 0} τ^+ τ^-$ with data collected by the Belle II experiment at the SuperKEKB electron-positron collider. The analysis uses a 365 fb$^{-1}$ data sample recorded at the center-of-mass energy of the $Υ(4S)$ resonance. One of the $B$ mesons produced in the $Υ(4S)\to B^0 \bar{B}^0$ process is fully reconstructed in a… ▽ More We present a search for the rare flavor-changing neutral-current decay $B^0 \to K^{\ast 0} τ^+ τ^-$ with data collected by the Belle II experiment at the SuperKEKB electron-positron collider. The analysis uses a 365 fb$^{-1}$ data sample recorded at the center-of-mass energy of the $Υ(4S)$ resonance. One of the $B$ mesons produced in the $Υ(4S)\to B^0 \bar{B}^0$ process is fully reconstructed in a hadronic decay mode, while its companion $B$ meson is required to decay into a $K^{\ast 0}$ and two $τ$ leptons of opposite charge. The $τ$ leptons are reconstructed in final states with a single electron, muon, charged pion or charged $ρ$ meson, and additional neutrinos. We set an upper limit on the branching ratio of $BR(B^0 \to K^{\ast 0} τ^+ τ^-) < 1.8 \times 10^{-3}$ at the 90% confidence level, which is the most stringent constraint reported to date. △ Less

Submitted 14 April, 2025; originally announced April 2025.

Report number: Belle II Preprint 2025-010; KEK Preprint 2025-8

arXiv:2504.09868 [pdf, other]

NeRF-Based Transparent Object Grasping Enhanced by Shape Priors

Authors: Yi Han, Zixin Lin, Dongjie Li, Lvping Chen, Yongliang Shi, Gan Ma

Abstract: Transparent object grasping remains a persistent challenge in robotics, largely due to the difficulty of acquiring precise 3D information. Conventional optical 3D sensors struggle to capture transparent objects, and machine learning methods are often hindered by their reliance on high-quality datasets. Leveraging NeRF's capability for continuous spatial opacity modeling, our proposed architecture… ▽ More Transparent object grasping remains a persistent challenge in robotics, largely due to the difficulty of acquiring precise 3D information. Conventional optical 3D sensors struggle to capture transparent objects, and machine learning methods are often hindered by their reliance on high-quality datasets. Leveraging NeRF's capability for continuous spatial opacity modeling, our proposed architecture integrates a NeRF-based approach for reconstructing the 3D information of transparent objects. Despite this, certain portions of the reconstructed 3D information may remain incomplete. To address these deficiencies, we introduce a shape-prior-driven completion mechanism, further refined by a geometric pose estimation method we have developed. This allows us to obtain a complete and reliable 3D information of transparent objects. Utilizing this refined data, we perform scene-level grasp prediction and deploy the results in real-world robotic systems. Experimental validation demonstrates the efficacy of our architecture, showcasing its capability to reliably capture 3D information of various transparent objects in cluttered scenes, and correspondingly, achieve high-quality, stables, and executable grasp predictions. △ Less

Submitted 14 April, 2025; originally announced April 2025.

arXiv:2504.09531 [pdf, ps, other]

Voltage and power-frequency electric field measurements with Rydberg-atom interferometry

Authors: Yingying Han, Changfa He, Zhenxiong Weng, Peng Xu, Yanting Zhao, Tao Wang

Abstract: We present a Rydberg-atom interferometry-based technique for voltage measurement between electrodes embedded in an atomic vapor cell, enabling the detection of weak voltages ($<0.1$V) and unambiguous discrimination between positive and negative polarities. This makes up for the shortcomings of measurements based on the Stark effect, which suffer from quadratic field dependence (limiting sensitivit… ▽ More We present a Rydberg-atom interferometry-based technique for voltage measurement between electrodes embedded in an atomic vapor cell, enabling the detection of weak voltages ($<0.1$V) and unambiguous discrimination between positive and negative polarities. This makes up for the shortcomings of measurements based on the Stark effect, which suffer from quadratic field dependence (limiting sensitivity in weak-field regimes) and incapable of distinguishing the electric field direction. Furthermore, this method extends naturally to power-frequency (PF) electric field measurements by exploiting the quasi-static approximation-valid given the PF field's characteristic timescale ($\sim10^{-2}$s) vastly exceeds the interferometric measurement duration ($\sim10^{-6}$s). Crucially, our protocol provides instantaneous PF field reconstruction, providing comprehensive information including amplitude, frequency and phase. These advancements have direct implications for traceable voltage measurements and non-invasive characterization of PF fields near high-voltage infrastructure. △ Less

Submitted 13 April, 2025; originally announced April 2025.

Comments: 5 pages, 4 figures

arXiv:2504.08740 [pdf]

Recommendation System in Advertising and Streaming Media: Unsupervised Data Enhancement Sequence Suggestions

Authors: Kowei Shih, Yi Han, Li Tan

Abstract: Sequential recommendation is an extensively explored approach to capturing users' evolving preferences based on past interactions, aimed at predicting their next likely choice. Despite significant advancements in this domain, including methods based on RNNs and self-attention, challenges like limited supervised signals and noisy data caused by unintentional clicks persist. To address these challen… ▽ More Sequential recommendation is an extensively explored approach to capturing users' evolving preferences based on past interactions, aimed at predicting their next likely choice. Despite significant advancements in this domain, including methods based on RNNs and self-attention, challenges like limited supervised signals and noisy data caused by unintentional clicks persist. To address these challenges, some studies have incorporated unsupervised learning by leveraging local item contexts within individual sequences. However, these methods often overlook the intricate associations between items across multiple sequences and are susceptible to noise in item co-occurrence patterns. In this context, we introduce a novel framework, Global Unsupervised Data-Augmentation (UDA4SR), which adopts a graph contrastive learning perspective to generate more robust item embeddings for sequential recommendation. Our approach begins by integrating Generative Adversarial Networks (GANs) for data augmentation, which serves as the first step to enhance the diversity and richness of the training data. Then, we build a Global Item Relationship Graph (GIG) based on all user interaction sequences. Subsequently, we employ graph contrastive learning on the refined graph to enhance item embeddings by capturing complex global associations. To model users' dynamic and diverse interests more effectively, we enhance the CapsNet module with a novel target-attention mechanism. Extensive experiments show that UDA4SR significantly outperforms state-of-the-art approaches. △ Less

Submitted 23 March, 2025; originally announced April 2025.

arXiv:2504.08558 [pdf]

Localized plasmonic meron-antimeron pairs in doubly degenerate orbitals

Authors: Jie Yang, Xinmin Fu, Jiafu Wang, Yifan Li, Jingxian Zhang, Fangyuan Qi, Yajuan Han, Yuxiang Jia, Guy A E Vandenbosch, Tie Jun Cui, Xuezhi Zheng

Abstract: Topological defects are pivotal in elucidating kaleidoscopic topological phenomena in different physical systems. Meron-antimeron pairs are a type of topological defects firstly found as soliton solutions to SU(2) Yang-Mills equations in gauge theory, and then identified in condensed matter physics as a type of magnetic quasiparticles created in the context of topological charge conservation. Here… ▽ More Topological defects are pivotal in elucidating kaleidoscopic topological phenomena in different physical systems. Meron-antimeron pairs are a type of topological defects firstly found as soliton solutions to SU(2) Yang-Mills equations in gauge theory, and then identified in condensed matter physics as a type of magnetic quasiparticles created in the context of topological charge conservation. Here, we show that isolated meron-antimeron pairs constitute a new form of optical topological quasiparticles that naturally emerge in doubly degenerate orbitals of plasmonic systems, including fundamental and higher-order ones, and their target-type counterparts. We demonstrate that their topological charges are strictly imposed by orbital indices from the doubly degenerate irreducible representations (irreps) of groups consisting of rotational symmetries, and thus are upper-bounded by the orbital indices imposed by group theory. In addition, we find that there exist highly-localized isolated (anti)merons in plasmonic spin textures, which were previously observed mostly in the form of lattices or clusters. We further demonstrate a locking effect between the chirality of the (anti)merons and the parity of the irreps. Then, the topological origins of the revealed topological quasiparticles, i.e., phase, V-point and L-line singularities in plasmonic fields, are investigated. Finally, a complete symmetry classification of the topological quasiparticles is provided. Generalizing the meron-antimeron pairs to photonic systems provides various possibilities for the applications in optical vectorial imaging, deep-subwavelength sensing and metrology. △ Less

Submitted 11 April, 2025; originally announced April 2025.

arXiv:2504.07348 [pdf, other]

doi 10.1126/sciadv.adu5264

A millisecond integrated quantum memory for photonic qubits

Authors: Yu-Ping Liu, Zhong-Wen Ou, Tian-Xiang Zhu, Ming-Xu Su, Chao Liu, Yong-Jian Han, Zong-Quan Zhou, Chuan-Feng Li, Guang-Can Guo

Abstract: Quantum memories for light are essential building blocks for quantum repeaters and quantum networks. Integrated operations of quantum memories could enable scalable application with low-power consumption. However, the photonic quantum storage lifetime in integrated optical waveguide has so far been limited to tens of microseconds, falling short of the requirements for practical applications. Here,… ▽ More Quantum memories for light are essential building blocks for quantum repeaters and quantum networks. Integrated operations of quantum memories could enable scalable application with low-power consumption. However, the photonic quantum storage lifetime in integrated optical waveguide has so far been limited to tens of microseconds, falling short of the requirements for practical applications. Here, we demonstrate quantum storage of photonic qubits for 1.021 ms based on a laser-written optical waveguide fabricated in a 151Eu3+:Y2SiO5 crystal. Spin dephasing of 151Eu3+ is mitigated through dynamical decoupling applied via on-chip electric waveguides and we obtain a storage efficiency of 12.0(0.5)% at 1.021 ms, which is a demonstration of integrated quantum memories that outperforms the efficiency of a simple fiber delay line. Such long-lived waveguide-based quantum memory could support applications in quantum repeaters, and further combination with critical magnetic fields could enable potential application as transportable quantum memories. △ Less

Submitted 9 April, 2025; originally announced April 2025.

Journal ref: Science Advances 11.13.eadu5264 (2025)

arXiv:2504.06803 [pdf, other]

DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation

Authors: Wangbo Zhao, Yizeng Han, Jiasheng Tang, Kai Wang, Hao Luo, Yibing Song, Gao Huang, Fan Wang, Yang You

Abstract: Diffusion Transformer (DiT), an emerging diffusion model for visual generation, has demonstrated superior performance but suffers from substantial computational costs. Our investigations reveal that these costs primarily stem from the \emph{static} inference paradigm, which inevitably introduces redundant computation in certain \emph{diffusion timesteps} and \emph{spatial regions}. To overcome thi… ▽ More Diffusion Transformer (DiT), an emerging diffusion model for visual generation, has demonstrated superior performance but suffers from substantial computational costs. Our investigations reveal that these costs primarily stem from the \emph{static} inference paradigm, which inevitably introduces redundant computation in certain \emph{diffusion timesteps} and \emph{spatial regions}. To overcome this inefficiency, we propose \textbf{Dy}namic \textbf{Di}ffusion \textbf{T}ransformer (DyDiT), an architecture that \emph{dynamically} adjusts its computation along both \emph{timestep} and \emph{spatial} dimensions. Specifically, we introduce a \emph{Timestep-wise Dynamic Width} (TDW) approach that adapts model width conditioned on the generation timesteps. In addition, we design a \emph{Spatial-wise Dynamic Token} (SDT) strategy to avoid redundant computation at unnecessary spatial locations. TDW and SDT can be seamlessly integrated into DiT and significantly accelerates the generation process. Building on these designs, we further enhance DyDiT in three key aspects. First, DyDiT is integrated seamlessly with flow matching-based generation, enhancing its versatility. Furthermore, we enhance DyDiT to tackle more complex visual generation tasks, including video generation and text-to-image generation, thereby broadening its real-world applications. Finally, to address the high cost of full fine-tuning and democratize technology access, we investigate the feasibility of training DyDiT in a parameter-efficient manner and introduce timestep-based dynamic LoRA (TD-LoRA). Extensive experiments on diverse visual generation models, including DiT, SiT, Latte, and FLUX, demonstrate the effectiveness of DyDiT. △ Less

Submitted 16 April, 2025; v1 submitted 9 April, 2025; originally announced April 2025.

Comments: Extended journal version for ICLR. arXiv admin note: substantial text overlap with arXiv:2410.03456

arXiv:2504.03281 [pdf, other]

Convergence and consensus analysis of a class of best-response opinion dynamics

Authors: Yuchen Xu, Yi Han, Chuanzhe Zhang, Miao Wang, Wenjun Mei

Abstract: Opinion dynamics aims to understand how individuals' opinions evolve through local interactions. Recently, opinion dynamics have been modeled as network games, where individuals update their opinions in order to minimize the social pressure caused by disagreeing with others. In this paper, we study a class of best response opinion dynamics introduced by Mei et al., where a parameter $α> 0$ control… ▽ More Opinion dynamics aims to understand how individuals' opinions evolve through local interactions. Recently, opinion dynamics have been modeled as network games, where individuals update their opinions in order to minimize the social pressure caused by disagreeing with others. In this paper, we study a class of best response opinion dynamics introduced by Mei et al., where a parameter $α> 0$ controls the marginal cost of opinion differences, bridging well-known mechanisms such as the DeGroot model ($α= 2$) and the weighted-median model ($α= 1$). We conduct theoretical analysis on how different values of $α$ affect the system's convergence and consensus behavior. For the case when $α> 1$, corresponding to increasing marginal costs, we establish the convergence of the dynamics and derive graph-theoretic conditions for consensus formation, which is proved to be similar to those in the DeGroot model. When $α< 1$, we show via a counterexample that convergence is not always guaranteed, and we provide sufficient conditions for convergence and consensus. Additionally, numerical simulations on small-world networks reveal how network structure and $α$ together affect opinion diversity. △ Less

Submitted 4 April, 2025; originally announced April 2025.

arXiv:2504.02955 [pdf, other]

Azimuthal anisotropy of direct photons in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV

Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, M. Alfred, S. Antsupov, N. Apadula, H. Asano, B. Azmoun, V. Babintsev, M. Bai, N. S. Bandara, B. Bannier, E. Bannikov, K. N. Barish, S. Bathe, A. Bazilevsky, M. Beaumier, S. Beckman, R. Belmont, A. Berdnikov, Y. Berdnikov , et al. (301 additional authors not shown)

Abstract: The PHENIX experiment at the Relativistic Heavy Ion Collider measured the second Fourier component $v_2$ of the direct-photon azimuthal anisotropy at midrapidity in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV. The results are presented in 10\% wide bins of collision centrality and cover the transverse-momentum range of $1<p_T<20$ GeV/$c$, and are in quantitative agreement with findings publis… ▽ More The PHENIX experiment at the Relativistic Heavy Ion Collider measured the second Fourier component $v_2$ of the direct-photon azimuthal anisotropy at midrapidity in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV. The results are presented in 10\% wide bins of collision centrality and cover the transverse-momentum range of $1<p_T<20$ GeV/$c$, and are in quantitative agreement with findings published earlier, but provide better granularity and higher $p_T$ reach. Above a $p_T$ of 8--10 GeV/$c$, where hard scattering dominates the direct-photon production, $v_2$ is consistent with zero. Below that in each centrality bin $v_2$ as a function of $p_T$ is comparable to the $π^0$ anisotropy albeit with a tendency of being somewhat smaller. The results are compared to recent theory calculations that include, in addition to thermal radiation from the quark-gluon plasma and hadron gas, sources of photons from pre-equilibrium, strong magnetic fields, or radiative hadronization. While the newer theoretical calculations describe the data better than previous models, none of them alone can fully explain the results, particularly in the region of $p_T=4$--8 GeV/$c$. △ Less

Submitted 3 April, 2025; originally announced April 2025.

Comments: 325 authors from 71 institutions, 12 pages, 9 figures, 2 tables. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

arXiv:2504.02137 [pdf, other]

Enhancing Embedding Representation Stability in Recommendation Systems with Semantic ID

Authors: Carolina Zheng, Minhui Huang, Dmitrii Pedchenko, Kaushik Rangadurai, Siyu Wang, Gaby Nahum, Jie Lei, Yang Yang, Tao Liu, Zutian Luo, Xiaohan Wei, Dinesh Ramasamy, Jiyan Yang, Yiping Han, Lin Yang, Hangjun Xu, Rong Jin, Shuang Yang

Abstract: The exponential growth of online content has posed significant challenges to ID-based models in industrial recommendation systems, ranging from extremely high cardinality and dynamically growing ID space, to highly skewed engagement distributions, to prediction instability as a result of natural id life cycles (e.g, the birth of new IDs and retirement of old IDs). To address these issues, many sys… ▽ More The exponential growth of online content has posed significant challenges to ID-based models in industrial recommendation systems, ranging from extremely high cardinality and dynamically growing ID space, to highly skewed engagement distributions, to prediction instability as a result of natural id life cycles (e.g, the birth of new IDs and retirement of old IDs). To address these issues, many systems rely on random hashing to handle the id space and control the corresponding model parameters (i.e embedding table). However, this approach introduces data pollution from multiple ids sharing the same embedding, leading to degraded model performance and embedding representation instability. This paper examines these challenges and introduces Semantic ID prefix ngram, a novel token parameterization technique that significantly improves the performance of the original Semantic ID. Semantic ID prefix ngram creates semantically meaningful collisions by hierarchically clustering items based on their content embeddings, as opposed to random assignments. Through extensive experimentation, we demonstrate that Semantic ID prefix ngram not only addresses embedding instability but also significantly improves tail id modeling, reduces overfitting, and mitigates representation shifts. We further highlight the advantages of Semantic ID prefix ngram in attention-based models that contextualize user histories, showing substantial performance improvements. We also report our experience of integrating Semantic ID into Meta production Ads Ranking system, leading to notable performance gains and enhanced prediction stability in live deployments. △ Less

Submitted 2 April, 2025; originally announced April 2025.

arXiv:2504.00759 [pdf, other]

MSSFC-Net:Enhancing Building Interpretation with Multi-Scale Spatial-Spectral Feature Collaboration

Authors: Dehua Huo, Weida Zhan, Jinxin Guo, Depeng Zhu, Yu Chen, YiChun Jiang, Yueyi Han, Deng Han, Jin Li

Abstract: Building interpretation from remote sensing imagery primarily involves two fundamental tasks: building extraction and change detection. However, most existing methods address these tasks independently, overlooking their inherent correlation and failing to exploit shared feature representations for mutual enhancement. Furthermore, the diverse spectral,spatial, and scale characteristics of buildings… ▽ More Building interpretation from remote sensing imagery primarily involves two fundamental tasks: building extraction and change detection. However, most existing methods address these tasks independently, overlooking their inherent correlation and failing to exploit shared feature representations for mutual enhancement. Furthermore, the diverse spectral,spatial, and scale characteristics of buildings pose additional challenges in jointly modeling spatial-spectral multi-scale features and effectively balancing precision and recall. The limited synergy between spatial and spectral representations often results in reduced detection accuracy and incomplete change localization.To address these challenges, we propose a Multi-Scale Spatial-Spectral Feature Cooperative Dual-Task Network (MSSFC-Net) for joint building extraction and change detection in remote sensing images. The framework integrates both tasks within a unified architecture, leveraging their complementary nature to simultaneously extract building and change features. Specifically,a Dual-branch Multi-scale Feature Extraction module (DMFE) with Spatial-Spectral Feature Collaboration (SSFC) is designed to enhance multi-scale representation learning, effectively capturing shallow texture details and deep semantic information, thus improving building extraction performance. For temporal feature aggregation, we introduce a Multi-scale Differential Fusion Module (MDFM) that explicitly models the interaction between differential and dual-temporal features. This module refines the network's capability to detect large-area changes and subtle structural variations in buildings. Extensive experiments conducted on three benchmark datasets demonstrate that MSSFC-Net achieves superior performance in both building extraction and change detection tasks, effectively improving detection accuracy while maintaining completeness. △ Less

Submitted 1 April, 2025; originally announced April 2025.

arXiv:2503.24073 [pdf, other]

doi 10.1103/PhysRevB.111.165106

Krylov complexity in quantum many-body scars of spin-1 models

Authors: Qingmin Hu, Wen-Yi Zhang, Yunguang Han, Wen-Long You

Abstract: Weak ergodicity breaking, particularly through quantum many-body scars (QMBS), has become a significant focus in many-body physics. Krylov state complexity quantifies the spread of quantum states within the Krylov basis and serves as a powerful diagnostic for analyzing nonergodic dynamics. In this work, we study spin-one XXZ magnets and reveal nonergodic behavior tied to QMBS. For the XY model, th… ▽ More Weak ergodicity breaking, particularly through quantum many-body scars (QMBS), has become a significant focus in many-body physics. Krylov state complexity quantifies the spread of quantum states within the Krylov basis and serves as a powerful diagnostic for analyzing nonergodic dynamics. In this work, we study spin-one XXZ magnets and reveal nonergodic behavior tied to QMBS. For the XY model, the nematic Néel state exhibits periodic revivals in Krylov complexity. In the generic XXZ model, we identify spin helix states as weakly ergodicity-breaking states, characterized by low entanglement and nonthermal dynamics. Across different scenarios, the Lanczos coefficients for scarred states display an elliptical pattern, reflecting a hidden SU(2) algebra that enables analytical results for Krylov complexity and fidelity. These findings, which exemplify the rare capability to characterize QMBS analytically, are feasible with current experimental techniques and offer deep insights into the nonergodic dynamics of interacting quantum systems. △ Less

Submitted 31 March, 2025; originally announced March 2025.

Comments: 9 pages, 6 figures

Journal ref: Phys. Rev. B 111, 165106 (2025)

arXiv:2503.23724 [pdf, other]

Colossal enhancement of spin transmission through magnon confinement in an antiferromagnet

Authors: Sajid Husain, Maya Ramesh, Xinyan Li, Sergei Prokhorenko, Shashank Kumar Ojha, Aiden Ross, Koushik Das, Boyang Zhao, Hyeon Woo Park, Peter Meisenheimer, Yousra Nahas, Lucas Caretta, Lane W. Martin, Se Kwon Kim, Zhi Yao, Haidan Wen, Sayeef Salahuddin, Long-Qing Chen, Yimo Han, Rogerio de Sousa, Laurent Bellaiche, Manuel Bibes, Darrell G. Schlom, Ramamoorthy Ramesh

Abstract: Since Felix Bloch's introduction of the concept of spin waves in 1930, magnons (the quanta of spin waves) have been extensively studied in a range of materials for spintronics, particularly for non-volatile logic-in-memory devices. Controlling magnons in conventional antiferromagnets and harnessing them in practical applications, however, remains a challenge. In this letter, we demonstrate highly… ▽ More Since Felix Bloch's introduction of the concept of spin waves in 1930, magnons (the quanta of spin waves) have been extensively studied in a range of materials for spintronics, particularly for non-volatile logic-in-memory devices. Controlling magnons in conventional antiferromagnets and harnessing them in practical applications, however, remains a challenge. In this letter, we demonstrate highly efficient magnon transport in an LaFeO$_3$/BiFeO$_3$/LaFeO$_3$ all-antiferromagnetic system which can be controlled electrically, making it highly desirable for energy-efficient computation. Leveraging spin-orbit-driven spin-charge transduction, we demonstrate that this material architecture permits magnon confinement in ultrathin antiferromagnets, enhancing the output voltage generated by magnon transport by several orders of magnitude, which provides a pathway to enable magnetoelectric memory and logic functionalities. Additionally, its non-volatility enables ultralow-power logic-in-memory processing, where magnonic devices can be efficiently reconfigured via electrically controlled magnon spin currents within magnetoelectric channels. △ Less

Submitted 31 March, 2025; originally announced March 2025.

Comments: 12 pages, 4 figures

arXiv:2503.23024 [pdf, other]

Empowering Large Language Models with 3D Situation Awareness

Authors: Zhihao Yuan, Yibo Peng, Jinke Ren, Yinghong Liao, Yatong Han, Chun-Mei Feng, Hengshuang Zhao, Guanbin Li, Shuguang Cui, Zhen Li

Abstract: Driven by the great success of Large Language Models (LLMs) in the 2D image domain, their applications in 3D scene understanding has emerged as a new trend. A key difference between 3D and 2D is that the situation of an egocentric observer in 3D scenes can change, resulting in different descriptions (e.g., ''left" or ''right"). However, current LLM-based methods overlook the egocentric perspective… ▽ More Driven by the great success of Large Language Models (LLMs) in the 2D image domain, their applications in 3D scene understanding has emerged as a new trend. A key difference between 3D and 2D is that the situation of an egocentric observer in 3D scenes can change, resulting in different descriptions (e.g., ''left" or ''right"). However, current LLM-based methods overlook the egocentric perspective and simply use datasets from a global viewpoint. To address this issue, we propose a novel approach to automatically generate a situation-aware dataset by leveraging the scanning trajectory during data collection and utilizing Vision-Language Models (VLMs) to produce high-quality captions and question-answer pairs. Furthermore, we introduce a situation grounding module to explicitly predict the position and orientation of observer's viewpoint, thereby enabling LLMs to ground situation description in 3D scenes. We evaluate our approach on several benchmarks, demonstrating that our method effectively enhances the 3D situational awareness of LLMs while significantly expanding existing datasets and reducing manual effort. △ Less

Submitted 29 March, 2025; originally announced March 2025.

Comments: Accepted by CVPR 2025

arXiv:2503.22789 [pdf, other]

Entropic Order

Authors: Yiqiu Han, Xiaoyang Huang, Zohar Komargodski, Andrew Lucas, Fedor K. Popov

Abstract: Ordered phases of matter, such as solids, ferromagnets, superfluids, or quantum topological order, typically only exist at low temperatures. Despite this conventional wisdom, we present explicit local models in which all such phases persist to arbitrarily high temperature. This is possible since order in one degree of freedom can enable other degrees of freedom to strongly fluctuate, leading to "e… ▽ More Ordered phases of matter, such as solids, ferromagnets, superfluids, or quantum topological order, typically only exist at low temperatures. Despite this conventional wisdom, we present explicit local models in which all such phases persist to arbitrarily high temperature. This is possible since order in one degree of freedom can enable other degrees of freedom to strongly fluctuate, leading to "entropic order", whereby typical high energy states are ordered. Our construction, which utilizes interacting bosons, avoids existing no-go theorems on long-range order or entanglement at high temperature. We propose a simple model for high-temperature superconductivity using these general principles. △ Less

Submitted 28 March, 2025; originally announced March 2025.

Comments: 5+16 pages; 1+1 figures

arXiv:2503.22486 [pdf, other]

Movable Antenna Enhanced Downlink Multi-User Integrated Sensing and Communication System

Authors: Yanze Han, Min Li, Xingyu Zhao, Ming-Min Zhao, Min-Jian Zhao

Abstract: This work investigates the potential of exploiting movable antennas (MAs) to enhance the performance of a multi-user downlink integrated sensing and communication (ISAC) system. Specifically, we formulate an optimization problem to maximize the transmit beampattern gain for sensing while simultaneously meeting each user's communication requirement by jointly optimizing antenna positions and beamfo… ▽ More This work investigates the potential of exploiting movable antennas (MAs) to enhance the performance of a multi-user downlink integrated sensing and communication (ISAC) system. Specifically, we formulate an optimization problem to maximize the transmit beampattern gain for sensing while simultaneously meeting each user's communication requirement by jointly optimizing antenna positions and beamforming design. The problem formulated is highly non-convex and involves multivariate-coupled constraints. To address these challenges, we introduce a series of auxiliary random variables and transform the original problem into an augmented Lagrangian problem. A double-loop algorithm based on a penalty dual decomposition framework is then developed to solve the problem. Numerical results validate the effectiveness of the proposed design, demonstrating its superiority over MA designs based on successive convex approximation optimization and other baseline approaches in ISAC systems. The results also highlight the advantages of MAs in achieving better sensing performance and improved beam control, especially for sparse arrays with large apertures. △ Less

Submitted 28 March, 2025; originally announced March 2025.

Comments: accepted and to appear in IEEE VTC2025-Spring

arXiv:2503.21476 [pdf, ps, other]

Robust DNN Partitioning and Resource Allocation Under Uncertain Inference Time

Authors: Zhaojun Nan, Yunchu Han, Sheng Zhou, Zhisheng Niu

Abstract: In edge intelligence systems, deep neural network (DNN) partitioning and data offloading can provide real-time task inference for resource-constrained mobile devices. However, the inference time of DNNs is typically uncertain and cannot be precisely determined in advance, presenting significant challenges in ensuring timely task processing within deadlines. To address the uncertain inference time,… ▽ More In edge intelligence systems, deep neural network (DNN) partitioning and data offloading can provide real-time task inference for resource-constrained mobile devices. However, the inference time of DNNs is typically uncertain and cannot be precisely determined in advance, presenting significant challenges in ensuring timely task processing within deadlines. To address the uncertain inference time, we propose a robust optimization scheme to minimize the total energy consumption of mobile devices while meeting task probabilistic deadlines. The scheme only requires the mean and variance information of the inference time, without any prediction methods or distribution functions. The problem is formulated as a mixed-integer nonlinear programming (MINLP) that involves jointly optimizing the DNN model partitioning and the allocation of local CPU/GPU frequencies and uplink bandwidth. To tackle the problem, we first decompose the original problem into two subproblems: resource allocation and DNN model partitioning. Subsequently, the two subproblems with probability constraints are equivalently transformed into deterministic optimization problems using the chance-constrained programming (CCP) method. Finally, the convex optimization technique and the penalty convex-concave procedure (PCCP) technique are employed to obtain the optimal solution of the resource allocation subproblem and a stationary point of the DNN model partitioning subproblem, respectively. The proposed algorithm leverages real-world data from popular hardware platforms and is evaluated on widely used DNN models. Extensive simulations show that our proposed algorithm effectively addresses the inference time uncertainty with probabilistic deadline guarantees while minimizing the energy consumption of mobile devices. △ Less

Submitted 27 March, 2025; originally announced March 2025.

arXiv:2503.20749 [pdf, ps, other]

Prompting is Not All You Need! Evaluating LLM Agent Simulation Methodologies with Real-World Online Customer Behavior Data

Authors: Yuxuan Lu, Jing Huang, Yan Han, Bingsheng Yao, Sisong Bei, Jiri Gesi, Yaochen Xie, Zheshen, Wang, Qi He, Dakuo Wang

Abstract: Recent research shows that LLMs can simulate ``believable'' human behaviors to power LLM agents via prompt-only methods. In this work, we focus on evaluating LLM's objective ``accuracy'' rather than the subjective ``believability'' in simulating human behavior, leveraging a large-scale, real-world dataset collected from customers' online shopping actions. We present the first comprehensive evaluat… ▽ More Recent research shows that LLMs can simulate ``believable'' human behaviors to power LLM agents via prompt-only methods. In this work, we focus on evaluating LLM's objective ``accuracy'' rather than the subjective ``believability'' in simulating human behavior, leveraging a large-scale, real-world dataset collected from customers' online shopping actions. We present the first comprehensive evaluation of state-of-the-art LLMs (e.g., DeepSeek-R1, Llama, and Claude) on the task of web shopping action generation. Our results show that out-of-the-box LLM-generated actions are often misaligned with actual human behavior, whereas fine-tuning LLMs on real-world behavioral data substantially improves their ability to generate accurate actions compared to prompt-only methods. Furthermore, incorporating synthesized reasonings into model training leads to additional performance gains, demonstrating the value of explicit rationale in behavior modeling. This work evaluates state-of-the-art LLMs in behavior simulation and provides actionable insights into how real-world action data can enhance the fidelity of LLM agents. △ Less

Submitted 5 June, 2025; v1 submitted 26 March, 2025; originally announced March 2025.

arXiv:2503.20355 [pdf, other]

CNN+Transformer Based Anomaly Traffic Detection in UAV Networks for Emergency Rescue

Authors: Yulu Han, Ziye Jia, Sijie He, Yu Zhang, Qihui Wu

Abstract: The unmanned aerial vehicle (UAV) network has gained significant attentions in recent years due to its various applications. However, the traffic security becomes the key threatening public safety issue in an emergency rescue system due to the increasing vulnerability of UAVs to cyber attacks in environments with high heterogeneities. Hence, in this paper, we propose a novel anomaly traffic detect… ▽ More The unmanned aerial vehicle (UAV) network has gained significant attentions in recent years due to its various applications. However, the traffic security becomes the key threatening public safety issue in an emergency rescue system due to the increasing vulnerability of UAVs to cyber attacks in environments with high heterogeneities. Hence, in this paper, we propose a novel anomaly traffic detection architecture for UAV networks based on the software-defined networking (SDN) framework and blockchain technology. Specifically, SDN separates the control and data plane to enhance the network manageability and security. Meanwhile, the blockchain provides decentralized identity authentication and data security records. Beisdes, a complete security architecture requires an effective mechanism to detect the time-series based abnormal traffic. Thus, an integrated algorithm combining convolutional neural networks (CNNs) and Transformer (CNN+Transformer) for anomaly traffic detection is developed, which is called CTranATD. Finally, the simulation results show that the proposed CTranATD algorithm is effective and outperforms the individual CNN, Transformer, and LSTM algorithms for detecting anomaly traffic. △ Less

Submitted 26 March, 2025; originally announced March 2025.

arXiv:2503.18413 [pdf, other]

Exploring the Finite-Temperature Behavior of Rydberg Atom Arrays: A Tensor Network Approach

Authors: Yuzhou Han, Hao Zhang, Lixin He

Abstract: Rydberg atom arrays have emerged as a powerful platform for experimental research and a challenging subject for theoretical investigation in quantum science. In this study, we investigate the finite-temperature properties of two-dimensional square-lattice Rydberg atom arrays using the projected entangled pair states (PEPS) method. By analyzing the thermal behavior of systems in the checkerboard an… ▽ More Rydberg atom arrays have emerged as a powerful platform for experimental research and a challenging subject for theoretical investigation in quantum science. In this study, we investigate the finite-temperature properties of two-dimensional square-lattice Rydberg atom arrays using the projected entangled pair states (PEPS) method. By analyzing the thermal behavior of systems in the checkerboard and striated phases, we extract critical exponents and identify phase transition characteristics. Our results confirm that the checkerboard phase transition belongs to the 2D Ising universality class, while the striated phase exhibits critical exponents that deviate from known universality classes, possibly due to finite-size effects. These findings provide theoretical insights into the thermal stability of quantum phases in Rydberg atom arrays and offer valuable guidance for future experimental efforts. △ Less

Submitted 24 March, 2025; originally announced March 2025.

arXiv:2503.18127 [pdf, other]

doi 10.1093/mnras/staf494

A PR drag origin for the Fomalhaut disk's pervasive inner dust: constraints on collisional strengths, icy composition, and embedded planets

Authors: Max Sommer, Mark Wyatt, Yinuo Han

Abstract: Recent JWST observations of the Fomalhaut debris disk have revealed a significant abundance of dust interior to the outer planetesimal belt, raising questions about its origin and maintenance. In this study, we apply an analytical model to the Fomalhaut system, that simulates the dust distribution interior to a planetesimal belt, as collisional fragments across a range of sizes are dragged inward… ▽ More Recent JWST observations of the Fomalhaut debris disk have revealed a significant abundance of dust interior to the outer planetesimal belt, raising questions about its origin and maintenance. In this study, we apply an analytical model to the Fomalhaut system, that simulates the dust distribution interior to a planetesimal belt, as collisional fragments across a range of sizes are dragged inward under Poynting-Robertson (PR) drag. We generate spectral energy distributions and synthetic JWST/MIRI images of the model disks, and perform an extensive grid search for particle parameters -- pertaining to composition and collisional strength -- that best match the observations. We find that a sound fit can be found for particle properties that involve a substantial water ice component, around 50%--80% by total volume, and a catastrophic disruption threshold, $Q_D^\star$, at a particle size of $D\!\approx\!30\,$um of 2--4$\,\times\,10^6\,$erg/g. Based on the expected dynamical depletion of migrating dust by an intervening planet we discount planets with masses $>1\,M_\mathrm{Saturn}$ beyond $\sim50\,$au in the extended disk, though a planet shepherding the inner edge of the outer belt of up to $\sim2\,M_\mathrm{Saturn}$ is reconcilable with the PR-drag-maintained disk scenario, contingent upon higher collisional strengths. These results indicate that PR drag transport from the outer belt alone can account for the high interior dust contents seen in the Fomalhaut system, which may thus constitute a common phenomenon in other belt-bearing systems. This establishes a framework for interpreting mid-planetary system dust around other stars, with our results for Fomalhaut providing a valuable calibration of the models. △ Less

Submitted 15 April, 2025; v1 submitted 23 March, 2025; originally announced March 2025.

Journal ref: Monthly Notices of the Royal Astronomical Society, v. 539-1 (2025), pp. 439-456

arXiv:2503.18082 [pdf, other]

Vehicular Road Crack Detection with Deep Learning: A New Online Benchmark for Comprehensive Evaluation of Existing Algorithms

Authors: Nachuan Ma, Zhengfei Song, Qiang Hu, Chuang-Wei Liu, Yu Han, Yanting Zhang, Rui Fan, Lihua Xie

Abstract: In the emerging field of urban digital twins (UDTs), advancing intelligent road inspection (IRI) vehicles with automatic road crack detection systems is essential for maintaining civil infrastructure. Over the past decade, deep learning-based road crack detection methods have been developed to detect cracks more efficiently, accurately, and objectively, with the goal of replacing manual visual ins… ▽ More In the emerging field of urban digital twins (UDTs), advancing intelligent road inspection (IRI) vehicles with automatic road crack detection systems is essential for maintaining civil infrastructure. Over the past decade, deep learning-based road crack detection methods have been developed to detect cracks more efficiently, accurately, and objectively, with the goal of replacing manual visual inspection. Nonetheless, there is a lack of systematic reviews on state-of-the-art (SoTA) deep learning techniques, especially data-fusion and label-efficient algorithms for this task. This paper thoroughly reviews the SoTA deep learning-based algorithms, including (1) supervised, (2) unsupervised, (3) semi-supervised, and (4) weakly-supervised methods developed for road crack detection. Also, we create a dataset called UDTIRI-Crack, comprising $2,500$ high-quality images from seven public annotated sources, as the first extensive online benchmark in this field. Comprehensive experiments are conducted to compare the detection performance, computational efficiency, and generalizability of public SoTA deep learning-based algorithms for road crack detection. In addition, the feasibility of foundation models and large language models (LLMs) for road crack detection is explored. Afterwards, the existing challenges and future development trends of deep learning-based road crack detection algorithms are discussed. We believe this review can serve as practical guidance for developing intelligent road detection vehicles with the next-generation road condition assessment systems. The released benchmark UDTIRI-Crack is available at https://udtiri.com/submission/. △ Less

Submitted 23 March, 2025; originally announced March 2025.

arXiv:2503.17643 [pdf, other]

Measurements of the branching fractions of $Ξ_{c}^{+}\to Σ^{+}K_{S}^{0}$, $Ξ_{c}^{+}\to Ξ^{0}π^{+}$, and $Ξ_{c}^{+}\to Ξ^{0}K^{+}$ at Belle and Belle II

Authors: Belle, Belle II Collaborations, :, I. Adachi, J. K. Ahn, Y. Ahn, N. Akopov, S. Alghamdi, M. Alhakami, N. Althubiti, K. Amos, N. Anh Ky, C. Antonioli, D. M. Asner, M. Aversano, R. Ayad, V. Babu, N. K. Baghel, P. Bambade, Sw. Banerjee, M. Barrett, M. Bartl, J. Baudot, A. Beaubien, F. Becherer , et al. (335 additional authors not shown)

Abstract: Using 983.0 $\rm{fb}^{-1}$ and 427.9 $\rm{fb}^{-1}$ data samples collected with the Belle and Belle II detectors at the KEKB and SuperKEKB asymmetric energy $e^+e^-$ colliders, respectively, we present studies of the Cabibbo-favored $Ξ_c^+$ decays ${Ξ_{c}^{+}\to Σ^{+}K_{S}^{0}}$ and $Ξ_{c}^{+}\to Ξ^{0}π^{+}$, and the singly Cabibbo-suppressed decay $Ξ_{c}^{+}\to Ξ^{0}K^{+}$. The ratios of branchin… ▽ More Using 983.0 $\rm{fb}^{-1}$ and 427.9 $\rm{fb}^{-1}$ data samples collected with the Belle and Belle II detectors at the KEKB and SuperKEKB asymmetric energy $e^+e^-$ colliders, respectively, we present studies of the Cabibbo-favored $Ξ_c^+$ decays ${Ξ_{c}^{+}\to Σ^{+}K_{S}^{0}}$ and $Ξ_{c}^{+}\to Ξ^{0}π^{+}$, and the singly Cabibbo-suppressed decay $Ξ_{c}^{+}\to Ξ^{0}K^{+}$. The ratios of branching fractions of ${Ξ_{c}^{+}\to Σ^{+}K_{S}^{0}}$ and $Ξ_{c}^{+}\to Ξ^{0}K^{+}$ relative to that of $Ξ_{c}^{+}\toΞ^{-}π^{+}π^{+}$ are measured for the first time, while the ratio ${\cal B}(Ξ_{c}^{+}\toΞ^{0}π^{+})/{\cal B}(Ξ_{c}^{+}\toΞ^{-}π^{+}π^{+}) $ is also determined and improved by an order of magnitude in precision. The measured branching fraction ratios are $\frac{\cal{B}(Ξ_{c}^{+} \to Σ^{+}K_{S}^{0})}{\cal{B}(Ξ_{c}^{+}\to Ξ^{-}π^{+}π^+)}= 0.067 \pm 0.007 \pm 0.003$, $\frac{\cal{B}(Ξ_c^{+} \to Ξ^{0}π^{+})}{\cal{B}(Ξ_{c}^{+}\to Ξ^{-}π^{+}π^+)} = 0.248 \pm 0.005 \pm 0.009$, $\frac{\cal{B}(Ξ_c^{+} \to Ξ^{0}K^{+})}{\cal{B}(Ξ_{c}^{+}\to Ξ^{-}π^{+}π^+)} = 0.017 \pm 0.003 \pm 0.001$. Additionally, the ratio ${\cal B}(Ξ_{c}^{+}\toΞ^{0}K^{+})/{\cal B}(Ξ_{c}^{+}\toΞ^{0}π^{+})$ is measured to be $ 0.068 \pm 0.010 \pm 0.004$. Here, the first and second uncertainties are statistical and systematic, respectively. Multiplying the ratios by the branching fraction of the normalization mode, ${\mathcal B}(Ξ_{c}^{+}\toΞ^{-}π^{+}π^+)= (2.9\pm 1.3)\%$, we obtain the following absolute branching fractions ${\cal B}(Ξ_{c}^{+}\toΣ^{+}K^{0}_{S}) = (0.194 \pm 0.021 \pm 0.009 \pm 0.087 )%$, ${\cal B}(Ξ_{c}^{+}\toΞ^{0}π^{+}) = (0.719 \pm 0.014 \pm 0.024 \pm 0.322 )%$, ${\cal B}(Ξ_{c}^{+}\toΞ^{0}K^{+}) = (0.049 \pm 0.007 \pm 0.002 \pm 0.022 )%$. △ Less

Submitted 22 March, 2025; originally announced March 2025.

Comments: 20 pages, 4 figures, 3 Tables

Report number: Belle II Preprint 2025-005; KEK Preprint 2025-2

arXiv:2503.15781 [pdf, other]

UAS Visual Navigation in Large and Unseen Environments via a Meta Agent

Authors: Yuci Han, Charles Toth, Alper Yilmaz

Abstract: The aim of this work is to develop an approach that enables Unmanned Aerial System (UAS) to efficiently learn to navigate in large-scale urban environments and transfer their acquired expertise to novel environments. To achieve this, we propose a meta-curriculum training scheme. First, meta-training allows the agent to learn a master policy to generalize across tasks. The resulting model is then f… ▽ More The aim of this work is to develop an approach that enables Unmanned Aerial System (UAS) to efficiently learn to navigate in large-scale urban environments and transfer their acquired expertise to novel environments. To achieve this, we propose a meta-curriculum training scheme. First, meta-training allows the agent to learn a master policy to generalize across tasks. The resulting model is then fine-tuned on the downstream tasks. We organize the training curriculum in a hierarchical manner such that the agent is guided from coarse to fine towards the target task. In addition, we introduce Incremental Self-Adaptive Reinforcement learning (ISAR), an algorithm that combines the ideas of incremental learning and meta-reinforcement learning (MRL). In contrast to traditional reinforcement learning (RL), which focuses on acquiring a policy for a specific task, MRL aims to learn a policy with fast transfer ability to novel tasks. However, the MRL training process is time consuming, whereas our proposed ISAR algorithm achieves faster convergence than the conventional MRL algorithm. We evaluate the proposed methodologies in simulated environments and demonstrate that using this training philosophy in conjunction with the ISAR algorithm significantly improves the convergence speed for navigation in large-scale cities and the adaptation proficiency in novel environments. △ Less

Submitted 19 March, 2025; originally announced March 2025.

arXiv:2503.11841 [pdf, other]

Trust Under Siege: Label Spoofing Attacks against Machine Learning for Android Malware Detection

Authors: Tianwei Lan, Luca Demetrio, Farid Nait-Abdesselam, Yufei Han, Simone Aonzo

Abstract: Machine learning (ML) malware detectors rely heavily on crowd-sourced AntiVirus (AV) labels, with platforms like VirusTotal serving as a trusted source of malware annotations. But what if attackers could manipulate these labels to classify benign software as malicious? We introduce label spoofing attacks, a new threat that contaminates crowd-sourced datasets by embedding minimal and undetectable m… ▽ More Machine learning (ML) malware detectors rely heavily on crowd-sourced AntiVirus (AV) labels, with platforms like VirusTotal serving as a trusted source of malware annotations. But what if attackers could manipulate these labels to classify benign software as malicious? We introduce label spoofing attacks, a new threat that contaminates crowd-sourced datasets by embedding minimal and undetectable malicious patterns into benign samples. These patterns coerce AV engines into misclassifying legitimate files as harmful, enabling poisoning attacks against ML-based malware classifiers trained on those data. We demonstrate this scenario by developing AndroVenom, a methodology for polluting realistic data sources, causing consequent poisoning attacks against ML malware detectors. Experiments show that not only state-of-the-art feature extractors are unable to filter such injection, but also various ML models experience Denial of Service already with 1% poisoned samples. Additionally, attackers can flip decisions of specific unaltered benign samples by modifying only 0.015% of the training data, threatening their reputation and market share and being unable to be stopped by anomaly detectors on training data. We conclude our manuscript by raising the alarm on the trustworthiness of the training process based on AV annotations, requiring further investigation on how to produce proper labels for ML malware detectors. △ Less

Submitted 14 March, 2025; originally announced March 2025.

arXiv:2503.11446 [pdf]

Extending Ambient Pressure X-ray Photoelectron Spectroscopy to Plasma Studies: A novel and flexible plasma gun approach

Authors: Yang Gu, Zhehao Qiu, Shui Lin, Yong Han, Hui Zhang, Zhi Liu, Jun Cai

Abstract: The characterization of the electronic structure and chemical states of gases, solids, and liquids can be effectively performed using ambient pressure X-ray photoelectron spectroscopy (AP-XPS). However, the acquisition of electronic and chemical information under plasma conditions poses significant challenges. In this study, we have developed an advanced experimental system capable of garnering el… ▽ More The characterization of the electronic structure and chemical states of gases, solids, and liquids can be effectively performed using ambient pressure X-ray photoelectron spectroscopy (AP-XPS). However, the acquisition of electronic and chemical information under plasma conditions poses significant challenges. In this study, we have developed an advanced experimental system capable of garnering electronic information amidst plasma environments, alongside providing detailed surface chemical states of samples subjected to plasma conditions. By designing a customized plasma generation apparatus, we successfully integrated it with a traditional AP-XPS system. This novel plasma-AP-XPS system confined plasma proximal to the sample area, with adjustable intensity parameters controlled by either modifying the distance between the plasma source and the sample surface or adjusting the voltage applied. This configuration permitted the direct detection of electrons in the plasma via the XPS electron detector. To substantiate the efficacy and versatility of this setup, it was applied to two distinct studies: the plasma etching of graphene and plasma oxidation of platinum (Pt). The investigations confirmed that argon (Ar) plasma facilitates the etching of graphene, a phenomenon clearly evidenced by the XPS spectra. Similarly, the exposure of the Pt surface to oxygen plasma was found to induce effective oxidation. This developed system significantly extends the utility of AP-XPS, enhancing its application for in-depth studies of plasma-enhanced reactions under operando conditions, thereby holding promise for the advancement in material science and chemical engineering fields. △ Less

Submitted 14 March, 2025; originally announced March 2025.

Comments: 16 pages, 8 figures

arXiv:2503.09968 [pdf, other]

Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection

Authors: Zihao Zhang, Aming Wu, Yahong Han

Abstract: Recently, a task of Single-Domain Generalized Object Detection (Single-DGOD) is proposed, aiming to generalize a detector to multiple unknown domains never seen before during training. Due to the unavailability of target-domain data, some methods leverage the multimodal capabilities of vision-language models, using textual prompts to estimate cross-domain information, enhancing the model's general… ▽ More Recently, a task of Single-Domain Generalized Object Detection (Single-DGOD) is proposed, aiming to generalize a detector to multiple unknown domains never seen before during training. Due to the unavailability of target-domain data, some methods leverage the multimodal capabilities of vision-language models, using textual prompts to estimate cross-domain information, enhancing the model's generalization capability. These methods typically use a single textual prompt, often referred to as the one-step prompt method. However, when dealing with complex styles such as the combination of rain and night, we observe that the performance of the one-step prompt method tends to be relatively weak. The reason may be that many scenes incorporate not just a single style but a combination of multiple styles. The one-step prompt method may not effectively synthesize combined information involving various styles. To address this limitation, we propose a new method, i.e., Style Evolving along Chain-of-Thought, which aims to progressively integrate and expand style information along the chain of thought, enabling the continual evolution of styles. Specifically, by progressively refining style descriptions and guiding the diverse evolution of styles, this approach enables more accurate simulation of various style characteristics and helps the model gradually learn and adapt to subtle differences between styles. Additionally, it exposes the model to a broader range of style features with different data distributions, thereby enhancing its generalization capability in unseen domains. The significant performance gains over five adverse-weather scenarios and the Real to Art benchmark demonstrate the superiorities of our method. △ Less

Submitted 12 March, 2025; originally announced March 2025.

arXiv:2503.07396 [pdf, other]

Brain Inspired Adaptive Memory Dual-Net for Few-Shot Image Classification

Authors: Kexin Di, Xiuxing Li, Yuyang Han, Ziyu Li, Qing Li, Xia Wu

Abstract: Few-shot image classification has become a popular research topic for its wide application in real-world scenarios, however the problem of supervision collapse induced by single image-level annotation remains a major challenge. Existing methods aim to tackle this problem by locating and aligning relevant local features. However, the high intra-class variability in real-world images poses significa… ▽ More Few-shot image classification has become a popular research topic for its wide application in real-world scenarios, however the problem of supervision collapse induced by single image-level annotation remains a major challenge. Existing methods aim to tackle this problem by locating and aligning relevant local features. However, the high intra-class variability in real-world images poses significant challenges in locating semantically relevant local regions under few-shot settings. Drawing inspiration from the human's complementary learning system, which excels at rapidly capturing and integrating semantic features from limited examples, we propose the generalization-optimized Systems Consolidation Adaptive Memory Dual-Network, SCAM-Net. This approach simulates the systems consolidation of complementary learning system with an adaptive memory module, which successfully addresses the difficulty of identifying meaningful features in few-shot scenarios. Specifically, we construct a Hippocampus-Neocortex dual-network that consolidates structured representation of each category, the structured representation is then stored and adaptively regulated following the generalization optimization principle in a long-term memory inside Neocortex. Extensive experiments on benchmark datasets show that the proposed model has achieved state-of-the-art performance. △ Less

Submitted 10 March, 2025; originally announced March 2025.

arXiv:2503.07093

Anomalous Meets Topological Hall Effect in Cr2Ge2Te6 Heterostructures

Authors: Xiaofan Cai, Yaqing Han, Jiawei Jiang, Renjun Du, Di Zhang, Jiabei Huang, Siqi Jiang, Jingkuan Xiao, Zihao Wang, Qian Guo, Wanting Xu, Fuzhuo Lian, Siqing Wang, Bingxian Ou, Yongqiang Yang, Kenji Watanabe, Takashi Taniguchi, Alexander S. Mayorov, Konstantin S. Novoselov, Baigeng Wang, Kai Chang, Hongxin Yang, Lei Wang, Geliang Yu

Abstract: Introducing topologically protected skyrmions in graphene holds significant importance for developing high-speed, low-energy spintronic devices. Here, we present a centrosymmetric ferromagnetic graphene/trilayer Cr2Ge2Te6/graphene heterostructure, demonstrating the anomalous and topological Hall effect due to the magnetic proximity effect. Through gate voltage control, we effectively tune the emer… ▽ More Introducing topologically protected skyrmions in graphene holds significant importance for developing high-speed, low-energy spintronic devices. Here, we present a centrosymmetric ferromagnetic graphene/trilayer Cr2Ge2Te6/graphene heterostructure, demonstrating the anomalous and topological Hall effect due to the magnetic proximity effect. Through gate voltage control, we effectively tune the emergence and size of skyrmions. Micromagnetic simulations reveal the formation of skyrmions and antiskyrmions, which respond differently to external magnetic fields, leading to oscillations in the topological Hall signal. Our findings provide a novel pathway for the formation and manipulation of skyrmions in centrosymmetric two-dimensional magnetic systems, offering significant insights for developing topological spintronics. △ Less

Submitted 11 March, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

Comments: There is a dispute over the ownership of the intellectual property involved. Differences exist with collaborators, research institutions, or other relevant parties regarding the ownership and use of the research results

arXiv:2503.06926 [pdf, ps, other]

Effect of Selection Format on LLM Performance

Authors: Yuchen Han, Yucheng Wu, Jeffrey Willard

Abstract: This paper investigates a critical aspect of large language model (LLM) performance: the optimal formatting of classification task options in prompts. Through an extensive experimental study, we compared two selection formats -- bullet points and plain English -- to determine their impact on model performance. Our findings suggest that presenting options via bullet points generally yields better r… ▽ More This paper investigates a critical aspect of large language model (LLM) performance: the optimal formatting of classification task options in prompts. Through an extensive experimental study, we compared two selection formats -- bullet points and plain English -- to determine their impact on model performance. Our findings suggest that presenting options via bullet points generally yields better results, although there are some exceptions. Furthermore, our research highlights the need for continued exploration of option formatting to drive further improvements in model performance. △ Less

Submitted 17 June, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

arXiv:2503.05027 [pdf, other]

Entanglement Transitions in Noisy Quantum Circuits on Trees

Authors: Vikram Ravindranath, Yiqiu Han, Xiao Chen

Abstract: Decoherence is ubiquitous, and poses a significant impediment to the observation of quantum phenomena, such as the measurement-induced entanglement phase transition (MIPT). In this work, we study entanglement transitions in quantum circuits on trees, subject to both noise and measurements. We uncover a rich phase diagram that describes the ability of a tree quantum circuit to retain quantum or cla… ▽ More Decoherence is ubiquitous, and poses a significant impediment to the observation of quantum phenomena, such as the measurement-induced entanglement phase transition (MIPT). In this work, we study entanglement transitions in quantum circuits on trees, subject to both noise and measurements. We uncover a rich phase diagram that describes the ability of a tree quantum circuit to retain quantum or classical information in the presence of decoherence. By developing a mapping between the dynamics of information on the tree to a classical Markov process -- also defined on the tree -- we obtain exact solutions to the entanglement transitions displayed by the circuit under various noise and measurement strengths. Moreover, we find a host of phenomena, including the MIPT, which are \textit{robust} to decoherence. The analytical tractability facilitated by the method developed in this paper showcases the first example of an exactly solvable noise-robust MIPT, and holds promise for studies on broader, tree-like circuits. △ Less

Submitted 6 March, 2025; originally announced March 2025.

Comments: Main: 13 pages, 15 figures; Appendix:6 pages, 6 figures

arXiv:2503.04371 [pdf, other]

Measurement of the Branching Fraction of $Λ_c^+ \to p K_S^0 π^0$ at Belle

Authors: The Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Ahmed, J. K. Ahn, H. Aihara, N. Akopov, M. Alhakami, A. Aloisio, N. Althubiti, M. Angelsmark, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, N. K. Baghel, S. Bahinipati, P. Bambade , et al. (404 additional authors not shown)

Abstract: We report a precise measurement of the ratio of branching fractions $\mathcal{B}(Λ_c^+\to p K_S^0 π^0)/\mathcal{B}(Λ_c^+\to p K^- π^+)$ using 980 fb$^{-1}$ of $e^+e^-$ data from the Belle experiment. We obtain a value of $\mathcal{B}(Λ_c^+\to p K_S^0 π^0)/\mathcal{B}(Λ_c^+\to p K^- π^+)=0.339\pm 0.002\pm 0.009$, where the first and second uncertainties are statistical and systematic, respectively.… ▽ More We report a precise measurement of the ratio of branching fractions $\mathcal{B}(Λ_c^+\to p K_S^0 π^0)/\mathcal{B}(Λ_c^+\to p K^- π^+)$ using 980 fb$^{-1}$ of $e^+e^-$ data from the Belle experiment. We obtain a value of $\mathcal{B}(Λ_c^+\to p K_S^0 π^0)/\mathcal{B}(Λ_c^+\to p K^- π^+)=0.339\pm 0.002\pm 0.009$, where the first and second uncertainties are statistical and systematic, respectively. This Belle result is consistent with the previous measurement from the CLEO experiment but has a fivefold improvement in precision. By combining our result with the world average $\mathcal{B}(Λ_c^+\to p K^- π^+)$, we obtain the absolute branching fraction $\mathcal{B}(Λ_c^+\to p K_S^0 π^0)=(2.12\pm 0.01\pm 0.05 \pm 0.10)\%$, where the uncertainties are statistical, systematic, and the uncertainty in the absolute branching fraction scale $\mathcal{B}(Λ_c^+\to p K^- π^+)$, respectively. This measurement can shed light on hadronic decay mechanisms in charmed baryon decays. △ Less

Submitted 18 March, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

Comments: 20 pages, 7 figures

Report number: Belle II Preprint: 2024-022, KEK preprint: 2024-20

arXiv:2503.04161 [pdf, other]

Square lattice model with staggered magnetic fluxes: zero Chern number topological states and topological flat bands

Authors: Li-Xiang Chen, Dong-Hao Guan, Lu Qi, Xiuyun Zhang, Ying Han, Ai-Lei He

Abstract: Staggered magnetic fluxes (SMF) play a crucial role in achieving Chern insulators (CIs), by which a series of CI models have been established on various lattices. In addition, SMF induced higher-order topological insulator (HOTI) in a lattice model has been reported. In this work, we propose a square lattice model with SMF. We find intracellular SMF can induce zero-Chern-number topological insulat… ▽ More Staggered magnetic fluxes (SMF) play a crucial role in achieving Chern insulators (CIs), by which a series of CI models have been established on various lattices. In addition, SMF induced higher-order topological insulator (HOTI) in a lattice model has been reported. In this work, we propose a square lattice model with SMF. We find intracellular SMF can induce zero-Chern-number topological insulator (ZCNTI) at quarter filling which hosts topologically protected edge states characterized by the quantized polarization, in analogy to the topological state in two dimensional Su-Schrieffer-Hegger model. When lattice dimerization and intracellular SMF are introduced, there exists HOTI state at half filling. Furthermore, this model hosts topological flat band (TFB) by considering the next-nearest-neighbor hoppings. Several fractional Chern insulator states are investigated when hard-core bosons are filled into this TFB model. △ Less

Submitted 6 March, 2025; originally announced March 2025.

Comments: 10 pages, 9 figures, Accepted by Physical Review B

arXiv:2503.04083 [pdf]

Spectral signature of periodic modulation and sliding of pseudogap state in moire system

Authors: Yingzhuo Han, Yingbo Wang, Yucheng Xue, Jiefei Shi, Xiaomeng Wang, Kenji Watanabe, Takashi Taniguchi, Jian Kang, Yuhang Jiang, Jinhai Mao

Abstract: The nature of the pseudogap state is widely believed as a key to understanding the pairing mechanism underlying unconventional superconductivity. Over the past two decades, significant efforts have been devoted to searching for spontaneous symmetry breaking or potential order parameters associated with these pseudogap states, aiming to better characterize their properties. Recently, pseudogap stat… ▽ More The nature of the pseudogap state is widely believed as a key to understanding the pairing mechanism underlying unconventional superconductivity. Over the past two decades, significant efforts have been devoted to searching for spontaneous symmetry breaking or potential order parameters associated with these pseudogap states, aiming to better characterize their properties. Recently, pseudogap states have also been realized in moire systems with extensive gate tunability, yet their local electronic structure remains largely unexplored8. In this study, we report the observation of gate-tunable spontaneous symmetry breaking and sliding behavior of the pseudogap state in magic-angle twisted bilayer graphene (MAtBG) using spectroscopic imaging scanning tunneling microscopy. Our spectroscopy reveals a distinct pseudogap at 4.4 K within the doping range -3 < v < -2. Spectroscopic imaging highlights a gap size modulation at moire scale that is sensitive to the filling, indicative of a wave-like fluctuating pseudogap feature. Specifically, the positions of gap size minima (GSM) coincide with regions of the highest local density of states (LDOS) at the filling v = -2.63, but a unidirectional sliding behavior of GSM is observed for other fillings. In addition, the pseudogap size distribution at certain doping levels also causes a clear nematic order, or an anisotropic gap distribution. Our results have shed light on the complex nature of this pseudogap state, revealing critical insights into the phase diagram of correlated electron systems. △ Less

Submitted 5 March, 2025; originally announced March 2025.

Comments: The file contains four Figures

arXiv:2503.03042 [pdf, other]

Learning from Noisy Labels with Contrastive Co-Transformer

Authors: Yan Han, Soumava Kumar Roy, Mehrtash Harandi, Lars Petersson

Abstract: Deep learning with noisy labels is an interesting challenge in weakly supervised learning. Despite their significant learning capacity, CNNs have a tendency to overfit in the presence of samples with noisy labels. Alleviating this issue, the well known Co-Training framework is used as a fundamental basis for our work. In this paper, we introduce a Contrastive Co-Transformer framework, which is sim… ▽ More Deep learning with noisy labels is an interesting challenge in weakly supervised learning. Despite their significant learning capacity, CNNs have a tendency to overfit in the presence of samples with noisy labels. Alleviating this issue, the well known Co-Training framework is used as a fundamental basis for our work. In this paper, we introduce a Contrastive Co-Transformer framework, which is simple and fast, yet able to improve the performance by a large margin compared to the state-of-the-art approaches. We argue the robustness of transformers when dealing with label noise. Our Contrastive Co-Transformer approach is able to utilize all samples in the dataset, irrespective of whether they are clean or noisy. Transformers are trained by a combination of contrastive loss and classification loss. Extensive experimental results on corrupted data from six standard benchmark datasets including Clothing1M, demonstrate that our Contrastive Co-Transformer is superior to existing state-of-the-art methods. △ Less

Submitted 4 March, 2025; originally announced March 2025.

arXiv:2503.00968 [pdf, other]

Simulation of the Background from $^{13}$C$(α, n)^{16}$O Reaction in the JUNO Scintillator

Authors: JUNO Collaboration, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Costas Andreopoulos, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Beretta, Antonio Bergnoli, Nikita Bessonov, Daniel Bick, Lukas Bieger, Svetlana Biktemerova , et al. (608 additional authors not shown)

Abstract: Large-scale organic liquid scintillator detectors are highly efficient in the detection of MeV-scale electron antineutrinos. These signal events can be detected through inverse beta decay on protons, which produce a positron accompanied by a neutron. A noteworthy background for antineutrinos coming from nuclear power reactors and from the depths of the Earth (geoneutrinos) is generated by ($α, n$)… ▽ More Large-scale organic liquid scintillator detectors are highly efficient in the detection of MeV-scale electron antineutrinos. These signal events can be detected through inverse beta decay on protons, which produce a positron accompanied by a neutron. A noteworthy background for antineutrinos coming from nuclear power reactors and from the depths of the Earth (geoneutrinos) is generated by ($α, n$) reactions. In organic liquid scintillator detectors, $α$ particles emitted from intrinsic contaminants such as $^{238}$U, $^{232}$Th, and $^{210}$Pb/$^{210}$Po, can be captured on $^{13}$C nuclei, followed by the emission of a MeV-scale neutron. Three distinct interaction mechanisms can produce prompt energy depositions preceding the delayed neutron capture, leading to a pair of events correlated in space and time within the detector. Thus, ($α, n$) reactions represent an indistinguishable background in liquid scintillator-based antineutrino detectors, where their expected rate and energy spectrum are typically evaluated via Monte Carlo simulations. This work presents results from the open-source SaG4n software, used to calculate the expected energy depositions from the neutron and any associated de-excitation products. Also simulated is a detailed detector response to these interactions, using a dedicated Geant4-based simulation software from the JUNO experiment. An expected measurable $^{13}$C$(α, n)^{16}$O event rate and reconstructed prompt energy spectrum with associated uncertainties, are presented in the context of JUNO, however, the methods and results are applicable and relevant to other organic liquid scintillator neutrino detectors. △ Less

Submitted 2 May, 2025; v1 submitted 2 March, 2025; originally announced March 2025.

Comments: 25 pages, 14 figures, 4 tables

arXiv:2503.00729 [pdf, other]

CLEA: Closed-Loop Embodied Agent for Enhancing Task Execution in Dynamic Environments

Authors: Mingcong Lei, Ge Wang, Yiming Zhao, Zhixin Mai, Qing Zhao, Yao Guo, Zhen Li, Shuguang Cui, Yatong Han, Jinke Ren

Abstract: Large Language Models (LLMs) exhibit remarkable capabilities in the hierarchical decomposition of complex tasks through semantic reasoning. However, their application in embodied systems faces challenges in ensuring reliable execution of subtask sequences and achieving one-shot success in long-term task completion. To address these limitations in dynamic environments, we propose Closed-Loop Embodi… ▽ More Large Language Models (LLMs) exhibit remarkable capabilities in the hierarchical decomposition of complex tasks through semantic reasoning. However, their application in embodied systems faces challenges in ensuring reliable execution of subtask sequences and achieving one-shot success in long-term task completion. To address these limitations in dynamic environments, we propose Closed-Loop Embodied Agent (CLEA) -- a novel architecture incorporating four specialized open-source LLMs with functional decoupling for closed-loop task management. The framework features two core innovations: (1) Interactive task planner that dynamically generates executable subtasks based on the environmental memory, and (2) Multimodal execution critic employing an evaluation framework to conduct a probabilistic assessment of action feasibility, triggering hierarchical re-planning mechanisms when environmental perturbations exceed preset thresholds. To validate CLEA's effectiveness, we conduct experiments in a real environment with manipulable objects, using two heterogeneous robots for object search, manipulation, and search-manipulation integration tasks. Across 12 task trials, CLEA outperforms the baseline model, achieving a 67.3% improvement in success rate and a 52.8% increase in task completion rate. These results demonstrate that CLEA significantly enhances the robustness of task planning and execution in dynamic environments. △ Less

Submitted 1 March, 2025; originally announced March 2025.

arXiv:2503.00273 [pdf, ps, other]

Evolution of Information in Interactive Decision Making: A Case Study for Multi-Armed Bandits

Authors: Yuzhou Gu, Yanjun Han, Jian Qian

Abstract: We study the evolution of information in interactive decision making through the lens of a stochastic multi-armed bandit problem. Focusing on a fundamental example where a unique optimal arm outperforms the rest by a fixed margin, we characterize the optimal success probability and mutual information over time. Our findings reveal distinct growth phases in mutual information -- initially linear, t… ▽ More We study the evolution of information in interactive decision making through the lens of a stochastic multi-armed bandit problem. Focusing on a fundamental example where a unique optimal arm outperforms the rest by a fixed margin, we characterize the optimal success probability and mutual information over time. Our findings reveal distinct growth phases in mutual information -- initially linear, transitioning to quadratic, and finally returning to linear -- highlighting curious behavioral differences between interactive and non-interactive environments. In particular, we show that optimal success probability and mutual information can be decoupled, where achieving optimal learning does not necessarily require maximizing information gain. These findings shed new light on the intricate interplay between information and learning in interactive decision making. △ Less

Submitted 28 February, 2025; originally announced March 2025.

arXiv:2503.00032 [pdf, ps, other]

KatFishNet: Detecting LLM-Generated Korean Text through Linguistic Feature Analysis

Authors: Shinwoo Park, Shubin Kim, Do-Kyung Kim, Yo-Sub Han

Abstract: The rapid advancement of large language models (LLMs) increases the difficulty of distinguishing between human-written and LLM-generated text. Detecting LLM-generated text is crucial for upholding academic integrity, preventing plagiarism, protecting copyrights, and ensuring ethical research practices. Most prior studies on detecting LLM-generated text focus primarily on English text. However, lan… ▽ More The rapid advancement of large language models (LLMs) increases the difficulty of distinguishing between human-written and LLM-generated text. Detecting LLM-generated text is crucial for upholding academic integrity, preventing plagiarism, protecting copyrights, and ensuring ethical research practices. Most prior studies on detecting LLM-generated text focus primarily on English text. However, languages with distinct morphological and syntactic characteristics require specialized detection approaches. Their unique structures and usage patterns can hinder the direct application of methods primarily designed for English. Among such languages, we focus on Korean, which has relatively flexible spacing rules, a rich morphological system, and less frequent comma usage compared to English. We introduce KatFish, the first benchmark dataset for detecting LLM-generated Korean text. The dataset consists of text written by humans and generated by four LLMs across three genres. By examining spacing patterns, part-of-speech diversity, and comma usage, we illuminate the linguistic differences between human-written and LLM-generated Korean text. Building on these observations, we propose KatFishNet, a detection method specifically designed for the Korean language. KatFishNet achieves an average of 19.78% higher AUROC compared to the best-performing existing detection method. Our code and data are available at https://github.com/Shinwoo-Park/detecting_llm_generated_korean_text_through_linguistic_analysis. △ Less

Submitted 1 July, 2025; v1 submitted 24 February, 2025; originally announced March 2025.

Comments: Accepted to ACL 2025 main conference

arXiv:2502.20675 [pdf]

Polar Vortex Superstructure and Its Coupling with Correlated Electrons in Quasiperiodic Moire Crystal

Authors: Si-yu Li, Zhongrui Wang, Yingzhuo Han, Shaoqing Xu, Zhiyue Xu, Yingbo Wang, Zhengwen Wang, Yucheng Xue, Aisheng Song, Kenji Watanabe, Takashi Taniguchi, Xueyun Wang, Tian-Bao Ma, Jiawang Hong, Hong-Jun Gao, Yuhang Jiang, Jinhai Mao

Abstract: Nanoscale polar structures are significant for understanding polarization processes in low-dimensional systems and hold potential for developing high-performance electronics. Here, we demonstrate a polar vortex superstructure arising from the reconstructed moiré patterns in twisted bilayer graphene aligned with hexagonal boron nitride. Scanning tunneling microscopy reveals spatially modulated char… ▽ More Nanoscale polar structures are significant for understanding polarization processes in low-dimensional systems and hold potential for developing high-performance electronics. Here, we demonstrate a polar vortex superstructure arising from the reconstructed moiré patterns in twisted bilayer graphene aligned with hexagonal boron nitride. Scanning tunneling microscopy reveals spatially modulated charge polarization, while theoretical simulations indicate that the in-plane polarization field forms an array of polar vortices. Notably, this polar field is gate-tunable, exhibiting an unconventional gate-tunable polar sliding and screening process. Moreover, its interaction with electron correlations in twisted bilayer graphene leads to modulated correlated states. Our findings establish moiré pattern reconstruction as a powerful strategy for engineering nanoscale polar structures and emergent quantum phases in van der Waals materials. △ Less

Submitted 27 February, 2025; originally announced February 2025.

Comments: 4 Figures

Showing 101–150 of 2,397 results for author: Han, Y