Search | arXiv e-print repository

Offline Reinforcement Learning for Microgrid Voltage Regulation

Abstract: This paper presents a study on using different offline reinforcement learning algorithms for microgrid voltage regulation with solar power penetration. When environment interaction is unviable due to technical or safety reasons, the proposed approach can still obtain an applicable model through offline-style training on a previously collected dataset, lowering the negative impact of lacking online… ▽ More This paper presents a study on using different offline reinforcement learning algorithms for microgrid voltage regulation with solar power penetration. When environment interaction is unviable due to technical or safety reasons, the proposed approach can still obtain an applicable model through offline-style training on a previously collected dataset, lowering the negative impact of lacking online environment interactions. Experiment results on the IEEE 33-bus system demonstrate the feasibility and effectiveness of the proposed approach on different offline datasets, including the one with merely low-quality experience. △ Less

Submitted 14 May, 2025; originally announced May 2025.

Comments: This paper has been accepted and presented at ICLR 2025 in Singapore, Apr. 28, 2025

arXiv:2505.09705 [pdf, other]

Search for a dark Higgs boson produced in association with inelastic dark matter at the Belle II experiment

Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, N. Althubiti, K. Amos, M. Angelsmark, N. Anh Ky, C. Antonioli, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, N. K. Baghel, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal , et al. (415 additional authors not shown)

Abstract: Inelastic dark matter models that have two dark matter particles and a massive dark photon can reproduce the observed relic dark matter density without violating cosmological limits. The mass splitting between the two dark matter particles $χ_{1}$ and $χ_{2}$, with $m(χ_{2}) > m(χ_{1})$, is induced by a dark Higgs field and a corresponding dark Higgs boson $h^{\prime}$. We present a search for dar… ▽ More Inelastic dark matter models that have two dark matter particles and a massive dark photon can reproduce the observed relic dark matter density without violating cosmological limits. The mass splitting between the two dark matter particles $χ_{1}$ and $χ_{2}$, with $m(χ_{2}) > m(χ_{1})$, is induced by a dark Higgs field and a corresponding dark Higgs boson $h^{\prime}$. We present a search for dark matter in events with two vertices, at least one of which must be displaced from the interaction region, and missing energy. Using a $365\,\mbox{fb}^{-1}$ data sample collected at Belle II, which operates at the SuperKEKB $e^+e^-$ collider, we observe no evidence for a signal. We set upper limits on the product of the production cross section $σ\left(e^+e^- \to h^\prime χ_1 χ_2\right)$, and the product of branching fractions $\mathcal{B}\left(χ_2\toχ_1 e^+ e^-\right)\times\mathcal{B}\left(h^\prime\to x^+x^-\right)$, where $x^+x^-$ indicates $μ^+μ^-, π^+π^-$, or $K^+K^-$, as functions of $h^{\prime}$ mass and lifetime at the level of $10^{-1}\,\mbox{fb}$. We set model-dependent upper limits on the dark Higgs mixing angle at the level of $10^{-5}$ and on the dark photon kinetic mixing parameter at the level of $10^{-3}$. This is the first search for dark Higgs bosons in association with inelastic dark matter. △ Less

Submitted 14 May, 2025; originally announced May 2025.

Comments: Submitted for publication with Physical Review Letters

Report number: Belle II Preprint 2025-015, KEK Preprint 2025-14

arXiv:2505.09123 [pdf, ps, other]

Promoting SAM for Camouflaged Object Detection via Selective Key Point-based Guidance

Authors: Guoying Liang, Su Yang

Abstract: Big model has emerged as a new research paradigm that can be applied to various down-stream tasks with only minor effort for domain adaption. Correspondingly, this study tackles Camouflaged Object Detection (COD) leveraging the Segment Anything Model (SAM). The previous studies declared that SAM is not workable for COD but this study reveals that SAM works if promoted properly, for which we devise… ▽ More Big model has emerged as a new research paradigm that can be applied to various down-stream tasks with only minor effort for domain adaption. Correspondingly, this study tackles Camouflaged Object Detection (COD) leveraging the Segment Anything Model (SAM). The previous studies declared that SAM is not workable for COD but this study reveals that SAM works if promoted properly, for which we devise a new framework to render point promotions: First, we develop the Promotion Point Targeting Network (PPT-net) to leverage multi-scale features in predicting the probabilities of camouflaged objects' presences at given candidate points over the image. Then, we develop a key point selection (KPS) algorithm to deploy both positive and negative point promotions contrastively to SAM to guide the segmentation. It is the first work to facilitate big model for COD and achieves plausible results experimentally over the existing methods on 3 data sets under 6 metrics. This study demonstrates an off-the-shelf methodology for COD by leveraging SAM, which gains advantage over designing professional models from scratch, not only in performance, but also in turning the problem to a less challenging task, that is, seeking informative but not exactly precise promotions. △ Less

Submitted 14 May, 2025; originally announced May 2025.

arXiv:2505.09093 [pdf]

doi 10.1051/0004-6361/202554496

Observational study of the formation of homologous confined circular-ribbon flares

Authors: Shuhong Yang, Ruisheng Zheng, Yijun Hou, Yuandeng Shen, Yin Li, Xiaoshuai Zhu, Ting Li, Guiping Zhou

Abstract: When several solar flares with comparable classes occur successively at the same location and exhibit similar morphological features, they are called homologous flares. During 2012 May 8-10, five M-class homologous circular-ribbon flares associated with no coronal mass ejection occurred in active region (AR) 11476. The formation process of these homologous confined flares, particularly the homolog… ▽ More When several solar flares with comparable classes occur successively at the same location and exhibit similar morphological features, they are called homologous flares. During 2012 May 8-10, five M-class homologous circular-ribbon flares associated with no coronal mass ejection occurred in active region (AR) 11476. The formation process of these homologous confined flares, particularly the homologous aspect, is unclear and inconclusive. This paper is dedicated to studying how the energy for this series of flares was accumulated and whether there existed null points responsible for the flare energy release. Before and during the five flares, the sunspots with opposite polarities sheared against each other and also rotated individually. Before each flare, the magnetic fields at the polarity inversion line were highly sheared and there existed a magnetic flux rope overlain by arch-shaped loops. For the first four flares, we find magnetic null points in the fan-spine topology, situated at about 3.8 Mm, 5.7 Mm, 3.4 Mm, and 2.6 Mm above the photosphere, respectively. For the fifth flare, no null point is detected. However, in the (extreme-)ultraviolet images, the evolution behaviors of all the flares were almost identical. Therefore, we speculate that a null point responsible for the occurrence of the fifth flare may have existed. These results reveal that, for these homologous flares in AR 11476, the sunspot rotation and shearing motion play important roles in energy accumulation, the null point of the fan-spine topology is crucial for energy release through magnetic reconnection therein, and large-scale magnetic loops prevent the erupting material from escaping the Sun, thus forming the observed homologous confined major circular-ribbon flares. This study provides clear evidence for the drivers of successive, homologous flares as well as the nature of confined events. △ Less

Submitted 13 May, 2025; originally announced May 2025.

Comments: 10 pages, 8 figures. Accepted for publication in A&A

Journal ref: A&A 698, A185 (2025)

arXiv:2505.09089 [pdf, ps, other]

Generating time-consistent dynamics with discriminator-guided image diffusion models

Authors: Philipp Hess, Maximilian Gelbrecht, Christof Schötz, Michael Aich, Yu Huang, Shangshang Yang, Niklas Boers

Abstract: Realistic temporal dynamics are crucial for many video generation, processing and modelling applications, e.g. in computational fluid dynamics, weather prediction, or long-term climate simulations. Video diffusion models (VDMs) are the current state-of-the-art method for generating highly realistic dynamics. However, training VDMs from scratch can be challenging and requires large computational re… ▽ More Realistic temporal dynamics are crucial for many video generation, processing and modelling applications, e.g. in computational fluid dynamics, weather prediction, or long-term climate simulations. Video diffusion models (VDMs) are the current state-of-the-art method for generating highly realistic dynamics. However, training VDMs from scratch can be challenging and requires large computational resources, limiting their wider application. Here, we propose a time-consistency discriminator that enables pretrained image diffusion models to generate realistic spatiotemporal dynamics. The discriminator guides the sampling inference process and does not require extensions or finetuning of the image diffusion model. We compare our approach against a VDM trained from scratch on an idealized turbulence simulation and a real-world global precipitation dataset. Our approach performs equally well in terms of temporal consistency, shows improved uncertainty calibration and lower biases compared to the VDM, and achieves stable centennial-scale climate simulations at daily time steps. △ Less

Submitted 14 May, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

arXiv:2505.08875 [pdf, other]

Real-time Capable Learning-based Visual Tool Pose Correction via Differentiable Simulation

Authors: Shuyuan Yang, Zonghe Chua

Abstract: Autonomy in Minimally Invasive Robotic Surgery (MIRS) has the potential to reduce surgeon cognitive and task load, thereby increasing procedural efficiency. However, implementing accurate autonomous control can be difficult due to poor end-effector proprioception, a limitation of their cable-driven mechanisms. Although the robot may have joint encoders for the end-effector pose calculation, variou… ▽ More Autonomy in Minimally Invasive Robotic Surgery (MIRS) has the potential to reduce surgeon cognitive and task load, thereby increasing procedural efficiency. However, implementing accurate autonomous control can be difficult due to poor end-effector proprioception, a limitation of their cable-driven mechanisms. Although the robot may have joint encoders for the end-effector pose calculation, various non-idealities make the entire kinematics chain inaccurate. Modern vision-based pose estimation methods lack real-time capability or can be hard to train and generalize. In this work, we demonstrate a real-time capable, vision transformer-based pose estimation approach that is trained using end-to-end differentiable kinematics and rendering in simulation. We demonstrate the potential of this method to correct for noisy pose estimates in simulation, with the longer term goal of verifying the sim-to-real transferability of our approach. △ Less

Submitted 13 May, 2025; originally announced May 2025.

arXiv:2505.08734 [pdf, ps, other]

NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context

Authors: Ben Yao, Qiuchi Li, Yazhou Zhang, Siyu Yang, Bohan Zhang, Prayag Tiwari, Jing Qin

Abstract: This work introduces the first benchmark for nursing value alignment, consisting of five core value dimensions distilled from international nursing codes: Altruism, Human Dignity, Integrity, Justice, and Professionalism. The benchmark comprises 1,100 real-world nursing behavior instances collected through a five-month longitudinal field study across three hospitals of varying tiers. These instance… ▽ More This work introduces the first benchmark for nursing value alignment, consisting of five core value dimensions distilled from international nursing codes: Altruism, Human Dignity, Integrity, Justice, and Professionalism. The benchmark comprises 1,100 real-world nursing behavior instances collected through a five-month longitudinal field study across three hospitals of varying tiers. These instances are annotated by five clinical nurses and then augmented with LLM-generated counterfactuals with reversed ethic polarity. Each original case is paired with a value-aligned and a value-violating version, resulting in 2,200 labeled instances that constitute the Easy-Level dataset. To increase adversarial complexity, each instance is further transformed into a dialogue-based format that embeds contextual cues and subtle misleading signals, yielding a Hard-Level dataset. We evaluate 23 state-of-the-art (SoTA) LLMs on their alignment with nursing values. Our findings reveal three key insights: (1) DeepSeek-V3 achieves the highest performance on the Easy-Level dataset (94.55), where Claude 3.5 Sonnet outperforms other models on the Hard-Level dataset (89.43), significantly surpassing the medical LLMs; (2) Justice is consistently the most difficult nursing value dimension to evaluate; and (3) in-context learning significantly improves alignment. This work aims to provide a foundation for value-sensitive LLMs development in clinical settings. The dataset and the code are available at https://huggingface.co/datasets/Ben012345/NurValues. △ Less

Submitted 13 May, 2025; originally announced May 2025.

Comments: 25 pages, 10 figures, 16 tables

MSC Class: 68T50 ACM Class: I.2.7

arXiv:2505.08418 [pdf, ps, other]

Search for lepton flavor-violating decay modes $B^0 \to K^{\ast 0}τ^\pm\ell^\mp$ ($\ell = e,μ$) with hadronic B-tagging at Belle and Belle II

Authors: Belle, Belle II Collaborations, :, I. Adachi, Y. Ahn, H. Aihara, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, K. Amos, M. Angelsmark, N. Anh Ky, C. Antonioli, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, N. K. Baghel, S. Bahinipati, P. Bambade, Sw. Banerjee , et al. (353 additional authors not shown)

Abstract: We present the results of a search for the charged-lepton-flavor violating decays $B^0 \rightarrow K^{*0}τ^\pm \ell^{\mp}$, where $\ell^{\mp}$ is either an electron or a muon. The results are based on 365 fb$^{-1}$ and 711 fb$^{-1}$ datasets collected with the Belle II and Belle detectors, respectively. We use an exclusive hadronic $B$-tagging technique, and search for a signal decay in the system… ▽ More We present the results of a search for the charged-lepton-flavor violating decays $B^0 \rightarrow K^{*0}τ^\pm \ell^{\mp}$, where $\ell^{\mp}$ is either an electron or a muon. The results are based on 365 fb$^{-1}$ and 711 fb$^{-1}$ datasets collected with the Belle II and Belle detectors, respectively. We use an exclusive hadronic $B$-tagging technique, and search for a signal decay in the system recoiling against a fully reconstructed $B$ meson. We find no evidence for $B^0 \rightarrow K^{*0}τ^\pm \ell^{\mp}$ decays and set upper limits on the branching fractions in the range of $(2.9-6.4)\times10^{-5}$ at 90% confidence level. △ Less

Submitted 13 May, 2025; originally announced May 2025.

Comments: 19 pages, 4 figures

Report number: Belle II preprint: 2025-014, KEK preprint: 2025-13

arXiv:2505.08409 [pdf, ps, other]

Observational constraints on the Kerr and its several single-parameter modified spacetimes using quasi-periodic oscillation data

Authors: Shining Yang, Jianbo Lu, Wenmei Li, Mou Xu, Jingyang Xu

Abstract: This paper investigates the dynamical effects of particles moving in the Kerr spacetime and its nine single-parameter modified spacetimes, including Bardeen, Ayon-Beato and Garcia (ABG), Hayward, Kerr-Newman (KN), Kerr-Taub-NUT (KTN), Braneworld Kerr (BK), Kerr-MOG, Kerr-Sen, and Perfect Fluid Dark Matter (PFDM) black holes. Using quasi-periodic oscillation (QPO) observational data, we constrain t… ▽ More This paper investigates the dynamical effects of particles moving in the Kerr spacetime and its nine single-parameter modified spacetimes, including Bardeen, Ayon-Beato and Garcia (ABG), Hayward, Kerr-Newman (KN), Kerr-Taub-NUT (KTN), Braneworld Kerr (BK), Kerr-MOG, Kerr-Sen, and Perfect Fluid Dark Matter (PFDM) black holes. Using quasi-periodic oscillation (QPO) observational data, we constrain the free parameters of the ten spacetimes through $χ^2$ analysis under the relativistic precession model of QPO. We constrain the modification parameters for the nine single-parameter modified spacetimes and provide the spin and mass ranges of three microquasars within the ten spacetime models (including Kerr) at the $68\%$ confidence level (CL). The results demonstrate that, at the $68 \%$ CL, the QPO data impose stringent constraints on the free parameters, as evidenced by the narrow confidence intervals. Among them, only the KN spacetime yields a modification parameter constraint spanning both negative and positive values (encompassing the Kerr case at zero). In contrast, all other tested geometries mandate positive-definite parameters at $68 \%$ CL, demonstrating statistical deviation of the Kerr solution. This highlights the significance of exploring modifications to the Kerr spacetime. Finally, we evaluate the spacetime models using the Bayes factor and the Akaike Information Criterion (AIC). Based on the current QPO observational data, the Bayesian factor analysis indicates that the ABG, Hayward, KN, BK, and Kerr-MOG spacetime have a slight advantage over the Kerr solution, while the Bardeen, KTN, Kerr-Sen, and PFDM spacetime are somewhat inferior to the Kerr model. In contrast, the AIC analysis shows that the Kerr spacetime remains the optimal model under the current QPO data. △ Less

Submitted 13 May, 2025; originally announced May 2025.

arXiv:2505.08194 [pdf, other]

CLTP: Contrastive Language-Tactile Pre-training for 3D Contact Geometry Understanding

Authors: Wenxuan Ma, Xiaoge Cao, Yixiang Zhang, Chaofan Zhang, Shaobo Yang, Peng Hao, Bin Fang, Yinghao Cai, Shaowei Cui, Shuo Wang

Abstract: Recent advancements in integrating tactile sensing with vision-language models (VLMs) have demonstrated remarkable potential for robotic multimodal perception. However, existing tactile descriptions remain limited to superficial attributes like texture, neglecting critical contact states essential for robotic manipulation. To bridge this gap, we propose CLTP, an intuitive and effective language ta… ▽ More Recent advancements in integrating tactile sensing with vision-language models (VLMs) have demonstrated remarkable potential for robotic multimodal perception. However, existing tactile descriptions remain limited to superficial attributes like texture, neglecting critical contact states essential for robotic manipulation. To bridge this gap, we propose CLTP, an intuitive and effective language tactile pretraining framework that aligns tactile 3D point clouds with natural language in various contact scenarios, thus enabling contact-state-aware tactile language understanding for contact-rich manipulation tasks. We first collect a novel dataset of 50k+ tactile 3D point cloud-language pairs, where descriptions explicitly capture multidimensional contact states (e.g., contact location, shape, and force) from the tactile sensor's perspective. CLTP leverages a pre-aligned and frozen vision-language feature space to bridge holistic textual and tactile modalities. Experiments validate its superiority in three downstream tasks: zero-shot 3D classification, contact state classification, and tactile 3D large language model (LLM) interaction. To the best of our knowledge, this is the first study to align tactile and language representations from the contact state perspective for manipulation tasks, providing great potential for tactile-language-action model learning. Code and datasets are open-sourced at https://sites.google.com/view/cltp/. △ Less

Submitted 12 May, 2025; originally announced May 2025.

Comments: 16 pages

arXiv:2505.08092 [pdf, ps, other]

Doubly Robust Fusion of Many Treatments for Policy Learning

Authors: Ke Zhu, Jianing Chu, Ilya Lipkovich, Wenyu Ye, Shu Yang

Abstract: Individualized treatment rules/recommendations (ITRs) aim to improve patient outcomes by tailoring treatments to the characteristics of each individual. However, when there are many treatment groups, existing methods face significant challenges due to data sparsity within treatment groups and highly unbalanced covariate distributions across groups. To address these challenges, we propose a novel c… ▽ More Individualized treatment rules/recommendations (ITRs) aim to improve patient outcomes by tailoring treatments to the characteristics of each individual. However, when there are many treatment groups, existing methods face significant challenges due to data sparsity within treatment groups and highly unbalanced covariate distributions across groups. To address these challenges, we propose a novel calibration-weighted treatment fusion procedure that robustly balances covariates across treatment groups and fuses similar treatments using a penalized working model. The fusion procedure ensures the recovery of latent treatment group structures when either the calibration model or the outcome model is correctly specified. In the fused treatment space, practitioners can seamlessly apply state-of-the-art ITR learning methods with the flexibility to utilize a subset of covariates, thereby achieving robustness while addressing practical concerns such as fairness. We establish theoretical guarantees, including consistency, the oracle property of treatment fusion, and regret bounds when integrated with multi-armed ITR learning methods such as policy trees. Simulation studies show superior group recovery and policy value compared to existing approaches. We illustrate the practical utility of our method using a nationwide electronic health record-derived de-identified database containing data from patients with Chronic Lymphocytic Leukemia and Small Lymphocytic Lymphoma. △ Less

Submitted 23 May, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

Comments: Accepted by ICML 2025

arXiv:2505.07782 [pdf, ps, other]

MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering

Authors: Rushi Qiang, Yuchen Zhuang, Yinghao Li, Dingu Sagar V K, Rongzhi Zhang, Changhao Li, Ian Shu-Hei Wong, Sherry Yang, Percy Liang, Chao Zhang, Bo Dai

Abstract: We introduce MLE-Dojo, a Gym-style framework for systematically reinforcement learning, evaluating, and improving autonomous large language model (LLM) agents in iterative machine learning engineering (MLE) workflows. Unlike existing benchmarks that primarily rely on static datasets or single-attempt evaluations, MLE-Dojo provides an interactive environment enabling agents to iteratively experimen… ▽ More We introduce MLE-Dojo, a Gym-style framework for systematically reinforcement learning, evaluating, and improving autonomous large language model (LLM) agents in iterative machine learning engineering (MLE) workflows. Unlike existing benchmarks that primarily rely on static datasets or single-attempt evaluations, MLE-Dojo provides an interactive environment enabling agents to iteratively experiment, debug, and refine solutions through structured feedback loops. Built upon 200+ real-world Kaggle challenges, MLE-Dojo covers diverse, open-ended MLE tasks carefully curated to reflect realistic engineering scenarios such as data processing, architecture search, hyperparameter tuning, and code debugging. Its fully executable environment supports comprehensive agent training via both supervised fine-tuning and reinforcement learning, facilitating iterative experimentation, realistic data sampling, and real-time outcome verification. Extensive evaluations of eight frontier LLMs reveal that while current models achieve meaningful iterative improvements, they still exhibit significant limitations in autonomously generating long-horizon solutions and efficiently resolving complex errors. Furthermore, MLE-Dojo's flexible and extensible architecture seamlessly integrates diverse data sources, tools, and evaluation protocols, uniquely enabling model-based agent tuning and promoting interoperability, scalability, and reproducibility. We open-source our framework and benchmarks to foster community-driven innovation towards next-generation MLE agents. △ Less

Submitted 12 May, 2025; originally announced May 2025.

arXiv:2505.06973 [pdf, ps, other]

A $p_T$-ratio observable for studies of intrinsic transverse momentum of partons from Drell-Yan $p_T$ spectra

Authors: Wenxiao Zhan, Siqi Yang, Minghui Liu, Liang Han, Francesco Hautmann

Abstract: The determination of the intrinsic transverse momentum distribution of partons is central both for applications of parton shower Monte Carlo generators and for QCD studies of transverse momentum dependent (TMD) parton densities. Valuable information on this distribution is provided by experimental measurements of Drell-Yan transverse momentum $p_T$, in the region of low transverse momenta, with fi… ▽ More The determination of the intrinsic transverse momentum distribution of partons is central both for applications of parton shower Monte Carlo generators and for QCD studies of transverse momentum dependent (TMD) parton densities. Valuable information on this distribution is provided by experimental measurements of Drell-Yan transverse momentum $p_T$, in the region of low transverse momenta, with fine binning in $p_T$. However, such fine-binning measurements are challenging, as they require an extremely delicate control of systematic uncertainties. We suggest a $p_T$ observable based on measuring ratios between cross sections of suitably defined low-$p_T$ and high-$p_T$ regions. This observable does not rely on any dedicated partition of bins and has lower systematic uncertainties, and is shown to provide a good sensitivity to the intrinsic transverse momentum. △ Less

Submitted 11 May, 2025; originally announced May 2025.

Comments: Contribution to the 2025 QCD session of the 59th Rencontres de Moriond. Based on the work in arXiv:2412.19060, published in Phys. Rev. D 111, 036018

arXiv:2505.06947 [pdf, ps, other]

The Wisdom of Agent Crowds: A Human-AI Interaction Innovation Ignition Framework

Authors: Senhao Yang, Qiwen Cheng, Ruiqi Ma, Liangzhe Zhao, Zhenying Wu, Guangqiang Yu

Abstract: With the widespread application of large AI models in various fields, the automation level of multi-agent systems has been continuously improved. However, in high-risk decision-making scenarios such as healthcare and finance, human participation and the alignment of intelligent systems with human intentions remain crucial. This paper focuses on the financial scenario and constructs a multi-agent b… ▽ More With the widespread application of large AI models in various fields, the automation level of multi-agent systems has been continuously improved. However, in high-risk decision-making scenarios such as healthcare and finance, human participation and the alignment of intelligent systems with human intentions remain crucial. This paper focuses on the financial scenario and constructs a multi-agent brainstorming framework based on the BDI theory. A human-computer collaborative multi-agent financial analysis process is built using Streamlit. The system plans tasks according to user intentions, reduces users' cognitive load through real-time updated structured text summaries and the interactive Cothinker module, and reasonably integrates general and reasoning large models to enhance the ability to handle complex problems. By designing a quantitative analysis algorithm for the sentiment tendency of interview content based on LLMs and a method for evaluating the diversity of ideas generated by LLMs in brainstorming based on k-means clustering and information entropy, the system is comprehensively evaluated. The results of human factors testing show that the system performs well in terms of usability and user experience. Although there is still room for improvement, it can effectively support users in completing complex financial tasks. The research shows that the system significantly improves the efficiency of human-computer interaction and the quality of decision-making in financial decision-making scenarios, providing a new direction for the development of related fields. △ Less

Submitted 11 May, 2025; originally announced May 2025.

ACM Class: I.2.7; J.4

arXiv:2505.06816 [pdf, ps, other]

doi 10.1109/TVT.2025.3561218

Cross-Link Interference Mitigation With Over-the-Air Pilot Forwarding for Dynamic TDD

Authors: Jia-Hui Bi, Shaoshi Yang, Xiao-Yang Wang, Yu-Song Luo, Sheng Chen

Abstract: Dynamic time-division duplex (D-TDD) aided mobile communication systems bear the potential to achieve significantly higher spectral efficiency than traditional static TDD based systems. However, strong cross-link interference (CLI) may be caused by different transmission directions between adjacent cells in D-TDD systems, thus degrading the performance. Most existing CLI mitigation schemes require… ▽ More Dynamic time-division duplex (D-TDD) aided mobile communication systems bear the potential to achieve significantly higher spectral efficiency than traditional static TDD based systems. However, strong cross-link interference (CLI) may be caused by different transmission directions between adjacent cells in D-TDD systems, thus degrading the performance. Most existing CLI mitigation schemes require sharing certain information among base stations (BSs) via backhaul links. This strategy is usually expensive and suffers high latency. Alternatively, we propose a pilot information sharing scheme based on over-the-air forwarding of the downlink pilot of the interfering BS to the interfered BS via a wireless terminal, along with a dedicated CLI channel estimation method. Simulation results demonstrate that thanks to the proposed pilot information sharing scheme the classic interference rejection combining (IRC) receiver achieves a signal detection performance highly comparable to that of the IRC detector with perfect pilot information, necessitating no information sharing among BSs via backhaul links. Furthermore, the proposed CLI channel estimation scheme reduces the impact of errors introduced by pilot forwarding, thereby improving the performance of both CLI channel estimation and signal detection. △ Less

Submitted 10 May, 2025; originally announced May 2025.

Comments: 6 pages, 7 figures, 1 table, to be published in IEEE Transactions on Vehicular Technology

arXiv:2505.06710 [pdf, other]

SimMIL: A Universal Weakly Supervised Pre-Training Framework for Multi-Instance Learning in Whole Slide Pathology Images

Authors: Yicheng Song, Tiancheng Lin, Die Peng, Su Yang, Yi Xu

Abstract: Various multi-instance learning (MIL) based approaches have been developed and successfully applied to whole-slide pathological images (WSI). Existing MIL methods emphasize the importance of feature aggregators, but largely neglect the instance-level representation learning. They assume that the availability of a pre-trained feature extractor can be directly utilized or fine-tuned, which is not al… ▽ More Various multi-instance learning (MIL) based approaches have been developed and successfully applied to whole-slide pathological images (WSI). Existing MIL methods emphasize the importance of feature aggregators, but largely neglect the instance-level representation learning. They assume that the availability of a pre-trained feature extractor can be directly utilized or fine-tuned, which is not always the case. This paper proposes to pre-train feature extractor for MIL via a weakly-supervised scheme, i.e., propagating the weak bag-level labels to the corresponding instances for supervised learning. To learn effective features for MIL, we further delve into several key components, including strong data augmentation, a non-linear prediction head and the robust loss function. We conduct experiments on common large-scale WSI datasets and find it achieves better performance than other pre-training schemes (e.g., ImageNet pre-training and self-supervised learning) in different downstream tasks. We further show the compatibility and scalability of the proposed scheme by deploying it in fine-tuning the pathological-specific models and pre-training on merged multiple datasets. To our knowledge, this is the first work focusing on the representation learning for MIL. △ Less

Submitted 10 May, 2025; originally announced May 2025.

arXiv:2505.06708 [pdf, ps, other]

Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Authors: Zihan Qiu, Zekun Wang, Bo Zheng, Zeyu Huang, Kaiyue Wen, Songlin Yang, Rui Men, Le Yu, Fei Huang, Suozhi Huang, Dayiheng Liu, Jingren Zhou, Junyang Lin

Abstract: Gating mechanisms have been widely utilized, from early models like LSTMs and Highway Networks to recent state space models, linear attention, and also softmax attention. Yet, existing literature rarely examines the specific effects of gating. In this work, we conduct comprehensive experiments to systematically investigate gating-augmented softmax attention variants. Specifically, we perform a com… ▽ More Gating mechanisms have been widely utilized, from early models like LSTMs and Highway Networks to recent state space models, linear attention, and also softmax attention. Yet, existing literature rarely examines the specific effects of gating. In this work, we conduct comprehensive experiments to systematically investigate gating-augmented softmax attention variants. Specifically, we perform a comprehensive comparison over 30 variants of 15B Mixture-of-Experts (MoE) models and 1.7B dense models trained on a 3.5 trillion token dataset. Our central finding is that a simple modification-applying a head-specific sigmoid gate after the Scaled Dot-Product Attention (SDPA)-consistently improves performance. This modification also enhances training stability, tolerates larger learning rates, and improves scaling properties. By comparing various gating positions and computational variants, we attribute this effectiveness to two key factors: (1) introducing non-linearity upon the low-rank mapping in the softmax attention, and (2) applying query-dependent sparse gating scores to modulate the SDPA output. Notably, we find this sparse gating mechanism mitigates 'attention sink' and enhances long-context extrapolation performance, and we also release related $\href{https://github.com/qiuzh20/gated_attention}{codes}$ and $\href{https://huggingface.co/QwQZh/gated_attention}{models}$ to facilitate future research. △ Less

Submitted 10 May, 2025; originally announced May 2025.

arXiv:2505.06663 [pdf, ps, other]

METOR: A Unified Framework for Mutual Enhancement of Objects and Relationships in Open-vocabulary Video Visual Relationship Detection

Authors: Yongqi Wang, Xinxiao Wu, Shuo Yang

Abstract: Open-vocabulary video visual relationship detection aims to detect objects and their relationships in videos without being restricted by predefined object or relationship categories. Existing methods leverage the rich semantic knowledge of pre-trained vision-language models such as CLIP to identify novel categories. They typically adopt a cascaded pipeline to first detect objects and then classify… ▽ More Open-vocabulary video visual relationship detection aims to detect objects and their relationships in videos without being restricted by predefined object or relationship categories. Existing methods leverage the rich semantic knowledge of pre-trained vision-language models such as CLIP to identify novel categories. They typically adopt a cascaded pipeline to first detect objects and then classify relationships based on the detected objects, which may lead to error propagation and thus suboptimal performance. In this paper, we propose Mutual EnhancemenT of Objects and Relationships (METOR), a query-based unified framework to jointly model and mutually enhance object detection and relationship classification in open-vocabulary scenarios. Under this framework, we first design a CLIP-based contextual refinement encoding module that extracts visual contexts of objects and relationships to refine the encoding of text features and object queries, thus improving the generalization of encoding to novel categories. Then we propose an iterative enhancement module to alternatively enhance the representations of objects and relationships by fully exploiting their interdependence to improve recognition performance. Extensive experiments on two public datasets, VidVRD and VidOR, demonstrate that our framework achieves state-of-the-art performance. △ Less

Submitted 10 May, 2025; originally announced May 2025.

Comments: IJCAI2025

arXiv:2505.06556 [pdf, other]

TierBase: A Workload-Driven Cost-Optimized Key-Value Store

Authors: Zhitao Shen, Shiyu Yang, Weibo Chen, Kunming Wang, Yue Li, Jiabao Jin, Wei Jia, Junwei Chen, Yuan Su, Xiaoxia Duan, Wei Chen, Lei Wang, Jie Song, Ruoyi Ruan, Xuemin Lin

Abstract: In the current era of data-intensive applications, the demand for high-performance, cost-effective storage solutions is paramount. This paper introduces a Space-Performance Cost Model for key-value store, designed to guide cost-effective storage configuration decisions. The model quantifies the trade-offs between performance and storage costs, providing a framework for optimizing resource allocati… ▽ More In the current era of data-intensive applications, the demand for high-performance, cost-effective storage solutions is paramount. This paper introduces a Space-Performance Cost Model for key-value store, designed to guide cost-effective storage configuration decisions. The model quantifies the trade-offs between performance and storage costs, providing a framework for optimizing resource allocation in large-scale data serving environments. Guided by this cost model, we present TierBase, a distributed key-value store developed by Ant Group that optimizes total cost by strategically synchronizing data between cache and storage tiers, maximizing resource utilization and effectively handling skewed workloads. To enhance cost-efficiency, TierBase incorporates several optimization techniques, including pre-trained data compression, elastic threading mechanisms, and the utilization of persistent memory. We detail TierBase's architecture, key components, and the implementation of cost optimization strategies. Extensive evaluations using both synthetic benchmarks and real-world workloads demonstrate TierBase's superior cost-effectiveness compared to existing solutions. Furthermore, case studies from Ant Group's production environments showcase TierBase's ability to achieve up to 62% cost reduction in primary scenarios, highlighting its practical impact in large-scale online data serving. △ Less

Submitted 10 May, 2025; originally announced May 2025.

Comments: Accepted by ICDE 2025

arXiv:2505.06192 [pdf, other]

GECAM Discovery of Peculiar Oscillating Particle Precipitation Events

Authors: Chenwei Wang, Shaolin Xiong, Yi Zhao, Wei Xu, Gaopeng Lu, Xuzhi Zhou, Xiaocheng Guo, Wenya Li, Xiaochao Yang, Qinghe Zhang, Xinqiao Li, Zhenxia Zhang, Zhenghua An, Ce Cai, Peiyi Feng, Yue Huang, Min Gao, Ke Gong, Dongya Guo, Haoxuan Guo, Bing Li, Xiaobo Li, Yaqing Liu, Jiacong Liu, Xiaojing Liu , et al. (30 additional authors not shown)

Abstract: Charged particle precipitation typically manifests as a gradual increase and decrease of flux observed by space detectors. Cases with rapidly flux variation are very rare. Periodic events are even more extraordinary. These oscillating particle precipitation (OPP) events are usually attributed to the bounce motion of electrons, which are induced by lightning. Owing to the observation limitations, t… ▽ More Charged particle precipitation typically manifests as a gradual increase and decrease of flux observed by space detectors. Cases with rapidly flux variation are very rare. Periodic events are even more extraordinary. These oscillating particle precipitation (OPP) events are usually attributed to the bounce motion of electrons, which are induced by lightning. Owing to the observation limitations, there has been debate regarding whether these oscillations originate from temporal flux evolution or spatial structure evolution. Here we report three peculiar charged particle precipitation events detected by GECAM during a geomagnetic storm on March 21, 2024, with two exhibiting significant periodicity. These events were observed around the same region during three consecutive orbits. Through comprehensive temporal and spectral analyses, we revealed that one of the OPP events exhibited a transition in spectral lag of mini-pulses, shifting from "softer-earlier" to "softer-later" while showing no significant time evolution in overall frequency characteristics. And there is no association found between these two OPP events and lightning activity. Several possible scenarios are discussed to explain these charged particles with a life time of more than 3.5 hours, but the nature of these three events remains an enigma. We suggest that these GECAM-detected OPP events may represent a new type of particle precipitation event or a peculiar Lightning-induced Electron Precipitations (LEPs). △ Less

Submitted 9 May, 2025; originally announced May 2025.

arXiv:2505.06167 [pdf, other]

Pitch Angle Measurement Method based on Detector Counts Distribution. -I. Basic conception

Authors: Chenwei Wang, Shaolin Xiong, Hongbo Xue, Yiteng Zhang, Shanzhi Ye, Wei Xu, Jinpeng Zhang, Zhenghua An, Ce Cai, Peiyi Feng, Ke Gong, Haoxuan Guo, Yue Huang, Xinqiao Li, Jiacong Liu, Xiaojing Liu, Xiang Ma, Liming Song, Wenjun Tan, Jin Wang, Ping Wang, Yue Wang, Xiangyang Wen, Shuo Xiao, Shenlun Xie , et al. (14 additional authors not shown)

Abstract: As an X-ray and gamma-ray all-sky monitor aiming for high energy astrophysical transients, Gravitational-wave high-energy Electromagnetic Counterpart All-sky Monitor (GECAM) has also made a series of observational discoveries on burst events of gamma-rays and particles in the low Earth orbit. Pitch angle is one of the key parameters of charged particles traveling around geomagnetic field. However,… ▽ More As an X-ray and gamma-ray all-sky monitor aiming for high energy astrophysical transients, Gravitational-wave high-energy Electromagnetic Counterpart All-sky Monitor (GECAM) has also made a series of observational discoveries on burst events of gamma-rays and particles in the low Earth orbit. Pitch angle is one of the key parameters of charged particles traveling around geomagnetic field. However, the usage of the GECAM-style instruments to measure the pitch angle of charged particles is still lacking. Here we propose a novel method for GECAM and similar instruments to measure the pitch angle of charged particles based on detector counts distribution. The basic conception of this method and simulation studies are described. With this method, the pitch angle of a peculiar electron precipitation event detected by GECAM-C is derived to be about 90$^\circ$, demonstrating the feasibility of our method. We note that the application of this method on GECAM-style instruments may open a new window for studying space particle events, such as Terrestrial Electron Beams (TEBs) and Lightning-induced Electron Precipitations (LEPs). △ Less

Submitted 9 May, 2025; originally announced May 2025.

arXiv:2505.05968 [pdf, other]

Offline Multi-agent Reinforcement Learning via Score Decomposition

Authors: Dan Qiao, Wenhao Li, Shanchao Yang, Hongyuan Zha, Baoxiang Wang

Abstract: Offline cooperative multi-agent reinforcement learning (MARL) faces unique challenges due to distributional shifts, particularly stemming from the high dimensionality of joint action spaces and the presence of out-of-distribution joint action selections. In this work, we highlight that a fundamental challenge in offline MARL arises from the multi-equilibrium nature of cooperative tasks, which indu… ▽ More Offline cooperative multi-agent reinforcement learning (MARL) faces unique challenges due to distributional shifts, particularly stemming from the high dimensionality of joint action spaces and the presence of out-of-distribution joint action selections. In this work, we highlight that a fundamental challenge in offline MARL arises from the multi-equilibrium nature of cooperative tasks, which induces a highly multimodal joint behavior policy space coupled with heterogeneous-quality behavior data. This makes it difficult for individual policy regularization to align with a consistent coordination pattern, leading to the policy distribution shift problems. To tackle this challenge, we design a sequential score function decomposition method that distills per-agent regularization signals from the joint behavior policy, which induces coordinated modality selection under decentralized execution constraints. Then we leverage a flexible diffusion-based generative model to learn these score functions from multimodal offline data, and integrate them into joint-action critics to guide policy updates toward high-reward, in-distribution regions under a shared team reward. Our approach achieves state-of-the-art performance across multiple particle environments and Multi-agent MuJoCo benchmarks consistently. To the best of our knowledge, this is the first work to explicitly address the distributional gap between offline and online MARL, paving the way for more generalizable offline policy-based MARL methods. △ Less

Submitted 5 June, 2025; v1 submitted 9 May, 2025; originally announced May 2025.

Comments: Working papers

arXiv:2505.05784 [pdf, ps, other]

FlowHFT: Imitation Learning via Flow Matching Policy for Optimal High-Frequency Trading under Diverse Market Conditions

Authors: Yang Li, Zhi Chen, Steve Yang

Abstract: High-frequency trading (HFT) is an investing strategy that continuously monitors market states and places bid and ask orders at millisecond speeds. Traditional HFT approaches fit models with historical data and assume that future market states follow similar patterns. This limits the effectiveness of any single model to the specific conditions it was trained for. Additionally, these models achieve… ▽ More High-frequency trading (HFT) is an investing strategy that continuously monitors market states and places bid and ask orders at millisecond speeds. Traditional HFT approaches fit models with historical data and assume that future market states follow similar patterns. This limits the effectiveness of any single model to the specific conditions it was trained for. Additionally, these models achieve optimal solutions only under specific market conditions, such as assumptions about stock price's stochastic process, stable order flow, and the absence of sudden volatility. Real-world markets, however, are dynamic, diverse, and frequently volatile. To address these challenges, we propose the FlowHFT, a novel imitation learning framework based on flow matching policy. FlowHFT simultaneously learns strategies from numerous expert models, each proficient in particular market scenarios. As a result, our framework can adaptively adjust investment decisions according to the prevailing market state. Furthermore, FlowHFT incorporates a grid-search fine-tuning mechanism. This allows it to refine strategies and achieve superior performance even in complex or extreme market scenarios where expert strategies may be suboptimal. We test FlowHFT in multiple market environments. We first show that flow matching policy is applicable in stochastic market environments, thus enabling FlowHFT to learn trading strategies under different market conditions. Notably, our single framework consistently achieves performance superior to the best expert for each market condition. △ Less

Submitted 22 May, 2025; v1 submitted 9 May, 2025; originally announced May 2025.

Comments: 16 pages, 6 figures, 7 tables, 2 algorithms

arXiv:2505.05444 [pdf, ps, other]

The soft X-ray transient EP241021a: a cosmic explosion with a complex off-axis jet and cocoon from a massive progenitor

Authors: Giulia Gianfagna, Luigi Piro, Gabriele Bruni, Aishwarya Linesh Thakur, Hendrik Van Eerten, Alberto Castro-Tirado, Yong Chen, Ye-hao Cheng, Han He, Shumei Jia, Zhixing Ling, Elisabetta Maiorano, Rosita Paladino, Roberta Tripodi, Andrea Rossi, Shuaikang Yang, Jianghui Yuan, Weimin Yuan, Chen Zhang

Abstract: X-Ray Flashes (XRFs) are fast X-ray transients discovered by the BeppoSAX satellite, showing an isotropic sky distribution and a prompt emission duration between 10-1000 seconds. The observed prompt X-ray spectrum is similar to Gamma Ray Bursts (GRBs), but with a softer peak energy of the spectrum. Several pieces of evidence indicate that XRFs are connected to GRBs and likely represent their softe… ▽ More X-Ray Flashes (XRFs) are fast X-ray transients discovered by the BeppoSAX satellite, showing an isotropic sky distribution and a prompt emission duration between 10-1000 seconds. The observed prompt X-ray spectrum is similar to Gamma Ray Bursts (GRBs), but with a softer peak energy of the spectrum. Several pieces of evidence indicate that XRFs are connected to GRBs and likely represent their softer analogues, but their origin is still unclear. Several models have been proposed to explain the observed properties of XRF, mostly in the context of collapsar scenario, similar to GRBs but with different geometrical or physical conditions of the progenitor. These include off-axis GRBs and baryon-loaded explosions, that either produce a low Lorentz factor jet or a spherical mildly (or non-) relativistic ejecta, known as cocoons. In this paper, we present multi-wavelength observations of the afterglow of EP241021a, a soft X-ray transient detected by EP, consistent with being an XRF. We present the results of our multiwavelength campaign from radio (uGMRT, ATCA, e-MERLIN, ALMA), optical (LBT, GTC, CAHA) and X-rays (EP-FXT). EP241021a afterglow is characterized by multiple components. They represent the imprints of the interaction of a jet with the complex environment of the pre-existing progenitor, that is likely shaping its structure from a highly relativistic narrow cone to much broader and mildly relativistic cocoon components. △ Less

Submitted 8 May, 2025; originally announced May 2025.

Comments: 19 pages, 12 figures. Submitted to Astronomy & Astrophysics

arXiv:2505.05279 [pdf, other]

MTL-UE: Learning to Learn Nothing for Multi-Task Learning

Authors: Yi Yu, Song Xia, Siyuan Yang, Chenqi Kong, Wenhan Yang, Shijian Lu, Yap-Peng Tan, Alex C. Kot

Abstract: Most existing unlearnable strategies focus on preventing unauthorized users from training single-task learning (STL) models with personal data. Nevertheless, the paradigm has recently shifted towards multi-task data and multi-task learning (MTL), targeting generalist and foundation models that can handle multiple tasks simultaneously. Despite their growing importance, MTL data and models have been… ▽ More Most existing unlearnable strategies focus on preventing unauthorized users from training single-task learning (STL) models with personal data. Nevertheless, the paradigm has recently shifted towards multi-task data and multi-task learning (MTL), targeting generalist and foundation models that can handle multiple tasks simultaneously. Despite their growing importance, MTL data and models have been largely neglected while pursuing unlearnable strategies. This paper presents MTL-UE, the first unified framework for generating unlearnable examples for multi-task data and MTL models. Instead of optimizing perturbations for each sample, we design a generator-based structure that introduces label priors and class-wise feature embeddings which leads to much better attacking performance. In addition, MTL-UE incorporates intra-task and inter-task embedding regularization to increase inter-class separation and suppress intra-class variance which enhances the attack robustness greatly. Furthermore, MTL-UE is versatile with good supports for dense prediction tasks in MTL. It is also plug-and-play allowing integrating existing surrogate-dependent unlearnable methods with little adaptation. Extensive experiments show that MTL-UE achieves superior attacking performance consistently across 4 MTL datasets, 3 base UE methods, 5 model backbones, and 5 MTL task-weighting strategies. △ Less

Submitted 8 May, 2025; originally announced May 2025.

Comments: Accepted by ICML 2025

arXiv:2505.05133 [pdf, other]

Probing the Collision Geometry via Two-Photon Processes in Heavy-Ion Collisions

Authors: Jiaxuan Luo, Xinbai Li, Zebo Tang, Xin Wu, Shuai Yang, Wangmei Zha, Zhan Zhang

Abstract: The initial collision geometry, including the reaction plane, is crucial for interpreting collective phenomena in relativistic heavy-ion collisions, yet it remains experimentally inaccessible through conventional measurements. Recent studies propose utilizing photon-induced processes as a direct probe, leveraging the complete linear polarization of emitted photons whose orientation strongly correl… ▽ More The initial collision geometry, including the reaction plane, is crucial for interpreting collective phenomena in relativistic heavy-ion collisions, yet it remains experimentally inaccessible through conventional measurements. Recent studies propose utilizing photon-induced processes as a direct probe, leveraging the complete linear polarization of emitted photons whose orientation strongly correlates with the collision geometry. In this work, we employ a QED-based approach to systematically investigate dilepton production via two-photon processes in heavy-ion collisions at RHIC and LHC energies and detector acceptances. Our calculations reveal that dilepton emission exhibits significant sensitivity to the initial collision geometry through both the azimuthal angles of their emission (defined by the relative momentum vector of the two leptons) and the overall momentum orientation of the dilepton pairs. These findings highlight the potential of two-photon-generated dileptons as a novel, polarization-driven probe to quantify the initial collision geometry and reduce uncertainties in characterizing quark-gluon plasma properties. △ Less

Submitted 8 May, 2025; originally announced May 2025.

Comments: 7 pages, 5 figures

arXiv:2505.04469 [pdf, other]

Measurement of reactor antineutrino oscillation at SNO+

Authors: SNO+ Collaboration, :, M. Abreu, V. Albanese, A. Allega, R. Alves, M. R. Anderson, S. Andringa, L. Anselmo, J. Antunes, E. Arushanova, S. Asahi, M. Askins, D. M. Asner, D. J. Auty, A. R. Back, S. Back, A. Bacon, T. Baltazar, F. Barão, Z. Barnard, A. Barr, N. Barros, D. Bartlett, R. Bayes , et al. (276 additional authors not shown)

Abstract: The SNO+ collaboration reports its second spectral analysis of reactor antineutrino oscillation using 286 tonne-years of new data. The measured energies of reactor antineutrino candidates were fitted to obtain the second-most precise determination of the neutrino mass-squared difference $Δm^2_{21}$ = ($7.96^{+0.48}_{-0.42}$) $\times$ 10$^{-5}$ eV$^2$. Constraining $Δm^2_{21}$ and $\sin^2θ_{12}$ wi… ▽ More The SNO+ collaboration reports its second spectral analysis of reactor antineutrino oscillation using 286 tonne-years of new data. The measured energies of reactor antineutrino candidates were fitted to obtain the second-most precise determination of the neutrino mass-squared difference $Δm^2_{21}$ = ($7.96^{+0.48}_{-0.42}$) $\times$ 10$^{-5}$ eV$^2$. Constraining $Δm^2_{21}$ and $\sin^2θ_{12}$ with measurements from long-baseline reactor antineutrino and solar neutrino experiments yields $Δm^2_{21}$ = ($7.58^{+0.18}_{-0.17}$) $\times$ 10$^{-5}$ eV$^2$ and $\sin^2θ_{12} = 0.308 \pm 0.013$. This fit also yields a first measurement of the flux of geoneutrinos in the Western Hemisphere, with $73^{+47}_{-43}$ TNU at SNO+. △ Less

Submitted 7 May, 2025; originally announced May 2025.

arXiv:2505.04409 [pdf, ps, other]

Measurement of neutron production in atmospheric neutrino interactions at Super-Kamiokande

Authors: Super-Kamiokande collaboration, :, S. Han, K. Abe, S. Abe, Y. Asaoka, C. Bronner, M. Harada, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, M. Nakahata, S. Nakayama, Y. Noguchi , et al. (260 additional authors not shown)

Abstract: We present measurements of total neutron production from atmospheric neutrino interactions in water, analyzed as a function of electron-equivalent visible energy over a range of 30 MeV to 10 GeV. These results are based on 4,270 days of data collected by Super-Kamiokande, including 564 days with 0.011 wt\% gadolinium added to enhance neutron detection. Neutron signal selection is based on a neural… ▽ More We present measurements of total neutron production from atmospheric neutrino interactions in water, analyzed as a function of electron-equivalent visible energy over a range of 30 MeV to 10 GeV. These results are based on 4,270 days of data collected by Super-Kamiokande, including 564 days with 0.011 wt\% gadolinium added to enhance neutron detection. Neutron signal selection is based on a neural network trained on simulation, with its performance validated using an Am/Be neutron point source. The measurements are compared to predictions from neutrino event generators combined with various hadron-nucleus interaction models, which include an intranuclear cascade model and a nuclear de-excitation model. We observe significant variations in the predictions depending on the choice of hadron-nucleus interaction model. We discuss key factors that contribute to describing our data, such as in-medium effects in the intranuclear cascade and the accuracy of statistical evaporation modeling. △ Less

Submitted 20 June, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

Comments: 24 pages, 25 figures

arXiv:2505.04396 [pdf, ps, other]

Supporting renewable energy planning and operation with data-driven high-resolution ensemble weather forecast

Authors: Jingnan Wang, Jie Chao, Shangshang Yang, Kaijun Ren, Kefeng Deng, Xi Chen, Yaxin Liu, Hanqiuzi Wen, Ziniu Xiao, Lifeng Zhang, Xiaodong Wang, Jiping Guan, Baoxiang Pan

Abstract: The planning and operation of renewable energy, especially wind power, depend crucially on accurate, timely, and high-resolution weather information. Coarse-grid global numerical weather forecasts are typically downscaled to meet these requirements, introducing challenges of scale inconsistency, process representation error, computation cost, and entanglement of distinct uncertainty sources from c… ▽ More The planning and operation of renewable energy, especially wind power, depend crucially on accurate, timely, and high-resolution weather information. Coarse-grid global numerical weather forecasts are typically downscaled to meet these requirements, introducing challenges of scale inconsistency, process representation error, computation cost, and entanglement of distinct uncertainty sources from chaoticity, model bias, and large-scale forcing. We address these challenges by learning the climatological distribution of a target wind farm using its high-resolution numerical weather simulations. An optimal combination of this learned high-resolution climatological prior with coarse-grid large scale forecasts yields highly accurate, fine-grained, full-variable, large ensemble of weather pattern forecasts. Using observed meteorological records and wind turbine power outputs as references, the proposed methodology verifies advantageously compared to existing numerical/statistical forecasting-downscaling pipelines, regarding either deterministic/probabilistic skills or economic gains. Moreover, a 100-member, 10-day forecast with spatial resolution of 1 km and output frequency of 15 min takes < 1 hour on a moderate-end GPU, as contrast to $\mathcal{O}(10^3)$ CPU hours for conventional numerical simulation. By drastically reducing computational costs while maintaining accuracy, our method paves the way for more efficient and reliable renewable energy planning and operation. △ Less

Submitted 27 June, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

arXiv:2505.04021 [pdf, other]

Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving

Authors: Shan Yu, Jiarong Xing, Yifan Qiao, Mingyuan Ma, Yangmin Li, Yang Wang, Shuo Yang, Zhiqiang Xie, Shiyi Cao, Ke Bao, Ion Stoica, Harry Xu, Ying Sheng

Abstract: Serving large language models (LLMs) is expensive, especially for providers hosting many models, making cost reduction essential. The unique workload patterns of serving multiple LLMs (i.e., multi-LLM serving) create new opportunities and challenges for this task. The long-tail popularity of models and their long idle periods present opportunities to improve utilization through GPU sharing. Howeve… ▽ More Serving large language models (LLMs) is expensive, especially for providers hosting many models, making cost reduction essential. The unique workload patterns of serving multiple LLMs (i.e., multi-LLM serving) create new opportunities and challenges for this task. The long-tail popularity of models and their long idle periods present opportunities to improve utilization through GPU sharing. However, existing GPU sharing systems lack the ability to adjust their resource allocation and sharing policies at runtime, making them ineffective at meeting latency service-level objectives (SLOs) under rapidly fluctuating workloads. This paper presents Prism, a multi-LLM serving system that unleashes the full potential of GPU sharing to achieve both cost efficiency and SLO attainment. At its core, Prism tackles a key limitation of existing systems$\unicode{x2014}$the lack of $\textit{cross-model memory coordination}$, which is essential for flexibly sharing GPU memory across models under dynamic workloads. Prism achieves this with two key designs. First, it supports on-demand memory allocation by dynamically mapping physical to virtual memory pages, allowing flexible memory redistribution among models that space- and time-share a GPU. Second, it improves memory efficiency through a two-level scheduling policy that dynamically adjusts sharing strategies based on models' runtime demands. Evaluations on real-world traces show that Prism achieves more than $2\times$ cost savings and $3.3\times$ SLO attainment compared to state-of-the-art systems. △ Less

Submitted 12 May, 2025; v1 submitted 6 May, 2025; originally announced May 2025.

arXiv:2505.03894 [pdf]

doi 10.1039/D4NR04812A

Spectroscopic evidence of intra-unit-cell charge redistribution in charge-neutral magnetic topological insulator Sb-doped MnBi6Te10

Authors: Khanh Duy Nguyen, Gabriele Berruto, Seng Huat Lee, Yunhe Bai, Haoran Lin, Qiang Gao, Zhiqiang Mao, Shuolong Yang

Abstract: The magnetic topological insulator MnBi$_{6}$Te$_{10}$ has emerged as a promising candidate for realizing the quantum anomalous Hall effect (QAHE), owing to its ability to retain ferromagnetism through precise control of anti-site defects. The next important task for realizing the QAHE is to tune the chemical potential into the energy gap formed by the broken time-reversal symmetry. Here we reveal… ▽ More The magnetic topological insulator MnBi$_{6}$Te$_{10}$ has emerged as a promising candidate for realizing the quantum anomalous Hall effect (QAHE), owing to its ability to retain ferromagnetism through precise control of anti-site defects. The next important task for realizing the QAHE is to tune the chemical potential into the energy gap formed by the broken time-reversal symmetry. Here we reveal an intra-unit-cell charge redistribution even when the overall doping suggests a near-charge-neutral condition. By performing time- and angle-resolved photoemission spectroscopy (trARPES) on the optimally 18% Sb-doped MnBi$_{6}$Te$_{10}$, we observe transient surface photovoltage (SPV) effects on both the MnBi$_{2}$Te$_{4}$ and single-Bi$_{2}$Te$_{3}$ terminations. Furthermore, we observe a time-dependent splitting of the band structure indicating multiple SPV shifts with different magnitudes. This observation suggests that adjacent plateaus with nominally the same terminating layer exhibit a strong intra-unit-cell charge redistribution, resulting in spontaneous electrical polarization. This is consistent with static micro-ARPES measurements revealing significant doping deviations from the charge-neutral configuration. Our findings underscore the challenges of engineering the family of Mn-Bi-Te materials to realize QAHE purely through chemical doping. Achieving the desired topological quantum phase requires both a uniform carrier doping and a ferromagnetic ground state. Furthermore, the light-induced polarization within each unit cell of ferromagnetic Mn(Bi$_{0.82}$Sb$_{0.18}$)$_{6}$Te$_{10}$ may open new possibilities for optoelectronic and spintronics. △ Less

Submitted 6 May, 2025; originally announced May 2025.

Comments: 11 pages, 4 figures

Journal ref: Nanoscale 17, 10663-10669 (2025)

arXiv:2505.03809 [pdf, ps, other]

When Dynamic Data Selection Meets Data Augmentation

Authors: Suorong Yang, Peng Ye, Furao Shen, Dongzhan Zhou

Abstract: Dynamic data selection aims to accelerate training with lossless performance. However, reducing training data inherently limits data diversity, potentially hindering generalization. While data augmentation is widely used to enhance diversity, it is typically not optimized in conjunction with selection. As a result, directly combining these techniques fails to fully exploit their synergies. To tack… ▽ More Dynamic data selection aims to accelerate training with lossless performance. However, reducing training data inherently limits data diversity, potentially hindering generalization. While data augmentation is widely used to enhance diversity, it is typically not optimized in conjunction with selection. As a result, directly combining these techniques fails to fully exploit their synergies. To tackle the challenge, we propose a novel online data training framework that, for the first time, unifies dynamic data selection and augmentation, achieving both training efficiency and enhanced performance. Our method estimates each sample's joint distribution of local density and multimodal semantic consistency, allowing for the targeted selection of augmentation-suitable samples while suppressing the inclusion of noisy or ambiguous data. This enables a more significant reduction in dataset size without sacrificing model generalization. Experimental results demonstrate that our method outperforms existing state-of-the-art approaches on various benchmark datasets and architectures, e.g., reducing 50\% training costs on ImageNet-1k with lossless performance. Furthermore, our approach enhances noise resistance and improves model robustness, reinforcing its practical utility in real-world scenarios. △ Less

Submitted 2 May, 2025; originally announced May 2025.

Journal ref: ICML 2025

arXiv:2505.03738 [pdf, ps, other]

AMO: Adaptive Motion Optimization for Hyper-Dexterous Humanoid Whole-Body Control

Authors: Jialong Li, Xuxin Cheng, Tianshu Huang, Shiqi Yang, Ri-Zhao Qiu, Xiaolong Wang

Abstract: Humanoid robots derive much of their dexterity from hyper-dexterous whole-body movements, enabling tasks that require a large operational workspace: such as picking objects off the ground. However, achieving these capabilities on real humanoids remains challenging due to their high degrees of freedom (DoF) and nonlinear dynamics. We propose Adaptive Motion Optimization (AMO), a framework that inte… ▽ More Humanoid robots derive much of their dexterity from hyper-dexterous whole-body movements, enabling tasks that require a large operational workspace: such as picking objects off the ground. However, achieving these capabilities on real humanoids remains challenging due to their high degrees of freedom (DoF) and nonlinear dynamics. We propose Adaptive Motion Optimization (AMO), a framework that integrates sim-to-real reinforcement learning (RL) with trajectory optimization for real-time, adaptive whole-body control. To mitigate distribution bias in motion imitation RL, we construct a hybrid AMO dataset and train a network capable of robust, on-demand adaptation to potentially O.O.D. commands. We validate AMO in simulation and on a 29-DoF Unitree G1 humanoid robot, demonstrating superior stability and an expanded workspace compared to strong baselines. Finally, we show that AMO's consistent performance supports autonomous task execution via imitation learning, underscoring the system's versatility and robustness. △ Less

Submitted 6 May, 2025; originally announced May 2025.

Comments: website: https://amo-humanoid.github.io

arXiv:2505.03483 [pdf, ps, other]

Measurement of the branching fraction ratio $R_K$ at large dilepton invariant mass

Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis, L. An , et al. (1134 additional authors not shown)

Abstract: A test of lepton universality between muons and electrons is performed using $B^+\to K^+\ell^+\ell^-$ decays (where $\ell$ = $e$, $μ$), in the dilepton invariant-mass-squared region above 14.3 GeV$^2/c^4$. The data used for the measurement consists of beauty meson decays produced in proton-proton collisions, corresponding to an integrated luminosity of 9 $\text{fb}^{-1}$, collected by the LHCb exp… ▽ More A test of lepton universality between muons and electrons is performed using $B^+\to K^+\ell^+\ell^-$ decays (where $\ell$ = $e$, $μ$), in the dilepton invariant-mass-squared region above 14.3 GeV$^2/c^4$. The data used for the measurement consists of beauty meson decays produced in proton-proton collisions, corresponding to an integrated luminosity of 9 $\text{fb}^{-1}$, collected by the LHCb experiment between 2011 and 2018. The ratio of branching fractions for $B^+\to K^+μ^+μ^-$ and $B^+\to K^+e^+e^-$ decays is measured to be $R_K = 1.08^{+0.11}_{-0.09}\;(\text{stat})\;^{+0.04}_{-0.04}\;(\text{syst})$, which is consistent with the Standard Model prediction of unity. This constitutes the most precise test of lepton flavour universality using $B^+\to K^+\ell^+\ell^-$ decays with dilepton invariant-mass-squared above the $ψ(2S)$ mass, whilst being the first of its kind at a hadron collider. △ Less

Submitted 25 June, 2025; v1 submitted 6 May, 2025; originally announced May 2025.

Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3164/ (LHCb public pages)

Report number: LHCb-PAPER-2024-056, CERN-EP-2025-069

arXiv:2505.02912 [pdf, ps, other]

doi 10.1103/vg9c-xvdc

Measurement of the time-integrated $CP$ asymmetry in $D^0\toπ^0π^0$ decays at Belle II

Authors: Belle II Collaboration, I. Adachi, Y. Ahn, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, N. Althubiti, K. Amos, M. Angelsmark, N. Anh Ky, C. Antonioli, D. M. Asner, H. Atmacan, T. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, N. K. Baghel, S. Bahinipati, P. Bambade, Sw. Banerjee, M. Barrett, M. Bartl , et al. (350 additional authors not shown)

Abstract: We measure the time-integrated $CP$ asymmetry, $A_{CP}$, in $D^0\toπ^0π^0$ decays reconstructed in $e^+e^-\to c\bar{c}$ events collected by Belle II during 2019--2022. The data corresponds to an integrated luminosity of 428$\mathrm{fb}^{-1}$. The $D^0$ decays are required to originate from the flavor-conserving $D^{*+} \to D^0 π^+$ decay to determine the charm flavor at production time. Control sa… ▽ More We measure the time-integrated $CP$ asymmetry, $A_{CP}$, in $D^0\toπ^0π^0$ decays reconstructed in $e^+e^-\to c\bar{c}$ events collected by Belle II during 2019--2022. The data corresponds to an integrated luminosity of 428$\mathrm{fb}^{-1}$. The $D^0$ decays are required to originate from the flavor-conserving $D^{*+} \to D^0 π^+$ decay to determine the charm flavor at production time. Control samples of $D^0\to K^- π^+$ decays, with or without an associated pion from a $D^{*+}$ decay, are used to correct for detection asymmetries. The result, $A_{CP}(D^0\toπ^0π^0) = (0.30\pm 0.72\pm 0.20)\%$, where the first uncertainty is statistical and the second systematic, is consistent with $CP$ symmetry. △ Less

Submitted 9 July, 2025; v1 submitted 5 May, 2025; originally announced May 2025.

Report number: Belle II Preprint 2025-009, KEK Preprint 2025-7

Journal ref: Phys. Rev. D 112, 012006 (2025)

arXiv:2505.02675 [pdf, other]

Attractor-Based Coevolving Dot Product Random Graph Model

Authors: Shiwen Yang, Daniel L. Sussman

Abstract: We introduce the attractor-based coevolving dot product random graph model (ABCDPRGM) to analyze time-series network data manifesting polarizing or flocking behavior. Graphs are generated based on latent positions under the random dot product graph regime. We assign group membership to each node. When evolving through time, the latent position of each node will change based on its current position… ▽ More We introduce the attractor-based coevolving dot product random graph model (ABCDPRGM) to analyze time-series network data manifesting polarizing or flocking behavior. Graphs are generated based on latent positions under the random dot product graph regime. We assign group membership to each node. When evolving through time, the latent position of each node will change based on its current position and two attractors, which are defined to be the centers of the latent positions of all of its neighbors who share its group membership or who have different group membership than it. Parameters are assigned to the attractors to quantify the amount of influence that the attractors have on the trajectory of the latent position of each node. We developed estimators for the parameters, demonstrated their consistency, and established convergence rates under specific assumptions. Through the ABCDPRGM, we provided a novel framework for quantifying and understanding the underlying forces influencing the polarizing or flocking behaviors in dynamic network data. △ Less

Submitted 5 May, 2025; originally announced May 2025.

arXiv:2505.02126 [pdf, other]

GarmentGS: Point-Cloud Guided Gaussian Splatting for High-Fidelity Non-Watertight 3D Garment Reconstruction

Authors: Zhihao Tang, Shenghao Yang, Hongtao Zhang, Mingbo Zhao

Abstract: Traditional 3D garment creation requires extensive manual operations, resulting in time and labor costs. Recently, 3D Gaussian Splatting has achieved breakthrough progress in 3D scene reconstruction and rendering, attracting widespread attention and opening new pathways for 3D garment reconstruction. However, due to the unstructured and irregular nature of Gaussian primitives, it is difficult to r… ▽ More Traditional 3D garment creation requires extensive manual operations, resulting in time and labor costs. Recently, 3D Gaussian Splatting has achieved breakthrough progress in 3D scene reconstruction and rendering, attracting widespread attention and opening new pathways for 3D garment reconstruction. However, due to the unstructured and irregular nature of Gaussian primitives, it is difficult to reconstruct high-fidelity, non-watertight 3D garments. In this paper, we present GarmentGS, a dense point cloud-guided method that can reconstruct high-fidelity garment surfaces with high geometric accuracy and generate non-watertight, single-layer meshes. Our method introduces a fast dense point cloud reconstruction module that can complete garment point cloud reconstruction in 10 minutes, compared to traditional methods that require several hours. Furthermore, we use dense point clouds to guide the movement, flattening, and rotation of Gaussian primitives, enabling better distribution on the garment surface to achieve superior rendering effects and geometric accuracy. Through numerical and visual comparisons, our method achieves fast training and real-time rendering while maintaining competitive quality. △ Less

Submitted 14 May, 2025; v1 submitted 4 May, 2025; originally announced May 2025.

Comments: Accepted by ICMR 2025

arXiv:2505.01993 [pdf, ps, other]

Supermassive Black Holes with High Accretion Rates in Active Galactic Nuclei. XIV. Long-Duration High-Cadence Reverberation Mapping Results for 11 PG Quasars

Authors: Chen Hu, Zhu-Heng Yao, Yong-Jie Chen, Yu-Yang Songsheng, Yi-Lin Wang, Sen Yang, Hao Zhang, Wei-Jian Guo, Pu Du, Yan-Rong Li, Ming Xiao, Jun-Rong Liu, Hua-Rui Bai, Feng-Na Fang, Yi-Xin Fu, Yue-Chang Peng, Shuo Zhai, Jin-Ming Bai, Luis C. Ho, Michael S. Brotherton, Jesús Aceituno, Hartmut Winkler, Jian-Min Wang

Abstract: We report the results of a long-duration high-cadence reverberation mapping campaign of a second batch of 11 PG quasars using the 2.2m telescope at the Calar Alto Observatory. This follows a similar earlier study of another sample of 15 objects reported by Hu et al. (2021). Among the 11 PG quasars, 8 objects have the H$β$ time lags measured for the first time, while the other 3 objects were observ… ▽ More We report the results of a long-duration high-cadence reverberation mapping campaign of a second batch of 11 PG quasars using the 2.2m telescope at the Calar Alto Observatory. This follows a similar earlier study of another sample of 15 objects reported by Hu et al. (2021). Among the 11 PG quasars, 8 objects have the H$β$ time lags measured for the first time, while the other 3 objects were observed in previous campaigns, but only had highly uncertain H$β$-lag measurements. Long-term light curves are presented of photometric $V$-band, spectroscopic 5100 Å continuum, and the H$β$ emission line, lasting for $\sim$3--6 years with a cadence of $\sim$6--14 days. Accurate H$β$ time lags ranging from $\sim$20 to 150 days in the rest frame are obtained. The estimated virial masses of the central supermassive black holes range from $\sim$(3--300)$\times10^7 M_\odot$. Combining these results with those reported in Hu et al. (2021), we now have 26 PG quasars, with representative properties, having reliable H$β$ time-lag measurements from our long-duration high-cadence campaign. A tentative fit to the relation between the H$β$ time lag and the continuum luminosity for these 26 objects gives a slope of 0.53. △ Less

Submitted 4 May, 2025; originally announced May 2025.

Comments: 20 pages, 17 figures, accepted for publication in ApJS

arXiv:2505.01992 [pdf, ps, other]

Supermassive Black Holes with High Accretion Rates in Active Galactic Nuclei. XII. Reverberation Mapping Results for 15 PG Quasars from a Long-Duration High-Cadence Campaign

Authors: Chen Hu, Sha-Sha Li, Sen Yang, Zi-Xu Yang, Wei-Jian Guo, Dong-Wei Bao, Bo-Wei Jiang, Pu Du, Yan-Rong Li, Ming Xiao, Yu-Yang Songsheng, Zhe Yu, Jin-Ming Bai, Luis C. Ho, Michael S. Brotherton, Jesús Aceituno, Hartmut Winkler, Jian-Min Wang

Abstract: We present the first results from long-term high-cadence spectroscopic monitoring of 15 PG quasars with relatively strong Fe II emission as a part of a broader reverberation mapping campaign performed with the Calar Alto Observatory 2.2m telescope. The $V$-band, 5100 Å continuum, and H$β$ broad emission line light curves were measured for a set of quasars for between dozens to more than a hundred… ▽ More We present the first results from long-term high-cadence spectroscopic monitoring of 15 PG quasars with relatively strong Fe II emission as a part of a broader reverberation mapping campaign performed with the Calar Alto Observatory 2.2m telescope. The $V$-band, 5100 Å continuum, and H$β$ broad emission line light curves were measured for a set of quasars for between dozens to more than a hundred epochs from May 2017 to July 2020. Accurate time lags between the variations of the H$β$ broad line fluxes and the optical continuum strength are obtained for all 15 quasars, ranging from $17.0_{-3.2}^{+2.5}$ to $95.9_{-23.9}^{+7.1}$ days in the rest frame. The virial masses of the central supermassive black holes are derived for all 15 quasars, ranging between $0.50_{-0.19}^{+0.18}$ and $19.17_{-2.73}^{+2.98}$ in units of $10^7 M_\odot$. For 11 of the objects in our sample, this is the first reverberation analysis published. Of the rest, two objects have been the subject of previous reverberation studies, but we determine time lags for these that are only half as long as found in the earlier investigations, which had only been able to sample much more sparsely. The remaining two objects have previously been monitored with high sampling rates. Our results here are consistent with the earlier findings in the sense that the time lag and the line width vary inversely consistent with virialization. △ Less

Submitted 4 May, 2025; originally announced May 2025.

Comments: 21 pages, 20 figures, published in ApJS, March 2021

Journal ref: 2021, ApJS, 253, 20

arXiv:2505.01979 [pdf, other]

D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection

Authors: Chenran Zhao, Dianxi Shi, Mengzhu Wang, Jianqiang Xia, Huanhuan Yang, Songchang Jin, Shaowu Yang, Chunping Qiu

Abstract: Current Hierarchical Reinforcement Learning (HRL) algorithms excel in long-horizon sequential decision-making tasks but still face two challenges: delay effects and spurious correlations. To address them, we propose a causal HRL approach called D3HRL. First, D3HRL models delayed effects as causal relationships across different time spans and employs distributed causal discovery to learn these rela… ▽ More Current Hierarchical Reinforcement Learning (HRL) algorithms excel in long-horizon sequential decision-making tasks but still face two challenges: delay effects and spurious correlations. To address them, we propose a causal HRL approach called D3HRL. First, D3HRL models delayed effects as causal relationships across different time spans and employs distributed causal discovery to learn these relationships. Second, it employs conditional independence testing to eliminate spurious correlations. Finally, D3HRL constructs and trains hierarchical policies based on the identified true causal relationships. These three steps are iteratively executed, gradually exploring the complete causal chain of the task. Experiments conducted in 2D-MineCraft and MiniGrid show that D3HRL demonstrates superior sensitivity to delay effects and accurately identifies causal relationships, leading to reliable decision-making in complex environments. △ Less

Submitted 3 May, 2025; originally announced May 2025.

arXiv:2505.01292 [pdf, other]

Fine-grained Manipulation Attacks to Local Differential Privacy Protocols for Data Streams

Authors: Xinyu Li, Xuebin Ren, Shusen Yang, Liang Shi, Chia-Mu Yu

Abstract: Local Differential Privacy (LDP) enables massive data collection and analysis while protecting end users' privacy against untrusted aggregators. It has been applied to various data types (e.g., categorical, numerical, and graph data) and application settings (e.g., static and streaming). Recent findings indicate that LDP protocols can be easily disrupted by poisoning or manipulation attacks, which… ▽ More Local Differential Privacy (LDP) enables massive data collection and analysis while protecting end users' privacy against untrusted aggregators. It has been applied to various data types (e.g., categorical, numerical, and graph data) and application settings (e.g., static and streaming). Recent findings indicate that LDP protocols can be easily disrupted by poisoning or manipulation attacks, which leverage injected/corrupted fake users to send crafted data conforming to the LDP reports. However, current attacks primarily target static protocols, neglecting the security of LDP protocols in the streaming settings. Our research fills the gap by developing novel fine-grained manipulation attacks to LDP protocols for data streams. By reviewing the attack surfaces in existing algorithms, We introduce a unified attack framework with composable modules, which can manipulate the LDP estimated stream toward a target stream. Our attack framework can adapt to state-of-the-art streaming LDP algorithms with different analytic tasks (e.g., frequency and mean) and LDP models (event-level, user-level, w-event level). We validate our attacks theoretically and through extensive experiments on real-world datasets, and finally explore a possible defense mechanism for mitigating these attacks. △ Less

Submitted 2 May, 2025; originally announced May 2025.

arXiv:2505.01047 [pdf, other]

Transforming physics-informed machine learning to convex optimization

Authors: Letian Yi, Siyuan Yang, Ying Cui, Zhilu Lai

Abstract: Physics-Informed Machine Learning (PIML) offers a powerful paradigm of integrating data with physical laws to address important scientific problems, such as parameter estimation, inferring hidden physics, equation discovery, and state prediction, etc. However, PIML still faces many serious optimization challenges that significantly restrict its applications. In this study, we propose a comprehensi… ▽ More Physics-Informed Machine Learning (PIML) offers a powerful paradigm of integrating data with physical laws to address important scientific problems, such as parameter estimation, inferring hidden physics, equation discovery, and state prediction, etc. However, PIML still faces many serious optimization challenges that significantly restrict its applications. In this study, we propose a comprehensive framework that transforms PIML to convex optimization to overcome all these limitations, referred to as Convex-PIML. The linear combination of B-splines is utilized to approximate the data, promoting the convexity of the loss function. By replacing the non-convex components of the loss function with convex approximations, the problem is further converted into a sequence of successively refined approximated convex optimization problems. This conversion allows the use of well-established convex optimization algorithms, obtaining solutions effectively and efficiently. Furthermore, an adaptive knot optimization method based on error estimate is introduced to mitigate the spectral bias issue of PIML, further improving the performance. The proposed theoretically guaranteed framework is tested in scenarios with distinct types of physical prior. The results indicate that optimization problems are effectively solved in these scenarios, highlighting the potential of the framework for broad applications. △ Less

Submitted 15 May, 2025; v1 submitted 2 May, 2025; originally announced May 2025.

Comments: 33 pages,14 figures

arXiv:2505.00409 [pdf]

Perceptual Implications of Automatic Anonymization in Pathological Speech

Authors: Soroosh Tayebi Arasteh, Saba Afza, Tri-Thien Nguyen, Lukas Buess, Maryam Parvin, Tomas Arias-Vergara, Paula Andrea Perez-Toro, Hiu Ching Hung, Mahshad Lotfinia, Thomas Gorges, Elmar Noeth, Maria Schuster, Seung Hee Yang, Andreas Maier

Abstract: Automatic anonymization techniques are essential for ethical sharing of pathological speech data, yet their perceptual consequences remain understudied. This study presents the first comprehensive human-centered analysis of anonymized pathological speech, using a structured perceptual protocol involving ten native and non-native German listeners with diverse linguistic, clinical, and technical bac… ▽ More Automatic anonymization techniques are essential for ethical sharing of pathological speech data, yet their perceptual consequences remain understudied. This study presents the first comprehensive human-centered analysis of anonymized pathological speech, using a structured perceptual protocol involving ten native and non-native German listeners with diverse linguistic, clinical, and technical backgrounds. Listeners evaluated anonymized-original utterance pairs from 180 speakers spanning Cleft Lip and Palate, Dysarthria, Dysglossia, Dysphonia, and age-matched healthy controls. Speech was anonymized using state-of-the-art automatic methods (equal error rates in the range of 30-40%). Listeners completed Turing-style discrimination and quality rating tasks under zero-shot (single-exposure) and few-shot (repeated-exposure) conditions. Discrimination accuracy was high overall (91% zero-shot; 93% few-shot), but varied by disorder (repeated-measures ANOVA: p=0.007), ranging from 96% (Dysarthria) to 86% (Dysphonia). Anonymization consistently reduced perceived quality (from 83% to 59%, p<0.001), with pathology-specific degradation patterns (one-way ANOVA: p=0.005). Native listeners rated original speech slightly higher than non-native listeners (Delta=4%, p=0.199), but this difference nearly disappeared after anonymization (Delta=1%, p=0.724). No significant gender-based bias was observed. Critically, human perceptual outcomes did not correlate with automatic privacy or clinical utility metrics. These results underscore the need for listener-informed, disorder- and context-specific anonymization strategies that preserve privacy while maintaining interpretability, communicative functions, and diagnostic utility, especially for vulnerable populations such as children. △ Less

Submitted 1 May, 2025; originally announced May 2025.

arXiv:2505.00217 [pdf, other]

Robust Estimation and Inference in Hybrid Controlled Trials for Binary Outcomes: A Case Study on Non-Small Cell Lung Cancer

Authors: Jiajun Liu, Ke Zhu, Shu Yang, Xiaofei Wang

Abstract: Hybrid controlled trials (HCTs), which augment randomized controlled trials (RCTs) with external controls (ECs), are increasingly receiving attention as a way to address limited power, slow accrual, and ethical concerns in clinical research. However, borrowing from ECs raises critical statistical challenges in estimation and inference, especially for binary outcomes where hidden bias is harder to… ▽ More Hybrid controlled trials (HCTs), which augment randomized controlled trials (RCTs) with external controls (ECs), are increasingly receiving attention as a way to address limited power, slow accrual, and ethical concerns in clinical research. However, borrowing from ECs raises critical statistical challenges in estimation and inference, especially for binary outcomes where hidden bias is harder to detect and estimands such as risk difference, risk ratio, and odds ratio are of primary interest. We propose a novel framework that combines doubly robust estimators for various estimands under covariate shift of ECs with conformal selective borrowing (CSB) to address outcome incomparability. CSB uses conformal inference with nearest-neighbor-based conformal scores and their label-conditional extensions to perform finite-sample exact individual-level EC selection, addressing the limited information in binary outcomes. To ensure strict type I error rate control for testing treatment effects while gaining power, we use a Fisher randomization test with the CSB estimator as the test statistic. Extensive simulations demonstrate the robust performance of our methods. We apply our method to data from CALGB 9633 and the National Cancer Database to evaluate chemotherapy effects in Stage IB non-small-cell lung cancer patients and show that the proposed method effectively mitigates hidden bias introduced by full-borrowing approaches, strictly controls the type I error rate, and improves the power over RCT-only analysis. △ Less

Submitted 30 April, 2025; originally announced May 2025.

arXiv:2504.21838 [pdf, ps, other]

Learning Universal User Representations Leveraging Cross-domain User Intent at Snapchat

Authors: Clark Mingxuan Ju, Leonardo Neves, Bhuvesh Kumar, Liam Collins, Tong Zhao, Yuwei Qiu, Qing Dou, Yang Zhou, Sohail Nizam, Rengim Ozturk, Yvette Liu, Sen Yang, Manish Malik, Neil Shah

Abstract: The development of powerful user representations is a key factor in the success of recommender systems (RecSys). Online platforms employ a range of RecSys techniques to personalize user experience across diverse in-app surfaces. User representations are often learned individually through user's historical interactions within each surface and user representations across different surfaces can be sh… ▽ More The development of powerful user representations is a key factor in the success of recommender systems (RecSys). Online platforms employ a range of RecSys techniques to personalize user experience across diverse in-app surfaces. User representations are often learned individually through user's historical interactions within each surface and user representations across different surfaces can be shared post-hoc as auxiliary features or additional retrieval sources. While effective, such schemes cannot directly encode collaborative filtering signals across different surfaces, hindering its capacity to discover complex relationships between user behaviors and preferences across the whole platform. To bridge this gap at Snapchat, we seek to conduct universal user modeling (UUM) across different in-app surfaces, learning general-purpose user representations which encode behaviors across surfaces. Instead of replacing domain-specific representations, UUM representations capture cross-domain trends, enriching existing representations with complementary information. This work discusses our efforts in developing initial UUM versions, practical challenges, technical choices and modeling and research directions with promising offline performance. Following successful A/B testing, UUM representations have been launched in production, powering multiple use cases and demonstrating their value. UUM embedding has been incorporated into (i) Long-form Video embedding-based retrieval, leading to 2.78% increase in Long-form Video Open Rate, (ii) Long-form Video L2 ranking, with 19.2% increase in Long-form Video View Time sum, (iii) Lens L2 ranking, leading to 1.76% increase in Lens play time, and (iv) Notification L2 ranking, with 0.87% increase in Notification Open Rate. △ Less

Submitted 9 June, 2025; v1 submitted 30 April, 2025; originally announced April 2025.

Comments: Accepted to the industrial track of SIGIR'25

arXiv:2504.21477 [pdf, other]

A Comprehensive Survey of Electrical Stimulation Haptic Feedback in Human-Computer Interaction

Authors: Simin Yang, Xian Wang, Yang Li, Lik-Hang Lee, Tristan Camille Braud, Pan Hui

Abstract: Haptic perception and feedback play a pivotal role in interactive experiences, forming an essential component of human-computer interaction (HCI). In recent years, the field of haptic interaction has witnessed significant advancements, particularly in the area of electrical haptic feedback, driving innovation across various domains. To gain a comprehensive understanding of the current state of res… ▽ More Haptic perception and feedback play a pivotal role in interactive experiences, forming an essential component of human-computer interaction (HCI). In recent years, the field of haptic interaction has witnessed significant advancements, particularly in the area of electrical haptic feedback, driving innovation across various domains. To gain a comprehensive understanding of the current state of research and the latest developments in electrical haptic interaction, this study systematically reviews the literature in this area. Our investigation covers key aspects including haptic devices, haptic perception mechanisms, the comparison and integration of electrical haptic feedback with other feedback modalities, and their diverse applications. Specifically, we conduct a systematic analysis of 110 research papers to explore the forefront of electrical haptic feedback, providing insights into its latest trends, challenges, and future directions. △ Less

Submitted 7 May, 2025; v1 submitted 30 April, 2025; originally announced April 2025.

Comments: 23 pages, 7 figures

arXiv:2504.21357 [pdf, other]

Mining and Intervention of Social Networks Information Cocoon Based on Multi-Layer Network Community Detection

Authors: Suwen Yang, Lei Shi

Abstract: With the rapid development of information technology and the widespread utilization of recommendation algorithms, users are able to access information more conveniently, while the content they receive tends to be homogeneous. Homogeneous viewpoints and preferences tend to cluster users into sub-networks, leading to group polarization and increasing the likelihood of forming information cocoons. Th… ▽ More With the rapid development of information technology and the widespread utilization of recommendation algorithms, users are able to access information more conveniently, while the content they receive tends to be homogeneous. Homogeneous viewpoints and preferences tend to cluster users into sub-networks, leading to group polarization and increasing the likelihood of forming information cocoons. This paper aims to handle information cocoon phenomena in debates on social media. In order to investigate potential user connections, we construct a double-layer network that incorporates two dimensions: relational ties and feature-based similarity between users. Based on the structure of the multi-layer network, we promote two graph auto-encoder (GAE) based community detection algorithms, which can be applied to the partition and determination of information cocoons. This paper tests these two algorithms on Cora, Citeseer, and synthetic datasets, comparing them with existing multi-layer network unsupervised community detection algorithms. Numerical experiments illustrate that the algorithms proposed in this paper significantly improve prediction accuracy indicator NMI (normalized mutual information) and network topology indicator Q. Additionally, an influence-based intervention measure on which algorithms can operate is proposed. Through the Markov states transition model, we simulate the intervention effects, which illustrate that our community detection algorithms play a vital role in partitioning and determining information cocoons. Simultaneously, our intervention strategy alleviates the polarization of viewpoints and the formation of information cocoons with minimal intervention effort. △ Less

Submitted 6 May, 2025; v1 submitted 30 April, 2025; originally announced April 2025.

Comments: 43 pages, 23 figures

MSC Class: 62-11 ACM Class: F.2.2

arXiv:2504.21096 [pdf, other]

Search for the Optical Counterpart of Einstein Probe Discovered Fast X-ray Transients from Lulin Observatory

Authors: Amar Aryan, Ting-Wan Chen, Sheng Yang, James H. Gillanders, Albert K. H. Kong, S. J. Smartt, Heloise F. Stevance, Yi-Jung Yang, Aysha Aamer, Rahul Gupta, Lele Fan, Wei-Jie Hou, Hsiang-Yao Hsiao, Amit Kumar, Cheng-Han Lai, Meng-Han Lee, Yu-Hsing Lee, Hung-Chin Lin, Chi-Sheng Lin, Chow-Choong Ngeow, Matt Nicholl, Yen-Chen Pan, Shashi Bhushan Pandey, Aiswarya Sankar. K, Shubham Srivastav , et al. (2 additional authors not shown)

Abstract: The launch of the Einstein Probe (EP) mission has revolutionized the detection and follow-up observations of fast X-ray transients (FXTs) by providing prompt and timely access to their precise localizations. In the first year of its operation, the EP mission reports the discovery of 72 high signal-to-noise FXTs. Subjected to the visibility in the sky and weather conditions, we search for the optic… ▽ More The launch of the Einstein Probe (EP) mission has revolutionized the detection and follow-up observations of fast X-ray transients (FXTs) by providing prompt and timely access to their precise localizations. In the first year of its operation, the EP mission reports the discovery of 72 high signal-to-noise FXTs. Subjected to the visibility in the sky and weather conditions, we search for the optical counterparts of 42 EP-discovered FXTs from the Lulin observatory. We successfully detect the optical counterparts of 12 FXTs, and five of those are first discovered by us from the Lulin observatory. We find that the optical counterparts are generally faint (r>20 mag) and decline rapidly (>0.5 mag per day). We also find that 11 out of 42 FXTs had shown direct evidence of their association with Gamma-Ray Bursts (GRBs) through significant temporal and spatial overlapping. Furthermore, the luminosities and redshifts of FXTs with confirm optical counterparts in our observations are fully consistent with the faintest end of the GRB population. However, the non-detection of any associated optical counterpart with a significant fraction of FXTs suggests that EP FXTs are likely a subset of so-called `Dark' FXTs, similar to `Dark' GRBs. Additionally, the luminosities of two FXTs were also consistent with jetted tidal disruption events (TDEs). However, their luminosities differ significantly from those of typical supernova shock breakout or kilonova emissions. Thus, we conclude that a significant fraction of EP-discovered FXTs are associated with events having relativistic jets; either a GRB or a jetted TDE. △ Less

Submitted 29 April, 2025; originally announced April 2025.

Comments: The manuscript has 14 Figures, 7 Tables and a total of 49 pages (including Appendix). Submitted to ApJS

arXiv:2504.20530 [pdf, other]

Beyond the Horizon: Decoupling UAVs Multi-View Action Recognition via Partial Order Transfer

Authors: Wenxuan Liu, Xian Zhong, Zhuo Zhou, Siyuan Yang, Chia-Wen Lin, Alex Chichung Kot

Abstract: Action recognition in unmanned aerial vehicles (UAVs) poses unique challenges due to significant view variations along the vertical spatial axis. Unlike traditional ground-based settings, UAVs capture actions from a wide range of altitudes, resulting in considerable appearance discrepancies. We introduce a multi-view formulation tailored to varying UAV altitudes and empirically observe a partial o… ▽ More Action recognition in unmanned aerial vehicles (UAVs) poses unique challenges due to significant view variations along the vertical spatial axis. Unlike traditional ground-based settings, UAVs capture actions from a wide range of altitudes, resulting in considerable appearance discrepancies. We introduce a multi-view formulation tailored to varying UAV altitudes and empirically observe a partial order among views, where recognition accuracy consistently decreases as the altitude increases. This motivates a novel approach that explicitly models the hierarchical structure of UAV views to improve recognition performance across altitudes. To this end, we propose the Partial Order Guided Multi-View Network (POG-MVNet), designed to address drastic view variations by effectively leveraging view-dependent information across different altitude levels. The framework comprises three key components: a View Partition (VP) module, which uses the head-to-body ratio to group views by altitude; an Order-aware Feature Decoupling (OFD) module, which disentangles action-relevant and view-specific features under partial order guidance; and an Action Partial Order Guide (APOG), which leverages the partial order to transfer informative knowledge from easier views to support learning in more challenging ones. We conduct experiments on Drone-Action, MOD20, and UAV datasets, demonstrating that POG-MVNet significantly outperforms competing methods. For example, POG-MVNet achieves a 4.7% improvement on Drone-Action dataset and a 3.5% improvement on UAV dataset compared to state-of-the-art methods ASAT and FAR. The code for POG-MVNet will be made available soon. △ Less

Submitted 29 April, 2025; originally announced April 2025.

Comments: 11 pages

arXiv:2504.20156 [pdf, other]

Terahertz Landau level spectroscopy of Dirac fermions in millimeter-scale twisted bilayer graphene

Authors: Benjamin F. Mead, Spenser Talkington, An-Hsi Chen, Debarghya Mallick, Zhaodong Chu, Xingyue Han, Seong-Jun Yang, Cheol-Joo Kim, Matthew Brahlek, Eugene J. Mele, Liang Wu

Abstract: Exotic electronic physics including correlated insulating states and fractional Chern insulators have been observed in twisted bilayer graphene in a magnetic field when the Fermi velocity vanishes, however a question remains as to the stability of these states which is controlled by the gap to the first excited state. Free-space terahertz magneto-optics can directly probe the gap to charge excitat… ▽ More Exotic electronic physics including correlated insulating states and fractional Chern insulators have been observed in twisted bilayer graphene in a magnetic field when the Fermi velocity vanishes, however a question remains as to the stability of these states which is controlled by the gap to the first excited state. Free-space terahertz magneto-optics can directly probe the gap to charge excitations which bounds the stability of electronic states, but this measurement has thus-far been inaccessible due to the micron size of twisted bilayer graphene samples, while the wavelength of terahertz light is up to a millimeter. Here we leverage advances in fabrication to create twisted bilayer graphene samples over 5 mm x 5 mm in size with a uniform twist angle and study the magnetic field dependence of the cyclotron resonance by a complex Faraday rotation experiment in p-doped large angle twisted bilayer graphene. These measurements directly probe charge excitations in inter-Landau level transitions and determine the Fermi velocity as a function of twist angle. △ Less

Submitted 28 April, 2025; originally announced April 2025.

Comments: 7+2 pages, 4+2 figures

Showing 201–250 of 5,434 results for author: Yang, S