Skip to main content

Showing 201–250 of 6,315 results for author: Sun, Y

.
  1. arXiv:2503.19267  [pdf, other

    cs.LG cs.AI

    NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios

    Authors: Songyi Gao, Zuolin Tu, Rong-Jun Qin, Yi-Hao Sun, Xiong-Hui Chen, Yang Yu

    Abstract: Offline reinforcement learning (RL) aims to learn from historical data without requiring (costly) access to the environment. To facilitate offline RL research, we previously introduced NeoRL, which highlighted that datasets from real-world tasks are often conservative and limited. With years of experience applying offline RL to various domains, we have identified additional real-world challenges.… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  2. arXiv:2503.18874  [pdf, other

    cs.LG cs.CV

    A semantic communication-based workload-adjustable transceiver for wireless AI-generated content (AIGC) delivery

    Authors: Runze Cheng, Yao Sun, Lan Zhang, Lei Feng, Lei Zhang, Muhammad Ali Imran

    Abstract: With the significant advances in generative AI (GAI) and the proliferation of mobile devices, providing high-quality AI-generated content (AIGC) services via wireless networks is becoming the future direction. However, the primary challenges of AIGC service delivery in wireless networks lie in unstable channels, limited bandwidth resources, and unevenly distributed computational resources. In this… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  3. arXiv:2503.18794  [pdf, other

    cs.CV

    NexusGS: Sparse View Synthesis with Epipolar Depth Priors in 3D Gaussian Splatting

    Authors: Yulong Zheng, Zicheng Jiang, Shengfeng He, Yandu Sun, Junyu Dong, Huaidong Zhang, Yong Du

    Abstract: Neural Radiance Field (NeRF) and 3D Gaussian Splatting (3DGS) have noticeably advanced photo-realistic novel view synthesis using images from densely spaced camera viewpoints. However, these methods struggle in few-shot scenarios due to limited supervision. In this paper, we present NexusGS, a 3DGS-based approach that enhances novel view synthesis from sparse-view images by directly embedding dept… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: This paper is accepted by CVPR 2025

  4. Observation of the decay $ψ(3686)\rightarrow Σ^{0}\barΣ^{0}ω$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (695 additional authors not shown)

    Abstract: Using a dataset of $(27.12\pm 0.14)\times 10^{8}$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we report the first observation of the decay $ψ(3686)\toΣ^{0}\barΣ^{0}ω$ with a statistical significance of 8.9$σ$. The measured branching fraction is $(1.24 \pm 0.16_{\textrm{stat}} \pm 0.11_{\textrm{sys}}) \times 10^{-5}$, where the first uncertainty i… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  5. arXiv:2503.18034  [pdf, other

    cs.CV cs.CL

    Expanding the Boundaries of Vision Prior Knowledge in Multi-modal Large Language Models

    Authors: Qiao Liang, Yanjiang Liu, Ben He, Yaojie Lu, Hongyu Lin, Jia Zheng, Xianpei Han, Le Sun, Yingfei Sun

    Abstract: Does the prior knowledge of the vision encoder constrain the capability boundary of Multi-modal Large Language Models (MLLMs)? While most existing research treats MLLMs as unified systems optimized through end-to-end training, the impact of vision encoder's prior knowledge is seldom investigated. In this work, we introduce a novel metric, $Rank_e$, to quantify the effect of the vision encoder's pr… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

  6. Cache-Aware Cooperative Multicast Beamforming in Dynamic Satellite-Terrestrial Networks

    Authors: Shuo Yuan, Yaohua Sun, Mugen Peng

    Abstract: With the burgeoning demand for data-intensive services, satellite-terrestrial networks (STNs) face increasing backhaul link congestion, deteriorating user quality of service (QoS), and escalating power consumption. Cache-aided STNs are acknowledged as a promising paradigm for accelerating content delivery to users and alleviating the load of backhaul links. However, the dynamic nature of low earth… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

    Comments: Accepted by IEEE Transactions on Vehicular Technology

  7. Satellite-Terrestrial Integrated Fog Networks: Architecture, Technologies, and Challenges

    Authors: Shuo Yuan, Mugen Peng, Yaohua Sun

    Abstract: In the evolution of sixth-generation (6G) mobile communication networks, satellite-terrestrial integrated networks emerge as a promising paradigm, characterized by their wide coverage and reliable transmission capabilities. By integrating with cloud-based terrestrial mobile communication networks, the limitations of low Earth orbit (LEO) satellites, such as insufficient onboard computing capabilit… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

    Comments: Accepted by IEEE Wireless Communications

  8. arXiv:2503.17901  [pdf

    cond-mat.supr-con cond-mat.str-el

    Strain tuning of charge density wave and Mott-insulating states in monolayer VTe2

    Authors: Wenqian Tu, Run Lv, Dingfu Shao, Yuping Sun, Wenjian Lu

    Abstract: Monolayer vanadium ditelluride (VTe2) exhibits a 2\sqrt{3}*2\sqrt{3} charge density wave (CDW) order intertwined with a Mott-insulating state. However, the physical mechanisms driving the emergence of CDW order and Mott-insulating state are still not well understood. In this study, we systematically investigate the electronic band structure, phonon dispersion, and electron-phonon coupling (EPC) of… ▽ More

    Submitted 6 April, 2025; v1 submitted 22 March, 2025; originally announced March 2025.

  9. arXiv:2503.17777  [pdf, ps, other

    eess.IV cs.CV

    Hierarchy-Aware and Channel-Adaptive Semantic Communication for Bandwidth-Limited Data Fusion

    Authors: Lei Guo, Wei Chen, Yuxuan Sun, Bo Ai, Nikolaos Pappas, Tony Quek

    Abstract: Obtaining high-resolution hyperspectral images (HR-HSI) is costly and data-intensive, making it necessary to fuse low-resolution hyperspectral images (LR-HSI) with high-resolution RGB images (HR-RGB) for practical applications. However, traditional fusion techniques, which integrate detailed information into the reconstruction, significantly increase bandwidth consumption compared to directly tran… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

    Comments: Accepted by the WCL

  10. arXiv:2503.17709  [pdf, other

    cs.CV cs.AI

    GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration

    Authors: Yuchen Sun, Shanhui Zhao, Tao Yu, Hao Wen, Samith Va, Mengwei Xu, Yuanchun Li, Chongyang Zhang

    Abstract: GUI agents hold significant potential to enhance the experience and efficiency of human-device interaction. However, current methods face challenges in generalizing across applications (apps) and tasks, primarily due to two fundamental limitations in existing datasets. First, these datasets overlook developer-induced structural variations among apps, limiting the transferability of knowledge acros… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

    Comments: CVPR 2025

  11. arXiv:2503.17551  [pdf, other

    cs.MM cs.AI cs.CV cs.SD eess.AS

    Audio-Enhanced Vision-Language Modeling with Latent Space Broadening for High Quality Data Expansion

    Authors: Yu Sun, Yin Li, Ruixiao Sun, Chunhui Liu, Fangming Zhou, Ze Jin, Linjie Wang, Xiang Shen, Zhuolin Hao, Hongyu Xiong

    Abstract: Transformer-based multimodal models are widely used in industrial-scale recommendation, search, and advertising systems for content understanding and relevance ranking. Enhancing labeled training data quality and cross-modal fusion significantly improves model performance, influencing key metrics such as quality view rates and ad revenue. High-quality annotations are crucial for advancing content… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  12. arXiv:2503.17371  [pdf

    physics.soc-ph

    A Review of Urban Resilience Frameworks: Transferring Knowledge to Enhance Pandemic Resilience

    Authors: Yue Sun, Ryan Weightman, Timur Dogan, Samitha Samaranayake

    Abstract: Urbanization is rapidly increasing, with urban populations expected to grow significantly by 2050, particularly in developing regions. This expansion brings challenges related to chronic stresses and acute shocks, such as the COVID-19 pandemic, which has underscored the critical role of urban form in a city's capacity to manage public health crises. Despite the heightened interest in urban resilie… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: Urban resilience, urban form, pandemic resilience, COVID-19, urban planning, analysis frameworks

  13. arXiv:2503.17165  [pdf, other

    hep-ex

    Stringent test of $CP$ symmetry in $Σ^+$ hyperon decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

    Abstract: The non-leptonic two-body weak decays $Σ^{+} \to p π^{0}$ and $\barΣ^{-} \to \bar{p} π^{0}$ are investigated, utilizing $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events and $(2.7124\pm0.0143)\times10^{9}$ $ψ(3686)$ events collected by BESIII experiment. The precision of the weak-decay parameters for the decays $Σ^{+} \to p π^{0}$ ($α_{0}$) and $\barΣ^{-} \to \bar{p} π^{0}$ ($\barα_{0}$) is improved b… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  14. arXiv:2503.17126  [pdf, other

    cs.CL cs.LG

    Modifying Large Language Model Post-Training for Diverse Creative Writing

    Authors: John Joon Young Chung, Vishakh Padmakumar, Melissa Roemmele, Yuqian Sun, Max Kreminski

    Abstract: As creative writing tasks do not have singular correct answers, large language models (LLMs) trained to perform these tasks should be able to generate diverse valid outputs. However, LLM post-training often focuses on improving generation quality but neglects to facilitate output diversity. Hence, in creative writing generation, we investigate post-training approaches to promote both output divers… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  15. arXiv:2503.17005  [pdf

    cs.RO eess.SY

    Autonomous Exploration-Based Precise Mapping for Mobile Robots through Stepwise and Consistent Motions

    Authors: Muhua Zhang, Lei Ma, Ying Wu, Kai Shen, Yongkui Sun, Henry Leung

    Abstract: This paper presents an autonomous exploration framework. It is designed for indoor ground mobile robots that utilize laser Simultaneous Localization and Mapping (SLAM), ensuring process completeness and precise mapping results. For frontier search, the local-global sampling architecture based on multiple Rapidly Exploring Random Trees (RRTs) is employed. Traversability checks during RRT expansion… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: 8 pages, 11 figures. This work has been submitted to the IEEE for possible publication

  16. arXiv:2503.16910  [pdf, other

    cs.CV

    Salient Object Detection in Traffic Scene through the TSOD10K Dataset

    Authors: Yu Qiu, Yuhang Sun, Jie Mei, Lin Xiao, Jing Xu

    Abstract: Traffic Salient Object Detection (TSOD) aims to segment the objects critical to driving safety by combining semantic (e.g., collision risks) and visual saliency. Unlike SOD in natural scene images (NSI-SOD), which prioritizes visually distinctive regions, TSOD emphasizes the objects that demand immediate driver attention due to their semantic impact, even with low visual contrast. This dual criter… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: 12 pages, 12 figures

  17. arXiv:2503.16867  [pdf, other

    cs.CV

    ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering

    Authors: Kaisi Guan, Zhengfeng Lai, Yuchong Sun, Peng Zhang, Wei Liu, Kieran Liu, Meng Cao, Ruihua Song

    Abstract: Precisely evaluating semantic alignment between text prompts and generated videos remains a challenge in Text-to-Video (T2V) Generation. Existing text-to-video alignment metrics like CLIPScore only generate coarse-grained scores without fine-grained alignment details, failing to align with human preference. To address this limitation, we propose ETVA, a novel Evaluation method of Text-to-Video Ali… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  18. arXiv:2503.16815  [pdf, other

    cs.DC

    DeFT: Mitigating Data Dependencies for Flexible Communication Scheduling in Distributed Training

    Authors: Lin Meng, Yuzhong Sun

    Abstract: Communication scheduling aims to reduce communication bottlenecks in data parallel training (DP) by maximizing the overlap between computation and communication. However, existing schemes fall short due to three main issues: (1) hard data dependencies break some overlapping between communication and computation; (2) high coverage rates impair further improvement on performance; (3) imbalanced comm… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: 14 pages, 16 figures

  19. arXiv:2503.16755  [pdf, other

    cs.DS cs.LG

    Fast online node labeling with graph subsampling

    Authors: Yushen Huang, Ertai Luo, Reza Babenezhad, Yifan Sun

    Abstract: Large data applications rely on storing data in massive, sparse graphs with millions to trillions of nodes. Graph-based methods, such as node prediction, aim for computational efficiency regardless of graph size. Techniques like localized approximate personalized page rank (APPR) solve sparse linear systems with complexity independent of graph size, but is in terms of the maximum node degree, whic… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  20. arXiv:2503.16737  [pdf, other

    stat.ML cs.LG math.PR math.ST

    Optimal Nonlinear Online Learning under Sequential Price Competition via s-Concavity

    Authors: Daniele Bracale, Moulinath Banerjee, Cong Shi, Yuekai Sun

    Abstract: We consider price competition among multiple sellers over a selling horizon of $T$ periods. In each period, sellers simultaneously offer their prices and subsequently observe their respective demand that is unobservable to competitors. The demand function for each seller depends on all sellers' prices through a private, unknown, and nonlinear relationship. To address this challenge, we propose a s… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  21. arXiv:2503.16550  [pdf, other

    cs.CL

    Unified Enhancement of the Generalization and Robustness of Language Models via Bi-Stage Optimization

    Authors: Yudao Sun, Juan Yin, Juan Zhao, Fan Zhang, Yongheng Liu, Hongji Chen

    Abstract: Neural network language models (LMs) are confronted with significant challenges in generalization and robustness. Currently, many studies focus on improving either generalization or robustness in isolation, without methods addressing both aspects simultaneously, which presents a significant challenge in developing LMs that are both robust and generalized. In this paper, we propose a bi-stage optim… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  22. arXiv:2503.16544  [pdf, other

    cs.CL cs.AI cs.HC

    Causal Discovery and Counterfactual Reasoning to Optimize Persuasive Dialogue Policies

    Authors: Donghuo Zeng, Roberto Legaspi, Yuewen Sun, Xinshuai Dong, Kazushi Ikeda, Peter Spirtes, Kun Zhang

    Abstract: Tailoring persuasive conversations to users leads to more effective persuasion. However, existing dialogue systems often struggle to adapt to dynamically evolving user states. This paper presents a novel method that leverages causal discovery and counterfactual reasoning for optimizing system persuasion capability and outcomes. We employ the Greedy Relaxation of the Sparsest Permutation (GRaSP) al… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 21 pages, 8 figures

  23. arXiv:2503.16402  [pdf, other

    cs.AI cs.CL cs.LG

    The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination

    Authors: Yifan Sun, Han Wang, Dongbai Li, Gang Wang, Huan Zhang

    Abstract: Benchmark Data Contamination (BDC)-the inclusion of benchmark testing samples in the training set-has raised increasing concerns in Large Language Model (LLM) evaluation, leading to falsely inflated performance estimates and undermining evaluation reliability. To address this, researchers have proposed various mitigation strategies to update existing benchmarks, including modifying original questi… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: 23 pages

  24. arXiv:2503.16224  [pdf, other

    cond-mat.supr-con

    Upper critical fields in high-$ T_{\rm{c}} $ superconductors

    Authors: Wei Wei, Yuling Xiang, Qiang Hou, Yue Sun, Zhixiang Shi

    Abstract: Since the discovery of high-temperature superconductivity in cuprates, understanding the unconventional pairing mechanism has remained one of the most significant challenges. The upper critical field ($H_{\rm{c2}}$) is an essential parameter for obtaining information on the pair-breaking mechanism, coherence length $ξ$, and pairing symmetry, all of which are crucial for understanding unconventiona… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: invited review;18 pages, 14 figures

    Journal ref: J. Phys.: Condens. Matter 37 143003 (2025)

  25. arXiv:2503.16081  [pdf, other

    cs.LG cs.IR

    OThink-MR1: Stimulating multimodal generalized reasoning capabilities via dynamic reinforcement learning

    Authors: Zhiyuan Liu, Yuting Zhang, Feng Liu, Changwang Zhang, Ying Sun, Jun Wang

    Abstract: Multimodal Large Language Models (MLLMs) have gained significant traction for their ability to process diverse input data types and generate coherent, contextually relevant outputs across various applications. While supervised fine-tuning (SFT) has been the predominant approach to enhance MLLM capabilities in task-specific optimization, it often falls short in fostering crucial generalized reasoni… ▽ More

    Submitted 28 March, 2025; v1 submitted 20 March, 2025; originally announced March 2025.

  26. arXiv:2503.16070  [pdf, other

    hep-ex hep-ph

    Search for the radiative leptonic decay $D^+\toγe^+ν_e$ with Deep Learning

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

    Abstract: Using 20.3$~\rm fb^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773$~\rm GeV$ with the BESIII detector, we report an improved search for the radiative leptonic decay $D^+\toγe^+ν_e$. An upper limit on its partial branching fraction for photon energies $E_γ>10~\rm MeV$ is determined to be $1.2\times10^{-5}$ at 90\% confidence level, which excludes most current theor… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: 15 pages, 6 figures

  27. arXiv:2503.15590  [pdf, other

    astro-ph.GA

    JADES and SAPPHIRES: Galaxy Metamorphosis Amidst a Huge, Luminous Emission-line Region

    Authors: Francesco D'Eugenio, Jakob M. Helton, Kevin Hainline, Fengwu Sun, Roberto Maiolino, Pablo G. Pérez-González, Ignas Juodžbalis, Santiago Arribas, Andrew J. Bunker, Stefano Carniani, Emma Curtis-Lake, Eiichi Egami, Daniel J. Eisenstein, Benjamin D. Johnson, Brant Robertson, Sandro Tacchella, Christopher N. A. Willmer, Chris Willott, William M. Baker, A. Lola Danhaive, Qiao Duan, Yoshinobu Fudamoto, Gareth C. Jones, Xiaojing Lin, Weizhe Liu , et al. (10 additional authors not shown)

    Abstract: We report the discovery of a remarkably large and luminous line-emitting nebula extending on either side of the Balmer-break galaxy JADES-GS-518794 at z=5.89, detected with JADES JWST/NIRCam imaging in [O III]$λλ$4959,5007 and H$α$ and spectroscopically confirmed with NIRCam/WFSS thanks to the pure-parallel SAPPHIRES programme. The end-to-end velocity offset is $Δv=830\pm130$ km s$^{-1}$. Nebulae… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 21 pages, 12 figures, submmited to MNRAS

  28. arXiv:2503.15426  [pdf, other

    cs.CV cs.AI

    Visual Position Prompt for MLLM based Visual Grounding

    Authors: Wei Tang, Yanpeng Sun, Qinying Gu, Zechao Li

    Abstract: Although Multimodal Large Language Models (MLLMs) excel at various image-related tasks, they encounter challenges in precisely aligning coordinates with spatial information within images, particularly in position-aware tasks such as visual grounding. This limitation arises from two key factors. First, MLLMs lack explicit spatial references, making it difficult to associate textual descriptions wit… ▽ More

    Submitted 24 March, 2025; v1 submitted 19 March, 2025; originally announced March 2025.

  29. arXiv:2503.15266  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Strong correlation between $H$-linear magnetoresistance and strange metal in FeSe superconductor

    Authors: Xinyue Wang, Yue Sun, Wei Wei, Qiang Hou, Nan Zhou, Yufeng Zhang, Zhixiang Shi

    Abstract: In strange metals, a strong and anomalous scattering effect exists and increases linearly with temperature. In FeSe, we observed that the temperature dependence of resistivity exhibits non-Fermi liquid behavior in two regions below and above a critical pressure, $p_\text{c}$$\sim$2 GPa. As pressure increases, a transition from quadratic to nonsaturating magnetoresistance is observed, with a distin… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 9 pages, 8 figures

    Journal ref: Phys. Rev. B 111, L100501 (2025)

  30. arXiv:2503.15197  [pdf, other

    cs.CV

    Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization

    Authors: Feifei Li, Mi Zhang, Yiming Sun, Min Yang

    Abstract: Text-to-image diffusion models have achieved state-of-the-art results in synthesis tasks; however, there is a growing concern about their potential misuse in creating harmful content. To mitigate these risks, post-hoc model intervention techniques, such as concept unlearning and safety guidance, have been developed. However, fine-tuning model weights or adapting the hidden states of the diffusion… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: CVPR25

  31. arXiv:2503.14919  [pdf, other

    cs.CV

    GenM$^3$: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation

    Authors: Junyu Shi, Lijiang Liu, Yong Sun, Zhiyuan Zhang, Jinni Zhou, Qiang Nie

    Abstract: Scaling up motion datasets is crucial to enhance motion generation capabilities. However, training on large-scale multi-source datasets introduces data heterogeneity challenges due to variations in motion content. To address this, we propose Generative Pretrained Multi-path Motion Model (GenM$^3$), a comprehensive framework designed to learn unified motion representations. GenM$^3$ comprises two c… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  32. arXiv:2503.14824  [pdf, other

    cs.CV

    Prototype Perturbation for Relaxing Alignment Constraints in Backward-Compatible Learning

    Authors: Zikun Zhou, Yushuai Sun, Wenjie Pei, Xin Li, Yaowei Wang

    Abstract: The traditional paradigm to update retrieval models requires re-computing the embeddings of the gallery data, a time-consuming and computationally intensive process known as backfilling. To circumvent backfilling, Backward-Compatible Learning (BCL) has been widely explored, which aims to train a new model compatible with the old one. Many previous works focus on effectively aligning the embeddings… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  33. arXiv:2503.14355  [pdf, other

    cs.CV

    MAST-Pro: Dynamic Mixture-of-Experts for Adaptive Segmentation of Pan-Tumors with Knowledge-Driven Prompts

    Authors: Runqi Meng, Sifan Song, Pengfei Jin, Yujin Oh, Lin Teng, Yulin Wang, Yiqun Sun, Ling Chen, Xiang Li, Quanzheng Li, Ning Guo, Dinggang Shen

    Abstract: Accurate tumor segmentation is crucial for cancer diagnosis and treatment. While foundation models have advanced general-purpose segmentation, existing methods still struggle with: (1) limited incorporation of medical priors, (2) imbalance between generic and tumor-specific features, and (3) high computational costs for clinical adaptation. To address these challenges, we propose MAST-Pro (Mixture… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

    Comments: 10 pages, 2 figures

  34. arXiv:2503.13960  [pdf

    cond-mat.soft cond-mat.mtrl-sci

    Dynamical Classification of Supercooled Liquids: Critical Cooling Rates and Entropic Signatures

    Authors: B Zhang, M. Zhang, D. Y. Sun, X. G. Gong

    Abstract: Using molecular dynamics simulations, we systematically investigate supercooled liquids formed at cooling rates below and above the critical cooling rate (CCR). By analyzing the distribution of short-time averaged potential energies (DoPE) and crystallization behaviors, we identify two distinct dynamical regimes in supercooled liquids: the glass-forming regime (GFR) and the crystal-forming regime… ▽ More

    Submitted 6 April, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

  35. arXiv:2503.13898  [pdf, other

    quant-ph

    A metropolitan-scale trapped-ion quantum network node with hybrid multiplexing enhancements

    Authors: Z. -B. Cui, Z. -Q. Wang, P. -C. Lai, Y. Wang, J. -X. Shi, P. -Y. Liu, Y. -D. Sun, Z. -C. Tian, Y. -B. Liang, B. -X. Qi, Y. -Y. Huang, Z. -C. Zhou, Y. -K. Wu, Y. Xu, L. -M. Duan, Y. -F. Pu

    Abstract: Quantum network and quantum repeater are promising ways to scale up a quantum information system to enable various applications with unprecedented performance. As a current bottleneck of building a long-distance quantum network, the distribution rate of heralded entanglement between remote network nodes is typically much lower than the decoherence rate of each local node, which obstructs the imple… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

    Comments: 15 pages, 13 figures

  36. arXiv:2503.13866  [pdf, other

    eess.SP

    Send Pilot or Data? Leveraging Age of Channel State Information for Throughput Maximization

    Authors: Sirin Chakraborty, Yin Sun

    Abstract: In this paper, we study the optimal timing for pilot and data transmissions to maximize effective throughput, also known as goodput, over a wireless fading channel. The receiver utilizes the received pilot signal and its Age of Information (AoI), termed the Age of Channel State Information (AoCSI), to estimate the channel state. Based on this estimation, the transmitter selects an appropriate modu… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  37. arXiv:2503.13139  [pdf, other

    cs.CV cs.AI cs.CL eess.IV

    Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding

    Authors: Weiyu Guo, Ziyang Chen, Shaoguang Wang, Jianxiang He, Yijie Xu, Jinhui Ye, Ying Sun, Hui Xiong

    Abstract: Understanding long video content is a complex endeavor that often relies on densely sampled frame captions or end-to-end feature selectors, yet these techniques commonly overlook the logical relationships between textual queries and visual elements. In practice, computational constraints necessitate coarse frame subsampling, a challenge analogous to ``finding a needle in a haystack.'' To address t… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

    Comments: 18 pages, under review

  38. arXiv:2503.12978  [pdf, other

    cs.LG

    Enhancing Job Salary Prediction with Disentangled Composition Effect Modeling: A Neural Prototyping Approach

    Authors: Yang Ji, Ying Sun, Hengshu Zhu

    Abstract: In the era of the knowledge economy, understanding how job skills influence salary is crucial for promoting recruitment with competitive salary systems and aligned salary expectations. Despite efforts on salary prediction based on job positions and talent demographics, there still lacks methods to effectively discern the set-structured skills' intricate composition effect on job salary. While rece… ▽ More

    Submitted 8 April, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

  39. arXiv:2503.12887  [pdf

    cond-mat.mtrl-sci

    Weyl Fermion Manipulation through Magnetic Transitions in the Ferromagnetic Non-Centrosymmetric Weyl semimetal PrAlSi

    Authors: K. P. Wang, W. J. Shi, W. Z. Cao, X. T. Yang, Z. Y. Lv, C. Peng, C. Chen, D. F. Liu, H. F. Yang, L. X. Yang, M. Lyu, P. J. Sun, E. K. Liu, M. Ye, Y. L. Chen, Y. Sun, Y. P. Qi, Z. K. Liu

    Abstract: PrAlSi, a non-centrosymmetric ferromagnetic Weyl semimetal candidate with a Curie temperature of 17.8K, offers a unique platform for exploring the interplay of symmetry breaking and topological electronic structures. Up to now, the Weyl fermion distribution as well as their evolution across the ferromagnetic to paramagnetic phase transition in PrAlSi has not been explored. Here, we uncover the pre… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

    Comments: 21 pages, 4 figures

    Journal ref: Advanced Electronic Materials (2025)

  40. arXiv:2503.12791  [pdf, other

    hep-th

    Topological invariant for holographic Weyl-$\mathrm Z_2$ semimetal

    Authors: Xiantong Chen, Xuanting Ji, Ya-Wen Sun

    Abstract: The occurrence of a topological phase transition can be demonstrated by a direct observation of a change in the topological invariant. For holographic topological semimetals, a topological Hamiltonian method needs to be employed to calculate the topological invariants due to the strong coupling nature of the system. We calculate the topological invariants for the holographic Weyl semimetal and the… ▽ More

    Submitted 20 March, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

    Comments: 46 pages, 7 figures. Typos corrected, references added

  41. arXiv:2503.12782  [pdf, other

    cs.RO

    DART: Dual-level Autonomous Robotic Topology for Efficient Exploration in Unknown Environments

    Authors: Qiming Wang, Yulong Gao, Yang Wang, Xiongwei Zhao, Yijiao Sun, Xiangyan Kong

    Abstract: Conventional algorithms in autonomous exploration face challenges due to their inability to accurately and efficiently identify the spatial distribution of convex regions in the real-time map. These methods often prioritize navigation toward the nearest or information-rich frontiers -- the boundaries between known and unknown areas -- resulting in incomplete convex region exploration and requiring… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

    Comments: 11 pages, 9 figures, Journal

  42. arXiv:2503.12600  [pdf, other

    cs.LG

    GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation

    Authors: Tao Feng, Yihang Sun, Jiaxuan You

    Abstract: The powerful capabilities of Large Language Models (LLMs) have led to their growing use in evaluating human-generated content, particularly in evaluating research ideas within academic settings. Existing solutions primarily rely on prompt-based LLM methods or fine-tuned lightweight language models for idea evaluation. However, these methods are often unstable and struggle to comprehend the complex… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

  43. arXiv:2503.12551  [pdf, other

    quant-ph cond-mat.dis-nn cond-mat.quant-gas math.OC

    qReduMIS: A Quantum-Informed Reduction Algorithm for the Maximum Independent Set Problem

    Authors: Martin J. A. Schuetz, Romina Yalovetzky, Ruben S. Andrist, Grant Salton, Yue Sun, Rudy Raymond, Shouvanik Chakrabarti, Atithi Acharya, Ruslan Shaydulin, Marco Pistoia, Helmut G. Katzgraber

    Abstract: We propose and implement a quantum-informed reduction algorithm for the maximum independent set problem that integrates classical kernelization techniques with information extracted from quantum devices. Our larger framework consists of dedicated application, algorithm, and hardware layers, and easily generalizes to the maximum weight independent set problem. In this hybrid quantum-classical frame… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

    Comments: Manuscript: 6 pages, 3 figures, 1 table. Appendix: 3 pages, 3 figures, 1 table

  44. arXiv:2503.12547  [pdf, other

    cs.IR

    LLMSeR: Enhancing Sequential Recommendation via LLM-based Data Augmentation

    Authors: Yuqi Sun, Qidong Liu, Haiping Zhu, Feng Tian

    Abstract: Sequential Recommender Systems (SRS) have become a cornerstone of online platforms, leveraging users' historical interaction data to forecast their next potential engagement. Despite their widespread adoption, SRS often grapple with the long-tail user dilemma, resulting in less effective recommendations for individuals with limited interaction records. The advent of Large Language Models (LLMs), w… ▽ More

    Submitted 21 March, 2025; v1 submitted 16 March, 2025; originally announced March 2025.

  45. arXiv:2503.12449  [pdf, other

    astro-ph.IM astro-ph.GA astro-ph.SR

    Calibration of Complementary Metal-oxide-semiconductor Sensor-based Photometry to a Few-millimagnitude Precision: The Case of the Mini-SiTian Array

    Authors: Kai Xiao, Yang Huang, Haibo Yuan, Zhirui Li, Yongkang Sun, Timothy C. Beers, Min He, Jifeng Liu, Hong Wu, Yongna Mao, Bowen Huang, Mingyang Ma, Chuanjie Zheng, Hongrui Gu, Beichuan Wang, Lin Yang, Shuai Xu

    Abstract: We present a pioneering achievement in the high-precision photometric calibration of CMOS-based photometry, by application of the Gaia BP/RP (XP) spectra-based synthetic photometry (XPSP) method to the mini-SiTian array (MST) photometry. Through 79 repeated observations of the $\texttt{f02}$ field on the night, we find good internal consistency in the calibrated MST $G_{\rm MST}$-band magnitudes f… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

    Comments: 8 pages, 6 figures, ApJL accepted, see main results in Figures 4

  46. Gas Transfer Between the Inner 3-kpc Disk and the Galactic Central Molecular Zone

    Authors: Yang Su, Shiyu Zhang, Yan Sun, Ji Yang, Fujun Du, Min Fang, Qing-Zeng Yan, Shaobo Zhang, Zhiwei Chen, Xuepeng Chen, Xin Zhou, Lixia Yuan, Yuehui Ma

    Abstract: We uncovered a more tilted molecular gas structure with highly negative velocities located near the dust lane. Our observations also show that the approaching gas flows from the overshoot process are captured by the bar gravitational and then flows towards the Galactic central molecular zone (CMZ) through the bar channel. The recycling gas from the overshoot effect, in conjunction with freshly acc… ▽ More

    Submitted 4 May, 2025; v1 submitted 15 March, 2025; originally announced March 2025.

    Comments: Published in the ApJ, 984, 109 (2025)

    Journal ref: ApJ, 984, 109 (2025)

  47. arXiv:2503.11780  [pdf, other

    cs.CV

    Rethinking Multi-modal Object Detection from the Perspective of Mono-Modality Feature Learning

    Authors: Tianyi Zhao, Boyang Liu, Yanglei Gao, Yiming Sun, Maoxun Yuan, Xingxing Wei

    Abstract: Multi-Modal Object Detection (MMOD), due to its stronger adaptability to various complex environments, has been widely applied in various applications. Extensive research is dedicated to the RGB-IR object detection, primarily focusing on how to integrate complementary features from RGB-IR modalities. However, they neglect the mono-modality insufficient learning problem that the decreased feature e… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: 10 pages, 6 figures

  48. arXiv:2503.11383  [pdf, other

    hep-ex

    Study of $φ\to K\bar{K}$ and $K_{S}^{0}-K_{L}^{0}$ asymmetry in the amplitude analysis of $D_{s}^{+} \to K_{S}^{0}K_{L}^{0}π^{+}$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (701 additional authors not shown)

    Abstract: Using $e^+e^-$ annihilation data corresponding to a total integrated luminosity of 7.33 $\rm fb^{-1}$ collected at center-of-mass energies between 4.128 and 4.226~GeV with the BESIII detector, we provide the first amplitude analysis and absolute branching fraction measurement of the hadronic decay $D_{s}^{+} \to K_{S}^{0}K_{L}^{0}π^{+}$. The branching fraction of… ▽ More

    Submitted 23 March, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

    Comments: 11 pages, 4 figures

  49. arXiv:2503.11375  [pdf, ps, other

    econ.EM

    Difference-in-Differences Meets Synthetic Control: Doubly Robust Identification and Estimation

    Authors: Yixiao Sun, Haitian Xie, Yuhang Zhang

    Abstract: Difference-in-Differences (DiD) and Synthetic Control (SC) are widely used methods for causal inference in panel data, each with its own strengths and limitations. In this paper, we propose a novel methodology that integrates the advantages of both DiD and SC approaches. Our integrated approach provides a doubly robust identification strategy for causal effects in panel data with a group structure… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  50. arXiv:2503.11347  [pdf, other

    q-bio.QM cs.LG physics.bio-ph

    Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-seq Data Analysis

    Authors: Zhenyi Zhang, Yuhao Sun, Qiangwei Peng, Tiejun Li, Peijie Zhou

    Abstract: Understanding the dynamic nature of biological systems is fundamental to deciphering cellular behavior, developmental processes, and disease progression. Single-cell RNA sequencing (scRNA-seq) has provided static snapshots of gene expression, offering valuable insights into cellular states at a single time point. Recent advancements in temporally resolved scRNA-seq, spatial transcriptomics (ST), a… ▽ More

    Submitted 30 April, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

    Journal ref: Entropy-2025