Skip to main content

Showing 1–50 of 315 results for author: Yuan, D

.
  1. arXiv:2506.12901  [pdf, ps, other

    math.OC

    Distributed Composite Optimization with Sub-Weibull Noises

    Authors: Zhan Yu, Zhongjie Shi, Deming Yuan

    Abstract: With the rapid development of multi-agent distributed optimization (MA-DO) theory over the past decade, the distributed stochastic gradient method (DSGM) occupies an important position. Although the theory of different DSGMs has been widely established, the main-stream results of existing work are still derived under the condition of light-tailed stochastic gradient noises. Increasing recent examp… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  2. arXiv:2506.06526  [pdf, ps, other

    eess.SP

    Prompting Wireless Networks: Reinforced In-Context Learning for Power Control

    Authors: Hao Zhou, Chengming Hu, Dun Yuan, Ye Yuan, Di Wu, Xue Liu, Jianzhong, Zhang

    Abstract: To manage and optimize constantly evolving wireless networks, existing machine learning (ML)- based studies operate as black-box models, leading to increased computational costs during training and a lack of transparency in decision-making, which limits their practical applicability in wireless networks. Motivated by recent advancements in large language model (LLM)-enabled wireless networks, this… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2408.00214

  3. arXiv:2505.18254  [pdf, other

    quant-ph

    Time independence does not limit information flow. II. The case with ancillas

    Authors: T. C. Mooney, Dong Yuan, Adam Ehrenberg, Christopher L. Baldwin, Alexey V. Gorshkov, Andrew M. Childs

    Abstract: While the impact of locality restrictions on quantum dynamics and algorithmic complexity has been well studied in the general case of time-dependent Hamiltonians, the capabilities of time-independent protocols are less well understood. Using clock constructions, we show that the light cone for time-independent Hamiltonians, as captured by Lieb-Robinson bounds, is the same as that for time-dependen… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 28 pages, 2 figures

  4. arXiv:2505.18249  [pdf, ps, other

    quant-ph cond-mat.other

    Time Independence Does Not Limit Information Flow. I. The Free-Particle Case

    Authors: Dong Yuan, Chao Yin, T. C. Mooney, Christopher L. Baldwin, Andrew M. Childs, Alexey V. Gorshkov

    Abstract: The speed of information propagation in long-range interacting quantum systems is limited by Lieb-Robinson-type bounds, whose tightness can be established by finding specific quantum state-transfer protocols. Previous works have given quantum state-transfer protocols that saturate the corresponding Lieb-Robinson bounds using time-dependent Hamiltonians. Are speed limits for quantum information pro… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 7+15 pages, 2+4 figures

  5. arXiv:2505.17683  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Dual Attention Residual U-Net for Accurate Brain Ultrasound Segmentation in IVH Detection

    Authors: Dan Yuan, Yi Feng, Ziyun Tang

    Abstract: Intraventricular hemorrhage (IVH) is a severe neurological complication among premature infants, necessitating early and accurate detection from brain ultrasound (US) images to improve clinical outcomes. While recent deep learning methods offer promise for computer-aided diagnosis, challenges remain in capturing both local spatial details and global contextual dependencies critical for segmenting… ▽ More

    Submitted 10 June, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

    Comments: 10 pages,6 figures and 3 tables

  6. arXiv:2505.12284  [pdf, other

    cs.AI cs.CL

    Efficient RL Training for Reasoning Models via Length-Aware Optimization

    Authors: Danlong Yuan, Tian Xie, Shaohan Huang, Zhuocheng Gong, Huishuai Zhang, Chong Luo, Furu Wei, Dongyan Zhao

    Abstract: Large reasoning models, such as OpenAI o1 or DeepSeek R1, have demonstrated remarkable performance on reasoning tasks but often incur a long reasoning path with significant memory and time costs. Existing methods primarily aim to shorten reasoning paths by introducing additional training data and stages. In this paper, we propose three critical reward designs integrated directly into the reinforce… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

    Comments: Under review

  7. arXiv:2505.03612  [pdf, other

    eess.SY

    Backstepping Reach-avoid Controller Synthesis for Multi-input Multi-output Systems with Mixed Relative Degrees

    Authors: Jianqiang Ding, Dingran Yuan, Shankar A. Deka

    Abstract: Designing controllers with provable formal guarantees has become an urgent requirement for cyber-physical systems in safety-critical scenarios. Beyond addressing scalability in high-dimensional implementations, controller synthesis methodologies separating safety and reachability objectives may risk optimization infeasibility due to conflicting constraints, thereby significantly undermining their… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  8. arXiv:2504.19417  [pdf, ps, other

    cs.CV

    A Real-Time Event-Based Normal Flow Estimator

    Authors: Dehao Yuan, Cornelia Fermüller

    Abstract: This paper presents a real-time, asynchronous, event-based normal flow estimator. It follows the same algorithm as Learning Normal Flow Directly From Event Neighborhoods, but with a more optimized implementation. The original method treats event slices as 3D point clouds, encodes each event's local geometry into a fixed-length vector, and uses a multi-layer perceptron to predict normal flow. It co… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

  9. arXiv:2504.09416  [pdf, other

    cs.LG cs.CY

    Spatially Directional Dual-Attention GAT for Spatial Fluoride Health Risk Modeling

    Authors: Da Yuan

    Abstract: Environmental exposure to fluoride is a major public health concern, particularly in regions with naturally elevated fluoride concentrations. Accurate modeling of fluoride-related health risks, such as dental fluorosis, requires spatially aware learning frameworks capable of capturing both geographic and semantic heterogeneity. In this work, we propose Spatially Directional Dual-Attention Graph At… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

  10. Exploring the origin of multi-periodic pulsations during a white-light flare

    Authors: Dong Li, Ding Yuan, Jingye Yan, Xinhua Zhao, Zhao Wu, Jincheng Wang, Zhenyong Hou, Chuan Li, Haisheng Zhao, Libo Fu, Lin Wu, Li Deng

    Abstract: We explored the quasi-periodic pulsations (QPPs) at multiple periods during an X4.0 flare on 2024 May 10 (SOL2024-05-10T06:27), which occurred in the complex active region of NOAA 13664. The flare radiation reveals five prominent periods in multiple wavelengths. A 8-min QPP is simultaneously detected in wavelengths of HXR, radio, UV/EUV, Lya, and white light, which may be associated with nontherma… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: 32 pages, 11 figures, accepted to Journal of Geophysical Research-Space Physics

  11. arXiv:2504.06129  [pdf, other

    cs.IR

    Knowledge Graph Completion with Relation-Aware Anchor Enhancement

    Authors: Duanyang Yuan, Sihang Zhou, Xiaoshu Chen, Dong Wang, Ke Liang, Xinwang Liu, Jian Huang

    Abstract: Text-based knowledge graph completion methods take advantage of pre-trained language models (PLM) to enhance intrinsic semantic connections of raw triplets with detailed text descriptions. Typical methods in this branch map an input query (textual descriptions associated with an entity and a relation) and its candidate entities into feature vectors, respectively, and then maximize the probability… ▽ More

    Submitted 30 April, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

  12. arXiv:2503.24245  [pdf, other

    cs.CL

    Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation

    Authors: Dun Yuan, Hao Zhou, Di Wu, Xue Liu, Hao Chen, Yan Xin, Jianzhong, Zhang

    Abstract: Large language models (LLMs) have made significant progress in general-purpose natural language processing tasks. However, LLMs are still facing challenges when applied to domain-specific areas like telecommunications, which demands specialized expertise and adaptability to evolving standards. This paper presents a novel framework that combines knowledge graph (KG) and retrieval-augmented generati… ▽ More

    Submitted 21 May, 2025; v1 submitted 31 March, 2025; originally announced March 2025.

    Comments: This work has been accepted to ICC 2025 IEEE International Conference on Communications. copyright 2025 IEEE

  13. arXiv:2503.22070  [pdf, ps, other

    math.AP

    Quantum Quasi-neutral Limits and Isothermal Euler Equations

    Authors: Immanuel Ben Porat, Gui-Qiang G. Chen, Difan Yuan

    Abstract: We provide a rigorous justification of the semiclassical quasi-neutral and the quantum many-body limits to the isothermal Euler equations. We consider the nonlinear Schrödinger-Poisson-Boltzmann system under a quasi-neutral scaling and establish the convergence of its solutions to the isothermal Euler equations. Different from the previous results that dealt with the linear Poisson equations, the… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

    Comments: 50 pages

  14. arXiv:2503.21506  [pdf

    cond-mat.supr-con

    Yin-Yang vortex on UTe2 (011) surface

    Authors: Ruotong Yin, Yuanji Li, Zengyi Du, Dengpeng Yuan, Shiyuan Wang, Jiashuo Gong, Mingzhe Li, Ziyuan Chen, Jiakang Zhang, Yuguang Wang, Ziwei Xue, Xinchun Lai, Shiyong Tan, Da Wang, Qiang-Hua Wang, Dong-Lai Feng, Ya-Jun Yan

    Abstract: UTe2 is a promising candidate for spin-triplet superconductor, yet its exact superconducting order parameter remains highly debated. Here, via scanning tunneling microscopy/spectroscopy, we observe a novel type of magnetic vortex with distinct dark-bright contrast in local density of states on UTe2 (011) surface under a perpendicular magnetic field, resembling the conjugate structure of Yin-Yang d… ▽ More

    Submitted 21 May, 2025; v1 submitted 27 March, 2025; originally announced March 2025.

    Comments: 13 pages, 4 figures

  15. arXiv:2503.16758  [pdf, other

    math.AP

    Nonlinear stability of compressible vortex sheets in three-dimensional elastodynamics

    Authors: Robin Ming Chen, Feimin Huang, Dehua Wang, Difan Yuan

    Abstract: We investigate the nonlinear stability of compressible vortex sheet solutions for three-dimensional (3D) isentropic elastic flows. Building upon previous results on the weakly linear stability of elastic vortex sheets [19], we perform a detailed study of the roots of the Lopatinskii determinant and identify a geometric stability condition associated with the deformation gradient. We employ an uppe… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    MSC Class: 35Q51; 35Q35; 74F10; 76E17; 76N99

  16. arXiv:2503.16300  [pdf, other

    astro-ph.SR

    Localized Heating and Dynamics of the Solar Corona due to a Symbiosis of Waves and Reconnection

    Authors: A. K. Srivastava, Sripan Mondal, Eric R. Priest, Sudheer K. Mishra, David I. Pontin, R. Y. Kwon, Ding Yuan, K. Murawski, Ayumi Asai

    Abstract: The Sun's outer atmosphere, the corona, is maintained at mega-Kelvin temperatures and fills the heliosphere with a supersonic outflowing wind. The dissipation of magnetic waves and direct electric currents are likely to be the most significant processes for heating the corona, but a lively debate exists on their relative roles. Here, we suggest that the two are often intrinsically linked, since ma… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: 13 pages, 6 figures; Accepted for the publication in ApJ

  17. arXiv:2503.07669  [pdf, other

    cs.LG

    WECAR: An End-Edge Collaborative Inference and Training Framework for WiFi-Based Continuous Human Activity Recognition

    Authors: Rong Li, Tao Deng, Siwei Feng, He Huang, Juncheng Jia, Di Yuan, Keqin Li

    Abstract: WiFi-based human activity recognition (HAR) holds significant promise for ubiquitous sensing in smart environments. A critical challenge lies in enabling systems to dynamically adapt to evolving scenarios, learning new activities without catastrophic forgetting of prior knowledge, while adhering to the stringent computational constraints of edge devices. Current approaches struggle to reconcile th… ▽ More

    Submitted 8 March, 2025; originally announced March 2025.

    Comments: arXiv admin note: text overlap with arXiv:2502.17483

  18. arXiv:2503.06468  [pdf, other

    cs.NI

    Mobility-Aware Multi-Task Decentralized Federated Learning for Vehicular Networks: Modeling, Analysis, and Optimization

    Authors: Dongyu Chen, Tao Deng, He Huang, Juncheng Jia, Mianxiong Dong, Di Yuan, Keqin Li

    Abstract: Federated learning (FL) is a promising paradigm that can enable collaborative model training between vehicles while protecting data privacy, thereby significantly improving the performance of intelligent transportation systems (ITSs). In vehicular networks, due to mobility, resource constraints, and the concurrent execution of multiple training tasks, how to allocate limited resources effectively… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

    Comments: Submitted to IEEE for possible publication

  19. Mobility-Aware Decentralized Federated Learning with Joint Optimization of Local Iteration and Leader Selection for Vehicular Networks

    Authors: Dongyu Chen, Tao Deng, Juncheng Jia, Siwei Feng, Di Yuan

    Abstract: Federated learning (FL) emerges as a promising approach to empower vehicular networks, composed by intelligent connected vehicles equipped with advanced sensing, computing, and communication capabilities. While previous studies have explored the application of FL in vehicular networks, they have largely overlooked the intricate challenges arising from the mobility of vehicles and resource constrai… ▽ More

    Submitted 11 March, 2025; v1 submitted 8 March, 2025; originally announced March 2025.

    Comments: Preprint submitted to Computer Networks; Corrected a missing space in arXiv abstract to ensure proper formatting

  20. arXiv:2503.01116  [pdf, other

    eess.SP cs.LG

    Large AI Model for Delay-Doppler Domain Channel Prediction in 6G OTFS-Based Vehicular Networks

    Authors: Jianzhe Xue, Dongcheng Yuan, Zhanxi Ma, Tiankai Jiang, Yu Sun, Haibo Zhou, Xuemin Shen

    Abstract: Channel prediction is crucial for high-mobility vehicular networks, as it enables the anticipation of future channel conditions and the proactive adjustment of communication strategies. However, achieving accurate vehicular channel prediction is challenging due to significant Doppler effects and rapid channel variations resulting from high-speed vehicle movement and complex propagation environment… ▽ More

    Submitted 8 May, 2025; v1 submitted 2 March, 2025; originally announced March 2025.

    Comments: This manuscript has been accepted by SCIENCE CHINA Information Sciences

  21. arXiv:2502.13094  [pdf, other

    math.AP math-ph math.FA physics.bio-ph

    Global Existence and Nonlinear Stability of Finite-Energy Solutions of the Compressible Euler-Riesz Equations with Large Initial Data of Spherical Symmetry

    Authors: José A. Carrillo, Samuel R. Charles, Gui-Qiang G. Chen, Difan Yuan

    Abstract: The compressible Euler-Riesz equations are fundamental with wide applications in astrophysics, plasma physics, and mathematical biology. In this paper, we are concerned with the global existence and nonlinear stability of finite-energy solutions of the multidimensional Euler-Riesz equations with large initial data of spherical symmetry. We consider both attractive and repulsive interactions for a… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 68 pages, 1 figure

    MSC Class: 35Q35; 35Q31; 35B25; 35B44; 35L65; 35L67; 76N10; 35R09; 35R35; 35D30; 76X05; 76N17

  22. arXiv:2501.13794  [pdf, ps, other

    cs.LG

    Unveiling the Power of Noise Priors: Enhancing Diffusion Models for Mobile Traffic Prediction

    Authors: Zhi Sheng, Daisy Yuan, Jingtao Ding, Yong Li

    Abstract: Accurate prediction of mobile traffic, i.e., network traffic from cellular base stations, is crucial for optimizing network performance and supporting urban development. However, the non-stationary nature of mobile traffic, driven by human activity and environmental changes, leads to both regular patterns and abrupt variations. Diffusion models excel in capturing such complex temporal dynamics due… ▽ More

    Submitted 26 June, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

  23. arXiv:2501.07879  [pdf, ps, other

    cs.LG cs.IT math.ST

    Distributed Nonparametric Estimation: from Sparse to Dense Samples per Terminal

    Authors: Deheng Yuan, Tao Guo, Zhongyi Huang

    Abstract: Consider the communication-constrained problem of nonparametric function estimation, in which each distributed terminal holds multiple i.i.d. samples. Under certain regularity assumptions, we characterize the minimax optimal rates for all regimes, and identify phase transitions of the optimal rates as the samples per terminal vary from sparse to dense. This fully solves the problem left open by pr… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

  24. arXiv:2501.06255  [pdf, other

    cs.LG cs.AI

    Progressive Supervision via Label Decomposition: An Long-Term and Large-Scale Wireless Traffic Forecasting Method

    Authors: Daojun Liang, Haixia Zhang, Dongfeng Yuan

    Abstract: Long-term and Large-scale Wireless Traffic Forecasting (LL-WTF) is pivotal for strategic network management and comprehensive planning on a macro scale. However, LL-WTF poses greater challenges than short-term ones due to the pronounced non-stationarity of extended wireless traffic and the vast number of nodes distributed at the city scale. To cope with this, we propose a Progressive Supervision m… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: Published at Knowledge-Based Systems. arXiv admin note: substantial text overlap with arXiv:2412.00108

  25. arXiv:2501.04688  [pdf, other

    quant-ph cond-mat.stat-mech

    Observation of topological prethermal strong zero modes

    Authors: Feitong Jin, Si Jiang, Xuhao Zhu, Zehang Bao, Fanhao Shen, Ke Wang, Zitian Zhu, Shibo Xu, Zixuan Song, Jiachen Chen, Ziqi Tan, Yaozu Wu, Chuanyu Zhang, Yu Gao, Ning Wang, Yiren Zou, Aosai Zhang, Tingting Li, Jiarun Zhong, Zhengyi Cui, Yihang Han, Yiyang He, Han Wang, Jianan Yang, Yanzhe Wang , et al. (20 additional authors not shown)

    Abstract: Symmetry-protected topological phases cannot be described by any local order parameter and are beyond the conventional symmetry-breaking paradigm for understanding quantum matter. They are characterized by topological boundary states robust against perturbations that respect the protecting symmetry. In a clean system without disorder, these edge modes typically only occur for the ground states of… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

  26. arXiv:2412.19906  [pdf, other

    cs.CL cs.AI

    Evaluate Summarization in Fine-Granularity: Auto Evaluation with LLM

    Authors: Dong Yuan, Eti Rastogi, Fen Zhao, Sagar Goyal, Gautam Naik, Sree Prasanna Rajagopal

    Abstract: Due to the exponential growth of information and the need for efficient information consumption the task of summarization has gained paramount importance. Evaluating summarization accurately and objectively presents significant challenges, particularly when dealing with long and unstructured texts rich in content. Existing methods, such as ROUGE (Lin, 2004) and embedding similarities, often yield… ▽ More

    Submitted 27 December, 2024; originally announced December 2024.

  27. arXiv:2412.19191  [pdf, other

    q-bio.BM cs.AI cs.LG

    Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models

    Authors: Haonan He, Yuchen Ren, Yining Tang, Ziyang Xu, Junxian Li, Minghao Yang, Di Zhang, Dong Yuan, Tao Chen, Shufei Zhang, Yuqiang Li, Nanqing Dong, Wanli Ouyang, Dongzhan Zhou, Peng Ye

    Abstract: Large language models have already demonstrated their formidable capabilities in general domains, ushering in a revolutionary transformation. However, exploring and exploiting the extensive knowledge of these models to comprehend multi-omics biology remains underexplored. To fill this research gap, we first introduce Biology-Instructions, the first large-scale multi-omics biological sequences-rela… ▽ More

    Submitted 26 December, 2024; originally announced December 2024.

  28. arXiv:2412.11540  [pdf, other

    cs.CV cs.AI

    SP$^2$T: Sparse Proxy Attention for Dual-stream Point Transformer

    Authors: Jiaxu Wan, Hong Zhang, Ziqi He, Qishu Wang, Ding Yuan, Yifan Yang

    Abstract: In 3D understanding, point transformers have yielded significant advances in broadening the receptive field. However, further enhancement of the receptive field is hindered by the constraints of grouping attention. The proxy-based model, as a hot topic in image and language feature extraction, uses global or local proxies to expand the model's receptive field. But global proxy-based methods fail t… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: 13 pages, 14 figures, 14 tables

  29. arXiv:2412.11284  [pdf, other

    cs.CV

    Learning Normal Flow Directly From Event Neighborhoods

    Authors: Dehao Yuan, Levi Burner, Jiayi Wu, Minghui Liu, Jingxi Chen, Yiannis Aloimonos, Cornelia Fermüller

    Abstract: Event-based motion field estimation is an important task. However, current optical flow methods face challenges: learning-based approaches, often frame-based and relying on CNNs, lack cross-domain transferability, while model-based methods, though more robust, are less accurate. To address the limitations of optical flow estimation, recent works have focused on normal flow, which can be more relia… ▽ More

    Submitted 15 December, 2024; originally announced December 2024.

  30. arXiv:2412.10347  [pdf, other

    q-bio.BM cs.AI cs.LG

    COMET: Benchmark for Comprehensive Biological Multi-omics Evaluation Tasks and Language Models

    Authors: Yuchen Ren, Wenwei Han, Qianyuan Zhang, Yining Tang, Weiqiang Bai, Yuchen Cai, Lifeng Qiao, Hao Jiang, Dong Yuan, Tao Chen, Siqi Sun, Pan Tan, Wanli Ouyang, Nanqing Dong, Xinzhu Ma, Peng Ye

    Abstract: As key elements within the central dogma, DNA, RNA, and proteins play crucial roles in maintaining life by guaranteeing accurate genetic expression and implementation. Although research on these molecules has profoundly impacted fields like medicine, agriculture, and industry, the diversity of machine learning approaches-from traditional statistical methods to deep learning models and large langua… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

  31. Multi-Head Encoding for Extreme Label Classification

    Authors: Daojun Liang, Haixia Zhang, Dongfeng Yuan, Minggao Zhang

    Abstract: The number of categories of instances in the real world is normally huge, and each instance may contain multiple labels. To distinguish these massive labels utilizing machine learning, eXtreme Label Classification (XLC) has been established. However, as the number of categories increases, the number of parameters and nonlinear operations in the classifier also rises. This results in a Classifier C… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

    Comments: 20 pages, 12 figs, Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2024

  32. arXiv:2412.07761  [pdf, other

    cs.CV

    Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation

    Authors: Jingxi Chen, Brandon Y. Feng, Haoming Cai, Tianfu Wang, Levi Burner, Dehao Yuan, Cornelia Fermuller, Christopher A. Metzler, Yiannis Aloimonos

    Abstract: Video Frame Interpolation aims to recover realistic missing frames between observed frames, generating a high-frame-rate video from a low-frame-rate video. However, without additional guidance, the large motion between frames makes this problem ill-posed. Event-based Video Frame Interpolation (EVFI) addresses this challenge by using sparse, high-temporal-resolution event measurements as motion gui… ▽ More

    Submitted 25 March, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: Accepted to CVPR 2025

  33. arXiv:2412.00108  [pdf, other

    cs.LG

    Act Now: A Novel Online Forecasting Framework for Large-Scale Streaming Data

    Authors: Daojun Liang, Haixia Zhang, Jing Wang, Dongfeng Yuan, Minggao Zhang

    Abstract: In this paper, we find that existing online forecasting methods have the following issues: 1) They do not consider the update frequency of streaming data and directly use labels (future signals) to update the model, leading to information leakage. 2) Eliminating information leakage can exacerbate concept drift and online parameter updates can damage prediction accuracy. 3) Leaving out a validation… ▽ More

    Submitted 27 November, 2024; originally announced December 2024.

    Comments: 12 pages, 8 figures

  34. arXiv:2411.08043  [pdf, other

    physics.space-ph physics.geo-ph

    Graph-GIC: A Smart and Parallelized Geomagnetically Induced Current Modelling Algorithm Based on Graph Theory for Space Weather Applications

    Authors: Wen Chen, Ding Yuan, Xueshang Feng, Stefaan Poedts, Zhengyang Zou, Song Feng, Yuxuan Zhu, Tong Yin

    Abstract: Geomagnetically Induced Current (GIC) refers to the electromagnetic response of the Earth and its conductive modern infrastructures to space weather and would pose a significant threat to high-voltage power grids designed for the alternative current operation. To assess the impact of space weather on the power grid, one needs to calculate the GIC on a national or continental scale. In this study,… ▽ More

    Submitted 29 October, 2024; originally announced November 2024.

    Comments: 19 pages, 10 figures

  35. arXiv:2411.04136  [pdf, other

    cs.NI

    Large Language Models for Wireless Networks: An Overview from the Prompt Engineering Perspective

    Authors: Hao Zhou, Chengming Hu, Dun Yuan, Ye Yuan, Di Wu, Xi Chen, Hina Tabassum, Xue Liu

    Abstract: Recently, large language models (LLMs) have been successfully applied to many fields, showing outstanding comprehension and reasoning capabilities. Despite their great potential, LLMs usually require dedicated pre-training and fine-tuning for domain-specific applications such as wireless networks. These adaptations can be extremely demanding for computational resources and datasets, while most net… ▽ More

    Submitted 27 December, 2024; v1 submitted 26 October, 2024; originally announced November 2024.

  36. arXiv:2411.02180  [pdf, other

    astro-ph.SR physics.plasm-ph physics.space-ph

    Generation of fast magnetoacoustic waves in the corona by impulsive bursty reconnection

    Authors: Sripan Mondal, A. K. Srivastava, David I. Pontin, Eric R. Priest, R. Kwon, Ding Yuan

    Abstract: Fast-mode magnetohydrodynamic (MHD) waves in the solar corona are often known to be produced by solar flares and eruptive prominences. We here simulate the effect of the interaction of an external perturbation on a magnetic null in the solar corona which results in the formation of a current sheet (CS). Once the CS undergoes a sufficient extension in its length and squeezing of its width, it may g… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: 24 pages, 13 figures, Accepted for publication in The Astrophysical Journal

  37. arXiv:2411.01915  [pdf, other

    cs.RO

    RoboCrowd: Scaling Robot Data Collection through Crowdsourcing

    Authors: Suvir Mirchandani, David D. Yuan, Kaylee Burns, Md Sazzad Islam, Tony Z. Zhao, Chelsea Finn, Dorsa Sadigh

    Abstract: In recent years, imitation learning from large-scale human demonstrations has emerged as a promising paradigm for training robot policies. However, the burden of collecting large quantities of human demonstrations is significant in terms of collection time and the need for access to expert operators. We introduce a new data collection paradigm, RoboCrowd, which distributes the workload by utilizin… ▽ More

    Submitted 21 May, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

    Comments: 21 pages, 25 figures. International Conference on Robotics and Automation (ICRA) 2025

  38. arXiv:2410.23752  [pdf, other

    eess.SP

    A Peaceman-Rachford Splitting Approach with Deep Equilibrium Network for Channel Estimation

    Authors: Dingli Yuan, Shitong Wu, Haoran Tang, Lu Yang, Chenghui Peng

    Abstract: Multiple-input multiple-output (MIMO) is pivotal for wireless systems, yet its high-dimensional, stochastic channel poses significant challenges for accurate estimation, highlighting the critical need for robust estimation techniques. In this paper, we introduce a novel channel estimation method for the MIMO system. The main idea is to construct a fixed-point equation for channel estimation, which… ▽ More

    Submitted 7 January, 2025; v1 submitted 31 October, 2024; originally announced October 2024.

  39. arXiv:2410.16947  [pdf, ps, other

    cs.CV cs.LG

    ISImed: A Framework for Self-Supervised Learning using Intrinsic Spatial Information in Medical Images

    Authors: Nabil Jabareen, Dongsheng Yuan, Sören Lukassen

    Abstract: This paper demonstrates that spatial information can be used to learn interpretable representations in medical images using Self-Supervised Learning (SSL). Our proposed method, ISImed, is based on the observation that medical images exhibit a much lower variability among different images compared to classic data vision benchmarks. By leveraging this resemblance of human body structures across mult… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

    Comments: 11 pages, 4 figures

  40. arXiv:2410.15455  [pdf, other

    quant-ph cond-mat.quant-gas physics.atom-ph

    Observation of quantum information collapse-and-revival in a strongly-interacting Rydberg atom array

    Authors: De-Sheng Xiang, Yao-Wen Zhang, Hao-Xiang Liu, Peng Zhou, Dong Yuan, Kuan Zhang, Shun-Yao Zhang, Biao Xu, Lu Liu, Yitong Li, Lin Li

    Abstract: Interactions of isolated quantum many-body systems typically scramble local information into the entire system and make it unrecoverable. Ergodicity-breaking systems possess the potential to exhibit fundamentally different information scrambling dynamics beyond this paradigm. For many-body localized systems with strong ergodicity breaking, local transport vanishes and information scrambles logarit… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

    Comments: 12 pages, 6 figures + Supplementary Information 37 pages, 24 figures

  41. arXiv:2410.14741  [pdf, other

    cs.LG stat.ML

    CAKD: A Correlation-Aware Knowledge Distillation Framework Based on Decoupling Kullback-Leibler Divergence

    Authors: Zao Zhang, Huaming Chen, Pei Ning, Nan Yang, Dong Yuan

    Abstract: In knowledge distillation, a primary focus has been on transforming and balancing multiple distillation components. In this work, we emphasize the importance of thoroughly examining each distillation component, as we observe that not all elements are equally crucial. From this perspective,we decouple the Kullback-Leibler (KL) divergence into three unique elements: Binary Classification Divergence… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Report number: DM741

    Journal ref: IEEE International Conference on Data Mining 2024

  42. arXiv:2410.10366  [pdf, other

    cs.CV cs.AI

    Affinity-Graph-Guided Contractive Learning for Pretext-Free Medical Image Segmentation with Minimal Annotation

    Authors: Zehua Cheng, Di Yuan, Thomas Lukasiewicz

    Abstract: The combination of semi-supervised learning (SemiSL) and contrastive learning (CL) has been successful in medical image segmentation with limited annotations. However, these works often rely on pretext tasks that lack the specificity required for pixel-level segmentation, and still face overfitting issues due to insufficient supervision signals resulting from too few annotations. Therefore, this p… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: BIBM 2024

  43. arXiv:2410.08799  [pdf, ps, other

    cs.NI eess.SP

    Online Learning for Intelligent Thermal Management of Interference-coupled and Passively Cooled Base Stations

    Authors: Zhanwei Yu, Yi Zhao, Xiaoli Chu, Di Yuan

    Abstract: Passively cooled base stations (PCBSs) have emerged to deliver better cost and energy efficiency. However, passive cooling necessitates intelligent thermal control via traffic management, i.e., the instantaneous data traffic or throughput of a PCBS directly impacts its thermal performance. This is particularly challenging for outdoor deployment of PCBSs because the heat dissipation efficiency is u… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  44. arXiv:2410.06884  [pdf, ps, other

    cs.LG cs.IT math.ST

    Adaptive Refinement Protocols for Distributed Distribution Estimation under $\ell^p$-Losses

    Authors: Deheng Yuan, Tao Guo, Zhongyi Huang

    Abstract: Consider the communication-constrained estimation of discrete distributions under $\ell^p$ losses, where each distributed terminal holds multiple independent samples and uses limited number of bits to describe the samples. We obtain the minimax optimal rates of the problem in most parameter regimes. An elbow effect of the optimal rates at $p=2$ is clearly identified. To show the optimal rates, we… ▽ More

    Submitted 8 November, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

  45. arXiv:2409.15505  [pdf, other

    cs.RO

    Discovering Object Attributes by Prompting Large Language Models with Perception-Action APIs

    Authors: Angelos Mavrogiannis, Dehao Yuan, Yiannis Aloimonos

    Abstract: There has been a lot of interest in grounding natural language to physical entities through visual context. While Vision Language Models (VLMs) can ground linguistic instructions to visual sensory information, they struggle with grounding non-visual attributes, like the weight of an object. Our key insight is that non-visual attribute detection can be effectively achieved by active perception guid… ▽ More

    Submitted 6 March, 2025; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: ICRA 2025

  46. arXiv:2409.00002  [pdf, ps, other

    eess.SY

    Distributed Optimization by Network Flows with Spatio-Temporal Compression

    Authors: Zihao Ren, Lei Wang, Xinlei Yi, Xi Wang, Deming Yuan, Tao Yang, Zhengguang Wu, Guodong Shi

    Abstract: Several data compressors have been proposed in distributed optimization frameworks of network systems to reduce communication overhead in large-scale applications. In this paper, we demonstrate that effective information compression may occur over time or space during sequences of node communications in distributed algorithms, leading to the concept of spatio-temporal compressors. This abstraction… ▽ More

    Submitted 5 March, 2025; v1 submitted 14 August, 2024; originally announced September 2024.

    Comments: arXiv admin note: text overlap with arXiv:2408.02332

  47. arXiv:2408.15569  [pdf, other

    cs.CV

    Temporal Attention for Cross-View Sequential Image Localization

    Authors: Dong Yuan, Frederic Maire, Feras Dayoub

    Abstract: This paper introduces a novel approach to enhancing cross-view localization, focusing on the fine-grained, sequential localization of street-view images within a single known satellite image patch, a significant departure from traditional one-to-one image retrieval methods. By expanding to sequential image fine-grained localization, our model, equipped with a novel Temporal Attention Module (TAM),… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: Accepted to IROS 2024

  48. arXiv:2408.15496  [pdf, other

    cs.CL

    ReMamba: Equip Mamba with Effective Long-Sequence Modeling

    Authors: Danlong Yuan, Jiahao Liu, Bei Li, Huishuai Zhang, Jingang Wang, Xunliang Cai, Dongyan Zhao

    Abstract: While the Mamba architecture demonstrates superior inference efficiency and competitive performance on short-context natural language processing (NLP) tasks, empirical evidence suggests its capacity to comprehend long contexts is limited compared to transformer-based models. In this study, we investigate the long-context efficiency issues of the Mamba models and propose ReMamba, which enhances Mam… ▽ More

    Submitted 1 January, 2025; v1 submitted 27 August, 2024; originally announced August 2024.

  49. arXiv:2408.12086  [pdf, other

    cs.CV cs.AI

    Unlocking Attributes' Contribution to Successful Camouflage: A Combined Textual and VisualAnalysis Strategy

    Authors: Hong Zhang, Yixuan Lyu, Qian Yu, Hanyang Liu, Huimin Ma, Ding Yuan, Yifan Yang

    Abstract: In the domain of Camouflaged Object Segmentation (COS), despite continuous improvements in segmentation performance, the underlying mechanisms of effective camouflage remain poorly understood, akin to a black box. To address this gap, we present the first comprehensive study to examine the impact of camouflage attributes on the effectiveness of camouflage patterns, offering a quantitative framewor… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: Accepted by ECCV 2024

  50. arXiv:2408.02549  [pdf, other

    eess.SY

    Generative AI as a Service in 6G Edge-Cloud: Generation Task Offloading by In-context Learning

    Authors: Hao Zhou, Chengming Hu, Dun Yuan, Ye Yuan, Di Wu, Xue Liu, Zhu Han, Charlie Zhang

    Abstract: Generative artificial intelligence (GAI) is a promising technique towards 6G networks, and generative foundation models such as large language models (LLMs) have attracted considerable interest from academia and telecom industry. This work considers a novel edge-cloud deployment of foundation models in 6G networks. Specifically, it aims to minimize the service delay of foundation models by radio r… ▽ More

    Submitted 21 March, 2025; v1 submitted 5 August, 2024; originally announced August 2024.

    Comments: This paper has been accepted by IEEE Wireless Communications Letters