Skip to main content

Showing 1–50 of 181 results for author: Yuan, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.17683  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Dual Attention Residual U-Net for Accurate Brain Ultrasound Segmentation in IVH Detection

    Authors: Dan Yuan, Yi Feng, Ziyun Tang

    Abstract: Intraventricular hemorrhage (IVH) is a severe neurological complication among premature infants, necessitating early and accurate detection from brain ultrasound (US) images to improve clinical outcomes. While recent deep learning methods offer promise for computer-aided diagnosis, challenges remain in capturing both local spatial details and global contextual dependencies critical for segmenting… ▽ More

    Submitted 10 June, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

    Comments: 10 pages,6 figures and 3 tables

  2. arXiv:2505.12284  [pdf, other

    cs.AI cs.CL

    Efficient RL Training for Reasoning Models via Length-Aware Optimization

    Authors: Danlong Yuan, Tian Xie, Shaohan Huang, Zhuocheng Gong, Huishuai Zhang, Chong Luo, Furu Wei, Dongyan Zhao

    Abstract: Large reasoning models, such as OpenAI o1 or DeepSeek R1, have demonstrated remarkable performance on reasoning tasks but often incur a long reasoning path with significant memory and time costs. Existing methods primarily aim to shorten reasoning paths by introducing additional training data and stages. In this paper, we propose three critical reward designs integrated directly into the reinforce… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

    Comments: Under review

  3. arXiv:2504.19417  [pdf, ps, other

    cs.CV

    A Real-Time Event-Based Normal Flow Estimator

    Authors: Dehao Yuan, Cornelia Fermüller

    Abstract: This paper presents a real-time, asynchronous, event-based normal flow estimator. It follows the same algorithm as Learning Normal Flow Directly From Event Neighborhoods, but with a more optimized implementation. The original method treats event slices as 3D point clouds, encodes each event's local geometry into a fixed-length vector, and uses a multi-layer perceptron to predict normal flow. It co… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

  4. arXiv:2504.09416  [pdf, other

    cs.LG cs.CY

    Spatially Directional Dual-Attention GAT for Spatial Fluoride Health Risk Modeling

    Authors: Da Yuan

    Abstract: Environmental exposure to fluoride is a major public health concern, particularly in regions with naturally elevated fluoride concentrations. Accurate modeling of fluoride-related health risks, such as dental fluorosis, requires spatially aware learning frameworks capable of capturing both geographic and semantic heterogeneity. In this work, we propose Spatially Directional Dual-Attention Graph At… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

  5. arXiv:2504.06129  [pdf, other

    cs.IR

    Knowledge Graph Completion with Relation-Aware Anchor Enhancement

    Authors: Duanyang Yuan, Sihang Zhou, Xiaoshu Chen, Dong Wang, Ke Liang, Xinwang Liu, Jian Huang

    Abstract: Text-based knowledge graph completion methods take advantage of pre-trained language models (PLM) to enhance intrinsic semantic connections of raw triplets with detailed text descriptions. Typical methods in this branch map an input query (textual descriptions associated with an entity and a relation) and its candidate entities into feature vectors, respectively, and then maximize the probability… ▽ More

    Submitted 30 April, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

  6. arXiv:2503.24245  [pdf, other

    cs.CL

    Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation

    Authors: Dun Yuan, Hao Zhou, Di Wu, Xue Liu, Hao Chen, Yan Xin, Jianzhong, Zhang

    Abstract: Large language models (LLMs) have made significant progress in general-purpose natural language processing tasks. However, LLMs are still facing challenges when applied to domain-specific areas like telecommunications, which demands specialized expertise and adaptability to evolving standards. This paper presents a novel framework that combines knowledge graph (KG) and retrieval-augmented generati… ▽ More

    Submitted 21 May, 2025; v1 submitted 31 March, 2025; originally announced March 2025.

    Comments: This work has been accepted to ICC 2025 IEEE International Conference on Communications. copyright 2025 IEEE

  7. arXiv:2503.07669  [pdf, other

    cs.LG

    WECAR: An End-Edge Collaborative Inference and Training Framework for WiFi-Based Continuous Human Activity Recognition

    Authors: Rong Li, Tao Deng, Siwei Feng, He Huang, Juncheng Jia, Di Yuan, Keqin Li

    Abstract: WiFi-based human activity recognition (HAR) holds significant promise for ubiquitous sensing in smart environments. A critical challenge lies in enabling systems to dynamically adapt to evolving scenarios, learning new activities without catastrophic forgetting of prior knowledge, while adhering to the stringent computational constraints of edge devices. Current approaches struggle to reconcile th… ▽ More

    Submitted 8 March, 2025; originally announced March 2025.

    Comments: arXiv admin note: text overlap with arXiv:2502.17483

  8. arXiv:2503.06468  [pdf, other

    cs.NI

    Mobility-Aware Multi-Task Decentralized Federated Learning for Vehicular Networks: Modeling, Analysis, and Optimization

    Authors: Dongyu Chen, Tao Deng, He Huang, Juncheng Jia, Mianxiong Dong, Di Yuan, Keqin Li

    Abstract: Federated learning (FL) is a promising paradigm that can enable collaborative model training between vehicles while protecting data privacy, thereby significantly improving the performance of intelligent transportation systems (ITSs). In vehicular networks, due to mobility, resource constraints, and the concurrent execution of multiple training tasks, how to allocate limited resources effectively… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

    Comments: Submitted to IEEE for possible publication

  9. Mobility-Aware Decentralized Federated Learning with Joint Optimization of Local Iteration and Leader Selection for Vehicular Networks

    Authors: Dongyu Chen, Tao Deng, Juncheng Jia, Siwei Feng, Di Yuan

    Abstract: Federated learning (FL) emerges as a promising approach to empower vehicular networks, composed by intelligent connected vehicles equipped with advanced sensing, computing, and communication capabilities. While previous studies have explored the application of FL in vehicular networks, they have largely overlooked the intricate challenges arising from the mobility of vehicles and resource constrai… ▽ More

    Submitted 11 March, 2025; v1 submitted 8 March, 2025; originally announced March 2025.

    Comments: Preprint submitted to Computer Networks; Corrected a missing space in arXiv abstract to ensure proper formatting

  10. arXiv:2503.01116  [pdf, other

    eess.SP cs.LG

    Large AI Model for Delay-Doppler Domain Channel Prediction in 6G OTFS-Based Vehicular Networks

    Authors: Jianzhe Xue, Dongcheng Yuan, Zhanxi Ma, Tiankai Jiang, Yu Sun, Haibo Zhou, Xuemin Shen

    Abstract: Channel prediction is crucial for high-mobility vehicular networks, as it enables the anticipation of future channel conditions and the proactive adjustment of communication strategies. However, achieving accurate vehicular channel prediction is challenging due to significant Doppler effects and rapid channel variations resulting from high-speed vehicle movement and complex propagation environment… ▽ More

    Submitted 8 May, 2025; v1 submitted 2 March, 2025; originally announced March 2025.

    Comments: This manuscript has been accepted by SCIENCE CHINA Information Sciences

  11. arXiv:2501.13794  [pdf, ps, other

    cs.LG

    Unveiling the Power of Noise Priors: Enhancing Diffusion Models for Mobile Traffic Prediction

    Authors: Zhi Sheng, Daisy Yuan, Jingtao Ding, Yong Li

    Abstract: Accurate prediction of mobile traffic, i.e., network traffic from cellular base stations, is crucial for optimizing network performance and supporting urban development. However, the non-stationary nature of mobile traffic, driven by human activity and environmental changes, leads to both regular patterns and abrupt variations. Diffusion models excel in capturing such complex temporal dynamics due… ▽ More

    Submitted 26 June, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

  12. arXiv:2501.07879  [pdf, ps, other

    cs.LG cs.IT math.ST

    Distributed Nonparametric Estimation: from Sparse to Dense Samples per Terminal

    Authors: Deheng Yuan, Tao Guo, Zhongyi Huang

    Abstract: Consider the communication-constrained problem of nonparametric function estimation, in which each distributed terminal holds multiple i.i.d. samples. Under certain regularity assumptions, we characterize the minimax optimal rates for all regimes, and identify phase transitions of the optimal rates as the samples per terminal vary from sparse to dense. This fully solves the problem left open by pr… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

  13. arXiv:2501.06255  [pdf, other

    cs.LG cs.AI

    Progressive Supervision via Label Decomposition: An Long-Term and Large-Scale Wireless Traffic Forecasting Method

    Authors: Daojun Liang, Haixia Zhang, Dongfeng Yuan

    Abstract: Long-term and Large-scale Wireless Traffic Forecasting (LL-WTF) is pivotal for strategic network management and comprehensive planning on a macro scale. However, LL-WTF poses greater challenges than short-term ones due to the pronounced non-stationarity of extended wireless traffic and the vast number of nodes distributed at the city scale. To cope with this, we propose a Progressive Supervision m… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: Published at Knowledge-Based Systems. arXiv admin note: substantial text overlap with arXiv:2412.00108

  14. arXiv:2412.19906  [pdf, other

    cs.CL cs.AI

    Evaluate Summarization in Fine-Granularity: Auto Evaluation with LLM

    Authors: Dong Yuan, Eti Rastogi, Fen Zhao, Sagar Goyal, Gautam Naik, Sree Prasanna Rajagopal

    Abstract: Due to the exponential growth of information and the need for efficient information consumption the task of summarization has gained paramount importance. Evaluating summarization accurately and objectively presents significant challenges, particularly when dealing with long and unstructured texts rich in content. Existing methods, such as ROUGE (Lin, 2004) and embedding similarities, often yield… ▽ More

    Submitted 27 December, 2024; originally announced December 2024.

  15. arXiv:2412.19191  [pdf, other

    q-bio.BM cs.AI cs.LG

    Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models

    Authors: Haonan He, Yuchen Ren, Yining Tang, Ziyang Xu, Junxian Li, Minghao Yang, Di Zhang, Dong Yuan, Tao Chen, Shufei Zhang, Yuqiang Li, Nanqing Dong, Wanli Ouyang, Dongzhan Zhou, Peng Ye

    Abstract: Large language models have already demonstrated their formidable capabilities in general domains, ushering in a revolutionary transformation. However, exploring and exploiting the extensive knowledge of these models to comprehend multi-omics biology remains underexplored. To fill this research gap, we first introduce Biology-Instructions, the first large-scale multi-omics biological sequences-rela… ▽ More

    Submitted 26 December, 2024; originally announced December 2024.

  16. arXiv:2412.11540  [pdf, other

    cs.CV cs.AI

    SP$^2$T: Sparse Proxy Attention for Dual-stream Point Transformer

    Authors: Jiaxu Wan, Hong Zhang, Ziqi He, Qishu Wang, Ding Yuan, Yifan Yang

    Abstract: In 3D understanding, point transformers have yielded significant advances in broadening the receptive field. However, further enhancement of the receptive field is hindered by the constraints of grouping attention. The proxy-based model, as a hot topic in image and language feature extraction, uses global or local proxies to expand the model's receptive field. But global proxy-based methods fail t… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: 13 pages, 14 figures, 14 tables

  17. arXiv:2412.11284  [pdf, other

    cs.CV

    Learning Normal Flow Directly From Event Neighborhoods

    Authors: Dehao Yuan, Levi Burner, Jiayi Wu, Minghui Liu, Jingxi Chen, Yiannis Aloimonos, Cornelia Fermüller

    Abstract: Event-based motion field estimation is an important task. However, current optical flow methods face challenges: learning-based approaches, often frame-based and relying on CNNs, lack cross-domain transferability, while model-based methods, though more robust, are less accurate. To address the limitations of optical flow estimation, recent works have focused on normal flow, which can be more relia… ▽ More

    Submitted 15 December, 2024; originally announced December 2024.

  18. arXiv:2412.10347  [pdf, other

    q-bio.BM cs.AI cs.LG

    COMET: Benchmark for Comprehensive Biological Multi-omics Evaluation Tasks and Language Models

    Authors: Yuchen Ren, Wenwei Han, Qianyuan Zhang, Yining Tang, Weiqiang Bai, Yuchen Cai, Lifeng Qiao, Hao Jiang, Dong Yuan, Tao Chen, Siqi Sun, Pan Tan, Wanli Ouyang, Nanqing Dong, Xinzhu Ma, Peng Ye

    Abstract: As key elements within the central dogma, DNA, RNA, and proteins play crucial roles in maintaining life by guaranteeing accurate genetic expression and implementation. Although research on these molecules has profoundly impacted fields like medicine, agriculture, and industry, the diversity of machine learning approaches-from traditional statistical methods to deep learning models and large langua… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

  19. Multi-Head Encoding for Extreme Label Classification

    Authors: Daojun Liang, Haixia Zhang, Dongfeng Yuan, Minggao Zhang

    Abstract: The number of categories of instances in the real world is normally huge, and each instance may contain multiple labels. To distinguish these massive labels utilizing machine learning, eXtreme Label Classification (XLC) has been established. However, as the number of categories increases, the number of parameters and nonlinear operations in the classifier also rises. This results in a Classifier C… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

    Comments: 20 pages, 12 figs, Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2024

  20. arXiv:2412.07761  [pdf, other

    cs.CV

    Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation

    Authors: Jingxi Chen, Brandon Y. Feng, Haoming Cai, Tianfu Wang, Levi Burner, Dehao Yuan, Cornelia Fermuller, Christopher A. Metzler, Yiannis Aloimonos

    Abstract: Video Frame Interpolation aims to recover realistic missing frames between observed frames, generating a high-frame-rate video from a low-frame-rate video. However, without additional guidance, the large motion between frames makes this problem ill-posed. Event-based Video Frame Interpolation (EVFI) addresses this challenge by using sparse, high-temporal-resolution event measurements as motion gui… ▽ More

    Submitted 25 March, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: Accepted to CVPR 2025

  21. arXiv:2412.00108  [pdf, other

    cs.LG

    Act Now: A Novel Online Forecasting Framework for Large-Scale Streaming Data

    Authors: Daojun Liang, Haixia Zhang, Jing Wang, Dongfeng Yuan, Minggao Zhang

    Abstract: In this paper, we find that existing online forecasting methods have the following issues: 1) They do not consider the update frequency of streaming data and directly use labels (future signals) to update the model, leading to information leakage. 2) Eliminating information leakage can exacerbate concept drift and online parameter updates can damage prediction accuracy. 3) Leaving out a validation… ▽ More

    Submitted 27 November, 2024; originally announced December 2024.

    Comments: 12 pages, 8 figures

  22. arXiv:2411.04136  [pdf, other

    cs.NI

    Large Language Models for Wireless Networks: An Overview from the Prompt Engineering Perspective

    Authors: Hao Zhou, Chengming Hu, Dun Yuan, Ye Yuan, Di Wu, Xi Chen, Hina Tabassum, Xue Liu

    Abstract: Recently, large language models (LLMs) have been successfully applied to many fields, showing outstanding comprehension and reasoning capabilities. Despite their great potential, LLMs usually require dedicated pre-training and fine-tuning for domain-specific applications such as wireless networks. These adaptations can be extremely demanding for computational resources and datasets, while most net… ▽ More

    Submitted 27 December, 2024; v1 submitted 26 October, 2024; originally announced November 2024.

  23. arXiv:2411.01915  [pdf, other

    cs.RO

    RoboCrowd: Scaling Robot Data Collection through Crowdsourcing

    Authors: Suvir Mirchandani, David D. Yuan, Kaylee Burns, Md Sazzad Islam, Tony Z. Zhao, Chelsea Finn, Dorsa Sadigh

    Abstract: In recent years, imitation learning from large-scale human demonstrations has emerged as a promising paradigm for training robot policies. However, the burden of collecting large quantities of human demonstrations is significant in terms of collection time and the need for access to expert operators. We introduce a new data collection paradigm, RoboCrowd, which distributes the workload by utilizin… ▽ More

    Submitted 21 May, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

    Comments: 21 pages, 25 figures. International Conference on Robotics and Automation (ICRA) 2025

  24. arXiv:2410.16947  [pdf, ps, other

    cs.CV cs.LG

    ISImed: A Framework for Self-Supervised Learning using Intrinsic Spatial Information in Medical Images

    Authors: Nabil Jabareen, Dongsheng Yuan, Sören Lukassen

    Abstract: This paper demonstrates that spatial information can be used to learn interpretable representations in medical images using Self-Supervised Learning (SSL). Our proposed method, ISImed, is based on the observation that medical images exhibit a much lower variability among different images compared to classic data vision benchmarks. By leveraging this resemblance of human body structures across mult… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

    Comments: 11 pages, 4 figures

  25. arXiv:2410.14741  [pdf, other

    cs.LG stat.ML

    CAKD: A Correlation-Aware Knowledge Distillation Framework Based on Decoupling Kullback-Leibler Divergence

    Authors: Zao Zhang, Huaming Chen, Pei Ning, Nan Yang, Dong Yuan

    Abstract: In knowledge distillation, a primary focus has been on transforming and balancing multiple distillation components. In this work, we emphasize the importance of thoroughly examining each distillation component, as we observe that not all elements are equally crucial. From this perspective,we decouple the Kullback-Leibler (KL) divergence into three unique elements: Binary Classification Divergence… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Report number: DM741

    Journal ref: IEEE International Conference on Data Mining 2024

  26. arXiv:2410.10366  [pdf, other

    cs.CV cs.AI

    Affinity-Graph-Guided Contractive Learning for Pretext-Free Medical Image Segmentation with Minimal Annotation

    Authors: Zehua Cheng, Di Yuan, Thomas Lukasiewicz

    Abstract: The combination of semi-supervised learning (SemiSL) and contrastive learning (CL) has been successful in medical image segmentation with limited annotations. However, these works often rely on pretext tasks that lack the specificity required for pixel-level segmentation, and still face overfitting issues due to insufficient supervision signals resulting from too few annotations. Therefore, this p… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: BIBM 2024

  27. arXiv:2410.08799  [pdf, ps, other

    cs.NI eess.SP

    Online Learning for Intelligent Thermal Management of Interference-coupled and Passively Cooled Base Stations

    Authors: Zhanwei Yu, Yi Zhao, Xiaoli Chu, Di Yuan

    Abstract: Passively cooled base stations (PCBSs) have emerged to deliver better cost and energy efficiency. However, passive cooling necessitates intelligent thermal control via traffic management, i.e., the instantaneous data traffic or throughput of a PCBS directly impacts its thermal performance. This is particularly challenging for outdoor deployment of PCBSs because the heat dissipation efficiency is u… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  28. arXiv:2410.06884  [pdf, ps, other

    cs.LG cs.IT math.ST

    Adaptive Refinement Protocols for Distributed Distribution Estimation under $\ell^p$-Losses

    Authors: Deheng Yuan, Tao Guo, Zhongyi Huang

    Abstract: Consider the communication-constrained estimation of discrete distributions under $\ell^p$ losses, where each distributed terminal holds multiple independent samples and uses limited number of bits to describe the samples. We obtain the minimax optimal rates of the problem in most parameter regimes. An elbow effect of the optimal rates at $p=2$ is clearly identified. To show the optimal rates, we… ▽ More

    Submitted 8 November, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

  29. arXiv:2409.15505  [pdf, other

    cs.RO

    Discovering Object Attributes by Prompting Large Language Models with Perception-Action APIs

    Authors: Angelos Mavrogiannis, Dehao Yuan, Yiannis Aloimonos

    Abstract: There has been a lot of interest in grounding natural language to physical entities through visual context. While Vision Language Models (VLMs) can ground linguistic instructions to visual sensory information, they struggle with grounding non-visual attributes, like the weight of an object. Our key insight is that non-visual attribute detection can be effectively achieved by active perception guid… ▽ More

    Submitted 6 March, 2025; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: ICRA 2025

  30. arXiv:2408.15569  [pdf, other

    cs.CV

    Temporal Attention for Cross-View Sequential Image Localization

    Authors: Dong Yuan, Frederic Maire, Feras Dayoub

    Abstract: This paper introduces a novel approach to enhancing cross-view localization, focusing on the fine-grained, sequential localization of street-view images within a single known satellite image patch, a significant departure from traditional one-to-one image retrieval methods. By expanding to sequential image fine-grained localization, our model, equipped with a novel Temporal Attention Module (TAM),… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: Accepted to IROS 2024

  31. arXiv:2408.15496  [pdf, other

    cs.CL

    ReMamba: Equip Mamba with Effective Long-Sequence Modeling

    Authors: Danlong Yuan, Jiahao Liu, Bei Li, Huishuai Zhang, Jingang Wang, Xunliang Cai, Dongyan Zhao

    Abstract: While the Mamba architecture demonstrates superior inference efficiency and competitive performance on short-context natural language processing (NLP) tasks, empirical evidence suggests its capacity to comprehend long contexts is limited compared to transformer-based models. In this study, we investigate the long-context efficiency issues of the Mamba models and propose ReMamba, which enhances Mam… ▽ More

    Submitted 1 January, 2025; v1 submitted 27 August, 2024; originally announced August 2024.

  32. arXiv:2408.12086  [pdf, other

    cs.CV cs.AI

    Unlocking Attributes' Contribution to Successful Camouflage: A Combined Textual and VisualAnalysis Strategy

    Authors: Hong Zhang, Yixuan Lyu, Qian Yu, Hanyang Liu, Huimin Ma, Ding Yuan, Yifan Yang

    Abstract: In the domain of Camouflaged Object Segmentation (COS), despite continuous improvements in segmentation performance, the underlying mechanisms of effective camouflage remain poorly understood, akin to a black box. To address this gap, we present the first comprehensive study to examine the impact of camouflage attributes on the effectiveness of camouflage patterns, offering a quantitative framewor… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: Accepted by ECCV 2024

  33. arXiv:2407.19845  [pdf, other

    cs.LG cs.CR

    BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning

    Authors: Baoyuan Wu, Hongrui Chen, Mingda Zhang, Zihao Zhu, Shaokui Wei, Danni Yuan, Mingli Zhu, Ruotong Wang, Li Liu, Chao Shen

    Abstract: As an emerging approach to explore the vulnerability of deep neural networks (DNNs), backdoor learning has attracted increasing interest in recent years, and many seminal backdoor attack and defense algorithms are being developed successively or concurrently, in the status of a rapid arms race. However, mainly due to the diverse settings, and the difficulties of implementation and reproducibility… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: Substantial extensions based on our previous conference version "Backdoorbench: A comprehensive benchmark of backdoor learning" published at NeurIPS D&B Track 2022. 20 backdoor attack algorithms, 32 backdoor defense algorithms, 11000+ pairs of attack-against-defense evaluations, 10 analyses, 18 analysis tools

  34. arXiv:2407.19430  [pdf, other

    cs.CV

    Progressive Domain Adaptation for Thermal Infrared Object Tracking

    Authors: Qiao Li, Kanlun Tan, Qiao Liu, Di Yuan, Xin Li, Yunpeng Liu

    Abstract: Due to the lack of large-scale labeled Thermal InfraRed (TIR) training datasets, most existing TIR trackers are trained directly on RGB datasets. However, tracking methods trained on RGB datasets suffer a significant drop-off in TIR data due to the domain shift issue. To this end, in this work, we propose a Progressive Domain Adaptation framework for TIR Tracking (PDAT), which transfers useful kno… ▽ More

    Submitted 3 September, 2024; v1 submitted 28 July, 2024; originally announced July 2024.

    Comments: 10 pages, 8 figures

  35. Radio Frequency Signal based Human Silhouette Segmentation: A Sequential Diffusion Approach

    Authors: Penghui Wen, Kun Hu, Dong Yuan, Zhiyuan Ning, Changyang Li, Zhiyong Wang

    Abstract: Radio frequency (RF) signals have been proved to be flexible for human silhouette segmentation (HSS) under complex environments. Existing studies are mainly based on a one-shot approach, which lacks a coherent projection ability from the RF domain. Additionally, the spatio-temporal patterns have not been fully explored for human motion dynamics in HSS. Therefore, we propose a two-stage Sequential… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

  36. arXiv:2407.08558  [pdf, other

    cs.AI

    ST-Mamba: Spatial-Temporal Mamba for Traffic Flow Estimation Recovery using Limited Data

    Authors: Doncheng Yuan, Jianzhe Xue, Jinshan Su, Wenchao Xu, Haibo Zhou

    Abstract: Traffic flow estimation (TFE) is crucial for urban intelligent traffic systems. While traditional on-road detectors are hindered by limited coverage and high costs, cloud computing and data mining of vehicular network data, such as driving speeds and GPS coordinates, present a promising and cost-effective alternative. Furthermore, minimizing data collection can significantly reduce overhead. Howev… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted by 2024 IEEE/CIC International Conference on Communications in China (ICCC)

  37. arXiv:2407.08047   

    cs.LG cs.AI

    Spatial-Temporal Attention Model for Traffic State Estimation with Sparse Internet of Vehicles

    Authors: Jianzhe Xue, Dongcheng Yuan, Yu Sun, Tianqi Zhang, Wenchao Xu, Haibo Zhou, Xuemin, Shen

    Abstract: The growing number of connected vehicles offers an opportunity to leverage internet of vehicles (IoV) data for traffic state estimation (TSE) which plays a crucial role in intelligent transportation systems (ITS). By utilizing only a portion of IoV data instead of the entire dataset, the significant overheads associated with collecting and processing large amounts of data can be avoided. In this p… ▽ More

    Submitted 14 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: need further improvement

  38. arXiv:2407.08034  [pdf, other

    cs.AI

    Spatial-Temporal Generative AI for Traffic Flow Estimation with Sparse Data of Connected Vehicles

    Authors: Jianzhe Xue, Yunting Xu, Dongcheng Yuan, Caoyi Zha, Hongyang Du, Haibo Zhou, Dusit Niyato

    Abstract: Traffic flow estimation (TFE) is crucial for intelligent transportation systems. Traditional TFE methods rely on extensive road sensor networks and typically incur significant costs. Sparse mobile crowdsensing enables a cost-effective alternative by utilizing sparsely distributed probe vehicle data (PVD) provided by connected vehicles. However, as pointed out by the central limit theorem, the spar… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  39. Threats and Defenses in Federated Learning Life Cycle: A Comprehensive Survey and Challenges

    Authors: Yanli Li, Zhongliang Guo, Nan Yang, Huaming Chen, Dong Yuan, Weiping Ding

    Abstract: Federated Learning (FL) offers innovative solutions for privacy-preserving collaborative machine learning (ML). Despite its promising potential, FL is vulnerable to various attacks due to its distributed nature, affecting the entire life cycle of FL services. These threats can harm the model's utility or compromise participants' privacy, either directly or indirectly. In response, numerous defense… ▽ More

    Submitted 11 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2025, Page(s): 1 - 21

  40. arXiv:2407.01595  [pdf, other

    cs.LG cs.CY cs.SE

    Fairpriori: Improving Biased Subgroup Discovery for Deep Neural Network Fairness

    Authors: Kacy Zhou, Jiawen Wen, Nan Yang, Dong Yuan, Qinghua Lu, Huaming Chen

    Abstract: While deep learning has become a core functional module of most software systems, concerns regarding the fairness of ML predictions have emerged as a significant issue that affects prediction results due to discrimination. Intersectional bias, which disproportionately affects members of subgroups, is a prime example of this. For instance, a machine learning model might exhibit bias against darker-… ▽ More

    Submitted 24 June, 2024; originally announced July 2024.

    Comments: 11 pages

  41. arXiv:2406.11397  [pdf, other

    cs.LG cs.AI stat.ML

    DistPred: A Distribution-Free Probabilistic Inference Method for Regression and Forecasting

    Authors: Daojun Liang, Haixia Zhang, Dongfeng Yuan

    Abstract: Traditional regression and prediction tasks often only provide deterministic point estimates. To estimate the distribution or uncertainty of the response variable, traditional methods either assume that the posterior distribution of samples follows a Gaussian process or require thousands of forward passes for sample generation. We propose a novel approach called DistPred for regression and forecas… ▽ More

    Submitted 6 January, 2025; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Published at KDD 2025

    Journal ref: KDD 2025

  42. arXiv:2406.10655  [pdf, ps, other

    cs.CR

    E-SAGE: Explainability-based Defense Against Backdoor Attacks on Graph Neural Networks

    Authors: Dingqiang Yuan, Xiaohua Xu, Lei Yu, Tongchang Han, Rongchang Li, Meng Han

    Abstract: Graph Neural Networks (GNNs) have recently been widely adopted in multiple domains. Yet, they are notably vulnerable to adversarial and backdoor attacks. In particular, backdoor attacks based on subgraph insertion have been shown to be effective in graph classification tasks while being stealthy, successfully circumventing various existing defense methods. In this paper, we propose E-SAGE, a novel… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  43. arXiv:2406.10391  [pdf, other

    q-bio.QM cs.LG

    BEACON: Benchmark for Comprehensive RNA Tasks and Language Models

    Authors: Yuchen Ren, Zhiyuan Chen, Lifeng Qiao, Hongtai Jing, Yuchen Cai, Sheng Xu, Peng Ye, Xinzhu Ma, Siqi Sun, Hongliang Yan, Dong Yuan, Wanli Ouyang, Xihui Liu

    Abstract: RNA plays a pivotal role in translating genetic instructions into functional outcomes, underscoring its importance in biological processes and disease mechanisms. Despite the emergence of numerous deep learning approaches for RNA, particularly universal RNA language models, there remains a significant lack of standardized benchmarks to assess the effectiveness of these methods. In this study, we i… ▽ More

    Submitted 12 December, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted by NeurIPS 2024 Dataset and Benchmark Track

  44. arXiv:2406.08688  [pdf, other

    cs.SE cs.AI

    On Security Weaknesses and Vulnerabilities in Deep Learning Systems

    Authors: Zhongzheng Lai, Huaming Chen, Ruoxi Sun, Yu Zhang, Minhui Xue, Dong Yuan

    Abstract: The security guarantee of AI-enabled software systems (particularly using deep learning techniques as a functional core) is pivotal against the adversarial attacks exploiting software vulnerabilities. However, little attention has been paid to a systematic investigation of vulnerabilities in such systems. A common situation learned from the open source software community is that deep learning engi… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  45. arXiv:2405.14251   

    cs.RO eess.SY

    Efficient Navigation of a Robotic Fish Swimming Across the Vortical Flow Field

    Authors: Haodong Feng, Dehan Yuan, Jiale Miao, Jie You, Yue Wang, Yi Zhu, Dixia Fan

    Abstract: Navigating efficiently across vortical flow fields presents a significant challenge in various robotic applications. The dynamic and unsteady nature of vortical flows often disturbs the control of underwater robots, complicating their operation in hydrodynamic environments. Conventional control methods, which depend on accurate modeling, fail in these settings due to the complexity of fluid-struct… ▽ More

    Submitted 27 September, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: We would like to request the withdrawal of our submission due to some misunderstandings among the co-authors concerning the submission process. It appears that the current version was submitted before we reached a consensus among all authors. We are actively working to address these matters and plan to resubmit a revised version once we achieve agreement

  46. arXiv:2405.12530  [pdf, other

    cs.NI

    Multi-hop Multi-RIS Wireless Communication Systems: Multi-reflection Path Scheduling and Beamforming

    Authors: Xiaoyan Ma, Haixia Zhang, Xianhao Chen, Yuguang Fangmand Dongfeng Yuan

    Abstract: Reconfigurable intelligent surface (RIS) provides a promising way to proactively augment propagation environments for better transmission performance in wireless communications. Existing multi-RIS works mainly focus on link-level optimization with predetermined transmission paths, which cannot be directly extended to system-level management, since they neither consider the interference caused by u… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: Accepted by IEEE Transactions on Wireless Communication

  47. arXiv:2405.10825  [pdf, other

    eess.SY cs.LG

    Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities

    Authors: Hao Zhou, Chengming Hu, Ye Yuan, Yufei Cui, Yili Jin, Can Chen, Haolun Wu, Dun Yuan, Li Jiang, Di Wu, Xue Liu, Charlie Zhang, Xianbin Wang, Jiangchuan Liu

    Abstract: Large language models (LLMs) have received considerable attention recently due to their outstanding comprehension and reasoning capabilities, leading to great progress in many fields. The advancement of LLM techniques also offers promising opportunities to automate many tasks in the telecommunication (telecom) field. After pre-training and fine-tuning, LLMs can perform diverse downstream tasks bas… ▽ More

    Submitted 16 September, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  48. arXiv:2405.07444  [pdf, other

    cs.CV

    Motion Keyframe Interpolation for Any Human Skeleton via Temporally Consistent Point Cloud Sampling and Reconstruction

    Authors: Clinton Mo, Kun Hu, Chengjiang Long, Dong Yuan, Zhiyong Wang

    Abstract: In the character animation field, modern supervised keyframe interpolation models have demonstrated exceptional performance in constructing natural human motions from sparse pose definitions. As supervised models, large motion datasets are necessary to facilitate the learning process; however, since motion is represented with fixed hierarchical skeletons, such datasets are incompatible for skeleto… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 17 pages, 7 figures

  49. arXiv:2405.02360  [pdf, other

    cs.LG cs.DC

    Holistic Evaluation Metrics: Use Case Sensitive Evaluation Metrics for Federated Learning

    Authors: Yanli Li, Jehad Ibrahim, Huaming Chen, Dong Yuan, Kim-Kwang Raymond Choo

    Abstract: A large number of federated learning (FL) algorithms have been proposed for different applications and from varying perspectives. However, the evaluation of such approaches often relies on a single metric (e.g., accuracy). Such a practice fails to account for the unique demands and diverse requirements of different use cases. Thus, how to comprehensively evaluate an FL algorithm and determine the… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  50. arXiv:2404.19666  [pdf, other

    cs.CV eess.IV

    Beyond MOS: Subjective Image Quality Score Preprocessing Method Based on Perceptual Similarity

    Authors: Lei Wang, Desen Yuan

    Abstract: Image quality assessment often relies on raw opinion scores provided by subjects in subjective experiments, which can be noisy and unreliable. To address this issue, postprocessing procedures such as ITU-R BT.500, ITU-T P.910, and ITU-T P.913 have been standardized to clean up the original opinion scores. These methods use annotator-based statistical priors, but they do not take into account exten… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.