Skip to main content

Showing 1–50 of 161 results for author: Pan, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.13137  [pdf, ps, other

    cs.IT eess.SP

    On secure UAV-aided ISCC systems

    Authors: Hongjiang Lei, Congke Jiang, Ki-Hong Park, Mohamed A. Aboulhassan, Sen Zhou, Gaofeng Pan

    Abstract: Integrated communication and sensing, which can make full use of the limited spectrum resources to perform communication and sensing tasks simultaneously, is an up-and-coming technology in wireless communication networks. In this work, we investigate the secrecy performance of an uncrewed aerial vehicle (UAV)-assisted secure integrated communication, sensing, and computing system, where the UAV se… ▽ More

    Submitted 27 June, 2025; v1 submitted 16 June, 2025; originally announced June 2025.

    Comments: 11 pages, 7 figures, submitted to IEEE Journal for review

  2. arXiv:2506.03622  [pdf, ps, other

    cs.IT eess.SP

    Beamforming for Secure RSMA-Aided ISAC Systems

    Authors: Qian Dan, Hongjiang Lei, Ki-Hong Park, Gaofeng Pan

    Abstract: This work investigates the physical layer security of rate-splitting multiple access (RSMA)-aided integrated communication and sensing (ISAC) systems. The ISAC base station (BS) transmits signals to communicate with users in an eavesdropped scenario and to estimate the parameters of the sensed targets. The research considers different sensing signals under RSMA technology and the Cram{é}r-Rao boun… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 15 pages, 6 figures, submitted to IEEE journal for review

  3. arXiv:2506.01968  [pdf, ps, other

    cs.LG cs.AI cs.NE

    Efficient ANN-SNN Conversion with Error Compensation Learning

    Authors: Chang Liu, Jiangrong Shen, Xuming Ran, Mingkun Xu, Qi Xu, Yi Xu, Gang Pan

    Abstract: Artificial neural networks (ANNs) have demonstrated outstanding performance in numerous tasks, but deployment in resource-constrained environments remains a challenge due to their high computational and memory requirements. Spiking neural networks (SNNs) operate through discrete spike events and offer superior energy efficiency, providing a bio-inspired alternative. However, current ANN-to-SNN con… ▽ More

    Submitted 12 May, 2025; originally announced June 2025.

  4. arXiv:2505.14535  [pdf, ps, other

    cs.LG cs.HC

    Spiking Neural Networks with Temporal Attention-Guided Adaptive Fusion for imbalanced Multi-modal Learning

    Authors: Jiangrong Shen, Yulin Xie, Qi Xu, Gang Pan, Huajin Tang, Badong Chen

    Abstract: Multimodal spiking neural networks (SNNs) hold significant potential for energy-efficient sensory processing but face critical challenges in modality imbalance and temporal misalignment. Current approaches suffer from uncoordinated convergence speeds across modalities and static fusion mechanisms that ignore time-varying cross-modal interactions. We propose the temporal attention-guided adaptive f… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  5. arXiv:2505.12221  [pdf, ps, other

    cs.NE

    Bridging Quantized Artificial Neural Networks and Neuromorphic Hardware

    Authors: Zhenhui Chen, Haoran Xu, Yangfan Hu, Xiaofei Jin, Xinyu Li, Ziyang Kang, Gang Pan, De Ma

    Abstract: Neuromorphic hardware aims to leverage distributed computing and event-driven circuit design to achieve an energy-efficient AI system. The name "neuromorphic" is derived from its spiking and local computing nature, which mimics the fundamental activity of an animal's nervous system. In neuromorphic hardware, neurons, i.e., computing cores use single-bit, event-driven data (called spikes) for inter… ▽ More

    Submitted 22 June, 2025; v1 submitted 17 May, 2025; originally announced May 2025.

  6. arXiv:2505.12089  [pdf, ps, other

    eess.IV cs.AI cs.CV

    NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results

    Authors: Sangmin Lee, Eunpil Park, Angel Canelo, Hyunhee Park, Youngjo Kim, Hyung-Ju Chun, Xin Jin, Chongyi Li, Chun-Le Guo, Radu Timofte, Qi Wu, Tianheng Qiu, Yuchun Dong, Shenglin Ding, Guanghua Pan, Weiyu Zhou, Tao Hu, Yixu Feng, Duwei Dai, Yu Cao, Peng Wu, Wei Dong, Yanning Zhang, Qingsen Yan, Simon J. Larsen , et al. (11 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2025 Efficient Burst HDR and Restoration Challenge, which aims to advance efficient multi-frame high dynamic range (HDR) and restoration techniques. The challenge is based on a novel RAW multi-frame fusion dataset, comprising nine noisy and misaligned RAW frames with various exposure levels per scene. Participants were tasked with developing solutions capable of effect… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  7. arXiv:2505.11100  [pdf, other

    cs.LG cs.AI

    Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors

    Authors: Lang Feng, Jiahao Lin, Dong Xing, Li Zhang, De Ma, Gang Pan

    Abstract: Population-population generalization is a challenging problem in multi-agent reinforcement learning (MARL), particularly when agents encounter unseen co-players. However, existing self-play-based methods are constrained by the limitation of inside-space generalization. In this study, we propose Bidirectional Distillation (BiDist), a novel mixed-play framework, to overcome this limitation in MARL.… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  8. arXiv:2505.10134  [pdf, other

    eess.SP cs.AI cs.LG

    Large Wireless Localization Model (LWLM): A Foundation Model for Positioning in 6G Networks

    Authors: Guangjin Pan, Kaixuan Huang, Hui Chen, Shunqing Zhang, Christian Häger, Henk Wymeersch

    Abstract: Accurate and robust localization is a critical enabler for emerging 5G and 6G applications, including autonomous driving, extended reality (XR), and smart manufacturing. While data-driven approaches have shown promise, most existing models require large amounts of labeled data and struggle to generalize across deployment scenarios and wireless configurations. To address these limitations, we propo… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 13 pages,16 figures.This work has been submitted to the IEEE for possible publication

  9. arXiv:2505.09085  [pdf

    cs.LG cs.AI

    Human-like Cognitive Generalization for Large Models via Brain-in-the-loop Supervision

    Authors: Jiaxuan Chen, Yu Qi, Yueming Wang, Gang Pan

    Abstract: Recent advancements in deep neural networks (DNNs), particularly large-scale language models, have demonstrated remarkable capabilities in image and natural language understanding. Although scaling up model parameters with increasing volume of training data has progressively improved DNN capabilities, achieving complex cognitive abilities - such as understanding abstract concepts, reasoning, and a… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  10. arXiv:2505.08523  [pdf, ps, other

    cs.IT eess.SP

    Dual-UAV-Enabled Secure Communication and Sensing for A2G-ISAC Systems with Maneuverable Jamming

    Authors: Libiao Lou, Yuan Liu, Fotis Foukalas, Hongjiang Lei, Gaofeng Pan, Theodoros A. Tsiftsis, Hongwu Liu

    Abstract: In this paper, we propose a dual-unmanned aerial vehicle (UAV)-enabled secure communication and sensing (SCS) scheme for an air-to-ground integrated sensing and communication (ISAC) system, in which a dual-functional source UAV and jamming UAV collaborate to enhance both the secure communication and target sensing performance. From a perspective of hybrid monostatitc-bistatic radar, the jamming UA… ▽ More

    Submitted 18 May, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

    Comments: 13 pages, submitted to IEEE Journal

  11. arXiv:2505.07715  [pdf, ps, other

    cs.CV cs.AI

    Hybrid Spiking Vision Transformer for Object Detection with Event Cameras

    Authors: Qi Xu, Jie Deng, Jiangrong Shen, Biwu Chen, Huajin Tang, Gang Pan

    Abstract: Event-based object detection has gained increasing attention due to its advantages such as high temporal resolution, wide dynamic range, and asynchronous address-event representation. Leveraging these advantages, Spiking Neural Networks (SNNs) have emerged as a promising approach, offering low energy consumption and rich spatiotemporal dynamics. To further enhance the performance of event-based ob… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  12. arXiv:2505.04165  [pdf, other

    cs.NE cs.AI

    TS-SNN: Temporal Shift Module for Spiking Neural Networks

    Authors: Kairong Yu, Tianqing Zhang, Qi Xu, Gang Pan, Hongwei Wang

    Abstract: Spiking Neural Networks (SNNs) are increasingly recognized for their biological plausibility and energy efficiency, positioning them as strong alternatives to Artificial Neural Networks (ANNs) in neuromorphic computing applications. SNNs inherently process temporal information by leveraging the precise timing of spikes, but balancing temporal feature utilization with low energy consumption remains… ▽ More

    Submitted 16 May, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

    Comments: Accepted by ICML2025

  13. arXiv:2505.01780  [pdf, other

    eess.SP cs.AI cs.NI eess.SY

    Rate-Limited Closed-Loop Distributed ISAC Systems: An Autoencoder Approach

    Authors: Guangjin Pan, Zhixing Li, Ayça Özçelikkale, Christian Häger, Musa Furkan Keskin, Henk Wymeersch

    Abstract: In closed-loop distributed multi-sensor integrated sensing and communication (ISAC) systems, performance often hinges on transmitting high-dimensional sensor observations over rate-limited networks. In this paper, we first present a general framework for rate-limited closed-loop distributed ISAC systems, and then propose an autoencoder-based observation compression method to overcome the constrain… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

    Comments: 6 pages, 15 figures. This work has been submitted to the IEEE for possible publication

  14. arXiv:2504.21444  [pdf, other

    cs.NI

    A Unified QoS-Aware Multiplexing Framework for Next Generation Immersive Communication with Legacy Wireless Applications

    Authors: Jihong Li, Shunqing Zhang, Tao Yu, Guangjin Pan, Kaixuan Huang, Xiaojing Chen, Yanzan Sun, Junyu Liu, Jiandong Li, Derrick Wing Kwan Ng

    Abstract: Immersive communication, including emerging augmented reality, virtual reality, and holographic telepresence, has been identified as a key service for enabling next-generation wireless applications. To align with legacy wireless applications, such as enhanced mobile broadband or ultra-reliable low-latency communication, network slicing has been widely adopted. However, attempting to statistically… ▽ More

    Submitted 2 May, 2025; v1 submitted 30 April, 2025; originally announced April 2025.

  15. arXiv:2504.17676  [pdf, other

    eess.SP cs.IT

    UNILoc: Unified Localization Combining Model-Based Geometry and Unsupervised Learning

    Authors: Yuhao Zhang, Guangjin Pan, Musa Furkan Keskin, Ossi Kaltiokallio, Mikko Valkama, Henk Wymeersch

    Abstract: Accurate mobile device localization is critical for emerging 5G/6G applications such as autonomous vehicles and augmented reality. In this paper, we propose a unified localization method that integrates model-based and machine learning (ML)-based methods to reap their respective advantages by exploiting available map information. In order to avoid supervised learning, we generate training labels a… ▽ More

    Submitted 27 April, 2025; v1 submitted 24 April, 2025; originally announced April 2025.

    Comments: 6 pages, submitted to IEEE conference

  16. arXiv:2504.10240  [pdf, other

    cs.AR cs.LG

    GNN-ACLP: Graph Neural Networks based Analog Circuit Link Prediction

    Authors: Guanyuan Pan, Tiansheng Zhou, Bingtao Ma, Yaqi Wang, Jianxiang Zhao, Zhi Li, Yugui Lin, Pietro Lio, Shuai Wang

    Abstract: Circuit link prediction identifying missing component connections from incomplete netlists is crucial in automating analog circuit design. However, existing methods face three main challenges: 1) Insufficient use of topological patterns in circuit graphs reduces prediction accuracy; 2) Data scarcity due to the complexity of annotations hinders model generalization; 3) Limited adaptability to vario… ▽ More

    Submitted 18 May, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

    Comments: Data will be made available on request. V2 Update: Optimized figures; Optimized and added experiments; Added references

  17. arXiv:2504.03643  [pdf, ps, other

    eess.SP cs.AI cs.HC

    Potential Indicator for Continuous Emotion Arousal by Dynamic Neural Synchrony

    Authors: Guandong Pan, Zhaobang Wu, Yaqian Yang, Xin Wang, Longzhao Liu, Zhiming Zheng, Shaoting Tang

    Abstract: The need for automatic and high-quality emotion annotation is paramount in applications such as continuous emotion recognition and video highlight detection, yet achieving this through manual human annotations is challenging. Inspired by inter-subject correlation (ISC) utilized in neuroscience, this study introduces a novel Electroencephalography (EEG) based ISC methodology that leverages a single… ▽ More

    Submitted 23 January, 2025; originally announced April 2025.

  18. arXiv:2503.18382  [pdf, other

    cs.CV cs.AI

    PP-FormulaNet: Bridging Accuracy and Efficiency in Advanced Formula Recognition

    Authors: Hongen Liu, Cheng Cui, Yuning Du, Yi Liu, Gang Pan

    Abstract: Formula recognition is an important task in document intelligence. It involves converting mathematical expressions from document images into structured symbolic formats that computers can easily work with. LaTeX is the most common format used for this purpose. In this work, we present PP-FormulaNet, a state-of-the-art formula recognition model that excels in both accuracy and efficiency. To meet t… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  19. arXiv:2503.18130  [pdf, other

    cs.LG cs.AI

    Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization

    Authors: Juntao Dai, Taiye Chen, Yaodong Yang, Qian Zheng, Gang Pan

    Abstract: Reinforcement learning from human feedback (RLHF) is an effective method for aligning large language models (LLMs) with human values. However, reward over-optimization remains an open challenge leading to discrepancies between the performance of LLMs under the reward model and the true human objectives. A primary contributor to reward over-optimization is the extrapolation error that arises when t… ▽ More

    Submitted 23 March, 2025; originally announced March 2025.

    Comments: Published as a conference paper at ICLR 2025

  20. arXiv:2503.09318  [pdf, other

    cs.DC cs.AR

    FpgaHub: Fpga-centric Hyper-heterogeneous Computing Platform for Big Data Analytics

    Authors: Zeke Wang, Jie Zhang, Hongjing Huang, Yingtao Li, Xueying Zhu, Mo Sun, Zihan Yang, De Ma, Huajing Tang, Gang Pan, Fei Wu, Bingsheng He, Gustavo Alonso

    Abstract: Modern data analytics requires a huge amount of computing power and processes a massive amount of data. At the same time, the underlying computing platform is becoming much more heterogeneous on both hardware and software. Even though specialized hardware, e.g., FPGA- or GPU- or TPU-based systems, often achieves better performance than a CPU-only system due to the slowing of Moore's law, such syst… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  21. arXiv:2502.13572  [pdf, other

    cs.HC

    Improving the Sparse Structure Learning of Spiking Neural Networks from the View of Compression Efficiency

    Authors: Jiangrong Shen, Qi Xu, Gang Pan, Badong Chen

    Abstract: The human brain utilizes spikes for information transmission and dynamically reorganizes its network structure to boost energy efficiency and cognitive capabilities throughout its lifespan. Drawing inspiration from this spike-based computation, Spiking Neural Networks (SNNs) have been developed to construct event-driven models that emulate this efficiency. Despite these advances, deep SNNs continu… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  22. arXiv:2502.09449  [pdf, other

    cs.NE

    Spiking Neural Networks for Temporal Processing: Status Quo and Future Prospects

    Authors: Chenxiang Ma, Xinyi Chen, Yanchen Li, Qu Yang, Yujie Wu, Guoqi Li, Gang Pan, Huajin Tang, Kay Chen Tan, Jibin Wu

    Abstract: Temporal processing is fundamental for both biological and artificial intelligence systems, as it enables the comprehension of dynamic environments and facilitates timely responses. Spiking Neural Networks (SNNs) excel in handling such data with high efficiency, owing to their rich neuronal dynamics and sparse activity patterns. Given the recent surge in the development of SNNs, there is an urgent… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  23. arXiv:2502.00983  [pdf, other

    cs.LG stat.ML

    CausalCOMRL: Context-Based Offline Meta-Reinforcement Learning with Causal Representation

    Authors: Zhengzhe Zhang, Wenjia Meng, Haoliang Sun, Gang Pan

    Abstract: Context-based offline meta-reinforcement learning (OMRL) methods have achieved appealing success by leveraging pre-collected offline datasets to develop task representations that guide policy learning. However, current context-based OMRL methods often introduce spurious correlations, where task components are incorrectly correlated due to confounders. These correlations can degrade policy performa… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

  24. arXiv:2502.00345  [pdf, other

    cs.LG cs.AI cs.MA

    The Composite Task Challenge for Cooperative Multi-Agent Reinforcement Learning

    Authors: Yurui Li, Yuxuan Chen, Li Zhang, Shijian Li, Gang Pan

    Abstract: The significant role of division of labor (DOL) in promoting cooperation is widely recognized in real-world applications.Many cooperative multi-agent reinforcement learning (MARL) methods have incorporated the concept of DOL to improve cooperation among agents.However, the tasks used in existing testbeds typically correspond to tasks where DOL is often not a necessary feature for achieving optimal… ▽ More

    Submitted 1 February, 2025; originally announced February 2025.

  25. arXiv:2501.14970  [pdf, other

    eess.SP cs.AI cs.LG

    AI-driven Wireless Positioning: Fundamentals, Standards, State-of-the-art, and Challenges

    Authors: Guangjin Pan, Yuan Gao, Yilin Gao, Zhiyong Zhong, Xiaoyu Yang, Xinyu Guo, Shugong Xu

    Abstract: Wireless positioning technologies hold significant value for applications in autonomous driving, extended reality (XR), unmanned aerial vehicles (UAVs), and more. With the advancement of artificial intelligence (AI), leveraging AI to enhance positioning accuracy and robustness has emerged as a field full of potential. Driven by the requirements and functionalities defined in the 3rd Generation Par… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

    Comments: 32 pages. This work has been submitted to the IEEE for possible publication

  26. arXiv:2501.02572  [pdf, other

    cs.NI cs.AI eess.SY

    Energy Optimization of Multi-task DNN Inference in MEC-assisted XR Devices: A Lyapunov-Guided Reinforcement Learning Approach

    Authors: Yanzan Sun, Jiacheng Qiu, Guangjin Pan, Shugong Xu, Shunqing Zhang, Xiaoyun Wang, Shuangfeng Han

    Abstract: Extended reality (XR), blending virtual and real worlds, is a key application of future networks. While AI advancements enhance XR capabilities, they also impose significant computational and energy challenges on lightweight XR devices. In this paper, we developed a distributed queue model for multi-task DNN inference, addressing issues of resource competition and queue coupling. In response to th… ▽ More

    Submitted 5 January, 2025; originally announced January 2025.

    Comments: 13 pages, 7 figures. This work has been submitted to the IEEE for possible publication

  27. arXiv:2501.00824  [pdf, ps, other

    cs.CR cs.IT

    How Breakable Is Privacy: Probing and Resisting Model Inversion Attacks in Collaborative Inference

    Authors: Rongke Liu, Youwen Zhu, Dong Wang, Gaoning Pan, Xingyu He, Weizhi Meng

    Abstract: Collaborative inference (CI) improves computational efficiency for edge devices by transmitting intermediate features to cloud models. However, this process inevitably exposes feature representations to model inversion attacks (MIAs), enabling unauthorized data reconstruction. Despite extensive research, there is no established criterion for assessing the difficulty of MIA implementation, leaving… ▽ More

    Submitted 20 June, 2025; v1 submitted 1 January, 2025; originally announced January 2025.

    Comments: 15 pages, 5 figures, 6 tables. The experimental data have been corrected, and some explanations have been supplemented

  28. arXiv:2412.19055  [pdf, other

    cs.CV cs.LG

    SpectralKD: A Unified Framework for Interpreting and Distilling Vision Transformers via Spectral Analysis

    Authors: Huiyuan Tian, Bonan Xu, Shijian Li, Gang Pan

    Abstract: Knowledge Distillation (KD) has achieved widespread success in compressing large Vision Transformers (ViTs), but a unified theoretical framework for both ViTs and KD is still lacking. In this paper, we propose SpectralKD, a novel unified analytical framework that offers deeper insights into ViTs and optimizes KD via spectral analysis. Our model-wise analysis reveals that CaiT concentrates informat… ▽ More

    Submitted 30 January, 2025; v1 submitted 25 December, 2024; originally announced December 2024.

  29. arXiv:2412.15634  [pdf, other

    cs.SE

    Darkit: A User-Friendly Software Toolkit for Spiking Large Language Model

    Authors: Xin Du, Shifan Ye, Qian Zheng, Yangfan Hu, Rui Yan, Shunyu Qi, Shuyang Chen, Huajin Tang, Gang Pan, Shuiguang Deng

    Abstract: Large language models (LLMs) have been widely applied in various practical applications, typically comprising billions of parameters, with inference processes requiring substantial energy and computational resources. In contrast, the human brain, employing bio-plausible spiking mechanisms, can accomplish the same tasks while significantly reducing energy consumption, even with a similar number of… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

  30. arXiv:2412.12159  [pdf, other

    cs.LG cs.AI

    Personalized Sleep Staging Leveraging Source-free Unsupervised Domain Adaptation

    Authors: Yangxuan Zhou, Sha Zhao, Jiquan Wang, Haiteng Jiang, hijian Li, Benyan Luo, Tao Li, Gang Pan

    Abstract: Sleep staging is crucial for assessing sleep quality and diagnosing related disorders. Recent deep learning models for automatic sleep staging using polysomnography often suffer from poor generalization to new subjects because they are trained and tested on the same labeled datasets, overlooking individual differences. To tackle this issue, we propose a novel Source-Free Unsupervised Individual Do… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    Comments: 9 pages, 6 figures

  31. arXiv:2412.11812  [pdf, other

    cs.CV

    CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector

    Authors: Tianheng Qiu, Ka Lung Law, Guanghua Pan, Jufei Wang, Xin Gao, Xuan Huang, Hu Wei

    Abstract: Unsupervised domain adaptive (UDA) algorithms can markedly enhance the performance of object detectors under conditions of domain shifts, thereby reducing the necessity for extensive labeling and retraining. Current domain adaptive object detection algorithms primarily cater to two-stage detectors, which tend to offer minimal improvements when directly applied to single-stage detectors such as YOL… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  32. arXiv:2412.11138  [pdf, other

    cs.LG cs.AI

    Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation

    Authors: Juntao Dai, Yaodong Yang, Qian Zheng, Gang Pan

    Abstract: A key aspect of Safe Reinforcement Learning (Safe RL) involves estimating the constraint condition for the next policy, which is crucial for guiding the optimization of safe policy updates. However, the existing Advantage-based Estimation (ABE) method relies on the infinite-horizon discounted advantage function. This dependence leads to catastrophic errors in finite-horizon scenarios with non-disc… ▽ More

    Submitted 15 December, 2024; originally announced December 2024.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:9872-9903, 2024

  33. arXiv:2412.09849  [pdf, ps, other

    eess.SP cs.AI

    Deep Learning for Spectrum Prediction in Cognitive Radio Networks: State-of-the-Art, New Opportunities, and Challenges

    Authors: Guangliang Pan, David K. Y. Yau, Bo Zhou, Qihui Wu

    Abstract: Spectrum prediction is considered to be a promising technology that enhances spectrum efficiency by assisting dynamic spectrum access (DSA) in cognitive radio networks (CRN). Nonetheless, the highly nonlinear nature of spectrum data across time, frequency, and space domains, coupled with the intricate spectrum usage patterns, poses challenges for accurate spectrum prediction. Deep learning (DL), r… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

  34. arXiv:2412.07236  [pdf, other

    eess.SP cs.AI cs.LG q-bio.NC

    CBraMod: A Criss-Cross Brain Foundation Model for EEG Decoding

    Authors: Jiquan Wang, Sha Zhao, Zhiling Luo, Yangxuan Zhou, Haiteng Jiang, Shijian Li, Tao Li, Gang Pan

    Abstract: Electroencephalography (EEG) is a non-invasive technique to measure and record brain electrical activity, widely used in various BCI and healthcare applications. Early EEG decoding methods rely on supervised learning, limited by specific tasks and datasets, hindering model performance and generalizability. With the success of large language models, there is a growing body of studies focusing on EE… ▽ More

    Submitted 13 April, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: Accepted by The Thirteenth International Conference on Learning Representations (ICLR 2025)

  35. arXiv:2412.06720  [pdf, other

    cs.CV cs.CL

    VP-MEL: Visual Prompts Guided Multimodal Entity Linking

    Authors: Hongze Mi, Jinyuan Li, Xuying Zhang, Haoran Cheng, Jiahao Wang, Di Sun, Gang Pan

    Abstract: Multimodal entity linking (MEL), a task aimed at linking mentions within multimodal contexts to their corresponding entities in a knowledge base (KB), has attracted much attention due to its wide applications in recent years. However, existing MEL methods often rely on mention words as retrieval cues, which limits their ability to effectively utilize information from both images and text. This rel… ▽ More

    Submitted 15 February, 2025; v1 submitted 9 December, 2024; originally announced December 2024.

  36. arXiv:2412.03957  [pdf, other

    cs.CV cs.AI

    A Framework For Image Synthesis Using Supervised Contrastive Learning

    Authors: Yibin Liu, Jianyu Zhang, Li Zhang, Shijian Li, Gang Pan

    Abstract: Text-to-image (T2I) generation aims at producing realistic images corresponding to text descriptions. Generative Adversarial Network (GAN) has proven to be successful in this task. Typical T2I GANs are 2 phase methods that first pretrain an inter-modal representation from aligned image-text pairs and then use GAN to train image generator on that basis. However, such representation ignores the inne… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

  37. arXiv:2411.17045  [pdf, other

    cs.SE

    Redefining Crowdsourced Test Report Prioritization: An Innovative Approach with Large Language Model

    Authors: Yuchen Ling, Shengcheng Yu, Chunrong Fang, Guobin Pan, Jun Wang, Jia Liu

    Abstract: Context: Crowdsourced testing has gained popularity in software testing, especially for mobile app testing, due to its ability to bring diversity and tackle fragmentation issues. However, the openness of crowdsourced testing presents challenges, particularly in the manual review of numerous test reports, which is time-consuming and labor-intensive. Objective: The primary goal of this research is t… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: Accepted by Information and Software Technology in Nov 2024

  38. arXiv:2410.15885  [pdf, other

    cs.AI

    How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?

    Authors: Zuojin Tang, Bin Hu, Chenyang Zhao, De Ma, Gang Pan, Bin Liu

    Abstract: Existing large pre-trained models typically map text input to text output in an end-to-end manner, such as ChatGPT, or map a segment of text input to a hierarchy of action decisions, such as OpenVLA. However, humans can simultaneously generate text and actions when receiving specific input signals. For example, a driver can make precise driving decisions while conversing with a friend in the passe… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  39. arXiv:2410.15689  [pdf, other

    cs.CV cs.LG cs.NE

    Enhancing SNN-based Spatio-Temporal Learning: A Benchmark Dataset and Cross-Modality Attention Model

    Authors: Shibo Zhou, Bo Yang, Mengwen Yuan, Runhao Jiang, Rui Yan, Gang Pan, Huajin Tang

    Abstract: Spiking Neural Networks (SNNs), renowned for their low power consumption, brain-inspired architecture, and spatio-temporal representation capabilities, have garnered considerable attention in recent years. Similar to Artificial Neural Networks (ANNs), high-quality benchmark datasets are of great importance to the advances of SNNs. However, our analysis indicates that many prevalent neuromorphic da… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  40. arXiv:2410.07266  [pdf, other

    cs.CV

    Spiking GS: Towards High-Accuracy and Low-Cost Surface Reconstruction via Spiking Neuron-based Gaussian Splatting

    Authors: Weixing Zhang, Zongrui Li, De Ma, Huajin Tang, Xudong Jiang, Qian Zheng, Gang Pan

    Abstract: 3D Gaussian Splatting is capable of reconstructing 3D scenes in minutes. Despite recent advances in improving surface reconstruction accuracy, the reconstructed results still exhibit bias and suffer from inefficiency in storage and training. This paper provides a different observation on the cause of the inefficiency and the reconstruction bias, which is attributed to the integration of the low-op… ▽ More

    Submitted 3 December, 2024; v1 submitted 8 October, 2024; originally announced October 2024.

  41. arXiv:2410.05684  [pdf, other

    cs.HC cs.AI cs.CL

    Copiloting Diagnosis of Autism in Real Clinical Scenarios via LLMs

    Authors: Yi Jiang, Qingyang Shen, Shuzhong Lai, Shunyu Qi, Qian Zheng, Lin Yao, Yueming Wang, Gang Pan

    Abstract: Autism spectrum disorder(ASD) is a pervasive developmental disorder that significantly impacts the daily functioning and social participation of individuals. Despite the abundance of research focused on supporting the clinical diagnosis of ASD, there is still a lack of systematic and comprehensive exploration in the field of methods based on Large Language Models (LLMs), particularly regarding the… ▽ More

    Submitted 9 October, 2024; v1 submitted 8 October, 2024; originally announced October 2024.

  42. arXiv:2409.08579  [pdf, ps, other

    cs.IT

    Secure Offloading in NOMA-Aided Aerial MEC Systems Based on Deep Reinforcement Learning

    Authors: Hongjiang Lei, Mingxu Yang, Ki-Hong Park, Gaofeng Pan

    Abstract: Mobile edge computing (MEC) technology can reduce user latency and energy consumption by offloading computationally intensive tasks to the edge servers. Unmanned aerial vehicles (UAVs) and non-orthogonal multiple access (NOMA) technology enable the MEC networks to provide offloaded computing services for massively accessed terrestrial users conveniently. However, the broadcast nature of signal pro… ▽ More

    Submitted 11 October, 2024; v1 submitted 13 September, 2024; originally announced September 2024.

    Comments: 12 pages, 7 figures, accepted by IEEE Journal on Miniaturization for Air and Space Systems

  43. arXiv:2409.02111  [pdf, other

    cs.LG

    Toward Large-scale Spiking Neural Networks: A Comprehensive Survey and Future Directions

    Authors: Yangfan Hu, Qian Zheng, Guoqi Li, Huajin Tang, Gang Pan

    Abstract: Deep learning has revolutionized artificial intelligence (AI), achieving remarkable progress in fields such as computer vision, speech recognition, and natural language processing. Moreover, the recent success of large language models (LLMs) has fueled a surge in research on large-scale neural networks. However, the escalating demand for computing resources and energy consumption has prompted the… ▽ More

    Submitted 19 August, 2024; originally announced September 2024.

  44. arXiv:2408.16564  [pdf, other

    cs.MM cs.SD eess.AS

    Human-Inspired Audio-Visual Speech Recognition: Spike Activity, Cueing Interaction and Causal Processing

    Authors: Qianhui Liu, Jiadong Wang, Yang Wang, Xin Yang, Gang Pan, Haizhou Li

    Abstract: Humans naturally perform audiovisual speech recognition (AVSR), enhancing the accuracy and robustness by integrating auditory and visual information. Spiking neural networks (SNNs), which mimic the brain's information-processing mechanisms, are well-suited for emulating the human capability of AVSR. Despite their potential, research on SNNs for AVSR is scarce, with most existing audio-visual multi… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  45. arXiv:2407.20947  [pdf, other

    cs.NE

    An Asynchronous Multi-core Accelerator for SNN inference

    Authors: Zhuo Chen, De Ma, Xiaofei Jin, Qinghui Xing, Ouwen Jin, Xin Du, Shuibing He, Gang Pan

    Abstract: Spiking Neural Networks (SNNs) are extensively utilized in brain-inspired computing and neuroscience research. To enhance the speed and energy efficiency of SNNs, several many-core accelerators have been developed. However, maintaining the accuracy of SNNs often necessitates frequent explicit synchronization among all cores, which presents a challenge to overall efficiency. In this paper, we propo… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  46. arXiv:2407.20852  [pdf, other

    cs.NI cs.MM eess.SY

    Optimizing 5G-Advanced Networks for Time-critical Applications: The Role of L4S

    Authors: Guangjin Pan, Shugong Xu, Pin Jiang

    Abstract: As 5G networks strive to support advanced time-critical applications, such as immersive Extended Reality (XR), cloud gaming, and autonomous driving, the demand for Real-time Broadband Communication (RTBC) grows. In this article, we present the main mechanisms of Low Latency, Low Loss, and Scalable Throughput (L4S). Subsequently, we investigate the support and challenges of L4S technology in the la… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: 7 pages, 3 figures. This work has been submitted to the IEEE for possible publication

  47. arXiv:2407.19271  [pdf, other

    cs.CV eess.IV

    Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network

    Authors: Gang Pan, Chen Wang, Zhijie Sui, Shuai Guo, Yaozhi Lv, Honglie Li, Di Sun, Zixia Xia

    Abstract: The Quick-view (QV) technique serves as a primary method for detecting defects within sewerage systems. However, the effectiveness of QV is impeded by the limited visual range of its hardware, resulting in suboptimal image quality for distant portions of the sewer network. Image super-resolution is an effective way to improve image quality and has been applied in a variety of scenes. However, rese… ▽ More

    Submitted 25 February, 2025; v1 submitted 27 July, 2024; originally announced July 2024.

  48. Semi-Supervised Pipe Video Temporal Defect Interval Localization

    Authors: Zhu Huang, Gang Pan, Chao Kang, YaoZhi Lv

    Abstract: In sewer pipe Closed-Circuit Television (CCTV) inspection, accurate temporal defect localization is essential for effective defect classification, detection, segmentation and quantification. Industry standards typically do not require time-interval annotations, even though they are more informative than time-point annotations for defect localization, resulting in additional annotation costs when f… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: 13 pages, 3 figures

  49. arXiv:2407.07314  [pdf, ps, other

    cs.IT

    Proactive Eavesdropping in Relay Systems via Trajectory and Power Optimization

    Authors: Qian Dan, Hongjiang Lei, Ki-Hong Park, Weijia Lei, Gaofeng Pan

    Abstract: Wireless relays can effectively extend the transmission range of information. However, if relay technology is utilized unlawfully, it can amplify potential harm. Effectively surveilling illegitimate relay links poses a challenging problem. Unmanned aerial vehicles (UAVs) can proactively surveil wireless relay systems due to their flexible mobility. This work focuses on maximizing the eavesdropping… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 14 pages, 8 figures, submitted to IEEE Journal for review

  50. arXiv:2407.06521  [pdf, ps, other

    cs.IT eess.SP

    Beamforming Design for Joint Target Sensing and Proactive Eavesdropping

    Authors: Qian Dan, Hongjiang Lei, Ki-Hong Park, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: This work studies the beamforming design in the joint target sensing and proactive eavesdropping (JTSAPE) system. The JTSAPE base station (BS) receives the information transmitted by the illegal transmitter and transmits the waveform for target sensing. The shared waveform also serves as artificial noise to interfere with the illegal receiver, thereby achieving proactive eavesdropping. We firstly… ▽ More

    Submitted 2 May, 2025; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: 14 pages, 7 figures, submitted to IEEE Journal for review