Skip to main content

Showing 1–50 of 187 results for author: Ai, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.04621  [pdf, ps, other

    cs.LG cs.AI cs.NI

    Multimodal LLM Integrated Semantic Communications for 6G Immersive Experiences

    Authors: Yusong Zhang, Yuxuan Sun, Lei Guo, Wei Chen, Bo Ai, Deniz Gunduz

    Abstract: 6G networks promise revolutionary immersive communication experiences including augmented reality (AR), virtual reality (VR), and holographic communications. These applications demand high-dimensional multimodal data transmission and intelligent data processing in real-time, which is extremely challenging over resource-limited wireless communication systems. Moreover, a joint understanding of the… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  2. arXiv:2507.01876  [pdf, ps, other

    cs.IT eess.SP

    Joint Power Control and Precoding for Cell-Free Massive MIMO Systems With Sparse Multi-Dimensional Graph Neural Networks

    Authors: Yukun Ma, Jiayi Zhang, Ziheng Liu, Guowei Shi, Bo Ai

    Abstract: Cell-free massive multiple-input multiple-output (CF mMIMO) has emerged as a prominent candidate for future networks due to its ability to significantly enhance spectral efficiency by eliminating inter-cell interference. However, its practical deployment faces considerable challenges, such as high computational complexity and the optimization of its complex processing. To address these challenges,… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: 5 pages, 5 figures

  3. arXiv:2506.21876  [pdf, ps, other

    cs.CL cs.AI cs.CV

    Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation

    Authors: Qiyue Gao, Xinyu Pi, Kevin Liu, Junrong Chen, Ruolan Yang, Xinqi Huang, Xinyu Fang, Lu Sun, Gautham Kishore, Bo Ai, Stone Tao, Mengyang Liu, Jiaxi Yang, Chao-Jung Lai, Chuanyang Jin, Jiannan Xiang, Benhao Huang, Zeming Chen, David Danks, Hao Su, Tianmin Shu, Ziqiao Ma, Lianhui Qin, Zhiting Hu

    Abstract: Internal world models (WMs) enable agents to understand the world's state and predict transitions, serving as the basis for advanced deliberative reasoning. Recent large Vision-Language Models (VLMs), such as OpenAI o3, GPT-4o and Gemini, exhibit potential as general-purpose WMs. While the latest studies have evaluated and shown limitations in specific capabilities such as visual understanding, a… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: ACL 2025 (Findings)

  4. arXiv:2506.07599  [pdf, ps, other

    cs.IT eess.SP

    Flexible MIMO for Future Wireless Communications: Which Flexibilities are Possible?

    Authors: Zhe Wang, Jiayi Zhang, Bokai Xu, Wenhui Yi, Emil Björnson, Bo Ai

    Abstract: To enable next-generation wireless communication networks with modest spectrum availability, multiple-input multiple-output (MIMO) technology needs to undergo further evolution. In this paper, we introduce a promising next-generation wireless communication concept: flexible MIMO technology. This technology represents a MIMO technology with flexible physical configurations and integrated applicatio… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: 9 pages, 5 figures, 1 table

  5. arXiv:2506.02353  [pdf, ps, other

    cs.RO

    SAVOR: Skill Affordance Learning from Visuo-Haptic Perception for Robot-Assisted Bite Acquisition

    Authors: Zhanxin Wu, Bo Ai, Tom Silver, Tapomayukh Bhattacharjee

    Abstract: Robot-assisted feeding requires reliable bite acquisition, a challenging task due to the complex interactions between utensils and food with diverse physical properties. These interactions are further complicated by the temporal variability of food properties-for example, steak becomes firm as it cools even during a meal. To address this, we propose SAVOR, a novel approach for learning skill affor… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  6. arXiv:2505.24307  [pdf, ps, other

    cs.IT eess.SP

    Multi-Waveguide Pinching Antennas for ISAC

    Authors: Weihao Mao, Yang Lu, Yanqing Xu, Bo Ai, Octavia A. Dobre, Dusit Niyato

    Abstract: Recently, a novel flexible-antenna technology, called pinching antennas, has attracted growing academic interest. By inserting discrete dielectric materials, pinching antennas can be activated at arbitrary points along waveguides, allowing for flexible customization of large-scale path loss. This paper investigates a multi-waveguide pinching-antenna integrated sensing and communications (ISAC) sys… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  7. arXiv:2505.16350  [pdf, ps, other

    cs.IT

    Sensing-Enhanced Handover Criterion for Low-Altitude Wireless Networks (LAWNs)

    Authors: Jingli Li, Yiyan Ma, Bo Ai, Qingqing Cheng, Guoyu Ma, Mi Yang, Yunlong Lu, Wenwei Yue, Zhangdui Zhong

    Abstract: With the rapid growth of the low-altitude economy, the demand for cellular-enabled low-altitude wireless networks (LAWNs) is rising significantly. The three-dimensional mobility of unmanned aerial vehicles (UAVs) will lead to frequent handovers (HOs) in cellular networks, while traditional reference signal received power (RSRP)-based criteria may fail to capture the dynamic environment, causing re… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: 5 pages, 7 figures, submitted to IEEE TVT Correspondence

  8. arXiv:2505.05753  [pdf, other

    cs.RO cs.AI cs.LG

    Towards Embodiment Scaling Laws in Robot Locomotion

    Authors: Bo Ai, Liu Dai, Nico Bohlinger, Dichen Li, Tongzhou Mu, Zhanxin Wu, K. Fay, Henrik I. Christensen, Jan Peters, Hao Su

    Abstract: Developing generalist agents that can operate across diverse tasks, environments, and physical embodiments is a grand challenge in robotics and artificial intelligence. In this work, we focus on the axis of embodiment and investigate embodiment scaling laws$\unicode{x2013}$the hypothesis that increasing the number of training embodiments improves generalization to unseen ones. Using robot locomoti… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: 32 pages. Project website: https://embodiment-scaling-laws.github.io/

  9. arXiv:2504.15737  [pdf, ps, other

    cs.IT eess.SP

    Energy-Efficient SIM-assisted Communications: How Many Layers Do We Need?

    Authors: Enyu Shi, Jiayi Zhang, Jiancheng An, Marco Di Renzo, Bo Ai, Chau Yuen

    Abstract: The stacked intelligent metasurface (SIM), comprising multiple layers of reconfigurable transmissive metasurfaces, is becoming an increasingly viable solution for future wireless communication systems. In this paper, we explore the integration of SIM in a multi-antenna base station for application to downlink multi-user communications, and a realistic power consumption model for SIM-assisted syste… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: 14 pages, 10 figures

  10. arXiv:2504.10836  [pdf, other

    eess.SP cs.AI

    Uplink Assisted Joint Channel Estimation and CSI Feedback: An Approach Based on Deep Joint Source-Channel Coding

    Authors: Yiran Guo, Wei Chen, Bo Ai

    Abstract: In frequency division duplex (FDD) multiple-input multiple-output (MIMO) wireless communication systems, the acquisition of downlink channel state information (CSI) is essential for maximizing spatial resource utilization and improving system spectral efficiency. The separate design of modules in AI-based CSI feedback architectures under traditional modular communication frameworks, including chan… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  11. arXiv:2504.09138  [pdf, other

    cs.IT

    White-Box AI Model: Next Frontier of Wireless Communications

    Authors: Jiayao Yang, Jiayi Zhang, Bokai Xu, Jiakang Zheng, Zhilong Liu, Ziheng Liu, Dusit Niyato, Mérouane Debbah, Zhu Han, Bo Ai

    Abstract: White-box AI (WAI), or explainable AI (XAI) model, a novel tool to achieve the reasoning behind decisions and predictions made by the AI algorithms, makes it more understandable and transparent. It offers a new approach to address key challenges of interpretability and mathematical validation in traditional black-box models. In this paper, WAI-aided wireless communication systems are proposed and… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

  12. arXiv:2503.20314  [pdf, other

    cs.CV

    Wan: Open and Advanced Large-Scale Video Generative Models

    Authors: Team Wan, Ang Wang, Baole Ai, Bin Wen, Chaojie Mao, Chen-Wei Xie, Di Chen, Feiwu Yu, Haiming Zhao, Jianxiao Yang, Jianyuan Zeng, Jiayu Wang, Jingfeng Zhang, Jingren Zhou, Jinkai Wang, Jixuan Chen, Kai Zhu, Kang Zhao, Keyu Yan, Lianghua Huang, Mengyang Feng, Ningyi Zhang, Pandeng Li, Pingyu Wu, Ruihang Chu , et al. (37 additional authors not shown)

    Abstract: This report presents Wan, a comprehensive and open suite of video foundation models designed to push the boundaries of video generation. Built upon the mainstream diffusion transformer paradigm, Wan achieves significant advancements in generative capabilities through a series of innovations, including our novel VAE, scalable pre-training strategies, large-scale data curation, and automated evaluat… ▽ More

    Submitted 18 April, 2025; v1 submitted 26 March, 2025; originally announced March 2025.

    Comments: 60 pages, 33 figures

  13. arXiv:2503.20208  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Adaptive Dexterous Grasping from Single Demonstrations

    Authors: Liangzhi Shi, Yulin Liu, Lingqi Zeng, Bo Ai, Zhengdong Hong, Hao Su

    Abstract: How can robots learn dexterous grasping skills efficiently and apply them adaptively based on user instructions? This work tackles two key challenges: efficient skill acquisition from limited human demonstrations and context-driven skill selection. We introduce AdaDexGrasp, a framework that learns a library of grasping skills from a single human demonstration per skill and selects the most suitabl… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  14. arXiv:2503.17777  [pdf, ps, other

    eess.IV cs.CV

    Hierarchy-Aware and Channel-Adaptive Semantic Communication for Bandwidth-Limited Data Fusion

    Authors: Lei Guo, Wei Chen, Yuxuan Sun, Bo Ai, Nikolaos Pappas, Tony Quek

    Abstract: Obtaining high-resolution hyperspectral images (HR-HSI) is costly and data-intensive, making it necessary to fuse low-resolution hyperspectral images (LR-HSI) with high-resolution RGB images (HR-RGB) for practical applications. However, traditional fusion techniques, which integrate detailed information into the reconstruction, significantly increase bandwidth consumption compared to directly tran… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

    Comments: Accepted by the WCL

  15. arXiv:2503.13468  [pdf, other

    eess.SP cs.LG

    A CGAN-LSTM-Based Framework for Time-Varying Non-Stationary Channel Modeling

    Authors: Keying Guo, Ruisi He, Mi Yang, Yuxin Zhang, Bo Ai, Haoxiang Zhang, Jiahui Han, Ruifeng Chen

    Abstract: Time-varying non-stationary channels, with complex dynamic variations and temporal evolution characteristics, have significant challenges in channel modeling and communication system performance evaluation. Most existing methods of time-varying channel modeling focus on predicting channel state at a given moment or simulating short-term channel fluctuations, which are unable to capture the long-te… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

    Comments: 11 pages,7 figures

  16. arXiv:2503.11999  [pdf, other

    cs.RO cs.CV eess.SY

    Diffusion Dynamics Models with Generative State Estimation for Cloth Manipulation

    Authors: Tongxuan Tian, Haoyang Li, Bo Ai, Xiaodi Yuan, Zhiao Huang, Hao Su

    Abstract: Manipulating deformable objects like cloth is challenging due to their complex dynamics, near-infinite degrees of freedom, and frequent self-occlusions, which complicate state estimation and dynamics modeling. Prior work has struggled with robust cloth state estimation, while dynamics models, primarily based on Graph Neural Networks (GNNs), are limited by their locality. Inspired by recent advance… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

  17. arXiv:2503.08985  [pdf, ps, other

    cs.IT eess.SP

    Channel Estimation for Rydberg Atomic Receivers

    Authors: Bokai Xu, Jiayi Zhang, Zhongtao Chen, Bingyang Cheng, Ziheng Liu, Yik-Chung Wu, Bo Ai

    Abstract: The rapid development of the quantum technology presents huge opportunities for 6G communications. Leveraging the quantum properties of highly excited Rydberg atoms, Rydberg atom-based antennas present distinct advantages, such as high sensitivity, broad frequency range, and compact size, over traditional antennas. To realize efficient precoding, accurate channel state information is essential. Ho… ▽ More

    Submitted 9 June, 2025; v1 submitted 11 March, 2025; originally announced March 2025.

  18. arXiv:2503.07189  [pdf, ps, other

    cs.IT eess.SP

    Beamforming Design for Beyond Diagonal RIS-Aided Cell-Free Massive MIMO Systems

    Authors: Yizhuo Li, Jiakang Zheng, Bokai Xu, Yiyang Zhu, Jiayi Zhang, Bo Ai

    Abstract: Reconfigurable intelligent surface (RIS)-aided cell-free (CF) massive multiple-input multiple-output (mMIMO) is a promising architecture for further improving spectral efficiency (SE) with low cost and power consumption. However, conventional RIS has inevitable limitations due to its capability of only reflecting signals. In contrast, beyond-diagonal RIS (BD-RIS), with its ability to both reflect… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  19. Optimal Bilinear Equalizer Beamforming Design for Cell-Free Massive MIMO Networks with Arbitrary Channel Estimators

    Authors: Zhe Wang, Jiayi Zhang, Hao Lei, Dusit Niyato, Bo Ai

    Abstract: This paper studies the distributed optimal bilinear equalizer (OBE) beamforming design for both the uplink and downlink cell-free massive multiple-input multiple-output networks. We consider arbitrary statistics-based channel estimators over spatially correlated Rician fading channels. In the uplink, we derive the achievable spectral efficiency (SE) performance and OBE combining schemes with arbit… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

    Comments: 6 pages, 3 figures. This paper has been accepted by IEEE Transactions on Vehicular Technology

  20. arXiv:2502.19675  [pdf, other

    cs.IT eess.SP

    Joint Power Allocation and Phase Shift Design for Stacked Intelligent Metasurfaces-aided Cell-Free Massive MIMO Systems with MARL

    Authors: Yiyang Zhu, Jiayi Zhang, Enyu Shi, Ziheng Liu, Chau Yuen, Bo Ai

    Abstract: Cell-free (CF) massive multiple-input multiple-output (mMIMO) systems offer high spectral efficiency (SE) through multiple distributed access points (APs). However, the large number of antennas increases power consumption. We propose incorporating stacked intelligent metasurfaces (SIM) into CF mMIMO systems as a cost-effective, energy-efficient solution. This paper focuses on optimizing the joint… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

  21. arXiv:2502.05812  [pdf, other

    cs.IT eess.SY

    Multi-Agent Reinforcement Learning in Wireless Distributed Networks for 6G

    Authors: Jiayi Zhang, Ziheng Liu, Yiyang Zhu, Enyu Shi, Bokai Xu, Chau Yuen, Dusit Niyato, Mérouane Debbah, Shi Jin, Bo Ai, Xuemin, Shen

    Abstract: The introduction of intelligent interconnectivity between the physical and human worlds has attracted great attention for future sixth-generation (6G) networks, emphasizing massive capacity, ultra-low latency, and unparalleled reliability. Wireless distributed networks and multi-agent reinforcement learning (MARL), both of which have evolved from centralized paradigms, are two promising solutions… ▽ More

    Submitted 9 February, 2025; originally announced February 2025.

  22. Vision Aided Channel Prediction for Vehicular Communications: A Case Study of Received Power Prediction Using RGB Images

    Authors: Xuejian Zhang, Ruisi He, Mi Yang, Zhengyu Zhang, Ziyi Qi, Bo Ai

    Abstract: The communication scenarios and channel characteristics of 6G will be more complex and difficult to characterize. Conventional methods for channel prediction face challenges in achieving an optimal balance between accuracy, practicality, and generalizability. Additionally, they often fail to effectively leverage environmental features. Within the framework of integration communication and artifici… ▽ More

    Submitted 25 January, 2025; originally announced January 2025.

    Comments: 12 pages, 11 figures, submitted to IEEE Transactions on Vehicular Technology

  23. arXiv:2501.17303  [pdf

    eess.SP cs.IT physics.ins-det

    Measurement-Based Modeling and Analysis of UAV Air-Ground Channels at 1 and 4 GHz

    Authors: Zhuangzhuang Cui, Cesar Briso-Rodriguez, Ke Guan, Cesar Calvo-Ramirez, Bo Ai, Zhangdui Zhong

    Abstract: In the design of unmanned aerial vehicle (UAV) wireless communications, a better understanding of propagation characteristics and an accurate channel model are required. Measurements and comprehensive analysis for the UAV-based air-ground (AG) propagation channel in the vertical dimension are presented in this letter. Based on the measurement data at 1 and 4 GHz, the large-scale and small-scale ch… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

  24. Measurement-Based Non-Stationary Markov Tapped Delay Line Channel Model for 5G-Railways

    Authors: Xuejian Zhang, Ruisi He, Mi Yang, Jianwen Ding, Ruifeng Chen, Shuaiqi Gao, Ziyi Qi, Zhengyu Zhang, Bo Ai, Zhangdui Zhong

    Abstract: 5G for Railways (5G-R) is globally recognized as a promising next-generation railway communication system designed to meet increasing demands. Channel modeling serves as foundation for communication system design, with tapped delay line (TDL) models widely utilized in system simulations due to their simplicity and practicality and serves as a crucial component of various standards like 3GPP. Howev… ▽ More

    Submitted 26 January, 2025; originally announced January 2025.

    Comments: 5 pages, 4 figures, submitted to IEEE Antennas and Wireless Propagation Letters

  25. arXiv:2501.15726  [pdf, other

    cs.IT eess.SP

    Vision-Aided Channel Prediction Based on Image Segmentation at Street Intersection Scenarios

    Authors: Xuejian Zhang, Ruisi He, Mi Yang, Ziyi Qi, Zhengyu Zhang, Bo Ai, Zhangdui Zhong

    Abstract: Intelligent vehicular communication with vehicle road collaboration capability is a key technology enabled by 6G, and the integration of various visual sensors on vehicles and infrastructures plays a crucial role. Moreover, accurate channel prediction is foundational to realizing intelligent vehicular communication. Traditional methods are still limited by the inability to balance accuracy and ope… ▽ More

    Submitted 26 January, 2025; originally announced January 2025.

    Comments: 12 pages, 9 figures, submitted to IEEE Transactions on Cognitive Communications and Networking

  26. arXiv:2501.15091  [pdf, other

    cs.IT eess.SP

    Deep Reinforcement Learning for Energy Efficiency Maximization in RSMA-IRS-Assisted ISAC System

    Authors: Zhangfeng Ma, Ruichen Zhang, Bo Ai, Zhuxian Lian, Linzhou Zeng, Dusit Niyato

    Abstract: This paper proposes a three-dimensional (3D) geometry-based channel model to accurately represent intelligent reflecting surfaces (IRS)-enhanced integrated sensing and communication (ISAC) networks using rate-splitting multiple access (RSMA) in practical urban environments. Based on this model, we formulate an energy efficiency (EE) maximization problem that incorporates transceiver beamforming co… ▽ More

    Submitted 25 January, 2025; originally announced January 2025.

    Comments: 5 pages, 4 figures

  27. arXiv:2501.13403  [pdf, ps, other

    eess.SP cs.IT

    ROMA: ROtary and Movable Antenna

    Authors: Jiayi Zhang, Wenhui Yi, Bokai Xu, Zhe Wang, Huahua Xiao, Bo Ai

    Abstract: The rotary and movable antenna (ROMA) architecture represents a next-generation multi-antenna technology that enables flexible adjustment of antenna position and array rotation angles of the transceiver. In this letter, we propose a ROMA-aided multi-user MIMO communication system to fully enhance the efficiency and reliability of system transmissions. By deploying ROMA panels at both the transmitt… ▽ More

    Submitted 23 April, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

    Comments: Rotary and movable antennas, multi-user MIMO, spectral efficiency, alternating optimization

  28. arXiv:2412.20943  [pdf, other

    cs.IT

    Cluster-Based Time-Variant Channel Characterization and Modeling for 5G-Railways

    Authors: Xuejian Zhang, Ruisi He, Bo Ai, Mi Yang, Jianwen Ding, Shuaiqi Gao, Ziyi Qi, Zhengyu Zhang, Zhangdui Zhong

    Abstract: With the development of high-speed railways, 5G for Railways (5G-R) is gradually replacing Global System for the Mobile Communications for Railway (GSM-R) worldwide to meet increasing demands. The large bandwidth, array antennas, and non-stationarity caused by high mobility has made 5G-R channel characterization more complex. Therefore, it is essential to develop an accurate channel model for 5G-R… ▽ More

    Submitted 30 December, 2024; originally announced December 2024.

    Comments: 13 pages, 13 figures, submitted to IEEE Transactions on Wireless Communications

  29. arXiv:2412.06178  [pdf, other

    cs.IT eess.SP

    Deep Unfolding Beamforming and Power Control Designs for Multi-Port Matching Networks

    Authors: Bokai Xu, Jiayi Zhang, Qingfeng Lin, Huahua Xiao, Yik-Chung Wu, Bo Ai

    Abstract: The key technologies of sixth generation (6G), such as ultra-massive multiple-input multiple-output (MIMO), enable intricate interactions between antennas and wireless propagation environments. As a result, it becomes necessary to develop joint models that encompass both antennas and wireless propagation channels. To achieve this, we utilize the multi-port communication theory, which considers imp… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

  30. arXiv:2412.03940  [pdf, other

    eess.SP cs.IT

    Performance Analysis of XL-MIMO with Rotary and Movable Antennas for High-speed Railway

    Authors: Wenhui Yi, Jiayi Zhang, Zhe Wang, Huahua Xiao, Bo Ai

    Abstract: The rotary and movable antennas (ROMA) technology is efficient in enhancing wireless network capacity by adjusting both the antenna spacing and three-dimensional (3D) rotation of antenna surfaces, based on the spatial distribution of users and channel statistics. Applying ROMA to high-speed rail (HSR) wireless communications can significantly improve system performance in terms of array gain and s… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: XL-MIMO, high-speed railway, ROMA, spatial correlation, capacity

  31. arXiv:2412.02581  [pdf, other

    cs.IT eess.SP

    Mobile Cell-Free Massive MIMO with Multi-Agent Reinforcement Learning: A Scalable Framework

    Authors: Ziheng Liu, Jiayi Zhang, Yiyang Zhu, Enyu Shi, Bo Ai

    Abstract: Cell-free massive multiple-input multiple-output (mMIMO) offers significant advantages in mobility scenarios, mainly due to the elimination of cell boundaries and strong macro diversity. In this paper, we examine the downlink performance of cell-free mMIMO systems equipped with mobile-APs utilizing the concept of unmanned aerial vehicles, where mobility and power control are jointly considered to… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  32. arXiv:2412.01029  [pdf, ps, other

    eess.SP cs.IT

    Deep Learning Based Near-Field User Localization with Beam Squint in Wideband XL-MIMO Systems

    Authors: Hao Lei, Jiayi Zhang, Huahua Xiao, Derrick Wing Kwan Ng, Bo Ai

    Abstract: Extremely large-scale multiple-input multiple-output (XL-MIMO) is gaining attention as a prominent technology for enabling the sixth-generation (6G) wireless networks. However, the vast antenna array and the huge bandwidth introduce a non-negligible beam squint effect, causing beams of different frequencies to focus at different locations. One approach to cope with this is to employ true-time-dela… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

  33. arXiv:2411.11798  [pdf

    cs.IT cs.AI eess.SP

    COST CA20120 INTERACT Framework of Artificial Intelligence Based Channel Modeling

    Authors: Ruisi He, Nicola D. Cicco, Bo Ai, Mi Yang, Yang Miao, Mate Boban

    Abstract: Accurate channel models are the prerequisite for communication-theoretic investigations as well as system design. Channel modeling generally relies on statistical and deterministic approaches. However, there are still significant limits for the traditional modeling methods in terms of accuracy, generalization ability, and computational complexity. The fundamental reason is that establishing a quan… ▽ More

    Submitted 31 October, 2024; originally announced November 2024.

    Comments: to appear in IEEE Wireless Communications Magazine

  34. arXiv:2411.11070  [pdf, ps, other

    cs.IT eess.SP

    Joint Precoding and AP Selection for Energy Efficient RIS-aided Cell-Free Massive MIMO Using Multi-agent Reinforcement Learning

    Authors: Enyu Shi, Jiayi Zhang, Ziheng Liu, Yiyang Zhu, Chau Yuen, Derrick Wing Kwan Ng, Marco Di Renzo, Bo Ai

    Abstract: Cell-free (CF) massive multiple-input multiple-output (mMIMO) and reconfigurable intelligent surface (RIS) are two advanced transceiver technologies for realizing future sixth-generation (6G) networks. In this paper, we investigate the joint precoding and access point (AP) selection for energy efficient RIS-aided CF mMIMO system. To address the associated computational complexity and communication… ▽ More

    Submitted 17 November, 2024; originally announced November 2024.

  35. arXiv:2410.12246  [pdf, other

    cs.IT

    Transmission Scheduling of Millimeter Wave Communication for High-Speed Railway in Space-Air-Ground Integrated Network

    Authors: Lei Liu, Bo Ai, Yong Niu, Zhu Han, Ning Wang, Lei Xiong, Ruisi He

    Abstract: The space-air-ground integrated network (SAGIN) greatly improves coverage and reliability for millimeter-wave (mmWave) communication in high-speed railway (HSR) scenarios. However, a significant challenge arises in the transmission scheduling due to the rapid changes in channel state, link selection for train mobile relays (MRs), and order of the flow scheduling. To tackle this challenge, we intro… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 16 pages, 15 figures, IEEE Transactions on Vehicular Technology

  36. arXiv:2410.06506  [pdf, other

    cs.IT eess.SP

    Cooperative Multi-Target Positioning for Cell-Free Massive MIMO with Multi-Agent Reinforcement Learning

    Authors: Ziheng Liu, Jiayi Zhang, Enyu Shi, Yiyang Zhu, Derrick Wing Kwan Ng, Bo Ai

    Abstract: Cell-free massive multiple-input multiple-output (mMIMO) is a promising technology to empower next-generation mobile communication networks. In this paper, to address the computational complexity associated with conventional fingerprint positioning, we consider a novel cooperative positioning architecture that involves certain relevant access points (APs) to establish positioning similarity coeffi… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  37. arXiv:2410.04871  [pdf, other

    cs.IT eess.SP

    Distributed Collaborative User Positioning for Cell-Free Massive MIMO with Multi-Agent Reinforcement Learning

    Authors: Ziheng Liu, Jiayi Zhang, Enyu Shi, Yiyang Zhu, Derrick Wing Kwan Ng, Bo Ai

    Abstract: In this paper, we investigate a cell-free massive multiple-input multiple-output system, which exhibits great potential in enhancing the capabilities of next-generation mobile communication networks. We first study the distributed positioning problem to lay the groundwork for solving resource allocation and interference management issues. Instead of relying on computationally and spatially complex… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

  38. arXiv:2409.14702  [pdf, ps, other

    cs.IT eess.SP

    Rate-Splitting for Cell-Free Massive MIMO: Performance Analysis and Generative AI Approach

    Authors: Jiakang Zheng, Jiayi Zhang, Hongyang Du, Ruichen Zhang, Dusit Niyato, Octavia A. Dobre, Bo Ai

    Abstract: Cell-free (CF) massive multiple-input multipleoutput (MIMO) provides a ubiquitous coverage to user equipments (UEs) but it is also susceptible to interference. Ratesplitting (RS) effectively extracts data by decoding interference, yet its effectiveness is limited by the weakest UE. In this paper, we investigate an RS-based CF massive MIMO system, which combines strengths and mitigates weaknesses o… ▽ More

    Submitted 24 September, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: 15 pages, 9 figures, Accepted in IEEE Transactions on Communications

  39. arXiv:2409.12870  [pdf, ps, other

    cs.IT eess.SP

    Joint AP-UE Association and Precoding for SIM-Aided Cell-Free Massive MIMO Systems

    Authors: Enyu Shi, Jiayi Zhang, Jiancheng An, Guangyang Zhang, Ziheng Liu, Chau Yuen, Bo Ai

    Abstract: Cell-free (CF) massive multiple-input multiple-output (mMIMO) systems are emerging as promising alternatives to cellular networks, especially in ultra-dense environments. However, further capacity enhancement requires the deployment of more access points (APs), which will lead to high costs and high energy consumption. To address this issue, in this paper, we explore the integration of low-power,… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

  40. arXiv:2409.12851  [pdf, ps, other

    cs.IT eess.SP

    Harnessing Stacked Intelligent Metasurface for Enhanced Cell-Free Massive MIMO Systems: A Low-Power and Cost Approach

    Authors: Enyu Shi, Jiayi Zhang, Yiyang Zhu, Jiancheng An, Chau Yuen, Bo Ai

    Abstract: In this paper, we explore the integration of low-power, low-cost stacked intelligent metasurfaces (SIM) into cell-free (CF) massive multiple-input multiple-output (mMIMO) systems to enhance access point (AP) capabilities and address high power consumption and cost challenges. Specifically, we investigate the uplink performance of a SIM-enhanced CF mMIMO system and propose a novel system framework.… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

  41. arXiv:2409.06946  [pdf, other

    cs.IT eess.SP

    Refracting Reconfigurable Intelligent Surface Assisted URLLC for Millimeter Wave High-Speed Train Communication Coverage Enhancement

    Authors: Changzhu Liu, Ruisi He, Yong Niu, Shiwen Mao, Bo Ai, Ruifeng Chen

    Abstract: High-speed train (HST) has garnered significant attention from both academia and industry due to the rapid development of railways worldwide. Millimeter wave (mmWave) communication, known for its large bandwidth is an effective way to address performance bottlenecks in cellular network based HST wireless communication systems. However, mmWave signals suffer from significant path loss when traversi… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: 11 figures, accepted by IEEE Transactions on Vehicular Technology

  42. arXiv:2408.15903  [pdf, other

    cs.CL

    LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments

    Authors: Ruirui Chen, Weifeng Jiang, Chengwei Qin, Ishaan Singh Rawal, Cheston Tan, Dongkyu Choi, Bo Xiong, Bo Ai

    Abstract: The important challenge of keeping knowledge in Large Language Models (LLMs) up-to-date has led to the development of various methods for incorporating new facts. However, existing methods for such knowledge editing still face difficulties with multi-hop questions that require accurate fact identification and sequential logical reasoning, particularly among numerous fact updates. To tackle these c… ▽ More

    Submitted 4 December, 2024; v1 submitted 28 August, 2024; originally announced August 2024.

  43. arXiv:2408.05517  [pdf, other

    cs.CL

    SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning

    Authors: Yuze Zhao, Jintao Huang, Jinghan Hu, Xingjun Wang, Yunlin Mao, Daoze Zhang, Hong Zhang, Zeyinzi Jiang, Zhikai Wu, Baole Ai, Ang Wang, Wenmeng Zhou, Yingda Chen

    Abstract: Recent development in Large Language Models (LLMs) and Multi-modal Large Language Models (MLLMs) have leverage Attention-based Transformer architectures and achieved superior performance and generalization capabilities. They have since covered extensive areas of traditional learning tasks. For instance, text-based tasks such as text-classification and sequence-labeling, as well as multi-modal task… ▽ More

    Submitted 19 May, 2025; v1 submitted 10 August, 2024; originally announced August 2024.

  44. Optimal Bilinear Equalizer for Cell-Free Massive MIMO Systems over Correlated Rician Channels

    Authors: Zhe Wang, Jiayi Zhang, Emil Björnson, Dusit Niyato, Bo Ai

    Abstract: In this paper, we explore the low-complexity optimal bilinear equalizer (OBE) combining scheme design for cell-free massive multiple-input multiple-output networks with spatially correlated Rician fading channels. We provide a spectral efficiency (SE) performance analysis framework for both the centralized and distributed processing schemes with bilinear equalizer (BE)-structure combining schemes… ▽ More

    Submitted 2 March, 2025; v1 submitted 26 July, 2024; originally announced July 2024.

    Comments: 16 pages, 10 figures. This paper has been accepted by IEEE Transactions on Signal Processing

  45. arXiv:2407.18468  [pdf, ps, other

    cs.LG cs.AI

    Diffusion-Driven Semantic Communication for Generative Models with Bandwidth Constraints

    Authors: Lei Guo, Wei Chen, Yuxuan Sun, Bo Ai, Nikolaos Pappas, Tony Q. S. Quek

    Abstract: Diffusion models have been extensively utilized in AI-generated content (AIGC) in recent years, thanks to the superior generation capabilities. Combining with semantic communications, diffusion models are used for tasks such as denoising, data reconstruction, and content generation. However, existing diffusion-based generative models do not consider the stringent bandwidth limitation, which limits… ▽ More

    Submitted 9 July, 2025; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: accepted to IEEE for possible publication

  46. arXiv:2407.10147  [pdf, ps, other

    eess.SP cs.IT

    Near-Field User Localization and Channel Estimation for XL-MIMO Systems: Fundamentals, Recent Advances, and Outlooks

    Authors: Hao Lei, Jiayi Zhang, Zhe Wang, Huahua Xiao, Bo Ai, Emil Björnson

    Abstract: Extremely large-scale multiple-input multipleoutput (XL-MIMO) is believed to be a cornerstone of sixth-generation (6G) wireless networks. XL-MIMO uses more antennas to both achieve unprecedented spatial degrees of freedom (DoFs) and exploit new electromagnetic (EM) phenomena occurring in the radiative near-field. The near-field effects provide the XL-MIMO array with depth perception, enabling prec… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 9 pages, 4 figures, 2tables, submitted to IEEE WCM

  47. arXiv:2407.04336  [pdf, ps, other

    eess.SP cs.AI

    AI-Driven Mobility Management for High-Speed Railway Communications: Compressed Measurements and Proactive Handover

    Authors: Wen Li, Wei Chen, Shiyue Wang, Yuanyuan Zhang, Michail Matthaiou, Bo Ai

    Abstract: High-speed railway (HSR) communications are pivotal for ensuring rail safety, operations, maintenance, and delivering passenger information services. The high speed of trains creates rapidly time-varying wireless channels, increases the signaling overhead, and reduces the system throughput, making it difficult to meet the growing and stringent needs of HSR applications. In this article, we explore… ▽ More

    Submitted 5 July, 2025; v1 submitted 5 July, 2024; originally announced July 2024.

  48. arXiv:2407.03122  [pdf, other

    cs.RO

    IntentionNet: Map-Lite Visual Navigation at the Kilometre Scale

    Authors: Wei Gao, Bo Ai, Joel Loo, Vinay, David Hsu

    Abstract: This work explores the challenges of creating a scalable and robust robot navigation system that can traverse both indoor and outdoor environments to reach distant goals. We propose a navigation system architecture called IntentionNet that employs a monolithic neural network as the low-level planner/controller, and uses a general interface that we call intentions to steer the controller. The paper… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  49. arXiv:2407.01418  [pdf, other

    cs.RO cs.AI cs.LG

    RoboPack: Learning Tactile-Informed Dynamics Models for Dense Packing

    Authors: Bo Ai, Stephen Tian, Haochen Shi, Yixuan Wang, Cheston Tan, Yunzhu Li, Jiajun Wu

    Abstract: Tactile feedback is critical for understanding the dynamics of both rigid and deformable objects in many manipulation tasks, such as non-prehensile manipulation and dense packing. We introduce an approach that combines visual and tactile sensing for robotic manipulation by learning a neural, tactile-informed dynamics model. Our proposed framework, RoboPack, employs a recurrent graph neural network… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Robotics: Science and Systems (RSS), 2024. Project page: https://robo-pack.github.io/

    ACM Class: I.2.9; I.2.6; I.2.10

  50. arXiv:2406.18538  [pdf, other

    cs.CV cs.AI eess.IV

    VideoQA-SC: Adaptive Semantic Communication for Video Question Answering

    Authors: Jiangyuan Guo, Wei Chen, Yuxuan Sun, Jialong Xu, Bo Ai

    Abstract: Although semantic communication (SC) has shown its potential in efficiently transmitting multimodal data such as texts, speeches and images, SC for videos has focused primarily on pixel-level reconstruction. However, these SC systems may be suboptimal for downstream intelligent tasks. Moreover, SC systems without pixel-level video reconstruction present advantages by achieving higher bandwidth eff… ▽ More

    Submitted 11 February, 2025; v1 submitted 17 May, 2024; originally announced June 2024.