-
Prompting Wireless Networks: Reinforced In-Context Learning for Power Control
Authors:
Hao Zhou,
Chengming Hu,
Dun Yuan,
Ye Yuan,
Di Wu,
Xue Liu,
Jianzhong,
Zhang
Abstract:
To manage and optimize constantly evolving wireless networks, existing machine learning (ML)- based studies operate as black-box models, leading to increased computational costs during training and a lack of transparency in decision-making, which limits their practical applicability in wireless networks. Motivated by recent advancements in large language model (LLM)-enabled wireless networks, this…
▽ More
To manage and optimize constantly evolving wireless networks, existing machine learning (ML)- based studies operate as black-box models, leading to increased computational costs during training and a lack of transparency in decision-making, which limits their practical applicability in wireless networks. Motivated by recent advancements in large language model (LLM)-enabled wireless networks, this paper proposes ProWin, a novel framework that leverages reinforced in-context learning to design task-specific demonstration Prompts for Wireless Network optimization, relying on the inference capabilities of LLMs without the need for dedicated model training or finetuning. The task-specific prompts are designed to incorporate natural language descriptions of the task description and formulation, enhancing interpretability and eliminating the need for specialized expertise in network optimization. We further propose a reinforced in-context learning scheme that incorporates a set of advisable examples into task-specific prompts, wherein informative examples capturing historical environment states and decisions are adaptively selected to guide current decision-making. Evaluations on a case study of base station power control showcases that the proposed ProWin outperforms reinforcement learning (RL)-based methods, highlighting the potential for next-generation future wireless network optimization.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
Hierarchical Debate-Based Large Language Model (LLM) for Complex Task Planning of 6G Network Management
Authors:
Yuyan Lin,
Hao Zhou,
Chengming Hu,
Xue Liu,
Hao Chen,
Yan Xin,
Jianzhong,
Zhang
Abstract:
6G networks have become increasingly complicated due to novel network architecture and newly emerging signal processing and transmission techniques, leading to significant burdens to 6G network management. Large language models (LLMs) have recently been considered a promising technique to equip 6G networks with AI-native intelligence. Different from most existing studies that only consider a singl…
▽ More
6G networks have become increasingly complicated due to novel network architecture and newly emerging signal processing and transmission techniques, leading to significant burdens to 6G network management. Large language models (LLMs) have recently been considered a promising technique to equip 6G networks with AI-native intelligence. Different from most existing studies that only consider a single LLM, this work involves a multi-LLM debate-based scheme for 6G network management, where multiple LLMs can collaboratively improve the initial solution sequentially. Considering the complex nature of 6G domain, we propose a novel hierarchical debate scheme: LLMs will first debate the sub-task decomposition, and then debate each subtask step-by-step. Such a hierarchical approach can significantly reduce the overall debate difficulty by sub-task decomposition, aligning well with the complex nature of 6G networks and ensuring the final solution qualities. In addition, to better evaluate the proposed technique, we have defined a novel dataset named 6GPlan, including 110 complex 6G network management tasks and 5000 keyword solutions. Finally, the experiments show that the proposed hierarchical debate can significantly improve performance compared to baseline techniques, e.g. more than 30% coverage rate and global recall rate improvement.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
Understanding 6G through Language Models: A Case Study on LLM-aided Structured Entity Extraction in Telecom Domain
Authors:
Ye Yuan,
Haolun Wu,
Hao Zhou,
Xue Liu,
Hao Chen,
Yan Xin,
Jianzhong,
Zhang
Abstract:
Knowledge understanding is a foundational part of envisioned 6G networks to advance network intelligence and AI-native network architectures. In this paradigm, information extraction plays a pivotal role in transforming fragmented telecom knowledge into well-structured formats, empowering diverse AI models to better understand network terminologies. This work proposes a novel language model-based…
▽ More
Knowledge understanding is a foundational part of envisioned 6G networks to advance network intelligence and AI-native network architectures. In this paradigm, information extraction plays a pivotal role in transforming fragmented telecom knowledge into well-structured formats, empowering diverse AI models to better understand network terminologies. This work proposes a novel language model-based information extraction technique, aiming to extract structured entities from the telecom context. The proposed telecom structured entity extraction (TeleSEE) technique applies a token-efficient representation method to predict entity types and attribute keys, aiming to save the number of output tokens and improve prediction accuracy. Meanwhile, TeleSEE involves a hierarchical parallel decoding method, improving the standard encoder-decoder architecture by integrating additional prompting and decoding strategies into entity extraction tasks. In addition, to better evaluate the performance of the proposed technique in the telecom domain, we further designed a dataset named 6GTech, including 2390 sentences and 23747 words from more than 100 6G-related technical publications. Finally, the experiment shows that the proposed TeleSEE method achieves higher accuracy than other baseline techniques, and also presents 5 to 9 times higher sample processing speed.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
An Arbitrary-Modal Fusion Network for Volumetric Cranial Nerves Tract Segmentation
Authors:
Lei Xie,
Huajun Zhou,
Junxiong Huang,
Jiahao Huang,
Qingrun Zeng,
Jianzhong He,
Jiawei Zhang,
Baohua Fan,
Mingchu Li,
Guoqiang Xie,
Hao Chen,
Yuanjing Feng
Abstract:
The segmentation of cranial nerves (CNs) tract provides a valuable quantitative tool for the analysis of the morphology and trajectory of individual CNs. Multimodal CNs tract segmentation networks, e.g., CNTSeg, which combine structural Magnetic Resonance Imaging (MRI) and diffusion MRI, have achieved promising segmentation performance. However, it is laborious or even infeasible to collect comple…
▽ More
The segmentation of cranial nerves (CNs) tract provides a valuable quantitative tool for the analysis of the morphology and trajectory of individual CNs. Multimodal CNs tract segmentation networks, e.g., CNTSeg, which combine structural Magnetic Resonance Imaging (MRI) and diffusion MRI, have achieved promising segmentation performance. However, it is laborious or even infeasible to collect complete multimodal data in clinical practice due to limitations in equipment, user privacy, and working conditions. In this work, we propose a novel arbitrary-modal fusion network for volumetric CNs tract segmentation, called CNTSeg-v2, which trains one model to handle different combinations of available modalities. Instead of directly combining all the modalities, we select T1-weighted (T1w) images as the primary modality due to its simplicity in data acquisition and contribution most to the results, which supervises the information selection of other auxiliary modalities. Our model encompasses an Arbitrary-Modal Collaboration Module (ACM) designed to effectively extract informative features from other auxiliary modalities, guided by the supervision of T1w images. Meanwhile, we construct a Deep Distance-guided Multi-stage (DDM) decoder to correct small errors and discontinuities through signed distance maps to improve segmentation accuracy. We evaluate our CNTSeg-v2 on the Human Connectome Project (HCP) dataset and the clinical Multi-shell Diffusion MRI (MDM) dataset. Extensive experimental results show that our CNTSeg-v2 achieves state-of-the-art segmentation performance, outperforming all competing methods.
△ Less
Submitted 5 May, 2025;
originally announced May 2025.
-
Low Complexity Frequency Domain Nonlinear Self-Interference Cancellation for Flexible Duplex
Authors:
Yonghwi Kim,
Kai-Kit Wong,
Jianzhong,
Zhang,
Chan-Byoung Chae
Abstract:
Nonlinear self-interference (SI) cancellation is essential for mitigating the impact of transmitter-side nonlinearity on overall SI cancellation performance in flexible duplex systems, including in-band full-duplex (IBFD) and sub-band full-duplex (SBFD). Digital SI cancellation (SIC) must address the nonlinearity in the power amplifier (PA) and the in-phase/quadrature-phase (IQ) imbalance from up/…
▽ More
Nonlinear self-interference (SI) cancellation is essential for mitigating the impact of transmitter-side nonlinearity on overall SI cancellation performance in flexible duplex systems, including in-band full-duplex (IBFD) and sub-band full-duplex (SBFD). Digital SI cancellation (SIC) must address the nonlinearity in the power amplifier (PA) and the in-phase/quadrature-phase (IQ) imbalance from up/down converters at the base station (BS), in addition to analog SIC. In environments with rich signal reflection paths, however, the required number of delayed taps for time-domain nonlinear SI cancellation increases exponentially with the number of multipaths, leading to excessive complexity. This paper introduces a novel, low-complexity, frequency domain nonlinear SIC, suitable for flexible duplex systems with multiple-input and multiple-output (MIMO) configurations. The key approach involves decomposing nonlinear SI into a nonlinear basis and categorizing them based on their effectiveness across any flexible duplex setting. The proposed algorithm is founded on our analytical results of intermodulation distortion (IMD) in the frequency domain and utilizes a specialized pilot sequence. This algorithm is directly applicable to orthogonal frequency division multiplexing (OFDM) multi-carrier systems and offers lower complexity than conventional digital SIC methods. Additionally, we assess the impact of the proposed SIC on flexible duplex systems through system-level simulation (SLS) using 3D ray-tracing and proof-of-concept (PoC) measurement.
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
Beamforming with Joint Phase and Time Array: System Design, Prototyping and Performance
Authors:
Jianhua Mo,
Ahmad AlAmmouri,
Shenggang Dong,
Younghan Nam,
Won-Suk Choi,
Gary Xu,
Jianzhong,
Zhan
Abstract:
Joint phase-time arrays (JPTA) is a new mmWave radio frequency front-end architecture constructed with appending time-delay elements to phase shifters for analog beamforming. JPTA allows the mmWave base station (BS) to form multiple frequency-dependent beams with a single RF chain, exploiting the extra degrees of freedom the time-delay elements offer. Without requiring extra power-hungry RF chains…
▽ More
Joint phase-time arrays (JPTA) is a new mmWave radio frequency front-end architecture constructed with appending time-delay elements to phase shifters for analog beamforming. JPTA allows the mmWave base station (BS) to form multiple frequency-dependent beams with a single RF chain, exploiting the extra degrees of freedom the time-delay elements offer. Without requiring extra power-hungry RF chains, a BS with JPTA can schedule multiple users in different directions in a frequency-division multiplexing (FDM) manner. A BS with JPTA achieves various advantages over the traditional analog beamforming system. Simulation results show that JPTA can bring significant system-level benefits, e.g., extending uplink throughput coverage by 100%. To realize these system benefits of JPTA, high-resolution delay elements with a wide delay dynamic range are essential. With newly developed delay elements, we demonstrate that a single TRX RF chain can serve four users in four different directions in the mmWave band.
△ Less
Submitted 31 January, 2025;
originally announced February 2025.
-
Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset
Authors:
Shanshan Wang,
Shoujun Yu,
Jian Cheng,
Sen Jia,
Changjun Tie,
Jiayu Zhu,
Haohao Peng,
Yijing Dong,
Jianzhong He,
Fan Zhang,
Yaowen Xing,
Xiuqin Jia,
Qi Yang,
Qiyuan Tian,
Hua Guo,
Guobin Li,
Hairong Zheng
Abstract:
Diffusion magnetic resonance imaging (dMRI) provides critical insights into the microstructural and connectional organization of the human brain. However, the availability of high-field, open-access datasets that include raw k-space data for advanced research remains limited. To address this gap, we introduce Diff5T, a first comprehensive 5.0 Tesla diffusion MRI dataset focusing on the human brain…
▽ More
Diffusion magnetic resonance imaging (dMRI) provides critical insights into the microstructural and connectional organization of the human brain. However, the availability of high-field, open-access datasets that include raw k-space data for advanced research remains limited. To address this gap, we introduce Diff5T, a first comprehensive 5.0 Tesla diffusion MRI dataset focusing on the human brain. This dataset includes raw k-space data and reconstructed diffusion images, acquired using a variety of imaging protocols. Diff5T is designed to support the development and benchmarking of innovative methods in artifact correction, image reconstruction, image preprocessing, diffusion modelling and tractography. The dataset features a wide range of diffusion parameters, including multiple b-values and gradient directions, allowing extensive research applications in studying human brain microstructure and connectivity. With its emphasis on open accessibility and detailed benchmarks, Diff5T serves as a valuable resource for advancing human brain mapping research using diffusion MRI, fostering reproducibility, and enabling collaboration across the neuroscience and medical imaging communities.
△ Less
Submitted 9 December, 2024;
originally announced December 2024.
-
A Two-Stage AI-Powered Motif Mining Method for Efficient Power System Topological Analysis
Authors:
Yiyan Li,
Zhenghao Zhou,
Jian Ping,
Xiaoyuan Xu,
Zheng Yan,
Jianzhong Wu
Abstract:
Graph motif, defined as the microstructure that appears repeatedly in a large graph, reveals important topological characteristics of the large graph and has gained increasing attention in power system analysis regarding reliability, vulnerability and resiliency. However, searching motifs within the large-scale power system is extremely computationally challenging and even infeasible, which underm…
▽ More
Graph motif, defined as the microstructure that appears repeatedly in a large graph, reveals important topological characteristics of the large graph and has gained increasing attention in power system analysis regarding reliability, vulnerability and resiliency. However, searching motifs within the large-scale power system is extremely computationally challenging and even infeasible, which undermines the value of motif analysis in practice. In this paper, we introduce a two-stage AI-powered motif mining method to enable efficient and wide-range motif analysis in power systems. In the first stage, a representation learning method with specially designed network structure and loss function is proposed to achieve ordered embedding for the power system topology, simplifying the subgraph isomorphic problem into a vector comparison problem. In the second stage, under the guidance of the ordered embedding space, a greedy-search-based motif growing algorithm is introduced to quickly obtain the motifs without traversal searching. A case study based on a power system database including 61 circuit models demonstrates the effectiveness of the proposed method.
△ Less
Submitted 8 December, 2024;
originally announced December 2024.
-
Joint Phase Time Array: Opportunities, Challenges and System Design Considerations
Authors:
Young-Han Nam,
Ahmad AlAmmouri,
Jianhua Mo,
Jianzhong Chalrie Zhang
Abstract:
This paper presents a novel approach to designing millimeter-wave (mmWave) cellular communication systems, based on joint phase time array (JPTA) radio frequency (RF) frontend architecture. JPTA architecture comprises time-delay components appended to conventional phase shifters, which offer extra degrees of freedom to be exploited for designing frequency-selective analog beams. Hence, a mmWave de…
▽ More
This paper presents a novel approach to designing millimeter-wave (mmWave) cellular communication systems, based on joint phase time array (JPTA) radio frequency (RF) frontend architecture. JPTA architecture comprises time-delay components appended to conventional phase shifters, which offer extra degrees of freedom to be exploited for designing frequency-selective analog beams. Hence, a mmWave device equipped with JPTA can receive and transmit signals in multiple directions in a single time slot per RF chain, one direction per frequency subband, which alleviates the traditional constraint of one analog beam per transceiver chain per time slot. The utilization of subband-specific analog beams offers a new opportunity in designing mmWave systems, allowing for enhanced cell capacity and reduced pilot overhead. To understand the practical feasibility of JPTA, a few challenges and system design considerations are discussed in relation to the performance and complexity of the JPTA systems. For example, frequency-selective beam gain losses are present for the subband analog beams, e.g., up to 1 dB losses for 2 subband cases, even with the state-of-the-art JPTA delay and phase optimization methods. Despite these side effects, system-level analysis reveals that the JPTA system is capable of improving cell capacity: 5-percentile cell throughput by up to 65 per cent. To the best of the author's knowledge, this paper is the first paper explaining the system-level benefits and system-design challenges of JPTA, with an analysis of the performance tradeoff based on an intuitive metric of beam gain losses.
△ Less
Submitted 24 March, 2025; v1 submitted 2 December, 2024;
originally announced December 2024.
-
Revised Optimal design of power electronic transformer based on hybrid MMC under over-modulation operation
Authors:
Yaqian Zhang,
Xudong Zhang,
Jianzhong Zhang,
Fujin Deng
Abstract:
The bridge arm of the hybrid modular multilevel converter (MMC) is composed of half-bridge and full-bridge sub-modules cascaded together. Compared with the half-bridge MMC, it can operate in the boost-AC mode, where the modulation index can be higher than 1, and the DC voltage and the AC voltage level are no longer mutually constrained; compared with the full-bridge MMC, it has lower switching dev…
▽ More
The bridge arm of the hybrid modular multilevel converter (MMC) is composed of half-bridge and full-bridge sub-modules cascaded together. Compared with the half-bridge MMC, it can operate in the boost-AC mode, where the modulation index can be higher than 1, and the DC voltage and the AC voltage level are no longer mutually constrained; compared with the full-bridge MMC, it has lower switching device costs and losses. When the hybrid MMC boost-AC mode is used in the power electronic transformer, the degree of freedom in system design is improved, and the cost and volume of the power electronic transformer system can be further reduced. This paper analyzes how to make full use of the newly added modulation index of freedom introduced by the boost-AC hybrid MMC to optimize the power electronic transformer system, and finally gives the optimal modulation index selection scheme of the hybrid MMC for different optimization objectives.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Simultaneous Deep Learning of Myocardium Segmentation and T2 Quantification for Acute Myocardial Infarction MRI
Authors:
Yirong Zhou,
Chengyan Wang,
Mengtian Lu,
Kunyuan Guo,
Zi Wang,
Dan Ruan,
Rui Guo,
Peijun Zhao,
Jianhua Wang,
Naiming Wu,
Jianzhong Lin,
Yinyin Chen,
Hang Jin,
Lianxin Xie,
Lilan Wu,
Liuhong Zhu,
Jianjun Zhou,
Congbo Cai,
He Wang,
Xiaobo Qu
Abstract:
In cardiac Magnetic Resonance Imaging (MRI) analysis, simultaneous myocardial segmentation and T2 quantification are crucial for assessing myocardial pathologies. Existing methods often address these tasks separately, limiting their synergistic potential. To address this, we propose SQNet, a dual-task network integrating Transformer and Convolutional Neural Network (CNN) components. SQNet features…
▽ More
In cardiac Magnetic Resonance Imaging (MRI) analysis, simultaneous myocardial segmentation and T2 quantification are crucial for assessing myocardial pathologies. Existing methods often address these tasks separately, limiting their synergistic potential. To address this, we propose SQNet, a dual-task network integrating Transformer and Convolutional Neural Network (CNN) components. SQNet features a T2-refine fusion decoder for quantitative analysis, leveraging global features from the Transformer, and a segmentation decoder with multiple local region supervision for enhanced accuracy. A tight coupling module aligns and fuses CNN and Transformer branch features, enabling SQNet to focus on myocardium regions. Evaluation on healthy controls (HC) and acute myocardial infarction patients (AMI) demonstrates superior segmentation dice scores (89.3/89.2) compared to state-of-the-art methods (87.7/87.9). T2 quantification yields strong linear correlations (Pearson coefficients: 0.84/0.93) with label values for HC/AMI, indicating accurate mapping. Radiologist evaluations confirm SQNet's superior image quality scores (4.60/4.58 for segmentation, 4.32/4.42 for T2 quantification) over state-of-the-art methods (4.50/4.44 for segmentation, 3.59/4.37 for T2 quantification). SQNet thus offers accurate simultaneous segmentation and quantification, enhancing cardiac disease diagnosis, such as AMI.
△ Less
Submitted 29 May, 2024; v1 submitted 17 May, 2024;
originally announced May 2024.
-
Integrated Monostatic Sensing and Full-Duplex Multiuser Communication for mmWave Systems
Authors:
Murat Bayraktar,
Nuria González-Prelcic,
Mikko Valkama,
Hao Chen,
Charlie Jianzhong Zhang
Abstract:
In this paper, we propose a hybrid precoding/combining framework for communication-centric integrated sensing and full-duplex (FD) communication operating at mmWave bands. The designed precoders and combiners enable multiuser (MU) FD communication while simultaneously supporting monostatic sensing in a frequency-selective setting. The joint design of precoders and combiners involves the mitigation…
▽ More
In this paper, we propose a hybrid precoding/combining framework for communication-centric integrated sensing and full-duplex (FD) communication operating at mmWave bands. The designed precoders and combiners enable multiuser (MU) FD communication while simultaneously supporting monostatic sensing in a frequency-selective setting. The joint design of precoders and combiners involves the mitigation of self-interference (SI) caused by simultaneous transmission and reception at the FD base station (BS). Additionally, MU interference needs to be handled by the precoder/combiner design. The resulting optimization problem involves non-convex constraints since hybrid analog/digital architectures utilize networks of phase shifters. To solve the proposed problem, we separate the optimization of each precoder/combiner, and design each one of them while fixing the others. The precoders at the FD BS are designed by reformulating the communication and sensing constraints as signal-to-leakage-plus-noise ratio (SLNR) maximization problems that consider SI and MU interference as leakage. Furthermore, we design the frequency-flat analog combiner such that the residual SI at the FD BS is minimized under communication and sensing gain constraints. Finally, we design an interference-aware digital combining stage that separates MU signals and target reflections. The communication performance and sensing results show that the proposed framework efficiently supports both functionalities simultaneously.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Learned Pulse Shaping Design for PAPR Reduction in DFT-s-OFDM
Authors:
Fabrizio Carpi,
Soheil Rostami,
Joonyoung Cho,
Siddharth Garg,
Elza Erkip,
Charlie Jianzhong Zhang
Abstract:
High peak-to-average power ratio (PAPR) is one of the main factors limiting cell coverage for cellular systems, especially in the uplink direction. Discrete Fourier transform spread orthogonal frequency-domain multiplexing (DFT-s-OFDM) with spectrally-extended frequency-domain spectrum shaping (FDSS) is one of the efficient techniques deployed to lower the PAPR of the uplink waveforms. In this wor…
▽ More
High peak-to-average power ratio (PAPR) is one of the main factors limiting cell coverage for cellular systems, especially in the uplink direction. Discrete Fourier transform spread orthogonal frequency-domain multiplexing (DFT-s-OFDM) with spectrally-extended frequency-domain spectrum shaping (FDSS) is one of the efficient techniques deployed to lower the PAPR of the uplink waveforms. In this work, we propose a machine learning-based framework to determine the FDSS filter, optimizing a tradeoff between the symbol error rate (SER), the PAPR, and the spectral flatness requirements. Our end-to-end optimization framework considers multiple important design constraints, including the Nyquist zero-ISI (inter-symbol interference) condition. The numerical results show that learned FDSS filters lower the PAPR compared to conventional baselines, with minimal SER degradation. Tuning the parameters of the optimization also helps us understand the fundamental limitations and characteristics of the FDSS filters for PAPR reduction.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
3D Beamforming Through Joint Phase-Time Arrays
Authors:
Ozlem Yildiz,
Ahmad AlAmmouri,
Jianhua Mo,
Younghan Nam,
Elza Erkip,
Jianzhong,
Zhang
Abstract:
High-frequency wideband cellular communications over mmWave and sub-THz offer the opportunity for high data rates. However, it also presents high path loss, resulting in limited coverage. High-gain beamforming from the antenna array is essential to mitigate the coverage limitations. The conventional phased antenna arrays (PAA) cause high scheduling latency owing to analog beam constraints, i.e., o…
▽ More
High-frequency wideband cellular communications over mmWave and sub-THz offer the opportunity for high data rates. However, it also presents high path loss, resulting in limited coverage. High-gain beamforming from the antenna array is essential to mitigate the coverage limitations. The conventional phased antenna arrays (PAA) cause high scheduling latency owing to analog beam constraints, i.e., only one frequency-flat beam is generated. Recently introduced joint phase-time array (JPTA) architecture, which utilizes both true-time-delay (TTD) units and phase shifters (PSs), alleviates analog beam constraints by creating multiple frequency-dependent beams for scheduling multiple users at different directions in a frequency-division manner. One class of previous studies offered solutions with ``rainbow" beams, which tend to allocate a small bandwidth per beam direction. Another class focused on uniform linear array (ULA) antenna architecture, whose frequency-dependent beams were designed along a single axis of either azimuth or elevation direction. This paper presents a novel 3D beamforming design that maximizes beamforming gain toward desired azimuth and elevation directions and across sub-bands partitioned according to scheduled users' bandwidth requirements. We provide analytical solutions and iterative algorithms to design the PSs and TTD units for a desired subband beam pattern. Through simulations of the beamforming gain, we observe that our proposed solutions outperform the state-of-the-art solutions reported elsewhere.
△ Less
Submitted 13 August, 2024; v1 submitted 1 January, 2024;
originally announced January 2024.
-
Joint Phase-Time Arrays: A Paradigm for Frequency-Dependent Analog Beamforming in 6G
Authors:
Vishnu V. Ratnam,
Jianhua Mo,
Ahmad AlAmmouri,
Boon L. Ng,
Jianzhong,
Zhang,
Andreas F. Molisch
Abstract:
Hybrid beamforming is an attractive solution to build cost-effective and energy-efficient transceivers for millimeter-wave and terahertz systems. However, conventional hybrid beamforming techniques rely on analog components that generate a frequency flat response such as phase-shifters and switches, which limits the flexibility of the achievable beam patterns. As a novel alternative, this paper pr…
▽ More
Hybrid beamforming is an attractive solution to build cost-effective and energy-efficient transceivers for millimeter-wave and terahertz systems. However, conventional hybrid beamforming techniques rely on analog components that generate a frequency flat response such as phase-shifters and switches, which limits the flexibility of the achievable beam patterns. As a novel alternative, this paper proposes a new class of hybrid beamforming called Joint phase-time arrays (JPTA), that additionally use true-time delay elements in the analog beamforming to create frequency-dependent analog beams. Using as an example two important frequency-dependent beam behaviors, the numerous benefits of such flexibility are exemplified. Subsequently, the JPTA beamformer design problem to generate any desired beam behavior is formulated and near-optimal algorithms to the problem are proposed. Simulations show that the proposed algorithms can outperform heuristics solutions for JPTA beamformer update. Furthermore, it is shown that JPTA can achieve the two exemplified beam behaviors with one radio-frequency chain, while conventional hybrid beamforming requires the radio-frequency chains to scale with the number of antennas to achieve similar performance. Finally, a wide range of problems to further tap into the potential of JPTA are also listed as future directions.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
FlexDTI: Flexible diffusion gradient encoding scheme-based highly efficient diffusion tensor imaging using deep learning
Authors:
Zejun Wu,
Jiechao Wang,
Zunquan Chen,
Qinqin Yang,
Zhen Xing,
Dairong Cao,
Jianfeng Bao,
Taishan Kang,
Jianzhong Lin,
Shuhui Cai,
Zhong Chen,
Congbo Cai
Abstract:
Objective: Most deep neural network-based diffusion tensor imaging methods require the diffusion gradients' number and directions in the data to be reconstructed to match those in the training data. This work aims to develop and evaluate a novel dynamic-convolution-based method called FlexDTI for highly efficient diffusion tensor reconstruction with flexible diffusion encoding gradient scheme. App…
▽ More
Objective: Most deep neural network-based diffusion tensor imaging methods require the diffusion gradients' number and directions in the data to be reconstructed to match those in the training data. This work aims to develop and evaluate a novel dynamic-convolution-based method called FlexDTI for highly efficient diffusion tensor reconstruction with flexible diffusion encoding gradient scheme. Approach: FlexDTI was developed to achieve high-quality DTI parametric mapping with flexible number and directions of diffusion encoding gradients. The method used dynamic convolution kernels to embed diffusion gradient direction information into feature maps of the corresponding diffusion signal. Furthermore, it realized the generalization of a flexible number of diffusion gradient directions by setting the maximum number of input channels of the network. The network was trained and tested using datasets from the Human Connectome Project and local hospitals. Results from FlexDTI and other advanced tensor parameter estimation methods were compared. Main results: Compared to other methods, FlexDTI successfully achieves high-quality diffusion tensor-derived parameters even if the number and directions of diffusion encoding gradients change. It reduces normalized root mean squared error (NRMSE) by about 50% on fractional anisotropy (FA) and 15% on mean diffusivity (MD), compared with the state-of-the-art deep learning method with flexible diffusion encoding gradient scheme. Significance: FlexDTI can well learn diffusion gradient direction information to achieve generalized DTI reconstruction with flexible diffusion gradient scheme. Both flexibility and reconstruction quality can be taken into account in this network.
△ Less
Submitted 21 December, 2023; v1 submitted 2 August, 2023;
originally announced August 2023.
-
One for Multiple: Physics-informed Synthetic Data Boosts Generalizable Deep Learning for Fast MRI Reconstruction
Authors:
Zi Wang,
Xiaotong Yu,
Chengyan Wang,
Weibo Chen,
Jiazheng Wang,
Ying-Hua Chu,
Hongwei Sun,
Rushuai Li,
Peiyong Li,
Fan Yang,
Haiwei Han,
Taishan Kang,
Jianzhong Lin,
Chen Yang,
Shufu Chang,
Zhang Shi,
Sha Hua,
Yan Li,
Juan Hu,
Liuhong Zhu,
Jianjun Zhou,
Meijing Lin,
Jiefeng Guo,
Congbo Cai,
Zhong Chen
, et al. (3 additional authors not shown)
Abstract:
Magnetic resonance imaging (MRI) is a widely used radiological modality renowned for its radiation-free, comprehensive insights into the human body, facilitating medical diagnoses. However, the drawback of prolonged scan times hinders its accessibility. The k-space undersampling offers a solution, yet the resultant artifacts necessitate meticulous removal during image reconstruction. Although Deep…
▽ More
Magnetic resonance imaging (MRI) is a widely used radiological modality renowned for its radiation-free, comprehensive insights into the human body, facilitating medical diagnoses. However, the drawback of prolonged scan times hinders its accessibility. The k-space undersampling offers a solution, yet the resultant artifacts necessitate meticulous removal during image reconstruction. Although Deep Learning (DL) has proven effective for fast MRI image reconstruction, its broader applicability across various imaging scenarios has been constrained. Challenges include the high cost and privacy restrictions associated with acquiring large-scale, diverse training data, coupled with the inherent difficulty of addressing mismatches between training and target data in existing DL methodologies. Here, we present a novel Physics-Informed Synthetic data learning framework for Fast MRI, called PISF. PISF marks a breakthrough by enabling generalized DL for multi-scenario MRI reconstruction through a single trained model. Our approach separates the reconstruction of a 2D image into many 1D basic problems, commencing with 1D data synthesis to facilitate generalization. We demonstrate that training DL models on synthetic data, coupled with enhanced learning techniques, yields in vivo MRI reconstructions comparable to or surpassing those of models trained on matched realistic datasets, reducing the reliance on real-world MRI data by up to 96%. Additionally, PISF exhibits remarkable generalizability across multiple vendors and imaging centers. Its adaptability to diverse patient populations has been validated through evaluations by ten experienced medical professionals. PISF presents a feasible and cost-effective way to significantly boost the widespread adoption of DL in various fast MRI applications.
△ Less
Submitted 28 February, 2024; v1 submitted 24 July, 2023;
originally announced July 2023.
-
Optimal preprocessing of WiFi CSI for sensing applications
Authors:
Vishnu V. Ratnam,
Hao Chen,
Hao Hsuan Chang,
Abhishek Sehgal,
Jianzhong,
Zhang
Abstract:
Due to its ubiquitous and contact-free nature, the use of WiFi infrastructure for performing sensing tasks has tremendous potential. However, the channel state information (CSI) measured by a WiFi receiver suffers from errors in both its gain and phase, which can significantly hinder sensing tasks. By analyzing these errors from different WiFi receivers, a mathematical model for these gain and pha…
▽ More
Due to its ubiquitous and contact-free nature, the use of WiFi infrastructure for performing sensing tasks has tremendous potential. However, the channel state information (CSI) measured by a WiFi receiver suffers from errors in both its gain and phase, which can significantly hinder sensing tasks. By analyzing these errors from different WiFi receivers, a mathematical model for these gain and phase errors is developed in this work. Based on these models, several theoretically justified preprocessing algorithms for correcting such errors at a receiver and, thus, obtaining clean CSI are presented. Simulation results show that at typical system parameters, the developed algorithms for cleaning CSI can reduce noise by $40$% and $200$%, respectively, compared to baseline methods for gain correction and phase correction, without significantly impacting computational cost. The superiority of the proposed methods is also validated in a real-world test bed for respiration rate monitoring (an example sensing task), where they improve the estimation signal-to-noise ratio by $20$% compared to baseline methods.
△ Less
Submitted 21 May, 2024; v1 submitted 22 July, 2023;
originally announced July 2023.
-
Bundle-specific Tractogram Distribution Estimation Using Higher-order Streamline Differential Equation
Authors:
Yuanjing Feng,
Lei Xie,
Jingqiang Wang,
Qiyuan Tian,
Jianzhong He,
Qingrun Zeng,
Fei Gao
Abstract:
Tractography traces the peak directions extracted from fiber orientation distribution (FOD) suffering from ambiguous spatial correspondences between diffusion directions and fiber geometry, which is prone to producing erroneous tracks while missing true positive connections. The peaks-based tractography methods 'locally' reconstructed streamlines in 'single to single' manner, thus lacking of globa…
▽ More
Tractography traces the peak directions extracted from fiber orientation distribution (FOD) suffering from ambiguous spatial correspondences between diffusion directions and fiber geometry, which is prone to producing erroneous tracks while missing true positive connections. The peaks-based tractography methods 'locally' reconstructed streamlines in 'single to single' manner, thus lacking of global information about the trend of the whole fiber bundle. In this work, we propose a novel tractography method based on a bundle-specific tractogram distribution function by using a higher-order streamline differential equation, which reconstructs the streamline bundles in 'cluster to cluster' manner. A unified framework for any higher-order streamline differential equation is presented to describe the fiber bundles with disjoint streamlines defined based on the diffusion tensor vector field. At the global level, the tractography process is simplified as the estimation of bundle-specific tractogram distribution (BTD) coefficients by minimizing the energy optimization model, and is used to characterize the relations between BTD and diffusion tensor vector under the prior guidance by introducing the tractogram bundle information to provide anatomic priors. Experiments are performed on simulated Hough, Sine, Circle data, ISMRM 2015 Tractography Challenge data, FiberCup data, and in vivo data from the Human Connectome Project (HCP) data for qualitative and quantitative evaluation. The results demonstrate that our approach can reconstruct the complex global fiber bundles directly. BTD reduces the error deviation and accumulation at the local level and shows better results in reconstructing long-range, twisting, and large fanning tracts.
△ Less
Submitted 17 August, 2024; v1 submitted 6 July, 2023;
originally announced July 2023.
-
CloudBrain-MRS: An Intelligent Cloud Computing Platform for in vivo Magnetic Resonance Spectroscopy Preprocessing, Quantification, and Analysis
Authors:
Xiaodie Chen,
Jiayu Li,
Dicheng Chen,
Yirong Zhou,
Zhangren Tu,
Meijin Lin,
Taishan Kang,
Jianzhong Lin,
Tao Gong,
Liuhong Zhu,
Jianjun Zhou,
Lin Ou-yang,
Jiefeng Guo,
Jiyang Dong,
Di Guo,
Xiaobo Qu
Abstract:
Magnetic resonance spectroscopy (MRS) is an important clinical imaging method for diagnosis of diseases. MRS spectrum is used to observe the signal intensity of metabolites or further infer their concentrations. Although the magnetic resonance vendors commonly provide basic functions of spectra plots and metabolite quantification, the widespread clinical research of MRS is still limited due to the…
▽ More
Magnetic resonance spectroscopy (MRS) is an important clinical imaging method for diagnosis of diseases. MRS spectrum is used to observe the signal intensity of metabolites or further infer their concentrations. Although the magnetic resonance vendors commonly provide basic functions of spectra plots and metabolite quantification, the widespread clinical research of MRS is still limited due to the lack of easy-to-use processing software or platform. To address this issue, we have developed CloudBrain-MRS, a cloud-based online platform that provides powerful hardware and advanced algorithms. The platform can be accessed simply through a web browser, without the need of any program installation on the user side. CloudBrain-MRS also integrates the classic LCModel and advanced artificial intelligence algorithms and supports batch preprocessing, quantification, and analysis of MRS data from different vendors. Additionally, the platform offers useful functions: 1) Automatically statistical analysis to find biomarkers for diseases; 2) Consistency verification between the classic and artificial intelligence quantification algorithms; 3) Colorful three-dimensional visualization for easy observation of individual metabolite spectrum. Last, both healthy and mild cognitive impairment patient data are used to demonstrate the functions of the platform. To the best of our knowledge, this is the first cloud computing platform for in vivo MRS with artificial intelligence processing. We have shared our cloud platform at MRSHub, providing free access and service for two years. Please visit https://mrshub.org/software_all/#CloudBrain-MRS or https://csrc.xmu.edu.cn/CloudBrain.html.
△ Less
Submitted 6 September, 2023; v1 submitted 19 June, 2023;
originally announced June 2023.
-
Cyclic Learning: Bridging Image-level Labels and Nuclei Instance Segmentation
Authors:
Yang Zhou,
Yongjian Wu,
Zihua Wang,
Bingzheng Wei,
Maode Lai,
Jianzhong Shou,
Yubo Fan,
Yan Xu
Abstract:
Nuclei instance segmentation on histopathology images is of great clinical value for disease analysis. Generally, fully-supervised algorithms for this task require pixel-wise manual annotations, which is especially time-consuming and laborious for the high nuclei density. To alleviate the annotation burden, we seek to solve the problem through image-level weakly supervised learning, which is under…
▽ More
Nuclei instance segmentation on histopathology images is of great clinical value for disease analysis. Generally, fully-supervised algorithms for this task require pixel-wise manual annotations, which is especially time-consuming and laborious for the high nuclei density. To alleviate the annotation burden, we seek to solve the problem through image-level weakly supervised learning, which is underexplored for nuclei instance segmentation. Compared with most existing methods using other weak annotations (scribble, point, etc.) for nuclei instance segmentation, our method is more labor-saving. The obstacle to using image-level annotations in nuclei instance segmentation is the lack of adequate location information, leading to severe nuclei omission or overlaps. In this paper, we propose a novel image-level weakly supervised method, called cyclic learning, to solve this problem. Cyclic learning comprises a front-end classification task and a back-end semi-supervised instance segmentation task to benefit from multi-task learning (MTL). We utilize a deep learning classifier with interpretability as the front-end to convert image-level labels to sets of high-confidence pseudo masks and establish a semi-supervised architecture as the back-end to conduct nuclei instance segmentation under the supervision of these pseudo masks. Most importantly, cyclic learning is designed to circularly share knowledge between the front-end classifier and the back-end semi-supervised part, which allows the whole system to fully extract the underlying information from image-level labels and converge to a better optimum. Experiments on three datasets demonstrate the good generality of our method, which outperforms other image-level weakly supervised methods for nuclei instance segmentation, and achieves comparable performance to fully-supervised methods.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
CloudBrain-ReconAI: An Online Platform for MRI Reconstruction and Image Quality Evaluation
Authors:
Yirong Zhou,
Chen Qian,
Jiayu Li,
Zi Wang,
Yu Hu,
Biao Qu,
Liuhong Zhu,
Jianjun Zhou,
Taishan Kang,
Jianzhong Lin,
Qing Hong,
Jiyang Dong,
Di Guo,
Xiaobo Qu
Abstract:
Efficient collaboration between engineers and radiologists is important for image reconstruction algorithm development and image quality evaluation in magnetic resonance imaging (MRI). Here, we develop CloudBrain-ReconAI, an online cloud computing platform, for algorithm deployment, fast and blind reader study. This platform supports online image reconstruction using state-of-the-art artificial in…
▽ More
Efficient collaboration between engineers and radiologists is important for image reconstruction algorithm development and image quality evaluation in magnetic resonance imaging (MRI). Here, we develop CloudBrain-ReconAI, an online cloud computing platform, for algorithm deployment, fast and blind reader study. This platform supports online image reconstruction using state-of-the-art artificial intelligence and compressed sensing algorithms with applications to fast imaging and high-resolution diffusion imaging. Through visiting the website, radiologists can easily score and mark the images. Then, automatic statistical analysis will be provided. CloudBrain-ReconAI is now open accessed at https://csrc.xmu.edu.cn/CloudBrain.html and will be continually improved to serve the MRI research community.
△ Less
Submitted 22 September, 2024; v1 submitted 4 December, 2022;
originally announced December 2022.
-
A Faithful Deep Sensitivity Estimation for Accelerated Magnetic Resonance Imaging
Authors:
Zi Wang,
Haoming Fang,
Chen Qian,
Boxuan Shi,
Lijun Bao,
Liuhong Zhu,
Jianjun Zhou,
Wenping Wei,
Jianzhong Lin,
Di Guo,
Xiaobo Qu
Abstract:
Magnetic resonance imaging (MRI) is an essential diagnostic tool that suffers from prolonged scan time. To alleviate this limitation, advanced fast MRI technology attracts extensive research interests. Recent deep learning has shown its great potential in improving image quality and reconstruction speed. Faithful coil sensitivity estimation is vital for MRI reconstruction. However, most deep learn…
▽ More
Magnetic resonance imaging (MRI) is an essential diagnostic tool that suffers from prolonged scan time. To alleviate this limitation, advanced fast MRI technology attracts extensive research interests. Recent deep learning has shown its great potential in improving image quality and reconstruction speed. Faithful coil sensitivity estimation is vital for MRI reconstruction. However, most deep learning methods still rely on pre-estimated sensitivity maps and ignore their inaccuracy, resulting in the significant quality degradation of reconstructed images. In this work, we propose a Joint Deep Sensitivity estimation and Image reconstruction network, called JDSI. During the image artifacts removal, it gradually provides more faithful sensitivity maps with high-frequency information, leading to improved image reconstructions. To understand the behavior of the network, the mutual promotion of sensitivity estimation and image reconstruction is revealed through the visualization of network intermediate results. Results on in vivo datasets and radiologist reader study demonstrate that, for both calibration-based and calibrationless reconstruction, the proposed JDSI achieves the state-of-the-art performance visually and quantitatively, especially when the acceleration factor is high. Additionally, JDSI owns nice robustness to patients and autocalibration signals.
△ Less
Submitted 24 December, 2023; v1 submitted 23 October, 2022;
originally announced October 2022.
-
Physics-informed Deep Diffusion MRI Reconstruction with Synthetic Data: Break Training Data Bottleneck in Artificial Intelligence
Authors:
Chen Qian,
Haoyu Zhang,
Yuncheng Gao,
Mingyang Han,
Zi Wang,
Dan Ruan,
Yu Shen,
Yaping Wu,
Yirong Zhou,
Chengyan Wang,
Boyu Jiang,
Ran Tao,
Zhigang Wu,
Jiazheng Wang,
Liuhong Zhu,
Yi Guo,
Taishan Kang,
Jianzhong Lin,
Tao Gong,
Chen Yang,
Guoqiang Fei,
Meijin Lin,
Di Guo,
Jianjun Zhou,
Meiyun Wang
, et al. (1 additional authors not shown)
Abstract:
Diffusion magnetic resonance imaging (MRI) is the only imaging modality for non-invasive movement detection of in vivo water molecules, with significant clinical and research applications. Diffusion weighted imaging (DWI) MRI acquired by multi-shot techniques can achieve higher resolution, better signal-to-noise ratio, and lower geometric distortion than single-shot, but suffers from inter-shot mo…
▽ More
Diffusion magnetic resonance imaging (MRI) is the only imaging modality for non-invasive movement detection of in vivo water molecules, with significant clinical and research applications. Diffusion weighted imaging (DWI) MRI acquired by multi-shot techniques can achieve higher resolution, better signal-to-noise ratio, and lower geometric distortion than single-shot, but suffers from inter-shot motion-induced artifacts. These artifacts cannot be removed prospectively, leading to the absence of artifact-free training labels. Thus, the potential of deep learning in multi-shot DWI reconstruction remains largely untapped. To break the training data bottleneck, here, we propose a Physics-Informed Deep DWI reconstruction method (PIDD) to synthesize high-quality paired training data by leveraging the physical diffusion model (magnitude synthesis) and inter-shot motion-induced phase model (motion phase synthesis). The network is trained only once with 100,000 synthetic samples, achieving encouraging results on multiple realistic in vivo data reconstructions. Advantages over conventional methods include: (a) Better motion artifact suppression and reconstruction stability; (b) Outstanding generalization to multi-scenario reconstructions, including multi-resolution, multi-b-value, multi-under-sampling, multi-vendor, and multi-center; (c) Excellent clinical adaptability to patients with verifications by seven experienced doctors (p<0.001). In conclusion, PIDD presents a novel deep learning framework by exploiting the power of MRI physics, providing a cost-effective and explainable way to break the data bottleneck in deep learning medical imaging.
△ Less
Submitted 3 May, 2025; v1 submitted 20 October, 2022;
originally announced October 2022.
-
A non-invasive fault location method for modular multilevel converters under light load conditions
Authors:
Yaqian Zhang,
Yi Zhang,
Frede Blaabjerg,
Jianzhong Zhang
Abstract:
This paper proposes a non-invasive fault location method for modular multilevel converters (MMC) considering light load conditions. The prior-art fault location methods of the MMC are often developed and verified under full load conditions. However, it is revealed that the faulty arm current will be suppressed to be unipolar when the open-circuit fault happens on the submodule switch under light l…
▽ More
This paper proposes a non-invasive fault location method for modular multilevel converters (MMC) considering light load conditions. The prior-art fault location methods of the MMC are often developed and verified under full load conditions. However, it is revealed that the faulty arm current will be suppressed to be unipolar when the open-circuit fault happens on the submodule switch under light load. This leads to the capacitor voltage of the healthy and faulty submodules rising or falling with the same variations, increasing the difficulty of fault location. The proposed approach of injecting the second-order circulating current will rebuild the bipolar arm current of the MMC and enlarge the capacitor voltage deviations between the healthy and faulty SMs. As a result, the fault location time is significantly shortened. The simulations are carried out to validate the effectiveness of the proposed approach, showing that the fault location time is reduced to 1/6 compared with the condition without second-order circulating current injection.
△ Less
Submitted 16 August, 2022;
originally announced August 2022.
-
A Paired Phase and Magnitude Reconstruction for Advanced Diffusion-Weighted Imaging
Authors:
Chen Qian,
Zi Wang,
Xinlin Zhang,
Boxuan Shi,
Boyu Jiang,
Ran Tao,
Jing Li,
Yuwei Ge,
Taishan Kang,
Jianzhong Lin,
Di Guo,
Xiaobo Qu
Abstract:
Objective: Multi-shot interleaved echo planer imaging can obtain diffusion-weighted images (DWI) with high spatial resolution and low distortion, but suffers from ghost artifacts introduced by phase variations between shots. In this work, we aim at solving the challenging reconstructions under inter-shot motions between shots and a low signal-to-noise ratio. Methods: An explicit phase model with p…
▽ More
Objective: Multi-shot interleaved echo planer imaging can obtain diffusion-weighted images (DWI) with high spatial resolution and low distortion, but suffers from ghost artifacts introduced by phase variations between shots. In this work, we aim at solving the challenging reconstructions under inter-shot motions between shots and a low signal-to-noise ratio. Methods: An explicit phase model with paired phase and magnitude priors is proposed to regularize the reconstruction (PAIR). The former prior is derived from the smoothness of the shot phase and enforced with low-rankness in the k-space domain. The latter explores similar edges among multi-b-value and multi-direction DWI with weighted total variation in the image domain. Results: Extensive simulation and in vivo results show that PAIR can remove ghost artifacts very well under a high number of shots (8 shots) and significantly suppress the noise under the ultra-high b-value (4000 s/mm2). Conclusion: The explicit phase model PAIR with complementary priors has a good performance on challenging reconstructions under inter-shot motions between shots and a low signal-to-noise ratio. Significance: PAIR has great potential in advanced clinical DWI applications and brain function research.
△ Less
Submitted 8 December, 2022; v1 submitted 28 March, 2022;
originally announced March 2022.
-
Multidimensional Orthogonal Matching Pursuit-based RIS-aided Joint Localization and Channel Estimation at mmWave
Authors:
Murat Bayraktar,
Joan Palacios,
Nuria González-Prelcic,
Charlie Jianzhong Zhang
Abstract:
RIS-aided millimeter wave wireless systems benefit from robustness to blockage and enhanced coverage. In this paper, we study the ability of RIS to also provide enhanced localization capabilities as a by-product of communication. We consider sparse reconstruction algorithms to obtain high resolution channel estimates that are mapped to position information. In RIS-aided mmWave systems, the complex…
▽ More
RIS-aided millimeter wave wireless systems benefit from robustness to blockage and enhanced coverage. In this paper, we study the ability of RIS to also provide enhanced localization capabilities as a by-product of communication. We consider sparse reconstruction algorithms to obtain high resolution channel estimates that are mapped to position information. In RIS-aided mmWave systems, the complexity of sparse recovery becomes a bottleneck, given the large number of elements of the RIS and the large communication arrays. We propose to exploit a multidimensional orthogonal matching pursuit strategy for compressive channel estimation in a RIS-aided millimeter wave system. We show how this algorithm, based on computing the projections on a set of independent dictionaries instead of a single large dictionary, enables high accuracy channel estimation at reduced complexity. We also combine this strategy with a localization approach which does not rely on the absolute time of arrival of the LoS path. Localization results in a realistic 3D indoor scenario show that RIS-aided wireless system can also benefit from a significant improvement in localization accuracy.
△ Less
Submitted 24 March, 2022;
originally announced March 2022.
-
Beam Management with Orientation and RSRP using Deep Learning for Beyond 5G Systems
Authors:
Khuong N. Nguyen,
Anum Ali,
Jianhua Mo,
Boon Loong Ng,
Vutha Va,
Jianzhong Charlie Zhang
Abstract:
Beam management (BM), i.e., the process of finding and maintaining a suitable transmit and receive beam pair, can be challenging, particularly in highly dynamic scenarios. Side-information, e.g., orientation, from on-board sensors can assist the user equipment (UE) BM. In this work, we use the orientation information coming from the inertial measurement unit (IMU) for effective BM. We use a data-d…
▽ More
Beam management (BM), i.e., the process of finding and maintaining a suitable transmit and receive beam pair, can be challenging, particularly in highly dynamic scenarios. Side-information, e.g., orientation, from on-board sensors can assist the user equipment (UE) BM. In this work, we use the orientation information coming from the inertial measurement unit (IMU) for effective BM. We use a data-driven strategy that fuses the reference signal received power (RSRP) with orientation information using a recurrent neural network (RNN). Simulation results show that the proposed strategy performs much better than the conventional BM and an orientation-assisted BM strategy that utilizes particle filter in another study. Specifically, the proposed data-driven strategy improves the beam-prediction accuracy up to 34% and increases mean RSRP by up to 4.2 dB when the UE orientation changes quickly.
△ Less
Submitted 4 February, 2022;
originally announced February 2022.
-
Sub-Chain Beam for mmWave Devices: A Trade-off between Power Saving and Beam Correspondence
Authors:
Jianhua Mo,
Daehee Park,
Boon Loong Ng,
Vutha Va,
Anum Ali,
Chonghwa Seo,
Jianzhong Charlie Zhang
Abstract:
Beam correspondence, or downlink-uplink (DL-UL) beam reciprocity, refers to the assumption that the best beams in the DL are also the best beams in the UL. This is an important assumption that allows the existing beam management framework in 5G to rely heavily on DL beam sweeping and avoid UL beam sweeping: UL beams are inferred from the measurements of the DL reference signals. Beam correspondenc…
▽ More
Beam correspondence, or downlink-uplink (DL-UL) beam reciprocity, refers to the assumption that the best beams in the DL are also the best beams in the UL. This is an important assumption that allows the existing beam management framework in 5G to rely heavily on DL beam sweeping and avoid UL beam sweeping: UL beams are inferred from the measurements of the DL reference signals. Beam correspondence holds when the radio configurations are symmetric in the DL and UL. However, as mmWave technology matures, the DL and the UL face different constraints often breaking the beam correspondence. For example, power constraints may require a UE to activate only a portion of its antenna array for UL transmission, while still activating the full array for DL reception. Meanwhile, if the UL beam with sub-array, named as sub-chain beam in this paper, has a similar radiation pattern as the DL beam, the beam correspondence can still hold. This paper proposes methods for sub-chain beam codebook design to achieve a trade-off between the power saving and beam correspondence.
△ Less
Submitted 22 December, 2021;
originally announced December 2021.
-
Magnetic Resonance Spectroscopy Deep Learning Denoising Using Few In Vivo Data
Authors:
Dicheng Chen,
Wanqi Hu,
Huiting Liu,
Yirong Zhou,
Tianyu Qiu,
Yihui Huang,
Zi Wang,
Jiazheng Wang,
Liangjie Lin,
Zhigang Wu,
Hao Chen,
Xi Chen,
Gen Yan,
Di Guo,
Jianzhong Lin,
Xiaobo Qu
Abstract:
Magnetic Resonance Spectroscopy (MRS) is a noninvasive tool to reveal metabolic information. One challenge of 1H-MRS is the low Signal-Noise Ratio (SNR). To improve the SNR, a typical approach is to perform Signal Averaging (SA) with M repeated samples. The data acquisition time, however, is increased by M times accordingly, and a complete clinical MRS scan takes approximately 10 minutes at a comm…
▽ More
Magnetic Resonance Spectroscopy (MRS) is a noninvasive tool to reveal metabolic information. One challenge of 1H-MRS is the low Signal-Noise Ratio (SNR). To improve the SNR, a typical approach is to perform Signal Averaging (SA) with M repeated samples. The data acquisition time, however, is increased by M times accordingly, and a complete clinical MRS scan takes approximately 10 minutes at a common setting M=128. Recently, deep learning has been introduced to improve the SNR but most of them use the simulated data as the training set. This may hinder the MRS applications since some potential differences, such as acquisition system imperfections, and physiological and psychologic conditions may exist between the simulated and in vivo data. Here, we proposed a new scheme that purely used the repeated samples of realistic data. A deep learning model, Refusion Long Short-Term Memory (ReLSTM), was designed to learn the mapping from the low SNR time-domain data (24 SA) to the high SNR one (128 SA). Experiments on the in vivo brain spectra of 7 healthy subjects, 2 brain tumor patients and 1 cerebral infarction patient showed that only using 20% repeated samples, the denoised spectra by ReLSTM could provide comparable estimated concentrations of metabolites to 128 SA. Compared with the state-of-the-art low-rank denoising method, the ReLSTM achieved the lower relative error and the Cramér-Rao lower bounds in quantifying some important biomarkers. In summary, ReLSTM can perform high-fidelity denoising of the spectra under fast acquisition (24 SA), which would be valuable to MRS clinical studies.
△ Less
Submitted 25 October, 2022; v1 submitted 26 January, 2021;
originally announced January 2021.
-
Robust Non-Coherent Beamforming for FDD Downlink Massive MIMO
Authors:
François Rottenberg,
Ming-Chun Lee,
Thomas Choi,
Jianzhong Zhang,
Andreas F. Molisch
Abstract:
Designing beamforming techniques for the downlink (DL) of frequency division duplex (FDD) massive MIMO is known to be a challenging problem due to the difficulty of obtaining channel state information (CSI). Indeed, since the uplink-downlink bands are disjoint, the system cannot rely on channel reciprocity to estimate the channel from uplink (UL) pilots as in time division duplexing (TDD) system.…
▽ More
Designing beamforming techniques for the downlink (DL) of frequency division duplex (FDD) massive MIMO is known to be a challenging problem due to the difficulty of obtaining channel state information (CSI). Indeed, since the uplink-downlink bands are disjoint, the system cannot rely on channel reciprocity to estimate the channel from uplink (UL) pilots as in time division duplexing (TDD) system. Still, in this paper, we propose original designs for robust beamformers that do not require any feedback from the users and only rely on the transmission of UL pilots. The price to pay is that the beamformer is non-coherent in the sense that it does not leverage full knowledge of the phase of each multipath component. A large variety of novel designs are proposed under different criterion and partial phase knowledge.
△ Less
Submitted 1 April, 2020;
originally announced April 2020.
-
Experimental Investigation of Frequency Domain Channel Extrapolation in Massive MIMO Systems for Zero-Feedback FDD
Authors:
Thomas Choi,
François Rottenberg,
Jorge Gomez-Ponce,
Akshay Ramesh,
Peng Luo,
Jianzhong Zhang,
Andreas F. Molisch
Abstract:
Estimating downlink (DL) channel state information (CSI) in frequency division duplex (FDD) massive multi-input multi-output (MIMO) systems generally requires downlink pilots and feedback overheads. Accordingly, this paper investigates the feasibility of zero-feedback FDD massive MIMO systems based on channel extrapolation. We use the high-resolution parameter estimation (HRPE), specifically the s…
▽ More
Estimating downlink (DL) channel state information (CSI) in frequency division duplex (FDD) massive multi-input multi-output (MIMO) systems generally requires downlink pilots and feedback overheads. Accordingly, this paper investigates the feasibility of zero-feedback FDD massive MIMO systems based on channel extrapolation. We use the high-resolution parameter estimation (HRPE), specifically the space-alternating generalized expectation-maximization (SAGE) algorithm, to extrapolate the DL CSI based on the extracted parameters of multipath components in the uplink channel. We apply the HRPE to two different channel models: the vector spatial signature (VSS) model and the direction of arrival (DOA) model. We verify these methods through real-world channel data acquired from channel measurement campaigns with two different types of channel sounders: a) a switched array-based, real-time, time-domain, outdoors setup at 3.5 GHz, and b) a virtual array-based, high-accuracy, frequency-domain, indoors setup at 2.4 and 5-7 GHz. The performance metrics of the extrapolated channels that we evaluate include the mean squared error, beamforming efficiency, and spectral efficiency in multiuser MIMO scenarios. The results show that the HRPE-based channel extrapolation performs best under the simple VSS model, which does not require array calibration, and if the BS is in an open outdoor environment having line-of-sight (LOS) paths to well-separated users.
△ Less
Submitted 29 September, 2020; v1 submitted 24 March, 2020;
originally announced March 2020.
-
RCNet: Incorporating Structural Information into Deep RNN for MIMO-OFDM Symbol Detection with Limited Training
Authors:
Zhou Zhou,
Lingjia Liu,
Shashank Jere,
Jianzhong,
Zhang,
Yang Yi
Abstract:
In this paper, we investigate learning-based MIMO-OFDM symbol detection strategies focusing on a special recurrent neural network (RNN) -- reservoir computing (RC). We first introduce the Time-Frequency RC to take advantage of the structural information inherent in OFDM signals. Using the time domain RC and the time-frequency RC as the building blocks, we provide two extensions of the shallow RC t…
▽ More
In this paper, we investigate learning-based MIMO-OFDM symbol detection strategies focusing on a special recurrent neural network (RNN) -- reservoir computing (RC). We first introduce the Time-Frequency RC to take advantage of the structural information inherent in OFDM signals. Using the time domain RC and the time-frequency RC as the building blocks, we provide two extensions of the shallow RC to RCNet: 1) Stacking multiple time domain RCs; 2) Stacking multiple time-frequency RCs into a deep structure. The combination of RNN dynamics, the time-frequency structure of MIMO-OFDM signals, and the deep network enables RCNet to handle the interference and nonlinear distortion of MIMO-OFDM signals to outperform existing methods. Unlike most existing NN-based detection strategies, RCNet is also shown to provide a good generalization performance even with a limited training set (i.e, similar amount of reference signals/training as standard model-based approaches). Numerical experiments demonstrate that the introduced RCNet can offer a faster learning convergence and as much as 20% gain in bit error rate over a shallow RC structure by compensating for the nonlinear distortion of the MIMO-OFDM signal, such as due to power amplifier compression in the transmitter or due to finite quantization resolution in the receiver.
△ Less
Submitted 15 March, 2020;
originally announced March 2020.
-
Random VLAD based Deep Hashing for Efficient Image Retrieval
Authors:
Li Weng,
Lingzhi Ye,
Jiangmin Tian,
Jiuwen Cao,
Jianzhong Wang
Abstract:
Image hash algorithms generate compact binary representations that can be quickly matched by Hamming distance, thus become an efficient solution for large-scale image retrieval. This paper proposes RV-SSDH, a deep image hash algorithm that incorporates the classical VLAD (vector of locally aggregated descriptors) architecture into neural networks. Specifically, a novel neural network component is…
▽ More
Image hash algorithms generate compact binary representations that can be quickly matched by Hamming distance, thus become an efficient solution for large-scale image retrieval. This paper proposes RV-SSDH, a deep image hash algorithm that incorporates the classical VLAD (vector of locally aggregated descriptors) architecture into neural networks. Specifically, a novel neural network component is formed by coupling a random VLAD layer with a latent hash layer through a transform layer. This component can be combined with convolutional layers to realize a hash algorithm. We implement RV-SSDH as a point-wise algorithm that can be efficiently trained by minimizing classification error and quantization loss. Comprehensive experiments show this new architecture significantly outperforms baselines such as NetVLAD and SSDH, and offers a cost-effective trade-off in the state-of-the-art. In addition, the proposed random VLAD layer leads to satisfactory accuracy with low complexity, thus shows promising potentials as an alternative to NetVLAD.
△ Less
Submitted 6 February, 2020;
originally announced February 2020.
-
GMAN: A Graph Multi-Attention Network for Traffic Prediction
Authors:
Chuanpan Zheng,
Xiaoliang Fan,
Cheng Wang,
Jianzhong Qi
Abstract:
Long-term traffic prediction is highly challenging due to the complexity of traffic systems and the constantly changing nature of many impacting factors. In this paper, we focus on the spatio-temporal factors, and propose a graph multi-attention network (GMAN) to predict traffic conditions for time steps ahead at different locations on a road network graph. GMAN adapts an encoder-decoder architect…
▽ More
Long-term traffic prediction is highly challenging due to the complexity of traffic systems and the constantly changing nature of many impacting factors. In this paper, we focus on the spatio-temporal factors, and propose a graph multi-attention network (GMAN) to predict traffic conditions for time steps ahead at different locations on a road network graph. GMAN adapts an encoder-decoder architecture, where both the encoder and the decoder consist of multiple spatio-temporal attention blocks to model the impact of the spatio-temporal factors on traffic conditions. The encoder encodes the input traffic features and the decoder predicts the output sequence. Between the encoder and the decoder, a transform attention layer is applied to convert the encoded traffic features to generate the sequence representations of future time steps as the input of the decoder. The transform attention mechanism models the direct relationships between historical and future time steps that helps to alleviate the error propagation problem among prediction time steps. Experimental results on two real-world traffic prediction tasks (i.e., traffic volume prediction and traffic speed prediction) demonstrate the superiority of GMAN. In particular, in the 1 hour ahead prediction, GMAN outperforms state-of-the-art methods by up to 4% improvement in MAE measure. The source code is available at https://github.com/zhengchuanpan/GMAN.
△ Less
Submitted 25 November, 2019; v1 submitted 11 November, 2019;
originally announced November 2019.
-
Channel Extrapolation for FDD Massive MIMO: Procedure and Experimental Results
Authors:
Thomas Choi,
François Rottenberg,
Jorge Gomez-Ponce,
Akshay Ramesh,
Peng Luo,
Jianzhong Zhang,
Andreas F. Molisch
Abstract:
Application of massive multiple-input multiple-output (MIMO) systems to frequency division duplex (FDD) is challenging mainly due to the considerable overhead required for downlink training and feedback. Channel extrapolation, i.e., estimating the channel response at the downlink frequency band based on measurements in the disjoint uplink band, is a promising solution to overcome this bottleneck.…
▽ More
Application of massive multiple-input multiple-output (MIMO) systems to frequency division duplex (FDD) is challenging mainly due to the considerable overhead required for downlink training and feedback. Channel extrapolation, i.e., estimating the channel response at the downlink frequency band based on measurements in the disjoint uplink band, is a promising solution to overcome this bottleneck. This paper presents measurement campaigns obtained by using a wideband (350 MHz) channel sounder at 3.5 GHz composed of a calibrated 64 element antenna array, in both an anechoic chamber and outdoor environment. The Space Alternating Generalized Expectation-Maximization (SAGE) algorithm was used to extract the parameters (amplitude, delay, and angular information) of the multipath components from the attained channel data within the training (uplink) band. The channel in the downlink band is then reconstructed based on these path parameters. The performance of the extrapolated channel is evaluated in terms of mean squared error (MSE) and reduction of beamforming gain (RBG) in comparison to the ground truth, i.e., the measured channel at the downlink frequency. We find strong sensitivity to calibration errors and model mismatch, and also find that performance depends on propagation conditions: LOS performs significantly better than NLOS.
△ Less
Submitted 26 July, 2019;
originally announced July 2019.
-
Artificial Intelligence-Enabled Cellular Networks: A Critical Path to Beyond-5G and 6G
Authors:
Rubayet Shafin,
Lingjia Liu,
Vikram Chandrasekhar,
Hao Chen,
Jeffrey Reed,
Jianzhong,
Zhang
Abstract:
Mobile Network Operators (MNOs) are in process of overlaying their conventional macro cellular networks with shorter range cells such as outdoor pico cells. The resultant increase in network complexity creates substantial overhead in terms of operating expenses, time, and labor for their planning and management. Artificial intelligence (AI) offers the potential for MNOs to operate their networks i…
▽ More
Mobile Network Operators (MNOs) are in process of overlaying their conventional macro cellular networks with shorter range cells such as outdoor pico cells. The resultant increase in network complexity creates substantial overhead in terms of operating expenses, time, and labor for their planning and management. Artificial intelligence (AI) offers the potential for MNOs to operate their networks in a more organic and cost-efficient manner. We argue that deploying AI in 5G and Beyond will require surmounting significant technical barriers in terms of robustness, performance, and complexity. We outline future research directions, identify top 5 challenges, and present a possible roadmap to realize the vision of AI-enabled cellular networks for Beyond-5G and 6G.
△ Less
Submitted 17 July, 2019;
originally announced July 2019.
-
GeoPrune: Efficiently Finding Shareable Vehicles Based on Geometric Properties
Authors:
Yixin Xu,
Jianzhong Qi,
Renata Borovica-Gajic,
Lars Kulik
Abstract:
On-demand ride-sharing is rapidly growing.Matching trip requests to vehicles efficiently is critical for the service quality of ride-sharing. To match trip requests with vehicles, a prune-and-select scheme is commonly used. The pruning stage identifies feasible vehicles that can satisfy the trip constraints (e.g., trip time). The selection stage selects the optimal one(s) from the feasible vehicle…
▽ More
On-demand ride-sharing is rapidly growing.Matching trip requests to vehicles efficiently is critical for the service quality of ride-sharing. To match trip requests with vehicles, a prune-and-select scheme is commonly used. The pruning stage identifies feasible vehicles that can satisfy the trip constraints (e.g., trip time). The selection stage selects the optimal one(s) from the feasible vehicles. The pruning stage is crucial to reduce the complexity of the selection stage and to achieve efficient matching. We propose an effective and efficient pruning algorithm called GeoPrune. GeoPrune represents the time constraints of trip requests using circles and ellipses, which can be computed and updated efficiently. Experiments on real-world datasets show that GeoPrune reduces the number of vehicle candidates in nearly all cases by an order of magnitude and the update cost by two to three orders of magnitude compared to the state-of-the-art.
△ Less
Submitted 19 October, 2019; v1 submitted 3 July, 2019;
originally announced July 2019.
-
Self-Tuning Sectorization: Deep Reinforcement Learning Meets Broadcast Beam Optimization
Authors:
Rubayet Shafin,
Hao Chen,
Young Han Nam,
Sooyoung Hur,
Jeongho Park,
Jianzhong,
Zhang,
Jeffrey Reed,
Lingjia Liu
Abstract:
Beamforming in multiple input multiple output (MIMO) systems is one of the key technologies for modern wireless communication. Creating appropriate sector-specific broadcast beams are essential for enhancing the coverage of cellular network and for improving the broadcast operation for control signals. However, in order to maximize the coverage, patterns for broadcast beams need to be adapted base…
▽ More
Beamforming in multiple input multiple output (MIMO) systems is one of the key technologies for modern wireless communication. Creating appropriate sector-specific broadcast beams are essential for enhancing the coverage of cellular network and for improving the broadcast operation for control signals. However, in order to maximize the coverage, patterns for broadcast beams need to be adapted based on the users' distribution and movement over time. In this work, we present self-tuning sectorization: a deep reinforcement learning framework to optimize MIMO broadcast beams autonomously and dynamically based on user' distribution in the network. Taking directly UE measurement results as input, deep reinforcement learning agent can track and predict the UE distribution pattern and come up with the best broadcast beams for each cell. Extensive simulation results show that the introduced framework can achieve the optimal coverage, and converge to the oracle solution for both single sector and multiple sectors environment, and for both periodic and Markov mobility patterns.
△ Less
Submitted 14 June, 2019;
originally announced June 2019.
-
Performance Analysis of Channel Extrapolation in FDD Massive MIMO Systems
Authors:
Francois Rottenberg,
Thomas Choi,
Peng Luo,
Jianzhong Zhang,
Andreas F. Molisch
Abstract:
Channel estimation for the downlink of frequency division duplex (FDD) massive MIMO systems is well known to generate a large overhead as the amount of training generally scales with the number of transmit antennas in a MIMO system. In this paper, we consider the solution of extrapolating the channel frequency response from uplink pilot estimates to the downlink frequency band, which completely re…
▽ More
Channel estimation for the downlink of frequency division duplex (FDD) massive MIMO systems is well known to generate a large overhead as the amount of training generally scales with the number of transmit antennas in a MIMO system. In this paper, we consider the solution of extrapolating the channel frequency response from uplink pilot estimates to the downlink frequency band, which completely removes the training overhead. We first show that conventional estimators fail to achieve reasonable accuracy. We propose instead to use high-resolution channel estimation. We derive theoretical lower bounds (LB) for the mean squared error (MSE) of the extrapolated channel. Assuming that the paths are well separated, the LB is simplified in an expression that gives considerable physical insight. It is then shown that the MSE is inversely proportional to the number of receive antennas while the extrapolation performance penalty scales with the square of the ratio of the frequency offset and the training bandwidth. The channel extrapolation performance is validated through numeric simulations and experimental measurements taken in an anechoic chamber. Our main conclusion is that channel extrapolation is a viable solution for FDD massive MIMO systems if accurate system calibration is performed and favorable propagation conditions are present.
△ Less
Submitted 22 January, 2020; v1 submitted 28 March, 2019;
originally announced April 2019.
-
How Many Antennas Do We Need for Massive MIMO Channel Sounding? - Validating Through Measurement
Authors:
Thomas Choi,
François Rottenberg,
Peng Luo,
Jianzhong Zhang,
Andreas F. Molisch
Abstract:
This paper investigates the impact of the number of antennas (8 to 64) and the array configuration on massive MIMO channel parameters estimation for multiple propagation scenarios at 3.5 GHz. Different measurement environments are artificially created by placing several reflectors and absorbers in an anechoic chamber. Ground truth channel parameters, e.g, path angles, are obtained by geometry and…
▽ More
This paper investigates the impact of the number of antennas (8 to 64) and the array configuration on massive MIMO channel parameters estimation for multiple propagation scenarios at 3.5 GHz. Different measurement environments are artificially created by placing several reflectors and absorbers in an anechoic chamber. Ground truth channel parameters, e.g, path angles, are obtained by geometry and trigonometric rules. Then, these are compared to the channel parameters extracted by the applying Space-Alternating Generalized Expectation-Maximization (SAGE) algorithm on the measurements. Overall, the estimation errors for various array configurations and the multiple environments are compared. This paper will help to determine the appropriate configuration of the antenna array and the parameter extraction algorithm for outdoor massive MIMO channel sounding campaigns.
△ Less
Submitted 19 March, 2019;
originally announced March 2019.
-
Real-Time Millimeter-Wave MIMO Channel Sounder for Dynamic Directional Measurements
Authors:
C. Umit Bas,
Rui Wang,
Seun Sangodoyin,
Dimitris Psychoudakis,
Thomas Henige,
Robert Monroe,
Jeongho Park,
Jianzhong Zhang,
Andreas F. Molisch
Abstract:
In this paper, we present a novel real-time multiple-input-multiple-output (MIMO) channel sounder for the 28 GHz band. Until now, most investigations of the directional characteristics of millimeter-wave channels have used mechanically rotating horn antennas. In contrast, the sounder presented here is capable of performing horizontal and vertical beam steering with the help of phased arrays. Due t…
▽ More
In this paper, we present a novel real-time multiple-input-multiple-output (MIMO) channel sounder for the 28 GHz band. Until now, most investigations of the directional characteristics of millimeter-wave channels have used mechanically rotating horn antennas. In contrast, the sounder presented here is capable of performing horizontal and vertical beam steering with the help of phased arrays. Due to its fast beam-switching capability, the proposed sounder can perform measurements that are directionally resolved both at the transmitter(TX) and receiver (RX) in 1.44 milliseconds compared to the minutes or even hours required for rotating horn antenna sounders. This not only enables measurement of more TX-RX locations for a better statistical validity but also allows to perform directional analysis in dynamic environments. The short measurement time combined with the high phase stability limits the phase drift between TX and RX, enabling phase-coherent sounding of all beam pairs even when TX and RX have no cabled connection for synchronization without any delay ambiguity. Furthermore, the phase stability over time enables complex RX waveform averaging to improve the signal to noise ratio during high path loss measurements. The paper discusses both the system design as well as the measurements performed for verification of the sounder performance. Furthermore, we present sample results from double directional measurements in dynamic environments.
△ Less
Submitted 31 July, 2018;
originally announced July 2018.