Search | arXiv e-print repository

Deep-Learning-Based Pre-Layout Parasitic Capacitance Prediction on SRAM Designs

Authors: Shan Shen, Dingcheng Yang, Yuyang Xie, Chunyan Pei, Wenjian Yu, Bei Yu

Abstract: To achieve higher system energy efficiency, SRAM in SoCs is often customized. The parasitic effects cause notable discrepancies between pre-layout and post-layout circuit simulations, leading to difficulty in converging design parameters and excessive design iterations. Is it possible to well predict the parasitics based on the pre-layout circuit, so as to perform parasitic-aware pre-layout simula… ▽ More To achieve higher system energy efficiency, SRAM in SoCs is often customized. The parasitic effects cause notable discrepancies between pre-layout and post-layout circuit simulations, leading to difficulty in converging design parameters and excessive design iterations. Is it possible to well predict the parasitics based on the pre-layout circuit, so as to perform parasitic-aware pre-layout simulation? In this work, we propose a deep-learning-based 2-stage model to accurately predict these parasitics in pre-layout stages. The model combines a Graph Neural Network (GNN) classifier and Multi-Layer Perceptron (MLP) regressors, effectively managing class imbalance of the net parasitics in SRAM circuits. We also employ Focal Loss to mitigate the impact of abundant internal net samples and integrate subcircuit information into the graph to abstract the hierarchical structure of schematics. Experiments on 4 real SRAM designs show that our approach not only surpasses the state-of-the-art model in parasitic prediction by a maximum of 19X reduction of error but also significantly boosts the simulation process by up to 598X speedup. △ Less

Submitted 9 July, 2025; originally announced July 2025.

Comments: Published in Proceedings of GLSVLSI2024

arXiv:2507.06538 [pdf, ps, other]

Few-shot Learning on AMS Circuits and Its Application to Parasitic Capacitance Prediction

Authors: Shan Shen, Yibin Zhang, Hector Rodriguez Rodriguez, Wenjian Yu

Abstract: Graph representation learning is a powerful method to extract features from graph-structured data, such as analog/mixed-signal (AMS) circuits. However, training deep learning models for AMS designs is severely limited by the scarcity of integrated circuit design data. In this work, we present CircuitGPS, a few-shot learning method for parasitic effect prediction in AMS circuits. The circuit netlis… ▽ More Graph representation learning is a powerful method to extract features from graph-structured data, such as analog/mixed-signal (AMS) circuits. However, training deep learning models for AMS designs is severely limited by the scarcity of integrated circuit design data. In this work, we present CircuitGPS, a few-shot learning method for parasitic effect prediction in AMS circuits. The circuit netlist is represented as a heterogeneous graph, with the coupling capacitance modeled as a link. CircuitGPS is pre-trained on link prediction and fine-tuned on edge regression. The proposed method starts with a small-hop sampling technique that converts a link or a node into a subgraph. Then, the subgraph embeddings are learned with a hybrid graph Transformer. Additionally, CircuitGPS integrates a low-cost positional encoding that summarizes the positional and structural information of the sampled subgraph. CircuitGPS improves the accuracy of coupling existence by at least 20\% and reduces the MAE of capacitance estimation by at least 0.067 compared to existing methods. Our method demonstrates strong inherent scalability, enabling direct application to diverse AMS circuit designs through zero-shot learning. Furthermore, the ablation studies provide valuable insights into graph models for representation learning. △ Less

Submitted 9 July, 2025; originally announced July 2025.

Comments: Published in Proceedings of DAC2025

arXiv:2507.06535 [pdf, ps, other]

Transferable Parasitic Estimation via Graph Contrastive Learning and Label Rebalancing in AMS Circuits

Authors: Shan Shen, Shenglu Hua, Jiajun Zou, Jiawei Liu, Jianwang Zhai, Chuan Shi, Wenjian Yu

Abstract: Graph representation learning on Analog-Mixed Signal (AMS) circuits is crucial for various downstream tasks, e.g., parasitic estimation. However, the scarcity of design data, the unbalanced distribution of labels, and the inherent diversity of circuit implementations pose significant challenges to learning robust and transferable circuit representations. To address these limitations, we propose Ci… ▽ More Graph representation learning on Analog-Mixed Signal (AMS) circuits is crucial for various downstream tasks, e.g., parasitic estimation. However, the scarcity of design data, the unbalanced distribution of labels, and the inherent diversity of circuit implementations pose significant challenges to learning robust and transferable circuit representations. To address these limitations, we propose CircuitGCL, a novel graph contrastive learning framework that integrates representation scattering and label rebalancing to enhance transferability across heterogeneous circuit graphs. CircuitGCL employs a self-supervised strategy to learn topology-invariant node embeddings through hyperspherical representation scattering, eliminating dependency on large-scale data. Simultaneously, balanced mean squared error (MSE) and softmax cross-entropy (bsmCE) losses are introduced to mitigate label distribution disparities between circuits, enabling robust and transferable parasitic estimation. Evaluated on parasitic capacitance estimation (edge-level task) and ground capacitance classification (node-level task) across TSMC 28nm AMS designs, CircuitGCL outperforms all state-of-the-art (SOTA) methods, with the $R^2$ improvement of $33.64\% \sim 44.20\%$ for edge regression and F1-score gain of $0.9\times \sim 2.1\times$ for node classification. Our code is available at \href{https://anonymous.4open.science/r/CircuitGCL-099B/README.md}{here}. △ Less

Submitted 9 July, 2025; originally announced July 2025.

Comments: Accepted by ICCAD2025. This is the initial version. Minor changes will be made

arXiv:2506.02601 [pdf, ps, other]

Hyperspectral Image Generation with Unmixing Guided Diffusion Model

Authors: Shiyu Shen, Bin Pan, Ziye Zhang, Zhenwei Shi

Abstract: Recently, hyperspectral image generation has received increasing attention, but existing generative models rely on conditional generation schemes, which limits the diversity of generated images. Diffusion models are popular for their ability to generate high-quality samples, but adapting these models from RGB to hyperspectral data presents the challenge of high dimensionality and physical constrai… ▽ More Recently, hyperspectral image generation has received increasing attention, but existing generative models rely on conditional generation schemes, which limits the diversity of generated images. Diffusion models are popular for their ability to generate high-quality samples, but adapting these models from RGB to hyperspectral data presents the challenge of high dimensionality and physical constraints. To address these challenges, we propose a novel diffusion model guided by hyperspectral unmixing. Our model comprises two key modules: an unmixing autoencoder module and an abundance diffusion module. The unmixing autoencoder module leverages unmixing guidance to shift the generative task from the image space to the low-dimensional abundance space, significantly reducing computational complexity while preserving high fidelity. The abundance diffusion module generates samples that satisfy the constraints of non-negativity and unity, ensuring the physical consistency of the reconstructed HSIs. Additionally, we introduce two evaluation metrics tailored to hyperspectral data. Empirical results, evaluated using both traditional metrics and our proposed metrics, indicate that our model is capable of generating high-quality and diverse hyperspectral images, offering an advancement in hyperspectral data generation. △ Less

Submitted 3 June, 2025; originally announced June 2025.

arXiv:2505.16504 [pdf, ps, other]

A Tutorial on Beyond-Diagonal Reconfigurable Intelligent Surfaces: Modeling, Architectures, System Design and Optimization, and Applications

Authors: Hongyu Li, Matteo Nerini, Shanpu Shen, Bruno Clerckx

Abstract: Written by its inventors, this first tutorial on Beyond-Diagonal Reconfigurable Intelligent Surfaces (BD-RISs) provides the readers with the basics and fundamental tools necessary to appreciate, understand, and contribute to this emerging and disruptive technology. Conventional (Diagonal) RISs (D-RISs) are characterized by a diagonal scattering matrix $\mathbfΘ$ such that the wave manipulation fle… ▽ More Written by its inventors, this first tutorial on Beyond-Diagonal Reconfigurable Intelligent Surfaces (BD-RISs) provides the readers with the basics and fundamental tools necessary to appreciate, understand, and contribute to this emerging and disruptive technology. Conventional (Diagonal) RISs (D-RISs) are characterized by a diagonal scattering matrix $\mathbfΘ$ such that the wave manipulation flexibility of D-RIS is extremely limited. In contrast, BD-RIS refers to a novel and general framework for RIS where its scattering matrix is not limited to be diagonal (hence, the ``beyond-diagonal'' terminology) and consequently, all entries of $\mathbfΘ$ can potentially help shaping waves for much higher manipulation flexibility. This physically means that BD-RIS can artificially engineer and reconfigure coupling across elements of the surface thanks to inter-element reconfigurable components which allow waves absorbed by one element to flow through other elements. Consequently, BD-RIS opens the door to more general and versatile intelligent surfaces that subsumes existing RIS architectures as special cases. In this tutorial, we share all the secret sauce to model, design, and optimize BD-RIS and make BD-RIS transformative in many different applications. Topics discussed include physics-consistent and multi-port network-aided modeling; transmitting, reflecting, hybrid, and multi-sector mode analysis; reciprocal and non-reciprocal architecture designs and optimal performance-complexity Pareto frontier of BD-RIS; signal processing, optimization, and channel estimation for BD-RIS; hardware impairments (discrete-value impedance and admittance, lossy interconnections and components, wideband effects, mutual coupling) of BD-RIS; benefits and applications of BD-RIS in communications, sensing, power transfer. △ Less

Submitted 22 May, 2025; originally announced May 2025.

Comments: 42 pages, 37 figures, submitted to IEEE journal for future publication

arXiv:2505.02470 [pdf, other]

Deep learning of personalized priors from past MRI scans enables fast, quality-enhanced point-of-care MRI with low-cost systems

Authors: Tal Oved, Beatrice Lena, Chloé F. Najac, Sheng Shen, Matthew S. Rosen, Andrew Webb, Efrat Shimron

Abstract: Magnetic resonance imaging (MRI) offers superb-quality images, but its accessibility is limited by high costs, posing challenges for patients requiring longitudinal care. Low-field MRI provides affordable imaging with low-cost devices but is hindered by long scans and degraded image quality, including low signal-to-noise ratio (SNR) and tissue contrast. We propose a novel healthcare paradigm: usin… ▽ More Magnetic resonance imaging (MRI) offers superb-quality images, but its accessibility is limited by high costs, posing challenges for patients requiring longitudinal care. Low-field MRI provides affordable imaging with low-cost devices but is hindered by long scans and degraded image quality, including low signal-to-noise ratio (SNR) and tissue contrast. We propose a novel healthcare paradigm: using deep learning to extract personalized features from past standard high-field MRI scans and harnessing them to enable accelerated, enhanced-quality follow-up scans with low-cost systems. To overcome the SNR and contrast differences, we introduce ViT-Fuser, a feature-fusion vision transformer that learns features from past scans, e.g. those stored in standard DICOM CDs. We show that \textit{a single prior scan is sufficient}, and this scan can come from various MRI vendors, field strengths, and pulse sequences. Experiments with four datasets, including glioblastoma data, low-field ($50mT$), and ultra-low-field ($6.5mT$) data, demonstrate that ViT-Fuser outperforms state-of-the-art methods, providing enhanced-quality images from accelerated low-field scans, with robustness to out-of-distribution data. Our freely available framework thus enables rapid, diagnostic-quality, low-cost imaging for wide healthcare applications. △ Less

Submitted 5 May, 2025; originally announced May 2025.

arXiv:2504.09618 [pdf, other]

A Hybrid Transmitting and Reflecting Beyond Diagonal Reconfigurable Intelligent Surface with Independent Beam Control and Power Splitting

Authors: Zhaoyang Ming, Shanpu Shen, Junhui Rao, Zan Li, Jichen Zhang, Chi Yuk Chiu, Ross Murch

Abstract: A hybrid transmitting and reflecting beyond diagonal reconfigurable intelligent surface (BD-RIS) design is proposed. Operating in the same aperture, frequency band and polarization, the proposed BD-RIS features independent beam steering control of its reflected and transmitted waves. In addition it provides a hybrid mode with both reflected and transmitted waves using tunable power splitting betwe… ▽ More A hybrid transmitting and reflecting beyond diagonal reconfigurable intelligent surface (BD-RIS) design is proposed. Operating in the same aperture, frequency band and polarization, the proposed BD-RIS features independent beam steering control of its reflected and transmitted waves. In addition it provides a hybrid mode with both reflected and transmitted waves using tunable power splitting between beams. The BD-RIS comprises two phase reconfigurable antenna arrays interconnected by an array of tunable two-port power splitters. The two-port power splitter in each BD-RIS cell is built upon a varactor in parallel with a bias inductor to exert tunable impedance variations on transmission lines. Provided with variable reverse DC voltages, the two-port power splitter can control the power ratio of S11 over S21 from -20 dB to 20 dB, thus allowing tunable power splitting. Each antenna is 2-bit phase reconfigurable with 200 MHz bandwidth at 2.4 GHz so that each cell of BD-RIS can also achieve independent reflection and transmission phase control. To characterize and optimize the electromagnetic response of the proposed BD-RIS design, a Thévenin equivalent model and corresponding analytical method is provided. A BD-RIS with 4 by 4 cells was also prototyped and tested. Experiments show that in reflection and transmission mode, the fabricated BD-RIS can realize beam steering in reflection and transmission space, respectively. It is also verified that when operating in hybrid mode, the BD-RIS enables independent beam steering of the reflected and transmitted waves. This work helps fill the gap between realizing practical hardware design and establishing an accurate physical model for the hybrid transmitting and reflecting BD-RIS, enabling hybrid transmitting and reflecting BD-RIS assisted wireless communications. △ Less

Submitted 13 April, 2025; originally announced April 2025.

Comments: 15 pages, 16 figures

arXiv:2503.22605 [pdf, ps, other]

Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis

Authors: Shuai Shen, Wanhua Li, Yunpeng Zhang, Yap-Peng Tan, Jiwen Lu

Abstract: Talking head synthesis has emerged as a prominent research topic in computer graphics and multimedia, yet most existing methods often struggle to strike a balance between generation quality and computational efficiency, particularly under real-time constraints. In this paper, we propose a novel framework that integrates Gaussian Splatting with a structured Audio Factorization Plane (Audio-Plane) t… ▽ More Talking head synthesis has emerged as a prominent research topic in computer graphics and multimedia, yet most existing methods often struggle to strike a balance between generation quality and computational efficiency, particularly under real-time constraints. In this paper, we propose a novel framework that integrates Gaussian Splatting with a structured Audio Factorization Plane (Audio-Plane) to enable high-quality, audio-synchronized, and real-time talking head generation. For modeling a dynamic talking head, a 4D volume representation, which consists of three axes in 3D space and one temporal axis aligned with audio progression, is typically required. However, directly storing and processing a dense 4D grid is impractical due to the high memory and computation cost, and lack of scalability for longer durations. We address this challenge by decomposing the 4D volume representation into a set of audio-independent spatial planes and audio-dependent planes, forming a compact and interpretable representation for talking head modeling that we refer to as the Audio-Plane. This factorized design allows for efficient and fine-grained audio-aware spatial encoding, and significantly enhances the model's ability to capture complex lip dynamics driven by speech signals. To further improve region-specific motion modeling, we introduce an audio-guided saliency splatting mechanism based on region-aware modulation, which adaptively emphasizes highly dynamic regions such as the mouth area. This allows the model to focus its learning capacity on where it matters most for accurate speech-driven animation. Extensive experiments on both the self-driven and the cross-driven settings demonstrate that our method achieves state-of-the-art visual quality, precise audio-lip synchronization, and real-time performance, outperforming prior approaches across both 2D- and 3D-based paradigms. △ Less

Submitted 26 June, 2025; v1 submitted 28 March, 2025; originally announced March 2025.

Comments: Demo video at \url{https://sstzal.github.io/Audio-Plane/}

arXiv:2503.03629 [pdf, other]

TeraSim: Uncovering Unknown Unsafe Events for Autonomous Vehicles through Generative Simulation

Authors: Haowei Sun, Xintao Yan, Zhijie Qiao, Haojie Zhu, Yihao Sun, Jiawei Wang, Shengyin Shen, Darian Hogue, Rajanikant Ananta, Derek Johnson, Greg Stevens, Greg McGuire, Yifan Wei, Wei Zheng, Yong Sun, Yasuo Fukai, Henry X. Liu

Abstract: Traffic simulation is essential for autonomous vehicle (AV) development, enabling comprehensive safety evaluation across diverse driving conditions. However, traditional rule-based simulators struggle to capture complex human interactions, while data-driven approaches often fail to maintain long-term behavioral realism or generate diverse safety-critical events. To address these challenges, we pro… ▽ More Traffic simulation is essential for autonomous vehicle (AV) development, enabling comprehensive safety evaluation across diverse driving conditions. However, traditional rule-based simulators struggle to capture complex human interactions, while data-driven approaches often fail to maintain long-term behavioral realism or generate diverse safety-critical events. To address these challenges, we propose TeraSim, an open-source, high-fidelity traffic simulation platform designed to uncover unknown unsafe events and efficiently estimate AV statistical performance metrics, such as crash rates. TeraSim is designed for seamless integration with third-party physics simulators and standalone AV stacks, to construct a complete AV simulation system. Experimental results demonstrate its effectiveness in generating diverse safety-critical events involving both static and dynamic agents, identifying hidden deficiencies in AV systems, and enabling statistical performance evaluation. These findings highlight TeraSim's potential as a practical tool for AV safety assessment, benefiting researchers, developers, and policymakers. The code is available at https://github.com/mcity/TeraSim. △ Less

Submitted 1 April, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

arXiv:2503.03546 [pdf]

Intermediate Domain-guided Adaptation for Unsupervised Chorioallantoic Membrane Vessel Segmentation

Authors: Pengwu Song, Liang Xu, Peng Yao, Shuwei Shen, Pengfei Shao, Mingzhai Sun, Ronald X. Xu

Abstract: The chorioallantoic membrane (CAM) model is widely employed in angiogenesis research, and distribution of growing blood vessels is the key evaluation indicator. As a result, vessel segmentation is crucial for quantitative assessment based on topology and morphology. However, manual segmentation is extremely time-consuming, labor-intensive, and prone to inconsistency due to its subjective nature. M… ▽ More The chorioallantoic membrane (CAM) model is widely employed in angiogenesis research, and distribution of growing blood vessels is the key evaluation indicator. As a result, vessel segmentation is crucial for quantitative assessment based on topology and morphology. However, manual segmentation is extremely time-consuming, labor-intensive, and prone to inconsistency due to its subjective nature. Moreover, research on CAM vessel segmentation algorithms remains limited, and the lack of public datasets contributes to poor prediction performance. To address these challenges, we propose an innovative Intermediate Domain-guided Adaptation (IDA) method, which utilizes the similarity between CAM images and retinal images, along with existing public retinal datasets, to perform unsupervised training on CAM images. Specifically, we introduce a Multi-Resolution Asymmetric Translation (MRAT) strategy to generate intermediate images to promote image-level interaction. Then, an Intermediate Domain-guided Contrastive Learning (IDCL) module is developed to disentangle cross-domain feature representations. This method overcomes the limitations of existing unsupervised domain adaptation (UDA) approaches, which primarily concentrate on directly source-target alignment while neglecting intermediate domain information. Notably, we create the first CAM dataset to validate the proposed algorithm. Extensive experiments on this dataset show that our method outperforms compared approaches. Moreover, it achieves superior performance in UDA tasks across retinal datasets, highlighting its strong generalization capability. The CAM dataset and source codes are available at https://github.com/Light-47/IDA. △ Less

Submitted 9 March, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

arXiv:2502.13733 [pdf, other]

Intrinsic Cramér-Rao Bound based 6D Localization and Tracking for 5G/6G Systems

Authors: Xueting Xu, Hui Chen, Shengqiang Shen, Hyowon Kim, Xu Fang, Ao Peng, Fan Jiang, Henk Wymeersch

Abstract: Localization and tracking are critical components of integrated sensing and communication (ISAC) systems, enhancing resource management, beamforming accuracy, and overall system reliability through precise sensing. Due to the high path loss of the high-frequency systems, antenna arrays are required at the transmitter and receiver sides for beamforming gain. However, beam misalignment may occur, wh… ▽ More Localization and tracking are critical components of integrated sensing and communication (ISAC) systems, enhancing resource management, beamforming accuracy, and overall system reliability through precise sensing. Due to the high path loss of the high-frequency systems, antenna arrays are required at the transmitter and receiver sides for beamforming gain. However, beam misalignment may occur, which requires accurate tracking of the six-dimensional (6D) state, namely, 3D position and 3D orientation. In this work, we first address the challenge that the rotation matrix, being part of the Lie group rather than Euclidean space, necessitates the derivation of the ICRB for an intrinsic performance benchmark. Then, leveraging the derived ICRB, we develop two filters-one utilizing pose fusion and the other employing error-state Kalman filter to estimate the UE's 6D state for different computational resource consumption and accuracy requirements. Simulation results validate the ICRB and assess the performance of the proposed filters, demonstrating their effectiveness and improved accuracy in 6D state tracking. △ Less

Submitted 19 February, 2025; originally announced February 2025.

arXiv:2501.07057 [pdf, other]

Optimization with Multi-sourced Reference Information and Unknown Trust: A Distributionally Robust Approach

Authors: Yanru Guo, Ruiwei Jiang, Siqian Shen

Abstract: In problems that involve input parameter information gathered from multiple data sources with varying reliability, incorporating users' trust about different sources in decision-optimization models can potentially improve solution performance and reliability. In this work, we propose a novel multi-reference distributionally robust optimization (MR-DRO) framework, where the model inputs are uncerta… ▽ More In problems that involve input parameter information gathered from multiple data sources with varying reliability, incorporating users' trust about different sources in decision-optimization models can potentially improve solution performance and reliability. In this work, we propose a novel multi-reference distributionally robust optimization (MR-DRO) framework, where the model inputs are uncertain and their probability distributions can be statistically inferred from multiple data sources. Via nonparametric data fusion, we construct a Wasserstein ambiguity set to minimize the worst-case expected value of a stochastic objective function, accounting for both uncertainty and unknown reliability of information sources. We reformulate the MR-DRO model as a linear program given linear objective and constraints in the original problem. We also incorporate a dynamic trust update mechanism that adjusts the trust for each source based on its performance over time. In addition, we introduce the concept of probability dominance to identify sources with dominant trust. Via solving instances of resource allocation and portfolio optimization, we demonstrate the effectiveness of the trust-informed MR-DRO approach compared to traditional optimization frameworks relying on a single data source. Our results highlight the significance of integrating (dynamic) user trust in decision making under uncertainty, particularly when given diverse and potentially conflicting input data. △ Less

Submitted 12 January, 2025; originally announced January 2025.

Comments: 38 pages, 9 figures, 7 tables

arXiv:2412.13778 [pdf]

Fast Link Recovery via PTP-synchronized Nanosecond Optical Switching

Authors: V. Yokar, A. Mehrpooya, Y. Teng, S. Shen, Z. Wu, K. Bardhi, S. Yan, D. Simeonidou

Abstract: This paper proposes and validates a PTP-synchronized 8.4ns optical switching with a 100ns jitter at the switching edges. This approach is adopted and demonstrated for instant network recovery within 2.7ms and scheduled network recovery. This paper proposes and validates a PTP-synchronized 8.4ns optical switching with a 100ns jitter at the switching edges. This approach is adopted and demonstrated for instant network recovery within 2.7ms and scheduled network recovery. △ Less

Submitted 18 December, 2024; originally announced December 2024.

arXiv:2411.06642 [pdf, ps, other]

Antenna Coding Empowered by Pixel Antennas

Authors: Shanpu Shen, Kai-Kit Wong, Ross Murch

Abstract: Pixel antennas, based on discretizing a continuous radiation surface into small elements called pixels, are a flexible reconfigurable antenna technology. By controlling the connections between pixels via switches, the characteristics of pixel antennas can be adjusted to enhance the wireless channel. Inspired by this, we propose a novel technique denoted antenna coding empowered by pixel antennas.… ▽ More Pixel antennas, based on discretizing a continuous radiation surface into small elements called pixels, are a flexible reconfigurable antenna technology. By controlling the connections between pixels via switches, the characteristics of pixel antennas can be adjusted to enhance the wireless channel. Inspired by this, we propose a novel technique denoted antenna coding empowered by pixel antennas. We first derive a physical and electromagnetic based communication model for pixel antennas using microwave multiport network theory and beamspace channel representation. With the model, we optimize the antenna coding to maximize the channel gain in a single-input single-output (SISO) pixel antenna system and develop a codebook design for antenna coding to reduce the computational complexity. We analyze the average channel gain of SISO pixel antenna system and derive the corresponding upper bound. In addition, we jointly optimize the antenna coding and transmit signal covariance matrix to maximize the channel capacity in a multiple-input multiple-output (MIMO) pixel antenna system. Simulation results show that using pixel antennas can enhance the average channel gain by up to 5.4 times and channel capacity by up to 3.1 times, demonstrating the significant potential of pixel antennas as a new dimension to design and optimize wireless communication systems. △ Less

Submitted 10 November, 2024; originally announced November 2024.

Comments: 13 pages, 11 figures

arXiv:2411.01428 [pdf, other]

Distributionally Robust Resource Allocation with Trust-aided Parametric Information Fusion

Authors: Yanru Guo, Bo Zhou, Ruiwei Jiang, Xi, Yang, Siqian Shen

Abstract: Reference information plays an essential role for making decisions under uncertainty, yet may vary across multiple data sources. In this paper, we study resource allocation in stochastic dynamic environments, where we perform information fusion based on trust of different data sources, to design an ambiguity set for attaining distributionally robust resource allocation solutions. We dynamically up… ▽ More Reference information plays an essential role for making decisions under uncertainty, yet may vary across multiple data sources. In this paper, we study resource allocation in stochastic dynamic environments, where we perform information fusion based on trust of different data sources, to design an ambiguity set for attaining distributionally robust resource allocation solutions. We dynamically update the trust parameter to simulate the decision maker's trust change based on losses caused by mis-specified reference information. We show an equivalent tractable linear programming reformulation of the distributionally robust optimization model and demonstrate the performance in a wildfire suppression application, where we use drone and satellite data to estimate the needs of resources in different regions. We demonstrate how our methods can improve trust and decision accuracy. The computational time grows linearly in the number of data sources and problem sizes. △ Less

Submitted 2 November, 2024; originally announced November 2024.

Comments: 6 pages, 5 figures, accepted by the Proceedings of the 63rd IEEE Conference on Decision and Control (CDC 2024), Milan, Italy, December 2024

arXiv:2411.01416 [pdf, other]

Sequential Charging Station Location Optimization under Uncertain Charging Behavior and User Growth

Authors: Wenjia Shen, Bo Zhou, Ruiwei Jiang, Siqian Shen

Abstract: Charging station availability is crucial for a thriving electric vehicle market. Due to budget constraints, locating these stations usually proceeds in phases, which calls for careful consideration of the (random) charging demand growth throughout the planning horizon. This paper integrates user choice behavior into two-stage and multi-stage stochastic programming models for intracity charging sta… ▽ More Charging station availability is crucial for a thriving electric vehicle market. Due to budget constraints, locating these stations usually proceeds in phases, which calls for careful consideration of the (random) charging demand growth throughout the planning horizon. This paper integrates user choice behavior into two-stage and multi-stage stochastic programming models for intracity charging station planning under demand uncertainty. We derive a second-order conic representation for the nonlinear, nonconvex formulation by taking advantage of the binary nature of location variables and propose subgradient inequalities to accelerate computation. Numerical results demonstrate the value of employing multi-stage models, particularly in scenarios of high demand fluctuations, increased demand dispersion, and high user sensitivity to the distance-to-recharge. △ Less

Submitted 2 November, 2024; originally announced November 2024.

Comments: 6 pages, 4 figures, to appear in the Proceedings of the 63rd IEEE Conference on Decision and Control (CDC 2024), Milan, Italy, Dec 2024

arXiv:2407.13401 [pdf, ps, other]

Cooperative Integrated Sensing and Communication Networks: Analysis and Distributed Design

Authors: Bowen Wang, Hongyu Li, Fan Liu, Ziyang Cheng, Shanpu Shen

Abstract: This paper proposes a cooperative integrated sensing and communication network (Co-ISACNet) adopting hybrid beamforming (HBF) architecture, which improves both radar sensing and communication performance. The main contributions of this work are four-fold. First, we introduce a novel cooperative sensing method for the considered Co-ISACNet, followed by a comprehensive analysis of this method. This… ▽ More This paper proposes a cooperative integrated sensing and communication network (Co-ISACNet) adopting hybrid beamforming (HBF) architecture, which improves both radar sensing and communication performance. The main contributions of this work are four-fold. First, we introduce a novel cooperative sensing method for the considered Co-ISACNet, followed by a comprehensive analysis of this method. This analysis mathematically verifies the benefits of Co-ISACNet and provides insightful design guidelines. Second, to show the benefits of Co-ISACNet, we propose to jointly design the HBF to maximize the network communication capacity while satisfying the constraint of beampattern similarity for radar sensing, which results in a highly dimensional and non-convex problem. Third, to facilitate the joint design, we propose a novel distributed optimization framework based on proximal gradient and alternating direction method of multipliers, namely PANDA. Fourth, we further adopt the proposed PANDA framework to solve the joint HBF design problem for the Co-ISACNet. By using the proposed PANDA framework, all access points (APs) optimize the HBF in parallel, where each AP only requires local channel state information and limited message exchange among the APs. Such framework reduces significantly the computational complexity and thus has pronounced benefits in practical scenarios. Simulation results verify the effectiveness of the proposed algorithm compared with the conventional centralized algorithm and show the remarkable performance improvement of radar sensing and communication by deploying Co-ISACNet. △ Less

Submitted 18 June, 2025; v1 submitted 18 July, 2024; originally announced July 2024.

arXiv:2405.07297 [pdf, other]

Beyond Diagonal Reconfigurable Intelligent Surfaces in Wideband OFDM Communications: Circuit-Based Modeling and Optimization

Authors: Hongyu Li, Matteo Nerini, Shanpu Shen, Bruno Clerckx

Abstract: This work investigates the modeling and optimization of beyond diagonal reconfigurable intelligent surface (BD-RIS), which generalizes conventional RIS with diagonal phase shift matrices and provides additional flexibility for manipulating wireless channels, in wideband communication systems. Specifically, we start from the signal modeling of the BD-RIS-aided orthogonal frequency division multiple… ▽ More This work investigates the modeling and optimization of beyond diagonal reconfigurable intelligent surface (BD-RIS), which generalizes conventional RIS with diagonal phase shift matrices and provides additional flexibility for manipulating wireless channels, in wideband communication systems. Specifically, we start from the signal modeling of the BD-RIS-aided orthogonal frequency division multiplexing (OFDM) system, which bridges the time-domain and frequency-domain channels, and explicitly shows the frequency dependence of the BD-RIS response. We next characterize the frequency dependence of the BD-RIS response based on circuit models. Benefiting from the admittance parameter analysis, we model individually each tunable admittance component of BD-RIS and derive an approximated linear expression with respect to the frequency of the transmit signals. With the proposed signal model for the BD-RIS-aided OFDM system and the frequency-dependent BD-RIS model, we propose algorithms to optimize the BD-RIS and the power allocation at the transmitter to maximize the average rate for a BD-RIS-aided OFDM system. Finally, simulation results show that BD-RIS outperforms conventional RIS in the OFDM system. More importantly, the impact of wideband modeling of BD-RIS on the system performance becomes more significant as the circuit complexity of BD-RIS architectures increases. △ Less

Submitted 19 January, 2025; v1 submitted 12 May, 2024; originally announced May 2024.

Comments: 14 pages, 6 figures, accepted by IEEE journal. arXiv admin note: text overlap with arXiv:2403.12893

arXiv:2404.14693 [pdf, other]

DIP-Watermark: A Double Identity Protection Method Based on Robust Adversarial Watermark

Authors: Yunming Zhang, Dengpan Ye, Caiyun Xie, Sipeng Shen, Ziyi Liu, Jiacheng Deng, Long Tang

Abstract: The wide deployment of Face Recognition (FR) systems poses privacy risks. One countermeasure is adversarial attack, deceiving unauthorized malicious FR, but it also disrupts regular identity verification of trusted authorizers, exacerbating the potential threat of identity impersonation. To address this, we propose the first double identity protection scheme based on traceable adversarial watermar… ▽ More The wide deployment of Face Recognition (FR) systems poses privacy risks. One countermeasure is adversarial attack, deceiving unauthorized malicious FR, but it also disrupts regular identity verification of trusted authorizers, exacerbating the potential threat of identity impersonation. To address this, we propose the first double identity protection scheme based on traceable adversarial watermarking, termed DIP-Watermark. DIP-Watermark employs a one-time watermark embedding to deceive unauthorized FR models and allows authorizers to perform identity verification by extracting the watermark. Specifically, we propose an information-guided adversarial attack against FR models. The encoder embeds an identity-specific watermark into the deep feature space of the carrier, guiding recognizable features of the image to deviate from the source identity. We further adopt a collaborative meta-optimization strategy compatible with sub-tasks, which regularizes the joint optimization direction of the encoder and decoder. This strategy enhances the representation of universal carrier features, mitigating multi-objective optimization conflicts in watermarking. Experiments confirm that DIP-Watermark achieves significant attack success rates and traceability accuracy on state-of-the-art FR models, exhibiting remarkable robustness that outperforms the existing privacy protection methods using adversarial attacks and deep watermarking, or simple combinations of the two. Our work potentially opens up new insights into proactive protection for FR privacy. △ Less

Submitted 23 October, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

arXiv:2404.12584 [pdf, other]

Multi-Objective Offloading Optimization in MEC and Vehicular-Fog Systems: A Distributed-TD3 Approach

Authors: Frezer Guteta Wakgra, Binayak Kar, Seifu Birhanu Tadele, Shan-Hsiang Shen, Asif Uddin Khan

Abstract: The emergence of 5G networks has enabled the deployment of a two-tier edge and vehicular-fog network. It comprises Multi-access Edge Computing (MEC) and Vehicular-Fogs (VFs), strategically positioned closer to Internet of Things (IoT) devices, reducing propagation latency compared to cloud-based solutions and ensuring satisfactory quality of service (QoS). However, during high-traffic events like… ▽ More The emergence of 5G networks has enabled the deployment of a two-tier edge and vehicular-fog network. It comprises Multi-access Edge Computing (MEC) and Vehicular-Fogs (VFs), strategically positioned closer to Internet of Things (IoT) devices, reducing propagation latency compared to cloud-based solutions and ensuring satisfactory quality of service (QoS). However, during high-traffic events like concerts or athletic contests, MEC sites may face congestion and become overloaded. Utilizing offloading techniques, we can transfer computationally intensive tasks from resource-constrained devices to those with sufficient capacity, for accelerating tasks and extending device battery life. In this research, we consider offloading within a two-tier MEC and VF architecture, involving offloading from MEC to MEC and from MEC to VF. The primary objective is to minimize the average system cost, considering both latency and energy consumption. To achieve this goal, we formulate a multi-objective optimization problem aimed at minimizing latency and energy while considering given resource constraints. To facilitate decision-making for nearly optimal computational offloading, we design an equivalent reinforcement learning environment that accurately represents the network architecture and the formulated problem. To accomplish this, we propose a Distributed-TD3 (DTD3) approach, which builds on the TD3 algorithm. Extensive simulations, demonstrate that our strategy achieves faster convergence and higher efficiency compared to other benchmark solutions. △ Less

Submitted 18 April, 2024; originally announced April 2024.

arXiv:2403.19224 [pdf, other]

doi 10.1109/ICASSP48485.2024.10446974

Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition

Authors: Siyuan Shen, Yu Gao, Feng Liu, Hanyang Wang, Aimin Zhou

Abstract: The mainstream paradigm of speech emotion recognition (SER) is identifying the single emotion label of the entire utterance. This line of works neglect the emotion dynamics at fine temporal granularity and mostly fail to leverage linguistic information of speech signal explicitly. In this paper, we propose Emotion Neural Transducer for fine-grained speech emotion recognition with automatic speech… ▽ More The mainstream paradigm of speech emotion recognition (SER) is identifying the single emotion label of the entire utterance. This line of works neglect the emotion dynamics at fine temporal granularity and mostly fail to leverage linguistic information of speech signal explicitly. In this paper, we propose Emotion Neural Transducer for fine-grained speech emotion recognition with automatic speech recognition (ASR) joint training. We first extend typical neural transducer with emotion joint network to construct emotion lattice for fine-grained SER. Then we propose lattice max pooling on the alignment lattice to facilitate distinguishing emotional and non-emotional frames. To adapt fine-grained SER to transducer inference manner, we further make blank, the special symbol of ASR, serve as underlying emotion indicator as well, yielding Factorized Emotion Neural Transducer. For typical utterance-level SER, our ENT models outperform state-of-the-art methods on IEMOCAP in low word error rate. Experiments on IEMOCAP and the latest speech emotion diarization dataset ZED also demonstrate the superiority of fine-grained emotion modeling. Our code is available at https://github.com/ECNU-Cross-Innovation-Lab/ENT. △ Less

Submitted 28 March, 2024; originally announced March 2024.

Comments: Accepted by 49th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

arXiv:2403.18087 [pdf, other]

Channel Estimation and Beamforming for Beyond Diagonal Reconfigurable Intelligent Surfaces

Authors: Hongyu Li, Shanpu Shen, Yumeng Zhang, Bruno Clerckx

Abstract: Beyond diagonal reconfigurable intelligent surface (BD-RIS) is a new advance and generalization of the RIS technique. BD-RIS breaks through the isolation between RIS elements by creatively introducing inter-element connections, thereby enabling smarter wave manipulation and enlarging coverage. However, exploring proper channel estimation schemes suitable for BD-RIS aided communication systems stil… ▽ More Beyond diagonal reconfigurable intelligent surface (BD-RIS) is a new advance and generalization of the RIS technique. BD-RIS breaks through the isolation between RIS elements by creatively introducing inter-element connections, thereby enabling smarter wave manipulation and enlarging coverage. However, exploring proper channel estimation schemes suitable for BD-RIS aided communication systems still remains an open problem. In this paper, we study channel estimation and beamforming design for BD-RIS aided multi-antenna systems. We first describe the channel estimation strategy based on the least square (LS) method, derive the mean square error (MSE) of the LS estimation, and formulate the joint pilot sequence and BD-RIS design problem with unique constraints induced by BD-RIS architectures. Specifically, we propose an efficient pilot sequence and BD-RIS design which theoretically guarantees to achieve the minimum MSE. With the estimated channel, we then consider two BD-RIS scenarios and propose beamforming design algorithms. Finally, we provide simulation results to verify the effectiveness of the proposed channel estimation scheme and beamforming design algorithms. We also show that more interelement connections in BD-RIS improves the performance while increasing the training overhead for channel estimation. △ Less

Submitted 10 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

Comments: 14 pages, 10 figures, submitted to IEEE journal

arXiv:2403.12893 [pdf, other]

Wideband Modeling and Beamforming for Beyond Diagonal Reconfigurable Intelligent Surfaces

Authors: Hongyu Li, Matteo Nerini, Shanpu Shen, Bruno Clerckx

Abstract: This work studies the wideband modeling and beamforming design of beyond diagonal reconfigurable intelligent surface (BD-RIS), which generalizes and goes beyond conventional RIS with diagonal phase shift matrices to achieve enhanced channel gain. Specifically, we investigate the response of BD-RIS in wideband systems by going back to its hardware circuit realizations. We propose a novel wideband m… ▽ More This work studies the wideband modeling and beamforming design of beyond diagonal reconfigurable intelligent surface (BD-RIS), which generalizes and goes beyond conventional RIS with diagonal phase shift matrices to achieve enhanced channel gain. Specifically, we investigate the response of BD-RIS in wideband systems by going back to its hardware circuit realizations. We propose a novel wideband model which has simple expressions while capturing the response variations of BD-RIS for signals with different frequencies. With this wideband model, we propose a BD-RIS design algorithm for an orthogonal frequency division multiplexing system to maximize the average rate over all subcarriers. Finally, we provide simulation results to evaluate the performance of the proposed design and show the importance of wideband modeling for BD-RIS. △ Less

Submitted 19 March, 2024; originally announced March 2024.

Comments: Submitted to IEEE for publication

arXiv:2403.09275 [pdf, ps, other]

Static Grouping Strategy Design for Beyond Diagonal Reconfigurable Intelligent Surfaces

Authors: Matteo Nerini, Shanpu Shen, Bruno Clerckx

Abstract: Beyond diagonal reconfigurable intelligent surface (BD-RIS) extends conventional RIS through novel architectures, such as group-connected RIS, with scattering matrix not restricted to being diagonal. However, it remains unexplored how to optimally group the elements in group-connected RISs to maximize the performance while maintaining a low-complexity circuit. In this study, we propose and model B… ▽ More Beyond diagonal reconfigurable intelligent surface (BD-RIS) extends conventional RIS through novel architectures, such as group-connected RIS, with scattering matrix not restricted to being diagonal. However, it remains unexplored how to optimally group the elements in group-connected RISs to maximize the performance while maintaining a low-complexity circuit. In this study, we propose and model BD-RIS with a static grouping strategy optimized based on the channel statistics. After formulating the corresponding problems, we design the grouping in single- and multi-user systems. Numerical results reveal the benefits of grouping optimization, i.e., up to 60% sum rate improvement, especially in highly correlated channels. △ Less

Submitted 7 May, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

Comments: Accepted by IEEE for publication

arXiv:2403.04028 [pdf, other]

doi 10.1109/TWC.2025.3536178

RISnet: A Domain-Knowledge Driven Neural Network Architecture for RIS Optimization with Mutual Coupling and Partial CSI

Authors: Bile Peng, Karl-Ludwig Besser, Shanpu Shen, Finn Siegismund-Poschmann, Ramprasad Raghunath, Daniel Mittleman, Vahid Jamali, Eduard A. Jorswieck

Abstract: Space-division multiple access (SDMA) plays an important role in modern wireless communications. Its performance depends on the channel properties, which can be improved by reconfigurable intelligent surfaces (RISs). In this work, we jointly optimize SDMA precoding at the base station (BS) and RIS configuration. We tackle difficulties of mutual coupling between RIS elements, scalability to more th… ▽ More Space-division multiple access (SDMA) plays an important role in modern wireless communications. Its performance depends on the channel properties, which can be improved by reconfigurable intelligent surfaces (RISs). In this work, we jointly optimize SDMA precoding at the base station (BS) and RIS configuration. We tackle difficulties of mutual coupling between RIS elements, scalability to more than 1000 RIS elements, and high requirement for channel estimation. We first derive an RIS-assisted channel model considering mutual coupling, then propose an unsupervised machine learning (ML) approach to optimize the RIS with a dedicated neural network (NN) architecture RISnet, which has good scalability, desired permutation-invariance, and a low requirement for channel estimation. Moreover, we leverage existing high-performance analytical precoding scheme to propose a hybrid solution of ML-enabled RIS configuration and analytical precoding at BS. More generally, this work is an early contribution to combine ML technique and domain knowledge in communication for NN architecture design. Compared to generic ML, the problem-specific ML can achieve higher performance, lower complexity and permutation-invariance. △ Less

Submitted 7 February, 2025; v1 submitted 6 March, 2024; originally announced March 2024.

Comments: 13 pages, 11 figures

Journal ref: IEEE Transactions on Wireless Communications, vol. 24, no. 5, pp. 4469-4482, May 2025

arXiv:2401.09032 [pdf, other]

Improved Consensus ADMM for Cooperative Motion Planning of Large-Scale Connected Autonomous Vehicles with Limited Communication

Authors: Haichao Liu, Zhenmin Huang, Zicheng Zhu, Yulin Li, Shaojie Shen, Jun Ma

Abstract: This paper investigates a cooperative motion planning problem for large-scale connected autonomous vehicles (CAVs) under limited communications, which addresses the challenges of high communication and computing resource requirements. Our proposed methodology incorporates a parallel optimization algorithm with improved consensus ADMM considering a more realistic locally connected topology network,… ▽ More This paper investigates a cooperative motion planning problem for large-scale connected autonomous vehicles (CAVs) under limited communications, which addresses the challenges of high communication and computing resource requirements. Our proposed methodology incorporates a parallel optimization algorithm with improved consensus ADMM considering a more realistic locally connected topology network, and time complexity of O(N) is achieved by exploiting the sparsity in the dual update process. To further enhance the computational efficiency, we employ a lightweight evolution strategy for the dynamic connectivity graph of CAVs, and each sub-problem split from the consensus ADMM only requires managing a small group of CAVs. The proposed method implemented with the receding horizon scheme is validated thoroughly, and comparisons with existing numerical solvers and approaches demonstrate the efficiency of our proposed algorithm. Also, simulations on large-scale cooperative driving tasks involving 80 vehicles are performed in the high-fidelity CARLA simulator, which highlights the remarkable computational efficiency, scalability, and effectiveness of our proposed development. Demonstration videos are available at https://henryhcliu.github.io/icadmm_cmp_carla. △ Less

Submitted 17 January, 2024; originally announced January 2024.

Comments: 15 pages, 10 figures

arXiv:2401.04968 [pdf, other]

A Universal Cooperative Decision-Making Framework for Connected Autonomous Vehicles with Generic Road Topologies

Authors: Zhenmin Huang, Shaojie Shen, Jun Ma

Abstract: Cooperative decision-making of Connected Autonomous Vehicles (CAVs) presents a longstanding challenge due to its inherent nonlinearity, non-convexity, and discrete characteristics, compounded by the diverse road topologies encountered in real-world traffic scenarios. The majority of current methodologies are only applicable to a single and specific scenario, predicated on scenario-specific assumpt… ▽ More Cooperative decision-making of Connected Autonomous Vehicles (CAVs) presents a longstanding challenge due to its inherent nonlinearity, non-convexity, and discrete characteristics, compounded by the diverse road topologies encountered in real-world traffic scenarios. The majority of current methodologies are only applicable to a single and specific scenario, predicated on scenario-specific assumptions. Consequently, their application in real-world environments is restricted by the innumerable nature of traffic scenarios. In this study, we propose a unified optimization approach that exhibits the potential to address cooperative decision-making problems related to traffic scenarios with generic road topologies. This development is grounded in the premise that the topologies of various traffic scenarios can be universally represented as Directed Acyclic Graphs (DAGs). Particularly, the reference paths and time profiles for all involved CAVs are determined in a fully cooperative manner, taking into account factors such as velocities, accelerations, conflict resolutions, and overall traffic efficiency. The cooperative decision-making of CAVs is approximated as a mixed-integer linear programming (MILP) problem building on the DAGs of road topologies. This favorably facilitates the use of standard numerical solvers and the global optimality can be attained through the optimization. Case studies corresponding to different multi-lane traffic scenarios featuring diverse topologies are scheduled as the test itineraries, and the efficacy of our proposed methodology is corroborated. △ Less

Submitted 10 January, 2024; originally announced January 2024.

arXiv:2312.03374 [pdf, other]

doi 10.1109/MNET.2023.3332893

Implementing Digital Twin in Field-Deployed Optical Networks: Uncertain Factors, Operational Guidance, and Field-Trial Demonstration

Authors: Yuchen Song, Min Zhang, Yao Zhang, Yan Shi, Shikui Shen, Bingli Guo, Shanguo Huang, Danshi Wang

Abstract: Digital twin has revolutionized optical communication networks by enabling their full life-cycle management, including design, troubleshooting, optimization, upgrade, and prediction. While extensive literature exists on frameworks, standards, and applications of digital twin, there is a pressing need in implementing digital twin in field-deployed optical networks operating in real-world environmen… ▽ More Digital twin has revolutionized optical communication networks by enabling their full life-cycle management, including design, troubleshooting, optimization, upgrade, and prediction. While extensive literature exists on frameworks, standards, and applications of digital twin, there is a pressing need in implementing digital twin in field-deployed optical networks operating in real-world environments, as opposed to controlled laboratory settings. This paper addresses this challenge by examining the uncertain factors behind the inaccuracy of digital twin in field-deployed optical networks from three main challenges and proposing operational guidance for implementing accurate digital twin in field-deployed optical networks. Through the proposed guidance, we demonstrate the effective implementation of digital twin in a field-trial C+L-band optical transmission link, showcasing its capabilities in performance recovery in a fiber cut scenario. △ Less

Submitted 6 December, 2023; originally announced December 2023.

Comments: 10 pages, 5 figures Accepted by IEEE Network Magazine, early access

arXiv:2311.10561 [pdf, ps, other]

A Universal Framework for Multiport Network Analysis of Reconfigurable Intelligent Surfaces

Authors: Matteo Nerini, Shanpu Shen, Hongyu Li, Marco Di Renzo, Bruno Clerckx

Abstract: Reconfigurable intelligent surface (RIS) is an emerging paradigm able to control the propagation environment in wireless systems. Most of the research on RIS has been dedicated to system optimization and, with the advent of beyond diagonal RIS (BD-RIS), to RIS architecture design. However, developing general and unified electromagnetic (EM)-consistent models for RIS-aided systems remains an open p… ▽ More Reconfigurable intelligent surface (RIS) is an emerging paradigm able to control the propagation environment in wireless systems. Most of the research on RIS has been dedicated to system optimization and, with the advent of beyond diagonal RIS (BD-RIS), to RIS architecture design. However, developing general and unified electromagnetic (EM)-consistent models for RIS-aided systems remains an open problem. In this study, we propose a universal framework for the multiport network analysis of RIS-aided systems. With our framework, we model RIS-aided systems and RIS architectures through impedance, admittance, and scattering parameter analysis. Based on these analyses, three equivalent models are derived accounting for the effects of impedance mismatching and mutual coupling. The three models are then simplified by assuming large transmission distances, perfect matching, and no mutual coupling to understand the role of the RIS in the communication model. The derived simplified models are consistent with the typical model used in related literature, although we show that an additional approximation is commonly considered in the literature. We discuss the benefits of each analysis in characterizing and optimizing the RIS and how to select the most suitable parameters according to the needs. Numerical results provide additional evidence of the equivalence of the three analyses. △ Less

Submitted 17 June, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

Comments: Accepted by IEEE for publication

arXiv:2310.05290 [pdf, other]

MSight: An Edge-Cloud Infrastructure-based Perception System for Connected Automated Vehicles

Authors: Rusheng Zhang, Depu Meng, Shengyin Shen, Zhengxia Zou, Houqiang Li, Henry X. Liu

Abstract: As vehicular communication and networking technologies continue to advance, infrastructure-based roadside perception emerges as a pivotal tool for connected automated vehicle (CAV) applications. Due to their elevated positioning, roadside sensors, including cameras and lidars, often enjoy unobstructed views with diminished object occlusion. This provides them a distinct advantage over onboard perc… ▽ More As vehicular communication and networking technologies continue to advance, infrastructure-based roadside perception emerges as a pivotal tool for connected automated vehicle (CAV) applications. Due to their elevated positioning, roadside sensors, including cameras and lidars, often enjoy unobstructed views with diminished object occlusion. This provides them a distinct advantage over onboard perception, enabling more robust and accurate detection of road objects. This paper presents MSight, a cutting-edge roadside perception system specifically designed for CAVs. MSight offers real-time vehicle detection, localization, tracking, and short-term trajectory prediction. Evaluations underscore the system's capability to uphold lane-level accuracy with minimal latency, revealing a range of potential applications to enhance CAV safety and efficiency. Presently, MSight operates 24/7 at a two-lane roundabout in the City of Ann Arbor, Michigan. △ Less

Submitted 8 October, 2023; originally announced October 2023.

Comments: Submitted to IEEE T-ITS

arXiv:2310.02708 [pdf, other]

Beyond Diagonal Reconfigurable Intelligent Surfaces with Mutual Coupling: Modeling and Optimization

Authors: Hongyu Li, Shanpu Shen, Matteo Nerini, Marco Di Renzo, Bruno Clerckx

Abstract: This work studies the modeling and optimization of beyond diagonal reconfigurable intelligent surface (BD-RIS) aided wireless communication systems in the presence of mutual coupling among the RIS elements. Specifically, we first derive the mutual coupling aware BD-RIS aided communication model using scattering and impedance parameter analysis. Based on the obtained communication model, we propose… ▽ More This work studies the modeling and optimization of beyond diagonal reconfigurable intelligent surface (BD-RIS) aided wireless communication systems in the presence of mutual coupling among the RIS elements. Specifically, we first derive the mutual coupling aware BD-RIS aided communication model using scattering and impedance parameter analysis. Based on the obtained communication model, we propose a general BD-RIS optimization algorithm applicable to different architectures of BD-RIS to maximize the channel gain. Numerical results validate the effectiveness of the proposed design and demonstrate that the larger the mutual coupling the larger the gain offered by BD-RIS over conventional diagonal RIS. △ Less

Submitted 24 May, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

Comments: 5 pages, 3 figures, accepted by IEEE Commun. Lett

arXiv:2309.03628 [pdf, other]

OSMOSIS: Enabling Multi-Tenancy in Datacenter SmartNICs

Authors: Mikhail Khalilov, Marcin Chrapek, Siyuan Shen, Alessandro Vezzu, Thomas Benz, Salvatore Di Girolamo, Timo Schneider, Daniele De Sensi, Luca Benini, Torsten Hoefler

Abstract: Multi-tenancy is essential for unleashing SmartNIC's potential in datacenters. Our systematic analysis in this work shows that existing on-path SmartNICs have resource multiplexing limitations. For example, existing solutions lack multi-tenancy capabilities such as performance isolation and QoS provisioning for compute and IO resources. Compared to standard NIC data paths with a well-defined set o… ▽ More Multi-tenancy is essential for unleashing SmartNIC's potential in datacenters. Our systematic analysis in this work shows that existing on-path SmartNICs have resource multiplexing limitations. For example, existing solutions lack multi-tenancy capabilities such as performance isolation and QoS provisioning for compute and IO resources. Compared to standard NIC data paths with a well-defined set of offloaded functions, unpredictable execution times of SmartNIC kernels make conventional approaches for multi-tenancy and QoS insufficient. We fill this gap with OSMOSIS, a SmartNICs resource manager co-design. OSMOSIS extends existing OS mechanisms to enable dynamic hardware resource multiplexing of the on-path packet processing data plane. We integrate OSMOSIS within an open-source RISC-V-based 400Gbit/s SmartNIC. Our performance results demonstrate that OSMOSIS fully supports multi-tenancy and enables broader adoption of SmartNICs in datacenters with low overhead. △ Less

Submitted 13 March, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

Comments: 12 pages, 14 figures, 103 references

arXiv:2308.02687 [pdf, other]

A Multi-objective Mixed-integer Programming Approach for Supply Chain Disruption Response with Lead-Time Awareness

Authors: Juan-Alberto Estrada-Garcia, Mingjie Bi, Dawn M. Tilbury, Kira Barton, Siqian Shen

Abstract: Supply chain (SC) risk management is influenced by both spatial and temporal attributes of different entities (suppliers, retailers, and customers). Each entity has given capacity and lead time for processing and transporting products to downstream entities. Under disruptive events, lead time and capacities may vary, which affects the overall SC performance. There have been many studies on SC disr… ▽ More Supply chain (SC) risk management is influenced by both spatial and temporal attributes of different entities (suppliers, retailers, and customers). Each entity has given capacity and lead time for processing and transporting products to downstream entities. Under disruptive events, lead time and capacities may vary, which affects the overall SC performance. There have been many studies on SC disruption mitigation, but often without considering lead time and the magnitude of lateness. In this paper, we formulate a mixed-integer programming (MIP) model to optimize SC operations via a routing and scheduling approach, to model the delivery time of products at different entities as they flow throughout the SC network. We minimize a weighted sum of multiple objectives involving costs related to transportation, shortage, and delivery lateness. We also develop a discrete-event simulation framework to evaluate the performance of solutions to the MIP model under lead time uncertainty. Via extensive numerical studies, we show how the attributes of SC entities affect the performance, so that we can improve SC design and operations under various uncertainties. △ Less

Submitted 4 August, 2023; originally announced August 2023.

arXiv:2307.14588 [pdf]

MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical Image Segmentation

Authors: Liang Xu, Mingxiao Chen, Yi Cheng, Pengfei Shao, Shuwei Shen, Peng Yao, Ronald X. Xu

Abstract: The UNet architecture, based on Convolutional Neural Networks (CNN), has demonstrated its remarkable performance in medical image analysis. However, it faces challenges in capturing long-range dependencies due to the limited receptive fields and inherent bias of convolutional operations. Recently, numerous transformer-based techniques have been incorporated into the UNet architecture to overcome t… ▽ More The UNet architecture, based on Convolutional Neural Networks (CNN), has demonstrated its remarkable performance in medical image analysis. However, it faces challenges in capturing long-range dependencies due to the limited receptive fields and inherent bias of convolutional operations. Recently, numerous transformer-based techniques have been incorporated into the UNet architecture to overcome this limitation by effectively capturing global feature correlations. However, the integration of the Transformer modules may result in the loss of local contextual information during the global feature fusion process. To overcome these challenges, we propose a 2D medical image segmentation model called Multi-scale Cross Perceptron Attention Network (MCPA). The MCPA consists of three main components: an encoder, a decoder, and a Cross Perceptron. The Cross Perceptron first captures the local correlations using multiple Multi-scale Cross Perceptron modules, facilitating the fusion of features across scales. The resulting multi-scale feature vectors are then spatially unfolded, concatenated, and fed through a Global Perceptron module to model global dependencies. Furthermore, we introduce a Progressive Dual-branch Structure to address the semantic segmentation of the image involving finer tissue structures. This structure gradually shifts the segmentation focus of MCPA network training from large-scale structural features to more sophisticated pixel-level features. We evaluate our proposed MCPA model on several publicly available medical image datasets from different tasks and devices, including the open large-scale dataset of CT (Synapse), MRI (ACDC), fundus camera (DRIVE, CHASE_DB1, HRF), and OCTA (ROSE). The experimental results show that our MCPA model achieves state-of-the-art performance. The code is available at https://github.com/simonustc/MCPA-for-2D-Medical-Image-Segmentation. △ Less

Submitted 26 July, 2023; originally announced July 2023.

arXiv:2306.17302 [pdf, other]

Robust Roadside Perception: an Automated Data Synthesis Pipeline Minimizing Human Annotation

Authors: Rusheng Zhang, Depu Meng, Lance Bassett, Shengyin Shen, Zhengxia Zou, Henry X. Liu

Abstract: Recently, advancements in vehicle-to-infrastructure communication technologies have elevated the significance of infrastructure-based roadside perception systems for cooperative driving. This paper delves into one of its most pivotal challenges: data insufficiency. The lacking of high-quality labeled roadside sensor data with high diversity leads to low robustness, and low transfer-ability of curr… ▽ More Recently, advancements in vehicle-to-infrastructure communication technologies have elevated the significance of infrastructure-based roadside perception systems for cooperative driving. This paper delves into one of its most pivotal challenges: data insufficiency. The lacking of high-quality labeled roadside sensor data with high diversity leads to low robustness, and low transfer-ability of current roadside perception systems. In this paper, a novel solution is proposed to address this problem that creates synthesized training data using Augmented Reality. A Generative Adversarial Network is then applied to enhance the reality further, that produces a photo-realistic synthesized dataset that is capable of training or fine-tuning a roadside perception detector which is robust to different weather and lighting conditions. Our approach was rigorously tested at two key intersections in Michigan, USA: the Mcity intersection and the State St./Ellsworth Rd roundabout. The Mcity intersection is located within the Mcity test field, a controlled testing environment. In contrast, the State St./Ellsworth Rd intersection is a bustling roundabout notorious for its high traffic flow and a significant number of accidents annually. Experimental results demonstrate that detectors trained solely on synthesized data exhibit commendable performance across all conditions. Furthermore, when integrated with labeled data, the synthesized data can notably bolster the performance of pre-existing detectors, especially in adverse conditions. △ Less

Submitted 8 February, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

Comments: Accepted by IEEE Transactions on Intelligent Vehicles

arXiv:2306.05946 [pdf, other]

Digital Twin-Assisted Resource Demand Prediction for Multicast Short Video Streaming

Authors: Xinyu Huang, Wen Wu, Xuemin Sherman Shen

Abstract: In this paper, we propose a digital twin (DT)-assisted resource demand prediction scheme to enhance prediction accuracy for multicast short video streaming. Particularly, we construct user DTs (UDTs) for collecting real-time user status, including channel condition, location, watching duration, and preference. A reinforcement learning-empowered K-means++ algorithm is developed to cluster users bas… ▽ More In this paper, we propose a digital twin (DT)-assisted resource demand prediction scheme to enhance prediction accuracy for multicast short video streaming. Particularly, we construct user DTs (UDTs) for collecting real-time user status, including channel condition, location, watching duration, and preference. A reinforcement learning-empowered K-means++ algorithm is developed to cluster users based on the collected user status in UDTs, which can effectively employ the mined users' intrinsic correlation to improve the accuracy of user clustering. We then analyze users' video watching duration and preferences in each multicast group to obtain the swiping probability distribution and recommended videos, respectively. The obtained information is utilized to predict radio and computing resource demand of each multicast group. Initial results demonstrate that the proposed scheme can effectively abstract multicast groups' swiping probability distributions for accurate resource demand prediction. △ Less

Submitted 9 June, 2023; originally announced June 2023.

Comments: 2 pages, 3 figures

arXiv:2305.05013 [pdf, ps, other]

Beyond Diagonal Reconfigurable Intelligent Surfaces Utilizing Graph Theory: Modeling, Architecture Design, and Optimization

Authors: Matteo Nerini, Shanpu Shen, Hongyu Li, Bruno Clerckx

Abstract: Recently, beyond diagonal reconfigurable intelligent surface (BD-RIS) has been proposed to generalize conventional RIS. BD-RIS has a scattering matrix that is not restricted to being diagonal and thus brings a performance improvement over conventional RIS. While different BD-RIS architectures have been proposed, it still remains an open problem to develop a systematic approach to design BD-RIS arc… ▽ More Recently, beyond diagonal reconfigurable intelligent surface (BD-RIS) has been proposed to generalize conventional RIS. BD-RIS has a scattering matrix that is not restricted to being diagonal and thus brings a performance improvement over conventional RIS. While different BD-RIS architectures have been proposed, it still remains an open problem to develop a systematic approach to design BD-RIS architectures achieving the optimal trade-off between performance and circuit complexity. In this work, we propose novel modeling, architecture design, and optimization for BD-RIS based on graph theory. This graph theoretical modeling allows us to develop two new efficient BD-RIS architectures, denoted as tree-connected and forest-connected RIS. Tree-connected RIS, whose corresponding graph is a tree, is proven to be the least complex BD-RIS architecture able to achieve the performance upper bound in multiple-input single-output (MISO) systems. Besides, forest-connected RIS allows us to strike a balance between performance and complexity, further decreasing the complexity over tree-connected RIS. To optimize tree-connected RIS, we derive a closed-form global optimal solution, while forest-connected RIS is optimized through a low-complexity iterative algorithm. Numerical results confirm that tree-connected (resp. forest-connected) RIS achieves the same performance as fully-connected (resp. group-connected) RIS, while reducing the complexity by up to 16.4 times. △ Less

Submitted 20 February, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

Comments: Accepted by IEEE for publication

arXiv:2304.10780 [pdf, other]

Omni-Line-of-Sight Imaging for Holistic Shape Reconstruction

Authors: Binbin Huang, Xingyue Peng, Siyuan Shen, Suan Xia, Ruiqian Li, Yanhua Yu, Yuehan Wang, Shenghua Gao, Wenzheng Chen, Shiying Li, Jingyi Yu

Abstract: We introduce Omni-LOS, a neural computational imaging method for conducting holistic shape reconstruction (HSR) of complex objects utilizing a Single-Photon Avalanche Diode (SPAD)-based time-of-flight sensor. As illustrated in Fig. 1, our method enables new capabilities to reconstruct near-$360^\circ$ surrounding geometry of an object from a single scan spot. In such a scenario, traditional line-o… ▽ More We introduce Omni-LOS, a neural computational imaging method for conducting holistic shape reconstruction (HSR) of complex objects utilizing a Single-Photon Avalanche Diode (SPAD)-based time-of-flight sensor. As illustrated in Fig. 1, our method enables new capabilities to reconstruct near-$360^\circ$ surrounding geometry of an object from a single scan spot. In such a scenario, traditional line-of-sight (LOS) imaging methods only see the front part of the object and typically fail to recover the occluded back regions. Inspired by recent advances of non-line-of-sight (NLOS) imaging techniques which have demonstrated great power to reconstruct occluded objects, Omni-LOS marries LOS and NLOS together, leveraging their complementary advantages to jointly recover the holistic shape of the object from a single scan position. The core of our method is to put the object nearby diffuse walls and augment the LOS scan in the front view with the NLOS scans from the surrounding walls, which serve as virtual ``mirrors'' to trap lights toward the object. Instead of separately recovering the LOS and NLOS signals, we adopt an implicit neural network to represent the object, analogous to NeRF and NeTF. While transients are measured along straight rays in LOS but over the spherical wavefronts in NLOS, we derive differentiable ray propagation models to simultaneously model both types of transient measurements so that the NLOS reconstruction also takes into account the direct LOS measurements and vice versa. We further develop a proof-of-concept Omni-LOS hardware prototype for real-world validation. Comprehensive experiments on various wall settings demonstrate that Omni-LOS successfully resolves shape ambiguities caused by occlusions, achieves high-fidelity 3D scan quality, and manages to recover objects of various scales and complexity. △ Less

Submitted 21 April, 2023; originally announced April 2023.

arXiv:2303.06912 [pdf, other]

Synergizing Beyond Diagonal Reconfigurable Intelligent Surface and Rate-Splitting Multiple Access

Authors: Hongyu Li, Shanpu Shen, Bruno Clerckx

Abstract: This work focuses on the synergy of rate-splitting multiple access (RSMA) and beyond diagonal reconfigurable intelligent surface (BD-RIS) to enlarge the coverage, improve the performance, and save on antennas. Specifically, we employ a multi-sector BD-RIS modeled as a prism, which can achieve highly directional full-space coverage, in a multiuser multiple input single output communication system.… ▽ More This work focuses on the synergy of rate-splitting multiple access (RSMA) and beyond diagonal reconfigurable intelligent surface (BD-RIS) to enlarge the coverage, improve the performance, and save on antennas. Specifically, we employ a multi-sector BD-RIS modeled as a prism, which can achieve highly directional full-space coverage, in a multiuser multiple input single output communication system. With the multi-sector BD-RIS aided RSMA model, we jointly design the transmit precoder and BD-RIS matrix under the imperfect channel state information (CSI) conditions. The robust design is performed by solving a stochastic average sum-rate maximization problem. With sample average approximation and weighted minimum mean square error-rate relationship, the stochastic problem is transformed into a deterministic one with multiple blocks, each of which is iteratively designed. Simulation results show that multi-sector BD-RIS aided RSMA outperforms space division multiple access schemes. More importantly, synergizing multi-sector BD-RIS with RSMA is an efficient strategy to reduce the number of active antennas at the transmitter and the number of passive antennas in BD-RIS. △ Less

Submitted 4 January, 2024; v1 submitted 13 March, 2023; originally announced March 2023.

Comments: 13 pages, 11 figures, accepted by IEEE Transactions on Wireless Communications

arXiv:2303.03090 [pdf, other]

Parallel Optimization with Hard Safety Constraints for Cooperative Planning of Connected Autonomous Vehicles

Authors: Zhenmin Huang, Haichao Liu, Shaojie Shen, Jun Ma

Abstract: The development of connected autonomous vehicles (CAVs) facilitates the enhancement of traffic efficiency in complicated scenarios. In unsignalized roundabout scenarios, difficulties remain unsolved in developing an effective and efficient coordination strategy for CAVs. In this paper, we formulate the cooperative autonomous driving problem of CAVs in the roundabout scenario as a constrained optim… ▽ More The development of connected autonomous vehicles (CAVs) facilitates the enhancement of traffic efficiency in complicated scenarios. In unsignalized roundabout scenarios, difficulties remain unsolved in developing an effective and efficient coordination strategy for CAVs. In this paper, we formulate the cooperative autonomous driving problem of CAVs in the roundabout scenario as a constrained optimal control problem, and propose a computationally-efficient parallel optimization framework to generate strategies for CAVs such that the travel efficiency is improved with hard safety guarantees. All constraints involved in the roundabout scenario are addressed appropriately with convex approximation, such that the convexity property of the reformulated optimization problem is exhibited. Then, a parallel optimization algorithm is presented to solve the reformulated optimization problem, where an embodied iterative nearest neighbor search strategy to determine the optimal passing sequence in the roundabout scenario. It is noteworthy that the travel efficiency in the roundabout scenario is enhanced and the computation burden is considerably alleviated with the innovation development. We also examine the proposed method in CARLA simulator and perform thorough comparisons with a rule-based baseline and the commonly used IPOPT optimization solver to demonstrate the effectiveness and efficiency of the proposed approach. △ Less

Submitted 4 May, 2024; v1 submitted 6 March, 2023; originally announced March 2023.

arXiv:2303.00563 [pdf, other]

ROCO: A Roundabout Traffic Conflict Dataset

Authors: Depu Meng, Owen Sayer, Rusheng Zhang, Shengyin Shen, Houqiang Li, Henry X. Liu

Abstract: Traffic conflicts have been studied by the transportation research community as a surrogate safety measure for decades. However, due to the rarity of traffic conflicts, collecting large-scale real-world traffic conflict data becomes extremely challenging. In this paper, we introduce and analyze ROCO - a real-world roundabout traffic conflict dataset. The data is collected at a two-lane roundabout… ▽ More Traffic conflicts have been studied by the transportation research community as a surrogate safety measure for decades. However, due to the rarity of traffic conflicts, collecting large-scale real-world traffic conflict data becomes extremely challenging. In this paper, we introduce and analyze ROCO - a real-world roundabout traffic conflict dataset. The data is collected at a two-lane roundabout at the intersection of State St. and W. Ellsworth Rd. in Ann Arbor, Michigan. We use raw video dataflow captured from four fisheye cameras installed at the roundabout as our input data source. We adopt a learning-based conflict identification algorithm from video to find potential traffic conflicts, and then manually label them for dataset collection and annotation. In total 557 traffic conflicts and 17 traffic crashes are collected from August 2021 to October 2021. We provide trajectory data of the traffic conflict scenes extracted using our roadside perception system. Taxonomy based on traffic conflict severity, reason for the traffic conflict, and its effect on the traffic flow is provided. With the traffic conflict data collected, we discover that failure to yield to circulating vehicles when entering the roundabout is the largest contributing reason for traffic conflicts. ROCO dataset will be made public in the short future. △ Less

Submitted 1 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

Comments: Accepted by TRBAM 2023 presentation

arXiv:2302.13277 [pdf, other]

Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations

Authors: Siyuan Shen, Feng Liu, Aimin Zhou

Abstract: Fueled by recent advances of self-supervised models, pre-trained speech representations proved effective for the downstream speech emotion recognition (SER) task. Most prior works mainly focus on exploiting pre-trained representations and just adopt a linear head on top of the pre-trained model, neglecting the design of the downstream network. In this paper, we propose a temporal shift module to m… ▽ More Fueled by recent advances of self-supervised models, pre-trained speech representations proved effective for the downstream speech emotion recognition (SER) task. Most prior works mainly focus on exploiting pre-trained representations and just adopt a linear head on top of the pre-trained model, neglecting the design of the downstream network. In this paper, we propose a temporal shift module to mingle channel-wise information without introducing any parameter or FLOP. With the temporal shift module, three designed baseline building blocks evolve into corresponding shift variants, i.e. ShiftCNN, ShiftLSTM, and Shiftformer. Moreover, to balance the trade-off between mingling and misalignment, we propose two technical strategies, placement of shift and proportion of shift. The family of temporal shift models all outperforms the state-of-the-art methods on the benchmark IEMOCAP dataset under both finetuning and feature extraction settings. Our code is available at https://github.com/ECNU-Cross-Innovation-Lab/ShiftSER. △ Less

Submitted 1 March, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

Comments: Accepted by 48th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023)

arXiv:2302.00227 [pdf, other]

A Projective Geometric View for 6D Pose Estimation in mmWave MIMO Systems

Authors: Shengqiang Shen, Henk Wymeersch

Abstract: Millimeter-wave (mmWave) systems in the 30--300 GHz bands are among the fundamental enabling technologies of 5G and beyond 5G, providing large bandwidths, not only for high data rate communication, but also for precise positioning services, in support of high accuracy demanding applications such as vehicle positioning. With the possibility to introduce relatively large arrays on user devices with… ▽ More Millimeter-wave (mmWave) systems in the 30--300 GHz bands are among the fundamental enabling technologies of 5G and beyond 5G, providing large bandwidths, not only for high data rate communication, but also for precise positioning services, in support of high accuracy demanding applications such as vehicle positioning. With the possibility to introduce relatively large arrays on user devices with a small footprint, the ability to determine the user orientation becomes unlocked. The estimation of the full user pose (joint 3D position and 3D orientation) is referred to as 6D localization. Conventionally, the problem of 6D localization using antenna arrays has been considered difficult and was solved through a combination of heuristics and optimization. In this paper, we reveal a close connection between the AoA and AoD and the well-studied perspective projection model from computer vision. This connection allows us to solve the 6D localization problem, by adapting state-of-the-art methods from computer vision. More specifically, two problems, namely 6D pose estimation from AoA from multiple single-antenna base stations and 6D SLAM based on single-BS mmWave communication, are first modeled with the perspective projection model, and then solved. Numerical simulations show that the proposed estimators operate close to the theoretical performance bounds. Moreover, the proposed SLAM method is effective even in the absence of the LoS path, or knowledge of the LoS/NLoS condition. △ Less

Submitted 1 February, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

arXiv:2301.03288 [pdf, other]

Reconfigurable Intelligent Surfaces 2.0: Beyond Diagonal Phase Shift Matrices

Authors: Hongyu Li, Shanpu Shen, Matteo Nerini, Bruno Clerckx

Abstract: Reconfigurable intelligent surface (RIS) has been envisioned as a promising technique to enable and enhance future wireless communications due to its potential to engineer the wireless channels in a cost-effective manner. Extensive research attention has been drawn to the use of conventional RIS 1.0 with diagonal phase shift matrices, where each RIS element is connected to its own load to ground b… ▽ More Reconfigurable intelligent surface (RIS) has been envisioned as a promising technique to enable and enhance future wireless communications due to its potential to engineer the wireless channels in a cost-effective manner. Extensive research attention has been drawn to the use of conventional RIS 1.0 with diagonal phase shift matrices, where each RIS element is connected to its own load to ground but not connected to other elements. However, the simple architecture of RIS 1.0 limits its flexibility of manipulating passive beamforming. To fully exploit the benefits of RIS, in this paper, we introduce RIS 2.0 beyond diagonal phase shift matrices, namely beyond diagonal RIS (BD-RIS). We first explain the modeling of BD-RIS based on the scattering parameter network analysis and classify BD-RIS by the mathematical characteristics of the scattering matrix, supported modes, and architectures. Then, we provide simulations to evaluate the sum-rate performance with different modes/architectures of BD-RIS. We summarize the benefits of BD-RIS in providing high flexibility in wave manipulation, enlarging coverage, facilitating the deployment, and requiring low complexity in resolution bit and element numbers. Inspired by the benefits of BD-RIS, we also discuss potential applications of BD-RIS in various wireless systems. Finally, we list key challenges in modeling, designing, and implementing BD-RIS in practice and point to possible future research directions for BD-RIS. △ Less

Submitted 10 October, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

Comments: 7 pages, 5 figures, submitted to IEEE journal for possible publication

arXiv:2301.03286 [pdf, other]

doi 10.1109/TCOMM.2024.3447917.

A Dual-Function Radar-Communication System Empowered by Beyond Diagonal Reconfigurable Intelligent Surface

Authors: Bowen Wang, Hongyu Li, Shanpu Shen, Ziyang Cheng, Bruno Clerckx

Abstract: This work focuses on the use of reconfigurable intelligent surface (RIS) in dual-function radar-communication (DFRC) systems to improve communication capacity and sensing precision, and enhance coverage for both functions. In contrast to most of the existing RIS aided DFRC works where the RIS is modeled as a diagonal phase shift matrix and can only reflect signals to half space, we propose a novel… ▽ More This work focuses on the use of reconfigurable intelligent surface (RIS) in dual-function radar-communication (DFRC) systems to improve communication capacity and sensing precision, and enhance coverage for both functions. In contrast to most of the existing RIS aided DFRC works where the RIS is modeled as a diagonal phase shift matrix and can only reflect signals to half space, we propose a novel beyond diagonal RIS (BD-RIS) aided DFRC system. Specifically, the proposed BD-RIS supports the hybrid reflecting and transmitting mode, and is compatible with flexible architectures, enabling the system to realize full-space coverage and to achieve enhanced performance. To achieve the expected benefits, we jointly optimize the transmit waveform, the BD-RIS matrices, and sensing receive filters, by maximizing the minimum signal-to-clutter-plus-noise ratio for fair target detection, subject to the constraints of the communication quality of service, different BD-RIS architectures and power budget. To solve the non-convex and non-smooth max-min problem, a general solution based on the alternating direction method of multipliers is provided. Numerical simulations validate the efficacy of the proposed algorithm and show the superiority of the BD-RIS aided DFRC system in terms of both communication and sensing compared to conventional RIS aided DFRC. △ Less

Submitted 8 September, 2024; v1 submitted 9 January, 2023; originally announced January 2023.

Comments: IEEE Transactions on Communications, 2024

arXiv:2212.12088 [pdf, other]

doi 10.1109/TPWRS.2023.3335348

Frequency Stability-Constrained Unit Commitment: Tight Approximation using Bernstein Polynomials

Authors: Bo Zhou, Ruiwei Jiang, Siqian Shen

Abstract: As we replace conventional synchronous generators with renewable energy, the frequency security of power systems is at higher risk. This calls for a more careful consideration of unit commitment (UC) and primary frequency response (PFR) reserves. This paper studies frequency-secured UC under significant wind power uncertainty. We coordinate the thermal units and wind farms to provide frequency sup… ▽ More As we replace conventional synchronous generators with renewable energy, the frequency security of power systems is at higher risk. This calls for a more careful consideration of unit commitment (UC) and primary frequency response (PFR) reserves. This paper studies frequency-secured UC under significant wind power uncertainty. We coordinate the thermal units and wind farms to provide frequency support, wherein we optimize the variable inverter droop factors of the wind farms for higher economy. In addition, we adopt distributionally robust chance constraints (DRCCs) to handle the wind power uncertainty. To depict the frequency dynamics, we incorporate a differential-algebraic equation (DAE) with the dead band into the UC model. Notably, we apply Bernstein polynomials to derive tight inner approximation of the DAE and obtain mixed-integer linear constraints, which can be computed in off-the-shelf solvers. Case studies demonstrate the tightness and effectiveness of the proposed method in guaranteeing frequency security. △ Less

Submitted 23 July, 2024; v1 submitted 22 December, 2022; originally announced December 2022.

arXiv:2211.06906 [pdf, other]

Digital Twin-Assisted Collaborative Transcoding for Better User Satisfaction in Live Streaming

Authors: Xinyu Huang, Mushu Li, Wen Wu, Conghao Zhou, Xuemin Sherman Shen

Abstract: In this paper, we propose a digital twin (DT)-assisted cloud-edge collaborative transcoding scheme to enhance user satisfaction in live streaming. We first present a DT-assisted transcoding workload estimation (TWE) model for the cloud-edge collaborative transcoding. Particularly, two DTs are constructed for emulating the cloud-edge collaborative transcoding process by analyzing spatial-temporal i… ▽ More In this paper, we propose a digital twin (DT)-assisted cloud-edge collaborative transcoding scheme to enhance user satisfaction in live streaming. We first present a DT-assisted transcoding workload estimation (TWE) model for the cloud-edge collaborative transcoding. Particularly, two DTs are constructed for emulating the cloud-edge collaborative transcoding process by analyzing spatial-temporal information of individual videos and transcoding configurations of transcoding queues, respectively. Two light-weight Bayesian neural networks are adopted to fit the TWE models in DTs, respectively. We then formulate a transcoding-path selection problem to maximize long-term user satisfaction within an average service delay threshold, taking into account the dynamics of video arrivals and video requests. The problem is transformed into a standard Markov decision process by using the Lyapunov optimization and solved by a deep reinforcement learning algorithm. Simulation results based on the real-world dataset demonstrate that the proposed scheme can effectively enhance user satisfaction compared with benchmark schemes. △ Less

Submitted 13 November, 2022; originally announced November 2022.

Comments: Submitted to ICC 2023

arXiv:2211.06117 [pdf, ps, other]

Closed-Form Global Optimization of Beyond Diagonal Reconfigurable Intelligent Surfaces

Authors: Matteo Nerini, Shanpu Shen, Bruno Clerckx

Abstract: Reconfigurable intelligent surfaces (RISs) allow controlling the propagation environment in wireless networks by tuning multiple reflecting elements. RISs have been traditionally realized through single connected architectures, mathematically characterized by a diagonal scattering matrix. Recently, beyond diagonal RISs (BD-RISs) have been proposed as a novel branch of RISs whose scattering matrix… ▽ More Reconfigurable intelligent surfaces (RISs) allow controlling the propagation environment in wireless networks by tuning multiple reflecting elements. RISs have been traditionally realized through single connected architectures, mathematically characterized by a diagonal scattering matrix. Recently, beyond diagonal RISs (BD-RISs) have been proposed as a novel branch of RISs whose scattering matrix is not limited to be diagonal, which creates new benefits and opportunities for RISs. Efficient BD-RIS architectures have been realized based on group and fully connected reconfigurable impedance networks. However, a closed-form solution for the global optimal scattering matrix of these architectures is not yet available. In this paper, we provide such a closed-form solution proving that the theoretical performance upper bounds can be exactly achieved for any channel realization. We first consider the received signal power maximization in single-user single-input single-output (SISO) systems aided by a BD-RIS working in reflective or transmissive mode. Then, we extend our solution to single-user multiple-input multiple-output (MIMO) and multi-user multiple-input single-output (MISO) systems. We show that our algorithm is less complex than the iterative optimization algorithms employed in the previous literature. The complexity of our algorithm grows linearly (resp. cubically) with the number of RIS elements in the case of group (resp. fully) connected architectures. △ Less

Submitted 14 June, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

Comments: Accepted by IEEE for publication

arXiv:2210.09920 [pdf, other]

An Efficient Ratio Detector for Ambient Backscatter Communication

Authors: Wenjing Liu, Shanpu Shen, Danny H. K. Tsang, Ranjan K. Mallik, Ross Murch

Abstract: Ambient backscatter communication (AmBC) leverages the existing ambient radio frequency (RF) environment to implement communication with battery-free devices. One critical challenge of AmBC systems is signal recovery because the transmitted information bits are embedded in the ambient RF signals and these are unknown and uncontrollable. To address this problem, most existing approaches use averagi… ▽ More Ambient backscatter communication (AmBC) leverages the existing ambient radio frequency (RF) environment to implement communication with battery-free devices. One critical challenge of AmBC systems is signal recovery because the transmitted information bits are embedded in the ambient RF signals and these are unknown and uncontrollable. To address this problem, most existing approaches use averaging-based energy detectors and consequently the data rate is low and there is an error floor. Here we propose a new detection strategy based on the ratio between signals received from a multiple-antenna Reader. The advantage of using the ratio is that ambient RF signals are removed directly from the embedded signals without averaging and hence it can increase data rates and avoid the error floor. Different from original ratio detectors that use the magnitude ratio of the signals between two Reader antennas, in our proposed approach, we utilize the complex ratio so that phase information is preserved and propose an accurate linear channel model approximation. This allows the application of existing linear detection techniques from which we can obtain a minimum distance detector and closed-form expressions for bit error rate (BER). In addition, averaging, coding and interleaving can also be included to further enhance the BER. The results are also general, allowing any number of Reader antennas to be utilized in the approach. Numerical results demonstrate that the proposed approach performs better than approaches based on energy detection and original ratio detectors. △ Less

Submitted 18 October, 2022; originally announced October 2022.

arXiv:2210.02499 [pdf, ps, other]

A Dynamic Grouping Strategy for Beyond Diagonal Reconfigurable Intelligent Surfaces with Hybrid Transmitting and Reflecting Mode

Authors: Hongyu Li, Shanpu Shen, Bruno Clerckx

Abstract: Beyond diagonal reconfigurable intelligent surface (BD-RIS) is a novel branch of RIS which breaks through the limitation of traditional RIS with diagonal scattering matrices. However, the existing research focuses on BD-RIS with fixed architectures regardless of channel state information (CSI), which limit the achievable performance of BD-RIS. To solve this issue, in this paper, we propose a novel… ▽ More Beyond diagonal reconfigurable intelligent surface (BD-RIS) is a novel branch of RIS which breaks through the limitation of traditional RIS with diagonal scattering matrices. However, the existing research focuses on BD-RIS with fixed architectures regardless of channel state information (CSI), which limit the achievable performance of BD-RIS. To solve this issue, in this paper, we propose a novel dynamically group-connected BD-RIS based on a dynamic grouping strategy. Specifically, RIS antennas are dynamically divided into several subsets adapting to the CSI, yielding a permuted block-diagonal scattering matrix. To verify the effectiveness of the proposed dynamically group-connected BD-RIS, we propose an efficient algorithm to optimize the BD-RIS with dynamic grouping for a BD-RIS-assisted multi-user multiple-input single-output system. Simulation results show that the proposed dynamically group-connected architecture outperforms fixed group-connected architectures. △ Less

Submitted 20 June, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

Comments: 6 pages, 6 figures, accepted by IEEE Trans. Veh. Technol

Showing 1–50 of 78 results for author: Shen, S