Search | arXiv e-print repository

SGD Jittering: A Training Strategy for Robust and Accurate Model-Based Architectures

Authors: Peimeng Guan, Mark A. Davenport

Abstract: Inverse problems aim to reconstruct unseen data from corrupted or perturbed measurements. While most work focuses on improving reconstruction quality, generalization accuracy and robustness are equally important, especially for safety-critical applications. Model-based architectures (MBAs), such as loop unrolling methods, are considered more interpretable and achieve better reconstructions. Empiri… ▽ More Inverse problems aim to reconstruct unseen data from corrupted or perturbed measurements. While most work focuses on improving reconstruction quality, generalization accuracy and robustness are equally important, especially for safety-critical applications. Model-based architectures (MBAs), such as loop unrolling methods, are considered more interpretable and achieve better reconstructions. Empirical evidence suggests that MBAs are more robust to perturbations than black-box solvers, but the accuracy-robustness tradeoff in MBAs remains underexplored. In this work, we propose a simple yet effective training scheme for MBAs, called SGD jittering, which injects noise iteration-wise during reconstruction. We theoretically demonstrate that SGD jittering not only generalizes better than the standard mean squared error training but is also more robust to average-case attacks. We validate SGD jittering using denoising toy examples, seismic deconvolution, and single-coil MRI reconstruction. Both SGD jittering and its SPGD extension yield cleaner reconstructions for out-of-distribution data and demonstrates enhanced robustness against adversarial attacks. △ Less

Submitted 6 June, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

Comments: ICML 2025

arXiv:2409.09542 [pdf, other]

MANGO: Learning Disentangled Image Transformation Manifolds with Grouped Operators

Authors: Brighton Ancelin, Yenho Chen, Peimeng Guan, Chiraag Kaushik, Belen Martin-Urcelay, Alex Saad-Falcon, Nakul Singh

Abstract: Learning semantically meaningful image transformations (i.e. rotation, thickness, blur) directly from examples can be a challenging task. Recently, the Manifold Autoencoder (MAE) proposed using a set of Lie group operators to learn image transformations directly from examples. However, this approach has limitations, as the learned operators are not guaranteed to be disentangled and the training ro… ▽ More Learning semantically meaningful image transformations (i.e. rotation, thickness, blur) directly from examples can be a challenging task. Recently, the Manifold Autoencoder (MAE) proposed using a set of Lie group operators to learn image transformations directly from examples. However, this approach has limitations, as the learned operators are not guaranteed to be disentangled and the training routine is prohibitively expensive when scaling up the model. To address these limitations, we propose MANGO (transformation Manifolds with Grouped Operators) for learning disentangled operators that describe image transformations in distinct latent subspaces. Moreover, our approach allows practitioners the ability to define which transformations they aim to model, thus improving the semantic meaning of the learned operators. Through our experiments, we demonstrate that MANGO enables composition of image transformations and introduces a one-phase training routine that leads to a 100x speedup over prior works. △ Less

Submitted 20 April, 2025; v1 submitted 14 September, 2024; originally announced September 2024.

Comments: Submitted to SampTA 2025. This work has been submitted to the IEEE for possible publication

ACM Class: I.2.6; I.4.2; I.4.7; I.4.10; I.5.1

arXiv:2408.03446 [pdf, other]

Optimizing NOMA Transmissions to Advance Federated Learning in Vehicular Networks

Authors: Ziru Chen, Zhou Ni, Peiyuan Guan, Lu Wang, Lin X. Cai, Morteza Hashemi, Zongzhi Li

Abstract: Diverse critical data, such as location information and driving patterns, can be collected by IoT devices in vehicular networks to improve driving experiences and road safety. However, drivers are often reluctant to share their data due to privacy concerns. The Federated Vehicular Network (FVN) is a promising technology that tackles these concerns by transmitting model parameters instead of raw da… ▽ More Diverse critical data, such as location information and driving patterns, can be collected by IoT devices in vehicular networks to improve driving experiences and road safety. However, drivers are often reluctant to share their data due to privacy concerns. The Federated Vehicular Network (FVN) is a promising technology that tackles these concerns by transmitting model parameters instead of raw data, thereby protecting the privacy of drivers. Nevertheless, the performance of Federated Learning (FL) in a vehicular network depends on the joining ratio, which is restricted by the limited available wireless resources. To address these challenges, this paper proposes to apply Non-Orthogonal Multiple Access (NOMA) to improve the joining ratio in a FVN. Specifically, a vehicle selection and transmission power control algorithm is developed to exploit the power domain differences in the received signal to ensure the maximum number of vehicles capable of joining the FVN. Our simulation results demonstrate that the proposed NOMA-based strategy increases the joining ratio and significantly enhances the performance of the FVN. △ Less

Submitted 6 August, 2024; originally announced August 2024.

Comments: The paper is accepted by IEEE Globecom 2024

arXiv:2403.04847 [pdf, other]

Solving Inverse Problems with Model Mismatch using Untrained Neural Networks within Model-based Architectures

Authors: Peimeng Guan, Naveed Iqbal, Mark A. Davenport, Mudassir Masood

Abstract: Model-based deep learning methods such as loop unrolling (LU) and deep equilibrium model}(DEQ) extensions offer outstanding performance in solving inverse problems (IP). These methods unroll the optimization iterations into a sequence of neural networks that in effect learn a regularization function from data. While these architectures are currently state-of-the-art in numerous applications, their… ▽ More Model-based deep learning methods such as loop unrolling (LU) and deep equilibrium model}(DEQ) extensions offer outstanding performance in solving inverse problems (IP). These methods unroll the optimization iterations into a sequence of neural networks that in effect learn a regularization function from data. While these architectures are currently state-of-the-art in numerous applications, their success heavily relies on the accuracy of the forward model. This assumption can be limiting in many physical applications due to model simplifications or uncertainties in the apparatus. To address forward model mismatch, we introduce an untrained forward model residual block within the model-based architecture to match the data consistency in the measurement domain for each instance. We propose two variants in well-known model-based architectures (LU and DEQ) and prove convergence under mild conditions. Our approach offers a unified solution that is less parameter-sensitive, requires no additional data, and enables simultaneous fitting of the forward model and reconstruction in a single pass, benefiting both linear and nonlinear inverse problems. The experiments show significant quality improvement in removing artifacts and preserving details across three distinct applications, encompassing both linear and nonlinear inverse problems. Moreover, we highlight reconstruction effectiveness in intermediate steps and showcase robustness to random initialization of the residual block and a higher number of iterations during evaluation. Code is available at \texttt{https://github.com/InvProbs/A-adaptive-model-based-methods}. △ Less

Submitted 10 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

Comments: Published in Transactions in Machine Learning Research (TMLR)

arXiv:2307.10030 [pdf, other]

Learned Proximal Operator for Solving Seismic Deconvolution Problem

Authors: Peimeng Guan, Naveed Iqbal, Mark A. Davenport, Mudassir Masood

Abstract: Seismic deconvolution is an essential step in seismic data processing that aims to extract layer information from noisy observed traces. In general, this is an ill-posed problem with non-unique solutions. Due to the sparse nature of the reflectivity sequence, spike-promoting regularizers such as the $\ell_1$-norm are frequently used. They either require rigorous coefficient tuning or strong assump… ▽ More Seismic deconvolution is an essential step in seismic data processing that aims to extract layer information from noisy observed traces. In general, this is an ill-posed problem with non-unique solutions. Due to the sparse nature of the reflectivity sequence, spike-promoting regularizers such as the $\ell_1$-norm are frequently used. They either require rigorous coefficient tuning or strong assumptions about reflectivity, such as assuming reflectivity as sparse signals with known sparsity levels and zero-mean Gaussian noise with known noise levels. To overcome the limitations of traditional regularizers, learning-based regularizers are proposed in the recent past. This paper proposes a Learned Proximal operator for Seismic Deconvolution (LP4SD), which leverages a neural network to learn the proximal operator of a regularizer. LP4SD is trained in a loop unrolled manner and is capable of learning complicated structures from the training data. It is worth mentioning that the network is trained with synthetic data and evaluated on both synthetic and real data. LP4SD is shown to generate better reconstruction results in terms of three different metrics as compared to learning a direct inverse. △ Less

Submitted 19 July, 2023; originally announced July 2023.

arXiv:2306.12675 [pdf, other]

STAR-RIS-Assisted Privacy Protection in Semantic Communication System

Authors: Yiru Wang, Wanting Yang, Pengxin Guan, Yuping Zhao, Zehui Xiong

Abstract: Semantic communication (SemCom) has emerged as a promising architecture in the realm of intelligent communication paradigms. SemCom involves extracting and compressing the core information at the transmitter while enabling the receiver to interpret it based on established knowledge bases (KBs). This approach enhances communication efficiency greatly. However, the open nature of wireless transmissi… ▽ More Semantic communication (SemCom) has emerged as a promising architecture in the realm of intelligent communication paradigms. SemCom involves extracting and compressing the core information at the transmitter while enabling the receiver to interpret it based on established knowledge bases (KBs). This approach enhances communication efficiency greatly. However, the open nature of wireless transmission and the presence of homogeneous KBs among subscribers of identical data type pose a risk of privacy leakage in SemCom. To address this challenge, we propose to leverage the simultaneous transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) to achieve privacy protection in a SemCom system. In this system, the STAR-RIS is utilized to enhance the signal transmission of the SemCom between a base station and a destination user, as well as to covert the signal to interference specifically for the eavesdropper (Eve). Simulation results demonstrate that our generated task-level disturbance outperforms other benchmarks in protecting SemCom privacy, as evidenced by the significantly lower task success rate achieved by Eve. △ Less

Submitted 22 June, 2023; originally announced June 2023.

arXiv:2210.04987 [pdf, other]

Loop Unrolled Shallow Equilibrium Regularizer (LUSER) -- A Memory-Efficient Inverse Problem Solver

Authors: Peimeng Guan, Jihui Jin, Justin Romberg, Mark A. Davenport

Abstract: In inverse problems we aim to reconstruct some underlying signal of interest from potentially corrupted and often ill-posed measurements. Classical optimization-based techniques proceed by optimizing a data consistency metric together with a regularizer. Current state-of-the-art machine learning approaches draw inspiration from such techniques by unrolling the iterative updates for an optimization… ▽ More In inverse problems we aim to reconstruct some underlying signal of interest from potentially corrupted and often ill-posed measurements. Classical optimization-based techniques proceed by optimizing a data consistency metric together with a regularizer. Current state-of-the-art machine learning approaches draw inspiration from such techniques by unrolling the iterative updates for an optimization-based solver and then learning a regularizer from data. This loop unrolling (LU) method has shown tremendous success, but often requires a deep model for the best performance leading to high memory costs during training. Thus, to address the balance between computation cost and network expressiveness, we propose an LU algorithm with shallow equilibrium regularizers (LUSER). These implicit models are as expressive as deeper convolutional networks, but far more memory efficient during training. The proposed method is evaluated on image deblurring, computed tomography (CT), as well as single-coil Magnetic Resonance Imaging (MRI) tasks and shows similar, or even better, performance while requiring up to 8 times less computational resources during training when compared against a more typical LU architecture with feedforward convolutional regularizers. △ Less

Submitted 13 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

arXiv:2206.08797 [pdf, ps, other]

Secure Wireless Transmission for Reconfigurable Intelligent Surface Aided Full Duplex Systems

Authors: Pengxin Guan, Yiru Wang, Yuping Zhao

Abstract: This letter considers the secure communication in a reconfigurable intelligent surface (RIS) aided full duplex (FD) system. An FD base station (BS) serves an uplink (UL) user and a downlink (DL) user simultaneously over the same time-frequency dimension assisted by a RIS in the presence of an eavesdropper. In addition, the artificial noise (AN) is also applied to interfere the eavesdropper's chann… ▽ More This letter considers the secure communication in a reconfigurable intelligent surface (RIS) aided full duplex (FD) system. An FD base station (BS) serves an uplink (UL) user and a downlink (DL) user simultaneously over the same time-frequency dimension assisted by a RIS in the presence of an eavesdropper. In addition, the artificial noise (AN) is also applied to interfere the eavesdropper's channel. We aim to maximize the sum secrecy rate of UL and DL users by jointly optimizing the transmit beamforming, receive beamforming and AN covariance matrix at the BS, and passive beamforming at the RIS. To handle the non-convex problem, we decompose it into tractable subproblems and propose an efficient algorithm based on alternating optimization framework. Specifically, the receive beamforming is derived as a closed-form solution while other variables are obtained by using semidefinite relaxation (SDR) method and successive convex approximation (SCA) algorithm. Simulation results demonstrate the superior performance of our proposed scheme compared to other baseline schemes. △ Less

Submitted 25 September, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

Comments: The paper has been submitted to an IEEE journal for possible publication

arXiv:2205.12079 [pdf, ps, other]

Reconfigurable Intelligent Surfaces for Energy Efficiency in Full-duplex Communication System

Authors: Yiru Wang, Pengxin Guan, Hongkang Yu, Yuping Zhao

Abstract: In this letter, we study the reconfigurable intelligent surfaces (RIS) aided full-duplex (FD) communication system. By jointly designing the active beamforming of two multi-antenna sources and passive beamforming of RIS, we aim to maximize the energy efficiency of the system, where extra self-interference cancellation power consumption in FD system is also considered. We divide the optimization pr… ▽ More In this letter, we study the reconfigurable intelligent surfaces (RIS) aided full-duplex (FD) communication system. By jointly designing the active beamforming of two multi-antenna sources and passive beamforming of RIS, we aim to maximize the energy efficiency of the system, where extra self-interference cancellation power consumption in FD system is also considered. We divide the optimization problem into active and passive beamforming design subproblems, and adopt the alternative optimization framework to solve them iteratively. Dinkelbach's method is used to tackle the fractional objective function in active beamforming problem. Penalty method and successive convex approximation are exploited for passive beamforming design. Simulation results show the energy efficiency of our scheme outperforms other benchmarks. △ Less

Submitted 24 May, 2022; originally announced May 2022.

arXiv:2203.07054 [pdf, ps, other]

Energy Efficiency Maximization of Simultaneous Transmission and Reflection RIS Assisted Full-Duplex Communications

Authors: Pengxin Guan, Yiru Wang, Hongkang Yu, Yuping Zhao

Abstract: This work studies the effectiveness of a novel simultaneous transmission and reflection reconfigurable intelligent surface (STAR-RIS) aided Full-Duplex (FD) communication system. We aim to maximize the energy efficiency by jointly optimizing the transmit power and passive beamforming at the STAR-RIS. We propose an efficient algorithm to optimize them iteratively under the alternating optimization… ▽ More This work studies the effectiveness of a novel simultaneous transmission and reflection reconfigurable intelligent surface (STAR-RIS) aided Full-Duplex (FD) communication system. We aim to maximize the energy efficiency by jointly optimizing the transmit power and passive beamforming at the STAR-RIS. We propose an efficient algorithm to optimize them iteratively under the alternating optimization framework. The successive convex approximation (SCA) and Dinkelbach's method are used to solve the power optimization subproblem. The penalty-based method is used to design passive beamforming at the STAR-RIS. Numerical results verify the convergence and effectiveness of the proposed algorithm, and further reveal the benifits of the combining of the STAR-RIS and FD communication compared to benchmarks. △ Less

Submitted 15 April, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

Comments: arXiv admin note: text overlap with arXiv:2203.05411

arXiv:2203.05411 [pdf, other]

Simultaneous Transmission and Reflection Reconfigurable Intelligent Surface Assisted Full-Duplex Communications

Authors: Yiru Wang, Pengxin Guan, Hongkang Yu, Yuping Zhao

Abstract: This work demonstrates the effectiveness of a novel simultaneous transmission and reflection reconfigurable intelligent surface (STAR-RIS) in Full-Duplex (FD) aided communication system. The objective is to minimize the total transmit power by jointly designing the transmit power and the transmitting and reflecting (T&R) coefficients of the STAR-RIS. To solve the nonconvex problem, an efficient al… ▽ More This work demonstrates the effectiveness of a novel simultaneous transmission and reflection reconfigurable intelligent surface (STAR-RIS) in Full-Duplex (FD) aided communication system. The objective is to minimize the total transmit power by jointly designing the transmit power and the transmitting and reflecting (T&R) coefficients of the STAR-RIS. To solve the nonconvex problem, an efficient algorithm is proposed by utilizing the alternating optimization framework to iteratively optimize variables. Specifically, in each iteration, we drive the closed-form expression for the optimal power design. The successive convex approximation (SCA) method and semidefinite program (SDP) are used to solve the passive beamforming optimization problem. Numerical results verify the convergence and effectiveness of the proposed algorithm, and further reveal in which scenarios STAR-RIS assisted FD communication defeats the Half-Duplex and conventional RIS. △ Less

Submitted 12 March, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

arXiv:2101.06845 [pdf, ps, other]

Performance Analysis and Codebook Design for mmWave Beamforming System with Beam Squint

Authors: Hongkang Yu, Pengxin Guan, Yiru Wang, Yuping Zhao

Abstract: Beamforming technology is widely used in millimeter wave systems to combat path losses, and beamformers are usually selected from a predefined codebook. Unfortunately, the traditional codebook design neglects the beam squint effect, and this will cause severe performance degradation when the bandwidth is large. In this letter, we consider that a codebook with fixed size is adopted in the wideband… ▽ More Beamforming technology is widely used in millimeter wave systems to combat path losses, and beamformers are usually selected from a predefined codebook. Unfortunately, the traditional codebook design neglects the beam squint effect, and this will cause severe performance degradation when the bandwidth is large. In this letter, we consider that a codebook with fixed size is adopted in the wideband beamforming system. First, we analyze how beam squint affects system performance when all beams have the same width. The expression of average spectrum efficiency is derived based on the ideal beam pattern. Next, we formulate the optimization problem to design the optimal codebook. Simulation results demonstrate that the proposed codebook deals with beam squint by spreading the beam coverage and significantly mitigates the performance degradation. △ Less

Submitted 20 June, 2021; v1 submitted 17 January, 2021; originally announced January 2021.

arXiv:1902.00837 [pdf]

UAV-aided urban target tracking system based on edge computing

Authors: Yajun Liu, Congxu Zhu, Xiaoheng Deng, Peiyuan Guan, Zhiwen Wan, Jie Luo, Enlu Liu, Honggang Zhang

Abstract: Target tracking is an important issue of social security. In order to track a target, traditionally a large amount of surveillance video data need to be uploaded into the cloud for processing and analysis, which put stremendous bandwidth pressure on communication links in access networks and core networks. At the same time, the long delay in wide area network is very likely to cause a tracking sys… ▽ More Target tracking is an important issue of social security. In order to track a target, traditionally a large amount of surveillance video data need to be uploaded into the cloud for processing and analysis, which put stremendous bandwidth pressure on communication links in access networks and core networks. At the same time, the long delay in wide area network is very likely to cause a tracking system to lose its target. Often, unmanned aerial vehicle (UAV) has been adopted for target tracking due to its flexibility, but its limited flight time due to battery constraint and the blocking by various obstacles in the field pose two major challenges to its target tracking task, which also very likely results in the loss of target. A novel target tracking model that coordinates the tracking by UAV and ground nodes in an edge computing environment is proposed in this study. The model can effectively reduce the communication cost and the long delay of the traditional surveillance camera system that relies on cloud computing, and it can improve the probability of finding a target again after an UAV loses the tracing of that target. It has been demonstrated that the proposed system achieved a significantly better performance in terms of low latency, high reliability, and optimal quality of experience (QoE). △ Less

Submitted 2 February, 2019; originally announced February 2019.

arXiv:1401.3198 [pdf, other]

Online Markov decision processes with Kullback-Leibler control cost

Authors: Peng Guan, Maxim Raginsky, Rebecca Willett

Abstract: This paper considers an online (real-time) control problem that involves an agent performing a discrete-time random walk over a finite state space. The agent's action at each time step is to specify the probability distribution for the next state given the current state. Following the set-up of Todorov, the state-action cost at each time step is a sum of a state cost and a control cost given by th… ▽ More This paper considers an online (real-time) control problem that involves an agent performing a discrete-time random walk over a finite state space. The agent's action at each time step is to specify the probability distribution for the next state given the current state. Following the set-up of Todorov, the state-action cost at each time step is a sum of a state cost and a control cost given by the Kullback-Leibler (KL) divergence between the agent's next-state distribution and that determined by some fixed passive dynamics. The online aspect of the problem is due to the fact that the state cost functions are generated by a dynamic environment, and the agent learns the current state cost only after selecting an action. An explicit construction of a computationally efficient strategy with small regret (i.e., expected difference between its actual total cost and the smallest cost attainable using noncausal knowledge of the state costs) under mild regularity conditions is presented, along with a demonstration of the performance of the proposed strategy on a simulated target tracking problem. A number of new results on Markov decision processes with KL control cost are also obtained. △ Less

Submitted 14 January, 2014; originally announced January 2014.

Comments: to appear in IEEE Transactions on Automatic Control

Showing 1–14 of 14 results for author: Guan, P