Search | arXiv e-print repository

arXiv:2508.20649 [pdf, ps, other]

Physics-Constrained Machine Learning for Chemical Engineering

Authors: Angan Mukherjee, Victor M. Zavala

Abstract: Physics-constrained machine learning (PCML) combines physical models with data-driven approaches to improve reliability, generalizability, and interpretability. Although PCML has shown significant benefits in diverse scientific and engineering domains, technical and intellectual challenges hinder its applicability in complex chemical engineering applications. Key difficulties include determining t… ▽ More Physics-constrained machine learning (PCML) combines physical models with data-driven approaches to improve reliability, generalizability, and interpretability. Although PCML has shown significant benefits in diverse scientific and engineering domains, technical and intellectual challenges hinder its applicability in complex chemical engineering applications. Key difficulties include determining the amount and type of physical knowledge to embed, designing effective fusion strategies with ML, scaling models to large datasets and simulators, and quantifying predictive uncertainty. This perspective summarizes recent developments and highlights challenges/opportunities in applying PCML to chemical engineering, emphasizing on closed-loop experimental design, real-time dynamics and control, and handling of multi-scale phenomena. △ Less

Submitted 28 August, 2025; originally announced August 2025.

arXiv:2508.03327 [pdf, ps, other]

Quantum Deep Learning for Massive MIMO User Scheduling

Authors: Xingyu Huang, Ruining Fan, Mouli Chakraborty, Avishek Nag, Anshu Mukherjee

Abstract: We introduce a hybrid Quantum Neural Networks (QNN) architecture for the efficient user scheduling in 5G/Beyond 5G (B5G) massive Multiple Input Multiple Output (MIMO) systems, addressing the scalability issues of traditional methods. By leveraging statistical Channel State Information (CSI), our model reduces computational overhead and enhances spectral efficiency. It integrates classical neural n… ▽ More We introduce a hybrid Quantum Neural Networks (QNN) architecture for the efficient user scheduling in 5G/Beyond 5G (B5G) massive Multiple Input Multiple Output (MIMO) systems, addressing the scalability issues of traditional methods. By leveraging statistical Channel State Information (CSI), our model reduces computational overhead and enhances spectral efficiency. It integrates classical neural networks with a variational quantum circuit kernel, outperforming classical Convolutional Neural Networks (CNNs) and maintaining robust performance in noisy channels. This demonstrates the potential of quantum-enhanced Machine Learning (ML) for wireless scheduling. △ Less

Submitted 5 August, 2025; originally announced August 2025.

arXiv:2507.23695 [pdf, ps, other]

On the Achievable Rate of Satellite Quantum Communication Channel using Deep Autoencoder Gaussian Mixture Model

Authors: Mouli Chakraborty, Subhash Chandra, Avishek Nag, Anshu Mukherjee

Abstract: We present a comparative study of the Gaussian mixture model (GMM) and the Deep Autoencoder Gaussian Mixture Model (DAGMM) for estimating satellite quantum channel capacity, considering hybrid quantum noise (HQN) and transmission constraints. While GMM is simple and interpretable, DAGMM better captures non-linear variations and noise distributions. Simulations show that DAGMM provides tighter capa… ▽ More We present a comparative study of the Gaussian mixture model (GMM) and the Deep Autoencoder Gaussian Mixture Model (DAGMM) for estimating satellite quantum channel capacity, considering hybrid quantum noise (HQN) and transmission constraints. While GMM is simple and interpretable, DAGMM better captures non-linear variations and noise distributions. Simulations show that DAGMM provides tighter capacity bounds and improved clustering. This introduces the Deep Cluster Gaussian Mixture Model (DCGMM) for high-dimensional quantum data analysis in quantum satellite communication. △ Less

Submitted 31 July, 2025; originally announced July 2025.

arXiv:2506.01737 [pdf, ps, other]

The Promise of Spiking Neural Networks for Ubiquitous Computing: A Survey and New Perspectives

Authors: Hemanth Sabbella, Archit Mukherjee, Thivya Kandappu, Sounak Dey, Arpan Pal, Archan Misra, Dong Ma

Abstract: Spiking neural networks (SNNs) have emerged as a class of bio -inspired networks that leverage sparse, event-driven signaling to achieve low-power computation while inherently modeling temporal dynamics. Such characteristics align closely with the demands of ubiquitous computing systems, which often operate on resource-constrained devices while continuously monitoring and processing time-series se… ▽ More Spiking neural networks (SNNs) have emerged as a class of bio -inspired networks that leverage sparse, event-driven signaling to achieve low-power computation while inherently modeling temporal dynamics. Such characteristics align closely with the demands of ubiquitous computing systems, which often operate on resource-constrained devices while continuously monitoring and processing time-series sensor data. Despite their unique and promising features, SNNs have received limited attention and remain underexplored (or at least, under-adopted) within the ubiquitous computing community. To address this gap, this paper first introduces the core components of SNNs, both in terms of models and training mechanisms. It then presents a systematic survey of 76 SNN-based studies focused on time-series data analysis, categorizing them into six key application domains. For each domain, we summarize relevant works and subsequent advancements, distill core insights, and highlight key takeaways for researchers and practitioners. To facilitate hands-on experimentation, we also provide a comprehensive review of current software frameworks and neuromorphic hardware platforms, detailing their capabilities and specifications, and then offering tailored recommendations for selecting development tools based on specific application needs. Finally, we identify prevailing challenges within each application domain and propose future research directions that need be explored in ubiquitous community. Our survey highlights the transformative potential of SNNs in enabling energy-efficient ubiquitous sensing across diverse application domains, while also serving as an essential introduction for researchers looking to enter this emerging field. △ Less

Submitted 2 June, 2025; originally announced June 2025.

Comments: 50 pages

ACM Class: I.2

arXiv:2505.19233 [pdf, ps, other]

RAISE: Realness Assessment for Image Synthesis and Evaluation

Authors: Aniruddha Mukherjee, Spriha Dubey, Somdyuti Paul

Abstract: The rapid advancement of generative AI has enabled the creation of highly photorealistic visual content, offering practical substitutes for real images and videos in scenarios where acquiring real data is difficult or expensive. However, reliably substituting real visual content with AI-generated counterparts requires robust assessment of the perceived realness of AI-generated visual content, a ch… ▽ More The rapid advancement of generative AI has enabled the creation of highly photorealistic visual content, offering practical substitutes for real images and videos in scenarios where acquiring real data is difficult or expensive. However, reliably substituting real visual content with AI-generated counterparts requires robust assessment of the perceived realness of AI-generated visual content, a challenging task due to its inherent subjective nature. To address this, we conducted a comprehensive human study evaluating the perceptual realness of both real and AI-generated images, resulting in a new dataset, containing images paired with subjective realness scores, introduced as RAISE in this paper. Further, we develop and train multiple models on RAISE to establish baselines for realness prediction. Our experimental results demonstrate that features derived from deep foundation vision models can effectively capture the subjective realness. RAISE thus provides a valuable resource for developing robust, objective models of perceptual realness assessment. △ Less

Submitted 3 August, 2025; v1 submitted 25 May, 2025; originally announced May 2025.

arXiv:2505.11572 [pdf, ps, other]

ASR-FAIRBENCH: Measuring and Benchmarking Equity Across Speech Recognition Systems

Authors: Anand Rai, Satyam Rahangdale, Utkarsh Anand, Animesh Mukherjee

Abstract: Automatic Speech Recognition (ASR) systems have become ubiquitous in everyday applications, yet significant disparities in performance across diverse demographic groups persist. In this work, we introduce the ASR-FAIRBENCH leaderboard which is designed to assess both the accuracy and equity of ASR models in real-time. Leveraging the Meta's Fair-Speech dataset, which captures diverse demographic ch… ▽ More Automatic Speech Recognition (ASR) systems have become ubiquitous in everyday applications, yet significant disparities in performance across diverse demographic groups persist. In this work, we introduce the ASR-FAIRBENCH leaderboard which is designed to assess both the accuracy and equity of ASR models in real-time. Leveraging the Meta's Fair-Speech dataset, which captures diverse demographic characteristics, we employ a mixed-effects Poisson regression model to derive an overall fairness score. This score is integrated with traditional metrics like Word Error Rate (WER) to compute the Fairness Adjusted ASR Score (FAAS), providing a comprehensive evaluation framework. Our approach reveals significant performance disparities in SOTA ASR models across demographic groups and offers a benchmark to drive the development of more inclusive ASR technologies. △ Less

Submitted 16 May, 2025; originally announced May 2025.

Comments: Paper accepted at INTERSPEECH 2025

arXiv:2501.07590 [pdf]

Ultrafast pulsed laser evaluation of Single Event Transients in opto-couplers

Authors: Kavin Dave, Aditya Mukherjee, Hari Shanker Gupta, Deepak Jain, Shalabh Gupta

Abstract: We build a 1064 nm fiber laser system-based testing facility for emulating SETs in different electronics components and ICs. Using these facilities, we tested the 4N35 optocoupler to observe SETs for the first time. We build a 1064 nm fiber laser system-based testing facility for emulating SETs in different electronics components and ICs. Using these facilities, we tested the 4N35 optocoupler to observe SETs for the first time. △ Less

Submitted 8 January, 2025; originally announced January 2025.

Comments: Accepted in CLEO 2023, San Jose, USA and CLEO 2024, North Carolina, USA for in poster presentation. However due to lack of funds, we could not travel

arXiv:2411.12719 [pdf, other]

Rethinking MUSHRA: Addressing Modern Challenges in Text-to-Speech Evaluation

Authors: Praveen Srinivasa Varadhan, Amogh Gulati, Ashwin Sankar, Srija Anand, Anirudh Gupta, Anirudh Mukherjee, Shiva Kumar Marepally, Ankur Bhatia, Saloni Jaju, Suvrat Bhooshan, Mitesh M. Khapra

Abstract: Despite rapid advancements in TTS models, a consistent and robust human evaluation framework is still lacking. For example, MOS tests fail to differentiate between similar models, and CMOS's pairwise comparisons are time-intensive. The MUSHRA test is a promising alternative for evaluating multiple TTS systems simultaneously, but in this work we show that its reliance on matching human reference sp… ▽ More Despite rapid advancements in TTS models, a consistent and robust human evaluation framework is still lacking. For example, MOS tests fail to differentiate between similar models, and CMOS's pairwise comparisons are time-intensive. The MUSHRA test is a promising alternative for evaluating multiple TTS systems simultaneously, but in this work we show that its reliance on matching human reference speech unduly penalises the scores of modern TTS systems that can exceed human speech quality. More specifically, we conduct a comprehensive assessment of the MUSHRA test, focusing on its sensitivity to factors such as rater variability, listener fatigue, and reference bias. Based on our extensive evaluation involving 492 human listeners across Hindi and Tamil we identify two primary shortcomings: (i) reference-matching bias, where raters are unduly influenced by the human reference, and (ii) judgement ambiguity, arising from a lack of clear fine-grained guidelines. To address these issues, we propose two refined variants of the MUSHRA test. The first variant enables fairer ratings for synthesized samples that surpass human reference quality. The second variant reduces ambiguity, as indicated by the relatively lower variance across raters. By combining these approaches, we achieve both more reliable and more fine-grained assessments. We also release MANGO, a massive dataset of 246,000 human ratings, the first-of-its-kind collection for Indian languages, aiding in analyzing human preferences and developing automatic metrics for evaluating TTS systems. △ Less

Submitted 26 May, 2025; v1 submitted 19 November, 2024; originally announced November 2024.

Comments: Accepted in TMLR

arXiv:2410.16712 [pdf, other]

DENOASR: Debiasing ASRs through Selective Denoising

Authors: Anand Kumar Rai, Siddharth D Jaiswal, Shubham Prakash, Bendi Pragnya Sree, Animesh Mukherjee

Abstract: Automatic Speech Recognition (ASR) systems have been examined and shown to exhibit biases toward particular groups of individuals, influenced by factors such as demographic traits, accents, and speech styles. Noise can disproportionately impact speakers with certain accents, dialects, or speaking styles, leading to biased error rates. In this work, we introduce a novel framework DENOASR, which is… ▽ More Automatic Speech Recognition (ASR) systems have been examined and shown to exhibit biases toward particular groups of individuals, influenced by factors such as demographic traits, accents, and speech styles. Noise can disproportionately impact speakers with certain accents, dialects, or speaking styles, leading to biased error rates. In this work, we introduce a novel framework DENOASR, which is a selective denoising technique to reduce the disparity in the word error rates between the two gender groups, male and female. We find that a combination of two popular speech denoising techniques, viz. DEMUCS and LE, can be effectively used to mitigate ASR disparity without compromising their overall performance. Experiments using two state-of-the-art open-source ASRs - OpenAI WHISPER and NVIDIA NEMO - on multiple benchmark datasets, including TIE, VOX-POPULI, TEDLIUM, and FLEURS, show that there is a promising reduction in the average word error rate gap across the two gender groups. For a given dataset, the denoising is selectively applied on speech samples having speech intelligibility below a certain threshold, estimated using a small validation sample, thus ameliorating the need for large-scale human-written ground-truth transcripts. Our findings suggest that selective denoising can be an elegant approach to mitigate biases in present-day ASR systems. △ Less

Submitted 22 October, 2024; originally announced October 2024.

Comments: Paper accepted at IEEE ICKG 2024

arXiv:2410.15418 [pdf, other]

A Hybrid Noise Approach to Modelling of Free-Space Satellite Quantum Communication Channel for Continuous-Variable QKD

Authors: Mouli Chakraborty, Anshu Mukherjee, Ioannis Krikidis, Avishek Nag, Subhash Chandra

Abstract: This paper significantly advances the application of Quantum Key Distribution (QKD) in Free- Space Optics (FSO) satellite-based quantum communication. We propose an innovative satellite quantum channel model and derive the secret quantum key distribution rate achievable through this channel. Unlike existing models that approximate the noise in quantum channels as merely Gaussian distributed, our m… ▽ More This paper significantly advances the application of Quantum Key Distribution (QKD) in Free- Space Optics (FSO) satellite-based quantum communication. We propose an innovative satellite quantum channel model and derive the secret quantum key distribution rate achievable through this channel. Unlike existing models that approximate the noise in quantum channels as merely Gaussian distributed, our model incorporates a hybrid noise analysis, accounting for both quantum Poissonian noise and classical Additive-White-Gaussian Noise (AWGN). This hybrid approach acknowledges the dual vulnerability of continuous variables (CV) Gaussian quantum channels to both quantum and classical noise, thereby offering a more realistic assessment of the quantum Secret Key Rate (SKR). This paper delves into the variation of SKR with the Signal-to-Noise Ratio (SNR) under various influencing parameters. We identify and analyze critical factors such as reconciliation efficiency, transmission coefficient, transmission efficiency, the quantum Poissonian noise parameter, and the satellite altitude. These parameters are pivotal in determining the SKR in FSO satellite quantum channels, highlighting the challenges of satellitebased quantum communication. Our work provides a comprehensive framework for understanding and optimizing SKR in satellite-based QKD systems, paving the way for more efficient and secure quantum communication networks. △ Less

Submitted 20 October, 2024; originally announced October 2024.

arXiv:2409.04746 [pdf, other]

Hybrid Quantum Noise Approximation and Pattern Analysis on Parameterized Component Distributions

Authors: Mouli Chakraborty, Anshu Mukherjee, Ioannis Krikidis, Avishek Nag, Subhash Chandra

Abstract: Noise is a vital factor in determining the accuracy of processing the information of the quantum channel. One must consider classical noise effects associated with quantum noise sources for more realistic modelling of quantum channels. A hybrid quantum noise model incorporating both quantum Poisson noise and classical additive white Gaussian noise (AWGN) can be interpreted as an infinite mixture o… ▽ More Noise is a vital factor in determining the accuracy of processing the information of the quantum channel. One must consider classical noise effects associated with quantum noise sources for more realistic modelling of quantum channels. A hybrid quantum noise model incorporating both quantum Poisson noise and classical additive white Gaussian noise (AWGN) can be interpreted as an infinite mixture of Gaussians with weightage from the Poisson distribution. The entropy measure of this function is difficult to calculate. This research developed how the infinite mixture can be well approximated by a finite mixture distribution depending on the Poisson parametric setting compared to the number of mixture components. The mathematical analysis of the characterization of hybrid quantum noise has been demonstrated based on Gaussian and Poisson parametric analysis. This helps in the pattern analysis of the parametric values of the component distribution, and it also helps in the calculation of hybrid noise entropy to understand hybrid quantum noise better. △ Less

Submitted 7 September, 2024; originally announced September 2024.

arXiv:2404.08993 [pdf, other]

An Unsupervised Machine Learning to Optimize Hybrid Quantum Noise Clusters for Gaussian Quantum Channel

Authors: Mouli Chakraborty, Anshu Mukherjee, Ioannis Krikidis, Avishek Nag, Subhash Chandra

Abstract: This work focuses on optimizing the hybrid quantum noise model to improve the capacity of Gaussian quantum channels using Machine Learning (ML) generated clusters. The work specifically leverages Gaussian Mixture Model (GMM) and the Expectation-Maximization (EM) algorithm to model the complex noise characteristics of quantum channels. Hybrid quantum noise, which includes both quantum shot noise an… ▽ More This work focuses on optimizing the hybrid quantum noise model to improve the capacity of Gaussian quantum channels using Machine Learning (ML) generated clusters. The work specifically leverages Gaussian Mixture Model (GMM) and the Expectation-Maximization (EM) algorithm to model the complex noise characteristics of quantum channels. Hybrid quantum noise, which includes both quantum shot noise and classical Additive-White-Gaussian Noise (AWGN), is modeled as an infinite mixture of Gaussian distributions weighted by Poissonian parameters. The study proposes a method to reduce the number of clusters within this noise model, simplifying visualization and improving the accuracy of channel capacity estimations without compromising essential noise characteristics. Key contributions include the reduction of Gaussian clusters while maintaining error tolerances and using the EM algorithm to update quantum channel parameters, leading to more accurate channel capacity. The approach is validated through simulations, demonstrating that ML-enhanced quantum noise clustering significantly improves the channels performance in satellite-based quantum communication systems, specifically for Quantum Key Distribution (QKD). The work demonstrates that GMM and EM algorithms provide a practical solution for modeling quantum noise in real-time applications, advancing the optimization of quantum communication networks. △ Less

Submitted 23 January, 2025; v1 submitted 13 April, 2024; originally announced April 2024.

arXiv:2402.10119 [pdf, other]

Physics-Informed Neural Network Policy Iteration: Algorithms, Convergence, and Verification

Authors: Yiming Meng, Ruikun Zhou, Amartya Mukherjee, Maxwell Fitzsimmons, Christopher Song, Jun Liu

Abstract: Solving nonlinear optimal control problems is a challenging task, particularly for high-dimensional problems. We propose algorithms for model-based policy iterations to solve nonlinear optimal control problems with convergence guarantees. The main component of our approach is an iterative procedure that utilizes neural approximations to solve linear partial differential equations (PDEs), ensuring… ▽ More Solving nonlinear optimal control problems is a challenging task, particularly for high-dimensional problems. We propose algorithms for model-based policy iterations to solve nonlinear optimal control problems with convergence guarantees. The main component of our approach is an iterative procedure that utilizes neural approximations to solve linear partial differential equations (PDEs), ensuring convergence. We present two variants of the algorithms. The first variant formulates the optimization problem as a linear least square problem, drawing inspiration from extreme learning machine (ELM) for solving PDEs. This variant efficiently handles low-dimensional problems with high accuracy. The second variant is based on a physics-informed neural network (PINN) for solving PDEs and has the potential to address high-dimensional problems. We demonstrate that both algorithms outperform traditional approaches, such as Galerkin methods, by a significant margin. We provide a theoretical analysis of both algorithms in terms of convergence of neural approximations towards the true optimal solutions in a general setting. Furthermore, we employ formal verification techniques to demonstrate the verifiable stability of the resulting controllers. △ Less

Submitted 15 February, 2024; originally announced February 2024.

arXiv:2312.07601 [pdf, other]

Non-contact Multimodal Indoor Human Monitoring Systems: A Survey

Authors: Le Ngu Nguyen, Praneeth Susarla, Anirban Mukherjee, Manuel Lage Cañellas, Constantino Álvarez Casado, Xiaoting Wu, Olli~Silvén, Dinesh Babu Jayagopi, Miguel Bordallo López

Abstract: Indoor human monitoring systems leverage a wide range of sensors, including cameras, radio devices, and inertial measurement units, to collect extensive data from users and the environment. These sensors contribute diverse data modalities, such as video feeds from cameras, received signal strength indicators and channel state information from WiFi devices, and three-axis acceleration data from ine… ▽ More Indoor human monitoring systems leverage a wide range of sensors, including cameras, radio devices, and inertial measurement units, to collect extensive data from users and the environment. These sensors contribute diverse data modalities, such as video feeds from cameras, received signal strength indicators and channel state information from WiFi devices, and three-axis acceleration data from inertial measurement units. In this context, we present a comprehensive survey of multimodal approaches for indoor human monitoring systems, with a specific focus on their relevance in elderly care. Our survey primarily highlights non-contact technologies, particularly cameras and radio devices, as key components in the development of indoor human monitoring systems. Throughout this article, we explore well-established techniques for extracting features from multimodal data sources. Our exploration extends to methodologies for fusing these features and harnessing multiple modalities to improve the accuracy and robustness of machine learning models. Furthermore, we conduct comparative analysis across different data modalities in diverse human monitoring tasks and undertake a comprehensive examination of existing multimodal datasets. This extensive survey not only highlights the significance of indoor human monitoring systems but also affirms their versatile applications. In particular, we emphasize their critical role in enhancing the quality of elderly care, offering valuable insights into the development of non-contact monitoring solutions applicable to the needs of aging populations. △ Less

Submitted 11 December, 2023; originally announced December 2023.

Comments: 19 pages, 5 figures

arXiv:2310.00216 [pdf, other]

A Novel U-Net Architecture for Denoising of Real-world Noise Corrupted Phonocardiogram Signal

Authors: Ayan Mukherjee, Rohan Banerjee, Avik Ghose

Abstract: The bio-acoustic information contained within heart sound signals are utilized by physicians world-wide for auscultation purpose. However, the heart sounds are inherently susceptible to noise contamination. Various sources of noises like lung sound, coughing, sneezing, and other background noises are involved in such contamination. Such corruption of the heart sound signal often leads to inconclus… ▽ More The bio-acoustic information contained within heart sound signals are utilized by physicians world-wide for auscultation purpose. However, the heart sounds are inherently susceptible to noise contamination. Various sources of noises like lung sound, coughing, sneezing, and other background noises are involved in such contamination. Such corruption of the heart sound signal often leads to inconclusive or false diagnosis. To address this issue, we have proposed a novel U-Net based deep neural network architecture for denoising of phonocardiogram (PCG) signal in this paper. For the design, development and validation of the proposed architecture, a novel approach of synthesizing real-world noise corrupted PCG signals have been proposed. For the purpose, an open-access real-world noise sample dataset and an open-access PCG dataset has been utilized. The performance of the proposed denoising methodology has been evaluated on the synthesized noisy PCG dataset. The performance of the proposed algorithm has been compared with existing state-of-the-art (SoA) denoising algorithms qualitatively and quantitatively. The proposed denoising technique has shown improvement in performance as comparison to the SoAs. △ Less

Submitted 29 September, 2023; originally announced October 2023.

arXiv:2304.03572 [pdf, other]

Weakly supervised segmentation with point annotations for histopathology images via contrast-based variational model

Authors: Hongrun Zhang, Liam Burrows, Yanda Meng, Declan Sculthorpe, Abhik Mukherjee, Sarah E Coupland, Ke Chen, Yalin Zheng

Abstract: Image segmentation is a fundamental task in the field of imaging and vision. Supervised deep learning for segmentation has achieved unparalleled success when sufficient training data with annotated labels are available. However, annotation is known to be expensive to obtain, especially for histopathology images where the target regions are usually with high morphology variations and irregular shap… ▽ More Image segmentation is a fundamental task in the field of imaging and vision. Supervised deep learning for segmentation has achieved unparalleled success when sufficient training data with annotated labels are available. However, annotation is known to be expensive to obtain, especially for histopathology images where the target regions are usually with high morphology variations and irregular shapes. Thus, weakly supervised learning with sparse annotations of points is promising to reduce the annotation workload. In this work, we propose a contrast-based variational model to generate segmentation results, which serve as reliable complementary supervision to train a deep segmentation model for histopathology images. The proposed method considers the common characteristics of target regions in histopathology images and can be trained in an end-to-end manner. It can generate more regionally consistent and smoother boundary segmentation, and is more robust to unlabeled `novel' regions. Experiments on two different histology datasets demonstrate its effectiveness and efficiency in comparison to previous models. △ Less

Submitted 7 April, 2023; originally announced April 2023.

Comments: Accepted to CVPR2023

arXiv:2209.00581 [pdf, other]

On the Energy-Efficiency Maximization for IRS-Assisted MIMOME Wiretap Channels

Authors: Anshu Mukherjee, Vaibhav Kumar, Derrick Wing Kwan Ng, Le-Nam Tran

Abstract: Security and energy efficiency have become crucial features in the modern-era wireless communication. In this paper, we consider an energy-efficient design for intelligent reflecting surface (IRS)-assisted multiple-input multiple-output multiple-eavesdropper (MIMOME) wiretap channels (WTC). Our objective is to jointly optimize the transmit covariance matrix and the IRS phase-shifts to maximize the… ▽ More Security and energy efficiency have become crucial features in the modern-era wireless communication. In this paper, we consider an energy-efficient design for intelligent reflecting surface (IRS)-assisted multiple-input multiple-output multiple-eavesdropper (MIMOME) wiretap channels (WTC). Our objective is to jointly optimize the transmit covariance matrix and the IRS phase-shifts to maximize the secrecy energy efficiency (SEE) of the considered system subject to a secrecy rate constraint at the legitimate receiver. To tackle this challenging non-convex problem in which the design variables are coupled in the objective and the constraint, we propose a penalty dual decomposition based alternating gradient projection (PDDAPG) method to obtain an efficient solution. We also show that the computational complexity of the proposed algorithm grows only linearly with the number of reflecting elements at the IRS, as well as with the number of antennas at transmitter/receivers' nodes. Our results confirm that using an IRS is helpful to improve the SEE of MIMOME WTC compared to its no-IRS counterpart only when the power consumption at IRS is small. In particular, and a large-sized IRS is not always beneficial for the SEE of a MIMOME WTC. △ Less

Submitted 1 September, 2022; originally announced September 2022.

Comments: 6 pages, 7 figures

Journal ref: IEEE 96th Vehicular Technology Conference: VTC2022-Fall

arXiv:2108.10688 [pdf, other]

Secrecy Rate Maximization for Intelligent Reflecting Surface Assisted MIMOME Wiretap Channels

Authors: Anshu Mukherjee, Vaibhav Kumar, Le-Nam Tran

Abstract: Intelligent reflecting surface (IRS) has gained tremendous attention recently as a disruptive technology for beyond 5G networks. In this paper, we consider the problem of secrecy rate maximization for an IRS-assisted Gaussian multiple-input multiple-output multi-antenna-eavesdropper (MIMOME) wiretap channel (WTC). In this context, we aim to jointly optimize the input covariance matrix and the IRS… ▽ More Intelligent reflecting surface (IRS) has gained tremendous attention recently as a disruptive technology for beyond 5G networks. In this paper, we consider the problem of secrecy rate maximization for an IRS-assisted Gaussian multiple-input multiple-output multi-antenna-eavesdropper (MIMOME) wiretap channel (WTC). In this context, we aim to jointly optimize the input covariance matrix and the IRS phase shifts to maximize the achievable secrecy rate of the considered system. To solve the formulated problem which is non-convex, we propose an iterative method based on the block successive maximization (BSM), where each iteration is done in closed form. More specifically, we maximize a lower bound on the achievable secrecy rate to update the input covariance matrix for fixed phase shifts, and then maximize the (exact) achievable secrecy rate to update phase shifts for a given input covariance.We consider the total free space path loss (FSPL) in this system to emphasize the first-order measure of the applicability of the IRS in the considered communication system. We present a convergence proof and the associated complexity analysis of the proposed algorithm. Numerical results are provided to demonstrate the superiority of the proposed method compared to a known solution, and also to show the effect of different parameters of interest on the achievable secrecy rate of the IRS-assisted MIMOME WTC. △ Less

Submitted 24 August, 2021; originally announced August 2021.

arXiv:2105.11415 [pdf, other]

On the Optimality of the Stationary Solution of Secrecy Rate Maximization for MIMO Wiretap Channel

Authors: Anshu Mukherjee, Vaibhav Kumar, Eduard Jorswieck, Björn Ottersten, Le-Nam Tran

Abstract: To achieve perfect secrecy in a multiple-input multiple-output (MIMO) Gaussian wiretap channel (WTC), we need to find its secrecy capacity and optimal signaling, which involves solving a difference of convex functions program known to be non-convex for the non-degraded case. To deal with this, a class of existing solutions have been developed but only local optimality is guaranteed by standard con… ▽ More To achieve perfect secrecy in a multiple-input multiple-output (MIMO) Gaussian wiretap channel (WTC), we need to find its secrecy capacity and optimal signaling, which involves solving a difference of convex functions program known to be non-convex for the non-degraded case. To deal with this, a class of existing solutions have been developed but only local optimality is guaranteed by standard convergence analysis. Interestingly, our extensive numerical experiments have shown that these local optimization methods indeed achieve global optimality. In this paper, we provide an analytical proof for this observation. To achieve this, we show that the Karush-Kuhn-Tucker (KKT) conditions of the secrecy rate maximization problem admit a unique solution for both degraded and non-degraded cases. Motivated by this, we also propose a low-complexity algorithm to find a stationary point. Numerical results are presented to verify the theoretical analysis. △ Less

Submitted 24 May, 2021; originally announced May 2021.

arXiv:2102.10396 [pdf, ps, other]

Efficient Numerical Methods for Secrecy Capacity of Gaussian MIMO Wiretap Channel

Authors: Anshu Mukherjee, Björn Ottersten, Le Nam Tran

Abstract: This paper presents two different low-complexity methods for obtaining the secrecy capacity of multiple-input multiple-output (MIMO) wiretap channel subject to a sum power constraint (SPC). The challenges in deriving computationally efficient solutions to the secrecy capacity problem are due to the fact that the secrecy rate is a difference of convex functions (DC) of the transmit covariance matri… ▽ More This paper presents two different low-complexity methods for obtaining the secrecy capacity of multiple-input multiple-output (MIMO) wiretap channel subject to a sum power constraint (SPC). The challenges in deriving computationally efficient solutions to the secrecy capacity problem are due to the fact that the secrecy rate is a difference of convex functions (DC) of the transmit covariance matrix, for which its convexity is only known for \emph{the degraded case}. In the first method, we capitalize on the accelerated DC algorithm, which requires solving a sequence of convex subproblems. In particular, we show that each subproblem indeed admits a water-filling solution. In the second method, based on the equivalent convex-concave reformulation of the secrecy capacity problem, we develop a so-called partial best response algorithm (PBRA). Each iteration of the PBRA is also done in closed form. Simulation results are provided to demonstrate the superior performance of the proposed methods. △ Less

Submitted 20 February, 2021; originally announced February 2021.

Comments: arXiv admin note: substantial text overlap with arXiv:2012.05667

arXiv:2012.05667 [pdf, ps, other]

On the Secrecy Capacity of MIMO Wiretap Channels: Convex Reformulation and Efficient Numerical Methods

Authors: Anshu Mukherjee, Björn Ottersten, Le-Nam Tran

Abstract: This paper presents novel numerical approaches to finding the secrecy capacity of the multiple-input multiple-output (MIMO) wiretap channel subject to multiple linear transmit covariance constraints, including sum power constraint, per antenna power constraints and interference power constraint. An analytical solution to this problem is not known and existing numerical solutions suffer from slow c… ▽ More This paper presents novel numerical approaches to finding the secrecy capacity of the multiple-input multiple-output (MIMO) wiretap channel subject to multiple linear transmit covariance constraints, including sum power constraint, per antenna power constraints and interference power constraint. An analytical solution to this problem is not known and existing numerical solutions suffer from slow convergence rate and/or high per-iteration complexity. Deriving computationally efficient solutions to the secrecy capacity problem is challenging since the secrecy rate is expressed as a difference of convex functions (DC) of the transmit covariance matrix, for which its convexity is only known for some special cases. In this paper we propose two low-complexity methods to compute the secrecy capacity along with a convex reformulation for degraded channels. In the first method we capitalize on the accelerated DC algorithm which requires solving a sequence of convex subproblems, for which we propose an efficient iterative algorithm where each iteration admits a closed-form solution. In the second method, we rely on the concave-convex equivalent reformulation of the secrecy capacity problem which allows us to derive the so-called partial best response algorithm to obtain an optimal solution. Notably, each iteration of the second method can also be done in closed form. The simulation results demonstrate a faster convergence rate of our methods compared to other known solutions. We carry out extensive numerical experiments to evaluate the impact of various parameters on the achieved secrecy capacity. △ Less

Submitted 8 July, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

arXiv:2005.02704 [pdf, ps, other]

doi 10.1007/978-3-030-34869-4_45

Fast Geometric Surface based Segmentation of Point Cloud from Lidar Data

Authors: Aritra Mukherjee, Sourya Dipta Das, Jasorsi Ghosh, Ananda S. Chowdhury, Sanjoy Kumar Saha

Abstract: Mapping the environment has been an important task for robot navigation and Simultaneous Localization And Mapping (SLAM). LIDAR provides a fast and accurate 3D point cloud map of the environment which helps in map building. However, processing millions of points in the point cloud becomes a computationally expensive task. In this paper, a methodology is presented to generate the segmented surfaces… ▽ More Mapping the environment has been an important task for robot navigation and Simultaneous Localization And Mapping (SLAM). LIDAR provides a fast and accurate 3D point cloud map of the environment which helps in map building. However, processing millions of points in the point cloud becomes a computationally expensive task. In this paper, a methodology is presented to generate the segmented surfaces in real time and these can be used in modeling the 3D objects. At first an algorithm is proposed for efficient map building from single shot data of spinning Lidar. It is based on fast meshing and sub-sampling. It exploits the physical design and the working principle of the spinning Lidar sensor. The generated mesh surfaces are then segmented by estimating the normal and considering their homogeneity. The segmented surfaces can be used as proposals for predicting geometrically accurate model of objects in the robots activity environment. The proposed methodology is compared with some popular point cloud segmentation methods to highlight the efficacy in terms of accuracy and speed. △ Less

Submitted 6 May, 2020; originally announced May 2020.

Comments: Accepted to PReMI 2019( Pattern Recognition and Machine Intelligence 2019). International Conference on Pattern Recognition and Machine Intelligence. Springer, Cham, 2019

arXiv:1203.2511 [pdf]

doi 10.5121/ijasuc.2012.3105

A Simple Flood Forecasting Scheme Using Wireless Sensor Networks

Authors: Victor Seal, Arnab Raha, Shovan Maity, Souvik Kr Mitra, Amitava Mukherjee, Mrinal Kanti Naskar

Abstract: This paper presents a forecasting model designed using WSNs (Wireless Sensor Networks) to predict flood in rivers using simple and fast calculations to provide real-time results and save the lives of people who may be affected by the flood. Our prediction model uses multiple variable robust linear regression which is easy to understand and simple and cost effective in implementation, is speed effi… ▽ More This paper presents a forecasting model designed using WSNs (Wireless Sensor Networks) to predict flood in rivers using simple and fast calculations to provide real-time results and save the lives of people who may be affected by the flood. Our prediction model uses multiple variable robust linear regression which is easy to understand and simple and cost effective in implementation, is speed efficient, but has low resource utilization and yet provides real time predictions with reliable accuracy, thus having features which are desirable in any real world algorithm. Our prediction model is independent of the number of parameters, i.e. any number of parameters may be added or removed based on the on-site requirements. When the water level rises, we represent it using a polynomial whose nature is used to determine if the water level may exceed the flood line in the near future. We compare our work with a contemporary algorithm to demonstrate our improvements over it. Then we present our simulation results for the predicted water level compared to the actual water level. △ Less

Submitted 9 March, 2012; originally announced March 2012.

Comments: 16 pages, 4 figures, published in International Journal Of Ad-Hoc, Sensor And Ubiquitous Computing, February 2012; V. seal et al, 'A Simple Flood Forecasting Scheme Using Wireless Sensor Networks', IJASUC, Feb.2012

arXiv:1202.5692 [pdf]

doi 10.1109/PACC.2011.5979047

Adaptive Gain and Order Scheduling of Optimal Fractional Order PIλDμ Controllers with Radial Basis Function Neural-Network

Authors: Saptarshi Das, Sayan Saha, Ayan Mukherjee, Indranil Pan, Amitava Gupta

Abstract: Gain and order scheduling of fractional order (FO) PIλDμ controllers are studied in this paper considering four different classes of higher order processes. The mapping between the optimum PID/FOPID controller parameters and the reduced order process models are done using Radial Basis Function (RBF) type Artificial Neural Network (ANN). Simulation studies have been done to show the effectiveness o… ▽ More Gain and order scheduling of fractional order (FO) PIλDμ controllers are studied in this paper considering four different classes of higher order processes. The mapping between the optimum PID/FOPID controller parameters and the reduced order process models are done using Radial Basis Function (RBF) type Artificial Neural Network (ANN). Simulation studies have been done to show the effectiveness of the RBFNN for online scheduling of such controllers with random change in set-point and process parameters. △ Less

Submitted 25 February, 2012; originally announced February 2012.

Comments: 6 pages, 12 figures

Journal ref: Proceedings of 2011 International Conference on Process Automation, Control and Computing, PACC 2011, art. no. 5979047, July 2011, Coimbatore

arXiv:1202.5690 [pdf]

doi 10.1109/PACC.2011.5979045

Embedded Network Test-Bed for Validating Real-Time Control Algorithms to Ensure Optimal Time Domain Performance

Authors: Ayan Mukherjee, Anindya Pakhira, Saptarshi Das, Indranil Pan, Amitava Gupta

Abstract: The paper presents a Stateflow based network test-bed to validate real-time optimal control algorithms. Genetic Algorithm (GA) based time domain performance index minimization is attempted for tuning of PI controller to handle a balanced lag and delay type First Order Plus Time Delay (FOPTD) process over network. The tuning performance is validated on a real-time communication network with artific… ▽ More The paper presents a Stateflow based network test-bed to validate real-time optimal control algorithms. Genetic Algorithm (GA) based time domain performance index minimization is attempted for tuning of PI controller to handle a balanced lag and delay type First Order Plus Time Delay (FOPTD) process over network. The tuning performance is validated on a real-time communication network with artificially simulated stochastic delay, packet loss and out-of order packets characterizing the network. △ Less

Submitted 25 February, 2012; originally announced February 2012.

Comments: 6 pages, 12 figures

Journal ref: Proceedings of 2011 International Conference on Process Automation, Control and Computing, PACC 2011, art. no. 5979045, July 2011, Coimbatore

Showing 1–25 of 25 results for author: Mukherjee, A