Skip to main content

Showing 1–25 of 25 results for author: Mukherjee, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2508.20649  [pdf, ps, other

    cs.LG eess.SY

    Physics-Constrained Machine Learning for Chemical Engineering

    Authors: Angan Mukherjee, Victor M. Zavala

    Abstract: Physics-constrained machine learning (PCML) combines physical models with data-driven approaches to improve reliability, generalizability, and interpretability. Although PCML has shown significant benefits in diverse scientific and engineering domains, technical and intellectual challenges hinder its applicability in complex chemical engineering applications. Key difficulties include determining t… ▽ More

    Submitted 28 August, 2025; originally announced August 2025.

  2. arXiv:2508.03327  [pdf, ps, other

    eess.SP

    Quantum Deep Learning for Massive MIMO User Scheduling

    Authors: Xingyu Huang, Ruining Fan, Mouli Chakraborty, Avishek Nag, Anshu Mukherjee

    Abstract: We introduce a hybrid Quantum Neural Networks (QNN) architecture for the efficient user scheduling in 5G/Beyond 5G (B5G) massive Multiple Input Multiple Output (MIMO) systems, addressing the scalability issues of traditional methods. By leveraging statistical Channel State Information (CSI), our model reduces computational overhead and enhances spectral efficiency. It integrates classical neural n… ▽ More

    Submitted 5 August, 2025; originally announced August 2025.

  3. arXiv:2507.23695  [pdf, ps, other

    eess.SP

    On the Achievable Rate of Satellite Quantum Communication Channel using Deep Autoencoder Gaussian Mixture Model

    Authors: Mouli Chakraborty, Subhash Chandra, Avishek Nag, Anshu Mukherjee

    Abstract: We present a comparative study of the Gaussian mixture model (GMM) and the Deep Autoencoder Gaussian Mixture Model (DAGMM) for estimating satellite quantum channel capacity, considering hybrid quantum noise (HQN) and transmission constraints. While GMM is simple and interpretable, DAGMM better captures non-linear variations and noise distributions. Simulations show that DAGMM provides tighter capa… ▽ More

    Submitted 31 July, 2025; originally announced July 2025.

  4. arXiv:2506.01737  [pdf, ps, other

    cs.NE eess.SP

    The Promise of Spiking Neural Networks for Ubiquitous Computing: A Survey and New Perspectives

    Authors: Hemanth Sabbella, Archit Mukherjee, Thivya Kandappu, Sounak Dey, Arpan Pal, Archan Misra, Dong Ma

    Abstract: Spiking neural networks (SNNs) have emerged as a class of bio -inspired networks that leverage sparse, event-driven signaling to achieve low-power computation while inherently modeling temporal dynamics. Such characteristics align closely with the demands of ubiquitous computing systems, which often operate on resource-constrained devices while continuously monitoring and processing time-series se… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: 50 pages

    ACM Class: I.2

  5. arXiv:2505.19233  [pdf, ps, other

    cs.CV cs.AI cs.MM eess.IV

    RAISE: Realness Assessment for Image Synthesis and Evaluation

    Authors: Aniruddha Mukherjee, Spriha Dubey, Somdyuti Paul

    Abstract: The rapid advancement of generative AI has enabled the creation of highly photorealistic visual content, offering practical substitutes for real images and videos in scenarios where acquiring real data is difficult or expensive. However, reliably substituting real visual content with AI-generated counterparts requires robust assessment of the perceived realness of AI-generated visual content, a ch… ▽ More

    Submitted 3 August, 2025; v1 submitted 25 May, 2025; originally announced May 2025.

  6. arXiv:2505.11572  [pdf, ps, other

    cs.SD cs.CL eess.AS

    ASR-FAIRBENCH: Measuring and Benchmarking Equity Across Speech Recognition Systems

    Authors: Anand Rai, Satyam Rahangdale, Utkarsh Anand, Animesh Mukherjee

    Abstract: Automatic Speech Recognition (ASR) systems have become ubiquitous in everyday applications, yet significant disparities in performance across diverse demographic groups persist. In this work, we introduce the ASR-FAIRBENCH leaderboard which is designed to assess both the accuracy and equity of ASR models in real-time. Leveraging the Meta's Fair-Speech dataset, which captures diverse demographic ch… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Paper accepted at INTERSPEECH 2025

  7. arXiv:2501.07590  [pdf

    physics.ins-det eess.SY physics.optics physics.space-ph

    Ultrafast pulsed laser evaluation of Single Event Transients in opto-couplers

    Authors: Kavin Dave, Aditya Mukherjee, Hari Shanker Gupta, Deepak Jain, Shalabh Gupta

    Abstract: We build a 1064 nm fiber laser system-based testing facility for emulating SETs in different electronics components and ICs. Using these facilities, we tested the 4N35 optocoupler to observe SETs for the first time.

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: Accepted in CLEO 2023, San Jose, USA and CLEO 2024, North Carolina, USA for in poster presentation. However due to lack of funds, we could not travel

  8. arXiv:2411.12719  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Rethinking MUSHRA: Addressing Modern Challenges in Text-to-Speech Evaluation

    Authors: Praveen Srinivasa Varadhan, Amogh Gulati, Ashwin Sankar, Srija Anand, Anirudh Gupta, Anirudh Mukherjee, Shiva Kumar Marepally, Ankur Bhatia, Saloni Jaju, Suvrat Bhooshan, Mitesh M. Khapra

    Abstract: Despite rapid advancements in TTS models, a consistent and robust human evaluation framework is still lacking. For example, MOS tests fail to differentiate between similar models, and CMOS's pairwise comparisons are time-intensive. The MUSHRA test is a promising alternative for evaluating multiple TTS systems simultaneously, but in this work we show that its reliance on matching human reference sp… ▽ More

    Submitted 26 May, 2025; v1 submitted 19 November, 2024; originally announced November 2024.

    Comments: Accepted in TMLR

  9. arXiv:2410.16712  [pdf, other

    cs.SD cs.CL eess.AS

    DENOASR: Debiasing ASRs through Selective Denoising

    Authors: Anand Kumar Rai, Siddharth D Jaiswal, Shubham Prakash, Bendi Pragnya Sree, Animesh Mukherjee

    Abstract: Automatic Speech Recognition (ASR) systems have been examined and shown to exhibit biases toward particular groups of individuals, influenced by factors such as demographic traits, accents, and speech styles. Noise can disproportionately impact speakers with certain accents, dialects, or speaking styles, leading to biased error rates. In this work, we introduce a novel framework DENOASR, which is… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

    Comments: Paper accepted at IEEE ICKG 2024

  10. arXiv:2410.15418  [pdf, other

    eess.SP

    A Hybrid Noise Approach to Modelling of Free-Space Satellite Quantum Communication Channel for Continuous-Variable QKD

    Authors: Mouli Chakraborty, Anshu Mukherjee, Ioannis Krikidis, Avishek Nag, Subhash Chandra

    Abstract: This paper significantly advances the application of Quantum Key Distribution (QKD) in Free- Space Optics (FSO) satellite-based quantum communication. We propose an innovative satellite quantum channel model and derive the secret quantum key distribution rate achievable through this channel. Unlike existing models that approximate the noise in quantum channels as merely Gaussian distributed, our m… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

  11. arXiv:2409.04746  [pdf, other

    eess.SP

    Hybrid Quantum Noise Approximation and Pattern Analysis on Parameterized Component Distributions

    Authors: Mouli Chakraborty, Anshu Mukherjee, Ioannis Krikidis, Avishek Nag, Subhash Chandra

    Abstract: Noise is a vital factor in determining the accuracy of processing the information of the quantum channel. One must consider classical noise effects associated with quantum noise sources for more realistic modelling of quantum channels. A hybrid quantum noise model incorporating both quantum Poisson noise and classical additive white Gaussian noise (AWGN) can be interpreted as an infinite mixture o… ▽ More

    Submitted 7 September, 2024; originally announced September 2024.

  12. arXiv:2404.08993  [pdf, other

    eess.SP

    An Unsupervised Machine Learning to Optimize Hybrid Quantum Noise Clusters for Gaussian Quantum Channel

    Authors: Mouli Chakraborty, Anshu Mukherjee, Ioannis Krikidis, Avishek Nag, Subhash Chandra

    Abstract: This work focuses on optimizing the hybrid quantum noise model to improve the capacity of Gaussian quantum channels using Machine Learning (ML) generated clusters. The work specifically leverages Gaussian Mixture Model (GMM) and the Expectation-Maximization (EM) algorithm to model the complex noise characteristics of quantum channels. Hybrid quantum noise, which includes both quantum shot noise an… ▽ More

    Submitted 23 January, 2025; v1 submitted 13 April, 2024; originally announced April 2024.

  13. arXiv:2402.10119  [pdf, other

    eess.SY math.OC

    Physics-Informed Neural Network Policy Iteration: Algorithms, Convergence, and Verification

    Authors: Yiming Meng, Ruikun Zhou, Amartya Mukherjee, Maxwell Fitzsimmons, Christopher Song, Jun Liu

    Abstract: Solving nonlinear optimal control problems is a challenging task, particularly for high-dimensional problems. We propose algorithms for model-based policy iterations to solve nonlinear optimal control problems with convergence guarantees. The main component of our approach is an iterative procedure that utilizes neural approximations to solve linear partial differential equations (PDEs), ensuring… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  14. arXiv:2312.07601  [pdf, other

    eess.SP cs.LG

    Non-contact Multimodal Indoor Human Monitoring Systems: A Survey

    Authors: Le Ngu Nguyen, Praneeth Susarla, Anirban Mukherjee, Manuel Lage Cañellas, Constantino Álvarez Casado, Xiaoting Wu, Olli~Silvén, Dinesh Babu Jayagopi, Miguel Bordallo López

    Abstract: Indoor human monitoring systems leverage a wide range of sensors, including cameras, radio devices, and inertial measurement units, to collect extensive data from users and the environment. These sensors contribute diverse data modalities, such as video feeds from cameras, received signal strength indicators and channel state information from WiFi devices, and three-axis acceleration data from ine… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 19 pages, 5 figures

  15. arXiv:2310.00216  [pdf, other

    eess.SP cs.SD eess.AS

    A Novel U-Net Architecture for Denoising of Real-world Noise Corrupted Phonocardiogram Signal

    Authors: Ayan Mukherjee, Rohan Banerjee, Avik Ghose

    Abstract: The bio-acoustic information contained within heart sound signals are utilized by physicians world-wide for auscultation purpose. However, the heart sounds are inherently susceptible to noise contamination. Various sources of noises like lung sound, coughing, sneezing, and other background noises are involved in such contamination. Such corruption of the heart sound signal often leads to inconclus… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

  16. arXiv:2304.03572  [pdf, other

    eess.IV cs.CV cs.LG

    Weakly supervised segmentation with point annotations for histopathology images via contrast-based variational model

    Authors: Hongrun Zhang, Liam Burrows, Yanda Meng, Declan Sculthorpe, Abhik Mukherjee, Sarah E Coupland, Ke Chen, Yalin Zheng

    Abstract: Image segmentation is a fundamental task in the field of imaging and vision. Supervised deep learning for segmentation has achieved unparalleled success when sufficient training data with annotated labels are available. However, annotation is known to be expensive to obtain, especially for histopathology images where the target regions are usually with high morphology variations and irregular shap… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPR2023

  17. arXiv:2209.00581  [pdf, other

    eess.SP

    On the Energy-Efficiency Maximization for IRS-Assisted MIMOME Wiretap Channels

    Authors: Anshu Mukherjee, Vaibhav Kumar, Derrick Wing Kwan Ng, Le-Nam Tran

    Abstract: Security and energy efficiency have become crucial features in the modern-era wireless communication. In this paper, we consider an energy-efficient design for intelligent reflecting surface (IRS)-assisted multiple-input multiple-output multiple-eavesdropper (MIMOME) wiretap channels (WTC). Our objective is to jointly optimize the transmit covariance matrix and the IRS phase-shifts to maximize the… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

    Comments: 6 pages, 7 figures

    Journal ref: IEEE 96th Vehicular Technology Conference: VTC2022-Fall

  18. arXiv:2108.10688  [pdf, other

    cs.IT eess.SP

    Secrecy Rate Maximization for Intelligent Reflecting Surface Assisted MIMOME Wiretap Channels

    Authors: Anshu Mukherjee, Vaibhav Kumar, Le-Nam Tran

    Abstract: Intelligent reflecting surface (IRS) has gained tremendous attention recently as a disruptive technology for beyond 5G networks. In this paper, we consider the problem of secrecy rate maximization for an IRS-assisted Gaussian multiple-input multiple-output multi-antenna-eavesdropper (MIMOME) wiretap channel (WTC). In this context, we aim to jointly optimize the input covariance matrix and the IRS… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

  19. arXiv:2105.11415  [pdf, other

    cs.IT eess.SP

    On the Optimality of the Stationary Solution of Secrecy Rate Maximization for MIMO Wiretap Channel

    Authors: Anshu Mukherjee, Vaibhav Kumar, Eduard Jorswieck, Björn Ottersten, Le-Nam Tran

    Abstract: To achieve perfect secrecy in a multiple-input multiple-output (MIMO) Gaussian wiretap channel (WTC), we need to find its secrecy capacity and optimal signaling, which involves solving a difference of convex functions program known to be non-convex for the non-degraded case. To deal with this, a class of existing solutions have been developed but only local optimality is guaranteed by standard con… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

  20. arXiv:2102.10396  [pdf, ps, other

    cs.IT eess.SP

    Efficient Numerical Methods for Secrecy Capacity of Gaussian MIMO Wiretap Channel

    Authors: Anshu Mukherjee, Björn Ottersten, Le Nam Tran

    Abstract: This paper presents two different low-complexity methods for obtaining the secrecy capacity of multiple-input multiple-output (MIMO) wiretap channel subject to a sum power constraint (SPC). The challenges in deriving computationally efficient solutions to the secrecy capacity problem are due to the fact that the secrecy rate is a difference of convex functions (DC) of the transmit covariance matri… ▽ More

    Submitted 20 February, 2021; originally announced February 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2012.05667

  21. arXiv:2012.05667  [pdf, ps, other

    cs.IT eess.SP

    On the Secrecy Capacity of MIMO Wiretap Channels: Convex Reformulation and Efficient Numerical Methods

    Authors: Anshu Mukherjee, Björn Ottersten, Le-Nam Tran

    Abstract: This paper presents novel numerical approaches to finding the secrecy capacity of the multiple-input multiple-output (MIMO) wiretap channel subject to multiple linear transmit covariance constraints, including sum power constraint, per antenna power constraints and interference power constraint. An analytical solution to this problem is not known and existing numerical solutions suffer from slow c… ▽ More

    Submitted 8 July, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

  22. Fast Geometric Surface based Segmentation of Point Cloud from Lidar Data

    Authors: Aritra Mukherjee, Sourya Dipta Das, Jasorsi Ghosh, Ananda S. Chowdhury, Sanjoy Kumar Saha

    Abstract: Mapping the environment has been an important task for robot navigation and Simultaneous Localization And Mapping (SLAM). LIDAR provides a fast and accurate 3D point cloud map of the environment which helps in map building. However, processing millions of points in the point cloud becomes a computationally expensive task. In this paper, a methodology is presented to generate the segmented surfaces… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

    Comments: Accepted to PReMI 2019( Pattern Recognition and Machine Intelligence 2019). International Conference on Pattern Recognition and Machine Intelligence. Springer, Cham, 2019

  23. arXiv:1203.2511  [pdf

    cs.LG cs.CE cs.NI eess.SY stat.AP

    A Simple Flood Forecasting Scheme Using Wireless Sensor Networks

    Authors: Victor Seal, Arnab Raha, Shovan Maity, Souvik Kr Mitra, Amitava Mukherjee, Mrinal Kanti Naskar

    Abstract: This paper presents a forecasting model designed using WSNs (Wireless Sensor Networks) to predict flood in rivers using simple and fast calculations to provide real-time results and save the lives of people who may be affected by the flood. Our prediction model uses multiple variable robust linear regression which is easy to understand and simple and cost effective in implementation, is speed effi… ▽ More

    Submitted 9 March, 2012; originally announced March 2012.

    Comments: 16 pages, 4 figures, published in International Journal Of Ad-Hoc, Sensor And Ubiquitous Computing, February 2012; V. seal et al, 'A Simple Flood Forecasting Scheme Using Wireless Sensor Networks', IJASUC, Feb.2012

  24. Adaptive Gain and Order Scheduling of Optimal Fractional Order PIλDμ Controllers with Radial Basis Function Neural-Network

    Authors: Saptarshi Das, Sayan Saha, Ayan Mukherjee, Indranil Pan, Amitava Gupta

    Abstract: Gain and order scheduling of fractional order (FO) PIλDμ controllers are studied in this paper considering four different classes of higher order processes. The mapping between the optimum PID/FOPID controller parameters and the reduced order process models are done using Radial Basis Function (RBF) type Artificial Neural Network (ANN). Simulation studies have been done to show the effectiveness o… ▽ More

    Submitted 25 February, 2012; originally announced February 2012.

    Comments: 6 pages, 12 figures

    Journal ref: Proceedings of 2011 International Conference on Process Automation, Control and Computing, PACC 2011, art. no. 5979047, July 2011, Coimbatore

  25. Embedded Network Test-Bed for Validating Real-Time Control Algorithms to Ensure Optimal Time Domain Performance

    Authors: Ayan Mukherjee, Anindya Pakhira, Saptarshi Das, Indranil Pan, Amitava Gupta

    Abstract: The paper presents a Stateflow based network test-bed to validate real-time optimal control algorithms. Genetic Algorithm (GA) based time domain performance index minimization is attempted for tuning of PI controller to handle a balanced lag and delay type First Order Plus Time Delay (FOPTD) process over network. The tuning performance is validated on a real-time communication network with artific… ▽ More

    Submitted 25 February, 2012; originally announced February 2012.

    Comments: 6 pages, 12 figures

    Journal ref: Proceedings of 2011 International Conference on Process Automation, Control and Computing, PACC 2011, art. no. 5979045, July 2011, Coimbatore