Skip to main content

Showing 1–37 of 37 results for author: Dutta, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.18217  [pdf, other

    cs.SD cs.AI eess.AS

    ABHINAYA -- A System for Speech Emotion Recognition In Naturalistic Conditions Challenge

    Authors: Soumya Dutta, Smruthi Balaji, Varada R, Viveka Salinamakki, Sriram Ganapathy

    Abstract: Speech emotion recognition (SER) in naturalistic settings remains a challenge due to the intrinsic variability, diverse recording conditions, and class imbalance. As participants in the Interspeech Naturalistic SER Challenge which focused on these complexities, we present Abhinaya, a system integrating speech-based, text-based, and speech-text models. Our approach fine-tunes self-supervised and sp… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 5 pages, 2 figures, 4 tables, accepted at Interspeech 2025

  2. arXiv:2505.17655  [pdf, ps, other

    eess.AS cs.SD

    Audio-to-Audio Emotion Conversion With Pitch And Duration Style Transfer

    Authors: Soumya Dutta, Avni Jain, Sriram Ganapathy

    Abstract: Given a pair of source and reference speech recordings, audio-to-audio (A2A) style transfer involves the generation of an output speech that mimics the style characteristics of the reference while preserving the content and speaker attributes of the source. In this paper, we propose a novel framework, termed as A2A Zero-shot Emotion Style Transfer (A2A-ZEST), that enables the transfer of reference… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 11 pages, 9 figures, 5 tables

  3. arXiv:2503.08759  [pdf, other

    quant-ph cs.CV eess.IV

    QUIET-SR: Quantum Image Enhancement Transformer for Single Image Super-Resolution

    Authors: Siddhant Dutta, Nouhaila Innan, Khadijeh Najafi, Sadok Ben Yahia, Muhammad Shafique

    Abstract: Recent advancements in Single-Image Super-Resolution (SISR) using deep learning have significantly improved image restoration quality. However, the high computational cost of processing high-resolution images due to the large number of parameters in classical models, along with the scalability challenges of quantum algorithms for image processing, remains a major obstacle. In this paper, we propos… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: 10 figures, 3 pages

  4. arXiv:2503.07934  [pdf, other

    cs.LG cs.CY eess.SY stat.ME stat.ML

    Counterfactual Explanations for Model Ensembles Using Entropic Risk Measures

    Authors: Erfaun Noorani, Pasan Dissanayake, Faisal Hamman, Sanghamitra Dutta

    Abstract: Counterfactual explanations indicate the smallest change in input that can translate to a different outcome for a machine learning model. Counterfactuals have generated immense interest in high-stakes applications such as finance, education, hiring, etc. In several use-cases, the decision-making process often relies on an ensemble of models rather than just one. Despite significant research on cou… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  5. arXiv:2501.11468  [pdf, other

    eess.AS cs.SD

    LLM supervised Pre-training for Multimodal Emotion Recognition in Conversations

    Authors: Soumya Dutta, Sriram Ganapathy

    Abstract: Emotion recognition in conversations (ERC) is challenging due to the multimodal nature of the emotion expression. In this paper, we propose to pretrain a text-based recognition model from unsupervised speech transcripts with LLM guidance. These transcriptions are obtained from a raw speech dataset with a pre-trained ASR system. A text LLM model is queried to provide pseudo-labels for these transcr… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

    Comments: ICASSP 2025; 5 pages, 4 figures, 2 tables

  6. arXiv:2411.10429  [pdf, ps, other

    cs.IT cs.CR cs.LG eess.SP

    Private Counterfactual Retrieval With Immutable Features

    Authors: Shreya Meel, Pasan Dissanayake, Mohamed Nomeir, Sanghamitra Dutta, Sennur Ulukus

    Abstract: In a classification task, counterfactual explanations provide the minimum change needed for an input to be classified into a favorable class. We consider the problem of privately retrieving the exact closest counterfactual from a database of accepted samples while enforcing that certain features of the input sample cannot be changed, i.e., they are \emph{immutable}. An applicant (user) whose featu… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

  7. arXiv:2411.07483  [pdf, other

    stat.ML cs.CV cs.IT cs.LG eess.IV

    Quantifying Knowledge Distillation Using Partial Information Decomposition

    Authors: Pasan Dissanayake, Faisal Hamman, Barproda Halder, Ilia Sucholutsky, Qiuyi Zhang, Sanghamitra Dutta

    Abstract: Knowledge distillation deploys complex machine learning models in resource-constrained environments by training a smaller student model to emulate internal representations of a complex teacher model. However, the teacher's representations can also encode nuisance or additional information not relevant to the downstream task. Distilling such irrelevant information can actually impede the performanc… ▽ More

    Submitted 4 April, 2025; v1 submitted 11 November, 2024; originally announced November 2024.

    Comments: Accepted at the 28th International Conference on Artificial Intelligence and Statistics (AISTATS) 2025

  8. arXiv:2410.13812  [pdf, ps, other

    cs.IT cs.CR cs.LG eess.SP

    Private Counterfactual Retrieval

    Authors: Mohamed Nomeir, Pasan Dissanayake, Shreya Meel, Sanghamitra Dutta, Sennur Ulukus

    Abstract: Transparency and explainability are two extremely important aspects to be considered when employing black-box machine learning models in high-stake applications. Providing counterfactual explanations is one way of catering this requirement. However, this also poses a threat to the privacy of both the institution that is providing the explanation as well as the user who is requesting it. In this wo… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  9. arXiv:2409.05566  [pdf, other

    eess.AS

    Leveraging Content and Acoustic Representations for Speech Emotion Recognition

    Authors: Soumya Dutta, Sriram Ganapathy

    Abstract: Speech emotion recognition (SER), the task of identifying the expression of emotion from spoken content, is challenging due to the difficulty in extracting representations that capture emotional attributes from speech. The scarcity of labeled datasets further complicates the challenge where large models are prone to over-fitting. In this paper, we propose CARE (Content and Acoustic Representations… ▽ More

    Submitted 17 December, 2024; v1 submitted 9 September, 2024; originally announced September 2024.

    Comments: 11 pages, 5 figures, 6 tables

  10. arXiv:2407.19677  [pdf, other

    cs.CY cs.CR cs.SD eess.AS

    Navigating the United States Legislative Landscape on Voice Privacy: Existing Laws, Proposed Bills, Protection for Children, and Synthetic Data for AI

    Authors: Satwik Dutta, John H. L. Hansen

    Abstract: Privacy is a hot topic for policymakers across the globe, including the United States. Evolving advances in AI and emerging concerns about the misuse of personal data have pushed policymakers to draft legislation on trustworthy AI and privacy protection for its citizens. This paper presents the state of the privacy legislation at the U.S. Congress and outlines how voice data is considered as part… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: 5 pages, 2 figures, accepted at the Interspeech SynData4GenAI 2024 workshop

    ACM Class: I.2; J.1

  11. arXiv:2405.12226  [pdf

    eess.IV cond-mat.dis-nn physics.med-ph

    A novel perspective on denoising using quantum localization with application to medical imaging

    Authors: Amirreza Hashemi, Sayantan Dutta, Bertrand Georgeot, Denis Kouame, Hamid Sabet

    Abstract: Background noise in many fields such as medical imaging poses significant challenges for accurate diagnosis, prompting the development of denoising algorithms. Traditional methodologies, however, often struggle to address the complexities of noisy environments in high dimensional imaging systems. This paper introduces a novel quantum-inspired approach for image denoising, drawing upon principles o… ▽ More

    Submitted 30 January, 2025; v1 submitted 22 April, 2024; originally announced May 2024.

  12. arXiv:2405.01040  [pdf, other

    cs.CV cs.CL eess.IV

    Few Shot Class Incremental Learning using Vision-Language models

    Authors: Anurag Kumar, Chinmay Bharti, Saikat Dutta, Srikrishna Karanam, Biplab Banerjee

    Abstract: Recent advancements in deep learning have demonstrated remarkable performance comparable to human capabilities across various supervised computer vision tasks. However, the prevalent assumption of having an extensive pool of training data encompassing all classes prior to model training often diverges from real-world scenarios, where limited data availability for novel classes is the norm. The cha… ▽ More

    Submitted 15 August, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  13. arXiv:2404.05581  [pdf, other

    cs.RO eess.SY

    Design and Simulation of Time-energy Optimal Anti-swing Trajectory Planner for Autonomous Tower Cranes

    Authors: Souravik Dutta, Yiyu Cai

    Abstract: For autonomous crane lifting, optimal trajectories of the crane are required as reference inputs to the crane controller to facilitate feedforward control. Reducing the unactuated payload motion is a crucial issue for under-actuated tower cranes with spherical pendulum dynamics. The planned trajectory should be optimal in terms of both operating time and energy consumption, to facilitate optimum o… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 18 pages, 12 figures, 9 tables

  14. Automatic Tuning of Denoising Algorithms Parameters Without Ground Truth

    Authors: Arthur Floquet, Sayantan Dutta, Emmanuel Soubies, Duong Hung Pham, Denis Kouame

    Abstract: Denoising is omnipresent in image processing. It is usually addressed with algorithms relying on a set of hyperparameters that control the quality of the recovered image. Manual tuning of those parameters can be a daunting task, which calls for the development of automatic tuning methods. Given a denoising algorithm, the best set of parameters is the one that minimizes the error between denoised a… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  15. arXiv:2401.04511  [pdf, other

    eess.AS cs.LG cs.SD

    Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement

    Authors: Soumya Dutta, Sriram Ganapathy

    Abstract: The problem of audio-to-audio (A2A) style transfer involves replacing the style features of the source audio with those from the target audio while preserving the content related attributes of the source audio. In this paper, we propose an efficient approach, termed as Zero-shot Emotion Style Transfer (ZEST), that allows the transfer of emotional content present in the given source audio with the… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 5 pages, 3 figures, accepted at ICASSP 2024

  16. Quantum Algorithm for Signal Denoising

    Authors: Sayantan Dutta, Adrian Basarab, Denis Kouamé, Bertrand Georgeot

    Abstract: This letter presents a novel \textit{quantum algorithm} for signal denoising, which performs a thresholding in the frequency domain through amplitude amplification and using an adaptive threshold determined by local mean values. The proposed algorithm is able to process \textit{both classical and quantum} signals. It is parametrically faster than previous classical and quantum denoising algorithms… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: 6 pages, 3 figurs

    Journal ref: IEEE Signal Processing Letters, 2023

  17. arXiv:2310.19052  [pdf

    cs.SD cs.AI eess.AS

    Exploring the Emotional Landscape of Music: An Analysis of Valence Trends and Genre Variations in Spotify Music Data

    Authors: Shruti Dutta, Shashwat Mookherjee

    Abstract: This paper conducts an intricate analysis of musical emotions and trends using Spotify music data, encompassing audio features and valence scores extracted through the Spotipi API. Employing regression modeling, temporal analysis, mood transitions, and genre investigation, the study uncovers patterns within music-emotion relationships. Regression models linear, support vector, random forest, and r… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: 6 pages, Accepted at the 18th International Conference for Internet Technology and Secured Transactions, 13-15 November, 2023, St Anne's College, Oxford, UK

  18. arXiv:2310.12820  [pdf

    eess.SY

    A Centralized Voltage Controller for Offshore Wind Plants: NY State Grid Case Study

    Authors: Lin Zhu, Bruno Leonardi, Aboutaleb Haddadi, Sudipta Dutta, Alberto Del Rosso, Victor Paduani, Hossein Hooshyar

    Abstract: This paper proposes a centralized multi-plant reactive power and voltage controller to support voltage control in the interconnected onshore power system. This controller utilizes a hierarchical control structure consisting of a master controller and multiple slave controllers. To validate the proposed method, a realistic planning case of the New York State grid is created for the year 2035, in wh… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 5 pages, 13 figures, conference paper, IEEE PES Innovative Smart Grid Technologies - Latin America, San Juan, Puerto Rico, Nov. 6-9, 2023

  19. arXiv:2308.10636  [pdf, other

    eess.IV cs.CV

    Automated Identification of Failure Cases in Organ at Risk Segmentation Using Distance Metrics: A Study on CT Data

    Authors: Amin Honarmandi Shandiz, Attila Rádics, Rajesh Tamada, Makk Árpád, Karolina Glowacka, Lehel Ferenczi, Sandeep Dutta, Michael Fanariotis

    Abstract: Automated organ at risk (OAR) segmentation is crucial for radiation therapy planning in CT scans, but the generated contours by automated models can be inaccurate, potentially leading to treatment planning issues. The reasons for these inaccuracies could be varied, such as unclear organ boundaries or inaccurate ground truth due to annotation errors. To improve the model's performance, it is necess… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 11 pages, 5 figures, 2 tables

  20. arXiv:2305.18745  [pdf, other

    cs.RO eess.SY

    Multi-objective Anti-swing Trajectory Planning of Double-pendulum Tower Crane Operations using Opposition-based Evolutionary Algorithm

    Authors: Souravik Dutta, Yiyu Cai, Jianmin Zheng

    Abstract: Underactuated tower crane lifting requires time-energy optimal trajectories for the trolley/slew operations and reduction of the unactuated swings resulting from the trolley/jib motion. In scenarios involving non-negligible hook mass or long rig-cable, the hook-payload unit exhibits double-pendulum behaviour, making the problem highly challenging. This article introduces an offline multi-objective… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 14 pages, 14 figures, 6 tables

  21. arXiv:2304.06910  [pdf, other

    eess.AS cs.CL cs.SD

    HCAM -- Hierarchical Cross Attention Model for Multi-modal Emotion Recognition

    Authors: Soumya Dutta, Sriram Ganapathy

    Abstract: Emotion recognition in conversations is challenging due to the multi-modal nature of the emotion expression. We propose a hierarchical cross-attention model (HCAM) approach to multi-modal emotion recognition using a combination of recurrent and co-attention neural network models. The input to the model consists of two modalities, i) audio data, processed through a learnable wav2vec approach and, i… ▽ More

    Submitted 9 January, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: 11 pages, 6 figures

  22. arXiv:2304.02692  [pdf, other

    math.OC eess.SY

    A Unified Approach to Optimally Solving Sensor Scheduling and Sensor Selection Problems in Kalman Filtering

    Authors: Shamak Dutta, Nils Wilde, Stephen L. Smith

    Abstract: We consider a general form of the sensor scheduling problem for state estimation of linear dynamical systems, which involves selecting sensors that minimize the trace of the Kalman filter error covariance (weighted by a positive semidefinite matrix) subject to polyhedral constraints on the selected sensors. This general form captures several well-studied problems including sensor placement, sensor… ▽ More

    Submitted 11 December, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  23. arXiv:2301.00247  [pdf, other

    eess.IV

    DIVA: Deep Unfolded Network from Quantum Interactive Patches for Image Restoration

    Authors: Sayantan Dutta, Adrian Basarab, Bertrand Georgeot, Denis Kouamé

    Abstract: This paper presents a deep neural network called DIVA unfolding a baseline adaptive denoising algorithm (De-QuIP), relying on the theory of quantum many-body physics. Furthermore, it is shown that with very slight modifications, this network can be enhanced to solve more challenging image restoration tasks such as image deblurring, super-resolution and inpainting. Despite a compact and interpretab… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

    Comments: 18 pages, 18 figures; complements and expands https://ieeexplore.ieee.org/abstract/document/9897959 and https://ieeexplore.ieee.org/abstract/document/9958691

  24. arXiv:2209.07259  [pdf

    physics.ins-det eess.SY

    Design of a Strong-Arm Dynamic-Latch based comparator with high speed, low power and low offset for SAR-ADC

    Authors: Sounak Dutta

    Abstract: Comparators are utilised by Nyquist-rate and oversampling analog to digital converters (ADCs) to accomplish quantization and perhaps sampling. Thus, comparators have a substantial effect on the speed and accuracy of ADCs. This study provides a revised design for a dynamic-latch-based comparator that achieves the lowest latency, maximum area-efficient realisation, reduced power dissipation, and low… ▽ More

    Submitted 25 October, 2022; v1 submitted 9 September, 2022; originally announced September 2022.

    Comments: 5 pages, 3 figures

  25. arXiv:2204.09571  [pdf, other

    eess.SY

    Informative Path Planning in Random Fields via Mixed Integer Programming

    Authors: Shamak Dutta, Nils Wilde, Stephen L. Smith

    Abstract: We present a new mixed integer formulation for the discrete informative path planning problem in random fields. The objective is to compute a budget constrained path while collecting measurements whose linear estimate results in minimum error over a finite set of prediction locations. The problem is known to be NP-hard. However, we strive to compute optimal solutions by leveraging advances in mixe… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

  26. arXiv:2203.00845  [pdf, other

    eess.IV cs.AI cs.CV

    Can No-reference features help in Full-reference image quality estimation?

    Authors: Saikat Dutta, Sourya Dipta Das, Nisarg A. Shah

    Abstract: Development of perceptual image quality assessment (IQA) metrics has been of significant interest to computer vision community. The aim of these metrics is to model quality of an image as perceived by humans. Recent works in Full-reference IQA research perform pixelwise comparison between deep features corresponding to query and reference images for quality prediction. However, pixelwise feature c… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: Code to be updated on: https://github.com/saikatdutta/nr-in-friqa

  27. A Novel Image Denoising Algorithm Using Concepts of Quantum Many-Body Theory

    Authors: Sayantan Dutta, Adrian Basarab, Bertrand Georgeot, Denis Kouamé

    Abstract: Sparse representation of real-life images is a very effective approach in imaging applications, such as denoising. In recent years, with the growth of computing power, data-driven strategies exploiting the redundancy within patches extracted from one or several images to increase sparsity have become more prominent. This paper presents a novel image denoising algorithm exploiting such an image-dep… ▽ More

    Submitted 24 August, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: 24 pages, 14 figures; complements and expands arXiv:2108.13778

    Journal ref: Signal Processing, Volume 201, 2022, 108690

  28. Image Denoising Inspired by Quantum Many-Body physics

    Authors: Sayantan Dutta, Adrian Basarab, Bertrand Georgeot, Denis Kouamé

    Abstract: Decomposing an image through Fourier, DCT or wavelet transforms is still a common approach in digital image processing, in number of applications such as denoising. In this context, data-driven dictionaries and in particular exploiting the redundancy withing patches extracted from one or several images allowed important improvements. This paper proposes an original idea of constructing such an ima… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: 5 pages, 4 figures

    Journal ref: IEEE International Conference on Image Processing (ICIP 2021)

  29. Plug-and-Play Quantum Adaptive Denoiser for Deconvolving Poisson Noisy Images

    Authors: Sayantan Dutta, Adrian Basarab, Bertrand Georgeot, Denis Kouamé

    Abstract: A new Plug-and-Play (PnP) alternating direction of multipliers (ADMM) scheme is proposed in this paper, by embedding a recently introduced adaptive denoiser using the Schroedinger equation's solutions of quantum physics. The potential of the proposed model is studied for Poisson image deconvolution, which is a common problem occurring in number of imaging applications, such as limited photon acqui… ▽ More

    Submitted 20 October, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: 22 pages, 13 figures; complements and expands arXiv:2010.09321

    Journal ref: IEEE Access, vol. 9, pp. 139771-139791, 2021

  30. arXiv:2105.08819  [pdf, other

    eess.IV cs.CV cs.LG

    Fast and Accurate Quantized Camera Scene Detection on Smartphones, Mobile AI 2021 Challenge: Report

    Authors: Andrey Ignatov, Grigory Malivenko, Radu Timofte, Sheng Chen, Xin Xia, Zhaoyan Liu, Yuwei Zhang, Feng Zhu, Jiashi Li, Xuefeng Xiao, Yuan Tian, Xinglong Wu, Christos Kyrkou, Yixin Chen, Zexin Zhang, Yunbo Peng, Yue Lin, Saikat Dutta, Sourya Dipta Das, Nisarg A. Shah, Himanshu Kumar, Chao Ge, Pei-Lin Wu, Jin-Hua Du, Andrew Batutin , et al. (6 additional authors not shown)

    Abstract: Camera scene detection is among the most popular computer vision problem on smartphones. While many custom solutions were developed for this task by phone vendors, none of the designed models were available publicly up until now. To address this problem, we introduce the first Mobile AI challenge, where the target is to develop quantized deep learning-based camera scene classification solutions th… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Mobile AI 2021 Workshop and Challenges: https://ai-benchmark.com/workshops/mai/2021/. arXiv admin note: substantial text overlap with arXiv:2105.08630; text overlap with arXiv:2105.07825, arXiv:2105.07809, arXiv:2105.08629

  31. arXiv:2104.05778  [pdf, other

    eess.IV cs.CV

    Efficient Space-time Video Super Resolution using Low-Resolution Flow and Mask Upsampling

    Authors: Saikat Dutta, Nisarg A. Shah, Anurag Mittal

    Abstract: This paper explores an efficient solution for Space-time Super-Resolution, aiming to generate High-resolution Slow-motion videos from Low Resolution and Low Frame rate videos. A simplistic solution is the sequential running of Video Super Resolution and Video Frame interpolation models. However, this type of solutions are memory inefficient, have high inference time, and could not make the proper… ▽ More

    Submitted 8 June, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Accepted at NTIRE Workshop, CVPR 2021. Code and models: https://github.com/saikatdutta/FMU_STSR

  32. arXiv:2011.04988  [pdf, other

    eess.IV cs.CV

    AIM 2020 Challenge on Rendering Realistic Bokeh

    Authors: Andrey Ignatov, Radu Timofte, Ming Qian, Congyu Qiao, Jiamin Lin, Zhenyu Guo, Chenghua Li, Cong Leng, Jian Cheng, Juewen Peng, Xianrui Luo, Ke Xian, Zijin Wu, Zhiguo Cao, Densen Puthussery, Jiji C V, Hrishikesh P S, Melvin Kuriakose, Saikat Dutta, Sourya Dipta Das, Nisarg A. Shah, Kuldeep Purohit, Praveen Kandula, Maitreya Suin, A. N. Rajagopalan , et al. (10 additional authors not shown)

    Abstract: This paper reviews the second AIM realistic bokeh effect rendering challenge and provides the description of the proposed solutions and results. The participating teams were solving a real-world bokeh simulation problem, where the goal was to learn a realistic shallow focus technique using a large-scale EBB! bokeh dataset consisting of 5K shallow / wide depth-of-field image pairs captured using th… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: Published in ECCV 2020 Workshop (Advances in Image Manipulation), https://data.vision.ee.ethz.ch/cvl/aim20/

  33. arXiv:2010.09321  [pdf, other

    eess.IV eess.SP

    Poisson Image Deconvolution by a Plug-and-Play Quantum Denoising Scheme

    Authors: Sayantan Dutta, Adrian Basarab, Bertrand Georgeot, Denis Kouamé

    Abstract: This paper introduces a new Plug-and-Play (PnP) alternating direction of multipliers (ADMM) scheme based on a recently proposed denoiser using the Schroedinger equation's solutions of quantum physics. The efficiency of the proposed algorithm is evaluated for Poisson image deconvolution, which is very common for imaging applications, such as, for example, limited photon acquisition. Numerical resul… ▽ More

    Submitted 10 May, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: 5 pages, 2 figures

  34. arXiv:2005.14214  [pdf, other

    cs.CV cs.AI eess.IV

    Depth-aware Blending of Smoothed Images for Bokeh Effect Generation

    Authors: Saikat Dutta

    Abstract: Bokeh effect is used in photography to capture images where the closer objects look sharp and every-thing else stays out-of-focus. Bokeh photos are generally captured using Single Lens Reflex cameras using shallow depth-of-field. Most of the modern smartphones can take bokeh images by leveraging dual rear cameras or a good auto-focus hardware. However, for smartphones with single-rear camera witho… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Journal ref: Journal of Visual Communication and Image Representation 2021

  35. Quantum mechanics-based signal and image representation: application to denoising

    Authors: Sayantan Dutta, Adrian Basarab, Bertrand Georgeot, Denis Kouamé

    Abstract: Decomposition of digital signals and images into other basis or dictionaries than time or space domains is a very common approach in signal and image processing and analysis. Such a decomposition is commonly obtained using fixed transforms (e.g., Fourier or wavelet) or dictionaries learned from example databases or from the signal or image itself. In this work, we investigate in detail a new appro… ▽ More

    Submitted 16 March, 2021; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: 17 pages, 18 figures; complements and expands arXiv:1802.02358

    Journal ref: IEEE Open Journal of Signal Processing, 2, 190-206 (2021)

  36. arXiv:2001.02034  [pdf, other

    eess.SP

    Energy and Latency of Beamforming Architectures for Initial Access in mmWave Wireless Networks

    Authors: C. Nicolas Barati, Sourjya Dutta, Sundeep Rangan, Ashutosh Sabharwal

    Abstract: Future millimeter-wave (mmWave) systems, 5G cellular or WiFi, must rely on highly directional links to overcome severe pathloss in these frequency bands. Establishing such links requires the mutual discovery of the transmitter and the receiver %in the angular domain potentially leading to a large latency and high energy consumption. In this work, we show that both the discovery latency and energy… ▽ More

    Submitted 7 January, 2020; originally announced January 2020.

    Comments: 30 pages, 11 figures, submitted to Journal of the Indian Institute of Science: Special Issue on 5G

  37. arXiv:1908.08966  [pdf, other

    eess.SP

    Power Efficient Discontinuous Reception in THz and mmWave Wireless Systems

    Authors: Syed Hashim Ali Shah, Sundar Aditya, Sourjya Dutta, Christopher Slezak, Sundeep Rangan

    Abstract: Discontinuous reception (DRX), where a user equip-ment (UE) temporarily disables its receiver, is a critical power saving feature in modern cellular systems. DRX is likely tobe particularly aggressively used in the mmWave and THzfrequencies due to the high front end power consumption. A keychallenge of DRX in these frequencies is that individual links are directional and highly susceptible to bloc… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

    Comments: The paper has been accepted and presented at IEEE SPAWC 2019. It is yet to be published on IEEE Xplore