Skip to main content

Showing 1–6 of 6 results for author: Premananth, G

.
  1. arXiv:2505.16044  [pdf, ps, other

    eess.AS cs.LG eess.IV eess.SP

    Multimodal Biomarkers for Schizophrenia: Towards Individual Symptom Severity Estimation

    Authors: Gowtham Premananth, Philip Resnik, Sonia Bansal, Deanna L. Kelly, Carol Espy-Wilson

    Abstract: Studies on schizophrenia assessments using deep learning typically treat it as a classification task to detect the presence or absence of the disorder, oversimplifying the condition and reducing its clinical applicability. This traditional approach overlooks the complexity of schizophrenia, limiting its practical value in healthcare settings. This study shifts the focus to individual symptom sever… ▽ More

    Submitted 4 June, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: Accepted to be presented at Interspeech 2025

  2. arXiv:2505.15965  [pdf, ps, other

    eess.AS eess.SP

    Analyzing the Impact of Accent on English Speech: Acoustic and Articulatory Perspectives

    Authors: Gowtham Premananth, Vinith Kugathasan, Carol Espy-Wilson

    Abstract: Advancements in AI-driven speech-based applications have transformed diverse industries ranging from healthcare to customer service. However, the increasing prevalence of non-native accented speech in global interactions poses significant challenges for speech-processing systems, which are often trained on datasets dominated by native speech. This study investigates accented English speech through… ▽ More

    Submitted 4 June, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: Accepted to be presented at Interspeech 2025

  3. arXiv:2411.06033  [pdf, other

    eess.AS

    Speech-Based Estimation of Schizophrenia Severity Using Feature Fusion

    Authors: Gowtham Premananth, Carol Espy-Wilson

    Abstract: Speech-based assessment of the schizophrenia spectrum has been widely researched over in the recent past. In this study, we develop a deep learning framework to estimate schizophrenia severity scores from speech using a feature fusion approach that fuses articulatory features with different self-supervised speech features extracted from pre-trained audio models. We also propose an auto-encoder-bas… ▽ More

    Submitted 20 November, 2024; v1 submitted 8 November, 2024; originally announced November 2024.

    Comments: Submitted to ICASSP-SPADE workshop 2025

  4. arXiv:2409.09733  [pdf, other

    eess.AS cs.SD eess.SP

    Self-supervised Multimodal Speech Representations for the Assessment of Schizophrenia Symptoms

    Authors: Gowtham Premananth, Carol Espy-Wilson

    Abstract: Multimodal schizophrenia assessment systems have gained traction over the last few years. This work introduces a schizophrenia assessment system to discern between prominent symptom classes of schizophrenia and predict an overall schizophrenia severity score. We develop a Vector Quantized Variational Auto-Encoder (VQ-VAE) based Multimodal Representation Learning (MRL) model to produce task-agnosti… ▽ More

    Submitted 17 November, 2024; v1 submitted 15 September, 2024; originally announced September 2024.

  5. A Multimodal Framework for the Assessment of the Schizophrenia Spectrum

    Authors: Gowtham Premananth, Yashish M. Siriwardena, Philip Resnik, Sonia Bansal, Deanna L. Kelly, Carol Espy-Wilson

    Abstract: This paper presents a novel multimodal framework to distinguish between different symptom classes of subjects in the schizophrenia spectrum and healthy controls using audio, video, and text modalities. We implemented Convolution Neural Network and Long Short Term Memory based unimodal models and experimented on various multimodal fusion approaches to come up with the proposed framework. We utilize… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted to be presented at Interspeech 2024

  6. arXiv:2309.15136  [pdf, other

    eess.SP cs.MM cs.SD eess.AS eess.IV

    A multi-modal approach for identifying schizophrenia using cross-modal attention

    Authors: Gowtham Premananth, Yashish M. Siriwardena, Philip Resnik, Carol Espy-Wilson

    Abstract: This study focuses on how different modalities of human communication can be used to distinguish between healthy controls and subjects with schizophrenia who exhibit strong positive symptoms. We developed a multi-modal schizophrenia classification system using audio, video, and text. Facial action units and vocal tract variables were extracted as low-level features from video and audio respectivel… ▽ More

    Submitted 18 April, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: Accepted to Annual International Conference of the IEEE Engineering in Medicine and Biology Society 2024