Skip to main content

Showing 1–25 of 25 results for author: Chandra, S S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.17432  [pdf, other

    eess.AS cs.LG

    SelectTTS: Synthesizing Anyone's Voice via Discrete Unit-Based Frame Selection

    Authors: Ismail Rasim Ulgen, Shreeram Suresh Chandra, Junchen Lu, Berrak Sisman

    Abstract: Synthesizing the voices of unseen speakers remains a persisting challenge in multi-speaker text-to-speech (TTS). Existing methods model speaker characteristics through speaker conditioning during training, leading to increased model complexity and limiting reproducibility and accessibility. A lower-complexity method would enable speech synthesis research with limited computational and data resourc… ▽ More

    Submitted 6 May, 2025; v1 submitted 30 August, 2024; originally announced August 2024.

    Comments: Submitted to IEEE Signal Processing Letters

  2. arXiv:2406.04494  [pdf, other

    eess.AS

    Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline

    Authors: Ali N. Salman, Zongyang Du, Shreeram Suresh Chandra, Ismail Rasim Ulgen, Carlos Busso, Berrak Sisman

    Abstract: Voice conversion (VC) research traditionally depends on scripted or acted speech, which lacks the natural spontaneity of real-life conversations. While natural speech data is limited for VC, our study focuses on filling in this gap. We introduce a novel data-sourcing pipeline that makes the release of a natural speech dataset for VC, named NaturalVoices. The pipeline extracts rich information in s… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2406.03637  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Style Mixture of Experts for Expressive Text-To-Speech Synthesis

    Authors: Ahad Jawaid, Shreeram Suresh Chandra, Junchen Lu, Berrak Sisman

    Abstract: Recent advances in style transfer text-to-speech (TTS) have improved the expressiveness of synthesized speech. However, encoding stylistic information (e.g., timbre, emotion, and prosody) from diverse and unseen reference speech remains a challenge. This paper introduces StyleMoE, an approach that addresses the issue of learning averaged style representations in the style encoder by creating style… ▽ More

    Submitted 27 October, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: Published in Audio Imagination: NeurIPS 2024 Workshop

  4. arXiv:2405.11413  [pdf, other

    eess.AS cs.LG

    Exploring speech style spaces with language models: Emotional TTS without emotion labels

    Authors: Shreeram Suresh Chandra, Zongyang Du, Berrak Sisman

    Abstract: Many frameworks for emotional text-to-speech (E-TTS) rely on human-annotated emotion labels that are often inaccurate and difficult to obtain. Learning emotional prosody implicitly presents a tough challenge due to the subjective nature of emotions. In this study, we propose a novel approach that leverages text awareness to acquire emotional styles without the need for explicit emotion labels or t… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: Accepted at Speaker Odyssey 2024

  5. arXiv:2401.03621  [pdf

    eess.IV cs.AI cs.CV cs.LG

    Machine Learning Applications in Traumatic Brain Injury: A Spotlight on Mild TBI

    Authors: Hanem Ellethy, Shekhar S. Chandra, Viktor Vegh

    Abstract: Traumatic Brain Injury (TBI) poses a significant global public health challenge, contributing to high morbidity and mortality rates and placing a substantial economic burden on healthcare systems worldwide. The diagnosis of TBI relies on clinical information along with Computed Tomography (CT) scans. Addressing the multifaceted challenges posed by TBI has seen the development of innovative, data-d… ▽ More

    Submitted 11 January, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

    Comments: The manuscript has 34 pages, 3 figures, and 4 tables

  6. arXiv:2311.14197  [pdf

    eess.IV cs.CV cs.LG

    Enhancing mTBI Diagnosis with Residual Triplet Convolutional Neural Network Using 3D CT

    Authors: Hanem Ellethy, Shekhar S. Chandra, Viktor Vegh

    Abstract: Mild Traumatic Brain Injury (mTBI) is a common and challenging condition to diagnose accurately. Timely and precise diagnosis is essential for effective treatment and improved patient outcomes. Traditional diagnostic methods for mTBI often have limitations in terms of accuracy and sensitivity. In this study, we introduce an innovative approach to enhance mTBI diagnosis using 3D Computed Tomography… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  7. Single Image Compressed Sensing MRI via a Self-Supervised Deep Denoising Approach

    Authors: Marlon Bran Lorenzana, Feng Liu, Shekhar S. Chandra

    Abstract: Popular methods in compressed sensing (CS) are dependent on deep learning (DL), where large amounts of data are used to train non-linear reconstruction models. However, ensuring generalisability over and access to multiple datasets is challenging to realise for real-world applications. To address these concerns, this paper proposes a single image, self-supervised (SS) CS-MRI framework that enables… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 5 pages, 4 figures, 2 tables, conference

  8. arXiv:2310.04705  [pdf, other

    eess.IV cs.CV

    Multi-scale MRI reconstruction via dilated ensemble networks

    Authors: Wendi Ma, Marlon Bran Lorenzana, Wei Dai, Hongfu Sun, Shekhar S. Chandra

    Abstract: As aliasing artefacts are highly structural and non-local, many MRI reconstruction networks use pooling to enlarge filter coverage and incorporate global context. However, this inadvertently impedes fine detail recovery as downsampling creates a resolution bottleneck. Moreover, real and imaginary features are commonly split into separate channels, discarding phase information particularly importan… ▽ More

    Submitted 30 November, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

  9. arXiv:2309.12572  [pdf

    eess.IV cs.CV cs.LG

    Interpretable 3D Multi-Modal Residual Convolutional Neural Network for Mild Traumatic Brain Injury Diagnosis

    Authors: Hanem Ellethy, Viktor Vegh, Shekhar S. Chandra

    Abstract: Mild Traumatic Brain Injury (mTBI) is a significant public health challenge due to its high prevalence and potential for long-term health effects. Despite Computed Tomography (CT) being the standard diagnostic tool for mTBI, it often yields normal results in mTBI patients despite symptomatic evidence. This fact underscores the complexity of accurate diagnosis. In this study, we introduce an interp… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: Accepted by the Australasian Joint Conference on Artificial Intelligence 2023 (AJCAI 2023). 12 pages and 5 Figures

    Report number: Part of the Lecture Notes in Computer Science book series (LNAI,volume 14471)

  10. arXiv:2309.08641  [pdf, other

    eess.IV eess.SP

    Fractal Compressive Sensing

    Authors: Marlon Bran Lorenzana, Benjamin Cottier, Matthew Marques, Andrew Kingston, Shekhar S. Chandra

    Abstract: This paper introduces a sparse projection matrix composed of discrete (digital) periodic lines that create a pseudo-random (p.frac) sampling scheme. Our approach enables random Cartesian sampling whilst employing deterministic and one-dimensional (1D) trajectories derived from the discrete Radon transform (DRT). Unlike radial trajectories, DRT projections can be back-projected without interpolatio… ▽ More

    Submitted 6 January, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: 12 pages, 10 figures, 1 table

  11. arXiv:2309.00265  [pdf

    eess.IV cs.CV

    Application of Machine Learning in Melanoma Detection and the Identification of 'Ugly Duckling' and Suspicious Naevi: A Review

    Authors: Fatima Al Zegair, Nathasha Naranpanawa, Brigid Betz-Stablein, Monika Janda, H. Peter Soyer, Shekhar S. Chandra

    Abstract: Skin lesions known as naevi exhibit diverse characteristics such as size, shape, and colouration. The concept of an "Ugly Duckling Naevus" comes into play when monitoring for melanoma, referring to a lesion with distinctive features that sets it apart from other lesions in the vicinity. As lesions within the same individual typically share similarities and follow a predictable pattern, an ugly duc… ▽ More

    Submitted 5 September, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

  12. arXiv:2303.13810  [pdf, other

    cs.CV eess.IV

    Evidence-aware multi-modal data fusion and its application to total knee replacement prediction

    Authors: Xinwen Liu, Jing Wang, S. Kevin Zhou, Craig Engstrom, Shekhar S. Chandra

    Abstract: Deep neural networks have been widely studied for predicting a medical condition, such as total knee replacement (TKR). It has shown that data of different modalities, such as imaging data, clinical variables and demographic information, provide complementary information and thus can improve the prediction accuracy together. However, the data sources of various modalities may not always be of high… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  13. arXiv:2303.05696  [pdf, other

    eess.IV cs.CV cs.LG

    Explainable Semantic Medical Image Segmentation with Style

    Authors: Wei Dai, Siyu Liu, Craig B. Engstrom, Shekhar S. Chandra

    Abstract: Semantic medical image segmentation using deep learning has recently achieved high accuracy, making it appealing to clinical problems such as radiation therapy. However, the lack of high-quality semantically labelled data remains a challenge leading to model brittleness to small shifts to input data. Most works require extra data for semi-supervised learning and lack the interpretability of the bo… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  14. arXiv:2302.08861  [pdf

    eess.IV cs.CV cs.LG

    AliasNet: Alias Artefact Suppression Network for Accelerated Phase-Encode MRI

    Authors: Marlon E. Bran Lorenzana, Shekhar S. Chandra, Feng Liu

    Abstract: Sparse reconstruction is an important aspect of MRI, helping to reduce acquisition time and improve spatial-temporal resolution. Popular methods are based mostly on compressed sensing (CS), which relies on the random sampling of k-space to produce incoherent (noise-like) artefacts. Due to hardware constraints, 1D Cartesian phase-encode under-sampling schemes are popular for 2D CS-MRI. However, 1D… ▽ More

    Submitted 10 October, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  15. arXiv:2211.16696  [pdf, other

    eess.IV cs.CV cs.LG

    Automated anomaly-aware 3D segmentation of bones and cartilages in knee MR images from the Osteoarthritis Initiative

    Authors: Boyeong Woo, Craig Engstrom, William Baresic, Jurgen Fripp, Stuart Crozier, Shekhar S. Chandra

    Abstract: In medical image analysis, automated segmentation of multi-component anatomical structures, which often have a spectrum of potential anomalies and pathologies, is a challenging task. In this work, we develop a multi-step approach using U-Net-based neural networks to initially detect anomalies (bone marrow lesions, bone cysts) in the distal femur, proximal tibia and patella from 3D magnetic resonan… ▽ More

    Submitted 1 December, 2022; v1 submitted 29 November, 2022; originally announced November 2022.

  16. arXiv:2210.00255  [pdf

    eess.IV cs.CV

    Cascaded Multi-Modal Mixing Transformers for Alzheimer's Disease Classification with Incomplete Data

    Authors: Linfeng Liu, Siyu Liu, Lu Zhang, Xuan Vinh To, Fatima Nasrallah, Shekhar S. Chandra

    Abstract: Accurate medical classification requires a large number of multi-modal data, and in many cases, different feature types. Previous studies have shown promising results when using multi-modal data, outperforming single-modality models when classifying diseases such as Alzheimer's Disease (AD). However, those models are usually not flexible enough to handle missing modalities. Currently, the most com… ▽ More

    Submitted 16 July, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

  17. Transformer Compressed Sensing via Global Image Tokens

    Authors: Marlon Bran Lorenzana, Craig Engstrom, Shekhar S. Chandra

    Abstract: Convolutional neural networks (CNN) have demonstrated outstanding Compressed Sensing (CS) performance compared to traditional, hand-crafted methods. However, they are broadly limited in terms of generalisability, inductive bias and difficulty to model long distance relationships. Transformer neural networks (TNN) overcome such issues by implementing an attention mechanism designed to capture depen… ▽ More

    Submitted 12 July, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: 4 Pages, 4 Figures, 2 Tables

  18. arXiv:2203.03196  [pdf, other

    eess.IV cs.CV

    Undersampled MRI Reconstruction with Side Information-Guided Normalisation

    Authors: Xinwen Liu, Jing Wang, Cheng Peng, Shekhar S. Chandra, Feng Liu, S. Kevin Zhou

    Abstract: Magnetic resonance (MR) images exhibit various contrasts and appearances based on factors such as different acquisition protocols, views, manufacturers, scanning parameters, etc. This generally accessible appearance-related side information affects deep learning-based undersampled magnetic resonance imaging (MRI) reconstruction frameworks, but has been overlooked in the majority of current works.… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  19. arXiv:2112.02723  [pdf

    eess.IV

    Automated volumetric and statistical shape assessment of cam-type morphology of the femoral head-neck region from 3D magnetic resonance images

    Authors: Jessica M. Bugeja, Ying Xia, Shekhar S. Chandra, Nicholas J. Murphy, Jillian Eyles, Libby Spiers, Stuart Crozier, David J. Hunter, Jurgen Fripp, Craig Engstrom

    Abstract: Femoroacetabular impingement (FAI) cam morphology is routinely assessed using two-dimensional alpha angles which do not provide specific data on cam size characteristics. The purpose of this study is to implement a novel, automated three-dimensional (3D) pipeline, CamMorph, for segmentation and measurement of cam volume, surface area and height from magnetic resonance (MR) images in patients with… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

    Comments: 24 pages (including appendices), 8 figures, 2 tables, 4 Appendices

    ACM Class: I.4

  20. arXiv:2109.05443  [pdf, other

    eess.IV cs.CV

    CAN3D: Fast 3D Medical Image Segmentation via Compact Context Aggregation

    Authors: Wei Dai, Boyeong Woo, Siyu Liu, Matthew Marques, Craig B. Engstrom, Peter B. Greer, Stuart Crozier, Jason A. Dowling, Shekhar S. Chandra

    Abstract: Direct automatic segmentation of objects from 3D medical imaging, such as magnetic resonance (MR) imaging, is challenging as it often involves accurately identifying a number of individual objects with complex geometries within a large volume under investigation. To address these challenges, most deep learning approaches typically enhance their learning capability by substantially increasing the c… ▽ More

    Submitted 22 September, 2021; v1 submitted 12 September, 2021; originally announced September 2021.

    Comments: 21 pages, 7 figures

  21. Bespoke Fractal Sampling Patterns for Discrete Fourier Space via the Kaleidoscope Transform

    Authors: Jacob M. White, Stuart Crozier, Shekhar S. Chandra

    Abstract: Sampling strategies are important for sparse imaging methodologies, especially those employing the discrete Fourier transform (DFT). Chaotic sensing is one such methodology that employs deterministic, fractal sampling in conjunction with finite, iterative reconstruction schemes to form an image from limited samples. Using a sampling pattern constructed entirely from periodic lines in DFT space, ch… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: 6 pages, 7 figures

  22. arXiv:2106.10413  [pdf, other

    eess.SP

    A Survey on Machine Learning Algorithms for Applications in Cognitive Radio Networks

    Authors: Akshay Upadhye, Purushothaman Saravanan, Shreeram Suresh Chandra, Sanjeev Gurugopinath

    Abstract: In this paper, we present a survey on the utility of machine learning (ML) algorithms for applications in cognitive radio networks (CRN). We start with a high-level overview of some of the major challenges in CRNs, and mention the ML architectures and algorithms that can be used to alleviate them. In particular, our focus is on two fundamental applications in CRNs, namely spectrum sensing -- with… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

  23. arXiv:2103.16744  [pdf, other

    cs.CV eess.IV

    Deep Simultaneous Optimisation of Sampling and Reconstruction for Multi-contrast MRI

    Authors: Xinwen Liu, Jing Wang, Fangfang Tang, Shekhar S. Chandra, Feng Liu, Stuart Crozier

    Abstract: MRI images of the same subject in different contrasts contain shared information, such as the anatomical structure. Utilizing the redundant information amongst the contrasts to sub-sample and faithfully reconstruct multi-contrast images could greatly accelerate the imaging speed, improve image quality and shorten scanning protocols. We propose an algorithm that generates the optimised sampling pat… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: Presented at ISMRM 28th Annual Meeting & Exhibition (Poster #3619)

  24. arXiv:2007.03199  [pdf

    eess.IV cs.CV

    Automatic lesion detection, segmentation and characterization via 3D multiscale morphological sifting in breast MRI

    Authors: Hang Min, Darryl McClymont, Shekhar S. Chandra, Stuart Crozier, Andrew P. Bradley

    Abstract: Previous studies on computer aided detection/diagnosis (CAD) in 4D breast magnetic resonance imaging (MRI) regard lesion detection, segmentation and characterization as separate tasks, and typically require users to manually select 2D MRI slices or regions of interest as the input. In this work, we present a breast MRI CAD system that can handle 4D multimodal breast MRI data, and integrate lesion… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

  25. arXiv:2006.15578  [pdf, other

    eess.IV cs.CV

    Generalisable 3D Fabric Architecture for Streamlined Universal Multi-Dataset Medical Image Segmentation

    Authors: Siyu Liu, Wei Dai, Craig Engstrom, Jurgen Fripp, Stuart Crozier, Jason A. Dowling, Shekhar S. Chandra

    Abstract: Data scarcity is common in deep learning models for medical image segmentation. Previous works proposed multi-dataset learning, either simultaneously or via transfer learning to expand training sets. However, medical image datasets have diverse-sized images and features, and developing a model simultaneously for multiple datasets is challenging. This work proposes Fabric Image Representation Encod… ▽ More

    Submitted 28 November, 2022; v1 submitted 28 June, 2020; originally announced June 2020.