Skip to main content

Showing 1–1 of 1 results for author: Tholan, M T

Searching in archive cs. Search in all archives.
.
  1. Role of the Pretraining and the Adaptation data sizes for low-resource real-time MRI video segmentation

    Authors: Masoud Thajudeen Tholan, Vinayaka Hegde, Chetan Sharma, Prasanta Kumar Ghosh

    Abstract: Real-time Magnetic Resonance Imaging (rtMRI) is frequently used in speech production studies as it provides a complete view of the vocal tract during articulation. This study investigates the effectiveness of rtMRI in analyzing vocal tract movements by employing the SegNet and UNet models for Air-Tissue Boundary (ATB)segmentation tasks. We conducted pretraining of a few base models using increasin… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: Accepted to ICASSP 2025

    Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hyderabad, India, 2025, pp. 1-5