Skip to main content

Showing 1–4 of 4 results for author: Dip, S S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2504.03984  [pdf

    eess.SP

    Optimized Feature Selection and Neural Network-Based Classification of Motor Imagery Using EEG Signals

    Authors: Muhammad Sudipto Siam Dip, Mohammod Abdul Motin, Md. Anik Hasan, Sumaiya Kabir

    Abstract: Objective: Machine learning- and deep learning-based models have recently been employed in motor imagery intention classification from electroencephalogram (EEG) signals. Nevertheless, there is a limited understanding of feature selection to assist in identifying the most significant features in different spatial locations. Methods: This study proposes a feature selection technique using sequentia… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  2. arXiv:2501.00557  [pdf

    eess.SP

    NeuroSleepNet: A Multi-Head Self-Attention Based Automatic Sleep Scoring Scheme with Spatial and Multi-Scale Temporal Representation Learning

    Authors: Muhammad Sudipto Siam Dip, Mohammod Abdul Motin, Chandan Karmakar, Thomas Penzel, Marimuthu Palaniswami

    Abstract: Objective: Automatic sleep scoring is crucial for diagnosing sleep disorders. Existing frameworks based on Polysomnography often rely on long sequences of input signals to predict sleep stages, which can introduce complexity. Moreover, there is limited exploration of simplifying representation learning in sleep scoring methods. Methods: In this study, we propose NeuroSleepNet, an automatic sleep s… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

  3. arXiv:2409.10240  [pdf, other

    eess.AS cs.SD

    oboVox Far Field Speaker Recognition: A Novel Data Augmentation Approach with Pretrained Models

    Authors: Muhammad Sudipto Siam Dip, Md Anik Hasan, Sapnil Sarker Bipro, Md Abdur Raiyan, Mohammod Abdul Motin

    Abstract: In this study, we address the challenge of speaker recognition using a novel data augmentation technique of adding noise to enrollment files. This technique efficiently aligns the sources of test and enrollment files, enhancing comparability. Various pre-trained models were employed, with the resnet model achieving the highest DCF of 0.84 and an EER of 13.44. The augmentation technique notably imp… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 5 pages, 2 figures

  4. arXiv:2305.09688  [pdf

    eess.AS cs.CL cs.LG

    OOD-Speech: A Large Bengali Speech Recognition Dataset for Out-of-Distribution Benchmarking

    Authors: Fazle Rabbi Rakib, Souhardya Saha Dip, Samiul Alam, Nazia Tasnim, Md. Istiak Hossain Shihab, Md. Nazmuddoha Ansary, Syed Mobassir Hossen, Marsia Haque Meghla, Mamunur Mamun, Farig Sadeque, Sayma Sultana Chowdhury, Tahsin Reasat, Asif Sushmit, Ahmed Imtiaz Humayun

    Abstract: We present OOD-Speech, the first out-of-distribution (OOD) benchmarking dataset for Bengali automatic speech recognition (ASR). Being one of the most spoken languages globally, Bengali portrays large diversity in dialects and prosodic features, which demands ASR frameworks to be robust towards distribution shifts. For example, islamic religious sermons in Bengali are delivered with a tonality that… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.