Skip to main content

Showing 1–10 of 10 results for author: Usama, M

Searching in archive eess. Search in all archives.
.
  1. Spatial and Spatial-Spectral Morphological Mamba for Hyperspectral Image Classification

    Authors: Muhammad Ahmad, Muhammad Hassaan Farooq Butt, Adil Mehmood Khan, Manuel Mazzara, Salvatore Distefano, Muhammad Usama, Swalpa Kumar Roy, Jocelyn Chanussot, Danfeng Hong

    Abstract: Recent advancements in transformers, specifically self-attention mechanisms, have significantly improved hyperspectral image (HSI) classification. However, these models often suffer from inefficiencies, as their computational complexity scales quadratically with sequence length. To address these challenges, we propose the morphological spatial mamba (SMM) and morphological spatial-spectral Mamba (… ▽ More

    Submitted 30 November, 2024; v1 submitted 2 August, 2024; originally announced August 2024.

  2. WaveMamba: Spatial-Spectral Wavelet Mamba for Hyperspectral Image Classification

    Authors: Muhammad Ahmad, Muhammad Usama, Manuel Mazzara, Salvatore Distefano

    Abstract: Hyperspectral Imaging (HSI) has proven to be a powerful tool for capturing detailed spectral and spatial information across diverse applications. Despite the advancements in Deep Learning (DL) and Transformer architectures for HSI classification, challenges such as computational efficiency and the need for extensive labeled data persist. This paper introduces WaveMamba, a novel approach that integ… ▽ More

    Submitted 22 November, 2024; v1 submitted 2 August, 2024; originally announced August 2024.

  3. arXiv:2407.05163  [pdf, other

    eess.IV cs.CV

    A Domain Adaptation Model for Carotid Ultrasound: Image Harmonization, Noise Reduction, and Impact on Cardiovascular Risk Markers

    Authors: Mohd Usama, Emma Nyman, Ulf Naslund, Christer Gronlund

    Abstract: Deep learning has been used extensively for medical image analysis applications, assuming the training and test data adhere to the same probability distributions. However, a common challenge arises when dealing with medical images generated by different systems or even the same system with varying parameter settings. Such images often contain diverse textures and noise patterns, violating the assu… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: 17 pages, 7 figures, 7 tables

  4. arXiv:2405.08277  [pdf, other

    eess.SY

    AI-driven, Model-Free Current Control: A Deep Symbolic Approach for Optimal Induction Machine Performance

    Authors: Muhammad Usama, Yunkyung Hwang, Jaehong Kim

    Abstract: This paper proposed a straightforward and efficient current control solution for induction machines employing deep symbolic regression (DSR). The proposed DSR-based control design offers a simple yet highly effective approach by creating an optimal control model through training and fitting, resulting in an analytical dynamic numerical expression that characterizes the data. Notably, this approach… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: This work has been accepted for potential publication at the IEEE ECCE Asia 2024 International Power Electronics and Motion Control Conference. Please note that copyright may be transferred without prior notice

  5. arXiv:2308.12792  [pdf, other

    cs.SD eess.AS

    Sparks of Large Audio Models: A Survey and Outlook

    Authors: Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Yi Ren, Heriberto Cuayáhuitl, Wenwu Wang, Xulong Zhang, Roberto Togneri, Erik Cambria, Björn W. Schuller

    Abstract: This survey paper provides a comprehensive overview of the recent advancements and challenges in applying large language models to the field of audio signal processing. Audio processing, with its diverse signal representations and a wide range of sources--from human voices to musical instruments and environmental sounds--poses challenges distinct from those found in traditional Natural Language Pr… ▽ More

    Submitted 21 September, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: Under review, Repo URL: https://github.com/EmulationAI/awesome-large-audio-models

  6. arXiv:2307.06090  [pdf, other

    cs.SD eess.AS

    Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers

    Authors: Siddique Latif, Muhammad Usama, Mohammad Ibrahim Malik, Björn W. Schuller

    Abstract: Despite recent advancements in speech emotion recognition (SER) models, state-of-the-art deep learning (DL) approaches face the challenge of the limited availability of annotated data. Large language models (LLMs) have revolutionised our understanding of natural language, introducing emergent properties that broaden comprehension in language, speech, and vision. This paper examines the potential o… ▽ More

    Submitted 19 June, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Accepted in IEEE Computational Intelligence Magazine

  7. arXiv:2305.00725  [pdf, other

    cs.SD eess.AS

    Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing

    Authors: Ibrahim Malik, Siddique Latif, Sanaullah Manzoor, Muhammad Usama, Junaid Qadir, Raja Jurdak

    Abstract: Non-speech emotion recognition has a wide range of applications including healthcare, crime control and rescue, and entertainment, to name a few. Providing these applications using edge computing has great potential, however, recent studies are focused on speech-emotion recognition using complex architectures. In this paper, a non-speech-based emotion recognition system is proposed, which can rely… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: Under review

  8. arXiv:2202.05631  [pdf, other

    eess.IV cs.AI cs.CV

    Vehicle and License Plate Recognition with Novel Dataset for Toll Collection

    Authors: Muhammad Usama, Hafeez Anwar, Abbas Anwar, Saeed Anwar

    Abstract: We propose an automatic framework for toll collection, consisting of three steps: vehicle type recognition, license plate localization, and reading. However, each of the three steps becomes non-trivial due to image variations caused by several factors. The traditional vehicle decorations on the front cause variations among vehicles of the same type. These decorations make license plate localizatio… ▽ More

    Submitted 15 November, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

  9. Vector Control Algorithm Based on Different Current Control Switching Techniques for Ac Motor Drives

    Authors: Muhammad Usama, Jaehong Kim

    Abstract: A comparative analysis of vector control scheme based on different current control switching pulses (HC, SPWM, DPWM and SVPWM) for the speed response of motor drive is analysed in this paper. The control system using different switching techniques, are comparatively simulated and analysed. Ac motor drives are progressively used in high-performance application industries due to small size, efficien… ▽ More

    Submitted 10 May, 2020; originally announced May 2020.

  10. arXiv:1906.06969  [pdf, other

    cs.RO eess.SY

    Robotic Navigation using Entropy-Based Exploration

    Authors: Muhammad Usama, Dong Eui Chang

    Abstract: Robotic navigation concerns the task in which a robot should be able to find a safe and feasible path and traverse between two points in a complex environment. We approach the problem of robotic navigation using reinforcement learning and use deep $Q$-networks to train agents to solve the task of robotic navigation. We compare the Entropy-Based Exploration (EBE) with the widely used $ε$-greedy exp… ▽ More

    Submitted 17 June, 2019; originally announced June 2019.

    Comments: 5 pages