Skip to main content

Showing 1–50 of 111 results for author: Aali, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2507.08684  [pdf, ps, other

    eess.SY

    Large-Scale Processing and Validation of Grid Data for Assessing the Fair Spatial Distribution of PV Hosting Capacity

    Authors: Ali Mohamed Ali, Yaser Raeisi, Plouton Grammatikos, Davide Pavanello, Pierre Roduit, Fabrizio Sossan

    Abstract: The integration of PV systems and increased electrification levels present significant challenges to the traditional design and operation of distribution grids. This paper presents a methodology for extracting, validating, and adapting grid data from a distribution system operator's (DSO) database to facilitate large-scale grid studies, including load flow and optimal power flow analyses. The vali… ▽ More

    Submitted 11 July, 2025; originally announced July 2025.

  2. arXiv:2506.18474  [pdf, ps, other

    eess.IV cs.AI cs.CV cs.LG

    A Deep Convolutional Neural Network-Based Novel Class Balancing for Imbalance Data Segmentation

    Authors: Atifa Kalsoom, M. A. Iftikhar, Amjad Ali, Zubair Shah, Shidin Balakrishnan, Hazrat Ali

    Abstract: Retinal fundus images provide valuable insights into the human eye's interior structure and crucial features, such as blood vessels, optic disk, macula, and fovea. However, accurate segmentation of retinal blood vessels can be challenging due to imbalanced data distribution and varying vessel thickness. In this paper, we propose BLCB-CNN, a novel pipeline based on deep learning and bi-level class… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: This is preprint of the paper submitted to Scientific Reports journal

  3. arXiv:2506.07722  [pdf, ps, other

    cs.SD eess.AS

    Towards a Unified Benchmark for Arabic Pronunciation Assessment: Quranic Recitation as Case Study

    Authors: Yassine El Kheir, Omnia Ibrahim, Amit Meghanani, Nada Almarwani, Hawau Olamide Toyin, Sadeen Alharbi, Modar Alfadly, Lamya Alkanhal, Ibrahim Selim, Shehab Elbatal, Salima Mdhaffar, Thomas Hain, Yasser Hifny, Mostafa Shahin, Ahmed Ali

    Abstract: We present a unified benchmark for mispronunciation detection in Modern Standard Arabic (MSA) using Qur'anic recitation as a case study. Our approach lays the groundwork for advancing Arabic pronunciation assessment by providing a comprehensive pipeline that spans data processing, the development of a specialized phoneme set tailored to the nuances of MSA pronunciation, and the creation of the fir… ▽ More

    Submitted 12 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

    Comments: Accepted Interspeech 2025 and ArabicNLP Shared Task 2025

  4. TinyML-Based Adaptive Pulse Shaping for Edge Intelligence in IoT/IIoT

    Authors: Afan Ali

    Abstract: Edge intelligence in IoT and IIoT demands lightweight algorithms for data processing on resource-constrained devices. This paper introduces a novel adaptive pulse shape filter based on TinyML for PAPR and SER optimization on edge devices used in uplink IoT communication. Implemented on IoT nodes such as sensors, our pruned neural network provides up to 2 dB PAPR saving over root-raised-cosine (RRC… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: 6 pages, 11 Figures, Accepted in Internet Technology Letters

    Journal ref: Internet Technology Letters, June 2025

  5. arXiv:2505.08328  [pdf, ps, other

    cs.NI eess.SP

    AI-Driven Digital Twins: Optimizing 5G/6G Network Slicing with NTNs

    Authors: Afan Ali, Huseyin Arslan

    Abstract: Network slicing in 5G/6G Non-Terrestrial Network (NTN) is confronted with mobility and traffic variability. An artificial intelligence (AI)-based digital twin (DT) architecture with deep reinforcement learning (DRL) using Deep deterministic policy gradient (DDPG) is proposed for dynamic optimization of resource allocation. DT virtualizes network states to enable predictive analysis, while DRL chan… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 5 pages, 3 figures, letter

  6. arXiv:2505.01778  [pdf, other

    eess.SP

    Spreading the Wave: Low-Complexity PAPR Reduction for AFDM and OCDM in 6G Networks

    Authors: Afan Ali, Abdelali Arous, Huseyin Arslan

    Abstract: High Peak-to-Average Power Ratio (PAPR) is still a common issue in multicarrier signal modulation systems such as Orthogonal Chirp Division Multiplexing (OCDM) and Affine Frequency Division Multiplexing (AFDM), which are envisioned to play a central role in 6G networks. To this end, this paper aims to investigate a novel and low-complexity solution towards minimizing the PAPR with the aid of a uni… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

    Comments: 12 pages, 14 figures, 5 tables

  7. arXiv:2505.00399  [pdf, other

    eess.SP

    Stealth Signals: Multi-Discriminator GANs for Covert Communications Against Diverse Wardens

    Authors: Afan Ali, Md. Jalil Piran, Huseyin Arslan

    Abstract: Covert wireless communications are critical for concealing the existence of any transmission from adversarial wardens, particularly in complex environments with multiple heterogeneous detectors. This paper proposes a novel adversarial AI framework leveraging a multi-discriminator Generative Adversarial Network (GAN) to design signals that evade detection by diverse wardens, while ensuring reliable… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: 13 pages, 12 figures, 6 tables

  8. arXiv:2504.21606  [pdf, ps, other

    eess.SY

    Measurement-Based Line-Impedance Estimation in the Absence of Phasor Measurement Units

    Authors: Plouton Grammatikos, Ali Mohamed Ali, Fabrizio Sossan

    Abstract: This paper proposes and compares experimentally several methods to estimate the series resistance and reactance (i.e., the transversal components of the $π$-model of a line) of low-voltage lines in distribution grids. It first shows that if phasor measurements are available and the grid nodal voltages and power injections are known, the problem can be formulated and solved as a conventional load f… ▽ More

    Submitted 7 May, 2025; v1 submitted 30 April, 2025; originally announced April 2025.

    Comments: 2025 IEEE Kiel PowerTech

  9. arXiv:2504.01767  [pdf, other

    eess.AS cs.AI cs.CV

    Leveraging Embedding Techniques in Multimodal Machine Learning for Mental Illness Assessment

    Authors: Abdelrahaman A. Hassan, Abdelrahman A. Ali, Aya E. Fouda, Radwa J. Hanafy, Mohammed E. Fouda

    Abstract: The increasing global prevalence of mental disorders, such as depression and PTSD, requires objective and scalable diagnostic tools. Traditional clinical assessments often face limitations in accessibility, objectivity, and consistency. This paper investigates the potential of multimodal machine learning to address these challenges, leveraging the complementary information available in text, audio… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  10. arXiv:2503.19886  [pdf, ps, other

    cs.LG cs.DC cs.IT cs.NI eess.SP

    RCC-PFL: Robust Client Clustering under Noisy Labels in Personalized Federated Learning

    Authors: Abdulmoneam Ali, Ahmed Arafa

    Abstract: We address the problem of cluster identity estimation in a personalized federated learning (PFL) setting in which users aim to learn different personal models. The backbone of effective learning in such a setting is to cluster users into groups whose objectives are similar. A typical approach in the literature is to achieve this by training users' data on different proposed personal models and ass… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: to appear in the 2025 IEEE International Conference on Communications

  11. arXiv:2503.05798  [pdf, other

    eess.SY

    Automating Hot-Rolling: Designing an Integrated Mechatronics System for Enhanced Efficiency in Sheet Metal Production

    Authors: Mostafa A. Mostafa, Mohamed Khaled, Abdelrahman Ali, Amr Mostafa, Mariam Mohamed, Omar Ahmed, Osama Khalil

    Abstract: The hot-rolling process is a critical stage in sheet metal production within the heavy steel industry. Traditionally, parameter adjustments such as sheet metal velocity and roll gap are performed manually, leading to inefficiencies and limited precision. This project introduces an integrated mechatronics system designed to automate the control of rolling speed and sheet metal thickness, enhancing… ▽ More

    Submitted 16 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

  12. arXiv:2501.05826  [pdf, other

    eess.IV cs.AI cs.CV

    AI-Driven Diabetic Retinopathy Screening: Multicentric Validation of AIDRSS in India

    Authors: Amit Kr Dey, Pradeep Walia, Girish Somvanshi, Abrar Ali, Sagarnil Das, Pallabi Paul, Minakhi Ghosh

    Abstract: Purpose: Diabetic retinopathy (DR) is a major cause of vision loss, particularly in India, where access to retina specialists is limited in rural areas. This study aims to evaluate the Artificial Intelligence-based Diabetic Retinopathy Screening System (AIDRSS) for DR detection and prevalence assessment, addressing the growing need for scalable, automated screening solutions in resource-limited se… ▽ More

    Submitted 13 January, 2025; v1 submitted 10 January, 2025; originally announced January 2025.

    Comments: 22 pages, 5 figures

  13. arXiv:2412.10911  [pdf, other

    math.NA eess.SY

    Improving Numerical Stability and Accuracy in Partitioned Methods with Algebraic Prediction

    Authors: Ahmad Ali, Haya Monawwar, Hantao Cui

    Abstract: The partitioned approach for the numerical integration of power system differential algebraic equations faces inherent numerical stability challenges due to delays between the computation of state and algebraic variables. Such delays can compromise solution accuracy and computational efficiency, particularly in large-scale system simulations. We present an $O(h^2)$-accurate prediction scheme for a… ▽ More

    Submitted 14 December, 2024; originally announced December 2024.

    Comments: This paper has been submitted to the IEEE Power & Energy Society General Meeting 2025 and is currently under review

  14. arXiv:2412.10417  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Leveraging Audio and Text Modalities in Mental Health: A Study of LLMs Performance

    Authors: Abdelrahman A. Ali, Aya E. Fouda, Radwa J. Hanafy, Mohammed E. Fouda

    Abstract: Mental health disorders are increasingly prevalent worldwide, creating an urgent need for innovative tools to support early diagnosis and intervention. This study explores the potential of Large Language Models (LLMs) in multimodal mental health diagnostics, specifically for detecting depression and Post Traumatic Stress Disorder through text and audio modalities. Using the E-DAIC dataset, we comp… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

  15. arXiv:2411.12919  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Robust multi-coil MRI reconstruction via self-supervised denoising

    Authors: Asad Aali, Marius Arvinte, Sidharth Kumar, Yamin I. Arefeen, Jonathan I. Tamir

    Abstract: We study the effect of incorporating self-supervised denoising as a pre-processing step for training deep learning (DL) based reconstruction methods on data corrupted by Gaussian noise. K-space data employed for training are typically multi-coil and inherently noisy. Although DL-based reconstruction methods trained on fully sampled data can enable high reconstruction quality, obtaining large, nois… ▽ More

    Submitted 24 May, 2025; v1 submitted 19 November, 2024; originally announced November 2024.

    Journal ref: MRM, 2025

  16. arXiv:2410.15017  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    DM-Codec: Distilling Multimodal Representations for Speech Tokenization

    Authors: Md Mubtasim Ahasan, Md Fahim, Tasnim Mohiuddin, A K M Mahbubur Rahman, Aman Chadha, Tariq Iqbal, M Ashraful Amin, Md Mofijul Islam, Amin Ahsan Ali

    Abstract: Recent advancements in speech-language models have yielded significant improvements in speech tokenization and synthesis. However, effectively mapping the complex, multidimensional attributes of speech into discrete tokens remains challenging. This process demands acoustic, semantic, and contextual information for precise speech representations. Existing speech representations generally fall into… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

  17. arXiv:2410.14131  [pdf

    eess.IV cs.AI cs.CV

    Deep Learning Applications in Medical Image Analysis: Advancements, Challenges, and Future Directions

    Authors: Aimina Ali Eli, Abida Ali

    Abstract: Medical image analysis has emerged as an essential element of contemporary healthcare, facilitating physicians in achieving expedited and precise diagnosis. Recent breakthroughs in deep learning, a subset of artificial intelligence, have markedly revolutionized the analysis of medical pictures, improving the accuracy and efficiency of clinical procedures. Deep learning algorithms, especially convo… ▽ More

    Submitted 4 November, 2024; v1 submitted 17 October, 2024; originally announced October 2024.

  18. arXiv:2410.12434  [pdf, other

    eess.SY

    A Control Theoretic Study on Omnidirectional MAVs with Minimum Number of Actuators and No Internal Forces at Any Orientation

    Authors: Ahmed Ali, Chiara Gabellieri, Antonio Franchi

    Abstract: We propose a new multirotor aerial vehicle class of designs composed of a multi-body structure in which a main body is connected by passive joints to links equipped with propellers. We have investigated some instances of such class, some of which are shown to achieve omnidirectionality while having a minimum number of inputs equal to the main body Degrees of Freedom DoF's, only uni-directional pos… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  19. arXiv:2410.02733  [pdf, ps, other

    cs.LG cs.IT cs.NI eess.SP

    Data Similarity-Based One-Shot Clustering for Multi-Task Hierarchical Federated Learning

    Authors: Abdulmoneam Ali, Ahmed Arafa

    Abstract: We address the problem of cluster identity estimation in a hierarchical federated learning setting in which users work toward learning different tasks. To overcome the challenge of task heterogeneity, users need to be grouped in a way such that users with the same task are in the same group, conducting training together, while sharing the weights of feature extraction layers with the other groups.… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: To appear in Asilomar 2024

  20. arXiv:2409.19448  [pdf

    cs.SD cs.AI eess.AS

    Advanced Clustering Techniques for Speech Signal Enhancement: A Review and Metanalysis of Fuzzy C-Means, K-Means, and Kernel Fuzzy C-Means Methods

    Authors: Abdulhady Abas Abdullah, Aram Mahmood Ahmed, Tarik Rashid, Hadi Veisi, Yassin Hussein Rassul, Bryar Hassan, Polla Fattah, Sabat Abdulhameed Ali, Ahmed S. Shamsaldin

    Abstract: Speech signal processing is a cornerstone of modern communication technologies, tasked with improving the clarity and comprehensibility of audio data in noisy environments. The primary challenge in this field is the effective separation and recognition of speech from background noise, crucial for applications ranging from voice-activated assistants to automated transcription services. The quality… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

  21. arXiv:2408.05645  [pdf

    eess.IV cs.CV cs.LG

    BeyondCT: A deep learning model for predicting pulmonary function from chest CT scans

    Authors: Kaiwen Geng, Zhiyi Shi, Xiaoyan Zhao, Alaa Ali, Jing Wang, Joseph Leader, Jiantao Pu

    Abstract: Abstract Background: Pulmonary function tests (PFTs) and computed tomography (CT) imaging are vital in diagnosing, managing, and monitoring lung diseases. A common issue in practice is the lack of access to recorded pulmonary functions despite available chest CT scans. Purpose: To develop and validate a deep learning algorithm for predicting pulmonary function directly from chest CT scans. M… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: 5 tables, 7 figures,22 pages

  22. arXiv:2408.02430  [pdf, other

    eess.AS

    Beyond Orthography: Automatic Recovery of Short Vowels and Dialectal Sounds in Arabic

    Authors: Yassine El Kheir, Hamdy Mubarak, Ahmed Ali, Shammur Absar Chowdhury

    Abstract: This paper presents a novel Dialectal Sound and Vowelization Recovery framework, designed to recognize borrowed and dialectal sounds within phonologically diverse and dialect-rich languages, that extends beyond its standard orthographic sound sets. The proposed framework utilized a quantized sequence of input with(out) continuous pretrained self-supervised representation. We show the efficacy of t… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: Accepted ACL 2024 Main Conference

  23. arXiv:2407.20387  [pdf, other

    eess.IV cs.CV cs.LG

    Two-Phase Segmentation Approach for Accurate Left Ventricle Segmentation in Cardiac MRI using Machine Learning

    Authors: Maria Tamoor, Abbas Raza Ali, Philemon Philip, Ruqqayia Adil, Rabia Shahid, Asma Naseer

    Abstract: Accurate segmentation of the Left Ventricle (LV) holds substantial importance due to its implications in disease detection, regional analysis, and the development of complex models for cardiac surgical planning. CMR is a golden standard for diagnosis of serveral cardiac diseases. LV in CMR comprises of three distinct sections: Basal, Mid-Ventricle, and Apical. This research focuses on the precise… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  24. arXiv:2407.11865  [pdf, other

    eess.IV cs.CV

    Novel Hybrid Integrated Pix2Pix and WGAN Model with Gradient Penalty for Binary Images Denoising

    Authors: Luca Tirel, Ali Mohamed Ali, Hashim A. Hashim

    Abstract: This paper introduces a novel approach to image denoising that leverages the advantages of Generative Adversarial Networks (GANs). Specifically, we propose a model that combines elements of the Pix2Pix model and the Wasserstein GAN (WGAN) with Gradient Penalty (WGAN-GP). This hybrid framework seeks to capitalize on the denoising capabilities of conditional GANs, as demonstrated in the Pix2Pix mode… ▽ More

    Submitted 31 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: Systems and Soft Computing

  25. arXiv:2407.06392  [pdf, other

    cs.NI eess.SP

    Effects of Small-Scale User Mobility on Highly Directional XR Communications

    Authors: Asad Ali, Olga Galinina, Jiri Hosek, Sergey Andreev

    Abstract: The development of next-generation communication systems promises to enable extended reality (XR) applications, such as XR gaming with ultra-realistic content and human-grade sensory feedback. These demanding applications impose stringent performance requirements on the underlying wireless communication infrastructure. To meet the expected Quality of Experience (QoE) for XR applications, high-capa… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  26. arXiv:2406.17973  [pdf, other

    eess.SY

    Koopman-LQR Controller for Quadrotor UAVs from Data

    Authors: Zeyad M. Manaa, Ayman M. Abdallah, Mohammad A. Abido, Syed S. Azhar Ali

    Abstract: Quadrotor systems are common and beneficial for many fields, but their intricate behavior often makes it challenging to design effective and optimal control strategies. Some traditional approaches to nonlinear control often rely on local linearizations or complex nonlinear models, which can be inaccurate or computationally expensive. We present a data-driven approach to identify the dynamics of a… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  27. arXiv:2406.16099  [pdf, other

    cs.SD eess.AS

    Speech Representation Analysis based on Inter- and Intra-Model Similarities

    Authors: Yassine El Kheir, Ahmed Ali, Shammur Absar Chowdhury

    Abstract: Self-supervised models have revolutionized speech processing, achieving new levels of performance in a wide variety of tasks with limited resources. However, the inner workings of these models are still opaque. In this paper, we aim to analyze the encoded contextual representation of these foundation models based on their inter- and intra-model similarity, independent of any external annotation an… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 5 pages, Accepted to appear in ICASSP XAI-SA Workshop

  28. arXiv:2406.05716  [pdf, other

    eess.SP cs.IT

    Near or far: On determining the appropriate channel estimation strategy in cross-field communication

    Authors: Simon Tarboush, Anum Ali, Tareq Y. Al-Naffouri

    Abstract: The use of ultra-massive multiple-input multiple-output and high-frequency large bandwidth systems is likely in the next-generation wireless communication systems. In such systems, the user moves between near- and far-field regions, and consequently, the channel estimation will need to be carried out in the cross-field scenario. Channel estimation strategies have been proposed for both near- and f… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  29. arXiv:2405.15820  [pdf

    eess.SY physics.app-ph

    Concurrent Multiphysics and Multiscale Topology Optimization for Lightweight Laser-Driven Porous Actuator Systems

    Authors: Musaddiq Al Ali, Masatoshi Shimoda

    Abstract: In this research, multi-physics topology optimization is employed to achieve the detailed design of a lightweight porous linear actuation mechanism that harnesses energy through laser activation. A multiscale topology optimization methodology is introduced for micro- and macroscale design, considering energy dissipation via heat convection and radiation. This investigation meticulously considers t… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  30. arXiv:2405.14242  [pdf

    eess.IV cs.CV

    M2ANET: Mobile Malaria Attention Network for efficient classification of plasmodium parasites in blood cells

    Authors: Salam Ahmed Ali, Peshraw Salam Abdulqadir, Shan Ali Abdullah, Haruna Yunusa

    Abstract: Malaria is a life-threatening infectious disease caused by Plasmodium parasites, which poses a significant public health challenge worldwide, particularly in tropical and subtropical regions. Timely and accurate detection of malaria parasites in blood cells is crucial for effective treatment and control of the disease. In recent years, deep learning techniques have demonstrated remarkable success… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  31. arXiv:2405.02563  [pdf, other

    eess.SP cs.LG

    Deep Representation Learning-Based Dynamic Trajectory Phenotyping for Acute Respiratory Failure in Medical Intensive Care Units

    Authors: Alan Wu, Tilendra Choudhary, Pulakesh Upadhyaya, Ayman Ali, Philip Yang, Rishikesan Kamaleswaran

    Abstract: Sepsis-induced acute respiratory failure (ARF) is a serious complication with a poor prognosis. This paper presents a deep representation learningbased phenotyping method to identify distinct groups of clinical trajectories of septic patients with ARF. For this retrospective study, we created a dataset from electronic medical records (EMR) consisting of data from sepsis patients admitted to medica… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 9 pages

  32. arXiv:2404.10018  [pdf, other

    cs.RO eess.SY

    A Linear MPC with Control Barrier Functions for Differential Drive Robots

    Authors: Ali Mohamed Ali, Chao Shen, Hashim A. Hashim

    Abstract: The need for fully autonomous mobile robots has surged over the past decade, with the imperative of ensuring safe navigation in a dynamic setting emerging as a primary challenge impeding advancements in this domain. In this paper, a Safety Critical Model Predictive Control based on Dynamic Feedback Linearization tailored to the application of differential drive robots with two wheels is proposed t… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Accepted IET Control Theory & Applications. arXiv admin note: text overlap with arXiv:2404.09320

  33. arXiv:2404.09320  [pdf, other

    eess.SY

    MPC Based Linear Equivalence with Control Barrier Functions for VTOL-UAVs

    Authors: Ali Mohamed Ali, Hashim A. Hashim, Chao Shen

    Abstract: In this work, we propose a cascaded scheme of linear Model prediction Control (MPC) based on Control Barrier Functions (CBF) with Dynamic Feedback Linearization (DFL) for Vertical Take-off and Landing (VTOL) Unmanned Aerial Vehicles (UAVs). CBF is a tool that allows enforcement of forward invariance of a set using Lyapunov-like functions to ensure safety. The First control synthesis that employed… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: The 2024 IEEE American Control Conference (ACC)

  34. arXiv:2402.09461  [pdf, other

    eess.SP cs.LG

    A Novel Approach to WaveNet Architecture for RF Signal Separation with Learnable Dilation and Data Augmentation

    Authors: Yu Tian, Ahmed Alhammadi, Abdullah Quran, Abubakar Sani Ali

    Abstract: In this paper, we address the intricate issue of RF signal separation by presenting a novel adaptation of the WaveNet architecture that introduces learnable dilation parameters, significantly enhancing signal separation in dense RF spectrums. Our focused architectural refinements and innovative data augmentation strategies have markedly improved the model's ability to discern complex signal source… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  35. arXiv:2401.15417  [pdf, other

    cs.LG eess.SY

    Fault Diagnosis on Induction Motor using Machine Learning and Signal Processing

    Authors: Muhammad Samiullah, Hasan Ali, Shehryar Zahoor, Anas Ali

    Abstract: The detection and identification of induction motor faults using machine learning and signal processing is a valuable approach to avoiding plant disturbances and shutdowns in the context of Industry 4.0. In this work, we present a study on the detection and identification of induction motor faults using machine learning and signal processing with MATLAB Simulink. We developed a model of a three-ph… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 6 pages, 17 figures, 2 tables

  36. arXiv:2312.03989  [pdf, other

    cs.LG cond-mat.mtrl-sci eess.IV physics.data-an

    Rapid detection of rare events from in situ X-ray diffraction data using machine learning

    Authors: Weijian Zheng, Jun-Sang Park, Peter Kenesei, Ahsan Ali, Zhengchun Liu, Ian T. Foster, Nicholas Schwarz, Rajkumar Kettimuthu, Antonino Miceli, Hemant Sharma

    Abstract: High-energy X-ray diffraction methods can non-destructively map the 3D microstructure and associated attributes of metallic polycrystalline engineering materials in their bulk form. These methods are often combined with external stimuli such as thermo-mechanical loading to take snapshots over time of the evolving microstructure and attributes. However, the extreme data volumes and the high costs o… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  37. arXiv:2311.09413  [pdf

    eess.IV

    Leveraging machine learning to enhance climate models: a review

    Authors: Ahmed Elsayed, Shrouk Wally, Islam Alkabbany, Asem Ali, Aly Farag

    Abstract: Recent achievements in machine learning (Ml) have had a significant impact on various fields, including climate science. Climate modeling is very important and plays a crucial role in shaping the decisions of governments and individuals in mitigating the impact of climate change. Climate change poses a serious threat to humanity, however, current climate models are limited by computational costs,… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  38. arXiv:2310.16851  [pdf, other

    eess.IV cs.CV

    Deep Learning Models for Classification of COVID-19 Cases by Medical Images

    Authors: Amir Ali

    Abstract: In recent times, the use of chest Computed Tomography (CT) images for detecting coronavirus infections has gained significant attention, owing to their ability to reveal bilateral changes in affected individuals. However, classifying patients from medical images presents a formidable challenge, particularly in identifying such bilateral changes. To tackle this challenge, our study harnesses the po… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Master's thesis

  39. arXiv:2310.13974  [pdf, other

    cs.CL cs.SD eess.AS

    Automatic Pronunciation Assessment -- A Review

    Authors: Yassine El Kheir, Ahmed Ali, Shammur Absar Chowdhury

    Abstract: Pronunciation assessment and its application in computer-aided pronunciation training (CAPT) have seen impressive progress in recent years. With the rapid growth in language processing and deep learning over the past few years, there is a need for an updated review. In this paper, we review methods employed in pronunciation assessment for both phonemic and prosodic. We categorize the main challeng… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 9 pages, accepted to EMNLP Findings

  40. arXiv:2309.15674  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Speech collage: code-switched audio generation by collaging monolingual corpora

    Authors: Amir Hussein, Dorsa Zeinali, Ondřej Klejch, Matthew Wiesner, Brian Yan, Shammur Chowdhury, Ahmed Ali, Shinji Watanabe, Sanjeev Khudanpur

    Abstract: Designing effective automatic speech recognition (ASR) systems for Code-Switching (CS) often depends on the availability of the transcribed CS resources. To address data scarcity, this paper introduces Speech Collage, a method that synthesizes CS data from monolingual corpora by splicing audio segments. We further improve the smoothness quality of audio generation using an overlap-add approach. We… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  41. arXiv:2309.15563  [pdf, other

    cs.CV eess.IV

    Guided Frequency Loss for Image Restoration

    Authors: Bilel Benjdira, Anas M. Ali, Anis Koubaa

    Abstract: Image Restoration has seen remarkable progress in recent years. Many generative models have been adapted to tackle the known restoration cases of images. However, the interest in benefiting from the frequency domain is not well explored despite its major factor in these particular cases of image synthesis. In this study, we propose the Guided Frequency Loss (GFL), which helps the model to learn in… ▽ More

    Submitted 22 October, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

  42. arXiv:2309.07739  [pdf, other

    cs.CL cs.SD eess.AS

    The complementary roles of non-verbal cues for Robust Pronunciation Assessment

    Authors: Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali

    Abstract: Research on pronunciation assessment systems focuses on utilizing phonetic and phonological aspects of non-native (L2) speech, often neglecting the rich layer of information hidden within the non-verbal cues. In this study, we proposed a novel pronunciation assessment framework, IntraVerbalPA. % The framework innovatively incorporates both fine-grained frame- and abstract utterance-level non-verba… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 5 pages, submitted to ICASSP 2024

  43. arXiv:2309.07719  [pdf, other

    cs.CL cs.SD eess.AS

    L1-aware Multilingual Mispronunciation Detection Framework

    Authors: Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali

    Abstract: The phonological discrepancies between a speaker's native (L1) and the non-native language (L2) serves as a major factor for mispronunciation. This paper introduces a novel multilingual MDD architecture, L1-MultiMDD, enriched with L1-aware speech representation. An end-to-end speech encoder is trained on the input signal and its corresponding reference phoneme sequence. First, an attention mechani… ▽ More

    Submitted 21 September, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: 5 papers, submitted to ICASSP 2024

  44. arXiv:2308.15822  [pdf

    eess.IV cs.CV

    AMDNet23: A combined deep Contour-based Convolutional Neural Network and Long Short Term Memory system to diagnose Age-related Macular Degeneration

    Authors: Md. Aiyub Ali, Md. Shakhawat Hossain, Md. Kawar Hossain, Subhadra Soumi Sikder, Sharun Akter Khushbu, Mirajul Islam

    Abstract: In light of the expanding population, an automated framework of disease detection can assist doctors in the diagnosis of ocular diseases, yields accurate, stable, rapid outcomes, and improves the success rate of early detection. The work initially intended the enhancing the quality of fundus images by employing an adaptive contrast enhancement algorithm (CLAHE) and Gamma correction. In the preproc… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Report number: ISWA-D-23-00333

  45. arXiv:2308.04355  [pdf, other

    eess.SP cs.IT

    Evaluation of a Low-Cost Single-Lead ECG Module for Vascular Ageing Prediction and Studying Smoking-induced Changes in ECG

    Authors: S. Anas Ali, M. Saqib Niaz, Mubashir Rehman, Ahsan Mehmood, M. Mahboob Ur Rahman, Kashif Riaz, Qammer H. Abbasi

    Abstract: Vascular age is traditionally measured using invasive methods or through 12-lead electrocardiogram (ECG). This paper utilizes a low-cost single-lead (lead-I) ECG module to predict the vascular age of an apparently healthy young person. In addition, we also study the impact of smoking on ECG traces of the light-but-habitual smokers. We begin by collecting (lead-I) ECG data from 42 apparently health… ▽ More

    Submitted 25 November, 2024; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: 12 pages, 7 figures, 5 tables, submitted to a journal for review

  46. arXiv:2308.02503  [pdf, other

    eess.AS cs.CL cs.SD

    MyVoice: Arabic Speech Resource Collaboration Platform

    Authors: Yousseif Elshahawy, Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali

    Abstract: We introduce MyVoice, a crowdsourcing platform designed to collect Arabic speech to enhance dialectal speech technologies. This platform offers an opportunity to design large dialectal speech datasets; and makes them publicly available. MyVoice allows contributors to select city/country-level fine-grained dialect and record the displayed utterances. Users can switch roles between contributors and… ▽ More

    Submitted 23 July, 2023; originally announced August 2023.

    Comments: 2 pages, accepted at InterSpeech23 Show and Tell Session

  47. Measuring Student Behavioral Engagement using Histogram of Actions

    Authors: Ahmed Abdelkawy, Aly Farag, Islam Alkabbany, Asem Ali, Chris Foreman, Thomas Tretter, Nicholas Hindy

    Abstract: In this paper, we propose a novel technique for measuring behavioral engagement through students' actions recognition. The proposed approach recognizes student actions then predicts the student behavioral engagement level. For student action recognition, we use human skeletons to model student postures and upper body movements. To learn the dynamics of student upper body, a 3D-CNN model is used. T… ▽ More

    Submitted 15 May, 2025; v1 submitted 18 July, 2023; originally announced July 2023.

  48. arXiv:2307.06796  [pdf, other

    eess.SP eess.SY

    Defeating Proactive Jammers Using Deep Reinforcement Learning for Resource-Constrained IoT Networks

    Authors: Abubakar Sani Ali, Shimaa Naser, Sami Muhaidat

    Abstract: Traditional anti-jamming techniques like spread spectrum, adaptive power/rate control, and cognitive radio, have demonstrated effectiveness in mitigating jamming attacks. However, their robustness against the growing complexity of internet-of-thing (IoT) networks and diverse jamming attacks is still limited. To address these challenges, machine learning (ML)-based techniques have emerged as promis… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  49. arXiv:2306.07936  [pdf, other

    eess.AS cs.CL cs.SD

    FOOCTTS: Generating Arabic Speech with Acoustic Environment for Football Commentator

    Authors: Massa Baali, Ahmed Ali

    Abstract: This paper presents FOOCTTS, an automatic pipeline for a football commentator that generates speech with background crowd noise. The application gets the text from the user, applies text pre-processing such as vowelization, followed by the commentator's speech synthesizer. Our pipeline included Arabic automatic speech recognition for data labeling, CTC segmentation, transcription vowelization to m… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted at Interspeech 2023 Show & Tell Demo Session

  50. arXiv:2306.01845  [pdf, other

    cs.SD eess.AS

    Multi-View Multi-Task Representation Learning for Mispronunciation Detection

    Authors: Yassine El Kheir, Shammur Absar Chowdhury, Ahmed Ali

    Abstract: The disparity in phonology between learner's native (L1) and target (L2) language poses a significant challenge for mispronunciation detection and diagnosis (MDD) systems. This challenge is further intensified by lack of annotated L2 data. This paper proposes a novel MDD architecture that exploits multiple `views' of the same input data assisted by auxiliary tasks to learn more distinctive phoneti… ▽ More

    Submitted 7 August, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 5 pages, Accepted SLaTE23