-
Deep Reinforcement Learning Optimized Intelligent Resource Allocation in Active RIS-Integrated TN-NTN Networks
Authors:
Muhammad Ahmed Mohsin,
Hassan Rizwan,
Muhammad Jazib,
Muhammad Iqbal,
Muhammad Bilal,
Tabinda Ashraf,
Muhammad Farhan Khan,
Jen-Yi Pan
Abstract:
This work explores the deployment of active reconfigurable intelligent surfaces (A-RIS) in integrated terrestrial and non-terrestrial networks (TN-NTN) while utilizing coordinated multipoint non-orthogonal multiple access (CoMP-NOMA). Our system model incorporates a UAV-assisted RIS in coordination with a terrestrial RIS which aims for signal enhancement. We aim to maximize the sum rate for all us…
▽ More
This work explores the deployment of active reconfigurable intelligent surfaces (A-RIS) in integrated terrestrial and non-terrestrial networks (TN-NTN) while utilizing coordinated multipoint non-orthogonal multiple access (CoMP-NOMA). Our system model incorporates a UAV-assisted RIS in coordination with a terrestrial RIS which aims for signal enhancement. We aim to maximize the sum rate for all users in the network using a custom hybrid proximal policy optimization (H-PPO) algorithm by optimizing the UAV trajectory, base station (BS) power allocation factors, active RIS amplification factor, and phase shift matrix. We integrate edge users into NOMA pairs to achieve diversity gain, further enhancing the overall experience for edge users. Exhaustive comparisons are made with passive RIS-assisted networks to demonstrate the superior efficacy of active RIS in terms of energy efficiency, outage probability, and network sum rate.
△ Less
Submitted 11 January, 2025;
originally announced January 2025.
-
Real-time and Downtime-tolerant Fault Diagnosis for Railway Turnout Machines (RTMs) Empowered with Cloud-Edge Pipeline Parallelism
Authors:
Fan Wu,
Muhammad Bilal,
Haolong Xiang,
Heng Wang,
Jinjun Yu,
Xiaolong Xu
Abstract:
Railway Turnout Machines (RTMs) are mission-critical components of the railway transportation infrastructure, responsible for directing trains onto desired tracks. For safety assurance applications, especially in early-warning scenarios, RTM faults are expected to be detected as early as possible on a continuous 7x24 basis. However, limited emphasis has been placed on distributed model inference f…
▽ More
Railway Turnout Machines (RTMs) are mission-critical components of the railway transportation infrastructure, responsible for directing trains onto desired tracks. For safety assurance applications, especially in early-warning scenarios, RTM faults are expected to be detected as early as possible on a continuous 7x24 basis. However, limited emphasis has been placed on distributed model inference frameworks that can meet the inference latency and reliability requirements of such mission critical fault diagnosis systems. In this paper, an edge-cloud collaborative early-warning system is proposed to enable real-time and downtime-tolerant fault diagnosis of RTMs, providing a new paradigm for the deployment of models in safety-critical scenarios. Firstly, a modular fault diagnosis model is designed specifically for distributed deployment, which utilizes a hierarchical architecture consisting of the prior knowledge module, subordinate classifiers, and a fusion layer for enhanced accuracy and parallelism. Then, a cloud-edge collaborative framework leveraging pipeline parallelism, namely CEC-PA, is developed to minimize the overhead resulting from distributed task execution and context exchange by strategically partitioning and offloading model components across cloud and edge. Additionally, an election consensus mechanism is implemented within CEC-PA to ensure system robustness during coordinator node downtime. Comparative experiments and ablation studies are conducted to validate the effectiveness of the proposed distributed fault diagnosis approach. Our ensemble-based fault diagnosis model achieves a remarkable 97.4% accuracy on a real-world dataset collected by Nanjing Metro in Jiangsu Province, China. Meanwhile, CEC-PA demonstrates superior recovery proficiency during node disruptions and speed-up ranging from 1.98x to 7.93x in total inference time compared to its counterparts.
△ Less
Submitted 4 November, 2024;
originally announced November 2024.
-
MedISure: Towards Assuring Machine Learning-based Medical Image Classifiers using Mixup Boundary Analysis
Authors:
Adam Byfield,
William Poulett,
Ben Wallace,
Anusha Jose,
Shatakshi Tyagi,
Smita Shembekar,
Adnan Qayyum,
Junaid Qadir,
Muhammad Bilal
Abstract:
Machine learning (ML) models are becoming integral in healthcare technologies, presenting a critical need for formal assurance to validate their safety, fairness, robustness, and trustworthiness. These models are inherently prone to errors, potentially posing serious risks to patient health and could even cause irreparable harm. Traditional software assurance techniques rely on fixed code and do n…
▽ More
Machine learning (ML) models are becoming integral in healthcare technologies, presenting a critical need for formal assurance to validate their safety, fairness, robustness, and trustworthiness. These models are inherently prone to errors, potentially posing serious risks to patient health and could even cause irreparable harm. Traditional software assurance techniques rely on fixed code and do not directly apply to ML models since these algorithms are adaptable and learn from curated datasets through a training process. However, adapting established principles, such as boundary testing using synthetic test data can effectively bridge this gap. To this end, we present a novel technique called Mix-Up Boundary Analysis (MUBA) that facilitates evaluating image classifiers in terms of prediction fairness. We evaluated MUBA for two important medical imaging tasks -- brain tumour classification and breast cancer classification -- and achieved promising results. This research aims to showcase the importance of adapting traditional assurance principles for assessing ML models to enhance the safety and reliability of healthcare technologies. To facilitate future research, we plan to publicly release our code for MUBA.
△ Less
Submitted 23 November, 2023;
originally announced November 2023.
-
Multivessel Coronary Artery Segmentation and Stenosis Localisation using Ensemble Learning
Authors:
Muhammad Bilal,
Dinis Martinho,
Reiner Sim,
Adnan Qayyum,
Hunaid Vohra,
Massimo Caputo,
Taofeek Akinosho,
Sofiat Abioye,
Zaheer Khan,
Waleed Niaz,
Junaid Qadir
Abstract:
Coronary angiography analysis is a common clinical task performed by cardiologists to diagnose coronary artery disease (CAD) through an assessment of atherosclerotic plaque's accumulation. This study introduces an end-to-end machine learning solution developed as part of our solution for the MICCAI 2023 Automatic Region-based Coronary Artery Disease diagnostics using x-ray angiography imagEs (ARCA…
▽ More
Coronary angiography analysis is a common clinical task performed by cardiologists to diagnose coronary artery disease (CAD) through an assessment of atherosclerotic plaque's accumulation. This study introduces an end-to-end machine learning solution developed as part of our solution for the MICCAI 2023 Automatic Region-based Coronary Artery Disease diagnostics using x-ray angiography imagEs (ARCADE) challenge, which aims to benchmark solutions for multivessel coronary artery segmentation and potential stenotic lesion localisation from X-ray coronary angiograms. We adopted a robust baseline model training strategy to progressively improve performance, comprising five successive stages of binary class pretraining, multivessel segmentation, fine-tuning using class frequency weighted dataloaders, fine-tuning using F1-based curriculum learning strategy (F1-CLS), and finally multi-target angiogram view classifier-based collective adaptation. Unlike many other medical imaging procedures, this task exhibits a notable degree of interobserver variability. %, making it particularly amenable to automated analysis. Our ensemble model combines the outputs from six baseline models using the weighted ensembling approach, which our analysis shows is found to double the predictive accuracy of the proposed solution. The final prediction was further refined, targeting the correction of misclassified blobs. Our solution achieved a mean F1 score of $37.69\%$ for coronary artery segmentation, and $39.41\%$ for stenosis localisation, positioning our team in the 5th position on both leaderboards. This work demonstrates the potential of automated tools to aid CAD diagnosis, guide interventions, and improve the accuracy of stent injections in clinical settings.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
Robust Surgical Tools Detection in Endoscopic Videos with Noisy Data
Authors:
Adnan Qayyum,
Hassan Ali,
Massimo Caputo,
Hunaid Vohra,
Taofeek Akinosho,
Sofiat Abioye,
Ilhem Berrou,
Paweł Capik,
Junaid Qadir,
Muhammad Bilal
Abstract:
Over the past few years, surgical data science has attracted substantial interest from the machine learning (ML) community. Various studies have demonstrated the efficacy of emerging ML techniques in analysing surgical data, particularly recordings of procedures, for digitizing clinical and non-clinical functions like preoperative planning, context-aware decision-making, and operating skill assess…
▽ More
Over the past few years, surgical data science has attracted substantial interest from the machine learning (ML) community. Various studies have demonstrated the efficacy of emerging ML techniques in analysing surgical data, particularly recordings of procedures, for digitizing clinical and non-clinical functions like preoperative planning, context-aware decision-making, and operating skill assessment. However, this field is still in its infancy and lacks representative, well-annotated datasets for training robust models in intermediate ML tasks. Also, existing datasets suffer from inaccurate labels, hindering the development of reliable models. In this paper, we propose a systematic methodology for developing robust models for surgical tool detection using noisy data. Our methodology introduces two key innovations: (1) an intelligent active learning strategy for minimal dataset identification and label correction by human experts; and (2) an assembling strategy for a student-teacher model-based self-training framework to achieve the robust classification of 14 surgical tools in a semi-supervised fashion. Furthermore, we employ weighted data loaders to handle difficult class labels and address class imbalance issues. The proposed methodology achieves an average F1-score of 85.88\% for the ensemble model-based self-training with class weights, and 80.88\% without class weights for noisy labels. Also, our proposed method significantly outperforms existing approaches, which effectively demonstrates its effectiveness.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
Communication and Control in Collaborative UAVs: Recent Advances and Future Trends
Authors:
Shumaila Javaid,
Nasir Saeed,
Zakria Qadir,
Hamza Fahim,
Bin He,
Houbing Song,
Muhammad Bilal
Abstract:
The recent progress in unmanned aerial vehicles (UAV) technology has significantly advanced UAV-based applications for military, civil, and commercial domains. Nevertheless, the challenges of establishing high-speed communication links, flexible control strategies, and developing efficient collaborative decision-making algorithms for a swarm of UAVs limit their autonomy, robustness, and reliabilit…
▽ More
The recent progress in unmanned aerial vehicles (UAV) technology has significantly advanced UAV-based applications for military, civil, and commercial domains. Nevertheless, the challenges of establishing high-speed communication links, flexible control strategies, and developing efficient collaborative decision-making algorithms for a swarm of UAVs limit their autonomy, robustness, and reliability. Thus, a growing focus has been witnessed on collaborative communication to allow a swarm of UAVs to coordinate and communicate autonomously for the cooperative completion of tasks in a short time with improved efficiency and reliability. This work presents a comprehensive review of collaborative communication in a multi-UAV system. We thoroughly discuss the characteristics of intelligent UAVs and their communication and control requirements for autonomous collaboration and coordination. Moreover, we review various UAV collaboration tasks, summarize the applications of UAV swarm networks for dense urban environments and present the use case scenarios to highlight the current developments of UAV-based applications in various domains. Finally, we identify several exciting future research direction that needs attention for advancing the research in collaborative UAVs.
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
Design and Development of Wall Climbing Robot
Authors:
Hafiz Muhammad Bilal
Abstract:
Climbing Robots are being developed for applications ranging from cleaning to the inspection of difficult to reach constructions. Climbing robots should be capable of carrying a light payload and climbing on vertical surfaces with ability to cope with obstacles. Regarding adhesion to the surface, they should be able to operate on different surfaces with different adhesion methods to produce strong…
▽ More
Climbing Robots are being developed for applications ranging from cleaning to the inspection of difficult to reach constructions. Climbing robots should be capable of carrying a light payload and climbing on vertical surfaces with ability to cope with obstacles. Regarding adhesion to the surface, they should be able to operate on different surfaces with different adhesion methods to produce strong gripping force using light weight mechanism consuming minimum power. Bearing these facts in mind this paper presents a 4-legged Wall Climbing Robot in which suction power using on board suction pumps is used as an adhesion technique. A Walking gait was developed to provide the robot with a capability for climbing up the wall. The robot's kinematics and motion can be considered as mimicking a technique commonly used in rock-climbing using four limbs to climb. It uses four legs, each with four-degrees-of-freedom (4-DOF) and specially designed suction cups attached to the end of each leg that enable it to manoeuvre itself up the wall and to move in any direction. The end effector can also be replaced with other end effectors designed for different adhesion methods to climb on variety of surfaces.
△ Less
Submitted 14 January, 2023;
originally announced January 2023.
-
Deep Learning based Prediction of MSI using MMR Markers in Colorectal Cancer
Authors:
Ruqayya Awan,
Mohammed Nimir,
Shan E Ahmed Raza,
Mohsin Bilal,
Johannes Lotz,
David Snead,
Andrew Robinson,
Nasir Rajpoot
Abstract:
The accurate diagnosis and molecular profiling of colorectal cancers are critical for planning the best treatment options for patients. Microsatellite instability (MSI) or mismatch repair (MMR) status plays a vital role in appropriate treatment selection, has prognostic implications and is used to investigate the possibility of patients having underlying genetic disorders (Lynch syndrome). NICE re…
▽ More
The accurate diagnosis and molecular profiling of colorectal cancers are critical for planning the best treatment options for patients. Microsatellite instability (MSI) or mismatch repair (MMR) status plays a vital role in appropriate treatment selection, has prognostic implications and is used to investigate the possibility of patients having underlying genetic disorders (Lynch syndrome). NICE recommends that all CRC patients should be offered MMR/MSI testing. Immunohistochemistry is commonly used to assess MMR status with subsequent molecular testing performed as required. This incurs significant extra costs and requires additional resources. The introduction of automated methods that can predict MSI or MMR status from a target image could substantially reduce the cost associated with MMR testing. Unlike previous studies on MSI prediction involving training a CNN using coarse labels (MSI vs Microsatellite Stable (MSS)), we have utilised fine-grain MMR labels for training purposes. In this paper, we present our work on predicting MSI status in a two-stage process using a single target slide either stained with CK8/18 or H&E. First, we trained a multi-headed convolutional neural network model where each head was responsible for predicting one of the MMR protein expressions. To this end, we performed the registration of MMR stained slides to the target slide as a pre-processing step. In the second stage, statistical features computed from the MMR prediction maps were used for the final MSI prediction. Our results demonstrated that MSI classification can be improved by incorporating fine-grained MMR labels in comparison to the previous approaches in which only coarse labels were utilised.
△ Less
Submitted 26 April, 2022; v1 submitted 24 February, 2022;
originally announced March 2022.
-
Semantic annotation for computational pathology: Multidisciplinary experience and best practice recommendations
Authors:
Noorul Wahab,
Islam M Miligy,
Katherine Dodd,
Harvir Sahota,
Michael Toss,
Wenqi Lu,
Mostafa Jahanifar,
Mohsin Bilal,
Simon Graham,
Young Park,
Giorgos Hadjigeorghiou,
Abhir Bhalerao,
Ayat Lashen,
Asmaa Ibrahim,
Ayaka Katayama,
Henry O Ebili,
Matthew Parkin,
Tom Sorell,
Shan E Ahmed Raza,
Emily Hero,
Hesham Eldaly,
Yee Wah Tsang,
Kishore Gopalakrishnan,
David Snead,
Emad Rakha
, et al. (2 additional authors not shown)
Abstract:
Recent advances in whole slide imaging (WSI) technology have led to the development of a myriad of computer vision and artificial intelligence (AI) based diagnostic, prognostic, and predictive algorithms. Computational Pathology (CPath) offers an integrated solution to utilize information embedded in pathology WSIs beyond what we obtain through visual assessment. For automated analysis of WSIs and…
▽ More
Recent advances in whole slide imaging (WSI) technology have led to the development of a myriad of computer vision and artificial intelligence (AI) based diagnostic, prognostic, and predictive algorithms. Computational Pathology (CPath) offers an integrated solution to utilize information embedded in pathology WSIs beyond what we obtain through visual assessment. For automated analysis of WSIs and validation of machine learning (ML) models, annotations at the slide, tissue and cellular levels are required. The annotation of important visual constructs in pathology images is an important component of CPath projects. Improper annotations can result in algorithms which are hard to interpret and can potentially produce inaccurate and inconsistent results. Despite the crucial role of annotations in CPath projects, there are no well-defined guidelines or best practices on how annotations should be carried out. In this paper, we address this shortcoming by presenting the experience and best practices acquired during the execution of a large-scale annotation exercise involving a multidisciplinary team of pathologists, ML experts and researchers as part of the Pathology image data Lake for Analytics, Knowledge and Education (PathLAKE) consortium. We present a real-world case study along with examples of different types of annotations, diagnostic algorithm, annotation data dictionary and annotation constructs. The analyses reported in this work highlight best practice recommendations that can be used as annotation guidelines over the lifecycle of a CPath project.
△ Less
Submitted 25 June, 2021;
originally announced June 2021.
-
Deep Multi-Resolution Dictionary Learning for Histopathology Image Analysis
Authors:
Nima Hatami,
Mohsin Bilal,
Nasir Rajpoot
Abstract:
The problem of recognizing various types of tissues present in multi-gigapixel histology images is an important fundamental pre-requisite for downstream analysis of the tumor microenvironment in a bottom-up analysis paradigm for computational pathology. In this paper, we propose a deep dictionary learning approach to solve the problem of tissue phenotyping in histology images. We propose deep Mult…
▽ More
The problem of recognizing various types of tissues present in multi-gigapixel histology images is an important fundamental pre-requisite for downstream analysis of the tumor microenvironment in a bottom-up analysis paradigm for computational pathology. In this paper, we propose a deep dictionary learning approach to solve the problem of tissue phenotyping in histology images. We propose deep Multi-Resolution Dictionary Learning (deepMRDL) in order to benefit from deep texture descriptors at multiple different spatial resolutions. We show the efficacy of the proposed approach through extensive experiments on four benchmark histology image datasets from different organs (colorectal cancer, breast cancer and breast lymphnodes) and tasks (namely, cancer grading, tissue phenotyping, tumor detection and tissue type classification). We also show that the proposed framework can employ most off-the-shelf CNNs models to generate effective deep texture descriptors.
△ Less
Submitted 1 April, 2021;
originally announced April 2021.
-
Remote Sensing to Control Respiratory Viral Diseases Outbreaks using Internet of Vehicles
Authors:
Yesin Sahraoui,
Ahmed Korichi,
Chaker Abdelaziz Kerrache,
Muhammad Bilal,
Marica Amadeo
Abstract:
The respiratory viral diseases, such as those caused by the family of coronaviruses, can be extremely contagious and spread through saliva droplets generated by coughing, sneezing, or breathing. In humans, the most common symptoms of the infection include fever and difficulty in breathing. In order to reduce the diffusion of the current "Coronavirus disease 2019 (COVID-19)" pandemic, the Internet…
▽ More
The respiratory viral diseases, such as those caused by the family of coronaviruses, can be extremely contagious and spread through saliva droplets generated by coughing, sneezing, or breathing. In humans, the most common symptoms of the infection include fever and difficulty in breathing. In order to reduce the diffusion of the current "Coronavirus disease 2019 (COVID-19)" pandemic, the Internet of Things (IoT) technologies can play an important role; for instance, they can be effectively used for implementing a real-time patient tracking and warning system at a city scale. Crucial places to install the tracking IoT devices are the public/private vehicles that, augmented with multiple connectivity solutions, can implement the Internet of Vehicles (IoV) paradigm. In such a ubiquitous network environment, vehicles are equipped with a variety of sensors, including regular cameras that can be replaced with thermal cameras. Therefore, this paper proposes a new design for widely detecting respiratory viral diseases that leverages IoV to collect real-time body temperature and breathing rate measurements of pedestrians. This information can be used to recognize geographic areas affected by possible COVID-19 cases and to implement proactive preventive strategies that would further limit the spread of the disease.
△ Less
Submitted 13 August, 2020;
originally announced March 2021.
-
Shrinking-shifting and Amplifying-shifting Device Using Transformation Optics
Authors:
Hamza Ahmad Madni,
Muhammad Musavir Bilal,
Farrukh Jaleel,
Ahmed Sohaib,
Shahid Iqbal,
Wei Xiang Jiang
Abstract:
Based on transformation optics (TO), this paper uses geometric divisions and linear coordinate transformations to design shrinking-shifting - and reshaping, and amplifying-shifting - and reshaping devices. The proposed devices can reshape the sizes and locations of the wrapped-objects inside the core-region. The shrinking-shifting device shrinks the larger object into a smaller one and shifts it t…
▽ More
Based on transformation optics (TO), this paper uses geometric divisions and linear coordinate transformations to design shrinking-shifting - and reshaping, and amplifying-shifting - and reshaping devices. The proposed devices can reshape the sizes and locations of the wrapped-objects inside the core-region. The shrinking-shifting device shrinks the larger object into a smaller one and shifts it to different location, whereas the shrinking-reshaping device can generate a smaller-size image with different shape located at different location. In contrast to previously designed shrinking devices, the real object wrapped inside the proposed core-region and the transformed object contains the same material properties, and the location-shifting is another feature. Here, the shifting-region is located inside the physical-space boundaries to achieve the non-negative, homogeneous, and anisotropic material properties of the proposed device, which are easier for real implementations. Thus, we further verified this concept with the amplifying-shifting and -reshaping devices for visually transformation of smaller object into bigger one placed at different location and position. We also applied active scatterer to further validate the working functionality of proposed devices. In addition, the proposed devices behave like the concentrator and or rotator effect in the absence of any scatterer. Our findings highlight the role of TO, suggesting directions for future research on bi-functional devices that will be useful for shrinking and amplifying devices, illusion optics, camouflage, and object protection etc.
Keywords: amplifying, reshaping, shrinking, transformation optics, invisibility cloaks
△ Less
Submitted 7 October, 2020; v1 submitted 15 September, 2020;
originally announced September 2020.
-
Blended Multi-Modal Deep ConvNet Features for Diabetic Retinopathy Severity Prediction
Authors:
J. D. Bodapati,
N. Veeranjaneyulu,
S. N. Shareef,
S. Hakak,
M. Bilal,
P. K. R. Maddikunta,
O. Jo
Abstract:
Diabetic Retinopathy (DR) is one of the major causes of visual impairment and blindness across the world. It is usually found in patients who suffer from diabetes for a long period. The major focus of this work is to derive optimal representation of retinal images that further helps to improve the performance of DR recognition models. To extract optimal representation, features extracted from mult…
▽ More
Diabetic Retinopathy (DR) is one of the major causes of visual impairment and blindness across the world. It is usually found in patients who suffer from diabetes for a long period. The major focus of this work is to derive optimal representation of retinal images that further helps to improve the performance of DR recognition models. To extract optimal representation, features extracted from multiple pre-trained ConvNet models are blended using proposed multi-modal fusion module. These final representations are used to train a Deep Neural Network (DNN) used for DR identification and severity level prediction. As each ConvNet extracts different features, fusing them using 1D pooling and cross pooling leads to better representation than using features extracted from a single ConvNet. Experimental studies on benchmark Kaggle APTOS 2019 contest dataset reveals that the model trained on proposed blended feature representations is superior to the existing methods. In addition, we notice that cross average pooling based fusion of features from Xception and VGG16 is the most appropriate for DR recognition. With the proposed model, we achieve an accuracy of 97.41%, and a kappa statistic of 94.82 for DR identification and an accuracy of 81.7% and a kappa statistic of 71.1% for severity level prediction. Another interesting observation is that DNN with dropout at input layer converges more quickly when trained using blended features, compared to the same model trained using uni-modal deep features.
△ Less
Submitted 30 May, 2020;
originally announced June 2020.
-
Classification of Arrhythmia by Using Deep Learning with 2-D ECG Spectral Image Representation
Authors:
Amin Ullah,
Syed M. Anwar,
Muhammad Bilal,
Raja M Mehmood
Abstract:
The electrocardiogram (ECG) is one of the most extensively employed signals used in the diagnosis and prediction of cardiovascular diseases (CVDs). The ECG signals can capture the heart's rhythmic irregularities, commonly known as arrhythmias. A careful study of ECG signals is crucial for precise diagnoses of patients' acute and chronic heart conditions. In this study, we propose a two-dimensional…
▽ More
The electrocardiogram (ECG) is one of the most extensively employed signals used in the diagnosis and prediction of cardiovascular diseases (CVDs). The ECG signals can capture the heart's rhythmic irregularities, commonly known as arrhythmias. A careful study of ECG signals is crucial for precise diagnoses of patients' acute and chronic heart conditions. In this study, we propose a two-dimensional (2-D) convolutional neural network (CNN) model for the classification of ECG signals into eight classes; namely, normal beat, premature ventricular contraction beat, paced beat, right bundle branch block beat, left bundle branch block beat, atrial premature contraction beat, ventricular flutter wave beat, and ventricular escape beat. The one-dimensional ECG time series signals are transformed into 2-D spectrograms through short-time Fourier transform. The 2-D CNN model consisting of four convolutional layers and four pooling layers is designed for extracting robust features from the input spectrograms. Our proposed methodology is evaluated on a publicly available MIT-BIH arrhythmia dataset. We achieved a state-of-the-art average classification accuracy of 99.11\%, which is better than those of recently reported results in classifying similar types of arrhythmias. The performance is significant in other indices as well, including sensitivity and specificity, which indicates the success of the proposed method.
△ Less
Submitted 25 May, 2020; v1 submitted 14 May, 2020;
originally announced May 2020.
-
Amateur Drones Detection: A machine learning approach utilizing the acoustic signals in the presence of strong interference
Authors:
Zahoor Uddin,
Muhammad Altaf,
Muhammad Bilal,
Lewis Nkenyereye,
Ali Kashif Bashir
Abstract:
Owing to small size, sensing capabilities and autonomous nature, the Unmanned Air Vehicles (UAVs) have enormous applications in various areas, e.g., remote sensing, navigation, archaeology, journalism, environmental science, and agriculture. However, the unmonitored deployment of UAVs called the amateur drones (AmDr) can lead to serious security threats and risk to human life and infrastructure. T…
▽ More
Owing to small size, sensing capabilities and autonomous nature, the Unmanned Air Vehicles (UAVs) have enormous applications in various areas, e.g., remote sensing, navigation, archaeology, journalism, environmental science, and agriculture. However, the unmonitored deployment of UAVs called the amateur drones (AmDr) can lead to serious security threats and risk to human life and infrastructure. Therefore, timely detection of the AmDr is essential for the protection and security of sensitive organizations, human life and other vital infrastructure. AmDrs can be detected using different techniques based on sound, video, thermal, and radio frequencies. However, the performance of these techniques is limited in sever atmospheric conditions. In this paper, we propose an efficient unsupervise machine learning approach of independent component analysis (ICA) to detect various acoustic signals i.e., sounds of bird, airplanes, thunderstorm, rain, wind and the UAVs in practical scenario. After unmixing the signals, the features like Mel Frequency Cepstral Coefficients (MFCC), the power spectral density (PSD) and the Root Mean Square Value (RMS) of the PSD are extracted by using ICA. The PSD and the RMS of PSD signals are extracted by first passing the signals from octave band filter banks. Based on the above features the signals are classified using Support Vector Machines (SVM) and K Nearest Neighbor (KNN) to detect the presence or absence of AmDr. Unique feature of the proposed technique is the detection of a single or multiple AmDrs at a time in the presence of multiple acoustic interfering signals. The proposed technique is verified through extensive simulations and it is observed that the RMS values of PSD with KNN performs better than the MFCC with KNN and SVM.
△ Less
Submitted 28 February, 2020;
originally announced March 2020.
-
Classification of Chest Diseases using Wavelet Transforms and Transfer Learning
Authors:
Ahmed Rasheed,
Muhammad Shahzad Younis,
Muhammad Bilal,
Maha Rasheed
Abstract:
Chest X-ray scan is a most often used modality by radiologists to diagnose many chest related diseases in their initial stages. The proposed system aids the radiologists in making decision about the diseases found in the scans more efficiently. Our system combines the techniques of image processing for feature enhancement and deep learning for classification among diseases. We have used the ChestX…
▽ More
Chest X-ray scan is a most often used modality by radiologists to diagnose many chest related diseases in their initial stages. The proposed system aids the radiologists in making decision about the diseases found in the scans more efficiently. Our system combines the techniques of image processing for feature enhancement and deep learning for classification among diseases. We have used the ChestX-ray14 database in order to train our deep learning model on the 14 different labeled diseases found in it. The proposed research shows the significant improvement in the results by using wavelet transforms as pre-processing technique.
△ Less
Submitted 3 February, 2020;
originally announced February 2020.
-
Secure and Robust Machine Learning for Healthcare: A Survey
Authors:
Adnan Qayyum,
Junaid Qadir,
Muhammad Bilal,
Ala Al-Fuqaha
Abstract:
Recent years have witnessed widespread adoption of machine learning (ML)/deep learning (DL) techniques due to their superior performance for a variety of healthcare applications ranging from the prediction of cardiac arrest from one-dimensional heart signals to computer-aided diagnosis (CADx) using multi-dimensional medical images. Notwithstanding the impressive performance of ML/DL, there are sti…
▽ More
Recent years have witnessed widespread adoption of machine learning (ML)/deep learning (DL) techniques due to their superior performance for a variety of healthcare applications ranging from the prediction of cardiac arrest from one-dimensional heart signals to computer-aided diagnosis (CADx) using multi-dimensional medical images. Notwithstanding the impressive performance of ML/DL, there are still lingering doubts regarding the robustness of ML/DL in healthcare settings (which is traditionally considered quite challenging due to the myriad security and privacy issues involved), especially in light of recent results that have shown that ML/DL are vulnerable to adversarial attacks. In this paper, we present an overview of various application areas in healthcare that leverage such techniques from security and privacy point of view and present associated challenges. In addition, we present potential methods to ensure secure and privacy-preserving ML for healthcare applications. Finally, we provide insight into the current research challenges and promising directions for future research.
△ Less
Submitted 21 January, 2020;
originally announced January 2020.
-
Unsupervised Adversarial Domain Adaptation for Cross-Lingual Speech Emotion Recognition
Authors:
Siddique Latif,
Junaid Qadir,
Muhammad Bilal
Abstract:
Cross-lingual speech emotion recognition (SER) is a crucial task for many real-world applications. The performance of SER systems is often degraded by the differences in the distributions of training and test data. These differences become more apparent when training and test data belong to different languages, which cause a significant performance gap between the validation and test scores. It is…
▽ More
Cross-lingual speech emotion recognition (SER) is a crucial task for many real-world applications. The performance of SER systems is often degraded by the differences in the distributions of training and test data. These differences become more apparent when training and test data belong to different languages, which cause a significant performance gap between the validation and test scores. It is imperative to build more robust models that can fit in practical applications of SER systems. Therefore, in this paper, we propose a Generative Adversarial Network (GAN)-based model for multilingual SER. Our choice of using GAN is motivated by their great success in learning the underlying data distribution. The proposed model is designed in such a way that can learn language invariant representations without requiring target-language data labels. We evaluate our proposed model on four different language emotional datasets, including an Urdu-language dataset to also incorporate alternative languages for which labelled data is difficult to find and which have not been studied much by the mainstream community. Our results show that our proposed model can significantly improve the baseline cross-lingual SER performance for all the considered datasets including the non-mainstream Urdu language data without requiring any labels.
△ Less
Submitted 27 July, 2020; v1 submitted 13 July, 2019;
originally announced July 2019.