Search | arXiv e-print repository

Design of a compact low loss 2-way millimetre wave power divider for future communication

Authors: Muhammad Asfar Saeed, Augustine O. Nwajana, Muneeb Ahmad

Abstract: In this paper, a rectangular-shaped power divider has been presented operating at 27.9 GHz. The power divider has achieved acceptable results for important parameters such as S11, S12, S21, and S22. The substrate employed for the power divider is Roger 3003 which has a thickness of 1.6 mm. This power divider provides a reflection coefficient of -12.2 dB and an insertion loss of 3.1 dB at 28 GHz. T… ▽ More In this paper, a rectangular-shaped power divider has been presented operating at 27.9 GHz. The power divider has achieved acceptable results for important parameters such as S11, S12, S21, and S22. The substrate employed for the power divider is Roger 3003 which has a thickness of 1.6 mm. This power divider provides a reflection coefficient of -12.2 dB and an insertion loss of 3.1 dB at 28 GHz. This ka-band T-junction power divider covers 68% of the bandwidth. Dimensions of the ka-band T-junction power divider are 50x80 mm. Due to its dimensions and bandwidth this power divider is more suitable for millimetre wave applications like RADAR, beamforming, and 5G applications. △ Less

Submitted 7 May, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

Comments: 7 pages, 6 figures, 2 tables

arXiv:2504.04895 [pdf]

Asymmetric 4.77 Three-Way Unequal Filtering Power Divider/Combiner for Communication Systems Application

Authors: Augustine O. Nwajana, Mosammat Rokaiya Akter, Muhammad Asfar Saeed

Abstract: This study presents a novel three-way unequal filtering power divider/combiner, addressing challenges in unequal power distribution while incorporating filtering functions in communication systems. Wilkinson power divider (WPD) is the traditional power division approach using quarter-wavelength transmission lines [1]. This type of power divider is popularly used in communication systems due to its… ▽ More This study presents a novel three-way unequal filtering power divider/combiner, addressing challenges in unequal power distribution while incorporating filtering functions in communication systems. Wilkinson power divider (WPD) is the traditional power division approach using quarter-wavelength transmission lines [1]. This type of power divider is popularly used in communication systems due to its good electrical isolation and simple structure. The problem with WPD is that its operation requires the use of an externally connected bandpass filter (BPF) to achieve filtering functionality. This leads to increased footprint and increased loss coefficients in a system. In contrast to the traditional design approach involving a BPF, a matching transmission line, and a Wilkinson power divider as separate components, the proposed integrated filtering power divider (FPD) consolidates all three components into a single device, leading to lower footprint and lower loss coefficient in a system. Circuit modelling and electromagnetic (EM) simulations were conducted to ensure alignment between theoretical and practical results. The design demonstrates effective unequal power division at the three output ports while maintaining very good filtering performance. Results show a return loss better than 15 dB and a minimum insertion loss of 1.2 dB. The overall size of the device is 32.2 x 50.0 mm. This paper contributes to advancements in power divider design by addressing unequal power division challenges and integrating filtering functions. The findings offer a foundation for future developments in advanced power divider/combiner systems, with insights into potential challenges and areas for further improvements. △ Less

Submitted 7 April, 2025; originally announced April 2025.

Comments: 8 pages, 5 figures

arXiv:2503.17275 [pdf, other]

Vision Transformer Based Semantic Communications for Next Generation Wireless Networks

Authors: Muhammad Ahmed Mohsin, Muhammad Jazib, Zeeshan Alam, Muhmmad Farhan Khan, Muhammad Saad, Muhammad Ali Jamshed

Abstract: In the evolving landscape of 6G networks, semantic communications are poised to revolutionize data transmission by prioritizing the transmission of semantic meaning over raw data accuracy. This paper presents a Vision Transformer (ViT)-based semantic communication framework that has been deliberately designed to achieve high semantic similarity during image transmission while simultaneously minimi… ▽ More In the evolving landscape of 6G networks, semantic communications are poised to revolutionize data transmission by prioritizing the transmission of semantic meaning over raw data accuracy. This paper presents a Vision Transformer (ViT)-based semantic communication framework that has been deliberately designed to achieve high semantic similarity during image transmission while simultaneously minimizing the demand for bandwidth. By equipping ViT as the encoder-decoder framework, the proposed architecture can proficiently encode images into a high semantic content at the transmitter and precisely reconstruct the images, considering real-world fading and noise consideration at the receiver. Building on the attention mechanisms inherent to ViTs, our model outperforms Convolution Neural Network (CNNs) and Generative Adversarial Networks (GANs) tailored for generating such images. The architecture based on the proposed ViT network achieves the Peak Signal-to-noise Ratio (PSNR) of 38 dB, which is higher than other Deep Learning (DL) approaches in maintaining semantic similarity across different communication environments. These findings establish our ViT-based approach as a significant breakthrough in semantic communications. △ Less

Submitted 21 March, 2025; originally announced March 2025.

Comments: Accepted @ ICC 2025

arXiv:2411.18867 [pdf]

doi 10.1109/TMECH.2024.3459644

Comparative Analysis of Control Observer-Based Methods for State Estimation of Lithium-Ion Batteries in Practical Scenarios

Authors: Muhammad Saeed, Arash Khalatbarisoltani, Zhongwei Deng, Wenxue Liu, Faisal Altaf, Shuai Lu, Xiaosong Hu

Abstract: The reliability, lower computational complexity, and ease of implementation of control observers make them one of the most promising methods for the state estimation of Li-ion batteries (LIBs) in commercial applications. To pave their way, this study performs a comprehensive and systematic evaluation of four main categories of control observer-based methods in different practical scenarios conside… ▽ More The reliability, lower computational complexity, and ease of implementation of control observers make them one of the most promising methods for the state estimation of Li-ion batteries (LIBs) in commercial applications. To pave their way, this study performs a comprehensive and systematic evaluation of four main categories of control observer-based methods in different practical scenarios considering estimation accuracy, computational time convergence speed, stability, and robustness against measurement uncertainties. Observers are designed using a second-order equivalent circuit model whose observability against different scenarios is rigorously investigated to verify the feasibility of the proposed analysis. Established techniques then are validated against driving datasets and their comparative usefulness is evaluated using an experimental setup. The analysis also evaluates the adaptability of different techniques to electric vehicle field data. The results indicate better accuracy, stability, robustness, and faster convergence for the PI and PID, while the estimations of the Luenberger observers find it hard to converge against highly dynamic loadfiles. Moreover, this study also discusses the sensitivity of observer-based techniques to battery ohmic polarization and voltage-related measurement uncertainties. The most remarkable contribution of the proposed study lies in providing guidance for researchers when choosing the control observers for online state estimation of LIBs. △ Less

Submitted 2 December, 2024; v1 submitted 27 November, 2024; originally announced November 2024.

Journal ref: IEEE/ASME Transactions on Mechatronics, early access, (09 October 2024)

arXiv:2407.15113 [pdf, other]

Robust Secure ISAC: How RSMA and Active RIS Manage Eavesdropper's Spatial Uncertainty

Authors: A. Abdelaziz Salem, Saeed Abdallah, Mohamed Saad, Khawla Alnajjar, Mahmoud A. Albreem

Abstract: Incorporating rate splitting multiple access (RSMA) into integrated sensing and communication (ISAC) presents a significant security challenge, particularly in scenarios where the location of a potential eavesdropper (Eve) is unidentified. Splitting users' messages into common and private streams exposes them to eavesdropping, with the common stream dedicated for sensing and accessible to multiple… ▽ More Incorporating rate splitting multiple access (RSMA) into integrated sensing and communication (ISAC) presents a significant security challenge, particularly in scenarios where the location of a potential eavesdropper (Eve) is unidentified. Splitting users' messages into common and private streams exposes them to eavesdropping, with the common stream dedicated for sensing and accessible to multiple users. In response to this challenge, this paper proposes a novel approach that leverages active reconfigurable intelligent surface (RIS) aided beamforming and artificial noise (AN) to enhance the security of RSMA-enabled ISAC. Specifically, we first derive the ergodic private secrecy rate (EPSR) based on mathematical approximation of the average Eve channel gain. An optimization problem is then formulated to maximize the minimum EPSR, while satisfying the minimum required thresholds on ergodic common secrecy rate, radar sensing and RIS power budget. To address this non-convex problem, a novel optimization strategy is developed, whereby we alternatively optimize the transmit beamforming matrix for the common and private streams, rate splitting, AN, RIS reflection coefficient matrix, and radar receive beamformer. Successive convex approximation (SCA) and Majorization-Minimization (MM) are employed to convexify the beamforming and RIS sub-problems. Simulations are conducted to showcase the effectiveness of the proposed framework against established benchmarks. △ Less

Submitted 21 July, 2024; originally announced July 2024.

arXiv:2406.07073 [pdf]

Adaptive Control: Algorithms, Analysis and Applications

Authors: Ioan Doré Landau, Rogelio Lozano, Mohammed M Saad, Alireza Karimi

Abstract: Adaptive control provides techniques for adjusting control parameters in real time to maintain system performance despite unknown or changing process parameters. These methods use real data to tune controllers and adjust plant models or controller parameters. The field has progressed significantly since the 1970s, helped by digital computers. Early applications offered essential feedback, and theo… ▽ More Adaptive control provides techniques for adjusting control parameters in real time to maintain system performance despite unknown or changing process parameters. These methods use real data to tune controllers and adjust plant models or controller parameters. The field has progressed significantly since the 1970s, helped by digital computers. Early applications offered essential feedback, and theoretical advances solved many basic problems. This book comprehensively treats adaptive control, guiding readers from basic problems to analytical solutions with practical applications. Presenting a unified view is challenging due to various design steps and applications. However, a coherent presentation of basic techniques is now possible. The book uses a discrete-time approach to reflect the role of digital computers and shares practical experiences and understanding of different control designs. Mathematical aspects of synthesizing and analyzing algorithms are emphasized, though they alone may not solve practical problems. The book includes applications of control techniques but stresses that a solid mathematical understanding is crucial for creatively applying them to new challenges. Mathematical synthesis and analysis are highlighted, but they must be supplemented with practical problem-solving and algorithm modifications for specific applications. △ Less

Submitted 11 June, 2024; originally announced June 2024.

arXiv:2405.20987 [pdf, other]

Early Stopping Criteria for Training Generative Adversarial Networks in Biomedical Imaging

Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

Abstract: Generative Adversarial Networks (GANs) have high computational costs to train their complex architectures. Throughout the training process, GANs' output is analyzed qualitatively based on the loss and synthetic images' diversity and quality. Based on this qualitative analysis, training is manually halted once the desired synthetic images are generated. By utilizing an early stopping criterion, the… ▽ More Generative Adversarial Networks (GANs) have high computational costs to train their complex architectures. Throughout the training process, GANs' output is analyzed qualitatively based on the loss and synthetic images' diversity and quality. Based on this qualitative analysis, training is manually halted once the desired synthetic images are generated. By utilizing an early stopping criterion, the computational cost and dependence on manual oversight can be reduced yet impacted by training problems such as mode collapse, non-convergence, and instability. This is particularly prevalent in biomedical imagery, where training problems degrade the diversity and quality of synthetic images, and the high computational cost associated with training makes complex architectures increasingly inaccessible. This work proposes a novel early stopping criteria to quantitatively detect training problems, halt training, and reduce the computational costs associated with synthesizing biomedical images. Firstly, the range of generator and discriminator loss values is investigated to assess whether mode collapse, non-convergence, and instability occur sequentially, concurrently, or interchangeably throughout the training of GANs. Secondly, utilizing these occurrences in conjunction with the Mean Structural Similarity Index (MS-SSIM) and Fréchet Inception Distance (FID) scores of synthetic images forms the basis of the proposed early stopping criteria. This work helps identify the occurrence of training problems in GANs using low-resource computational cost and reduces training time to generate diversified and high-quality synthetic images. △ Less

Submitted 31 May, 2024; originally announced May 2024.

Comments: This paper is accepted at the 35th IEEE Irish Signals and Systems Conference (ISSC 2024)

arXiv:2405.17616 [pdf]

Design of a Rectangular Linear Microstrip Patch Antenna Array for 5G Communication

Authors: Muhammad Asfar Saeed, Augustine O. Nwajana

Abstract: This paper presents the design and characterization of a rectangular microstrip patch antenna array optimized for operation within the Ku-band frequency range. The antenna array is impedance-matched to 50 Ohms and utilizes a microstrip line feeding mechanism for excitation. The design maintains compact dimensions, with the overall antenna occupying an area of 29.5x7 mm. The antenna structure is mo… ▽ More This paper presents the design and characterization of a rectangular microstrip patch antenna array optimized for operation within the Ku-band frequency range. The antenna array is impedance-matched to 50 Ohms and utilizes a microstrip line feeding mechanism for excitation. The design maintains compact dimensions, with the overall antenna occupying an area of 29.5x7 mm. The antenna structure is modelled on an R03003 substrate material, featuring a dielectric constant of 3, a low-loss tangent of 0.0009, and a thickness of 1.574 mm. The substrate is backed by a conducting ground plane, and the array consists of six radiating patch elements positioned on top. Evaluation of the designed antenna array reveals a resonant frequency of 18GHz, with a -10 dB impedance bandwidth extending over 700MHz. The antenna demonstrates a high gain of 7.51dBi, making it well-suited for applications in 5G and future communication systems. Its compact form factor, cost-effectiveness, and broad impedance and radiation coverage further underscore its potential in these domains. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 4 pages, 5 figures, 2 tables

arXiv:2404.09342 [pdf, other]

Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan

Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Salman Tahir, Rohan Kumar Das, Muhammad Zaigham Zaheer, Marta Moscati, Markus Schedl, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf

Abstract: The advancements of technology have led to the use of multimodal systems in various real-world applications. Among them, the audio-visual systems are one of the widely used multimodal systems. In the recent years, associating face and voice of a person has gained attention due to presence of unique correlation between them. The Face-voice Association in Multilingual Environments (FAME) Challenge 2… ▽ More The advancements of technology have led to the use of multimodal systems in various real-world applications. Among them, the audio-visual systems are one of the widely used multimodal systems. In the recent years, associating face and voice of a person has gained attention due to presence of unique correlation between them. The Face-voice Association in Multilingual Environments (FAME) Challenge 2024 focuses on exploring face-voice association under a unique condition of multilingual scenario. This condition is inspired from the fact that half of the world's population is bilingual and most often people communicate under multilingual scenario. The challenge uses a dataset namely, Multilingual Audio-Visual (MAV-Celeb) for exploring face-voice association in multilingual environments. This report provides the details of the challenge, dataset, baselines and task details for the FAME Challenge. △ Less

Submitted 22 July, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

Comments: ACM Multimedia Conference - Grand Challenge

arXiv:2310.03278 [pdf, other]

doi 10.1109/GLOBECOM54140.2023.10437820

Mitigating Pilot Contamination and Enabling IoT Scalability in Massive MIMO Systems

Authors: Muhammad Kamran Saeed, Ahmed E. Kamal, Ashfaq Khokhar

Abstract: Massive MIMO is expected to play an important role in the development of 5G networks. This paper addresses the issue of pilot contamination and scalability in massive MIMO systems. The current practice of reusing orthogonal pilot sequences in adjacent cells leads to difficulty in differentiating incoming inter- and intra-cell pilot sequences. One possible solution is to increase the number of orth… ▽ More Massive MIMO is expected to play an important role in the development of 5G networks. This paper addresses the issue of pilot contamination and scalability in massive MIMO systems. The current practice of reusing orthogonal pilot sequences in adjacent cells leads to difficulty in differentiating incoming inter- and intra-cell pilot sequences. One possible solution is to increase the number of orthogonal pilot sequences, which results in dedicating more space of coherence block to pilot transmission than data transmission. This, in turn, also hinders the scalability of massive MIMO systems, particularly in accommodating a large number of IoT devices within a cell. To overcome these challenges, this paper devises an innovative pilot allocation scheme based on the data transfer patterns of IoT devices. The scheme assigns orthogonal pilot sequences to clusters of devices instead of individual devices, allowing multiple devices to utilize the same pilot for periodically transmitting data. Moreover, we formulate the pilot assignment problem as a graph coloring problem and use the max k-cut graph partitioning approach to overcome the pilot contamination in a multicell massive MIMO system. The proposed scheme significantly improves the spectral efficiency and enables the scalability of massive MIMO systems; for instance, by using ten orthogonal pilot sequences, we are able to accommodate 200 devices with only a 12.5% omission rate. △ Less

Submitted 4 October, 2023; originally announced October 2023.

Comments: Accepted At GLOBECOM 2023

Journal ref: GLOBECOM 2023 - 2023 IEEE Global Communications Conference

arXiv:2309.12245 [pdf, other]

Adaptive Input-image Normalization for Solving the Mode Collapse Problem in GAN-based X-ray Images

Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

Abstract: Biomedical image datasets can be imbalanced due to the rarity of targeted diseases. Generative Adversarial Networks play a key role in addressing this imbalance by enabling the generation of synthetic images to augment datasets. It is important to generate synthetic images that incorporate a diverse range of features to accurately represent the distribution of features present in the training imag… ▽ More Biomedical image datasets can be imbalanced due to the rarity of targeted diseases. Generative Adversarial Networks play a key role in addressing this imbalance by enabling the generation of synthetic images to augment datasets. It is important to generate synthetic images that incorporate a diverse range of features to accurately represent the distribution of features present in the training imagery. Furthermore, the absence of diverse features in synthetic images can degrade the performance of machine learning classifiers. The mode collapse problem impacts Generative Adversarial Networks' capacity to generate diversified images. Mode collapse comes in two varieties: intra-class and inter-class. In this paper, both varieties of the mode collapse problem are investigated, and their subsequent impact on the diversity of synthetic X-ray images is evaluated. This work contributes an empirical demonstration of the benefits of integrating the adaptive input-image normalization with the Deep Convolutional GAN and Auxiliary Classifier GAN to alleviate the mode collapse problems. Synthetically generated images are utilized for data augmentation and training a Vision Transformer model. The classification performance of the model is evaluated using accuracy, recall, and precision scores. Results demonstrate that the DCGAN and the ACGAN with adaptive input-image normalization outperform the DCGAN and ACGAN with un-normalized X-ray images as evidenced by the superior diversity scores and classification scores. △ Less

Submitted 29 April, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

Comments: Submitted to the Elsevier Journal

arXiv:2308.02505 [pdf, other]

Assessing Intra-class Diversity and Quality of Synthetically Generated Images in a Biomedical and Non-biomedical Setting

Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

Abstract: In biomedical image analysis, data imbalance is common across several imaging modalities. Data augmentation is one of the key solutions in addressing this limitation. Generative Adversarial Networks (GANs) are increasingly being relied upon for data augmentation tasks. Biomedical image features are sensitive to evaluating the efficacy of synthetic images. These features can have a significant impa… ▽ More In biomedical image analysis, data imbalance is common across several imaging modalities. Data augmentation is one of the key solutions in addressing this limitation. Generative Adversarial Networks (GANs) are increasingly being relied upon for data augmentation tasks. Biomedical image features are sensitive to evaluating the efficacy of synthetic images. These features can have a significant impact on metric scores when evaluating synthetic images across different biomedical imaging modalities. Synthetically generated images can be evaluated by comparing the diversity and quality of real images. Multi-scale Structural Similarity Index Measure and Cosine Distance are used to evaluate intra-class diversity, while Frechet Inception Distance is used to evaluate the quality of synthetic images. Assessing these metrics for biomedical and non-biomedical imaging is important to investigate an informed strategy in evaluating the diversity and quality of synthetic images. In this work, an empirical assessment of these metrics is conducted for the Deep Convolutional GAN in a biomedical and non-biomedical setting. The diversity and quality of synthetic images are evaluated using different sample sizes. This research intends to investigate the variance in diversity and quality across biomedical and non-biomedical imaging modalities. Results demonstrate that the metrics scores for diversity and quality vary significantly across biomedical-to-biomedical and biomedical-to-non-biomedical imaging modalities. △ Less

Submitted 23 July, 2023; originally announced August 2023.

Comments: This work is accepted in 25th Irish Machine Vision and Image Processing (IMVIP) Conference

arXiv:2302.13033 [pdf, other]

Speaker Recognition in Realistic Scenario Using Multimodal Data

Authors: Saqlain Hussain Shah, Muhammad Saad Saeed, Shah Nawaz, Muhammad Haroon Yousaf

Abstract: In recent years, an association is established between faces and voices of celebrities leveraging large scale audio-visual information from YouTube. The availability of large scale audio-visual datasets is instrumental in developing speaker recognition methods based on standard Convolutional Neural Networks. Thus, the aim of this paper is to leverage large scale audio-visual information to improve… ▽ More In recent years, an association is established between faces and voices of celebrities leveraging large scale audio-visual information from YouTube. The availability of large scale audio-visual datasets is instrumental in developing speaker recognition methods based on standard Convolutional Neural Networks. Thus, the aim of this paper is to leverage large scale audio-visual information to improve speaker recognition task. To achieve this task, we proposed a two-branch network to learn joint representations of faces and voices in a multimodal system. Afterwards, features are extracted from the two-branch network to train a classifier for speaker recognition. We evaluated our proposed framework on a large scale audio-visual dataset named VoxCeleb$1$. Our results show that addition of facial information improved the performance of speaker recognition. Moreover, our results indicate that there is an overlap between face and voice. △ Less

Submitted 25 February, 2023; originally announced February 2023.

Comments: Accepted at the International Conference on Artificial Intelligence (ICAI'2023)

arXiv:2210.06334 [pdf, other]

A Self-attention Guided Multi-scale Gradient GAN for Diversified X-ray Image Synthesis

Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

Abstract: Imbalanced image datasets are commonly available in the domain of biomedical image analysis. Biomedical images contain diversified features that are significant in predicting targeted diseases. Generative Adversarial Networks (GANs) are utilized to address the data limitation problem via the generation of synthetic images. Training challenges such as mode collapse, non-convergence, and instability… ▽ More Imbalanced image datasets are commonly available in the domain of biomedical image analysis. Biomedical images contain diversified features that are significant in predicting targeted diseases. Generative Adversarial Networks (GANs) are utilized to address the data limitation problem via the generation of synthetic images. Training challenges such as mode collapse, non-convergence, and instability degrade a GAN's performance in synthesizing diversified and high-quality images. In this work, MSG-SAGAN, an attention-guided multi-scale gradient GAN architecture is proposed to model the relationship between long-range dependencies of biomedical image features and improves the training performance using a flow of multi-scale gradients at multiple resolutions in the layers of generator and discriminator models. The intent is to reduce the impact of mode collapse and stabilize the training of GAN using an attention mechanism with multi-scale gradient learning for diversified X-ray image synthesis. Multi-scale Structural Similarity Index Measure (MS-SSIM) and Frechet Inception Distance (FID) are used to identify the occurrence of mode collapse and evaluate the diversity of synthetic images generated. The proposed architecture is compared with the multi-scale gradient GAN (MSG-GAN) to assess the diversity of generated synthetic images. Results indicate that the MSG-SAGAN outperforms MSG-GAN in synthesizing diversified images as evidenced by the MS-SSIM and FID scores. △ Less

Submitted 12 November, 2022; v1 submitted 9 October, 2022; originally announced October 2022.

Comments: Accepted in AICS-2022 Conference

arXiv:2208.08224 [pdf, other]

doi 10.3390/s22166088

Blind-Spot Collision Detection System for Commercial Vehicles Using Multi Deep CNN Architecture

Authors: Muhammad Muzammel, Mohd Zuki Yusoff, Mohamad Naufal Mohamad Saad, Faryal Sheikh, Muhammad Ahsan Awais

Abstract: Buses and heavy vehicles have more blind spots compared to cars and other road vehicles due to their large sizes. Therefore, accidents caused by these heavy vehicles are more fatal and result in severe injuries to other road users. These possible blind-spot collisions can be identified early using vision-based object detection approaches. Yet, the existing state-of-the-art vision-based object dete… ▽ More Buses and heavy vehicles have more blind spots compared to cars and other road vehicles due to their large sizes. Therefore, accidents caused by these heavy vehicles are more fatal and result in severe injuries to other road users. These possible blind-spot collisions can be identified early using vision-based object detection approaches. Yet, the existing state-of-the-art vision-based object detection models rely heavily on a single feature descriptor for making decisions. In this research, the design of two convolutional neural networks (CNNs) based on high-level feature descriptors and their integration with faster R-CNN is proposed to detect blind-spot collisions for heavy vehicles. Moreover, a fusion approach is proposed to integrate two pre-trained networks (i.e., Resnet 50 and Resnet 101) for extracting high level features for blind-spot vehicle detection. The fusion of features significantly improves the performance of faster R-CNN and outperformed the existing state-of-the-art methods. Both approaches are validated on a self-recorded blind-spot vehicle detection dataset for buses and an online LISA dataset for vehicle detection. For both proposed approaches, a false detection rate (FDR) of 3.05% and 3.49% are obtained for the self recorded dataset, making these approaches suitable for real time applications. △ Less

Submitted 19 August, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

arXiv:2208.05593 [pdf, other]

Evaluating the Quality and Diversity of DCGAN-based Generatively Synthesized Diabetic Retinopathy Imagery

Authors: Cristina-Madalina Dragan, Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

Abstract: Publicly available diabetic retinopathy (DR) datasets are imbalanced, containing limited numbers of images with DR. This imbalance contributes to overfitting when training machine learning classifiers. The impact of this imbalance is exacerbated as the severity of the DR stage increases, affecting the classifiers' diagnostic capacity. The imbalance can be addressed using Generative Adversarial Net… ▽ More Publicly available diabetic retinopathy (DR) datasets are imbalanced, containing limited numbers of images with DR. This imbalance contributes to overfitting when training machine learning classifiers. The impact of this imbalance is exacerbated as the severity of the DR stage increases, affecting the classifiers' diagnostic capacity. The imbalance can be addressed using Generative Adversarial Networks (GANs) to augment the datasets with synthetic images. Generating synthetic images is advantageous if high-quality and diversified images are produced. To evaluate the quality and diversity of synthetic images, several evaluation metrics, such as Multi-Scale Structural Similarity Index (MS-SSIM), Cosine Distance (CD), and Fréchet Inception Distance (FID) are used. Understanding the effectiveness of each metric in evaluating the quality and diversity of GAN-based synthetic images is critical to select images for augmentation. To date, there has been limited analysis of the appropriateness of these metrics in the context of biomedical imagery. This work contributes an empirical assessment of these evaluation metrics as applied to synthetic Proliferative DR imagery generated by a Deep Convolutional GAN (DCGAN). Furthermore, the metrics' capacity to indicate the quality and diversity of synthetic images and a correlation with classifier performance is undertaken. This enables a quantitative selection of synthetic imagery and an informed augmentation strategy. Results indicate that FID is suitable for evaluating the quality, while MS-SSIM and CD are suitable for evaluating the diversity of synthetic imagery. Furthermore, the superior performance of Convolutional Neural Network (CNN) and EfficientNet classifiers, as indicated by the F1 and AUC scores, for the augmented datasets demonstrates the efficacy of synthetic imagery to augment the imbalanced dataset. △ Less

Submitted 30 August, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

Comments: 29 Pages, 8 Figures, submitted to MEDAL23: Advances in Deep Generative Models for Medical Artificial Intelligence (Springer Nature series)

arXiv:2208.04705 [pdf, other]

Classification of Stress via Ambulatory ECG and GSR Data

Authors: Zachary Dair, Muhammad Muneeb Saad, Urja Pawar, Samantha Dockray, Ruairi O'Reilly

Abstract: In healthcare, detecting stress and enabling individuals to monitor their mental health and wellbeing is challenging. Advancements in wearable technology now enable continuous physiological data collection. This data can provide insights into mental health and behavioural states through psychophysiological analysis. However, automated analysis is required to provide timely results due to the quant… ▽ More In healthcare, detecting stress and enabling individuals to monitor their mental health and wellbeing is challenging. Advancements in wearable technology now enable continuous physiological data collection. This data can provide insights into mental health and behavioural states through psychophysiological analysis. However, automated analysis is required to provide timely results due to the quantity of data collected. Machine learning has shown efficacy in providing an automated classification of physiological data for health applications in controlled laboratory environments. Ambulatory uncontrolled environments, however, provide additional challenges requiring further modelling to overcome. This work empirically assesses several approaches utilising machine learning classifiers to detect stress using physiological data recorded in an ambulatory setting with self-reported stress annotations. A subset of the training portion SMILE dataset enables the evaluation of approaches before submission. The optimal stress detection approach achieves 90.77% classification accuracy, 91.24 F1-Score, 90.42 Sensitivity and 91.08 Specificity, utilising an ExtraTrees classifier and feature imputation methods. Meanwhile, accuracy on the challenge data is much lower at 59.23% (submission #54 from BEaTS-MTU, username ZacDair). The cause of the performance disparity is explored in this work. △ Less

Submitted 8 June, 2023; v1 submitted 19 July, 2022; originally announced August 2022.

Comments: Associated Code to enable reproducible experimental work - https://github.com/ZacDair/EMBC_Release SMILE dataset provided by Computational Wellbeing Group (COMPWELL) https://compwell.rice.edu/workshops/embc2022/dataset - https://compwell.rice.edu/

ACM Class: I.2.m; J.3; J.4

Journal ref: EMBC 2022 Compwell Workshop

arXiv:2201.10324 [pdf, other]

Addressing the Intra-class Mode Collapse Problem using Adaptive Input Image Normalization in GAN-based X-ray Images

Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

Abstract: Biomedical image datasets can be imbalanced due to the rarity of targeted diseases. Generative Adversarial Networks play a key role in addressing this imbalance by enabling the generation of synthetic images to augment datasets. It is important to generate synthetic images that incorporate a diverse range of features to accurately represent the distribution of features present in the training imag… ▽ More Biomedical image datasets can be imbalanced due to the rarity of targeted diseases. Generative Adversarial Networks play a key role in addressing this imbalance by enabling the generation of synthetic images to augment datasets. It is important to generate synthetic images that incorporate a diverse range of features to accurately represent the distribution of features present in the training imagery. Furthermore, the absence of diverse features in synthetic images can degrade the performance of machine learning classifiers. The mode collapse problem can impact a Generative Adversarial Network's capacity to generate diversified images. Mode collapse comes in two varieties: intra-class and inter-class. In this paper, the intra-class mode collapse problem is investigated, and its subsequent impact on the diversity of synthetic X-ray images is evaluated. This work contributes an empirical demonstration of the benefits of integrating the adaptive input-image normalization for the Deep Convolutional GAN to alleviate the intra-class mode collapse problem. Results demonstrate that the DCGAN with adaptive input-image normalization outperforms DCGAN with un-normalized X-ray images as evident by the superior diversity scores. △ Less

Submitted 12 April, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

Comments: Accepted to the IEEE EMBC22 Conference

arXiv:2201.07219 [pdf, other]

Contrastive Pretraining for Echocardiography Segmentation with Limited Data

Authors: Mohamed Saeed, Rand Muhtaseb, Mohammad Yaqub

Abstract: Contrastive learning has proven useful in many applications where access to labelled data is limited. The lack of annotated data is particularly problematic in medical image segmentation as it is difficult to have clinical experts manually annotate large volumes of data such as cardiac structures in ultrasound images of the heart. In this paper, We propose a self supervised contrastive learning me… ▽ More Contrastive learning has proven useful in many applications where access to labelled data is limited. The lack of annotated data is particularly problematic in medical image segmentation as it is difficult to have clinical experts manually annotate large volumes of data such as cardiac structures in ultrasound images of the heart. In this paper, We propose a self supervised contrastive learning method to segment the left ventricle from echocardiography when limited annotated images exist. Furthermore, we study the effect of contrastive pretraining on two well-known segmentation networks, UNet and DeepLabV3. Our results show that contrastive pretraining helps improve the performance on left ventricle segmentation, particularly when annotated data is scarce. We show how to achieve comparable results to state-of-the-art fully supervised algorithms when we train our models in a self-supervised fashion followed by fine-tuning on just 5\% of the data. We show that our solution outperforms what is currently published on a large public dataset (EchoNet-Dynamic) achieving a Dice score of 0.9252. We also compare the performance of our solution on another smaller dataset (CAMUS) to demonstrate the generalizability of our proposed solution. The code is available at (https://github.com/BioMedIA-MBZUAI/contrastive-echo). △ Less

Submitted 14 July, 2022; v1 submitted 16 January, 2022; originally announced January 2022.

arXiv:2201.06271 [pdf]

Wireless Connectivity in the Sub-THz Spectrum: A Path to 6G

Authors: Simon Bicaïs, Jean-Baptiste Doré, Majed Saad, Mohammad Alawieh, Faouzi Bader, Jacques Palicot, Yoann Corre, Gregory Gougeon, Emmanuel Faussurier

Abstract: Wireless communication in millimetre wave bands, namely above 20 GHz and up to 300 GHz, is foreseen as a key enabler technology for the next generation of wireless systems. The huge available bandwidth is contemplated to achieve high data-rate wireless communications, and hence, to fulfil the requirements of future wireless networks. In this paper, we discuss and illustrate new paradigms for the s… ▽ More Wireless communication in millimetre wave bands, namely above 20 GHz and up to 300 GHz, is foreseen as a key enabler technology for the next generation of wireless systems. The huge available bandwidth is contemplated to achieve high data-rate wireless communications, and hence, to fulfil the requirements of future wireless networks. In this paper, we discuss and illustrate new paradigms for the sub-THz physical layer, which either aim at maximizing the spectral efficiency, minimizing the device complexity, or finding good tradeoff. The solutions offered by appropriate modulation schemes and multi-antenna systems are assessed based on various potential scenarios. △ Less

Submitted 17 January, 2022; originally announced January 2022.

arXiv:2107.13643 [pdf]

Lighter Stacked Hourglass Human Pose Estimation

Authors: Ahmed Elhagry, Mohamed Saeed, Musie Araia

Abstract: Human pose estimation (HPE) is one of the most challenging tasks in computer vision as humans are deformable by nature and thus their pose has so much variance. HPE aims to correctly identify the main joint locations of a single person or multiple people in a given image or video. Locating joints of a person in images or videos is an important task that can be applied in action recognition and obj… ▽ More Human pose estimation (HPE) is one of the most challenging tasks in computer vision as humans are deformable by nature and thus their pose has so much variance. HPE aims to correctly identify the main joint locations of a single person or multiple people in a given image or video. Locating joints of a person in images or videos is an important task that can be applied in action recognition and object tracking. As have many computer vision tasks, HPE has advanced massively with the introduction of deep learning to the field. In this paper, we focus on one of the deep learning-based approaches of HPE proposed by Newell et al., which they named the stacked hourglass network. Their approach is widely used in many applications and is regarded as one of the best works in this area. The main focus of their approach is to capture as much information as it can at all possible scales so that a coherent understanding of the local features and full-body location is achieved. Their findings demonstrate that important cues such as orientation of a person, arrangement of limbs, and adjacent joints' relative location can be identified from multiple scales at different resolutions. To do so, they makes use of a single pipeline to process images in multiple resolutions, which comprises a skip layer to not lose spatial information at each resolution. The resolution of the images stretches as lower as 4x4 to make sure that a smaller spatial feature is included. In this study, we study the effect of architectural modifications on the computational speed and accuracy of the network. △ Less

Submitted 28 July, 2021; originally announced July 2021.

arXiv:2102.09099 [pdf]

doi 10.1093/gigascience/giac037

NuCLS: A scalable crowdsourcing, deep learning approach and dataset for nucleus classification, localization and segmentation

Authors: Mohamed Amgad, Lamees A. Atteya, Hagar Hussein, Kareem Hosny Mohammed, Ehab Hafiz, Maha A. T. Elsebaie, Ahmed M. Alhusseiny, Mohamed Atef AlMoslemany, Abdelmagid M. Elmatboly, Philip A. Pappalardo, Rokia Adel Sakr, Pooya Mobadersany, Ahmad Rachid, Anas M. Saad, Ahmad M. Alkashash, Inas A. Ruhban, Anas Alrefai, Nada M. Elgazar, Ali Abdulkarim, Abo-Alela Farag, Amira Etman, Ahmed G. Elsaeed, Yahya Alagha, Yomna A. Amer, Ahmed M. Raslan , et al. (12 additional authors not shown)

Abstract: High-resolution mapping of cells and tissue structures provides a foundation for developing interpretable machine-learning models for computational pathology. Deep learning algorithms can provide accurate mappings given large numbers of labeled instances for training and validation. Generating adequate volume of quality labels has emerged as a critical barrier in computational pathology given the… ▽ More High-resolution mapping of cells and tissue structures provides a foundation for developing interpretable machine-learning models for computational pathology. Deep learning algorithms can provide accurate mappings given large numbers of labeled instances for training and validation. Generating adequate volume of quality labels has emerged as a critical barrier in computational pathology given the time and effort required from pathologists. In this paper we describe an approach for engaging crowds of medical students and pathologists that was used to produce a dataset of over 220,000 annotations of cell nuclei in breast cancers. We show how suggested annotations generated by a weak algorithm can improve the accuracy of annotations generated by non-experts and can yield useful data for training segmentation algorithms without laborious manual tracing. We systematically examine interrater agreement and describe modifications to the MaskRCNN model to improve cell mapping. We also describe a technique we call Decision Tree Approximation of Learned Embeddings (DTALE) that leverages nucleus segmentations and morphologic features to improve the transparency of nucleus classification models. The annotation data produced in this study are freely available for algorithm development and benchmarking at: https://sites.google.com/view/nucls. △ Less

Submitted 17 February, 2021; originally announced February 2021.

Journal ref: GigaScience, 11 (2022)

arXiv:2102.05436 [pdf, other]

Range Estimation of a Moving Target Using Ultrasound Differential Zadoff-Chu Codes

Authors: Mohammed H. AlSharif, Mohamed Saad, Mohamed Siala, Mohanad Ahmed, Tareq Y. Al-Naffouri

Abstract: High accuracy range estimation is an essential tool required in many modern applications and technologies. However, continuous range estimation of a moving target is a challenging task, especially under Doppler effects. This paper presents a novel signal design, which we name differential Zadoff-Chu (DZC). Under Doppler effects, DZC sequences improve the performance of the maximum likelihood (ML)-… ▽ More High accuracy range estimation is an essential tool required in many modern applications and technologies. However, continuous range estimation of a moving target is a challenging task, especially under Doppler effects. This paper presents a novel signal design, which we name differential Zadoff-Chu (DZC). Under Doppler effects, DZC sequences improve the performance of the maximum likelihood (ML)-based range estimation compared to its performance when using regular ZC sequences. Moreover, a reduced-complexity ranging algorithm is proposed utilizing DZC sequences and is shown to outperform the regular ZC ML-based range estimation. The proposed system is evaluated in a typical indoor environment, using low-cost ultrasound hardware. Under a low signal to noise ratio (-10 dB SNR), more than 90% of the range estimates are in less than 1.6 mm error, with a movement range from $0.2$ m to 2.2 m and a maximum velocity of 0.5 m/s. For the same movement range, the system provides range estimates with a root mean square error (RMSE) less than 0.76 mm in a high SNR scenario (10 dB), and an MSE less than 0.85 mm in a low SNR scenario (-10 dB). For a larger movement range from 1.8 m to 4.2 m with a maximum velocity of 1.91 m/s, the proposed system provides range estimates with RMSE less than 7.70 mm at 10 dB SNR. △ Less

Submitted 10 February, 2021; originally announced February 2021.

arXiv:2007.05503 [pdf, other]

Predicting Bit Error Rate from Meta Information using Random Forests

Authors: Jianyuan Yu, Yue Xu, Hussein Metwaly Saad, R. Michael Buehrer

Abstract: With the increasing power of machine learning-based reasoning, the use of meta-information (e.g., digital signal modulation parameters, channel conditions, etc.) to predict the performance of various signal processing techniques has become feasible. One such problem of practical interest is choosing a proper interference mitigation method based on the meta information of the received signal. Since… ▽ More With the increasing power of machine learning-based reasoning, the use of meta-information (e.g., digital signal modulation parameters, channel conditions, etc.) to predict the performance of various signal processing techniques has become feasible. One such problem of practical interest is choosing a proper interference mitigation method based on the meta information of the received signal. Since heuristic table-based methods suffer from limited prediction capability for unseen cases, we propose a recommendation system based on the use of Random Forests (RF). Specifically, RF used to predict the Bit-Error-Rate (BER) of all mitigation approaches so as to determine the approach with the best performance. We found RF can predict BER with high accuracy, and its importance factor demonstrates which input attributes matter most. These BER prediction results can also benefit other functions such as adaptive modulation, channel sensing, beaming selection, etc. △ Less

Submitted 10 July, 2020; originally announced July 2020.

arXiv:2005.01813 [pdf]

Impact of user distribution on optical wireless systems

Authors: Khulood D. Alazwary, Osama Zwaid Alsulami, Sarah O. M. Saeed, Sanaa Hamid Mohamed, T. E. H. El-Gorashi, Mohammed T. Alresheedi, Jaafar M. H. Elmirghani

Abstract: In this paper, we investigate the impact of user distribution on resource allocation in visible light communication (VLC) systems, using a wavelength division multiple access (WDMA) scheme. Two different room layouts are examined in this study. Three 10-user scenarios are considered, while an optical angle diversity receiver (ADR) with four faces is used. A mixed-integer linear programming (MILP)… ▽ More In this paper, we investigate the impact of user distribution on resource allocation in visible light communication (VLC) systems, using a wavelength division multiple access (WDMA) scheme. Two different room layouts are examined in this study. Three 10-user scenarios are considered, while an optical angle diversity receiver (ADR) with four faces is used. A mixed-integer linear programming (MILP) model is utilized to identify the optimum wavelengths and access point (AP) allocation in each scenario. The results show that a change in user distribution can affect the level of channel bandwidth and SINR. However, a uniform distribution of users in the room can provide a higher channel bandwidth as well as high SINR above the threshold (15.6 dB) for all users compared to clustered users, which is a scenario that has the lowest SINR with supported data rate above 3.2 Gbps. △ Less

Submitted 4 May, 2020; originally announced May 2020.

arXiv:2004.14922 [pdf]

Resilience in Optical Wireless Systems

Authors: Sarah O. M. Saeed, Sanaa Hamid Mohamed, Osama Zwaid Alsulami, Mohammed T. Alresheedi, Taisir E. H. Elgorashi, Jaafar M. H. Elmirghani

Abstract: High reliability and availability of communication services is a key requirement that needs to be ensured by service providers. Since the direct line-of-sight (LOS) beam is prone to blockage in indoor optical wireless communication systems, a backup link needs to be at hand in case of blockage, and hence channel allocation algorithms should be blockage-aware. In this paper, the impact of beam bloc… ▽ More High reliability and availability of communication services is a key requirement that needs to be ensured by service providers. Since the direct line-of-sight (LOS) beam is prone to blockage in indoor optical wireless communication systems, a backup link needs to be at hand in case of blockage, and hence channel allocation algorithms should be blockage-aware. In this paper, the impact of beam blockage due to a disc with varying size and distance from the receiver is studied where blockage is quantitatively evaluated using percentage blockage for 512 room locations at 25 cm separation. It was found that assigning two links with maximum separation between the serving access points can reduce or eliminate blockage compared to the case when resilience is not implemented. Increasing the number of allocated access points per user further increases resilience. △ Less

Submitted 30 April, 2020; originally announced April 2020.

arXiv:2004.13780 [pdf, other]

Cross-modal Speaker Verification and Recognition: A Multilingual Perspective

Authors: Muhammad Saad Saeed, Shah Nawaz, Pietro Morerio, Arif Mahmood, Ignazio Gallo, Muhammad Haroon Yousaf, Alessio Del Bue

Abstract: Recent years have seen a surge in finding association between faces and voices within a cross-modal biometric application along with speaker recognition. Inspired from this, we introduce a challenging task in establishing association between faces and voices across multiple languages spoken by the same set of persons. The aim of this paper is to answer two closely related questions: "Is face-voice… ▽ More Recent years have seen a surge in finding association between faces and voices within a cross-modal biometric application along with speaker recognition. Inspired from this, we introduce a challenging task in establishing association between faces and voices across multiple languages spoken by the same set of persons. The aim of this paper is to answer two closely related questions: "Is face-voice association language independent?" and "Can a speaker be recognised irrespective of the spoken language?". These two questions are very important to understand effectiveness and to boost development of multilingual biometric systems. To answer them, we collected a Multilingual Audio-Visual dataset, containing human speech clips of $154$ identities with $3$ language annotations extracted from various videos uploaded online. Extensive experiments on the three splits of the proposed dataset have been performed to investigate and answer these novel research questions that clearly point out the relevance of the multilingual problem. △ Less

Submitted 22 April, 2021; v1 submitted 28 April, 2020; originally announced April 2020.

Comments: Accepted: CVPRW

arXiv:2004.11159 [pdf]

Beam Blockage in Optical Wireless Systems

Authors: Sarah O. M. Saeed, Sanaa Hamid Mohamed, Osama Zwaid Alsulami, Mohammed T. Alresheedi, Taisir E. H. Elgorashi, Jaafar M. H. Elmirghani

Abstract: In this paper, we use the percentage blockage as a metric when an opaque disc obstructs the Line-of-Sight link from the access point to the receiver in an optical wireless indoor communication system. The effect of the different parameters of the obstructing object are studied, these are the radius, the height, and the horizontal distance from the receiver in the positive y direction. The percenta… ▽ More In this paper, we use the percentage blockage as a metric when an opaque disc obstructs the Line-of-Sight link from the access point to the receiver in an optical wireless indoor communication system. The effect of the different parameters of the obstructing object are studied, these are the radius, the height, and the horizontal distance from the receiver in the positive y direction. The percentage of blocked room locations to the total number of room locations when varying the disc parameters is studied assuming a single serving link. It was found that depending on the dimensions of the obstructing object and the distance from the receiver in addition to which access point is serving the user, that blockage can vary between 0% up to 100%. Furthermore, the service received by a user, in terms of beam blockage depends on the access point they are connected to. The resulting fairness challenges will be addressed in resource allocation optimization in future work. △ Less

Submitted 23 April, 2020; originally announced April 2020.

arXiv:2004.08739 [pdf]

Resource Allocation in Co-existing Optical Wireless HetNets

Authors: Osama Zwaid Alsulami, Sarah O. M. Saeed, Sanaa Hamid Mohamed, T. E. H. El-Gorashi, Mohammed T. Alresheedi, Jaafar M. H. Elmirghani

Abstract: In multi-user optical wireless communication (OWC) systems interference between users and cells can significantly affect the quality of OWC links. Thus, in this paper, a mixed-integer linear programming (MILP) model is developed to establish the optimum resource allocation in wavelength division multiple access (WDMA) optical wireless systems. Consideration is given to the optimum allocation of wa… ▽ More In multi-user optical wireless communication (OWC) systems interference between users and cells can significantly affect the quality of OWC links. Thus, in this paper, a mixed-integer linear programming (MILP) model is developed to establish the optimum resource allocation in wavelength division multiple access (WDMA) optical wireless systems. Consideration is given to the optimum allocation of wavelengths and access points (APs) to each user to support multiple users in an environment where Micro, Pico and Atto Cells co-exist for downlink communication. The high directionality of light rays in small cells, such as Pico and Atto cells, can offer a very high signal to noise and interference ratio (SINR) at high data rates. Consideration is given in this work to visible light communication links which utilise four wavelengths per access point (red, green, yellow and blue) for Pico and Atto cells systems, while the Micro cell system uses an infrared (IR) transmitter. Two 10-user scenarios are considered in this work. All users in both scenarios achieve a high optical channel bandwidth beyond 7.8 GHz. In addition, all users in the two scenarios achieve high SINR beyond the threshold (15.6 dB) needed for 10-9 on off keying (OOK) bit error rate at a data rate of 7.1 Gbps. △ Less

Submitted 18 April, 2020; originally announced April 2020.

arXiv:2003.01838 [pdf]

Effect of receiver orientation on resource allocation in optical wireless systems

Authors: Osama Zwaid Alsulami, Khulood D. Alazwary, Sarah O. M. Saeed, Sanaa Hamid Mohamed, Taisir E. H. El-Gorashi, Mohammed T. Alresheedi, Jaafar M. H. Elmirghani

Abstract: Optical wireless communication (OWC) systems have been the subject of a significant amount of interest as they can be used in sixth generation (6G) wireless communication to provide high data rates and support multiple users simultaneously. This paper investigates the impact of receiver orientation on resource allocation in optical wireless systems, using a wavelength division multiple access (WDM… ▽ More Optical wireless communication (OWC) systems have been the subject of a significant amount of interest as they can be used in sixth generation (6G) wireless communication to provide high data rates and support multiple users simultaneously. This paper investigates the impact of receiver orientation on resource allocation in optical wireless systems, using a wavelength division multiple access (WDMA) scheme. Three different systems that have different receiver orientations are examined in this work. Each of these systems considers 8 simultaneous users in two scenarios. WDMA is utilised to support multiple users and is based on four wavelengths offered by Red, Yellow, Green and Blue (RYGB) LDs for each AP. An angle diversity receiver (ADR) is used in each system with different orientations. The optimised resource allocations in terms of wavelengths and access point (AP) is obtained by using a mixed-integer linear programming (MILP) model. The channel bandwidth and SINR are determined in the two scenarios in all systems. The results show that a change in the orientation of the receiver can affect the level of channel bandwidth and SINR. However, SINRs in both scenarios for all users are above the threshold (15.6 dB). The SINR obtained can support t data rate of 5.7 Gbps in both scenarios in all systems. △ Less

Submitted 3 March, 2020; originally announced March 2020.

arXiv:2002.09430 [pdf]

Shared optical wireless cells for in-cabin aircraft links

Authors: Osama Zwaid Alsulami, Sarah O. M. Saeed, Sanaa Hamid Mohamed, T. E. H. El-Gorashi, Mohammed T. Alresheedi, Jaafar M. H. Elmirghani

Abstract: The design of a wireless communication system that can support multiple users at high data rates inside an aircraft is a key requirement of aircraft manufacturers. This paper examines the design of an on-board visible light communication (VLC) system for transmitting data on board Boeing 747-400 aircraft. The reading light unit of each seat is utilised as an optical transmitter. A red, yellow, gre… ▽ More The design of a wireless communication system that can support multiple users at high data rates inside an aircraft is a key requirement of aircraft manufacturers. This paper examines the design of an on-board visible light communication (VLC) system for transmitting data on board Boeing 747-400 aircraft. The reading light unit of each seat is utilised as an optical transmitter. A red, yellow, green, and blue (RYGB) laser diode (LD) is used in each reading light unit for transmitting data. An angle diversity receiver (ADR), which is an optical receiver that is composed of four branches (in this work), is evaluated. The signal-to-interference-plus-noise ratio (SINR) and data rate are determined. Three scenarios have been examined where, in the first scenario, one device is used, in the second scenario two devices are used and in the third scenario three devices are used by each passenger. The proposed system can offer high SINRs that support high data rates for each passenger by using simple on-off-keying (OOK). △ Less

Submitted 21 February, 2020; originally announced February 2020.

arXiv:2002.09234 [pdf]

Impact of room size on WDM optical wireless links with multiple access points and angle diversity receivers

Authors: Osama Zwaid Alsulami, Mansourah K. A. Aljohani, Sarah O. M. Saeed, Sanaa Hamid Mohamed, T. E. H. El-Gorashi, Mohammed T. Alresheedi, Jaafar M. H. Elmirghani

Abstract: Optical wireless communication (OWC) systems have been the subject of attention as a promising wireless communication technology that can offer high data rates and support multiple users simultaneously. In this paper, the impact of room size is investigated when using wavelength division multiple access (WDMA) in conjunction with an angle diversity receiver (ADR). Four wavelengths (red, yellow, gr… ▽ More Optical wireless communication (OWC) systems have been the subject of attention as a promising wireless communication technology that can offer high data rates and support multiple users simultaneously. In this paper, the impact of room size is investigated when using wavelength division multiple access (WDMA) in conjunction with an angle diversity receiver (ADR). Four wavelengths (red, yellow, green and blue) can be provided in this work based on the RYGB LDs transmitter used. Three room sizes are considered with two 8-user scenarios. A mixed-integer linear programming (MILP) model is proposed for the purpose of optimising the resource allocation. The optical channel bandwidth, SINR and data rate have been calculated for each user in both scenarios in all rooms. Room A, which is the largest room, can provide a higher channel bandwidth and SINR compared to the other rooms. However, all rooms can provide a data rate above 5 Gbps in both scenarios. △ Less

Submitted 21 February, 2020; originally announced February 2020.

arXiv:2002.01580 [pdf]

Data centre optical wireless downlink with WDM and multi-access point support

Authors: O. Z. Alsulami, S. O. M. Saeed, S. H. Mohamed, T. E. H. El-Gorashi, M. T. Alresheedi, J. M. H. Elmirghani

Abstract: The ability to provide very high data rates is a significant benefit of optical wireless communication (OWC) systems. In this paper, an optical wireless downlink in a data centre that uses wavelength division multiple access (WDMA) is designed. Red, yellow, green and blue (RYGB) laser diodes (LDs) are used as transmitters to provide a high modulation bandwidth. A WDMA scheme based on RYGB LDs is u… ▽ More The ability to provide very high data rates is a significant benefit of optical wireless communication (OWC) systems. In this paper, an optical wireless downlink in a data centre that uses wavelength division multiple access (WDMA) is designed. Red, yellow, green and blue (RYGB) laser diodes (LDs) are used as transmitters to provide a high modulation bandwidth. A WDMA scheme based on RYGB LDs is used to provide communication for multiple racks at the same time from the same light unit. Two types of optical receivers are examined in this study; an angle diversity receiver (ADR) with three branches and a 10 pixel imaging receiver (ImR). The proposed data centre achieves high data rates with a higher signal-to-interference-plus-noise ratio (SINR) for each rack while using simple on-off-keying (OOK) modulation. △ Less

Submitted 4 February, 2020; originally announced February 2020.

arXiv:1907.09544 [pdf]

Networking and processing in optical wireless

Authors: Osama Zwaid Alsulami, Amal A. Alahmadi, Sarah O. M. Saeed, Sanaa Hamid Mohamed, T. E. H. El-Gorashi, Mohammed T. Alresheedi, Jaafar M. H. Elmirghani

Abstract: Optical wireless communication (OWC) is a promising technology that can provide high data rates while supporting multiple users. The Optical Wireless (OW) physical layer has been researched extensively, however less work was devoted to multiple access and how the OW front end is connected to the network. In this paper, an OWC system which employs a wavelength division multiple access (WDMA) scheme… ▽ More Optical wireless communication (OWC) is a promising technology that can provide high data rates while supporting multiple users. The Optical Wireless (OW) physical layer has been researched extensively, however less work was devoted to multiple access and how the OW front end is connected to the network. In this paper, an OWC system which employs a wavelength division multiple access (WDMA) scheme is studied, for the purpose of supporting multiple users. In addition, a cloud/fog architecture is proposed for the first time for OWC to provide processing capabilities. The cloud/fog-integrated architecture uses visible indoor light to create high data rate connections with potential mobile nodes. These optical wireless nodes are further clustered and used as fog mini servers to provide processing services through the optical wireless channel for other users. Additional fog processing units are located in the room, the building, the campus and at the metro level. Further processing capabilities are provided by remote cloud sites. A mixed-integer linear programming (MILP) model was developed and utilised to optimise resource allocation in the indoor OWC system. A second MILP model was developed to optimise the placement of processing tasks in the different fog and cloud nodes available. The optimisation of tasks placement in the cloud-/fog-integrated architecture was analysed using the MILP models. Multiple scenarios were considered where the mobile node locations were varied in the room and the amount of processing and data rate requested by each optical wireless node is varied. The results help identify the optimum colour and access point to use for communication for a given mobile node location and OWC system configuration, the optimum location to place processing and the impact of the network architecture. Areas for future work are identified. △ Less

Submitted 22 July, 2019; originally announced July 2019.

arXiv:1907.07671 [pdf, other]

Electroencephalography based Classification of Long-term Stress using Psychological Labeling

Authors: Sanay Muhammad Umar Saeed, Syed Muhammad Anwar, Humaira Khalid, Muhammad Majid, Ulas Bagci

Abstract: Stress research is a rapidly emerging area in thefield of electroencephalography (EEG) based signal processing.The use of EEG as an objective measure for cost effective andpersonalized stress management becomes important in particularsituations such as the non-availability of mental health facilities.In this study, long-term stress is classified using baseline EEGsignal recordings. The labelling f… ▽ More Stress research is a rapidly emerging area in thefield of electroencephalography (EEG) based signal processing.The use of EEG as an objective measure for cost effective andpersonalized stress management becomes important in particularsituations such as the non-availability of mental health facilities.In this study, long-term stress is classified using baseline EEGsignal recordings. The labelling for the stress and control groupsis performed using two methods (i) the perceived stress scalescore and (ii) expert evaluation. The frequency domain featuresare extracted from five-channel EEG recordings in addition tothe frontal and temporal alpha and beta asymmetries. The alphaasymmetry is computed from four channels and used as a feature.Feature selection is also performed using a t-test to identifystatistically significant features for both stress and control groups.We found that support vector machine is best suited to classifylong-term human stress when used with alpha asymmetry asa feature. It is observed that expert evaluation based labellingmethod has improved the classification accuracy up to 85.20%.Based on these results, it is concluded that alpha asymmetry maybe used as a potential bio-marker for stress classification, when labels are assigned using expert evaluation. △ Less

Submitted 16 July, 2019; originally announced July 2019.

Comments: Submitted to IEEE JBHI

arXiv:1904.04548 [pdf]

Optimized Resource Allocation in Multi-user WDM VLC Systems

Authors: Sarah O. M. Saeed, Sanaa Hamid Mohamed, Osama Zwaid Alsulami, Mohammed T. Alresheedi, Jaafar M. H. Elmirghani

Abstract: In this paper, we address the optimization of wavelength resource allocation in multi-user WDM Visible Light Communication (VLC) systems. A Mixed Integer Linear Programming (MILP) model that maximizes the sum of Signal-to-Interference-plus-Noise-Ratio (SINR) for all users is utilized. The results show that optimizing the wavelength allocation in multi-user WDM VLC systems can reduce the impact of… ▽ More In this paper, we address the optimization of wavelength resource allocation in multi-user WDM Visible Light Communication (VLC) systems. A Mixed Integer Linear Programming (MILP) model that maximizes the sum of Signal-to-Interference-plus-Noise-Ratio (SINR) for all users is utilized. The results show that optimizing the wavelength allocation in multi-user WDM VLC systems can reduce the impact of the interference and improve the system throughput in terms of the sum of data rates for up to 7 users. △ Less

Submitted 9 April, 2019; originally announced April 2019.

arXiv:1710.08623 [pdf, other]

Hand Gesture Recognition Using Ultrasonic Waves

Authors: Mohammed H. AlSharif, Mohamed Saad, Tareq Y. Al-Naffouri

Abstract: This paper presents a new method for detecting and classifying a predefined set of hand gestures using a single transmitter and a single receiver utilizing a linearly frequency modulated ultrasonic signal. Gestures are identified based on estimated range and received signal strength (RSS) of reflected signal from the hand. Support Vector Machine (SVM) was used for gesture detection and classificat… ▽ More This paper presents a new method for detecting and classifying a predefined set of hand gestures using a single transmitter and a single receiver utilizing a linearly frequency modulated ultrasonic signal. Gestures are identified based on estimated range and received signal strength (RSS) of reflected signal from the hand. Support Vector Machine (SVM) was used for gesture detection and classification. The system was tested using experimental setup and achieved an average accuracy of 88%. △ Less

Submitted 24 October, 2017; originally announced October 2017.

Comments: 2 pages, Msc thesis paper

Showing 1–37 of 37 results for author: Saad, M