Search | arXiv e-print repository

arXiv:2502.20570 [pdf]

An Integrated Deep Learning Framework Leveraging NASNet and Vision Transformer with MixProcessing for Accurate and Precise Diagnosis of Lung Diseases

Authors: Sajjad Saleem, Muhammad Imran Sharif

Abstract: The lungs are the essential organs of respiration, and this system is significant in the carbon dioxide and exchange between oxygen that occurs in human life. However, several lung diseases, which include pneumonia, tuberculosis, COVID-19, and lung cancer, are serious healthiness challenges and demand early and precise diagnostics. The methodological study has proposed a new deep learning framewor… ▽ More The lungs are the essential organs of respiration, and this system is significant in the carbon dioxide and exchange between oxygen that occurs in human life. However, several lung diseases, which include pneumonia, tuberculosis, COVID-19, and lung cancer, are serious healthiness challenges and demand early and precise diagnostics. The methodological study has proposed a new deep learning framework called NASNet-ViT, which effectively incorporates the convolution capability of NASNet with the global attention mechanism capability of Vision Transformer ViT. The proposed model will classify the lung conditions into five classes: Lung cancer, COVID-19, pneumonia, TB, and normal. A sophisticated multi-faceted preprocessing strategy called MixProcessing has been used to improve diagnostic accuracy. This preprocessing combines wavelet transform, adaptive histogram equalization, and morphological filtering techniques. The NASNet-ViT model performs at state of the art, achieving an accuracy of 98.9%, sensitivity of 0.99, an F1-score of 0.989, and specificity of 0.987, outperforming other state of the art architectures such as MixNet-LD, D-ResNet, MobileNet, and ResNet50. The model's efficiency is further emphasized by its compact size, 25.6 MB, and a low computational time of 12.4 seconds, hence suitable for real-time, clinically constrained environments. These results reflect the high-quality capability of NASNet-ViT in extracting meaningful features and recognizing various types of lung diseases with very high accuracy. This work contributes to medical image analysis by providing a robust and scalable solution for diagnostics in lung diseases. △ Less

Submitted 27 February, 2025; originally announced February 2025.

arXiv:2501.11389 [pdf, other]

Resilience of LTE-A/5G-NR links Against Transient Electromagnetic Interference

Authors: Sharzeel Saleem, Mir Lodro

Abstract: This paper presents a comparative analysis of long-term evolution advanced (LTE-A) and fifth-generation new radio (5G-NR), focusing on the effects of Transient Electromagnetic Interference (EMI) caused by catenary-pantograph contact in a railway environment. We developed a software-defined radio (SDR)-based prototype for the performance evaluation of LTE-A and 5G-NR links in the presence of transi… ▽ More This paper presents a comparative analysis of long-term evolution advanced (LTE-A) and fifth-generation new radio (5G-NR), focusing on the effects of Transient Electromagnetic Interference (EMI) caused by catenary-pantograph contact in a railway environment. We developed a software-defined radio (SDR)-based prototype for the performance evaluation of LTE-A and 5G-NR links in the presence of transient interference. The results show that both links experience considerable degradation due to interference at different center frequencies. Performance degradation is proportional to the gain of interference. The measurement results show that both links experience considerable performance degradation in the presence of transient EM interference. △ Less

Submitted 20 January, 2025; originally announced January 2025.

Comments: 5 pages, 9 figures, 5 tables

arXiv:2312.05187 [pdf, other]

Seamless: Multilingual Expressive and Streaming Speech Translation

Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek , et al. (40 additional authors not shown)

Abstract: Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4… ▽ More Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4T model-SeamlessM4T v2. This newer model, incorporating an updated UnitY2 framework, was trained on more low-resource language data. SeamlessM4T v2 provides the foundation on which our next two models are initiated. SeamlessExpressive enables translation that preserves vocal styles and prosody. Compared to previous efforts in expressive speech research, our work addresses certain underexplored aspects of prosody, such as speech rate and pauses, while also preserving the style of one's voice. As for SeamlessStreaming, our model leverages the Efficient Monotonic Multihead Attention mechanism to generate low-latency target translations without waiting for complete source utterances. As the first of its kind, SeamlessStreaming enables simultaneous speech-to-speech/text translation for multiple source and target languages. To ensure that our models can be used safely and responsibly, we implemented the first known red-teaming effort for multimodal machine translation, a system for the detection and mitigation of added toxicity, a systematic evaluation of gender bias, and an inaudible localized watermarking mechanism designed to dampen the impact of deepfakes. Consequently, we bring major components from SeamlessExpressive and SeamlessStreaming together to form Seamless, the first publicly available system that unlocks expressive cross-lingual communication in real-time. The contributions to this work are publicly released and accessible at https://github.com/facebookresearch/seamless_communication △ Less

Submitted 8 December, 2023; originally announced December 2023.

arXiv:2011.08972 [pdf, ps, other]

Reducing the Mutual Outage Probability of Cooperative Non-Orthogonal Multiple Access

Authors: Sana Riaz, Fahd Ahmed Khan, Sajid Saleem, Qasim Zeeshan Ahmed

Abstract: In this letter, a new power allocation scheme is proposed to improve the reliability of cooperative non-orthogonal multiple access (CO-NOMA). The strong user is allocated the maximum power, whereas the weak user is allocated the minimum power. This power allocation alters the decoding sequence along with the signal-to-interference plus noise ratio (SINR), at the users. The weak user benefits from… ▽ More In this letter, a new power allocation scheme is proposed to improve the reliability of cooperative non-orthogonal multiple access (CO-NOMA). The strong user is allocated the maximum power, whereas the weak user is allocated the minimum power. This power allocation alters the decoding sequence along with the signal-to-interference plus noise ratio (SINR), at the users. The weak user benefits from receiving multiple copies of the signal whereas the strong user benefits from the higher power allocation. Numerical simulation results show that the proposed scheme has a lower mutual outage probability (MOP) and offers better reliability as compared to the conventional power allocation scheme for CONOMA. An exact closed-form expression of MOP is derived for the two-user CO-NOMA system and it is shown that each user achieves full diversity. The proposed allocation is able to achieve approximately 30% higher transmission rate at 15 dB as compared to conventional CO-NOMA in a practical non-power balanced scenario. △ Less

Submitted 24 November, 2020; v1 submitted 28 October, 2020; originally announced November 2020.

arXiv:2007.03497 [pdf, other]

STBC-Aided Cooperative NOMA with Timing Offsets, Imperfect Successive Interference Cancellation, and Imperfect Channel State Information

Authors: Muhammad Waseem Akhtar, Syed Ali Hassan, Sajid Saleem, Haejoon Jung

Abstract: The combination of non-orthogonal multiple access(NOMA) and cooperative communications can be a suitable solution for fifth generation (5G) and beyond 5G (B5G) wireless systems with massive connectivity, because it can provide higher spectral efficiency, lower energy consumption, and improved fairness compared to the non-cooperative NOMA. However,the receiver complexity in the conventional coopera… ▽ More The combination of non-orthogonal multiple access(NOMA) and cooperative communications can be a suitable solution for fifth generation (5G) and beyond 5G (B5G) wireless systems with massive connectivity, because it can provide higher spectral efficiency, lower energy consumption, and improved fairness compared to the non-cooperative NOMA. However,the receiver complexity in the conventional cooperative NOMA increases with increasing number of users owing to successive interference cancellation (SIC) at each user. Space time block code-aided cooperative NOMA (STBC-CNOMA) offers less numbers of SIC as compared to that of conventional cooperative NOMA. In this paper, we evaluate the performance of STBC-CNOMA under practical challenges such as imperfect SIC, imperfect timing synchronization between distributed cooperating users, and imperfect channel state information (CSI). We derive closed-form expressions of the received signals in the presence of such realistic impairments and then use them to evaluate outage probability. Further, we provide intuitive insights into the impact of each impairment on the outage performance through asymptotic analysis at high transmit signal-to-noise ratio. We also compare the complexity of STBC-CNOMA with existing cooperative NOMA protocols for a given number of users. In addition, through analysis and simulation, we observe that the impact of the imperfect SIC on the outage performance of STBC-CNOMA is more significant compared to the other two imperfections. Therefore, considering the smaller number of SIC in STBC-CNOMA compared to the other cooperative NOMA protocols, STBC-CNOMA is an effective solution to achieve high reliability for the same SIC imperfection condition. △ Less

Submitted 7 July, 2020; originally announced July 2020.

Showing 1–5 of 5 results for author: Saleem, S