-
Effectively Identifying Wi-Fi Devices through State Transitions
Authors:
Melissa Safari,
Abhishek K. Mishra,
Mathieu Cunche
Abstract:
Wi-Fi management frames reveal structured communication patterns that persist even under randomization of MAC addresses. Prior approaches to associating randomized MAC addresses with devices primarily focus on probe requests, overlooking the broader set of management frames and their transition dynamics. This narrow focus limits their robustness in dense, real-world environments with high device m…
▽ More
Wi-Fi management frames reveal structured communication patterns that persist even under randomization of MAC addresses. Prior approaches to associating randomized MAC addresses with devices primarily focus on probe requests, overlooking the broader set of management frames and their transition dynamics. This narrow focus limits their robustness in dense, real-world environments with high device mobility, where probe activity alone fails to yield stable and distinctive signatures. In this paper, we present a novel framework for fingerprinting Wi-Fi devices based on behavioral dynamics extracted from passively observed management frames. We model each device's behavior as a finite state machine and introduce matrix-based representations that encode both structural (state transition frequencies) and temporal (inter-state delays) characteristics. These matrices are embedded into compact feature vectors, enabling efficient similarity comparison. Through extensive evaluation in diverse real-world settings, our method achieves over 86% identification accuracy for non-randomized devices using only Wi-Fi management frames, with further improvements observed through temporal burst aggregation. Our findings are sufficient to uniquely and consistently characterize devices at scale, outperforming the state-of-the-art.
△ Less
Submitted 3 July, 2025;
originally announced July 2025.
-
Unifying Biomedical Vision-Language Expertise: Towards a Generalist Foundation Model via Multi-CLIP Knowledge Distillation
Authors:
Shansong Wang,
Zhecheng Jin,
Mingzhe Hu,
Mojtaba Safari,
Feng Zhao,
Chih-Wei Chang,
Richard LJ Qiu,
Justin Roper,
David S. Yu,
Xiaofeng Yang
Abstract:
CLIP models pretrained on natural images with billion-scale image-text pairs have demonstrated impressive capabilities in zero-shot classification, cross-modal retrieval, and open-ended visual answering. However, transferring this success to biomedicine is hindered by the scarcity of large-scale biomedical image-text corpora, the heterogeneity of image modalities, and fragmented data standards acr…
▽ More
CLIP models pretrained on natural images with billion-scale image-text pairs have demonstrated impressive capabilities in zero-shot classification, cross-modal retrieval, and open-ended visual answering. However, transferring this success to biomedicine is hindered by the scarcity of large-scale biomedical image-text corpora, the heterogeneity of image modalities, and fragmented data standards across institutions. These limitations hinder the development of a unified and generalizable biomedical foundation model trained from scratch. To overcome this, we introduce MMKD-CLIP, a generalist biomedical foundation model developed via Multiple Medical CLIP Knowledge Distillation. Rather than relying on billion-scale raw data, MMKD-CLIP distills knowledge from nine state-of-the-art domain-specific or generalist biomedical CLIP models, each pretrained on millions of biomedical image-text pairs. Our two-stage training pipeline first performs CLIP-style pretraining on over 2.9 million biomedical image-text pairs from 26 image modalities, followed by feature-level distillation using over 19.2 million feature pairs extracted from teacher models. We evaluate MMKD-CLIP on 58 diverse biomedical datasets, encompassing over 10.8 million biomedical images across nine image modalities. The evaluation spans six core task types: zero-shot classification, linear probing, cross-modal retrieval, visual question answering, survival prediction, and cancer diagnosis. MMKD-CLIP consistently outperforms all teacher models while demonstrating remarkable robustness and generalization across image domains and task settings. These results underscore that multi-teacher knowledge distillation is a scalable and effective paradigm for building high-performing biomedical foundation models under the practical constraints of real-world data availability.
△ Less
Submitted 27 June, 2025;
originally announced June 2025.
-
Towards Scalable Schema Mapping using Large Language Models
Authors:
Christopher Buss,
Mahdis Safari,
Arash Termehchy,
Stefan Lee,
David Maier
Abstract:
The growing need to integrate information from a large number of diverse sources poses significant scalability challenges for data integration systems. These systems often rely on manually written schema mappings, which are complex, source-specific, and costly to maintain as sources evolve. While recent advances suggest that large language models (LLMs) can assist in automating schema matching by…
▽ More
The growing need to integrate information from a large number of diverse sources poses significant scalability challenges for data integration systems. These systems often rely on manually written schema mappings, which are complex, source-specific, and costly to maintain as sources evolve. While recent advances suggest that large language models (LLMs) can assist in automating schema matching by leveraging both structural and natural language cues, key challenges remain. In this paper, we identify three core issues with using LLMs for schema mapping: (1) inconsistent outputs due to sensitivity to input phrasing and structure, which we propose methods to address through sampling and aggregation techniques; (2) the need for more expressive mappings (e.g., GLaV), which strain the limited context windows of LLMs; and (3) the computational cost of repeated LLM calls, which we propose to mitigate through strategies like data type prefiltering.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
PSRB: A Comprehensive Benchmark for Evaluating Persian ASR Systems
Authors:
Nima Sedghiyeh,
Sara Sadeghi,
Reza Khodadadi,
Farzin Kashani,
Omid Aghdaei,
Somayeh Rahimi,
Mohammad Sadegh Safari
Abstract:
Although Automatic Speech Recognition (ASR) systems have become an integral part of modern technology, their evaluation remains challenging, particularly for low-resource languages such as Persian. This paper introduces Persian Speech Recognition Benchmark(PSRB), a comprehensive benchmark designed to address this gap by incorporating diverse linguistic and acoustic conditions. We evaluate ten ASR…
▽ More
Although Automatic Speech Recognition (ASR) systems have become an integral part of modern technology, their evaluation remains challenging, particularly for low-resource languages such as Persian. This paper introduces Persian Speech Recognition Benchmark(PSRB), a comprehensive benchmark designed to address this gap by incorporating diverse linguistic and acoustic conditions. We evaluate ten ASR systems, including state-of-the-art commercial and open-source models, to examine performance variations and inherent biases. Additionally, we conduct an in-depth analysis of Persian ASR transcriptions, identifying key error types and proposing a novel metric that weights substitution errors. This metric enhances evaluation robustness by reducing the impact of minor and partial errors, thereby improving the precision of performance assessment. Our findings indicate that while ASR models generally perform well on standard Persian, they struggle with regional accents, children's speech, and specific linguistic challenges. These results highlight the necessity of fine-tuning and incorporating diverse, representative training datasets to mitigate biases and enhance overall ASR performance. PSRB provides a valuable resource for advancing ASR research in Persian and serves as a framework for developing benchmarks in other low-resource languages. A subset of the PSRB dataset is publicly available at https://huggingface.co/datasets/PartAI/PSRB.
△ Less
Submitted 27 May, 2025;
originally announced May 2025.
-
MRI motion correction via efficient residual-guided denoising diffusion probabilistic models
Authors:
Mojtaba Safari,
Shansong Wang,
Qiang Li,
Zach Eidex,
Richard L. J. Qiu,
Chih-Wei Chang,
Hui Mao,
Xiaofeng Yang
Abstract:
Purpose: Motion artifacts in magnetic resonance imaging (MRI) significantly degrade image quality and impair quantitative analysis. Conventional mitigation strategies, such as repeated acquisitions or motion tracking, are costly and workflow-intensive. This study introduces Res-MoCoDiff, an efficient denoising diffusion probabilistic model tailored for MRI motion artifact correction. Methods: Res-…
▽ More
Purpose: Motion artifacts in magnetic resonance imaging (MRI) significantly degrade image quality and impair quantitative analysis. Conventional mitigation strategies, such as repeated acquisitions or motion tracking, are costly and workflow-intensive. This study introduces Res-MoCoDiff, an efficient denoising diffusion probabilistic model tailored for MRI motion artifact correction. Methods: Res-MoCoDiff incorporates a novel residual error shifting mechanism in the forward diffusion process, aligning the noise distribution with motion-corrupted data and enabling an efficient four-step reverse diffusion. A U-net backbone enhanced with Swin-Transformer blocks conventional attention layers, improving adaptability across resolutions. Training employs a combined l1+l2 loss, which promotes image sharpness and reduces pixel-level errors. Res-MoCoDiff was evaluated on synthetic dataset generated using a realistic motion simulation framework and on an in-vivo dataset. Comparative analyses were conducted against established methods, including CycleGAN, Pix2pix, and MT-DDPM using quantitative metrics such as peak signal-to-noise ratio (PSNR), structural similarity index measure (SSIM), and normalized mean squared error (NMSE). Results: The proposed method demonstrated superior performance in removing motion artifacts across all motion severity levels. Res-MoCoDiff consistently achieved the highest SSIM and the lowest NMSE values, with a PSNR of up to 41.91+-2.94 dB for minor distortions. Notably, the average sampling time was reduced to 0.37 seconds per batch of two image slices, compared with 101.74 seconds for conventional approaches.
△ Less
Submitted 6 May, 2025;
originally announced May 2025.
-
MRI super-resolution reconstruction using efficient diffusion probabilistic model with residual shifting
Authors:
Mojtaba Safari,
Shansong Wang,
Zach Eidex,
Qiang Li,
Erik H. Middlebrooks,
David S. Yu,
Xiaofeng Yang
Abstract:
Objective:This study introduces a residual error-shifting mechanism that drastically reduces sampling steps while preserving critical anatomical details, thus accelerating MRI reconstruction. Approach:We propose a novel diffusion-based SR framework called Res-SRDiff, which integrates residual error shifting into the forward diffusion process. This enables efficient HR image reconstruction by align…
▽ More
Objective:This study introduces a residual error-shifting mechanism that drastically reduces sampling steps while preserving critical anatomical details, thus accelerating MRI reconstruction. Approach:We propose a novel diffusion-based SR framework called Res-SRDiff, which integrates residual error shifting into the forward diffusion process. This enables efficient HR image reconstruction by aligning the degraded HR and LR distributions.We evaluated Res-SRDiff on ultra-high-field brain T1 MP2RAGE maps and T2-weighted prostate images, comparing it with Bicubic, Pix2pix, CycleGAN, and a conventional denoising diffusion probabilistic model with vision transformer backbone (TM-DDPM), using quantitative metrics such as peak signal-to-noise ratio (PSNR), structural similarity index (SSIM), gradient magnitude similarity deviation (GMSD), and learned perceptual image patch similarity (LPIPS). Main results: Res-SRDiff significantly outperformed all comparative methods in terms of PSNR, SSIM, and GMSD across both datasets, with statistically significant improvements (p-values<<0.05). The model achieved high-fidelity image restoration with only four sampling steps, drastically reducing computational time to under one second per slice, which is substantially faster than conventional TM-DDPM with around 20 seconds per slice. Qualitative analyses further demonstrated that Res-SRDiff effectively preserved fine anatomical details and lesion morphology in both brain and pelvic MRI images. Significance: Our findings show that Res-SRDiff is an efficient and accurate MRI SR method, markedly improving computational efficiency and image quality. Integrating residual error shifting into the diffusion process allows for rapid and robust HR image reconstruction, enhancing clinical MRI workflows and advancing medical imaging research. The source at:https://github.com/mosaf/Res-SRDiff
△ Less
Submitted 26 April, 2025; v1 submitted 3 March, 2025;
originally announced March 2025.
-
Enhancing DNA Foundation Models to Address Masking Inefficiencies
Authors:
Monireh Safari,
Pablo Millan Arias,
Scott C. Lowe,
Lila Kari,
Angel X. Chang,
Graham W. Taylor
Abstract:
Masked language modelling (MLM) as a pretraining objective has been widely adopted in genomic sequence modelling. While pretrained models can successfully serve as encoders for various downstream tasks, the distribution shift between pretraining and inference detrimentally impacts performance, as the pretraining task is to map [MASK] tokens to predictions, yet the [MASK] is absent during downstrea…
▽ More
Masked language modelling (MLM) as a pretraining objective has been widely adopted in genomic sequence modelling. While pretrained models can successfully serve as encoders for various downstream tasks, the distribution shift between pretraining and inference detrimentally impacts performance, as the pretraining task is to map [MASK] tokens to predictions, yet the [MASK] is absent during downstream applications. This means the encoder does not prioritize its encodings of non-[MASK] tokens, and expends parameters and compute on work only relevant to the MLM task, despite this being irrelevant at deployment time. In this work, we propose a modified encoder-decoder architecture based on the masked autoencoder framework, designed to address this inefficiency within a BERT-based transformer. We empirically show that the resulting mismatch is particularly detrimental in genomic pipelines where models are often used for feature extraction without fine-tuning. We evaluate our approach on the BIOSCAN-5M dataset, comprising over 2 million unique DNA barcodes. We achieve substantial performance gains in both closed-world and open-world classification tasks when compared against causal models and bidirectional architectures pretrained with MLM tasks.
△ Less
Submitted 25 February, 2025;
originally announced February 2025.
-
Triad: Vision Foundation Model for 3D Magnetic Resonance Imaging
Authors:
Shansong Wang,
Mojtaba Safari,
Qiang Li,
Chih-Wei Chang,
Richard LJ Qiu,
Justin Roper,
David S. Yu,
Xiaofeng Yang
Abstract:
Vision foundation models (VFMs) are pre-trained on extensive image datasets to learn general representations for diverse types of data. These models can subsequently be fine-tuned for specific downstream tasks, significantly boosting performance across a broad range of applications. However, existing vision foundation models that claim to be applicable to various clinical tasks are mostly pre-trai…
▽ More
Vision foundation models (VFMs) are pre-trained on extensive image datasets to learn general representations for diverse types of data. These models can subsequently be fine-tuned for specific downstream tasks, significantly boosting performance across a broad range of applications. However, existing vision foundation models that claim to be applicable to various clinical tasks are mostly pre-trained on 3D computed tomography (CT), which benefits from the availability of extensive 3D CT databases. Significant differences between CT and magnetic resonance imaging (MRI) in imaging principles, signal characteristics, and data distribution may hinder their practical performance and versatility in MRI-specific applications. Here, we propose Triad, a vision foundation model for 3D MRI. Triad adopts a widely used autoencoder architecture to learn robust representations from 131,170 3D MRI volumes and uses organ-independent imaging descriptions to constrain the semantic distribution of the visual modality. The above pre-training dataset is called Triad-131K, which is currently the largest 3D MRI pre-training dataset. We evaluate Triad across three tasks, namely, organ/tumor segmentation, organ/cancer classification, and medical image registration, in two data modalities (within-domain and out-of-domain) settings using 25 downstream datasets. By initializing models with Triad's pre-trained weights, nnUNet-Triad improves segmentation performance by 2.51% compared to nnUNet-Scratch across 17 datasets. Swin-B-Triad achieves a 3.97% improvement over Swin-B-Scratch in classification tasks across five datasets. SwinUNETR-Triad improves by 4.00% compared to SwinUNETR-Scratch in registration tasks across two datasets. Our study demonstrates that pre-training can improve performance when the data modalities and organs of upstream and downstream tasks are consistent.
△ Less
Submitted 22 February, 2025; v1 submitted 19 February, 2025;
originally announced February 2025.
-
A Physics-Informed Deep Learning Model for MRI Brain Motion Correction
Authors:
Mojtaba Safari,
Shansong Wang,
Zach Eidex,
Richard Qiu,
Chih-Wei Chang,
David S. Yu,
Xiaofeng Yang
Abstract:
Background: MRI is crucial for brain imaging but is highly susceptible to motion artifacts due to long acquisition times. This study introduces PI-MoCoNet, a physics-informed motion correction network that integrates spatial and k-space information to remove motion artifacts without explicit motion parameter estimation, enhancing image fidelity and diagnostic reliability. Materials and Methods: PI…
▽ More
Background: MRI is crucial for brain imaging but is highly susceptible to motion artifacts due to long acquisition times. This study introduces PI-MoCoNet, a physics-informed motion correction network that integrates spatial and k-space information to remove motion artifacts without explicit motion parameter estimation, enhancing image fidelity and diagnostic reliability. Materials and Methods: PI-MoCoNet consists of a motion detection network (U-net with spatial averaging) to identify corrupted k-space lines and a motion correction network (U-net with Swin Transformer blocks) to reconstruct motion-free images. The correction is guided by three loss functions: reconstruction (L1), perceptual (LPIPS), and data consistency (Ldc). Motion artifacts were simulated via rigid phase encoding perturbations and evaluated on IXI and MR-ART datasets against Pix2Pix, CycleGAN, and U-net using PSNR, SSIM, and NMSE. Results: PI-MoCoNet significantly improved image quality. On IXI, for minor artifacts, PSNR increased from 34.15 dB to 45.95 dB, SSIM from 0.87 to 1.00, and NMSE reduced from 0.55% to 0.04%. For moderate artifacts, PSNR improved from 30.23 dB to 42.16 dB, SSIM from 0.80 to 0.99, and NMSE from 1.32% to 0.09%. For heavy artifacts, PSNR rose from 27.99 dB to 36.01 dB, SSIM from 0.75 to 0.97, and NMSE decreased from 2.21% to 0.36%. On MR-ART, PI-MoCoNet achieved PSNR gains of ~10 dB and SSIM improvements of up to 0.20, with NMSE reductions of ~6%. Ablation studies confirmed the importance of data consistency and perceptual losses, yielding a 1 dB PSNR gain and 0.17% NMSE reduction. Conclusions: PI-MoCoNet effectively mitigates motion artifacts in brain MRI, outperforming existing methods. Its ability to integrate spatial and k-space information makes it a promising tool for clinical use in motion-prone settings. Code: https://github.com/mosaf/PI-MoCoNet.git.
△ Less
Submitted 13 February, 2025;
originally announced February 2025.
-
Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics
Authors:
Indrashis Das,
Mahmoud Safari,
Steven Adriaensen,
Frank Hutter
Abstract:
Activation functions are fundamental elements of deep learning architectures as they significantly influence training dynamics. ReLU, while widely used, is prone to the dying neuron problem, which has been mitigated by variants such as LeakyReLU, PReLU, and ELU that better handle negative neuron outputs. Recently, self-gated activations like GELU and Swish have emerged as state-of-the-art alternat…
▽ More
Activation functions are fundamental elements of deep learning architectures as they significantly influence training dynamics. ReLU, while widely used, is prone to the dying neuron problem, which has been mitigated by variants such as LeakyReLU, PReLU, and ELU that better handle negative neuron outputs. Recently, self-gated activations like GELU and Swish have emerged as state-of-the-art alternatives, leveraging their smoothness to ensure stable gradient flow and prevent neuron inactivity. In this work, we introduce the Gompertz Linear Unit (GoLU), a novel self-gated activation function defined as $\mathrm{GoLU}(x) = x \, \mathrm{Gompertz}(x)$, where $\mathrm{Gompertz}(x) = e^{-e^{-x}}$. The GoLU activation leverages the right-skewed asymmetry in the Gompertz function to reduce variance in the latent space more effectively compared to GELU and Swish, while preserving robust gradient flow. Extensive experiments across diverse tasks, including Image Classification, Language Modeling, Semantic Segmentation, Object Detection, Instance Segmentation, and Diffusion, highlight GoLU's superior performance relative to state-of-the-art activation functions, establishing GoLU as a robust alternative to existing activation functions.
△ Less
Submitted 21 May, 2025; v1 submitted 5 February, 2025;
originally announced February 2025.
-
FPGA Innovation Research in the Netherlands: Present Landscape and Future Outlook
Authors:
Nikolaos Alachiotis,
Sjoerd van den Belt,
Steven van der Vlugt,
Reinier van der Walle,
Mohsen Safari,
Bruno Endres Forlin,
Tiziano De Matteis,
Zaid Al-Ars,
Roel Jordans,
António J. Sousa de Almeida,
Federico Corradi,
Christiaan Baaij,
Ana-Lucia Varbanescu
Abstract:
FPGAs have transformed digital design by enabling versatile and customizable solutions that balance performance and power efficiency, yielding them essential for today's diverse computing challenges. Research in the Netherlands, both in academia and industry, plays a major role in developing new innovative FPGA solutions. This survey presents the current landscape of FPGA innovation research in th…
▽ More
FPGAs have transformed digital design by enabling versatile and customizable solutions that balance performance and power efficiency, yielding them essential for today's diverse computing challenges. Research in the Netherlands, both in academia and industry, plays a major role in developing new innovative FPGA solutions. This survey presents the current landscape of FPGA innovation research in the Netherlands by delving into ongoing projects, advancements, and breakthroughs in the field. Focusing on recent research outcome (within the past 5 years), we have identified five key research areas: a) FPGA architecture, b) FPGA robustness, c) data center infrastructure and high-performance computing, d) programming models and tools, and e) applications. This survey provides in-depth insights beyond a mere snapshot of the current innovation research landscape by highlighting future research directions within each key area; these insights can serve as a foundational resource to inform potential national-level investments in FPGA technology.
△ Less
Submitted 4 February, 2025;
originally announced February 2025.
-
End-to-End Target Speaker Speech Recognition Using Context-Aware Attention Mechanisms for Challenging Enrollment Scenario
Authors:
Mohsen Ghane,
Mohammad Sadegh Safari
Abstract:
This paper presents a novel streaming end-to-end target-speaker speech recognition that addresses two critical limitations in systems: the handling of noisy enrollment utterances and specific enrollment phrase requirements. This paper proposes a robust Target-Speaker Recurrent Neural Network Transducer (TS-RNNT) with dual attention mechanisms for contextual biasing and overlapping enrollment proce…
▽ More
This paper presents a novel streaming end-to-end target-speaker speech recognition that addresses two critical limitations in systems: the handling of noisy enrollment utterances and specific enrollment phrase requirements. This paper proposes a robust Target-Speaker Recurrent Neural Network Transducer (TS-RNNT) with dual attention mechanisms for contextual biasing and overlapping enrollment processing. The model incorporates a text decoder and attention mechanism specifically designed to extract relevant speaker characteristics from noisy, overlapping enrollment audio. Experimental results on a synthesized dataset demonstrate the model's resilience, maintaining a Word Error Rate (WER) of 16.44% even with overlapping enrollment at 5dB Signal-to-Interference Ratio (SIR), compared to conventional approaches that degrade to WERs above 75% under similar conditions. This significant performance improvement, coupled with the model's semi-text-dependent enrollment capabilities, represents a substantial advancement toward more practical and versatile voice-controlled devices.
△ Less
Submitted 26 January, 2025;
originally announced January 2025.
-
Advancing MRI Reconstruction: A Systematic Review of Deep Learning and Compressed Sensing Integration
Authors:
Mojtaba Safari,
Zach Eidex,
Chih-Wei Chang,
Richard L. J. Qiu,
Xiaofeng Yang
Abstract:
Magnetic resonance imaging (MRI) is a non-invasive imaging modality and provides comprehensive anatomical and functional insights into the human body. However, its long acquisition times can lead to patient discomfort, motion artifacts, and limiting real-time applications. To address these challenges, strategies such as parallel imaging have been applied, which utilize multiple receiver coils to s…
▽ More
Magnetic resonance imaging (MRI) is a non-invasive imaging modality and provides comprehensive anatomical and functional insights into the human body. However, its long acquisition times can lead to patient discomfort, motion artifacts, and limiting real-time applications. To address these challenges, strategies such as parallel imaging have been applied, which utilize multiple receiver coils to speed up the data acquisition process. Additionally, compressed sensing (CS) is a method that facilitates image reconstruction from sparse data, significantly reducing image acquisition time by minimizing the amount of data collection needed. Recently, deep learning (DL) has emerged as a powerful tool for improving MRI reconstruction. It has been integrated with parallel imaging and CS principles to achieve faster and more accurate MRI reconstructions. This review comprehensively examines DL-based techniques for MRI reconstruction. We categorize and discuss various DL-based methods, including end-to-end approaches, unrolled optimization, and federated learning, highlighting their potential benefits. Our systematic review highlights significant contributions and underscores the potential of DL in MRI reconstruction. Additionally, we summarize key results and trends in DL-based MRI reconstruction, including quantitative metrics, the dataset, acceleration factors, and the progress of and research interest in DL techniques over time. Finally, we discuss potential future directions and the importance of DL-based MRI reconstruction in advancing medical imaging. To facilitate further research in this area, we provide a GitHub repository that includes up-to-date DL-based MRI reconstruction publications and public datasets-https://github.com/mosaf/Awesome-DL-based-CS-MRI.
△ Less
Submitted 1 February, 2025; v1 submitted 23 January, 2025;
originally announced January 2025.
-
T1-contrast Enhanced MRI Generation from Multi-parametric MRI for Glioma Patients with Latent Tumor Conditioning
Authors:
Zach Eidex,
Mojtaba Safari,
Richard L. J. Qiu,
David S. Yu,
Hui-Kuo Shu,
Hui Mao,
Xiaofeng Yang
Abstract:
Objective: Gadolinium-based contrast agents (GBCAs) are commonly used in MRI scans of patients with gliomas to enhance brain tumor characterization using T1-weighted (T1W) MRI. However, there is growing concern about GBCA toxicity. This study develops a deep-learning framework to generate T1-postcontrast (T1C) from pre-contrast multiparametric MRI. Approach: We propose the tumor-aware vision trans…
▽ More
Objective: Gadolinium-based contrast agents (GBCAs) are commonly used in MRI scans of patients with gliomas to enhance brain tumor characterization using T1-weighted (T1W) MRI. However, there is growing concern about GBCA toxicity. This study develops a deep-learning framework to generate T1-postcontrast (T1C) from pre-contrast multiparametric MRI. Approach: We propose the tumor-aware vision transformer (TA-ViT) model that predicts high-quality T1C images. The predicted tumor region is significantly improved (P < .001) by conditioning the transformer layers from predicted segmentation maps through adaptive layer norm zero mechanism. The predicted segmentation maps were generated with the multi-parametric residual (MPR) ViT model and transformed into a latent space to produce compressed, feature-rich representations. The TA-ViT model predicted T1C MRI images of 501 glioma cases. Selected patients were split into training (N=400), validation (N=50), and test (N=51) sets. Main Results: Both qualitative and quantitative results demonstrate that the TA-ViT model performs superior against the benchmark MRP-ViT model. Our method produces synthetic T1C MRI with high soft tissue contrast and more accurately reconstructs both the tumor and whole brain volumes. The synthesized T1C images achieved remarkable improvements in both tumor and healthy tissue regions compared to the MRP-ViT model. For healthy tissue and tumor regions, the results were as follows: NMSE: 8.53 +/- 4.61E-4; PSNR: 31.2 +/- 2.2; NCC: 0.908 +/- .041 and NMSE: 1.22 +/- 1.27E-4, PSNR: 41.3 +/- 4.7, and NCC: 0.879 +/- 0.042, respectively. Significance: The proposed method generates synthetic T1C images that closely resemble real T1C images. Future development and application of this approach may enable contrast-agent-free MRI for brain tumor patients, eliminating the risk of GBCA toxicity and simplifying the MRI scan protocol.
△ Less
Submitted 3 September, 2024;
originally announced September 2024.
-
Efficient Search for Customized Activation Functions with Gradient Descent
Authors:
Lukas Strack,
Mahmoud Safari,
Frank Hutter
Abstract:
Different activation functions work best for different deep learning models. To exploit this, we leverage recent advancements in gradient-based search techniques for neural architectures to efficiently identify high-performing activation functions for a given application. We propose a fine-grained search cell that combines basic mathematical operations to model activation functions, allowing for t…
▽ More
Different activation functions work best for different deep learning models. To exploit this, we leverage recent advancements in gradient-based search techniques for neural architectures to efficiently identify high-performing activation functions for a given application. We propose a fine-grained search cell that combines basic mathematical operations to model activation functions, allowing for the exploration of novel activations. Our approach enables the identification of specialized activations, leading to improved performance in every model we tried, from image classification to language models. Moreover, the identified activations exhibit strong transferability to larger models of the same type, as well as new datasets. Importantly, our automated process for creating customized activation functions is orders of magnitude more efficient than previous approaches. It can easily be applied on top of arbitrary deep learning pipelines and thus offers a promising practical avenue for enhancing deep learning architectures.
△ Less
Submitted 13 August, 2024;
originally announced August 2024.
-
Deep Learning Based Apparent Diffusion Coefficient Map Generation from Multi-parametric MR Images for Patients with Diffuse Gliomas
Authors:
Zach Eidex,
Mojtaba Safari,
Jacob Wynne,
Richard L. J. Qiu,
Tonghe Wang,
David Viar Hernandez,
Hui-Kuo Shu,
Hui Mao,
Xiaofeng Yang
Abstract:
Purpose: Apparent diffusion coefficient (ADC) maps derived from diffusion weighted (DWI) MRI provides functional measurements about the water molecules in tissues. However, DWI is time consuming and very susceptible to image artifacts, leading to inaccurate ADC measurements. This study aims to develop a deep learning framework to synthesize ADC maps from multi-parametric MR images. Methods: We pro…
▽ More
Purpose: Apparent diffusion coefficient (ADC) maps derived from diffusion weighted (DWI) MRI provides functional measurements about the water molecules in tissues. However, DWI is time consuming and very susceptible to image artifacts, leading to inaccurate ADC measurements. This study aims to develop a deep learning framework to synthesize ADC maps from multi-parametric MR images. Methods: We proposed the multiparametric residual vision transformer model (MPR-ViT) that leverages the long-range context of ViT layers along with the precision of convolutional operators. Residual blocks throughout the network significantly increasing the representational power of the model. The MPR-ViT model was applied to T1w and T2- fluid attenuated inversion recovery images of 501 glioma cases from a publicly available dataset including preprocessed ADC maps. Selected patients were divided into training (N=400), validation (N=50) and test (N=51) sets, respectively. Using the preprocessed ADC maps as ground truth, model performance was evaluated and compared against the Vision Convolutional Transformer (VCT) and residual vision transformer (ResViT) models. Results: The results are as follows using T1w + T2-FLAIR MRI as inputs: MPR-ViT - PSNR: 31.0 +/- 2.1, MSE: 0.009 +/- 0.0005, SSIM: 0.950 +/- 0.015. In addition, ablation studies showed the relative impact on performance of each input sequence. Both qualitative and quantitative results indicate that the proposed MR- ViT model performs favorably against the ground truth data. Conclusion: We show that high-quality ADC maps can be synthesized from structural MRI using a MPR- VCT model. Our predicted images show better conformality to the ground truth volume than ResViT and VCT predictions. These high-quality synthetic ADC maps would be particularly useful for disease diagnosis and intervention, especially when ADC maps have artifacts or are unavailable.
△ Less
Submitted 4 July, 2024; v1 submitted 2 July, 2024;
originally announced July 2024.
-
Self-Supervised Adversarial Diffusion Models for Fast MRI Reconstruction
Authors:
Mojtaba Safari,
Zach Eidex,
Shaoyan Pan,
Richard L. J. Qiu,
Xiaofeng Yang
Abstract:
Purpose: To propose a self-supervised deep learning-based compressed sensing MRI (DL-based CS-MRI) method named "Adaptive Self-Supervised Consistency Guided Diffusion Model (ASSCGD)" to accelerate data acquisition without requiring fully sampled datasets. Materials and Methods: We used the fastMRI multi-coil brain axial T2-weighted (T2-w) dataset from 1,376 cases and single-coil brain quantitative…
▽ More
Purpose: To propose a self-supervised deep learning-based compressed sensing MRI (DL-based CS-MRI) method named "Adaptive Self-Supervised Consistency Guided Diffusion Model (ASSCGD)" to accelerate data acquisition without requiring fully sampled datasets. Materials and Methods: We used the fastMRI multi-coil brain axial T2-weighted (T2-w) dataset from 1,376 cases and single-coil brain quantitative magnetization prepared 2 rapid acquisition gradient echoes (MP2RAGE) T1 maps from 318 cases to train and test our model. Robustness against domain shift was evaluated using two out-of-distribution (OOD) datasets: multi-coil brain axial postcontrast T1 -weighted (T1c) dataset from 50 cases and axial T1-weighted (T1-w) dataset from 50 patients. Data were retrospectively subsampled at acceleration rates R in {2x, 4x, 8x}. ASSCGD partitions a random sampling pattern into two disjoint sets, ensuring data consistency during training. We compared our method with ReconFormer Transformer and SS-MRI, assessing performance using normalized mean squared error (NMSE), peak signal-to-noise ratio (PSNR), and structural similarity index (SSIM). Statistical tests included one-way analysis of variance (ANOVA) and multi-comparison Tukey's Honesty Significant Difference (HSD) tests. Results: ASSCGD preserved fine structures and brain abnormalities visually better than comparative methods at R = 8x for both multi-coil and single-coil datasets. It achieved the lowest NMSE at R in {4x, 8x}, and the highest PSNR and SSIM values at all acceleration rates for the multi-coil dataset. Similar trends were observed for the single-coil dataset, though SSIM values were comparable to ReconFormer at R in {2x, 8x}. These results were further confirmed by the voxel-wise correlation scatter plots. OOD results showed significant (p << 10^-5 ) improvements in undersampled image quality after reconstruction.
△ Less
Submitted 20 November, 2024; v1 submitted 21 June, 2024;
originally announced June 2024.
-
Surprisingly Strong Performance Prediction with Neural Graph Features
Authors:
Gabriela Kadlecová,
Jovita Lukasik,
Martin Pilát,
Petra Vidnerová,
Mahmoud Safari,
Roman Neruda,
Frank Hutter
Abstract:
Performance prediction has been a key part of the neural architecture search (NAS) process, allowing to speed up NAS algorithms by avoiding resource-consuming network training. Although many performance predictors correlate well with ground truth performance, they require training data in the form of trained networks. Recently, zero-cost proxies have been proposed as an efficient method to estimat…
▽ More
Performance prediction has been a key part of the neural architecture search (NAS) process, allowing to speed up NAS algorithms by avoiding resource-consuming network training. Although many performance predictors correlate well with ground truth performance, they require training data in the form of trained networks. Recently, zero-cost proxies have been proposed as an efficient method to estimate network performance without any training. However, they are still poorly understood, exhibit biases with network properties, and their performance is limited. Inspired by the drawbacks of zero-cost proxies, we propose neural graph features (GRAF), simple to compute properties of architectural graphs. GRAF offers fast and interpretable performance prediction while outperforming zero-cost proxies and other common encodings. In combination with other zero-cost proxies, GRAF outperforms most existing performance predictors at a fraction of the cost.
△ Less
Submitted 13 August, 2024; v1 submitted 25 April, 2024;
originally announced April 2024.
-
Weight-Entanglement Meets Gradient-Based Neural Architecture Search
Authors:
Rhea Sanjay Sukthanker,
Arjun Krishnakumar,
Mahmoud Safari,
Frank Hutter
Abstract:
Weight sharing is a fundamental concept in neural architecture search (NAS), enabling gradient-based methods to explore cell-based architecture spaces significantly faster than traditional blackbox approaches. In parallel, weight \emph{entanglement} has emerged as a technique for intricate parameter sharing among architectures within macro-level search spaces. %However, the macro structure of such…
▽ More
Weight sharing is a fundamental concept in neural architecture search (NAS), enabling gradient-based methods to explore cell-based architecture spaces significantly faster than traditional blackbox approaches. In parallel, weight \emph{entanglement} has emerged as a technique for intricate parameter sharing among architectures within macro-level search spaces. %However, the macro structure of such spaces poses compatibility challenges for gradient-based NAS methods. %As a result, blackbox optimization methods have been commonly employed, particularly in conjunction with supernet training, to maintain search efficiency. %Due to the inherent differences in the structure of these search spaces, these Since weight-entanglement poses compatibility challenges for gradient-based NAS methods, these two paradigms have largely developed independently in parallel sub-communities. This paper aims to bridge the gap between these sub-communities by proposing a novel scheme to adapt gradient-based methods for weight-entangled spaces. This enables us to conduct an in-depth comparative assessment and analysis of the performance of gradient-based NAS in weight-entangled search spaces. Our findings reveal that this integration of weight-entanglement and gradient-based NAS brings forth the various benefits of gradient-based methods (enhanced performance, improved supernet training properties and superior any-time performance), while preserving the memory efficiency of weight-entangled spaces. The code for our work is openly accessible \href{https://anonymous.4open.science/r/TangleNAS-527C}{here}
△ Less
Submitted 16 December, 2023;
originally announced December 2023.
-
EU COST Action on future generation optical wireless communication technologies, 2nd White paper
Authors:
Z. Ghassemlooy,
M. A. Khalighi,
S. Zvanovec,
A. Shrestha,
B. Ortega,
M. Petkovic,
X. Pang,
C. Sirtori,
D. Orsucci,
A. Shrestha,
F. Moll,
G. Cossu,
V. Spirito,
M. P. Ninos,
E. Ciaramella,
J. Bas,
M. Amay,
S. Huang,
M. Safari,
T. Gutema,
W. Popoola,
Vicente Matus,
Jose Rabadan,
Rafael Perez-Jimenez,
E. Panayirci
, et al. (3 additional authors not shown)
Abstract:
NEWFOCUS is an EU COST Action targeted at exploring radical solutions that could influence the design of future wireless networks. The project aims to address some of the challenges associated with optical wireless communication (OWC) and to establish it as a complementary technology to the radio frequency (RF)-based wireless systems in order to meet the demanding requirements of the fifth generat…
▽ More
NEWFOCUS is an EU COST Action targeted at exploring radical solutions that could influence the design of future wireless networks. The project aims to address some of the challenges associated with optical wireless communication (OWC) and to establish it as a complementary technology to the radio frequency (RF)-based wireless systems in order to meet the demanding requirements of the fifth generation (5G) and the future sixth generation (6G) backhaul and access networks. Only 6G will be able to widely serve the exponential growth in connected devices (i.e., more than 500 billion) in 2030, real-time holographic communication, future virtual reality, etc. Space is emerging as the new frontier in 5 and 6G and beyond communication networks, where it offers high-speed wireless coverage to remote areas both in lands and sees. This activity is supported by the recent development of low-altitude Earth orbit satellite mega-constellations. The focus of this 2nd White Paper is on the use of OWC as an enabling technology for medium- and long-range links for deployment in (i) smart-cities and intelligent transportation systems; (ii) first- and last-mile access and backhaul/fronthaul wireless networks; (iii) hybrid free-space optics/RF adaptive wireless connections; (iv) space-to-ground, inter-satellite, ground-to-air, and air-to-air communications; and (v) underwater communications.
△ Less
Submitted 14 June, 2023;
originally announced November 2023.
-
BarcodeBERT: Transformers for Biodiversity Analysis
Authors:
Pablo Millan Arias,
Niousha Sadjadi,
Monireh Safari,
ZeMing Gong,
Austin T. Wang,
Joakim Bruslund Haurum,
Iuliia Zarubiieva,
Dirk Steinke,
Lila Kari,
Angel X. Chang,
Scott C. Lowe,
Graham W. Taylor
Abstract:
In the global challenge of understanding and characterizing biodiversity, short species-specific genomic sequences known as DNA barcodes play a critical role, enabling fine-grained comparisons among organisms within the same kingdom of life. Although machine learning algorithms specifically designed for the analysis of DNA barcodes are becoming more popular, most existing methodologies rely on gen…
▽ More
In the global challenge of understanding and characterizing biodiversity, short species-specific genomic sequences known as DNA barcodes play a critical role, enabling fine-grained comparisons among organisms within the same kingdom of life. Although machine learning algorithms specifically designed for the analysis of DNA barcodes are becoming more popular, most existing methodologies rely on generic supervised training algorithms. We introduce BarcodeBERT, a family of models tailored to biodiversity analysis and trained exclusively on data from a reference library of 1.5M invertebrate DNA barcodes. We compared the performance of BarcodeBERT on taxonomic identification tasks against a spectrum of machine learning approaches including supervised training of classical neural architectures and fine-tuning of general DNA foundation models. Our self-supervised pretraining strategies on domain-specific data outperform fine-tuned foundation models, especially in identification tasks involving lower taxa such as genera and species. We also compared BarcodeBERT with BLAST, one of the most widely used bioinformatics tools for sequence searching, and found that our method matched BLAST's performance in species-level classification while being 55 times faster. Our analysis of masking and tokenization strategies also provides practical guidance for building customized DNA language models, emphasizing the importance of aligning model training strategies with dataset characteristics and domain knowledge. The code repository is available at https://github.com/bioscan-ml/BarcodeBERT.
△ Less
Submitted 21 January, 2025; v1 submitted 4 November, 2023;
originally announced November 2023.
-
Single-Photon Counting Receivers for Optical Wireless Communications in Future 6G Networks
Authors:
Shenjie Huang,
Danial Chitnis,
Cheng Chen,
Harald Haas,
Mohammad-Ali Khalighi,
Robert K. Henderson,
Majid Safari
Abstract:
Optical wireless communication (OWC) offers several complementary advantages to radio-frequency wireless networks such as its massive available spectrum; hence, it is widely anticipated that OWC will assume a pivotal role in the forthcoming sixth generation wireless communication networks. Although significant progress has been achieved in OWC over the past decades, the outage induced by occasiona…
▽ More
Optical wireless communication (OWC) offers several complementary advantages to radio-frequency wireless networks such as its massive available spectrum; hence, it is widely anticipated that OWC will assume a pivotal role in the forthcoming sixth generation wireless communication networks. Although significant progress has been achieved in OWC over the past decades, the outage induced by occasionally low received optical power continues to pose a key limiting factor for its deployment. In this work, we discuss the potential role of single-photon counting (SPC) receivers as a promising solution to overcome this limitation. We present an overview of the applications of SPC-based OWC systems in 6G networks, introduce their major performance-limiting factors, propose a performance enhancement framework to tackle these issues, and identify critical areas of open problems for future research.
△ Less
Submitted 30 October, 2023; v1 submitted 16 May, 2023;
originally announced May 2023.
-
Neural Architecture Search: Insights from 1000 Papers
Authors:
Colin White,
Mahmoud Safari,
Rhea Sukthanker,
Binxin Ru,
Thomas Elsken,
Arber Zela,
Debadeepta Dey,
Frank Hutter
Abstract:
In the past decade, advances in deep learning have resulted in breakthroughs in a variety of areas, including computer vision, natural language understanding, speech recognition, and reinforcement learning. Specialized, high-performing neural architectures are crucial to the success of deep learning in these areas. Neural architecture search (NAS), the process of automating the design of neural ar…
▽ More
In the past decade, advances in deep learning have resulted in breakthroughs in a variety of areas, including computer vision, natural language understanding, speech recognition, and reinforcement learning. Specialized, high-performing neural architectures are crucial to the success of deep learning in these areas. Neural architecture search (NAS), the process of automating the design of neural architectures for a given task, is an inevitable next step in automating machine learning and has already outpaced the best human-designed architectures on many tasks. In the past few years, research in NAS has been progressing rapidly, with over 1000 papers released since 2020 (Deng and Lindauer, 2021). In this survey, we provide an organized and comprehensive guide to neural architecture search. We give a taxonomy of search spaces, algorithms, and speedup techniques, and we discuss resources such as benchmarks, best practices, other surveys, and open-source libraries.
△ Less
Submitted 25 January, 2023; v1 submitted 20 January, 2023;
originally announced January 2023.
-
Design and Optimisation of High-Speed Receivers for 6G Optical Wireless Networks
Authors:
Elham Sarbazi,
Hossein Kazemi,
Michael Crisp,
Taisir El-Gorashi,
Jaafar Elmirghani,
Richard Penty,
Ian White,
Majid Safari,
Harald Haas
Abstract:
To achieve multi-Gb/s data rates in 6G optical wireless access networks based on narrow infrared (IR) laser beams, a high-speed receiver with two key specifications is needed: a sufficiently large aperture to collect the required optical power and a wide field of view (FOV) to avoid strict alignment issues. This paper puts forward the systematic design and optimisation of multi-tier non-imaging an…
▽ More
To achieve multi-Gb/s data rates in 6G optical wireless access networks based on narrow infrared (IR) laser beams, a high-speed receiver with two key specifications is needed: a sufficiently large aperture to collect the required optical power and a wide field of view (FOV) to avoid strict alignment issues. This paper puts forward the systematic design and optimisation of multi-tier non-imaging angle diversity receivers (ADRs) composed of compound parabolic concentrators (CPCs) coupled with photodiode (PD) arrays for laser-based optical wireless communication (OWC) links. Design tradeoffs include the gain-FOV tradeoff for each receiver element and the area-bandwidth tradeoff for each PD array. The rate maximisation is formulated as a non-convex optimisation problem under the constraints on the minimum required FOV and the overall ADR dimensions to find optimum configuration of the receiver bandwidth and FOV, and a low-complexity optimal solution is proposed. The ADR performance is studied using computer simulations and insightful design guidelines are provided through various numerical examples. An efficient technique is also proposed to reduce the ADR dimensions based on CPC length truncation. It is shown that a compact ADR with a height of $\leq0.5$ cm and an effective area of $\leq0.5$ cm$^2$ reaches a data rate of $12$ Gb/s with a half-angle FOV of $30^\circ$ over a $3$ m link distance.
△ Less
Submitted 30 December, 2022;
originally announced December 2022.
-
SPAD-Based Optical Wireless Communication with ACO-OFDM
Authors:
Shenjie Huang,
Cheng Chen,
Mohammad Dehghani Soltani,
Robert Henderson,
Harald Haas,
Majid Safari
Abstract:
The sensitivity of the optical wireless communication (OWC) can be effectively improved by employing the highly sensitive single-photon avalanche diode (SPAD) arrays. However, the nonlinear distortion introduced by the dead time strongly limits the throughput of the SPAD-based OWC systems. Optical orthogonal frequency division multiplexing (OFDM) can be employed in the systems with SPAD arrays to…
▽ More
The sensitivity of the optical wireless communication (OWC) can be effectively improved by employing the highly sensitive single-photon avalanche diode (SPAD) arrays. However, the nonlinear distortion introduced by the dead time strongly limits the throughput of the SPAD-based OWC systems. Optical orthogonal frequency division multiplexing (OFDM) can be employed in the systems with SPAD arrays to improve the spectral efficiency. In this work, a theoretical performance analysis of SPAD-based OWC system with asymmetrically-clipped optical OFDM (ACO-OFDM) is presented. The impact of the SPAD nonlinearity on the system performance is investigated. In addition, the comparison of the considered scheme with direct-current-biased optical OFDM (DCO-OFDM) is presented showing the distinct reliable operation regimes of the two schemes. In the low power regimes, ACO-OFDM outperforms DCO-OFDM; whereas, the latter is more preferable in the high power regimes.
△ Less
Submitted 25 October, 2022;
originally announced October 2022.
-
NAS-Bench-Suite-Zero: Accelerating Research on Zero Cost Proxies
Authors:
Arjun Krishnakumar,
Colin White,
Arber Zela,
Renbo Tu,
Mahmoud Safari,
Frank Hutter
Abstract:
Zero-cost proxies (ZC proxies) are a recent architecture performance prediction technique aiming to significantly speed up algorithms for neural architecture search (NAS). Recent work has shown that these techniques show great promise, but certain aspects, such as evaluating and exploiting their complementary strengths, are under-studied. In this work, we create NAS-Bench-Suite: we evaluate 13 ZC…
▽ More
Zero-cost proxies (ZC proxies) are a recent architecture performance prediction technique aiming to significantly speed up algorithms for neural architecture search (NAS). Recent work has shown that these techniques show great promise, but certain aspects, such as evaluating and exploiting their complementary strengths, are under-studied. In this work, we create NAS-Bench-Suite: we evaluate 13 ZC proxies across 28 tasks, creating by far the largest dataset (and unified codebase) for ZC proxies, enabling orders-of-magnitude faster experiments on ZC proxies, while avoiding confounding factors stemming from different implementations. To demonstrate the usefulness of NAS-Bench-Suite, we run a large-scale analysis of ZC proxies, including a bias analysis, and the first information-theoretic analysis which concludes that ZC proxies capture substantial complementary information. Motivated by these findings, we present a procedure to improve the performance of ZC proxies by reducing biases such as cell size, and we also show that incorporating all 13 ZC proxies into the surrogate models used by NAS algorithms can improve their predictive performance by up to 42%. Our code and datasets are available at https://github.com/automl/naslib/tree/zerocost.
△ Less
Submitted 6 October, 2022;
originally announced October 2022.
-
Optimal Power Allocation for Integrated Visible Light Positioning and Communication System with a Single LED-Lamp
Authors:
Shuai Ma,
Ruixin Yang,
Bing Li,
Yongyan Chen,
Hang Li,
Youlong Wu,
Majid Safari,
Shiyin Li,
Naofal Al-Dhahir
Abstract:
In this paper, we investigate an integrated visible light positioning and communication (VLPC) system with a single LED-lamp. First, by leveraging the fact that the VLC channel model is a function of the receiver's location, we propose a system model that estimates the channel state information (CSI) based on the positioning information without transmitting pilot sequences. Second, we derive the C…
▽ More
In this paper, we investigate an integrated visible light positioning and communication (VLPC) system with a single LED-lamp. First, by leveraging the fact that the VLC channel model is a function of the receiver's location, we propose a system model that estimates the channel state information (CSI) based on the positioning information without transmitting pilot sequences. Second, we derive the Cramer-Rao lower bound (CRLB) on the positioning error variance and a lower bound on the achievable rate with on-off keying modulation. Third, based on the derived performance metrics, we optimize the power allocation to minimize the CRLB, while satisfying the rate outage probability constraint. To tackle this non-convex optimization problem, we apply the worst-case distribution of the Conditional Value-at-Risk (CVaR) and the block coordinate descent (BCD) methods to obtain the feasible solutions. Finally, the effects of critical system parameters, such as outage probability, rate threshold, total power threshold, are revealed by numerical results.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
Terabit Indoor Laser-Based Wireless Communications: LiFi 2.0 for 6G
Authors:
Mohammad Dehghani Soltani,
Hossein Kazemi,
Elham Sarbazi,
Ahmad Adnan Qidan,
Barzan Yosuf,
Sanaa Mohamed,
Ravinder Singh,
Bela Berde,
Dominique Chiaroni,
Bastien Béchadergue,
Fathi Abdeldayem,
Hardik Soni,
Jose Tabu,
Micheline Perrufel,
Nikola Serafimovski,
Taisir E. H. El-Gorashi,
Jaafar Elmirghani,
Richard Penty,
Ian H. White,
Harald Haas,
Majid Safari
Abstract:
This paper provides a summary of available technologies required for implementing indoor laser-based wireless networks capable of achieving aggregate data-rates of terabits per second as widely accepted as a sixth generation (6G) key performance indicator. The main focus of this paper is on the technologies supporting the near infrared region of the optical spectrum. The main challenges in the des…
▽ More
This paper provides a summary of available technologies required for implementing indoor laser-based wireless networks capable of achieving aggregate data-rates of terabits per second as widely accepted as a sixth generation (6G) key performance indicator. The main focus of this paper is on the technologies supporting the near infrared region of the optical spectrum. The main challenges in the design of the transmitter and receiver systems and communication/networking schemes are identified and new insights are provided. This paper also covers the previous and recent standards as well as industrial applications for optical wireless communications (OWC) and LiFi.
△ Less
Submitted 21 June, 2022;
originally announced June 2022.
-
Performance Analysis of SPAD-Based Optical Wireless Communication with OFDM
Authors:
Shenjie Huang,
Yichen Li,
Cheng Chen,
Mohammad Dehghani Soltani,
Robert Henderson,
Majid Safari,
Harald Haas
Abstract:
In recent years, there has been a growing interest in the use of single-photon avalanche diode (SPAD) in optical wireless communication (OWC). SPAD operates in the Geiger mode and can act as a photon counting receiver obviating the need for a transimpedance amplifier (TIA). Although a SPAD receiver can provide higher sensitivity compared to the traditional linear photodetectors, it suffers from th…
▽ More
In recent years, there has been a growing interest in the use of single-photon avalanche diode (SPAD) in optical wireless communication (OWC). SPAD operates in the Geiger mode and can act as a photon counting receiver obviating the need for a transimpedance amplifier (TIA). Although a SPAD receiver can provide higher sensitivity compared to the traditional linear photodetectors, it suffers from the dead-time-induced nonlinearity. To improve the data rates of SPAD-based OWC systems, optical orthogonal frequency division multiplexing (OFDM) can be employed. This paper provides a comprehensive theoretical analysis of the SPAD-based OWC systems using OFDM signalling considering the effects of signal clipping, SPAD nonlinearity, and signal-dependent shot noise. An equivalent additive Gaussian noise channel model is proposed to describe the performance of the SPAD-based OFDM system. The statistics of the proposed channel model and the analytical expressions of the signal-to-noise ratio (SNR) and bit error rate (BER) are derived in closed forms. By means of extensive numerical results, the impact of the unique receiver nonlinearity on the system performance is investigated. The results demonstrate new insights into different optical power regimes of reliable operation for SPAD-based OFDM systems even well beyond SPAD saturation level.
△ Less
Submitted 4 June, 2022;
originally announced June 2022.
-
High-Speed Imaging Receiver Design for 6G Optical Wireless Communications: A Rate-FOV Trade-Off
Authors:
Mohammad Dehghani Soltani,
Hossein Kazemi,
Elham Sarbazi,
Taisir E. H. El-Gorashi,
Jaafar M. H. Elmirghani,
Richard V. Penty,
Ian H. White,
Harald Haas,
Majid Safari
Abstract:
The design of a compact high-speed and wide field of view (FOV) receiver is challenging due to the presence of two well-known trade-offs. The first one is the area-bandwidth trade-off of photodetectors (PDs) and the second one is the gain-FOV trade-off due to the use of optics. The combined effects of these two trade-offs imply that the achievable data rate of an imaging optical receiver is limite…
▽ More
The design of a compact high-speed and wide field of view (FOV) receiver is challenging due to the presence of two well-known trade-offs. The first one is the area-bandwidth trade-off of photodetectors (PDs) and the second one is the gain-FOV trade-off due to the use of optics. The combined effects of these two trade-offs imply that the achievable data rate of an imaging optical receiver is limited by its FOV, i.e., a rate-FOV trade-off. To control the area-bandwidth trade-off, an array of small PDs can be used instead of a single PD. Moreover, in practice, a large-area lens is required to ensure sufficient power collection, which in turn limits the receiver FOV (i.e., gain-FOV trade-off). We propose an imaging receiver design in the form of an array of arrays. To achieve a reasonable receiver FOV, we use individual focusing lens for each PD array rather than a single collection lens for the whole receiver. The proposed array of arrays structure provides an effective method to control both gain-FOV trade-off (via an array of lenses) and area-bandwidth trade-off (via arrays of PDs). We first derive a tractable analytical model for the SNR of an array of PDs where the maximum ratio combining has been employed. Then, we extend the model for the proposed array of arrays structure and the accuracy of the analytical model is verified based on several Optic Studio-based simulations. Next, we formulate an optimization problem to maximize the achievable data rate of the imaging receiver subject to a minimum required FOV. The optimization problem is solved for two commonly used modulation techniques, namely, OOK and direct current biased optical orthogonal frequency division multiplexing with variable rate quadrature amplitude modulation. It is demonstrated that a data rate of ~ 24 Gbps with a FOV of 15 is achievable using OOK with a total receiver size of 2 cm by 2 cm.
△ Less
Submitted 11 May, 2022;
originally announced May 2022.
-
NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy
Authors:
Yash Mehta,
Colin White,
Arber Zela,
Arjun Krishnakumar,
Guri Zabergja,
Shakiba Moradian,
Mahmoud Safari,
Kaicheng Yu,
Frank Hutter
Abstract:
The release of tabular benchmarks, such as NAS-Bench-101 and NAS-Bench-201, has significantly lowered the computational overhead for conducting scientific research in neural architecture search (NAS). Although they have been widely adopted and used to tune real-world NAS algorithms, these benchmarks are limited to small search spaces and focus solely on image classification. Recently, several new…
▽ More
The release of tabular benchmarks, such as NAS-Bench-101 and NAS-Bench-201, has significantly lowered the computational overhead for conducting scientific research in neural architecture search (NAS). Although they have been widely adopted and used to tune real-world NAS algorithms, these benchmarks are limited to small search spaces and focus solely on image classification. Recently, several new NAS benchmarks have been introduced that cover significantly larger search spaces over a wide range of tasks, including object detection, speech recognition, and natural language processing. However, substantial differences among these NAS benchmarks have so far prevented their widespread adoption, limiting researchers to using just a few benchmarks. In this work, we present an in-depth analysis of popular NAS algorithms and performance prediction methods across 25 different combinations of search spaces and datasets, finding that many conclusions drawn from a few NAS benchmarks do not generalize to other benchmarks. To help remedy this problem, we introduce NAS-Bench-Suite, a comprehensive and extensible collection of NAS benchmarks, accessible through a unified interface, created with the aim to facilitate reproducible, generalizable, and rapid NAS research. Our code is available at https://github.com/automl/naslib.
△ Less
Submitted 11 February, 2022; v1 submitted 31 January, 2022;
originally announced January 2022.
-
5 Gbps Optical Wireless Communication Using Commercial SPAD Array Receivers
Authors:
Shenjie Huang,
Cheng Chen,
Rui Bian,
Harald Haas,
Majid Safari
Abstract:
Photon counting detectors such as single-photon avalanche diode (SPAD) arrays can be utilized to improve the sensitivity of optical wireless communication (OWC) systems. However, the achievable data rate of SPAD-based OWC systems is strongly limited by the nonlinearity induced by SPAD dead time. In this work, the performance of SPAD-based OWC system with orthogonal frequency division multiplexing…
▽ More
Photon counting detectors such as single-photon avalanche diode (SPAD) arrays can be utilized to improve the sensitivity of optical wireless communication (OWC) systems. However, the achievable data rate of SPAD-based OWC systems is strongly limited by the nonlinearity induced by SPAD dead time. In this work, the performance of SPAD-based OWC system with orthogonal frequency division multiplexing (OFDM) is investigated and compared with that of on-off keying (OOK). We employ nonlinear equalization, peak-to-average power ratio optimization by adjusting the OFDM clipping level, and adaptive bit and energy loading to achieve a record experimental data rate of 5 Gbps. The contrasting optimal regimes of operation of the two modulation schemes are also demonstrated.
△ Less
Submitted 4 April, 2022; v1 submitted 13 November, 2021;
originally announced November 2021.
-
A VCSEL Array Transmission System with Novel Beam Activation Mechanisms
Authors:
Zhihong Zeng,
Mohammad Dehghani Soltani,
Majid Safari,
Harald Haas
Abstract:
Optical wireless communication (OWC) is considered to be a promising technology which will alleviate traffic burden caused by the increasing number of mobile devices. In this study, a novel vertical-cavity surface-emitting laser (VCSEL) array is proposed for indoor OWC systems. To activate the best beam for a mobile user, two beam activation methods are proposed for the system. The method based on…
▽ More
Optical wireless communication (OWC) is considered to be a promising technology which will alleviate traffic burden caused by the increasing number of mobile devices. In this study, a novel vertical-cavity surface-emitting laser (VCSEL) array is proposed for indoor OWC systems. To activate the best beam for a mobile user, two beam activation methods are proposed for the system. The method based on a corner-cube retroreflector (CCR) provides very low latency and allows real-time activation for high-speed users. The other method uses the omnidirectional transmitter (ODTx). The ODTx can serve the purpose of uplink transmission and beam activation simultaneously. Moreover, systems with ODTx are very robust to the random orientation of a user equipment (UE). System level analyses are carried out for the proposed VCSEL array system. For a single user scenario, the probability density function (PDF) of the signal-to-noise ratio (SNR) for the central beam of the VCSEL array system can be approximated as a uniform distribution. In addition, the average data rate of the central beam and its upper bound are given analytically and verified by Monte-Carlo simulations. For a multi-user scenario, an analytical upper bound for the average data rate is given. The effects of the cell size and the full width at half maximum (FWHM) angle on the system performance are studied. The results show that the system with a FWHM angle of $4^\circ$ outperforms the others.
△ Less
Submitted 13 August, 2021;
originally announced August 2021.
-
Interference Mitigation using Optimized Angle Diversity Receiver in LiFi Cellular Network
Authors:
Zhihong Zeng,
Chen Chen,
Svetislav Savovi,
Mohammad Dehghani Soltani,
Cheng Chen,
Majid Safari,
Harald Haas
Abstract:
Light-fidelity (LiFi) is an emerging technology for high-speed short-range mobile communications. Inter-cell interference (ICI) is an important issue that limits the system performance in an optical attocell network. Angle diversity receivers (ADRs) have been proposed to mitigate ICI. In this paper, the structure of pyramid receivers (PRs) and truncated pyramid receivers (TPRs) are studied. The co…
▽ More
Light-fidelity (LiFi) is an emerging technology for high-speed short-range mobile communications. Inter-cell interference (ICI) is an important issue that limits the system performance in an optical attocell network. Angle diversity receivers (ADRs) have been proposed to mitigate ICI. In this paper, the structure of pyramid receivers (PRs) and truncated pyramid receivers (TPRs) are studied. The coverage problems of PRs and TPRs are defined and investigated, and the lower bound of field of view (FOV) for each PD is given analytically. The impact of random device orientation and diffuse link signal propagation are taken into consideration. The performances of PRs and TPRs are compared and then optimized ADR structures are proposed. The performance comparison between the select best combining (SBC) and maximum ratio combining (MRC) is given under different noise levels. It is shown that SBC will outperform MRC in an interference limited system, otherwise, MRC is a preferred scheme. In addition, the double source system, where each LiFi AP consists of two sources transmitting the same information signals but with opposite polarity, is proved to outperform the single source (SS) system under certain conditions.
△ Less
Submitted 12 June, 2024; v1 submitted 12 August, 2021;
originally announced August 2021.
-
Progressive Transmission using Recurrent Neural Networks
Authors:
Mohammad Sadegh Safari,
Vahid Pourahmadi,
Patrick Mitran,
Hamid Sheikhzadeh
Abstract:
In this paper, we investigate a new machine learning-based transmission strategy called progressive transmission or ProgTr. In ProgTr, there are b variables that should be transmitted using at most T channel uses. The transmitter aims to send the data to the receiver as fast as possible and with as few channel uses as possible (as channel conditions permit) while the receiver refines its estimate…
▽ More
In this paper, we investigate a new machine learning-based transmission strategy called progressive transmission or ProgTr. In ProgTr, there are b variables that should be transmitted using at most T channel uses. The transmitter aims to send the data to the receiver as fast as possible and with as few channel uses as possible (as channel conditions permit) while the receiver refines its estimate after each channel use. We use recurrent neural networks as the building block of both the transmitter and receiver where the SNR is provided as an input that represents the channel conditions. To show how ProgTr works, the proposed scheme was simulated in different scenarios including single/multi-user settings, different channel conditions, and for both discrete and continuous input data. The results show that ProgTr can achieve better performance compared to conventional modulation methods. In addition to performance metrics such as BER, bit-wise mutual information is used to provide some interpretation to how the transmitter and receiver operate in ProgTr.
△ Less
Submitted 3 August, 2021;
originally announced August 2021.
-
Time-Gated Photon Counting Receivers for Optical Wireless Communication
Authors:
Shenjie Huang,
Majid Safari
Abstract:
Photon counting detectors such as single-photon avalanche diode (SPAD) arrays are commonly considered for reliable optical wireless communication at power limited regimes. However, SPAD-based receivers suffer from significant dead time induced intersymbol interference (ISI) especially when the incident photon rate is relatively high and the dead time is comparable or even larger than the symbol du…
▽ More
Photon counting detectors such as single-photon avalanche diode (SPAD) arrays are commonly considered for reliable optical wireless communication at power limited regimes. However, SPAD-based receivers suffer from significant dead time induced intersymbol interference (ISI) especially when the incident photon rate is relatively high and the dead time is comparable or even larger than the symbol duration, i.e., sub-dead-time regime. In this work, we propose a novel time-gated SPAD receiver to mitigate such ISI effects and improve the communication performance. When operated in the gated mode, the SPAD can be activated and deactivated in well-defined time intervals. We investigate the statistics of the detected photon count for the proposed time-gated SPAD receiver. It is demonstrated that the gate-ON time interval can be optimized to achieve the best bit error rate (BER) performance. Our extensive performance analysis illustrates the superiority of the time-gated SPAD receiver over the traditional free-running receiver in terms of the BER performance and the tolerance to background light.
△ Less
Submitted 21 May, 2021;
originally announced May 2021.
-
A Tb/s Indoor MIMO Optical Wireless Backhaul System Using VCSEL Arrays
Authors:
Hossein Kazemi,
Elham Sarbazi,
Mohammad Dehghani Soltani,
Taisir E. H. El-Gorashi,
Jaafar M. H. Elmirghani,
Richard V. Penty,
Ian H. White,
Majid Safari,
Harald Haas
Abstract:
In this paper, the design of a multiple-input multiple-output (MIMO) optical wireless communication (OWC) link based on vertical cavity surface emitting laser (VCSEL) arrays is systematically carried out with the aim to support data rates in excess of 1 Tb/s for the backhaul of sixth generation (6G) indoor wireless networks. The proposed design combines direct current optical orthogonal frequency…
▽ More
In this paper, the design of a multiple-input multiple-output (MIMO) optical wireless communication (OWC) link based on vertical cavity surface emitting laser (VCSEL) arrays is systematically carried out with the aim to support data rates in excess of 1 Tb/s for the backhaul of sixth generation (6G) indoor wireless networks. The proposed design combines direct current optical orthogonal frequency division multiplexing (DCO-OFDM) and a spatial multiplexing MIMO architecture. For such an ultra-high-speed line-of-sight (LOS) OWC link with low divergence laser beams, maintaining alignment is of high importance. In this paper, two types of misalignment error between the transmitter and receiver are distinguished, namely, radial displacement error and orientation angle error, and they are thoroughly modeled in a unified analytical framework assuming Gaussian laser beams, resulting in a generalized misalignment model (GMM). The derived GMM is then extended to MIMO arrays and the performance of the MIMO-OFDM OWC system is analyzed in terms of the aggregate data rate. Novel insights are provided into the system performance based on computer simulations by studying various influential factors such as beam waist, array configuration and different misalignment errors, which can be used as guidelines for designing short range Tb/s MIMO OWC systems.
△ Less
Submitted 4 April, 2022; v1 submitted 19 February, 2021;
originally announced February 2021.
-
Safety Analysis for Laser-based Optical Wireless Communications: A Tutorial
Authors:
Mohammad Dehghani Soltani,
Elham Sarbazi,
Nikolaos Bamiedakis,
Priyanka de Souza,
Hossein Kazemi,
Jaafar M. H. Elmirghani,
Ian H. White,
Richard V. Penty,
Harald Haas,
Majid Safari
Abstract:
Light amplification by stimulated emission of radiation (laser) sources have many advantages for use in high data rate optical wireless communications. In particular, the low cost and high-bandwidth properties of laser sources such as vertical-cavity surface-emitting lasers (VCSELs) make them attractive for future indoor optical wireless communications. In order to be integrated into future indoor…
▽ More
Light amplification by stimulated emission of radiation (laser) sources have many advantages for use in high data rate optical wireless communications. In particular, the low cost and high-bandwidth properties of laser sources such as vertical-cavity surface-emitting lasers (VCSELs) make them attractive for future indoor optical wireless communications. In order to be integrated into future indoor networks, such lasers should conform to eye safety regulations determined by the international electrotechnical commission (IEC) standards for laser safety. In this paper, we provide a detailed study of beam propagation to evaluate the received power of various laser sources, based on which as well as the maximum permissible exposure (MPE) defined by the IEC 60825-1:2014 standard, we establish a comprehensive framework for eye safety analyses. This framework allows us to calculate the maximum allowable transmit power, which is crucial in the design of a reliable and safe laser-based wireless communication system. Initially, we consider a single-mode Gaussian beam and calculate the maximum permissible transmit power. Subsequently, we generalize this approach for higher-mode beams. It is shown that the M-squared-based approach for analysis of multimode lasers ensures the IEC eye safety limits, however, in some scenarios, it can be too conservative compared to the precise beam decomposition method. Laser safety analyses with consideration of optical elements such as lens and diffuser, as well as for VCSEL array have been also presented. Skin safety, as another significant factor of laser safety, has also been investigated in this paper. We have studied the impacts of various parameters such as wavelength, exposure duration and the divergence angle of laser sources on the safety analysis by presenting insightful results.
△ Less
Submitted 5 May, 2021; v1 submitted 17 February, 2021;
originally announced February 2021.
-
SPAD-Based Optical Wireless Communication with Signal Pre-Distortion and Noise Normalization
Authors:
Shenjie Huang,
Majid Safari
Abstract:
In recent years, there has been a growing interest in exploring the application of single-photon avalanche diode (SPAD) in optical wireless communication (OWC). As a photon counting detector, SPAD can provide much higher sensitivity compared to the other commonly used photodetectors. However, SPAD-based receivers suffer from significant dead-time-induced non-linear distortion and signal dependent…
▽ More
In recent years, there has been a growing interest in exploring the application of single-photon avalanche diode (SPAD) in optical wireless communication (OWC). As a photon counting detector, SPAD can provide much higher sensitivity compared to the other commonly used photodetectors. However, SPAD-based receivers suffer from significant dead-time-induced non-linear distortion and signal dependent noise. In this work, we propose a novel SPAD-based OWC system in which the non-linear distortion caused by dead time can be successfully eliminated by the pre-distortion of the signal at the transmitter. In addition, another system with joint pre-distortion and noise normalization functionality is proposed. Thanks to the additional noise normalization process, for the transformed signal at the receiver, the originally signal dependent noise becomes signal independent so that the conventional signal detection techniques designed for AWGN channels can be employed to decode the signal. Our numerical results demonstrate the superiority of the proposed SPAD-based systems compared to the existing systems in terms of BER performance and achievable data rate.
△ Less
Submitted 10 February, 2022; v1 submitted 22 January, 2021;
originally announced January 2021.
-
SuperCoder: Program Learning Under Noisy Conditions From Superposition of States
Authors:
Ali Davody,
Mahmoud Safari,
Răzvan V. Florian
Abstract:
We propose a new method of program learning in a Domain Specific Language (DSL) which is based on gradient descent with no direct search. The first component of our method is a probabilistic representation of the DSL variables. At each timestep in the program sequence, different DSL functions are applied on the DSL variables with a certain probability, leading to different possible outcomes. Rathe…
▽ More
We propose a new method of program learning in a Domain Specific Language (DSL) which is based on gradient descent with no direct search. The first component of our method is a probabilistic representation of the DSL variables. At each timestep in the program sequence, different DSL functions are applied on the DSL variables with a certain probability, leading to different possible outcomes. Rather than handling all these outputs separately, whose number grows exponentially with each timestep, we collect them into a superposition of variables which captures the information in a single, but fuzzy, state. This state is to be contrasted at the final timestep with the ground-truth output, through a loss function. The second component of our method is an attention-based recurrent neural network, which provides an appropriate initialization point for the gradient descent that optimizes the probabilistic representation. The method we have developed surpasses the state-of-the-art for synthesising long programs and is able to learn programs under noise.
△ Less
Submitted 7 December, 2020;
originally announced December 2020.
-
Hybrid LiFi and WiFi Networks: A Survey
Authors:
Xiping Wu,
Mohammad Dehghani Soltani,
Lai Zhou,
Majid Safari,
Harald Haas
Abstract:
To tackle the rapidly growing number of mobile devices and their expanding demands for Internet services, network convergence is envisaged to integrate different technology domains. A recently proposed and promising approach to indoor wireless communications is integrating light fidelity (LiFi) and wireless fidelity (WiFi), namely a hybrid LiFi and WiFi network (HLWNet). This type of network combi…
▽ More
To tackle the rapidly growing number of mobile devices and their expanding demands for Internet services, network convergence is envisaged to integrate different technology domains. A recently proposed and promising approach to indoor wireless communications is integrating light fidelity (LiFi) and wireless fidelity (WiFi), namely a hybrid LiFi and WiFi network (HLWNet). This type of network combines the high-speed data transmission of LiFi and the ubiquitous coverage of WiFi. In this paper, we present a survey-style introduction to HLWNets, starting with a framework including the network structure, cell deployment, multiple access schemes, modulation techniques, illumination requirements and backhauling. Then, key performance metrics and recent achievements are reviewed. Further, the unique challenges faced by HLWNets are elaborated in many research directions, including user behavior modeling, interference management, handover and load balancing. Finally, we discuss the potential of HLWNets in application areas such as indoor positioning and physical layer security.
△ Less
Submitted 14 January, 2020;
originally announced January 2020.
-
Off-the-grid Recovery of Time and Frequency Shifts with Multiple Measurement Vectors
Authors:
Maral Safari,
Sajad Daei,
Farzan Haddadi
Abstract:
We address the problem of estimating time and frequency shifts of a known waveform in the presence of multiple measurement vectors (MMVs). This problem naturally arises in radar imaging and wireless communications. Specifically, a signal ensemble is observed, where each signal of the ensemble is formed by a superposition of a small number of scaled, time-delayed, and frequency shifted versions of…
▽ More
We address the problem of estimating time and frequency shifts of a known waveform in the presence of multiple measurement vectors (MMVs). This problem naturally arises in radar imaging and wireless communications. Specifically, a signal ensemble is observed, where each signal of the ensemble is formed by a superposition of a small number of scaled, time-delayed, and frequency shifted versions of a known waveform sharing the same continuous-valued time and frequency components. The goal is to recover the continuous-valued time-frequency pairs from a small number of observations. In this work, we propose a semidefinite programming which exactly recovers $s$ pairs of time-frequency shifts from $L$ regularly spaced samples per measurement vector under a minimum separation condition between the time-frequency shifts. Moreover, we prove that the number $s$ of time-frequency shifts scales linearly with the number $L$ of samples up to a log-factor. Extensive numerical results are also provided to validate the effectiveness of the proposed method over the single measurement vectors (SMVs) problem. In particular, we find that our approach leads to a relaxed minimum separation condition and reduced number of required samples.
△ Less
Submitted 26 February, 2021; v1 submitted 30 October, 2019;
originally announced October 2019.
-
Multi-Hop Wireless Optical Backhauling for LiFi Attocell Networks: Bandwidth Scheduling and Power Control
Authors:
Hossein Kazemi,
Majid Safari,
Harald Haas
Abstract:
The backhaul of hundreds of light fidelity (LiFi) base stations (BSs) constitutes a major challenge. Indoor wireless optical backhauling is a novel approach whereby the interconnections between adjacent LiFi BSs are provided by way of directed line-of-sight (LOS) wireless infrared (IR) links. Building on the aforesaid approach, this paper presents the top-down design of a multi-hop wireless backha…
▽ More
The backhaul of hundreds of light fidelity (LiFi) base stations (BSs) constitutes a major challenge. Indoor wireless optical backhauling is a novel approach whereby the interconnections between adjacent LiFi BSs are provided by way of directed line-of-sight (LOS) wireless infrared (IR) links. Building on the aforesaid approach, this paper presents the top-down design of a multi-hop wireless backhaul configuration for multi-tier optical attocell networks by proposing the novel idea of super cells. Such cells incorporate multiple clusters of attocells that are connected to the core network via a single gateway based on multi-hop decode-and-forward (DF) relaying. Consequently, new challenges arise for managing the bandwidth and power resources of the bottleneck backhaul. By putting forward user-based bandwidth scheduling (UBS) and cell-based bandwidth scheduling (CBS) policies, the system-level modeling and analysis of the end-to-end multi-user sum rate is elaborated. In addition, optimal bandwidth scheduling under both UBS and CBS policies are formulated as constrained convex optimization problems, which are solved by using the projected subgradient method. Furthermore, the transmission power of the backhaul system is opportunistically reduced by way of an innovative fixed power control (FPC) strategy. The notion of backhaul bottleneck occurrence (BBO) is introduced. An accurate approximate expression of the probability of BBO is derived, and then verified using Monte Carlo simulations. Several insights are provided into the offered gains of the proposed schemes through extensive computer simulations, by studying different aspects of the performance of super cells including the average sum rate, the BBO probability and the backhaul power efficiency (PE).
△ Less
Submitted 12 July, 2019;
originally announced July 2019.
-
Performance Analysis of SPAD-based OFDM
Authors:
Yichen Li,
Majid Safari,
Robert Henderson,
Harald Haas
Abstract:
In this paper, an analytical approach for the nonlinear distorted bit error rate performance of optical orthogonal frequency division multiplexing (O-OFDM) with single photon avalanche diode (SPAD) receivers is presented. Major distortion effects of passive quenching (PQ) and active quenching (AQ) SPAD receivers are analysed in this study. The performance analysis of DC-biased O-OFDM and asymmetri…
▽ More
In this paper, an analytical approach for the nonlinear distorted bit error rate performance of optical orthogonal frequency division multiplexing (O-OFDM) with single photon avalanche diode (SPAD) receivers is presented. Major distortion effects of passive quenching (PQ) and active quenching (AQ) SPAD receivers are analysed in this study. The performance analysis of DC-biased O-OFDM and asymmetrically clipped O-OFDM with PQ and AQ SPAD are derived. The comparison results show the maximum optical irradiance caused by the nonlinear distortion, which limits the transmission power and bit rate. The theoretical maximum bit rate of SPAD-based OFDM is found which is up to 1~Gbits/s. This approach supplies a closed-form analytical solution for designing an optimal SPAD-based system.
△ Less
Submitted 15 May, 2019;
originally announced May 2019.
-
Deep UL2DL: Channel Knowledge Transfer from Uplink to Downlink
Authors:
Mohammad Sadegh Safari,
Vahid Pourahmadi,
Shabnam Sodagari
Abstract:
Knowledge of the channel state information (CSI) at the transmitter side is one of the primary sources of information that can be used for the efficient allocation of wireless resources. Obtaining downlink (DL) CSI in Frequency Division Duplexing (FDD) systems from uplink (UL) CSI is not as straightforward as in TDD systems. Therefore, users usually feed the DL-CSI back to the transmitter. To remo…
▽ More
Knowledge of the channel state information (CSI) at the transmitter side is one of the primary sources of information that can be used for the efficient allocation of wireless resources. Obtaining downlink (DL) CSI in Frequency Division Duplexing (FDD) systems from uplink (UL) CSI is not as straightforward as in TDD systems. Therefore, users usually feed the DL-CSI back to the transmitter. To remove the need for feedback (and thus having less signaling overhead), we propose to use two recent deep neural network structures, i.e., convolutional neural networks and generative adversarial networks (GANs) to infer the DL-CSI by observing the UL-CSI. The core idea of our data-driven scheme is exploiting the fact that both DL and UL channels share the same propagation environment. As such, we extracted the environment information from the UL channel response to a latent domain and then transferred the derived environment information from the latent domain to predict the DL channel. To overcome incorrect latent domain and the problem of oversimplistic assumptions, in this work, we did not use any specific parametric model and instead used data-driven approaches to discover the underlying structure of data without any prior model assumptions. To overcome the challenge of capturing the UL-DL joint distribution, we used a mean square error-based variant of the GAN structure with improved convergence properties called boundary equilibrium GAN (BEGAN). For training and testing we used simulated data of Extended Vehicular-A (EVA) and Extended Typical Urban (ETU) models. Simulation results verified that our methods can accurately infer and predict the downlink CSI from the uplink CSI for different multipath environments in FDD communications.
△ Less
Submitted 30 November, 2019; v1 submitted 15 December, 2018;
originally announced December 2018.
-
Impact of Device Orientation on Error Performance of LiFi Systems
Authors:
Mohammad Dehghani Soltani,
Ardimas Andi Purwita,
Iman Tavakkolnia,
Harald Haas,
Majid Safari
Abstract:
Most studies on optical wireless communications (OWCs) have neglected the effect of random orientation in their performance analysis due to the lack of a proper model for the random orientation. Our recent empirical-based research illustrates that the random orientation follows a Laplace distribution for a static user equipment (UE). In this paper, we analyze the device orientation and assess its…
▽ More
Most studies on optical wireless communications (OWCs) have neglected the effect of random orientation in their performance analysis due to the lack of a proper model for the random orientation. Our recent empirical-based research illustrates that the random orientation follows a Laplace distribution for a static user equipment (UE). In this paper, we analyze the device orientation and assess its importance on system performance. The reliability of an OWC channel highly depends on the availability and alignment of line-of-sight (LOS) links. In this study, the effect of receiver orientation including both polar and azimuth angles on the LOS channel gain are analyzed. The probability of establishing a LOS link is investigated and the probability density function (PDF) of signal-to-noise ratio (SNR) for a randomly-oriented device is derived. By means of the PDF of SNR, the bit-error ratio (BER) of DC-biased optical orthogonal frequency division multiplexing (DCO-OFDM) in additive white Gaussian noise (AWGN) channels is evaluated. A closed-form approximation for the BER of UE with random orientation is presented which shows a good match with Monte-Carlo simulation results. Furthermore, the impact of the UE's random motion on the BER performance has been assessed. Finally, the effect of random orientation on the average signal-to-interference-plus-noise ratio (SINR) in a multiple access points (APs) scenario is investigated.
△ Less
Submitted 25 February, 2019; v1 submitted 30 August, 2018;
originally announced August 2018.
-
Game-Theoretic Spectrum Trading in RF Relay-Assisted Free-Space Optical Communications
Authors:
Shenjie Huang,
Vahid Shah-Mansouri,
Majid Safari
Abstract:
This work proposes a novel hybrid RF/FSO system based on a game theoretic spectrum trading process. It is assumed that no RF spectrum is preallocated to the FSO link and only when the link availability is severely impaired by the infrequent adverse weather conditions, i.e. fog, etc., the source can borrow a portion of licensed RF spectrum from one of the surrounding RF nodes. Using the leased spec…
▽ More
This work proposes a novel hybrid RF/FSO system based on a game theoretic spectrum trading process. It is assumed that no RF spectrum is preallocated to the FSO link and only when the link availability is severely impaired by the infrequent adverse weather conditions, i.e. fog, etc., the source can borrow a portion of licensed RF spectrum from one of the surrounding RF nodes. Using the leased spectrum, the source establishes a dual-hop RF/FSO hybrid link to maintain its throughout to the destination. The proposed system is considered to be both spectrum- and power-efficient. A market-equilibrium-based pricing process is proposed for the spectrum trading between the source and RF nodes. Through extensive performance analysis, it is demonstrated that the proposed scheme can significantly improve the average capacity of the system, especially when the surrounding RF nodes are with low traffic loads. In addition, the system benefits from involving more RF nodes into the spectrum trading process by means of diversity, particularly when the surrounding RF nodes have high probability of being in heavy traffic loads. Furthermore, the application of the proposed system in a realistic scenario is presented based on the weather statistics in the city of Edinburgh, UK. It is demonstrated that the proposed system can substantially enhance the link availability towards the carrier-class requirement.
△ Less
Submitted 27 June, 2018;
originally announced June 2018.
-
Modeling the Random Orientation of Mobile Devices: Measurement, Analysis and LiFi Use Case
Authors:
Mohammad Dehghani Soltani,
Ardimas Andi Purwita,
Zhihong Zeng,
Harald Haas,
Majid Safari
Abstract:
Light-fidelity (LiFi) is a networked optical wireless communication (OWC) solution for high-speed indoor connectivity for fixed and mobile optical communications. Unlike conventional radio frequency wireless systems, the OWC channel is not isotropic, meaning that the device orientation affects the channel gain significantly, particularly for mobile users. However, due to the lack of a proper model…
▽ More
Light-fidelity (LiFi) is a networked optical wireless communication (OWC) solution for high-speed indoor connectivity for fixed and mobile optical communications. Unlike conventional radio frequency wireless systems, the OWC channel is not isotropic, meaning that the device orientation affects the channel gain significantly, particularly for mobile users. However, due to the lack of a proper model for device orientation, many studies have assumed that the receiver is vertically upward and fixed. In this paper, a novel model for device orientation based on experimental measurements of forty participants has been proposed. It is shown that the probability density function (PDF) of the polar angle can be modeled either based on a Laplace (for static users) or a Gaussian (for mobile users) distribution. In addition, a closed-form expression is obtained for the PDF of the cosine of the incidence angle based on which line-of-sight (LOS) channel gain is described in OWC channels. An approximation of this PDF based on the truncated Laplace is proposed and the accuracy of this approximation is confirmed by the Kolmogorov-Smirnov distance (KSD). Moreover, the statistics of the LOS channel gain are calculated and the random orientation of a user equipment (UE) is modeled as a random process. The influence of the random orientation on signal-to-noise-ratio (SNR) performance of OWC systems has been evaluated. Finally, an orientation-based random waypoint (ORWP) mobility model is proposed by considering the random orientation of the UE during the user's movement. The performance of ORWP is assessed on the handover rate and it is shown that it is important to take the random orientation into account.
△ Less
Submitted 28 September, 2018; v1 submitted 21 May, 2018;
originally announced May 2018.
-
Spatial-Mode Diversity and Multiplexing for FSO Communication with Direct Detection
Authors:
Shenjie Huang,
Gilda Raoof Mehrpoor,
Majid Safari
Abstract:
This work investigates spatial-mode multiplexing (SMM) for practical free-space optical communication (FSO) systems using direct detection. Unlike several works in the literature where mutually incoherent channels are assumed, we consider mutually coherent channels that accurately describe SMM FSO systems employing a single laser source at the transmitter with a narrow linewidth. We develop an ana…
▽ More
This work investigates spatial-mode multiplexing (SMM) for practical free-space optical communication (FSO) systems using direct detection. Unlike several works in the literature where mutually incoherent channels are assumed, we consider mutually coherent channels that accurately describe SMM FSO systems employing a single laser source at the transmitter with a narrow linewidth. We develop an analytical model for such mutually coherent channels and derive expressions for aggregate achievable rate (AAR). Through numerical simulations, it was shown that there exist optimal transmit mode sets which result in the maximal asymptotic AAR at high transmitted power. Moreover, in order to resolve the reliability issues of such SMM FSO systems in the presence of turbulence, a so-called mode diversity scheme is proposed that can be easily implemented along with SMM FSO systems. It is demonstrated that mode diversity can significantly improve the outage probability and the outage achievable rate performance of the multiplexed channels in SMM FSO systems degraded by turbulence.
△ Less
Submitted 1 September, 2017;
originally announced September 2017.
-
Bidirectional User Throughput Maximization Based on Feedback Reduction in LiFi Networks
Authors:
Mohammad Dehghani Soltani,
Xiping Wu,
Majid Safari,
Harald Haas
Abstract:
Channel adaptive signalling, which is based on feedback, can result in almost any performance metric enhancement. Unlike the radio frequency (RF) channel, the optical wireless communications (OWCs) channel is fairly static. This feature enables a potential improvement of the bidirectional user throughput by reducing the amount of feedback. Light-Fidelity (LiFi) is a subset of OWCs, and it is a bid…
▽ More
Channel adaptive signalling, which is based on feedback, can result in almost any performance metric enhancement. Unlike the radio frequency (RF) channel, the optical wireless communications (OWCs) channel is fairly static. This feature enables a potential improvement of the bidirectional user throughput by reducing the amount of feedback. Light-Fidelity (LiFi) is a subset of OWCs, and it is a bidirectional, high-speed and fully networked wireless communication technology where visible light and infrared are used in downlink and uplink respectively. In this paper, two techniques for reducing the amount of feedback in LiFi cellular networks are proposed, i) Limited-content feedback (LCF) scheme based on reducing the content of feedback information and ii) Limited-frequency feedback (LFF) based on the update interval scheme that lets the receiver to transmit feedback information after some data frames transmission. Furthermore, based on the random waypoint (RWP) mobility model, the optimum update interval which provides maximum bidirectional user equipment (UE) throughput, has been derived. Results show that the proposed schemes can achieve better average overall throughput compared to the benchmark one-bit feedback and full-feedback mechanisms.
△ Less
Submitted 10 August, 2017;
originally announced August 2017.