Search | arXiv e-print repository

Learning to Interfere in Non-Orthogonal Multiple-Access Joint Source-Channel Coding

Authors: Selim F. Yilmaz, Can Karamanli, Deniz Gunduz

Abstract: We consider multiple transmitters aiming to communicate their source signals (e.g., images) over a multiple access channel (MAC). Conventional communication systems minimize interference by orthogonally allocating resources (time and/or bandwidth) among users, which limits their capacity. We introduce a machine learning (ML)-aided wireless image transmission method that merges compression and chan… ▽ More We consider multiple transmitters aiming to communicate their source signals (e.g., images) over a multiple access channel (MAC). Conventional communication systems minimize interference by orthogonally allocating resources (time and/or bandwidth) among users, which limits their capacity. We introduce a machine learning (ML)-aided wireless image transmission method that merges compression and channel coding using a multi-view autoencoder, which allows the transmitters to use all the available channel resources simultaneously, resulting in a non-orthogonal multiple access (NOMA) scheme. The receiver must recover all the images from the received superposed signal, while also associating each image with its transmitter. Traditional ML models deal with individual samples, whereas our model allows signals from different users to interfere in order to leverage gains from NOMA under limited bandwidth and power constraints. We introduce a progressive fine-tuning algorithm that doubles the number of users at each iteration, maintaining initial performance with orthogonalized user-specific projections, which is then improved through fine-tuning steps. Remarkably, our method scales up to 16 users and beyond, with only a 0.6% increase in the number of trainable parameters compared to a single-user model, significantly enhancing recovered image quality and outperforming existing NOMA-based methods over a wide range of datasets, metrics, and channel conditions. Our approach paves the way for more efficient and robust multi-user communication systems, leveraging innovative ML components and strategies. △ Less

Submitted 23 March, 2025; originally announced April 2025.

Comments: 18 pages, 19 figures

arXiv:2503.12484 [pdf, other]

SING: Semantic Image Communications using Null-Space and INN-Guided Diffusion Models

Authors: Jiakang Chen, Selim F. Yilmaz, Di You, Pier Luigi Dragotti, Deniz Gündüz

Abstract: Joint source-channel coding systems based on deep neural networks (DeepJSCC) have recently demonstrated remarkable performance in wireless image transmission. Existing methods primarily focus on minimizing distortion between the transmitted image and the reconstructed version at the receiver, often overlooking perceptual quality. This can lead to severe perceptual degradation when transmitting ima… ▽ More Joint source-channel coding systems based on deep neural networks (DeepJSCC) have recently demonstrated remarkable performance in wireless image transmission. Existing methods primarily focus on minimizing distortion between the transmitted image and the reconstructed version at the receiver, often overlooking perceptual quality. This can lead to severe perceptual degradation when transmitting images under extreme conditions, such as low bandwidth compression ratios (BCRs) and low signal-to-noise ratios (SNRs). In this work, we propose SING, a novel two-stage JSCC framework that formulates the recovery of high-quality source images from corrupted reconstructions as an inverse problem. Depending on the availability of information about the DeepJSCC encoder/decoder and the channel at the receiver, SING can either approximate the stochastic degradation as a linear transformation, or leverage invertible neural networks (INNs) for precise modeling. Both approaches enable the seamless integration of diffusion models into the reconstruction process, enhancing perceptual quality. Experimental results demonstrate that SING outperforms DeepJSCC and other approaches, delivering superior perceptual quality even under extremely challenging conditions, including scenarios with significant distribution mismatches between the training and test data. △ Less

Submitted 16 March, 2025; originally announced March 2025.

arXiv:2412.04081 [pdf, other]

Federated Learning in Mobile Networks: A Comprehensive Case Study on Traffic Forecasting

Authors: Nikolaos Pavlidis, Vasileios Perifanis, Selim F. Yilmaz, Francesc Wilhelmi, Marco Miozzo, Pavlos S. Efraimidis, Remous-Aris Koutsiamanis, Pavol Mulinka, Paolo Dini

Abstract: The increasing demand for efficient resource allocation in mobile networks has catalyzed the exploration of innovative solutions that could enhance the task of real-time cellular traffic prediction. Under these circumstances, federated learning (FL) stands out as a distributed and privacy-preserving solution to foster collaboration among different sites, thus enabling responsive near-the-edge solu… ▽ More The increasing demand for efficient resource allocation in mobile networks has catalyzed the exploration of innovative solutions that could enhance the task of real-time cellular traffic prediction. Under these circumstances, federated learning (FL) stands out as a distributed and privacy-preserving solution to foster collaboration among different sites, thus enabling responsive near-the-edge solutions. In this paper, we comprehensively study the potential benefits of FL in telecommunications through a case study on federated traffic forecasting using real-world data from base stations (BSs) in Barcelona (Spain). Our study encompasses relevant aspects within the federated experience, including model aggregation techniques, outlier management, the impact of individual clients, personalized learning, and the integration of exogenous sources of data. The performed evaluation is based on both prediction accuracy and sustainability, thus showcasing the environmental impact of employed FL algorithms in various settings. The findings from our study highlight FL as a promising and robust solution for mobile traffic prediction, emphasizing its twin merits as a privacy-conscious and environmentally sustainable approach, while also demonstrating its capability to overcome data heterogeneity and ensure high-quality predictions, marking a significant stride towards its integration in mobile traffic management systems. △ Less

Submitted 5 December, 2024; originally announced December 2024.

arXiv:2410.05172 [pdf, ps, other]

Unlocking Potential: Integrating Multihop, CRC, and GRAND for Wireless 5G-Beyond/6G Networks

Authors: Bora Bozkurt, Emirhan Zor, Ferkan Yilmaz

Abstract: As future wireless networks move towards millimeter wave (mmWave) and terahertz (THz) frequencies for 6G, multihop transmission using Integrated Access Backhaul (IABs) and Network-Controlled Repeaters (NCRs) will be highly essential to overcome coverage limitations. This paper examines the use of Guessing Random Additive Noise (GRAND) decoding for multihop transmissions in 3GPP networks. We explor… ▽ More As future wireless networks move towards millimeter wave (mmWave) and terahertz (THz) frequencies for 6G, multihop transmission using Integrated Access Backhaul (IABs) and Network-Controlled Repeaters (NCRs) will be highly essential to overcome coverage limitations. This paper examines the use of Guessing Random Additive Noise (GRAND) decoding for multihop transmissions in 3GPP networks. We explore two scenarios: one where only the destination uses GRAND decoding, and another where both relays and the destination leverage it. Interestingly, in the latter scenario, the Bit Error Rate (BER) curves for all hop counts intersect at a specific Signal-to-Noise Ratio (SNR), which we term the GRAND barrier. This finding offers valuable insights for future research and 3GPP standard development. Simulations confirm the effectiveness of GRAND in improving communication speed and quality, contributing to the robustness and interconnectivity of future wireless systems, particularly relevant for the migration towards mmWave and THz bands in 6G networks. Finally, we investigate the integration of multihop transmission, CRC detection, and GRAND decoding within 3GPP networks, demonstrating their potential to overcome coverage limitations and enhance overall network performance. △ Less

Submitted 7 October, 2024; originally announced October 2024.

Comments: Best Paper Awarded, 6 Pages, 8 Figures, ICCSPA'24 Conference. This work is supported by Istanbul Technical University BAPSIS MAB- 2023-44565 project. The publication is funded by TUBITAK-BILGEM

Journal ref: Publication ID: 979-8-3503-8481-9/24/$31.00 \c{opyright}2024 IEEE

arXiv:2407.21151 [pdf, other]

doi 10.1109/TMLCN.2025.3526551

Private Collaborative Edge Inference via Over-the-Air Computation

Authors: Selim F. Yilmaz, Burak Hasircioglu, Li Qiao, Deniz Gunduz

Abstract: We consider collaborative inference at the wireless edge, where each client's model is trained independently on its local dataset. Clients are queried in parallel to make an accurate decision collaboratively. In addition to maximizing the inference accuracy, we also want to ensure the privacy of local models. To this end, we leverage the superposition property of the multiple access channel to imp… ▽ More We consider collaborative inference at the wireless edge, where each client's model is trained independently on its local dataset. Clients are queried in parallel to make an accurate decision collaboratively. In addition to maximizing the inference accuracy, we also want to ensure the privacy of local models. To this end, we leverage the superposition property of the multiple access channel to implement bandwidth-efficient multi-user inference methods. We propose different methods for ensemble and multi-view classification that exploit over-the-air computation (OAC). We show that these schemes perform better than their orthogonal counterparts with statistically significant differences while using fewer resources and providing privacy guarantees. We also provide experimental results verifying the benefits of the proposed OAC approach to multi-user inference, and perform an ablation study to demonstrate the effectiveness of our design choices. We share the source code of the framework publicly on Github to facilitate further research and reproducibility. △ Less

Submitted 14 January, 2025; v1 submitted 30 July, 2024; originally announced July 2024.

Comments: 17 pages, 8 figures. This work extends from our preliminary study presented at the 2022 IEEE International Symposium on Information Theory [1]. arXiv admin note: text overlap with arXiv:2202.03129

arXiv:2310.04311 [pdf, other]

Distributed Deep Joint Source-Channel Coding with Decoder-Only Side Information

Authors: Selim F. Yilmaz, Ezgi Ozyilkan, Deniz Gunduz, Elza Erkip

Abstract: We consider low-latency image transmission over a noisy wireless channel when correlated side information is present only at the receiver side (the Wyner-Ziv scenario). In particular, we are interested in developing practical schemes using a data-driven joint source-channel coding (JSCC) approach, which has been previously shown to outperform conventional separation-based approaches in the practic… ▽ More We consider low-latency image transmission over a noisy wireless channel when correlated side information is present only at the receiver side (the Wyner-Ziv scenario). In particular, we are interested in developing practical schemes using a data-driven joint source-channel coding (JSCC) approach, which has been previously shown to outperform conventional separation-based approaches in the practical finite blocklength regimes, and to provide graceful degradation with channel quality. We propose a novel neural network architecture that incorporates the decoder-only side information at multiple stages at the receiver side. Our results demonstrate that the proposed method succeeds in integrating the side information, yielding improved performance at all channel conditions in terms of the various quality measures considered here, especially at low channel signal-to-noise ratios (SNRs) and small bandwidth ratios (BRs). We have made the source code of the proposed method public to enable further research, and the reproducibility of the results. △ Less

Submitted 27 February, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

Comments: To appear in IEEE International Conference on Machine Learning for Communication and Networking (ICMLCN) 2024

arXiv:2309.15889 [pdf, other]

doi 10.1109/INFOCOMWKSHPS61880.2024.10620904

High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models

Authors: Selim F. Yilmaz, Xueyan Niu, Bo Bai, Wei Han, Lei Deng, Deniz Gunduz

Abstract: We consider the image transmission problem over a noisy wireless channel via deep learning-based joint source-channel coding (DeepJSCC) along with a denoising diffusion probabilistic model (DDPM) at the receiver. Specifically, we are interested in the perception-distortion trade-off in the practical finite block length regime, in which separate source and channel coding can be highly suboptimal. W… ▽ More We consider the image transmission problem over a noisy wireless channel via deep learning-based joint source-channel coding (DeepJSCC) along with a denoising diffusion probabilistic model (DDPM) at the receiver. Specifically, we are interested in the perception-distortion trade-off in the practical finite block length regime, in which separate source and channel coding can be highly suboptimal. We introduce a novel scheme, where the conventional DeepJSCC encoder targets transmitting a lower resolution version of the image, which later can be refined thanks to the generative model available at the receiver. In particular, we utilize the range-null space decomposition of the target image; DeepJSCC transmits the range-space of the image, while DDPM progressively refines its null space contents. Through extensive experiments, we demonstrate significant improvements in distortion and perceptual quality of reconstructed images compared to standard DeepJSCC and the state-of-the-art generative learning-based method. △ Less

Submitted 20 September, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

Comments: 6 pages, 5 figures. Published at INFOCOM 2024 Workshops

arXiv:2309.10645 [pdf, other]

Towards Energy-Aware Federated Traffic Prediction for Cellular Networks

Authors: Vasileios Perifanis, Nikolaos Pavlidis, Selim F. Yilmaz, Francesc Wilhelmi, Elia Guerra, Marco Miozzo, Pavlos S. Efraimidis, Paolo Dini, Remous-Aris Koutsiamanis

Abstract: Cellular traffic prediction is a crucial activity for optimizing networks in fifth-generation (5G) networks and beyond, as accurate forecasting is essential for intelligent network design, resource allocation and anomaly mitigation. Although machine learning (ML) is a promising approach to effectively predict network traffic, the centralization of massive data in a single data center raises issues… ▽ More Cellular traffic prediction is a crucial activity for optimizing networks in fifth-generation (5G) networks and beyond, as accurate forecasting is essential for intelligent network design, resource allocation and anomaly mitigation. Although machine learning (ML) is a promising approach to effectively predict network traffic, the centralization of massive data in a single data center raises issues regarding confidentiality, privacy and data transfer demands. To address these challenges, federated learning (FL) emerges as an appealing ML training framework which offers high accurate predictions through parallel distributed computations. However, the environmental impact of these methods is often overlooked, which calls into question their sustainability. In this paper, we address the trade-off between accuracy and energy consumption in FL by proposing a novel sustainability indicator that allows assessing the feasibility of ML models. Then, we comprehensively evaluate state-of-the-art deep learning (DL) architectures in a federated scenario using real-world measurements from base station (BS) sites in the area of Barcelona, Spain. Our findings indicate that larger ML models achieve marginally improved performance but have a significant environmental impact in terms of carbon footprint, which make them impractical for real-world applications. △ Less

Submitted 19 September, 2023; originally announced September 2023.

Comments: International Symposium on Federated Learning Technologies and Applications (FLTA), 2023

arXiv:2211.09920 [pdf, other]

Distributed Deep Joint Source-Channel Coding over a Multiple Access Channel

Authors: Selim F. Yilmaz, Can Karamanli, Deniz Gunduz

Abstract: We consider distributed image transmission over a noisy multiple access channel (MAC) using deep joint source-channel coding (DeepJSCC). It is known that Shannon's separation theorem holds when transmitting independent sources over a MAC in the asymptotic infinite block length regime. However, we are interested in the practical finite block length regime, in which case separate source and channel… ▽ More We consider distributed image transmission over a noisy multiple access channel (MAC) using deep joint source-channel coding (DeepJSCC). It is known that Shannon's separation theorem holds when transmitting independent sources over a MAC in the asymptotic infinite block length regime. However, we are interested in the practical finite block length regime, in which case separate source and channel coding is known to be suboptimal. We introduce a novel joint image compression and transmission scheme, where the devices send their compressed image representations in a non-orthogonal manner. While non-orthogonal multiple access (NOMA) is known to achieve the capacity region, to the best of our knowledge, non-orthogonal joint source channel coding (JSCC) scheme for practical systems has not been studied before. Through extensive experiments, we show significant improvements in terms of the quality of the reconstructed images compared to orthogonal transmission employing current DeepJSCC approaches particularly for low bandwidth ratios. We publicly share source code to facilitate further research and reproducibility. △ Less

Submitted 2 March, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

Comments: To appear in IEEE International Conference on Communications (ICC) 2023

arXiv:2210.04166 [pdf, ps, other]

Test-time Recalibration of Conformal Predictors Under Distribution Shift Based on Unlabeled Examples

Authors: Fatih Furkan Yilmaz, Reinhard Heckel

Abstract: Modern image classifiers are very accurate, but the predictions come without uncertainty estimates. Conformal predictors provide uncertainty estimates by computing a set of classes containing the correct class with a user-specified probability based on the classifier's probability estimates. To provide such sets, conformal predictors often estimate a cutoff threshold for the probability estimates… ▽ More Modern image classifiers are very accurate, but the predictions come without uncertainty estimates. Conformal predictors provide uncertainty estimates by computing a set of classes containing the correct class with a user-specified probability based on the classifier's probability estimates. To provide such sets, conformal predictors often estimate a cutoff threshold for the probability estimates based on a calibration set. Conformal predictors guarantee reliability only when the calibration set is from the same distribution as the test set. Therefore, conformal predictors need to be recalibrated for new distributions. However, in practice, labeled data from new distributions is rarely available, making calibration infeasible. In this work, we consider the problem of predicting the cutoff threshold for a new distribution based on unlabeled examples. While it is impossible in general to guarantee reliability when calibrating based on unlabeled examples, we propose a method that provides excellent uncertainty estimates under natural distribution shifts, and provably works for a specific model of a distribution shift. △ Less

Submitted 3 June, 2023; v1 submitted 9 October, 2022; originally announced October 2022.

arXiv:2206.01378 [pdf, ps, other]

Regularization-wise double descent: Why it occurs and how to eliminate it

Authors: Fatih Furkan Yilmaz, Reinhard Heckel

Abstract: The risk of overparameterized models, in particular deep neural networks, is often double-descent shaped as a function of the model size. Recently, it was shown that the risk as a function of the early-stopping time can also be double-descent shaped, and this behavior can be explained as a super-position of bias-variance tradeoffs. In this paper, we show that the risk of explicit L2-regularized mo… ▽ More The risk of overparameterized models, in particular deep neural networks, is often double-descent shaped as a function of the model size. Recently, it was shown that the risk as a function of the early-stopping time can also be double-descent shaped, and this behavior can be explained as a super-position of bias-variance tradeoffs. In this paper, we show that the risk of explicit L2-regularized models can exhibit double descent behavior as a function of the regularization strength, both in theory and practice. We find that for linear regression, a double descent shaped risk is caused by a superposition of bias-variance tradeoffs corresponding to different parts of the model and can be mitigated by scaling the regularization strength of each part appropriately. Motivated by this result, we study a two-layer neural network and show that double descent can be eliminated by adjusting the regularization strengths for the first and second layer. Lastly, we study a 5-layer CNN and ResNet-18 trained on CIFAR-10 with label noise, and CIFAR-100 without label noise, and demonstrate that all exhibit double descent behavior as a function of the regularization strength. △ Less

Submitted 2 June, 2022; originally announced June 2022.

Comments: To be published in the 2022 IEEE International Symposium on Information Theory (ISIT) Proceedings

arXiv:2203.10472 [pdf, other]

Federated Spatial Reuse Optimization in Next-Generation Decentralized IEEE 802.11 WLANs

Authors: Francesc Wilhelmi, Jernej Hribar, Selim F. Yilmaz, Emre Ozfatura, Kerem Ozfatura, Ozlem Yildiz, Deniz Gündüz, Hao Chen, Xiaoying Ye, Lizhao You, Yulin Shao, Paolo Dini, Boris Bellalta

Abstract: As wireless standards evolve, more complex functionalities are introduced to address the increasing requirements in terms of throughput, latency, security, and efficiency. To unleash the potential of such new features, artificial intelligence (AI) and machine learning (ML) are currently being exploited for deriving models and protocols from data, rather than by hand-programming. In this paper, we… ▽ More As wireless standards evolve, more complex functionalities are introduced to address the increasing requirements in terms of throughput, latency, security, and efficiency. To unleash the potential of such new features, artificial intelligence (AI) and machine learning (ML) are currently being exploited for deriving models and protocols from data, rather than by hand-programming. In this paper, we explore the feasibility of applying ML in next-generation wireless local area networks (WLANs). More specifically, we focus on the IEEE 802.11ax spatial reuse (SR) problem and predict its performance through federated learning (FL) models. The set of FL solutions overviewed in this work is part of the 2021 International Telecommunication Union (ITU) AI for 5G Challenge. △ Less

Submitted 7 June, 2022; v1 submitted 20 March, 2022; originally announced March 2022.

arXiv:2202.03129 [pdf, other]

Over-the-Air Ensemble Inference with Model Privacy

Authors: Selim F. Yilmaz, Burak Hasircioglu, Deniz Gunduz

Abstract: We consider distributed inference at the wireless edge, where multiple clients with an ensemble of models, each trained independently on a local dataset, are queried in parallel to make an accurate decision on a new sample. In addition to maximizing inference accuracy, we also want to maximize the privacy of local models. We exploit the superposition property of the air to implement bandwidth-effi… ▽ More We consider distributed inference at the wireless edge, where multiple clients with an ensemble of models, each trained independently on a local dataset, are queried in parallel to make an accurate decision on a new sample. In addition to maximizing inference accuracy, we also want to maximize the privacy of local models. We exploit the superposition property of the air to implement bandwidth-efficient ensemble inference methods. We introduce different over-the-air ensemble methods and show that these schemes perform significantly better than their orthogonal counterparts, while using less resources and providing privacy guarantees. We also provide experimental results verifying the benefits of the proposed over-the-air inference approach, whose source code is shared publicly on Github. △ Less

Submitted 15 May, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

Comments: To appear in IEEE International Symposium on Information Theory (ISIT) 2022

arXiv:2107.11786 [pdf, other]

Deep Learning-based Frozen Section to FFPE Translation

Authors: Kutsev Bengisu Ozyoruk, Sermet Can, Guliz Irem Gokceler, Kayhan Basak, Derya Demir, Gurdeniz Serin, Uguray Payam Hacisalihoglu, Emirhan Kurtuluş, Berkan Darbaz, Ming Y. Lu, Tiffany Y. Chen, Drew F. K. Williamson, Funda Yilmaz, Faisal Mahmood, Mehmet Turan

Abstract: Frozen sectioning (FS) is the preparation method of choice for microscopic evaluation of tissues during surgical operations. The high speed of the procedure allows pathologists to rapidly assess the key microscopic features, such as tumour margins and malignant status to guide surgical decision-making and minimise disruptions to the course of the operation. However, FS is prone to introducing many… ▽ More Frozen sectioning (FS) is the preparation method of choice for microscopic evaluation of tissues during surgical operations. The high speed of the procedure allows pathologists to rapidly assess the key microscopic features, such as tumour margins and malignant status to guide surgical decision-making and minimise disruptions to the course of the operation. However, FS is prone to introducing many misleading artificial structures (histological artefacts), such as nuclear ice crystals, compression, and cutting artefacts, hindering timely and accurate diagnostic judgement of the pathologist. Additional training and prolonged experience is often required to make highly effective and time-critical diagnosis on frozen sections. On the other hand, the gold standard tissue preparation technique of formalin-fixation and paraffin-embedding (FFPE) provides significantly superior image quality, but is a very time-consuming process (12-48 hours), making it unsuitable for intra-operative use. In this paper, we propose an artificial intelligence (AI) method that improves FS image quality by computationally transforming frozen-sectioned whole-slide images (FS-WSIs) into whole-slide FFPE-style images in minutes. AI-FFPE rectifies FS artefacts with the guidance of an attention mechanism that puts a particular emphasis on artefacts while utilising a self-regularization mechanism established between FS input image and synthesized FFPE-style image that preserves clinically relevant features. As a result, AI-FFPE method successfully generates FFPE-style images without significantly extending tissue processing time and consequently improves diagnostic accuracy. We demonstrate the efficacy of AI-FFPE on lung and brain frozen sections using a variety of different qualitative and quantitative metrics including visual Turing tests from 20 board certified pathologists. △ Less

Submitted 2 November, 2021; v1 submitted 25 July, 2021; originally announced July 2021.

arXiv:2009.02572 [pdf, other]

PySAD: A Streaming Anomaly Detection Framework in Python

Authors: Selim F. Yilmaz, Suleyman S. Kozat

Abstract: Streaming anomaly detection requires algorithms that operate under strict constraints: bounded memory, single-pass processing, and constant-time complexity. We present PySAD, a comprehensive Python framework addressing these challenges through a unified architecture. The framework implements 17+ streaming algorithms (LODA, Half-Space Trees, xStream) with specialized components including projectors… ▽ More Streaming anomaly detection requires algorithms that operate under strict constraints: bounded memory, single-pass processing, and constant-time complexity. We present PySAD, a comprehensive Python framework addressing these challenges through a unified architecture. The framework implements 17+ streaming algorithms (LODA, Half-Space Trees, xStream) with specialized components including projectors, probability calibrators, and postprocessors. Unlike existing batch-focused frameworks, PySAD enables efficient real-time processing with bounded memory while maintaining compatibility with PyOD and scikit-learn. Supporting all learning paradigms for univariate and multivariate streams, PySAD provides the most comprehensive streaming anomaly detection toolkit in Python. The source code is publicly available at github.com/selimfirat/pysad. △ Less

Submitted 24 May, 2025; v1 submitted 5 September, 2020; originally announced September 2020.

Comments: 7 pages, 1 figure

arXiv:2008.11573 [pdf, other]

doi 10.1109/TNNLS.2021.3094304

Multi-Label Sentiment Analysis on 100 Languages with Dynamic Weighting for Label Imbalance

Authors: Selim F. Yilmaz, E. Batuhan Kaynak, Aykut Koç, Hamdi Dibeklioğlu, Suleyman S. Kozat

Abstract: We investigate cross-lingual sentiment analysis, which has attracted significant attention due to its applications in various areas including market research, politics and social sciences. In particular, we introduce a sentiment analysis framework in multi-label setting as it obeys Plutchik wheel of emotions. We introduce a novel dynamic weighting method that balances the contribution from each cl… ▽ More We investigate cross-lingual sentiment analysis, which has attracted significant attention due to its applications in various areas including market research, politics and social sciences. In particular, we introduce a sentiment analysis framework in multi-label setting as it obeys Plutchik wheel of emotions. We introduce a novel dynamic weighting method that balances the contribution from each class during training, unlike previous static weighting methods that assign non-changing weights based on their class frequency. Moreover, we adapt the focal loss that favors harder instances from single-label object recognition literature to our multi-label setting. Furthermore, we derive a method to choose optimal class-specific thresholds that maximize the macro-f1 score in linear time complexity. Through an extensive set of experiments, we show that our method obtains the state-of-the-art performance in 7 of 9 metrics in 3 different languages using a single model compared to the common baselines and the best-performing methods in the SemEval competition. We publicly share our code for our model, which can perform sentiment analysis in 100 languages, to facilitate further research. △ Less

Submitted 26 August, 2020; originally announced August 2020.

Comments: 11 pages, 6 figures

arXiv:2007.10099 [pdf, ps, other]

Early Stopping in Deep Networks: Double Descent and How to Eliminate it

Authors: Reinhard Heckel, Fatih Furkan Yilmaz

Abstract: Over-parameterized models, such as large deep networks, often exhibit a double descent phenomenon, whereas a function of model size, error first decreases, increases, and decreases at last. This intriguing double descent behavior also occurs as a function of training epochs and has been conjectured to arise because training epochs control the model complexity. In this paper, we show that such epoc… ▽ More Over-parameterized models, such as large deep networks, often exhibit a double descent phenomenon, whereas a function of model size, error first decreases, increases, and decreases at last. This intriguing double descent behavior also occurs as a function of training epochs and has been conjectured to arise because training epochs control the model complexity. In this paper, we show that such epoch-wise double descent arises for a different reason: It is caused by a superposition of two or more bias-variance tradeoffs that arise because different parts of the network are learned at different epochs, and eliminating this by proper scaling of stepsizes can significantly improve the early stopping performance. We show this analytically for i) linear regression, where differently scaled features give rise to a superposition of bias-variance tradeoffs, and for ii) a two-layer neural network, where the first and second layer each govern a bias-variance tradeoff. Inspired by this theory, we study two standard convolutional networks empirically and show that eliminating epoch-wise double descent through adjusting stepsizes of different layers improves the early stopping performance significantly. △ Less

Submitted 19 September, 2020; v1 submitted 20 July, 2020; originally announced July 2020.

Comments: 37 pages, 8 figures; changes from version 1: additional numerical results and clarifications

arXiv:2005.08948 [pdf, other]

Achieving Online Regression Performance of LSTMs with Simple RNNs

Authors: N. Mert Vural, Fatih Ilhan, Selim F. Yilmaz, Salih Ergüt, Suleyman S. Kozat

Abstract: Recurrent Neural Networks (RNNs) are widely used for online regression due to their ability to generalize nonlinear temporal dependencies. As an RNN model, Long-Short-Term-Memory Networks (LSTMs) are commonly preferred in practice, as these networks are capable of learning long-term dependencies while avoiding the vanishing gradient problem. However, due to their large number of parameters, traini… ▽ More Recurrent Neural Networks (RNNs) are widely used for online regression due to their ability to generalize nonlinear temporal dependencies. As an RNN model, Long-Short-Term-Memory Networks (LSTMs) are commonly preferred in practice, as these networks are capable of learning long-term dependencies while avoiding the vanishing gradient problem. However, due to their large number of parameters, training LSTMs requires considerably longer training time compared to simple RNNs (SRNNs). In this paper, we achieve the online regression performance of LSTMs with SRNNs efficiently. To this end, we introduce a first-order training algorithm with a linear time complexity in the number of parameters. We show that when SRNNs are trained with our algorithm, they provide very similar regression performance with the LSTMs in two to three times shorter training time. We provide strong theoretical analysis to support our experimental results by providing regret bounds on the convergence rate of our algorithm. Through an extensive set of experiments, we verify our theoretical work and demonstrate significant performance improvements of our algorithm with respect to LSTMs and the other state-of-the-art learning models. △ Less

Submitted 31 May, 2021; v1 submitted 16 May, 2020; originally announced May 2020.

Comments: arXiv admin note: substantial text overlap with arXiv:2003.03601

arXiv:2005.05865 [pdf, other]

Unsupervised Anomaly Detection via Deep Metric Learning with End-to-End Optimization

Authors: Selim F. Yilmaz, Suleyman S. Kozat

Abstract: We investigate unsupervised anomaly detection for high-dimensional data and introduce a deep metric learning (DML) based framework. In particular, we learn a distance metric through a deep neural network. Through this metric, we project the data into the metric space that better separates the anomalies from the normal data and reduces the effect of the curse of dimensionality for high-dimensional… ▽ More We investigate unsupervised anomaly detection for high-dimensional data and introduce a deep metric learning (DML) based framework. In particular, we learn a distance metric through a deep neural network. Through this metric, we project the data into the metric space that better separates the anomalies from the normal data and reduces the effect of the curse of dimensionality for high-dimensional data. We present a novel data distillation method through self-supervision to remedy the conventional practice of assuming all data as normal. We also employ the hard mining technique from the DML literature. We show these components improve the performance of our model and significantly reduce the running time. Through an extensive set of experiments on the 14 real-world datasets, our method demonstrates significant performance gains compared to the state-of-the-art unsupervised anomaly detection methods, e.g., an absolute improvement between 4.44% and 11.74% on the average over the 14 datasets. Furthermore, we share the source code of our method on Github to facilitate further research. △ Less

Submitted 12 May, 2020; originally announced May 2020.

Comments: 11 pages, 3 figures

arXiv:2003.03601

RNN-based Online Learning: An Efficient First-Order Optimization Algorithm with a Convergence Guarantee

Authors: N. Mert Vural, Selim F. Yilmaz, Fatih Ilhan, Suleyman S. Kozat

Abstract: We investigate online nonlinear regression with continually running recurrent neural network networks (RNNs), i.e., RNN-based online learning. For RNN-based online learning, we introduce an efficient first-order training algorithm that theoretically guarantees to converge to the optimum network parameters. Our algorithm is truly online such that it does not make any assumption on the learning envi… ▽ More We investigate online nonlinear regression with continually running recurrent neural network networks (RNNs), i.e., RNN-based online learning. For RNN-based online learning, we introduce an efficient first-order training algorithm that theoretically guarantees to converge to the optimum network parameters. Our algorithm is truly online such that it does not make any assumption on the learning environment to guarantee convergence. Through numerical simulations, we verify our theoretical results and illustrate significant performance improvements achieved by our algorithm with respect to the state-of-the-art RNN training methods. △ Less

Submitted 31 May, 2021; v1 submitted 7 March, 2020; originally announced March 2020.

Comments: This paper was an early draft of the presented results. We have written and published another paper (arXiv:2005.08948) where we have improved the material in this paper. The published paper covers most of the material presented in this paper as well. Therefore, we remove this paper from Arxiv and kindly refer the interested readers to arXiv:2005.08948

arXiv:1910.09055 [pdf, other]

Image recognition from raw labels collected without annotators

Authors: Fatih Furkan Yilmaz, Reinhard Heckel

Abstract: Image classification problems are typically addressed by first collecting examples with candidate labels, second cleaning the candidate labels manually, and third training a deep neural network on the clean examples. The manual labeling step is often the most expensive one as it requires workers to label millions of images. In this paper we propose to work without any explicitly labeled data by i)… ▽ More Image classification problems are typically addressed by first collecting examples with candidate labels, second cleaning the candidate labels manually, and third training a deep neural network on the clean examples. The manual labeling step is often the most expensive one as it requires workers to label millions of images. In this paper we propose to work without any explicitly labeled data by i) directly training the deep neural network on the noisy candidate labels, and ii) early stopping the training to avoid overfitting. With this procedure we exploit an intriguing property of standard overparameterized convolutional neural networks trained with (stochastic) gradient descent: Clean labels are fitted faster than noisy ones. We consider two classification problems, a subset of ImageNet and CIFAR-10. For both, we construct large candidate datasets without any explicit human annotations, that only contain 10%-50% correctly labeled examples per class. We show that training on the candidate examples and regularizing through early stopping gives higher test performance for both problems than when training on the original, clean data. This is possible because the candidate datasets contain a huge number of clean examples, and, as we show in this paper, the noise generated through the label collection process is not nearly as adversarial for learning as the noise generated by randomly flipping labels. △ Less

Submitted 25 February, 2020; v1 submitted 20 October, 2019; originally announced October 2019.

Comments: Version changelog: Added content on ImageNet related experiments; Re-structured the document to incorporate the new content

arXiv:1910.07675 [pdf, ps, other]

On the asymptotic analysis of the high-order statistics of the channel capacity over generalized fading channels

Authors: Ferkan Yilmaz

Abstract: In this article, we provide further asymptotic analysis to the higher-order statistics (HOS) of the channel capacity over generalized fading channels, especially by proposing simple and closed-form expressions each of which can be easily computed as a tight-bound revealing the existence of constant gap between the actual and asymptotic HOS of the channel capacity in the limit of both high- and low… ▽ More In this article, we provide further asymptotic analysis to the higher-order statistics (HOS) of the channel capacity over generalized fading channels, especially by proposing simple and closed-form expressions each of which can be easily computed as a tight-bound revealing the existence of constant gap between the actual and asymptotic HOS of the channel capacity in the limit of both high- and low-signal to noise ratios (SNRs). As such, we show that these closed-form asymptotic expressions are insightful enough to comprehend the diversity gains. The mathematical formalism we followed in this article is illustrated with some selected numerical examples that validate the correctness of our newly derived asymptotic results. △ Less

Submitted 12 October, 2019; originally announced October 2019.

Comments: 15 pages

arXiv:1910.05592 [pdf, ps, other]

McLeish Distribution: Performance of Digital Communications over Additive White McLeish Noise (AWMN) Channels

Authors: Ferkan Yilmaz

Abstract: The objective of this article is to propose and statistically validate a more general additive non-Gaussian noise distribution, which we term McLeish distribution, whose random nature can model different impulsive noise environments commonly encountered in practice and provides a robust alternative to Gaussian noise distribution. In particular, for the first time in the literature, we establish th… ▽ More The objective of this article is to propose and statistically validate a more general additive non-Gaussian noise distribution, which we term McLeish distribution, whose random nature can model different impulsive noise environments commonly encountered in practice and provides a robust alternative to Gaussian noise distribution. In particular, for the first time in the literature, we establish the laws of McLeish distribution and therefrom derive the laws of the sum of McLeish distributions by obtaining closed-form expressions for their PDF, CDF, complementary CDF (C$^2$DF), MGF and higher-order moments. Further, for certain problems related to the envelope of complex random signals, we extend McLeish distribution to complex McLeish distribution and thereby propose circularly/elliptically symmetric (CS/ES) complex McLeish distributions with closed-form PDF, CDF, MGF and higher-order moments. For generalization of one-dimensional distribution to multi-dimensional distribution, we develop and propose both multivariate McLeish distribution and multivariate complex CS/ES (CCS/CES) McLeish distribution with analytically tractable and closed-form PDF, CDF, C$^2$DF and MGF. In addition to the proposed McLeish distribution framework and for its practical illustration, we theoretically investigate and prove the existence of McLeish distribution as additive noise in communication systems. Accordingly, we introduce additive white McLeish noise (AWMN) channels. For coherent/non-coherent signaling over AWMN channels, we propose novel expressions for MAP and ML symbol decisions and thereby obtain closed-form expressions for both BER of binary modulation schemes and SER of various M-ary modulation schemes. Further, we verify the validity and accuracy of our novel BER/SER expressions with some selected numerical examples and some computer-based simulations. △ Less

Submitted 5 March, 2020; v1 submitted 12 October, 2019; originally announced October 2019.

Comments: Single column, 173 pages

arXiv:1907.06634 [pdf, ps, other]

On the Relationships Between Average Channel Capacity, Average Bit Error Rate, Outage probability and Outage Capacity over Additive White Gaussian Noise Channels

Authors: Ferkan Yilmaz

Abstract: In the theory of wireless communications, average performance measures (APMs) are widely utilized to quantify the performance gains/impairments in various fading environments under various scenarios, and to comprehend how the factors arising from design/implementation affect system performance. To the best of our knowledge, it has not been yet discovered in the literature how these APMs relate to… ▽ More In the theory of wireless communications, average performance measures (APMs) are widely utilized to quantify the performance gains/impairments in various fading environments under various scenarios, and to comprehend how the factors arising from design/implementation affect system performance. To the best of our knowledge, it has not been yet discovered in the literature how these APMs relate to each other. In this article, having been inspired by the work of Verdu et al. [1], we propose that one APM can be calculated using the other APMs instead of using the end-to-end SNR distribution. Particularly, using the Lamperti's transformation (LT), we propose a tractable approach, which we call LT-based APM analysis, to identify a relationship between any two given APMs such that it is irrespective of SNR distribution. Thereby, we introduce some novel relationships among average channel capacity (ACC), average bit error rate (ABER) and outage probability/capacity (OP/OC) performances, and accordingly present how to obtain ACC from ABER performance and how to obtain OP/OC from ACC performance in fading environments. We demonstrate that the ACC of any communications system can be evaluated empirically without using end-to-end SNR distribution. We consider some numerical examples and simulations to validate our newly derived relationships. △ Less

Submitted 12 October, 2019; v1 submitted 14 July, 2019; originally announced July 2019.

Comments: 14 pages, 3 figures

arXiv:1812.10558 [pdf, other]

Deception Detection by 2D-to-3D Face Reconstruction from Videos

Authors: Minh Ngô, Burak Mandira, Selim Fırat Yılmaz, Ward Heij, Sezer Karaoglu, Henri Bouma, Hamdi Dibeklioglu, Theo Gevers

Abstract: Lies and deception are common phenomena in society, both in our private and professional lives. However, humans are notoriously bad at accurate deception detection. Based on the literature, human accuracy of distinguishing between lies and truthful statements is 54% on average, in other words it is slightly better than a random guess. While people do not much care about this issue, in high-stakes… ▽ More Lies and deception are common phenomena in society, both in our private and professional lives. However, humans are notoriously bad at accurate deception detection. Based on the literature, human accuracy of distinguishing between lies and truthful statements is 54% on average, in other words it is slightly better than a random guess. While people do not much care about this issue, in high-stakes situations such as interrogations for series crimes and for evaluating the testimonies in court cases, accurate deception detection methods are highly desirable. To achieve a reliable, covert, and non-invasive deception detection, we propose a novel method that jointly extracts reliable low- and high-level facial features namely, 3D facial geometry, skin reflectance, expression, head pose, and scene illumination in a video sequence. Then these features are modeled using a Recurrent Neural Network to learn temporal characteristics of deceptive and honest behavior. We evaluate the proposed method on the Real-Life Trial (RLT) dataset that contains high-stake deceptive and honest videos recorded in courtrooms. Our results show that the proposed method (with an accuracy of 72.8%) improves the state of the art as well as outperforming the use of manually coded facial attributes 67.6%) in deception detection. △ Less

Submitted 26 December, 2018; originally announced December 2018.

Comments: 9 pages, 3 figures

arXiv:1805.05572 [pdf, ps, other]

doi 10.1109/TWC.2015.2467386

Performance Analysis of Free-Space Optical Links Over Málaga ($\mathcal{M}$) Turbulence Channels with Pointing Errors

Authors: Imran Shafique Ansari, Ferkan Yilmaz, Mohamed-Slim Alouini

Abstract: In this work, we present a unified performance analysis of a free-space optical (FSO) link that accounts for pointing errors and both types of detection techniques (i.e. intensity modulation/direct detection (IM/DD) as well as heterodyne detection). More specifically, we present unified exact closed-form expressions for the cumulative distribution function, the probability density function, the mo… ▽ More In this work, we present a unified performance analysis of a free-space optical (FSO) link that accounts for pointing errors and both types of detection techniques (i.e. intensity modulation/direct detection (IM/DD) as well as heterodyne detection). More specifically, we present unified exact closed-form expressions for the cumulative distribution function, the probability density function, the moment generating function, and the moments of the end-to-end signal-to-noise ratio (SNR) of a single link FSO transmission system, all in terms of the Meijer's G function except for the moments that is in terms of simple elementary functions. We then capitalize on these unified results to offer unified exact closed-form expressions for various performance metrics of FSO link transmission systems, such as, the outage probability, the scintillation index (SI), the average error rate for binary and $M$-ary modulation schemes, and the ergodic capacity (except for IM/DD technique, where we present closed-form lower bound results), all in terms of Meijer's G functions except for the SI that is in terms of simple elementary functions. Additionally, we derive the asymptotic results for all the expressions derived earlier in terms of Meijer's G function in the high SNR regime in terms of simple elementary functions via an asymptotic expansion of the Meijer's G function. We also derive new asymptotic expressions for the ergodic capacity in the low as well as high SNR regimes in terms of simple elementary functions via utilizing moments. All the presented results are verified via computer-based Monte-Carlo simulations. △ Less

Submitted 15 May, 2018; originally announced May 2018.

arXiv:1302.4225 [pdf, ps, other]

Impact of Pointing Errors on the Performance of Mixed RF/FSO Dual-Hop Transmission Systems

Authors: Imran Shafique Ansari, Ferkan Yilmaz, Mohamed-Slim Alouini

Abstract: In this work, the performance analysis of a dual-hop relay transmission system composed of asymmetric radio-frequency (RF)/free-space optical (FSO) links with pointing errors is presented. More specifically, we build on the system model presented in [1] to derive new exact closed-form expressions for the cumulative distribution function, probability density function, moment generating function, an… ▽ More In this work, the performance analysis of a dual-hop relay transmission system composed of asymmetric radio-frequency (RF)/free-space optical (FSO) links with pointing errors is presented. More specifically, we build on the system model presented in [1] to derive new exact closed-form expressions for the cumulative distribution function, probability density function, moment generating function, and moments of the end-to-end signal-to-noise ratio in terms of the Meijer's G function. We then capitalize on these results to offer new exact closed-form expressions for the higher-order amount of fading, average error rate for binary and M-ary modulation schemes, and the ergodic capacity, all in terms of Meijer's G functions. Our new analytical results were also verified via computer-based Monte-Carlo simulation results. △ Less

Submitted 18 February, 2013; originally announced February 2013.

Comments: 6 pages, 3 figures

arXiv:1211.4372 [pdf, ps, other]

A Framework for Uplink Intercell Interference Modeling with Channel-Based Scheduling

Authors: Hina Tabassum, Ferkan Yilmaz, Zaher Dawy, Mohamed-Slim Alouini

Abstract: This paper presents a novel framework for modeling the uplink intercell interference (ICI) in a multiuser cellular network. The proposed framework assists in quantifying the impact of various fading channel models and state-of-the-art scheduling schemes on the uplink ICI. Firstly, we derive a semianalytical expression for the distribution of the location of the scheduled user in a given cell consi… ▽ More This paper presents a novel framework for modeling the uplink intercell interference (ICI) in a multiuser cellular network. The proposed framework assists in quantifying the impact of various fading channel models and state-of-the-art scheduling schemes on the uplink ICI. Firstly, we derive a semianalytical expression for the distribution of the location of the scheduled user in a given cell considering a wide range of scheduling schemes. Based on this, we derive the distribution and moment generating function (MGF) of the uplink ICI considering a single interfering cell. Consequently, we determine the MGF of the cumulative ICI observed from all interfering cells and derive explicit MGF expressions for three typical fading models. Finally, we utilize the obtained expressions to evaluate important network performance metrics such as the outage probability, ergodic capacity, and average fairness numerically. Monte-Carlo simulation results are provided to demonstrate the efficacy of the derived analytical expressions. △ Less

Submitted 19 November, 2012; originally announced November 2012.

Comments: IEEE Transactions on Wireless Communications, 2013. arXiv admin note: substantial text overlap with arXiv:1206.2292

arXiv:1210.0100 [pdf, ps, other]

On the Sum of Squared η-μRandom Variates With Application to the Performance of Wireless Communication Systems

Authors: Imran Shafique Ansari, Ferkan Yilmaz, Mohamed-Slim Alouini

Abstract: The probability density function (PDF) and cumulative distribution function of the sum of L independent but not necessarily identically distributed squared η-μvariates, applicable to the output statistics of maximal ratio combining (MRC) receiver operating over η-μfading channels that includes the Hoyt and the Nakagami-m models as special cases, is presented in closed-form in terms of the Fox's H-… ▽ More The probability density function (PDF) and cumulative distribution function of the sum of L independent but not necessarily identically distributed squared η-μvariates, applicable to the output statistics of maximal ratio combining (MRC) receiver operating over η-μfading channels that includes the Hoyt and the Nakagami-m models as special cases, is presented in closed-form in terms of the Fox's H-bar function. Further analysis, particularly on the bit error rate via PDF-based approach, is also represented in closed form in terms of the extended Fox's H-bar function (H-hat). The proposed new analytical results complement previous results and are illustrated by extensive numerical and Monte Carlo simulation results. △ Less

Submitted 29 September, 2012; originally announced October 2012.

Comments: 6 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:1202.2576

arXiv:1209.4065 [pdf, ps, other]

On the Performance of Transmit Antenna Selection Based on Shadowing Side Information

Authors: Ahmet Yilmaz, Ferkan Yilmaz, Mohamed-Slim Alouini, Oğuz Kucur

Abstract: In this paper, a transmit antenna selection scheme, which is based on shadowing side information, is investigated. In this scheme, the selected single transmit antenna provides the highest shadowing coefficient between transmitter and receiver. By the proposed technique, the frequency of the usage of the feedback channel from the receiver to the transmitter and also channel estimation complexity a… ▽ More In this paper, a transmit antenna selection scheme, which is based on shadowing side information, is investigated. In this scheme, the selected single transmit antenna provides the highest shadowing coefficient between transmitter and receiver. By the proposed technique, the frequency of the usage of the feedback channel from the receiver to the transmitter and also channel estimation complexity at the receiver can be reduced. We study the performance of our proposed technique and in the analysis, we consider an independent but not identically distributed Generalized-K composite fading model. More specifically exact and closed-form expressions for the outage probability, the moment generating function, the moments of signal-to-noise ratio, and the average symbol error probability are derived. In addition, asymptotic outage probability and symbol error probability expressions are also presented in order to investigate the diversity order and the array gain. Finally, our theoretical performance results are validated by Monte Carlo simulations. △ Less

Submitted 19 September, 2012; v1 submitted 18 September, 2012; originally announced September 2012.

Comments: 7 pages, 5 figures, journal

arXiv:1207.1805 [pdf, ps, other]

A Novel Ergodic Capacity Analysis of Diversity Combining and Multihop Transmission Systems over Generalized Composite Fading Channels

Authors: Ferkan Yilmaz, Mohamed-Slim Alouini

Abstract: Ergodic capacity is an important performance measure associated with reliable communication at the highest rate at which information can be sent over the channel with a negligible probability of error. In the shadow of this definition, diversity receivers (such as selection combining, equal-gain combining and maximal-ratio combining) and transmission techniques (such as cascaded fading channels, a… ▽ More Ergodic capacity is an important performance measure associated with reliable communication at the highest rate at which information can be sent over the channel with a negligible probability of error. In the shadow of this definition, diversity receivers (such as selection combining, equal-gain combining and maximal-ratio combining) and transmission techniques (such as cascaded fading channels, amplify-and-forward multihop transmission) are deployed in mitigating various performance impairing effects such as fading and shadowing in digital radio communication links. However, the exact analysis of ergodic capacity is in general not always possible for all of these forms of diversity receivers and transmission techniques over generalized composite fading environments due to it's mathematical intractability. In the literature, published papers concerning the exact analysis of ergodic capacity have been therefore scarce (i.e., only [1] and [2]) when compared to those concerning the exact analysis of average symbol error probability. In addition, they are essentially targeting to the ergodic capacity of the maximal ratio combining diversity receivers and are not readily applicable to the capacity analysis of the other diversity combiners / transmission techniques. In this paper, we propose a novel moment generating function-based approach for the exact ergodic capacity analysis of both diversity receivers and transmission techniques over generalized composite fading environments. As such, we demonstrate how to simultaneously treat the ergodic capacity analysis of all forms of both diversity receivers and multihop transmission techniques. △ Less

Submitted 7 July, 2012; originally announced July 2012.

Comments: 7 pages, no figure, published in IEEE International Conference on Communications (ICC 2012), Ottawa, Canada, 10th-15th June, 2012

arXiv:1206.2292 [pdf, ps, other]

An Intercell Interference Model based on Scheduling for Future Generation Wireless Networks (Part 1 and Part 2)

Authors: Hina Tabassum, Ferkan Yilmaz, Zaher Dawy, Mohamed-Slim Alouini

Abstract: This technical report is divided into two parts. The first part of the technical report presents a novel framework for modeling the uplink and downlink intercell interference (ICI) in a multiuser cellular network. The proposed framework assists in quantifying the impact of various fading channel models and multiuser scheduling schemes on the uplink and downlink ICI. Firstly, we derive a semi-analy… ▽ More This technical report is divided into two parts. The first part of the technical report presents a novel framework for modeling the uplink and downlink intercell interference (ICI) in a multiuser cellular network. The proposed framework assists in quantifying the impact of various fading channel models and multiuser scheduling schemes on the uplink and downlink ICI. Firstly, we derive a semi-analytical expression for the distribution of the location of the scheduled user in a given cell considering a wide range of scheduling schemes. Based on this, we derive the distribution and moment generating function (MGF) of the ICI considering a single interfering cell. Consequently, we determine the MGF of the cumulative ICI observed from all interfering cells and derive explicit MGF expressions for three typical fading models. Finally, we utilize the obtained expressions to evaluate important network performance metrics such as the outage probability, ergodic capacity and average fairness numerically. Monte-Carlo simulation results are provided to demonstrate the efficacy of the derived analytical expressions {\bf The first part of the technical report is currently submitted to IEEE Transactions on Wireless Communications}. The second part of the technical report deals with the statistical modeling of uplink inter-cell interference (ICI) considering greedy scheduling with power adaptation based on channel conditions. The derived model is utilized to evaluate important network performance metrics such as ergodic capacity, average fairness and average power preservation numerically. In parallel to the literature, we have shown that greedy scheduling with power adaptation reduces the ICI, average power consumption of users, and enhances the average fairness among users, compared to the case without power adaptation. △ Less

Submitted 13 June, 2012; v1 submitted 11 June, 2012; originally announced June 2012.

arXiv:1206.0399 [pdf, ps, other]

On the Computation of the Higher-Order Statistics of the Channel Capacity for Amplify-and-Forward Multihop Transmission

Authors: Ferkan Yilmaz, Hina Tabassum, Mohamed-Slim Alouini

Abstract: Higher-order statistics (HOS) of the channel capacity provide useful information regarding the level of reliability of the signal transmission at a particular rate. We propose in this letter a novel and unified analysis, which is based on the moment-generating function (MGF) approach, to efficiently and accurately compute the HOS of the channel capacity for amplify-and-forward multihop transmissio… ▽ More Higher-order statistics (HOS) of the channel capacity provide useful information regarding the level of reliability of the signal transmission at a particular rate. We propose in this letter a novel and unified analysis, which is based on the moment-generating function (MGF) approach, to efficiently and accurately compute the HOS of the channel capacity for amplify-and-forward multihop transmission over generalized fading channels. More precisely, our mathematical formulism is easy-to-use and tractable specifically requiring only the reciprocal MGFs of the instantaneous signal-to-noise ratio distributions of the transmission hops. Numerical and simulation results, performed to exemplify the usefulness of the proposed MGF-based analysis, are shown to be in perfect agreement. △ Less

Submitted 20 August, 2012; v1 submitted 2 June, 2012; originally announced June 2012.

Comments: Two Figures, one table, ad submitted to a possible publication

arXiv:1204.3719 [pdf, ps, other]

On the Computation of the Higher Order Statistics of the Channel Capacity over Generalized Fading Channels

Authors: Ferkan Yilmaz, Mohamed-Slim Alouini

Abstract: The higher-order statistics (HOS) of the channel capacity $μ_n=\mathbb{E}[\log^n(1+γ_{end})]$, where $n\in\mathbb{N}$ denotes the order of the statistics, has received relatively little attention in the literature, due in part to the intractability of its analysis. In this letter, we propose a novel and unified analysis, which is based on the moment generating function (MGF) technique, to exactly… ▽ More The higher-order statistics (HOS) of the channel capacity $μ_n=\mathbb{E}[\log^n(1+γ_{end})]$, where $n\in\mathbb{N}$ denotes the order of the statistics, has received relatively little attention in the literature, due in part to the intractability of its analysis. In this letter, we propose a novel and unified analysis, which is based on the moment generating function (MGF) technique, to exactly compute the HOS of the channel capacity. More precisely, our mathematical formalism can be readily applied to maximal-ratio-combining (MRC) receivers operating in generalized fading environments (i.e., the sum of the correlated noncentral chi-squared distributions / the correlated generalized Rician distributions). The mathematical formalism is illustrated by some numerical examples focussing on the correlated generalized fading environments. △ Less

Submitted 27 July, 2012; v1 submitted 17 April, 2012; originally announced April 2012.

Comments: Submitted to IEEE Wireless Communications Letter, February 18, 2012

arXiv:1202.3910 [pdf, ps, other]

Performance of Amplify-and-Forward Multihop Transmission over Relay Clusters with Different Routing Strategies

Authors: Ferkan Yilmaz, Fahd Ahmed Khan, Mohamed-Slim Alouini

Abstract: We Consider a multihop relay network in which two terminals are communicating with each other via a number of cluster of relays. Performance of such networks depends on the routing protocols employed. In this paper, we find the expressions for the average symbol error probability (ASEP) performance of amplify-and-forward (AF) multihop transmission for the simplest routing protocol in which the rel… ▽ More We Consider a multihop relay network in which two terminals are communicating with each other via a number of cluster of relays. Performance of such networks depends on the routing protocols employed. In this paper, we find the expressions for the average symbol error probability (ASEP) performance of amplify-and-forward (AF) multihop transmission for the simplest routing protocol in which the relay transmits using the channel having the best symbol to noise ratio (SNR). The ASEP performance of a better protocol proposed in [1] known as the adhoc protocol is also analyzed. The derived expressions for the performance are a convenient tool to analyze the performance of AF multihop transmission over relay clusters. Monte-Carlo simulations verify the correctness of the proposed formulation and are in agreement with analytical results. Furthermore, we propose new generalized protocols termed as last-n-hop selection protocol, the dual path protocol, the forward- backward last-n-hop selection protocol, and the forward-backward dual path protocol, to get improved ASEP performances. The ASEP performance of these proposed schemes is analysed by computer simulations. It is shown that close to optimal performance can be achieved by using the last-n-hop selection protocol and its forward-backward variant. The complexity of the protocols is also studied. △ Less

Submitted 17 February, 2012; originally announced February 2012.

Comments: To appear in IJAACS

arXiv:1202.2576 [pdf, ps, other]

New Results on the Sum of Gamma Random Variates With Application to the Performance of Wireless Communication Systems over Nakagami-m Fading Channels

Authors: Imran Shafique Ansari, Ferkan Yilmaz, Mohamed-Slim Alouini, Oğuz Kucur

Abstract: The probability density function (PDF) and cumulative distribution function of the sum of L independent but not necessarily identically distributed Gamma variates, applicable to the output statistics of maximal ratio combining (MRC) receiver operating over Nakagami-m fading channels or in other words to the statistical analysis of the scenario where the sum of squared Nakagami-m distributions are… ▽ More The probability density function (PDF) and cumulative distribution function of the sum of L independent but not necessarily identically distributed Gamma variates, applicable to the output statistics of maximal ratio combining (MRC) receiver operating over Nakagami-m fading channels or in other words to the statistical analysis of the scenario where the sum of squared Nakagami-m distributions are user-of-interest, is presented in closed-form in terms of well-known Meijer's G function and easily computable Fox's H-bar function for integer valued and non-integer valued m fading parameters. Further analysis, particularly on bit error rate via a PDF-based approach is also offered in closed form in terms of Meijer's G function and Fox's H-bar function for integer valued fading parameters, and extended Fox's H-bar function (H-hat) for non-integer valued fading parameters. Our proposed results complement previous known results that are either expressed in terms of infinite sums, nested sums, or higher order derivatives of the fading parameter m. △ Less

Submitted 18 July, 2012; v1 submitted 12 February, 2012; originally announced February 2012.

Comments: 5 figures, 4 tables, Accepted in SPAWC 2012

arXiv:1201.1278 [pdf, ps, other]

Novel Relations between the Ergodic Capacity and the Average Bit Error Rate

Authors: Ferkan Yilmaz, Mohamed-Slim Alouini

Abstract: Ergodic capacity and average bit error rate have been widely used to compare the performance of different wireless communication systems. As such recent scientific research and studies revealed strong impact of designing and implementing wireless technologies based on these two performance indicators. However and to the best of our knowledge, the direct links between these two performance indicato… ▽ More Ergodic capacity and average bit error rate have been widely used to compare the performance of different wireless communication systems. As such recent scientific research and studies revealed strong impact of designing and implementing wireless technologies based on these two performance indicators. However and to the best of our knowledge, the direct links between these two performance indicators have not been explicitly proposed in the literature so far. In this paper, we propose novel relations between the ergodic capacity and the average bit error rate of an overall communication system using binary modulation schemes for signaling with a limited bandwidth and operating over generalized fading channels. More specifically, we show that these two performance measures can be represented in terms of each other, without the need to know the exact end-to-end statistical characterization of the communication channel. We validate the correctness and accuracy of our newly proposed relations and illustrated their usefulness by considering some classical examples. △ Less

Submitted 5 January, 2012; originally announced January 2012.

Comments: This work has been presented by Ferkan Yilmaz in IEEE International Symposium on Wireless Communication Systems (ISWCS 2011), Aachen, Germany, 6th-9th November, 2011. (Including 6 pages, 2 figures)

arXiv:1109.6510 [pdf, ps, other]

Exact Performance Analysis of Partial Relay Selection Based on Shadowing Side Information over Generalized Composite Fading Channels

Authors: Ferkan Yilmaz, Mohamed-Slim Alouini

Abstract: Relay technology has recently gained great interest in millimeter wave (60 GHz or above) radio frequencies as a promising transmission technique improving the quality of service, providing high data rate, and extending the coverage area without additional transmit power in deeply shadowed fading environments. The performance of relay-based systems considerably depends on which relay selection prot… ▽ More Relay technology has recently gained great interest in millimeter wave (60 GHz or above) radio frequencies as a promising transmission technique improving the quality of service, providing high data rate, and extending the coverage area without additional transmit power in deeply shadowed fading environments. The performance of relay-based systems considerably depends on which relay selection protocols (RSPs) are used. These RSPs are typically using the channel side information (CSI). Specifically, the relay terminal (RT) is chosen among all available RTs by a central entity (CE) which receives all RTs' CSI via feedback channels. However, in the millimeter wave radio frequencies, the rate of the CSI variation is much higher than that of the CSI variation in 6 GHz frequencies under the same mobility conditions, which evidently results in a serious problem causing that the CSI at the CE is inaccurate for the RSP since the feedback channels have a backhaul / transmission delay. However and fortunately, the shadowing side information (SSI) varies very slowly in comparison to the rate of the CSI variation. In this context, we propose in this paper a partial-RSP in dual-hop amplify-and-forward relaying system, which utilize only the SSI of the RTs instead of their CSI. Then for the performance analysis, we obtain an exact average unified performance (AUP) of the proposed SSI-based partial-RSP for a variety shadowed fading environments. In particular, we offer a generic AUP expression whose special cases include the average bit error probability (ABEP) analysis for binary modulation schemes, the ergodic capacity analysis and the moments-generating function (MGF)-based characterization. The correctness of our newly theoretical results is validated with some selected numerical examples in an extended generalized-K fading environment. △ Less

Submitted 9 July, 2012; v1 submitted 29 September, 2011; originally announced September 2011.

Comments: Number of Figures: 5, Number of Tables: 1, Keywords: Partial relay selection, unified performance expression, average bit error probability, ergodic capacity, moments-generating function, shadowing side information, and extended generalized-K fading

arXiv:1101.5317 [pdf, ps, other]

A Novel Unified Expression for the Capacity and Bit Error Probability of Wireless Communication Systems over Generalized Fading Channels

Authors: Ferkan Yilmaz, Mohamed-Slim Alouini

Abstract: Analysis of the average binary error probabilities (ABEP) and average capacity (AC) of wireless communications systems over generalized fading channels have been considered separately in the past. This paper introduces a novel moment generating function (MGF)-based \emph{unified expression} for the ABEP and AC of single and multiple link communication with maximal ratio combining. In addition, thi… ▽ More Analysis of the average binary error probabilities (ABEP) and average capacity (AC) of wireless communications systems over generalized fading channels have been considered separately in the past. This paper introduces a novel moment generating function (MGF)-based \emph{unified expression} for the ABEP and AC of single and multiple link communication with maximal ratio combining. In addition, this paper proposes the hyper-Fox's H fading model as a unified fading distribution of a majority of the well-known generalized fading models. As such, we offer a generic unified performance expression that can be easily calculated and that is applicable to a wide variety of fading scenarios. The mathematical formalism is illustrated with some selected numerical examples that validate the correctness of our newly derived results. △ Less

Submitted 26 January, 2011; originally announced January 2011.

Comments: In this paper (including 5 Tables and 6 Figures), we presented a unified performance expression combining the ABEP and AC of wireless communication systems over generalized fading channels. In addition, the hyper-Fox's H fading model is proposed as a unified fading distribution for a majority of the well-known generalized fading models

arXiv:1012.3788 [pdf, ps, other]

doi 10.1109/TCOMM.2011.063011.100303A

A New Formula for the BER of Binary Modulations with Dual-Branch Selection over Generalized-K Composite Fading Channels

Authors: Imran Shafique Ansari, Saad Al-Ahmadi, Ferkan Yilmaz, Mohamed-Slim Alouini, Halim Yanikomeroglu

Abstract: Error performance is one of the main performance measures and derivation of its closed-form expression has proved to be quite involved for certain systems. In this letter, a unified closed-form expression, applicable to different binary modulation schemes, for the bit error rate of dual-branch selection diversity based systems undergoing independent but not necessarily identically distributed gene… ▽ More Error performance is one of the main performance measures and derivation of its closed-form expression has proved to be quite involved for certain systems. In this letter, a unified closed-form expression, applicable to different binary modulation schemes, for the bit error rate of dual-branch selection diversity based systems undergoing independent but not necessarily identically distributed generalized-K fading is derived in terms of the extended generalized bivariate Meijer G-function. △ Less

Submitted 16 December, 2010; originally announced December 2010.

Comments: Diversity schemes, selection combining, dual-branch selection diversity, binary modulation schemes, generalized-K (GK) model, composite fading, bit error rate (BER), and Meijer G-function distribution

arXiv:1012.2598 [pdf, ps, other]

Extended Generalized-K (EGK): A New Simple and General Model for Composite Fading Channels

Authors: Ferkan Yilmaz, Mohamed-Slim Alouini

Abstract: In this paper, we introduce a generalized composite fading distribution (termed extended generalized-K (EGK)) to model the envelope and the power of the received signal in millimeter wave (60 GHz or above) and free-space optical channels. We obtain the first and the second-order statistics of the received signal envelope characterized by the EGK composite fading distribution. In particular, expres… ▽ More In this paper, we introduce a generalized composite fading distribution (termed extended generalized-K (EGK)) to model the envelope and the power of the received signal in millimeter wave (60 GHz or above) and free-space optical channels. We obtain the first and the second-order statistics of the received signal envelope characterized by the EGK composite fading distribution. In particular, expressions for probability density function, cumulative distribution function, level crossing rate and average fade duration, and fractional moments are derived. In addition performance measures such as amount of fading, average bit error probability, outage probability, average capacity, and outage capacity are offered in closed-form. Selected numerical and computer simulation examples validate the accuracy of the presented mathematical analysis. △ Less

Submitted 12 December, 2010; originally announced December 2010.

Comments: Composite fading distribution, generalized-K distribution, probability density function, cumulative distribution function, fractional moments, level crossing rate, amount of fade duration, moments, amount of fading, average bit error probability, average capacity

arXiv:1012.2596 [pdf, ps, other]

A Unified MGF-Based Capacity Analysis of Diversity Combiners over Generalized Fading Channels

Authors: Ferkan Yilmaz, Mohamed-Slim Alouini

Abstract: Unified exact average capacity results for L-branch coherent diversity receivers including equal-gain combining (EGC) and maximal-ratio combining (MRC) are not known. This paper develops a novel generic framework for the capacity analysis of $L$-branch EGC/MRC over generalized fading channels. The framework is used to derive new results for the Gamma shadowed generalized Nakagami-m fading model wh… ▽ More Unified exact average capacity results for L-branch coherent diversity receivers including equal-gain combining (EGC) and maximal-ratio combining (MRC) are not known. This paper develops a novel generic framework for the capacity analysis of $L$-branch EGC/MRC over generalized fading channels. The framework is used to derive new results for the Gamma shadowed generalized Nakagami-m fading model which can be a suitable model for the fading environments encountered by high frequency (60 GHz and above) communications. The mathematical formalism is illustrated with some selected numerical and simulation results confirming the correctness of our newly proposed framework. △ Less

Submitted 12 December, 2010; originally announced December 2010.

Showing 1–42 of 42 results for author: Yilmaz, F