-
Advancing Experimental Platforms for UAV Communications: Insights from AERPAW'S Digital Twin
Authors:
Joshua Moore,
Aly Sabri Abdalla,
Charles Ueltschey,
Anıl Gürses,
Özgür Özdemir,
Mihail L. Sichitiu,
İsmail Güvenç,
Vuk Marojevic
Abstract:
The rapid evolution of 5G and beyond has advanced space-air-terrestrial networks, with unmanned aerial vehicles (UAVs) offering enhanced coverage, flexible configurations, and cost efficiency. However, deploying UAV-based systems presents challenges including varying propagation conditions and hardware limitations. While simulators and theoretical models have been developed, real-world experimenta…
▽ More
The rapid evolution of 5G and beyond has advanced space-air-terrestrial networks, with unmanned aerial vehicles (UAVs) offering enhanced coverage, flexible configurations, and cost efficiency. However, deploying UAV-based systems presents challenges including varying propagation conditions and hardware limitations. While simulators and theoretical models have been developed, real-world experimentation is critically important to validate the research. Digital twins, virtual replicas of physical systems, enable emulation that bridge theory and practice. This paper presents our experimental results from AERPAW's digital twin, showcasing its ability to simulate UAV communication scenarios and providing insights into system performance and reliability.
△ Less
Submitted 12 October, 2024;
originally announced October 2024.
-
A Survey of Anomaly Detection in In-Vehicle Networks
Authors:
Övgü Özdemir,
M. Tuğberk İşyapar,
Pınar Karagöz,
Klaus Werner Schmidt,
Demet Demir,
N. Alpay Karagöz
Abstract:
Modern vehicles are equipped with Electronic Control Units (ECU) that are used for controlling important vehicle functions including safety-critical operations. ECUs exchange information via in-vehicle communication buses, of which the Controller Area Network (CAN bus) is by far the most widespread representative. Problems that may occur in the vehicle's physical parts or malicious attacks may cau…
▽ More
Modern vehicles are equipped with Electronic Control Units (ECU) that are used for controlling important vehicle functions including safety-critical operations. ECUs exchange information via in-vehicle communication buses, of which the Controller Area Network (CAN bus) is by far the most widespread representative. Problems that may occur in the vehicle's physical parts or malicious attacks may cause anomalies in the CAN traffic, impairing the correct vehicle operation. Therefore, the detection of such anomalies is vital for vehicle safety. This paper reviews the research on anomaly detection for in-vehicle networks, more specifically for the CAN bus. Our main focus is the evaluation of methods used for CAN bus anomaly detection together with the datasets used in such analysis. To provide the reader with a more comprehensive understanding of the subject, we first give a brief review of related studies on time series-based anomaly detection. Then, we conduct an extensive survey of recent deep learning-based techniques as well as conventional techniques for CAN bus anomaly detection. Our comprehensive analysis delves into anomaly detection algorithms employed in in-vehicle networks, specifically focusing on their learning paradigms, inherent strengths, and weaknesses, as well as their efficacy when applied to CAN bus datasets. Lastly, we highlight challenges and open research problems in CAN bus anomaly detection.
△ Less
Submitted 11 September, 2024;
originally announced September 2024.
-
A UAV-assisted Wireless Localization Challenge on AERPAW
Authors:
Paul Kudyba,
Jaya Sravani Mandapaka,
Weijie Wang,
Logan McCorkendale,
Zachary McCorkendale,
Mathias Kidane,
Haijian Sun,
Eric Adams,
Kamesh Namuduri,
Fraida Fund,
Mihail Sichitiu,
Ozgur Ozdemir
Abstract:
As wireless researchers are tasked to enable wireless communication as infrastructure in more dynamic aerial settings, there is a growing need for large-scale experimental platforms that provide realistic, reproducible, and reliable experimental validation. To bridge the research-to-implementation gap, the Aerial Experimentation and Research Platform for Advanced Wireless (AERPAW) offers open-sour…
▽ More
As wireless researchers are tasked to enable wireless communication as infrastructure in more dynamic aerial settings, there is a growing need for large-scale experimental platforms that provide realistic, reproducible, and reliable experimental validation. To bridge the research-to-implementation gap, the Aerial Experimentation and Research Platform for Advanced Wireless (AERPAW) offers open-source tools, reference experiments, and hardware to facilitate and evaluate the development of wireless research in controlled digital twin environments and live testbed flights. The inaugural AERPAW Challenge, "Find a Rover," was issued to spark collaborative efforts and test the platform's capabilities. The task involved localizing a narrowband wireless signal, with teams given ten minutes to find the "rover" within a twenty-acre area. By engaging in this exercise, researchers can validate the platform's value as a tool for innovation in wireless communications research within aerial robotics. This paper recounts the methods and experiences of the top three teams in automating and rapidly locating a wireless signal by automating and controlling an aerial drone in a realistic testbed scenario.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts
Authors:
Övgü Özdemir,
Erdem Akagündüz
Abstract:
Visual question answering (VQA) is known as an AI-complete task as it requires understanding, reasoning, and inferring about the vision and the language content. Over the past few years, numerous neural architectures have been suggested for the VQA problem. However, achieving success in zero-shot VQA remains a challenge due to its requirement for advanced generalization and reasoning skills. This…
▽ More
Visual question answering (VQA) is known as an AI-complete task as it requires understanding, reasoning, and inferring about the vision and the language content. Over the past few years, numerous neural architectures have been suggested for the VQA problem. However, achieving success in zero-shot VQA remains a challenge due to its requirement for advanced generalization and reasoning skills. This study explores the impact of incorporating image captioning as an intermediary process within the VQA pipeline. Specifically, we explore the efficacy of utilizing image captions instead of images and leveraging large language models (LLMs) to establish a zero-shot setting. Since image captioning is the most crucial step in this process, we compare the impact of state-of-the-art image captioning models on VQA performance across various question types in terms of structure and semantics. We propose a straightforward and efficient question-driven image captioning approach within this pipeline to transfer contextual information into the question-answering (QA) model. This method involves extracting keywords from the question, generating a caption for each image-question pair using the keywords, and incorporating the question-driven caption into the LLM prompt. We evaluate the efficacy of using general-purpose and question-driven image captions in the VQA pipeline. Our study highlights the potential of employing image captions and harnessing the capabilities of LLMs to achieve competitive performance on GQA under the zero-shot setting. Our code is available at \url{https://github.com/ovguyo/captions-in-VQA}.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Digital Twins and Testbeds for Supporting AI Research with Autonomous Vehicle Networks
Authors:
Anıl Gürses,
Gautham Reddy,
Saad Masrur,
Özgür Özdemir,
İsmail Güvenç,
Mihail L. Sichitiu,
Alphan Şahin,
Ahmed Alkhateeb,
Magreth Mushi,
Rudra Dutta
Abstract:
Digital twins (DTs), which are virtual environments that simulate, predict, and optimize the performance of their physical counterparts, hold great promise in revolutionizing next-generation wireless networks. While DTs have been extensively studied for wireless networks, their use in conjunction with autonomous vehicles featuring programmable mobility remains relatively under-explored. In this pa…
▽ More
Digital twins (DTs), which are virtual environments that simulate, predict, and optimize the performance of their physical counterparts, hold great promise in revolutionizing next-generation wireless networks. While DTs have been extensively studied for wireless networks, their use in conjunction with autonomous vehicles featuring programmable mobility remains relatively under-explored. In this paper, we study DTs used as a development environment to design, deploy, and test artificial intelligence (AI) techniques that utilize real-world (RW) observations, e.g. radio key performance indicators, for vehicle trajectory and network optimization decisions in autonomous vehicle networks (AVN). We first compare and contrast the use of simulation, digital twin (software in the loop (SITL)), sandbox (hardware-in-the-loop (HITL)), and physical testbed (PT) environments for their suitability in developing and testing AI algorithms for AVNs. We then review various representative use cases of DTs for AVN scenarios. Finally, we provide an example from the NSF AERPAW platform where a DT is used to develop and test AI-aided solutions for autonomous unmanned aerial vehicles for localizing a signal source based solely on link quality measurements. Our results in the physical testbed show that SITL DTs, when supplemented with data from RW measurements and simulations, can serve as an ideal environment for developing and testing innovative AI solutions for AVNs.
△ Less
Submitted 8 August, 2024; v1 submitted 1 April, 2024;
originally announced April 2024.
-
Transfer Learning for Cross-dataset Isolated Sign Language Recognition in Under-Resourced Datasets
Authors:
Ahmet Alp Kindiroglu,
Ozgur Kara,
Ogulcan Ozdemir,
Lale Akarun
Abstract:
Sign language recognition (SLR) has recently achieved a breakthrough in performance thanks to deep neural networks trained on large annotated sign datasets. Of the many different sign languages, these annotated datasets are only available for a select few. Since acquiring gloss-level labels on sign language videos is difficult, learning by transferring knowledge from existing annotated sources is…
▽ More
Sign language recognition (SLR) has recently achieved a breakthrough in performance thanks to deep neural networks trained on large annotated sign datasets. Of the many different sign languages, these annotated datasets are only available for a select few. Since acquiring gloss-level labels on sign language videos is difficult, learning by transferring knowledge from existing annotated sources is useful for recognition in under-resourced sign languages. This study provides a publicly available cross-dataset transfer learning benchmark from two existing public Turkish SLR datasets. We use a temporal graph convolution-based sign language recognition approach to evaluate five supervised transfer learning approaches and experiment with closed-set and partial-set cross-dataset transfer learning. Experiments demonstrate that improvement over finetuning based transfer learning is possible with specialized supervised transfer learning methods.
△ Less
Submitted 15 April, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
Open RAN Testbeds with Controlled Air Mobility
Authors:
Magreth Mushi,
Yuchen Liu,
Shreyas Sreenivasa,
Ozgur Ozdemir,
Ismail Guvenc,
Mihail Sichitiu,
Rudra Dutta,
Russ Gyurek
Abstract:
With its promise of increasing softwarization, improving disaggregability, and creating an open-source based ecosystem in the area of Radio Access Networks, the idea of Open RAN has generated rising interest in the community. Even as the community races to provide and verify complete Open RAN systems, the importance of verification of systems based on Open RAN under real-world conditions has becom…
▽ More
With its promise of increasing softwarization, improving disaggregability, and creating an open-source based ecosystem in the area of Radio Access Networks, the idea of Open RAN has generated rising interest in the community. Even as the community races to provide and verify complete Open RAN systems, the importance of verification of systems based on Open RAN under real-world conditions has become clear, and testbed facilities for general use have been envisioned, in addition to private testing facilities. Aerial robots, including autonomous ones, are among the increasingly important and interesting clients of RAN systems, but also present a challenge for testbeds. Based on our experience in architecting and operating an advanced wireless testbed with aerial robots as a primary citizen, we present considerations relevant to the design of Open RAN testbeds, with particular attention to making such a testbed capable of controlled experimentation with aerial clients. We also present representative results from the NSF AERPAW testbed on Open RAN slicing, programmable vehicles, and programmable radios.
△ Less
Submitted 26 January, 2023;
originally announced January 2023.
-
Learning Bidirectional Action-Language Translation with Limited Supervision and Incongruent Input
Authors:
Ozan Özdemir,
Matthias Kerzel,
Cornelius Weber,
Jae Hee Lee,
Muhammad Burhan Hafez,
Patrick Bruns,
Stefan Wermter
Abstract:
Human infant learning happens during exploration of the environment, by interaction with objects, and by listening to and repeating utterances casually, which is analogous to unsupervised learning. Only occasionally, a learning infant would receive a matching verbal description of an action it is committing, which is similar to supervised learning. Such a learning mechanism can be mimicked with de…
▽ More
Human infant learning happens during exploration of the environment, by interaction with objects, and by listening to and repeating utterances casually, which is analogous to unsupervised learning. Only occasionally, a learning infant would receive a matching verbal description of an action it is committing, which is similar to supervised learning. Such a learning mechanism can be mimicked with deep learning. We model this weakly supervised learning paradigm using our Paired Gated Autoencoders (PGAE) model, which combines an action and a language autoencoder. After observing a performance drop when reducing the proportion of supervised training, we introduce the Paired Transformed Autoencoders (PTAE) model, using Transformer-based crossmodal attention. PTAE achieves significantly higher accuracy in language-to-action and action-to-language translations, particularly in realistic but difficult cases when only few supervised training samples are available. We also test whether the trained model behaves realistically with conflicting multimodal input. In accordance with the concept of incongruence in psychology, conflict deteriorates the model output. Conflicting action input has a more severe impact than conflicting language input, and more conflicting features lead to larger interference. PTAE can be trained on mostly unlabelled data where labeled data is scarce, and it behaves plausibly when tested with incongruent input.
△ Less
Submitted 22 February, 2023; v1 submitted 9 January, 2023;
originally announced January 2023.
-
Which is the best model for my data?
Authors:
Gonzalo Nápoles,
Isel Grau,
Çiçek Güven,
Orçun Özdemir,
Yamisleydi Salgueiro
Abstract:
In this paper, we tackle the problem of selecting the optimal model for a given structured pattern classification dataset. In this context, a model can be understood as a classifier and a hyperparameter configuration. The proposed meta-learning approach purely relies on machine learning and involves four major steps. Firstly, we present a concise collection of 62 meta-features that address the pro…
▽ More
In this paper, we tackle the problem of selecting the optimal model for a given structured pattern classification dataset. In this context, a model can be understood as a classifier and a hyperparameter configuration. The proposed meta-learning approach purely relies on machine learning and involves four major steps. Firstly, we present a concise collection of 62 meta-features that address the problem of information cancellation when aggregation measure values involving positive and negative measurements. Secondly, we describe two different approaches for synthetic data generation intending to enlarge the training data. Thirdly, we fit a set of pre-defined classification models for each classification problem while optimizing their hyperparameters using grid search. The goal is to create a meta-dataset such that each row denotes a multilabel instance describing a specific problem. The features of these meta-instances denote the statistical properties of the generated datasets, while the labels encode the grid search results as binary vectors such that best-performing models are positively labeled. Finally, we tackle the model selection problem with several multilabel classifiers, including a Convolutional Neural Network designed to handle tabular data. The simulation results show that our meta-learning approach can correctly predict an optimal model for 91% of the synthetic datasets and for 87% of the real-world datasets. Furthermore, we noticed that most meta-classifiers produced better results when using our meta-features. Overall, our proposal differs from other meta-learning approaches since it tackles the algorithm selection and hyperparameter tuning problems in a single step. Toward the end, we perform a feature importance analysis to determine which statistical features drive the model selection mechanism.
△ Less
Submitted 26 October, 2022;
originally announced October 2022.
-
Learning Flexible Translation between Robot Actions and Language Descriptions
Authors:
Ozan Özdemir,
Matthias Kerzel,
Cornelius Weber,
Jae Hee Lee,
Stefan Wermter
Abstract:
Handling various robot action-language translation tasks flexibly is an essential requirement for natural interaction between a robot and a human. Previous approaches require change in the configuration of the model architecture per task during inference, which undermines the premise of multi-task learning. In this work, we propose the paired gated autoencoders (PGAE) for flexible translation betw…
▽ More
Handling various robot action-language translation tasks flexibly is an essential requirement for natural interaction between a robot and a human. Previous approaches require change in the configuration of the model architecture per task during inference, which undermines the premise of multi-task learning. In this work, we propose the paired gated autoencoders (PGAE) for flexible translation between robot actions and language descriptions in a tabletop object manipulation scenario. We train our model in an end-to-end fashion by pairing each action with appropriate descriptions that contain a signal informing about the translation direction. During inference, our model can flexibly translate from action to language and vice versa according to the given language signal. Moreover, with the option to use a pretrained language model as the language encoder, our model has the potential to recognise unseen natural language input. Another capability of our model is that it can recognise and imitate actions of another agent by utilising robot demonstrations. The experiment results highlight the flexible bidirectional translation capabilities of our approach alongside with the ability to generalise to the actions of the opposite-sitting agent.
△ Less
Submitted 12 September, 2022; v1 submitted 15 July, 2022;
originally announced July 2022.
-
Language Model-Based Paired Variational Autoencoders for Robotic Language Learning
Authors:
Ozan Özdemir,
Matthias Kerzel,
Cornelius Weber,
Jae Hee Lee,
Stefan Wermter
Abstract:
Human infants learn language while interacting with their environment in which their caregivers may describe the objects and actions they perform. Similar to human infants, artificial agents can learn language while interacting with their environment. In this work, first, we present a neural model that bidirectionally binds robot actions and their language descriptions in a simple object manipulat…
▽ More
Human infants learn language while interacting with their environment in which their caregivers may describe the objects and actions they perform. Similar to human infants, artificial agents can learn language while interacting with their environment. In this work, first, we present a neural model that bidirectionally binds robot actions and their language descriptions in a simple object manipulation scenario. Building on our previous Paired Variational Autoencoders (PVAE) model, we demonstrate the superiority of the variational autoencoder over standard autoencoders by experimenting with cubes of different colours, and by enabling the production of alternative vocabularies. Additional experiments show that the model's channel-separated visual feature extraction module can cope with objects of different shapes. Next, we introduce PVAE-BERT, which equips the model with a pretrained large-scale language model, i.e., Bidirectional Encoder Representations from Transformers (BERT), enabling the model to go beyond comprehending only the predefined descriptions that the network has been trained on; the recognition of action descriptions generalises to unconstrained natural language as the model becomes capable of understanding unlimited variations of the same descriptions. Our experiments suggest that using a pretrained language model as the language encoder allows our approach to scale up for real-world scenarios with instructions from human users.
△ Less
Submitted 6 May, 2024; v1 submitted 17 January, 2022;
originally announced January 2022.
-
A Framework for Developing Algorithms for Estimating Propagation Parameters from Measurements
Authors:
Akbar Sayeed,
Peter Vouras,
Camillo Gentile,
Alec Weiss,
Jeanne Quimby,
Zihang Cheng,
Bassel Modad,
Yuning Zhang,
Chethan Anjinappa,
Fatih Erden,
Ozgur Ozdemir,
Robert Muller,
Diego Dupleich,
Han Niu,
6David Michelson,
6Aidan Hughes
Abstract:
A framework is proposed for developing and evaluating algorithms for extracting multipath propagation components (MPCs) from measurements collected by sounders at millimeter-wave (mmW) frequencies. To focus on algorithmic performance, an idealized model is proposed for the spatial frequency response of the propagation environment measured by a sounder. The input to the sounder model is a pre-deter…
▽ More
A framework is proposed for developing and evaluating algorithms for extracting multipath propagation components (MPCs) from measurements collected by sounders at millimeter-wave (mmW) frequencies. To focus on algorithmic performance, an idealized model is proposed for the spatial frequency response of the propagation environment measured by a sounder. The input to the sounder model is a pre-determined set of MPC parameters that serve as the "ground truth." A three-dimensional angle-delay (beamspace) representation of the measured spatial frequency response serves as a natural domain for implementing and analyzing MPC extraction algorithms. Metrics for quantifying the error in estimated MPC parameters are introduced. Initial results are presented for a greedy matching pursuit algorithm that performs a least-squares (LS) reconstruction of the MPC path gains within the iterations. The results indicate that the simple greedy-LS algorithm has the ability to extract MPCs over a large dynamic range, and suggest several avenues for further performance improvement through extensions of the greedy-LS algorithm as well as by incorporating features of other algorithms, such as SAGE and RIMAX.
△ Less
Submitted 13 September, 2021;
originally announced September 2021.
-
Score-level Multi Cue Fusion for Sign Language Recognition
Authors:
Çağrı Gökçe,
Oğulcan Özdemir,
Ahmet Alp Kındıroğlu,
Lale Akarun
Abstract:
Sign Languages are expressed through hand and upper body gestures as well as facial expressions. Therefore, Sign Language Recognition (SLR) needs to focus on all such cues. Previous work uses hand-crafted mechanisms or network aggregation to extract the different cue features, to increase SLR performance. This is slow and involves complicated architectures. We propose a more straightforward approa…
▽ More
Sign Languages are expressed through hand and upper body gestures as well as facial expressions. Therefore, Sign Language Recognition (SLR) needs to focus on all such cues. Previous work uses hand-crafted mechanisms or network aggregation to extract the different cue features, to increase SLR performance. This is slow and involves complicated architectures. We propose a more straightforward approach that focuses on training separate cue models specializing on the dominant hand, hands, face, and upper body regions. We compare the performance of 3D Convolutional Neural Network (CNN) models specializing in these regions, combine them through score-level fusion, and use the weighted alternative. Our experimental results have shown the effectiveness of mixed convolutional models. Their fusion yields up to 19% accuracy improvement over the baseline using the full upper body. Furthermore, we include a discussion for fusion settings, which can help future work on Sign Language Translation (SLT).
△ Less
Submitted 29 September, 2020;
originally announced September 2020.
-
Reconfigurable Intelligent Surfaces for the Connectivity of Autonomous Vehicles
Authors:
Y. Ugur Ozcan,
Ozgur Ozdemir,
Gunes Karabulut Kurt
Abstract:
The use of real-time software-controlled reconfigurable intelligent surface (RIS) units is proposed to increase the reliability of vehicle-to-everything (V2X) communications. The optimum placement problem of the RIS units is formulated by considering their sizes and operating modes. The solution of the problem is given, where it is shown that the placement of the RIS depends on the locations of th…
▽ More
The use of real-time software-controlled reconfigurable intelligent surface (RIS) units is proposed to increase the reliability of vehicle-to-everything (V2X) communications. The optimum placement problem of the RIS units is formulated by considering their sizes and operating modes. The solution of the problem is given, where it is shown that the placement of the RIS depends on the locations of the transmitter and the receiver. The proposed RIS-supported highway deployment can combat the high path loss experienced by the use of higher frequency bands, including the millimeter-wave and the terahertz bands, that are expected to be used in the next-generation wireless networks, enabling the use of the existing base station deployment plans to remain operational, while providing reliable and energy-efficient connectivity for autonomous vehicles.
△ Less
Submitted 20 July, 2020;
originally announced July 2020.
-
Software Radios for Unmanned Aerial Systems
Authors:
Keith Powell,
Aly Sabri,
Daniel Brennan,
Vuk Marojevic,
R. Michael Barts,
Ashwin Panicker,
Ozgur Ozdemir,
Ismail Guvenc
Abstract:
As new use cases are emerging for unmanned aerial systems (UAS), advanced wireless communications technologies and systems need to be implemented and widely tested. This requires a flexible platform for development, deployment, testing and demonstration of wireless systems with ground and aerial nodes, enabling effective 3D mobile communications and networking. In this paper, we provide a comparat…
▽ More
As new use cases are emerging for unmanned aerial systems (UAS), advanced wireless communications technologies and systems need to be implemented and widely tested. This requires a flexible platform for development, deployment, testing and demonstration of wireless systems with ground and aerial nodes, enabling effective 3D mobile communications and networking. In this paper, we provide a comparative overview of software-defined radios (SDRs), with a specific focus on SDR hardware and software that can be used for aerial wireless experimentation and research. We discuss SDR hardware requirements, features of available SDR hardware that can be suitable for small UAS, and power measurements carried out with a subset of these SDR hardware. We also present SDR software requirements, available open-source SDR software, and calibration/benchmarking of SDR software. As a case study, we present AERPAW: Aerial Experimentation and Research Platform for Advanced Wireless, and discuss various different experiments that can be supported in that platform using SDRs, for verification/testing of future wireless innovations, protocols, and technologies.
△ Less
Submitted 20 May, 2020; v1 submitted 4 April, 2020;
originally announced April 2020.
-
BosphorusSign22k Sign Language Recognition Dataset
Authors:
Oğulcan Özdemir,
Ahmet Alp Kındıroğlu,
Necati Cihan Camgöz,
Lale Akarun
Abstract:
Sign Language Recognition is a challenging research domain. It has recently seen several advancements with the increased availability of data. In this paper, we introduce the BosphorusSign22k, a publicly available large scale sign language dataset aimed at computer vision, video recognition and deep learning research communities. The primary objective of this dataset is to serve as a new benchmark…
▽ More
Sign Language Recognition is a challenging research domain. It has recently seen several advancements with the increased availability of data. In this paper, we introduce the BosphorusSign22k, a publicly available large scale sign language dataset aimed at computer vision, video recognition and deep learning research communities. The primary objective of this dataset is to serve as a new benchmark in Turkish Sign Language Recognition for its vast lexicon, the high number of repetitions by native signers, high recording quality, and the unique syntactic properties of the signs it encompasses. We also provide state-of-the-art human pose estimates to encourage other tasks such as Sign Language Production. We survey other publicly available datasets and expand on how BosphorusSign22k can contribute to future research that is being made possible through the widespread availability of similar Sign Language resources. We have conducted extensive experiments and present baseline results to underpin future research on our dataset.
△ Less
Submitted 9 April, 2020; v1 submitted 2 April, 2020;
originally announced April 2020.
-
Temporal Accumulative Features for Sign Language Recognition
Authors:
Ahmet Alp Kındıroğlu,
Oğulcan Özdemir,
Lale Akarun
Abstract:
In this paper, we propose a set of features called temporal accumulative features (TAF) for representing and recognizing isolated sign language gestures. By incorporating sign language specific constructs to better represent the unique linguistic characteristic of sign language videos, we have devised an efficient and fast SLR method for recognizing isolated sign language gestures. The proposed me…
▽ More
In this paper, we propose a set of features called temporal accumulative features (TAF) for representing and recognizing isolated sign language gestures. By incorporating sign language specific constructs to better represent the unique linguistic characteristic of sign language videos, we have devised an efficient and fast SLR method for recognizing isolated sign language gestures. The proposed method is an HSV based accumulative video representation where keyframes based on the linguistic movement-hold model are represented by different colors. We also incorporate hand shape information and using a small scale convolutional neural network, demonstrate that sequential modeling of accumulative features for linguistic subunits improves upon baseline classification results.
△ Less
Submitted 2 April, 2020;
originally announced April 2020.
-
Neural Academic Paper Generation
Authors:
Samet Demir,
Uras Mutlu,
Özgur Özdemir
Abstract:
In this work, we tackle the problem of structured text generation, specifically academic paper generation in $\LaTeX{}$, inspired by the surprisingly good results of basic character-level language models. Our motivation is using more recent and advanced methods of language modeling on a more complex dataset of $\LaTeX{}$ source files to generate realistic academic papers. Our first contribution is…
▽ More
In this work, we tackle the problem of structured text generation, specifically academic paper generation in $\LaTeX{}$, inspired by the surprisingly good results of basic character-level language models. Our motivation is using more recent and advanced methods of language modeling on a more complex dataset of $\LaTeX{}$ source files to generate realistic academic papers. Our first contribution is preparing a dataset with $\LaTeX{}$ source files on recent open-source computer vision papers. Our second contribution is experimenting with recent methods of language modeling and text generation such as Transformer and Transformer-XL to generate consistent $\LaTeX{}$ code. We report cross-entropy and bits-per-character (BPC) results of the trained models, and we also discuss interesting points on some examples of the generated $\LaTeX{}$ code.
△ Less
Submitted 2 December, 2019;
originally announced December 2019.
-
A 3D Probabilistic Deep Learning System for Detection and Diagnosis of Lung Cancer Using Low-Dose CT Scans
Authors:
Onur Ozdemir,
Rebecca L. Russell,
Andrew A. Berlin
Abstract:
We introduce a new computer aided detection and diagnosis system for lung cancer screening with low-dose CT scans that produces meaningful probability assessments. Our system is based entirely on 3D convolutional neural networks and achieves state-of-the-art performance for both lung nodule detection and malignancy classification tasks on the publicly available LUNA16 and Kaggle Data Science Bowl…
▽ More
We introduce a new computer aided detection and diagnosis system for lung cancer screening with low-dose CT scans that produces meaningful probability assessments. Our system is based entirely on 3D convolutional neural networks and achieves state-of-the-art performance for both lung nodule detection and malignancy classification tasks on the publicly available LUNA16 and Kaggle Data Science Bowl challenges. While nodule detection systems are typically designed and optimized on their own, we find that it is important to consider the coupling between detection and diagnosis components. Exploiting this coupling allows us to develop an end-to-end system that has higher and more robust performance and eliminates the need for a nodule detection false positive reduction stage. Furthermore, we characterize model uncertainty in our deep learning systems, a first for lung CT analysis, and show that we can use this to provide well-calibrated classification probabilities for both nodule detection and patient malignancy diagnosis. These calibrated probabilities informed by model uncertainty can be used for subsequent risk-based decision making towards diagnostic interventions or disease treatments, as we demonstrate using a probability-based patient referral strategy to further improve our results.
△ Less
Submitted 20 January, 2020; v1 submitted 8 February, 2019;
originally announced February 2019.
-
Automated Vulnerability Detection in Source Code Using Deep Representation Learning
Authors:
Rebecca L. Russell,
Louis Kim,
Lei H. Hamilton,
Tomo Lazovich,
Jacob A. Harer,
Onur Ozdemir,
Paul M. Ellingwood,
Marc W. McConley
Abstract:
Increasing numbers of software vulnerabilities are discovered every year whether they are reported publicly or discovered internally in proprietary code. These vulnerabilities can pose serious risk of exploit and result in system compromise, information leaks, or denial of service. We leveraged the wealth of C and C++ open-source code available to develop a large-scale function-level vulnerability…
▽ More
Increasing numbers of software vulnerabilities are discovered every year whether they are reported publicly or discovered internally in proprietary code. These vulnerabilities can pose serious risk of exploit and result in system compromise, information leaks, or denial of service. We leveraged the wealth of C and C++ open-source code available to develop a large-scale function-level vulnerability detection system using machine learning. To supplement existing labeled vulnerability datasets, we compiled a vast dataset of millions of open-source functions and labeled it with carefully-selected findings from three different static analyzers that indicate potential exploits. The labeled dataset is available at: https://osf.io/d45bw/. Using these datasets, we developed a fast and scalable vulnerability detection tool based on deep feature representation learning that directly interprets lexed source code. We evaluated our tool on code from both real software packages and the NIST SATE IV benchmark dataset. Our results demonstrate that deep feature representation learning on source code is a promising approach for automated software vulnerability detection.
△ Less
Submitted 27 November, 2018; v1 submitted 11 July, 2018;
originally announced July 2018.
-
Learning to Repair Software Vulnerabilities with Generative Adversarial Networks
Authors:
Jacob Harer,
Onur Ozdemir,
Tomo Lazovich,
Christopher P. Reale,
Rebecca L. Russell,
Louis Y. Kim,
Peter Chin
Abstract:
Motivated by the problem of automated repair of software vulnerabilities, we propose an adversarial learning approach that maps from one discrete source domain to another target domain without requiring paired labeled examples or source and target domains to be bijections. We demonstrate that the proposed adversarial learning approach is an effective technique for repairing software vulnerabilitie…
▽ More
Motivated by the problem of automated repair of software vulnerabilities, we propose an adversarial learning approach that maps from one discrete source domain to another target domain without requiring paired labeled examples or source and target domains to be bijections. We demonstrate that the proposed adversarial learning approach is an effective technique for repairing software vulnerabilities, performing close to seq2seq approaches that require labeled pairs. The proposed Generative Adversarial Network approach is application-agnostic in that it can be applied to other problems similar to code repair, such as grammar correction or sentiment translation.
△ Less
Submitted 28 October, 2018; v1 submitted 18 May, 2018;
originally announced May 2018.
-
The Role-Relevance Model for Enhanced Semantic Targeting in Unstructured Text
Authors:
Christopher A. George,
Onur Ozdemir,
Connie Fournelle,
Kendra E. Moore
Abstract:
Personalized search provides a potentially powerful tool, however, it is limited due to the large number of roles that a person has: parent, employee, consumer, etc. We present the role-relevance algorithm: a search technique that favors search results relevant to the user's current role. The role-relevance algorithm uses three factors to score documents: (1) the number of keywords each document c…
▽ More
Personalized search provides a potentially powerful tool, however, it is limited due to the large number of roles that a person has: parent, employee, consumer, etc. We present the role-relevance algorithm: a search technique that favors search results relevant to the user's current role. The role-relevance algorithm uses three factors to score documents: (1) the number of keywords each document contains; (2) each document's geographic relevance to the user's role (if applicable); and (3) each document's topical relevance to the user's role (if applicable). Topical relevance is assessed using a novel extension to Latent Dirichlet Allocation (LDA) that allows standard LDA to score document relevance to user-defined topics. Overall results on a pre-labeled corpus show an average improvement in search precision of approximately 20% compared to keyword search alone.
△ Less
Submitted 29 April, 2018; v1 submitted 20 April, 2018;
originally announced April 2018.
-
Coverage Enhancement for mmWave Communications using Passive Reflectors
Authors:
Wahab Khawaja,
Ozgur Ozdemir,
Yavuz Yapici,
Ismail Guvenc,
Yuichi Kakishima
Abstract:
Millimeter wave (mmWave) technology is expected to dominate the future 5G networks mainly due to large spectrum available at these frequencies. However, coverage deteriorates significantly at mmWave frequencies due to higher path loss, especially for the non-line-of-sight (NLOS) scenarios. In this work, we explore the use of passive reflectors for improving mmWave signal coverage in NLOS indoor ar…
▽ More
Millimeter wave (mmWave) technology is expected to dominate the future 5G networks mainly due to large spectrum available at these frequencies. However, coverage deteriorates significantly at mmWave frequencies due to higher path loss, especially for the non-line-of-sight (NLOS) scenarios. In this work, we explore the use of passive reflectors for improving mmWave signal coverage in NLOS indoor areas. Measurements are carried out using the PXI-based mmWave transceiver platforms from National Instruments operating at 28 GHz, and the results are compared with the outcomes of ray tracing (RT) simulations in a similar environment. For both the measurements and RT simulations, different shapes of metallic passive reflectors are used to observe the coverage (signal strength) statistics on a receiver grid in an NLOS area. For a square metallic sheet reflector of size 24 by 24 in and 33 by 33 in , we observe a significant increase in the received power in the NLOS region, with a median gain of 20 dB when compared to no reflector case. The cylindrical reflector shows more uniform coverage on the receiver grid as compared to flat reflectors that are more directional.
△ Less
Submitted 25 May, 2018; v1 submitted 22 March, 2018;
originally announced March 2018.
-
Automated software vulnerability detection with machine learning
Authors:
Jacob A. Harer,
Louis Y. Kim,
Rebecca L. Russell,
Onur Ozdemir,
Leonard R. Kosta,
Akshay Rangamani,
Lei H. Hamilton,
Gabriel I. Centeno,
Jonathan R. Key,
Paul M. Ellingwood,
Erik Antelman,
Alan Mackay,
Marc W. McConley,
Jeffrey M. Opper,
Peter Chin,
Tomo Lazovich
Abstract:
Thousands of security vulnerabilities are discovered in production software each year, either reported publicly to the Common Vulnerabilities and Exposures database or discovered internally in proprietary code. Vulnerabilities often manifest themselves in subtle ways that are not obvious to code reviewers or the developers themselves. With the wealth of open source code available for analysis, the…
▽ More
Thousands of security vulnerabilities are discovered in production software each year, either reported publicly to the Common Vulnerabilities and Exposures database or discovered internally in proprietary code. Vulnerabilities often manifest themselves in subtle ways that are not obvious to code reviewers or the developers themselves. With the wealth of open source code available for analysis, there is an opportunity to learn the patterns of bugs that can lead to security vulnerabilities directly from data. In this paper, we present a data-driven approach to vulnerability detection using machine learning, specifically applied to C and C++ programs. We first compile a large dataset of hundreds of thousands of open-source functions labeled with the outputs of a static analyzer. We then compare methods applied directly to source code with methods applied to artifacts extracted from the build process, finding that source-based models perform better. We also compare the application of deep neural network models with more traditional models such as random forests and find the best performance comes from combining features learned by deep models with tree-based models. Ultimately, our highest performing model achieves an area under the precision-recall curve of 0.49 and an area under the ROC curve of 0.87.
△ Less
Submitted 2 August, 2018; v1 submitted 14 February, 2018;
originally announced March 2018.
-
Propagating Uncertainty in Multi-Stage Bayesian Convolutional Neural Networks with Application to Pulmonary Nodule Detection
Authors:
Onur Ozdemir,
Benjamin Woodward,
Andrew A. Berlin
Abstract:
Motivated by the problem of computer-aided detection (CAD) of pulmonary nodules, we introduce methods to propagate and fuse uncertainty information in a multi-stage Bayesian convolutional neural network (CNN) architecture. The question we seek to answer is "can we take advantage of the model uncertainty provided by one deep learning model to improve the performance of the subsequent deep learning…
▽ More
Motivated by the problem of computer-aided detection (CAD) of pulmonary nodules, we introduce methods to propagate and fuse uncertainty information in a multi-stage Bayesian convolutional neural network (CNN) architecture. The question we seek to answer is "can we take advantage of the model uncertainty provided by one deep learning model to improve the performance of the subsequent deep learning models and ultimately of the overall performance in a multi-stage Bayesian deep learning architecture?". Our experiments show that propagating uncertainty through the pipeline enables us to improve the overall performance in terms of both final prediction accuracy and model confidence.
△ Less
Submitted 1 December, 2017;
originally announced December 2017.
-
Sparsity-Aware Joint Frame Synchronization and Channel Estimation: Algorithm and USRP Implementation
Authors:
Ozgur Ozdemir,
Ridha Hamila,
Naofal Al-Dhahir,
Ismail Guvenc
Abstract:
Conventional correlation-based frame synchronization techniques can suffer significant performance degradation over multi-path frequency-selective channels. As a remedy, in this paper we consider joint frame synchronization and channel estimation. This, however, increases the length of the resulting combined channel and its estimation becomes more challenging. On the other hand, since the combined…
▽ More
Conventional correlation-based frame synchronization techniques can suffer significant performance degradation over multi-path frequency-selective channels. As a remedy, in this paper we consider joint frame synchronization and channel estimation. This, however, increases the length of the resulting combined channel and its estimation becomes more challenging. On the other hand, since the combined channel is a sparse vector, sparse channel estimation methods can be applied. We propose a joint frame synchronization and channel estimation method using the orthogonal matching pursuit (OMP) algorithm which exploits the sparsity of the combined channel vector. Subsequently, the channel estimate is used to design the equalizer. Our simulation results and experimental outcomes using software defined radios show that the proposed approach improves the overall system performance in terms of the mean square error (MSE) between the transmitted and the equalized symbols compared to the conventional method.
△ Less
Submitted 5 September, 2017;
originally announced September 2017.
-
UAV Air-to-Ground Channel Characterization for mmWave Systems
Authors:
Wahab Khawaja,
Ozgur Ozdemir,
Ismail Guvenc
Abstract:
Communication at mmWave bands carries critical importance for 5G wireless networks. In this paper, we study the characterization of mmWave air-to-ground (AG) channels for unmanned aerial vehicle (UAV) communications. In particular, we use ray tracing simulations using Remcom Wireless InSite software to study the behavior of AG mmWave bands at two different frequencies: 28~GHz and 60~GHz. Received…
▽ More
Communication at mmWave bands carries critical importance for 5G wireless networks. In this paper, we study the characterization of mmWave air-to-ground (AG) channels for unmanned aerial vehicle (UAV) communications. In particular, we use ray tracing simulations using Remcom Wireless InSite software to study the behavior of AG mmWave bands at two different frequencies: 28~GHz and 60~GHz. Received signal strength (RSS) and root mean square delay spread (RMS-DS) of multipath components (MPCs) are analyzed for different UAV heights considering four different environments: urban, suburban, rural, and over sea. It is observed that the RSS mostly follows the two ray propagation model along the UAV flight path for higher altitudes. This two ray propagation model is affected by the presence of high rise scatterers in urban scenario. Moreover, we present details of a universal serial radio peripheral (USRP) based channel sounder that can be used for AG channel measurements for mmWave (60 GHz) UAV communications.
△ Less
Submitted 15 October, 2017; v1 submitted 1 July, 2017;
originally announced July 2017.
-
Asynchronous Linear Modulation Classification with Multiple Sensors via Generalized EM Algorithm
Authors:
O. Ozdemir,
T. Wimalajeewa,
B. Dulek,
P. K. Varshney,
W. Su
Abstract:
In this paper, we consider the problem of automatic modulation classification with multiple sensors in the presence of unknown time offset, phase offset and received signal amplitude. We develop a novel hybrid maximum likelihood (HML) classification scheme based on a generalized expectation maximization (GEM) algorithm. GEM is capable of finding ML estimates numerically that are extremely hard to…
▽ More
In this paper, we consider the problem of automatic modulation classification with multiple sensors in the presence of unknown time offset, phase offset and received signal amplitude. We develop a novel hybrid maximum likelihood (HML) classification scheme based on a generalized expectation maximization (GEM) algorithm. GEM is capable of finding ML estimates numerically that are extremely hard to obtain otherwise. Assuming a good initialization technique is available for GEM, we show that the classification performance can be greatly improved with multiple sensors compared to that with a single sensor, especially when the signal-to-noise ratio (SNR) is low. We further demonstrate the superior performance of our approach when simulated annealing (SA) with uniform as well as nonuniform grids is employed for initialization of GEM in low SNR regions. The proposed GEM based approach employs only a small number of samples (in the order of hundreds) at a given sensor node to perform both time and phase synchronization, signal power estimation, followed by modulation classification. We provide simulation results to show the computational efficiency and effectiveness of the proposed algorithm.
△ Less
Submitted 3 February, 2015; v1 submitted 29 September, 2014;
originally announced September 2014.
-
Permutation Trellis Coded Multi-level FSK Signaling to Mitigate Primary User Interference in Cognitive Radio Networks
Authors:
Raghed El-Bardan,
Engin Masazade,
Onur Ozdemir,
Yunghsiang S. Han,
Pramod K. Varshney
Abstract:
We employ Permutation Trellis Code (PTC) based multi-level Frequency Shift Keying signaling to mitigate the impact of Primary Users (PUs) on the performance of Secondary Users (SUs) in Cognitive Radio Networks (CRNs). The PUs are assumed to be dynamic in that they appear intermittently and stay active for an unknown duration. Our approach is based on the use of PTC combined with multi-level FSK mo…
▽ More
We employ Permutation Trellis Code (PTC) based multi-level Frequency Shift Keying signaling to mitigate the impact of Primary Users (PUs) on the performance of Secondary Users (SUs) in Cognitive Radio Networks (CRNs). The PUs are assumed to be dynamic in that they appear intermittently and stay active for an unknown duration. Our approach is based on the use of PTC combined with multi-level FSK modulation so that an SU can improve its data rate by increasing its transmission bandwidth while operating at low power and not creating destructive interference for PUs. We evaluate system performance by obtaining an approximation for the actual Bit Error Rate (BER) using properties of the Viterbi decoder and carry out a thorough performance analysis in terms of BER and throughput. The results show that the proposed coded system achieves i) robustness by ensuring that SUs have stable throughput in the presence of heavy PU interference and ii) improved resiliency of SU links to interference in the presence of multiple dynamic PUs.
△ Less
Submitted 12 December, 2014; v1 submitted 11 July, 2014;
originally announced August 2014.
-
Hybrid Maximum Likelihood Modulation Classification Using Multiple Radios
Authors:
Onur Ozdemir,
Ruoyu Li,
Pramod K. Varshney
Abstract:
The performance of a modulation classifier is highly sensitive to channel signal-to-noise ratio (SNR). In this paper, we focus on amplitude-phase modulations and propose a modulation classification framework based on centralized data fusion using multiple radios and the hybrid maximum likelihood (ML) approach. In order to alleviate the computational complexity associated with ML estimation, we ado…
▽ More
The performance of a modulation classifier is highly sensitive to channel signal-to-noise ratio (SNR). In this paper, we focus on amplitude-phase modulations and propose a modulation classification framework based on centralized data fusion using multiple radios and the hybrid maximum likelihood (ML) approach. In order to alleviate the computational complexity associated with ML estimation, we adopt the Expectation Maximization (EM) algorithm. Due to SNR diversity, the proposed multi-radio framework provides robustness to channel SNR. Numerical results show the superiority of the proposed approach with respect to single radio approaches as well as to modulation classifiers using moments based estimators.
△ Less
Submitted 10 June, 2013; v1 submitted 4 March, 2013;
originally announced March 2013.
-
Asymptotic Properties of Likelihood Based Linear Modulation Classification Systems
Authors:
Onur Ozdemir,
Pramod K. Varshney,
Wei Su,
Andrew L. Drozd
Abstract:
The problem of linear modulation classification using likelihood based methods is considered. Asymptotic properties of most commonly used classifiers in the literature are derived. These classifiers are based on hybrid likelihood ratio test (HLRT) and average likelihood ratio test (ALRT), respectively. Both a single-sensor setting and a multi-sensor setting that uses a distributed decision fusion…
▽ More
The problem of linear modulation classification using likelihood based methods is considered. Asymptotic properties of most commonly used classifiers in the literature are derived. These classifiers are based on hybrid likelihood ratio test (HLRT) and average likelihood ratio test (ALRT), respectively. Both a single-sensor setting and a multi-sensor setting that uses a distributed decision fusion approach are analyzed. For a modulation classification system using a single sensor, it is shown that HLRT achieves asymptotically vanishing probability of error (Pe) whereas the same result cannot be proven for ALRT. In a multi-sensor setting using soft decision fusion, conditions are derived under which Pe vanishes asymptotically. Furthermore, the asymptotic analysis of the fusion rule that assumes independent sensor decisions is carried out.
△ Less
Submitted 28 November, 2012;
originally announced November 2012.