Feature Selection Using Reinforcement Learning
Authors:
Sali Rasoul,
Sodiq Adewole,
Alphonse Akakpo
Abstract:
With the decreasing cost of data collection, the space of variables or features that can be used to characterize a particular predictor of interest continues to grow exponentially. Therefore, identifying the most characterizing features that minimizes the variance without jeopardizing the bias of our models is critical to successfully training a machine learning model. In addition, identifying suc…
▽ More
With the decreasing cost of data collection, the space of variables or features that can be used to characterize a particular predictor of interest continues to grow exponentially. Therefore, identifying the most characterizing features that minimizes the variance without jeopardizing the bias of our models is critical to successfully training a machine learning model. In addition, identifying such features is critical for interpretability, prediction accuracy and optimal computation cost. While statistical methods such as subset selection, shrinkage, dimensionality reduction have been applied in selecting the best set of features, some other approaches in literature have approached feature selection task as a search problem where each state in the search space is a possible feature subset. In this paper, we solved the feature selection problem using Reinforcement Learning. Formulating the state space as a Markov Decision Process (MDP), we used Temporal Difference (TD) algorithm to select the best subset of features. Each state was evaluated using a robust and low cost classifier algorithm which could handle any non-linearities in the dataset.
△ Less
Submitted 23 January, 2021;
originally announced January 2021.
Detecting, identifying, and localizing radiological material in urban environments using scan statistics
Authors:
Michael D. Porter,
Alphonse Akakpo
Abstract:
A method is proposed, based on scan statistics, to detect, identify, and localize illicit radiological material using mobile sensors in an urban environment. Our method handles varying levels of background radiation that change according to an (unknown) environment. Our method can accurately determine if a source is present along a street segment as well as identify which of six possible sources g…
▽ More
A method is proposed, based on scan statistics, to detect, identify, and localize illicit radiological material using mobile sensors in an urban environment. Our method handles varying levels of background radiation that change according to an (unknown) environment. Our method can accurately determine if a source is present along a street segment as well as identify which of six possible sources generated the radiation. Our method can also localize the source, when detected, to within a few seconds. We have presented our results across a range of decision thresholds allowing stakeholders to evaluate the performance at different false alarm rates. Due to the simplicity of our approach, our models can be trained in a few minutes with very little training data and holds the potential to score a run in real-time. Our method was one of the top performing submissions in the 'Detecting Radiological Threats in Urban Areas' competition.
△ Less
Submitted 8 February, 2020;
originally announced February 2020.