-
Quality of service based radar resource management using deep reinforcement learning
Authors:
Sebastian Durst,
Stefan Brüggenwirth
Abstract:
An intelligent radar resource management is an essential milestone in the development of a cognitive radar system. The quality of service based resource allocation model (Q-RAM) is a framework allowing for intelligent decision making but classical solutions seem insufficient for real-time application in a modern radar system. In this paper, we present a solution for the Q-RAM radar resource manage…
▽ More
An intelligent radar resource management is an essential milestone in the development of a cognitive radar system. The quality of service based resource allocation model (Q-RAM) is a framework allowing for intelligent decision making but classical solutions seem insufficient for real-time application in a modern radar system. In this paper, we present a solution for the Q-RAM radar resource management problem using deep reinforcement learning considerably improving on runtime performance.
△ Less
Submitted 20 October, 2020;
originally announced October 2020.
-
Avoiding Jammers: A Reinforcement Learning Approach
Authors:
Serkan Ak,
Stefan Bruggenwirth
Abstract:
This paper investigates the anti-jamming performance of a cognitive radar under a partially observable Markov decision process (POMDP) model. First, we obtain an explicit expression for uncertainty of jammer dynamics, which paves the way for illuminating the performance metric of probability of being jammed for the radar beyond a conventional signal-to-noise ratio ($\mathsf{SNR}$) based analysis.…
▽ More
This paper investigates the anti-jamming performance of a cognitive radar under a partially observable Markov decision process (POMDP) model. First, we obtain an explicit expression for uncertainty of jammer dynamics, which paves the way for illuminating the performance metric of probability of being jammed for the radar beyond a conventional signal-to-noise ratio ($\mathsf{SNR}$) based analysis. Considering two frequency hopping strategies developed in the framework of reinforcement learning (RL), this performance metric is analyzed with deep Q-network (DQN) and long short term memory (LSTM) networks under various uncertainty values. Finally, the requirement of the target network in the RL algorithm for both network architectures is replaced with a softmax operator. Simulation results show that this operator improves upon the performance of the traditional target network.
△ Less
Submitted 27 November, 2019; v1 submitted 20 November, 2019;
originally announced November 2019.
-
Robotic Control for Cognitive UWB Radar
Authors:
Stefan Brüggenwirth,
Fernando Rial
Abstract:
In the article, we describe a trajectory planning problem for a 6-DOF robotic manipulator arm that carries an ultra-wideband (UWB) radar sensor with synthetic aperture (SAR). The resolution depends on the trajectory and velocity profile of the sensor head. The constraints can be modelled as an optimization problem to obtain a feasible, collision-free target trajectory of the end-effector of the ma…
▽ More
In the article, we describe a trajectory planning problem for a 6-DOF robotic manipulator arm that carries an ultra-wideband (UWB) radar sensor with synthetic aperture (SAR). The resolution depends on the trajectory and velocity profile of the sensor head. The constraints can be modelled as an optimization problem to obtain a feasible, collision-free target trajectory of the end-effector of the manipulator arm in Cartesian coordinates that minimizes observation time. For 3D-reconstruction, the target is observed in multiple height slices. For Through-the-Wall radar the sensor can be operated in sliding mode for scanning larger areas. For IED inspection the spot-light mode is preferred, constantly pointing the antennas towards the target to obtain maximum azimuth resolution.
△ Less
Submitted 11 October, 2017;
originally announced October 2017.