Skip to main content

Showing 1–14 of 14 results for author: Furfaro, R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2205.00085  [pdf

    cs.RO eess.SY

    Line of Sight Curvature for Missile Guidance using Reinforcement Meta-Learning

    Authors: Brian Gaudet, Roberto Furfaro

    Abstract: We use reinforcement meta learning to optimize a line of sight curvature policy that increases the effectiveness of a guidance system against maneuvering targets. The policy is implemented as a recurrent neural network that maps navigation system outputs to a Euler 321 attitude representation. The attitude representation is then used to construct a direction cosine matrix that biases the observed… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.

    Comments: Submitted to 2023 Scitech Guidance and Control conference. arXiv admin note: substantial text overlap with arXiv:2109.03880; text overlap with arXiv:2004.09978

  2. arXiv:2112.08540  [pdf

    eess.SY cs.AI cs.RO

    Integrated Guidance and Control for Lunar Landing using a Stabilized Seeker

    Authors: Brian Gaudet, Roberto Furfaro

    Abstract: We develop an integrated guidance and control system that in conjunction with a stabilized seeker and landing site detection software can achieve precise and safe planetary landing. The seeker tracks the designated landing site by adjusting seeker elevation and azimuth angles to center the designated landing site in the sensor field of view. The seeker angles, closing speed, and range to the desig… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: Accepted for 2022 AIAA Scitech GN&C. arXiv admin note: text overlap with arXiv:2107.14764, arXiv:2004.09978, arXiv:2110.00634, arXiv:2109.03880

  3. Terminal Adaptive Guidance for Autonomous Hypersonic Strike Weapons via Reinforcement Learning

    Authors: Brian Gaudet, Roberto Furfaro

    Abstract: An adaptive guidance system suitable for the terminal phase trajectory of a hypersonic strike weapon is optimized using reinforcement meta learning. The guidance system maps observations directly to commanded bank angle, angle of attack, and sideslip angle rates. Importantly, the observations are directly measurable from radar seeker outputs with minimal processing. The optimization framework impl… ▽ More

    Submitted 16 October, 2021; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2107.14764; text overlap with arXiv:2109.03880

  4. Integrated and Adaptive Guidance and Control for Endoatmospheric Missiles via Reinforcement Learning

    Authors: Brian Gaudet, Roberto Furfaro

    Abstract: We apply a reinforcement meta-learning framework to optimize an integrated and adaptive guidance and flight control system for an air-to-air missile. The system is implemented as a policy that maps navigation system outputs directly to commanded rates of change for the missile's control surface deflections. The system induces intercept trajectories against a maneuvering target that satisfy control… ▽ More

    Submitted 3 May, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Preprint for 2023 Scitech GN&C submission

  5. arXiv:2004.09978  [pdf, other

    eess.SY cs.LG

    Reinforcement Meta-Learning for Interception of Maneuvering Exoatmospheric Targets with Parasitic Attitude Loop

    Authors: Brian Gaudet, Roberto Furfaro, Richard Linares, Andrea Scorsoglio

    Abstract: We use Reinforcement Meta-Learning to optimize an adaptive integrated guidance, navigation, and control system suitable for exoatmospheric interception of a maneuvering target. The system maps observations consisting of strapdown seeker angles and rate gyro measurements directly to thruster on-off commands. Using a high fidelity six degree-of-freedom simulator, we demonstrate that the optimized po… ▽ More

    Submitted 18 April, 2020; originally announced April 2020.

    Comments: Under Consideration for publication in Journal of Spacecraft and Rockets. arXiv admin note: text overlap with arXiv:1906.02113

  6. Adaptive Generalized ZEM-ZEV Feedback Guidance for Planetary Landing via a Deep Reinforcement Learning Approach

    Authors: Roberto Furfaro, Andrea Scorsoglio, Richard Linares, Mauro Massari

    Abstract: Precision landing on large and small planetary bodies is a technology of utmost importance for future human and robotic exploration of the solar system. In this context, the Zero-Effort-Miss/Zero-Effort-Velocity (ZEM/ZEV) feedback guidance algorithm has been studied extensively and is still a field of active research. The algorithm, although powerful in terms of accuracy and ease of implementation… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

    Comments: 46 pages, 14 figures, Acta Astronautica, Pre-proof, Available March 4, 2020

  7. Fuel-Efficient Powered Descent Guidance on Large Planetary Bodies via Theory of Functional Connections

    Authors: Hunter Johnston, Enrico Schiassi, Roberto Furfaro, Daniele Mortari

    Abstract: In this paper we present a new approach to solve the fuel-efficient powered descent guidance problem on large planetary bodies with no atmosphere (e.g. the Moon or Mars) using the recently developed Theory of Functional Connections. The problem is formulated using the indirect method which casts the optimal guidance problem as a system of nonlinear two-point boundary value problems. Using the Theo… ▽ More

    Submitted 10 January, 2020; originally announced January 2020.

    Comments: 17 pages, 10 figures, 6 tables

    MSC Class: 49M99

    Journal ref: The Journal of the Astronautical Sciences (2020)

  8. Six Degree-of-Freedom Body-Fixed Hovering over Unmapped Asteroids via LIDAR Altimetry and Reinforcement Meta-Learning

    Authors: Brian Gaudet, Richard Linares, Roberto Furfaro

    Abstract: We optimize a six degrees of freedom hovering policy using reinforcement meta-learning. The policy maps flash LIDAR measurements directly to on/off spacecraft body-frame thrust commands, allowing hovering at a fixed position and attitude in the asteroid body-fixed reference frame. Importantly, the policy does not require position and velocity estimates, and can operate in environments with unknown… ▽ More

    Submitted 8 February, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

    Comments: Earlier version presented at 2020 AIAA Scitech conference. arXiv admin note: substantial text overlap with arXiv:1907.06098, arXiv:1906.02113

  9. arXiv:1911.00489  [pdf, other

    eess.SY

    Space Objects Maneuvering Prediction via Maximum Causal Entropy Inverse Reinforcement Learning

    Authors: Bryce Doerr, Richard Linares, Roberto Furfaro

    Abstract: Inverse Reinforcement Learning (RL) can be used to determine the behavior of Space Objects (SOs) by estimating the reward function that an SO is using for control. The approach discussed in this work can be used to analyze maneuvering of SOs from observational data. The inverse RL problem is solved using maximum causal entropy. This approach determines the optimal reward function that a SO is usin… ▽ More

    Submitted 6 December, 2019; v1 submitted 1 November, 2019; originally announced November 2019.

  10. arXiv:1907.06098  [pdf, other

    eess.SY astro-ph.IM cs.LG

    Seeker based Adaptive Guidance via Reinforcement Meta-Learning Applied to Asteroid Close Proximity Operations

    Authors: Brian Gaudet, Richard Linares, Roberto Furfaro

    Abstract: Current practice for asteroid close proximity maneuvers requires extremely accurate characterization of the environmental dynamics and precise spacecraft positioning prior to the maneuver. This creates a delay of several months between the spacecraft's arrival and the ability to safely complete close proximity maneuvers. In this work we develop an adaptive integrated guidance, navigation, and cont… ▽ More

    Submitted 13 July, 2019; originally announced July 2019.

    Comments: Accepted for 2020 AAS Conference

  11. Reinforcement Learning for Angle-Only Intercept Guidance of Maneuvering Targets

    Authors: Brian Gaudet, Roberto Furfaro, Richard Linares

    Abstract: We present a novel guidance law that uses observations consisting solely of seeker line of sight angle measurements and their rate of change. The policy is optimized using reinforcement meta-learning and demonstrated in a simulated terminal phase of a mid-course exo-atmospheric interception. Importantly, the guidance law does not require range estimation, making it particularly suitable for passiv… ▽ More

    Submitted 15 November, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

    Comments: Also in 2020 AIAA Scitech Guidance Navigation and Control Conference

  12. Adaptive Guidance and Integrated Navigation with Reinforcement Meta-Learning

    Authors: Brian Gaudet, Richard Linares, Roberto Furfaro

    Abstract: This paper proposes a novel adaptive guidance system developed using reinforcement meta-learning with a recurrent policy and value function approximator. The use of recurrent network layers allows the deployed policy to adapt real time to environmental forces acting on the agent. We compare the performance of the DR/DV guidance law, an RL agent with a non-recurrent policy, and an RL agent with a r… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1901.04473

  13. arXiv:1901.03895  [pdf, other

    cs.LG eess.SY

    Learning Accurate Extended-Horizon Predictions of High Dimensional Trajectories

    Authors: Brian Gaudet, Richard Linares, Roberto Furfaro

    Abstract: We present a novel predictive model architecture based on the principles of predictive coding that enables open loop prediction of future observations over extended horizons. There are two key innovations. First, whereas current methods typically learn to make long-horizon open-loop predictions using a multi-step cost function, we instead run the model open loop in the forward pass during training… ▽ More

    Submitted 12 January, 2019; originally announced January 2019.

  14. arXiv:1810.08719  [pdf, other

    eess.SY

    Deep Reinforcement Learning for Six Degree-of-Freedom Planetary Powered Descent and Landing

    Authors: Brian Gaudet, Richard Linares, Roberto Furfaro

    Abstract: Future Mars missions will require advanced guidance, navigation, and control algorithms for the powered descent phase to target specific surface locations and achieve pinpoint accuracy (landing error ellipse $<$ 5 m radius). The latter requires both a navigation system capable of estimating the lander's state in real-time and a guidance and control system that can map the estimated lander state to… ▽ More

    Submitted 19 October, 2018; originally announced October 2018.

    Comments: 37 pages