Trajectory Tracking Control of Skid-Steering Mobile Robots with Slip and Skid Compensation using Sliding-Mode Control and Deep Learning
Authors:
Payam Nourizadeh,
Fiona J Stevens McFadden,
Will N Browne
Abstract:
Compensating for slip and skid is crucial for mobile robots navigating outdoor terrains. In these challenging environments, slipping and skidding introduce uncertainties into trajectory tracking systems, potentially compromising the safety of the vehicle. Despite research in this field, having a real-world feasible online slip and skid compensation remains challenging due to the complexity of whee…
▽ More
Compensating for slip and skid is crucial for mobile robots navigating outdoor terrains. In these challenging environments, slipping and skidding introduce uncertainties into trajectory tracking systems, potentially compromising the safety of the vehicle. Despite research in this field, having a real-world feasible online slip and skid compensation remains challenging due to the complexity of wheel-terrain interaction in outdoor environments. This paper proposes a novel trajectory tracking technique featuring real-world feasible online slip and skid compensation at the vehicle level for skid-steering mobile robots operating outdoors. The approach employs sliding-mode control to design a robust trajectory tracking system, accounting for the inherent uncertainties in this type of robot. To estimate the robot's slipping and undesired skidding and compensate for them in real-time, two previously developed deep learning models are integrated into the control-feedback loop. The main advantages of the proposed technique are that it (1) considers two slip-related parameters for the entire robot, as opposed to the conventional approach involving two slip components for each wheel along with the robot's skidding, and (2) has an online real-world feasible slip and skid compensator, reducing the tracking errors in unforeseen environments. Experimental results demonstrate a significant improvement, enhancing the trajectory tracking system's performance by over 27%.
△ Less
Submitted 23 October, 2023; v1 submitted 15 September, 2023;
originally announced September 2023.
Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology
Authors:
Eugene Ie,
Vihan Jain,
Jing Wang,
Sanmit Narvekar,
Ritesh Agarwal,
Rui Wu,
Heng-Tze Cheng,
Morgane Lustman,
Vince Gatto,
Paul Covington,
Jim McFadden,
Tushar Chandra,
Craig Boutilier
Abstract:
Most practical recommender systems focus on estimating immediate user engagement without considering the long-term effects of recommendations on user behavior. Reinforcement learning (RL) methods offer the potential to optimize recommendations for long-term user engagement. However, since users are often presented with slates of multiple items - which may have interacting effects on user choice -…
▽ More
Most practical recommender systems focus on estimating immediate user engagement without considering the long-term effects of recommendations on user behavior. Reinforcement learning (RL) methods offer the potential to optimize recommendations for long-term user engagement. However, since users are often presented with slates of multiple items - which may have interacting effects on user choice - methods are required to deal with the combinatorics of the RL action space. In this work, we address the challenge of making slate-based recommendations to optimize long-term value using RL. Our contributions are three-fold. (i) We develop SLATEQ, a decomposition of value-based temporal-difference and Q-learning that renders RL tractable with slates. Under mild assumptions on user choice behavior, we show that the long-term value (LTV) of a slate can be decomposed into a tractable function of its component item-wise LTVs. (ii) We outline a methodology that leverages existing myopic learning-based recommenders to quickly develop a recommender that handles LTV. (iii) We demonstrate our methods in simulation, and validate the scalability of decomposed TD-learning using SLATEQ in live experiments on YouTube.
△ Less
Submitted 31 May, 2019; v1 submitted 29 May, 2019;
originally announced May 2019.