Skip to main content

Showing 1–4 of 4 results for author: Kermanshah, M

.
  1. arXiv:2411.17861  [pdf, other

    cs.LG cs.AI

    Accelerating Proximal Policy Optimization Learning Using Task Prediction for Solving Environments with Delayed Rewards

    Authors: Ahmad Ahmad, Mehdi Kermanshah, Kevin Leahy, Zachary Serlin, Ho Chit Siu, Makai Mann, Cristian-Ioan Vasile, Roberto Tron, Calin Belta

    Abstract: In this paper, we tackle the challenging problem of delayed rewards in reinforcement learning (RL). While Proximal Policy Optimization (PPO) has emerged as a leading Policy Gradient method, its performance can degrade under delayed rewards. We introduce two key enhancements to PPO: a hybrid policy architecture that combines an offline policy (trained on expert demonstrations) with an online PPO po… ▽ More

    Submitted 4 December, 2024; v1 submitted 26 November, 2024; originally announced November 2024.

  2. arXiv:2406.02722  [pdf, other

    cs.RO

    Model Predictive Control for Magnetically-Actuated Cellbots

    Authors: Mehdi Kermanshah, Logan E. Beaver, Max Sokolich, Fatma Ceren Kirmizitas, Sambeeta Das, Roberto Tron, Ron Weiss, Calin Belta

    Abstract: This paper presents a control framework for magnetically actuated cellbots, which combines Model Predictive Control (MPC) with Gaussian Processes (GPs) as a disturbance estimator for precise trajectory tracking. To address the challenges posed by unmodeled dynamics, we integrate data-driven modeling with model-based control to accurately track desired trajectories using relatively small data. To t… ▽ More

    Submitted 26 September, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  3. arXiv:2403.14519  [pdf, other

    eess.SY

    Designing Robust Linear Output Feedback Controller based on CLF-CBF framework via Linear~Programming(LP-CLF-CBF)

    Authors: Mahroo Bahreinian, Mehdi Kermanshah, Roberto Tron

    Abstract: We consider the problem of designing output feedback controllers that use measurements from a set of landmarks to navigate through a cell-decomposable environment using duality, Control Lyapunov and Barrier Functions (CLF, CBF), and Linear Programming. We propose two objectives for navigating in an environment, one to traverse the environment by making loops and one by converging to a stabilizatio… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2203.04416

  4. arXiv:2310.08413  [pdf, other

    eess.SY

    Control-Based Planning over Probability Mass Function Measurements via Robust Linear Programming

    Authors: Mehdi Kermanshah, Calin Belta, Roberto Tron

    Abstract: We propose an approach to synthesize linear feedback controllers for linear systems in polygonal environments. Our method focuses on designing a robust controller that can account for uncertainty in measurements. Its inputs are provided by a perception module that generates probability mass functions (PMFs) for predefined landmarks in the environment, such as distinguishable geometric features. We… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.