Skip to main content

Showing 1–8 of 8 results for author: Rezaei-Shoshtari, S

.
  1. arXiv:2412.17123  [pdf, other

    cs.LG cs.CY

    Fairness in Reinforcement Learning with Bisimulation Metrics

    Authors: Sahand Rezaei-Shoshtari, Hanna Yurchyk, Scott Fujimoto, Doina Precup, David Meger

    Abstract: Ensuring long-term fairness is crucial when developing automated decision making systems, specifically in dynamic and sequential environments. By maximizing their reward without consideration of fairness, AI agents can introduce disparities in their treatment of groups or individuals. In this paper, we establish the connection between bisimulation metrics and group fairness in reinforcement learni… ▽ More

    Submitted 31 December, 2024; v1 submitted 22 December, 2024; originally announced December 2024.

  2. arXiv:2305.05666  [pdf, other

    cs.LG cs.AI

    Policy Gradient Methods in the Presence of Symmetries and State Abstractions

    Authors: Prakash Panangaden, Sahand Rezaei-Shoshtari, Rosie Zhao, David Meger, Doina Precup

    Abstract: Reinforcement learning (RL) on high-dimensional and complex problems relies on abstraction for improved efficiency and generalization. In this paper, we study abstraction in the continuous-control setting, and extend the definition of Markov decision process (MDP) homomorphisms to the setting of continuous state and action spaces. We derive a policy gradient theorem on the abstract MDP for both st… ▽ More

    Submitted 7 March, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Published in the Journal of Machine Learning Research (JMLR). arXiv admin note: text overlap with arXiv:2209.07364

  3. arXiv:2211.15457  [pdf, other

    cs.LG

    Hypernetworks for Zero-shot Transfer in Reinforcement Learning

    Authors: Sahand Rezaei-Shoshtari, Charlotte Morissette, Francois Robert Hogan, Gregory Dudek, David Meger

    Abstract: In this paper, hypernetworks are trained to generate behaviors across a range of unseen task conditions, via a novel TD-based training objective and data from a set of near-optimal RL solutions for training tasks. This work relates to meta RL, contextual RL, and transfer learning, with a particular focus on zero-shot performance at test time, enabled by knowledge of the task parameters (also known… ▽ More

    Submitted 2 January, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: AAAI 2023

  4. arXiv:2209.07364  [pdf, other

    cs.LG

    Continuous MDP Homomorphisms and Homomorphic Policy Gradient

    Authors: Sahand Rezaei-Shoshtari, Rosie Zhao, Prakash Panangaden, David Meger, Doina Precup

    Abstract: Abstraction has been widely studied as a way to improve the efficiency and generalization of reinforcement learning algorithms. In this paper, we study abstraction in the continuous-control setting. We extend the definition of MDP homomorphisms to encompass continuous actions in continuous state spaces. We derive a policy gradient theorem on the abstract MDP, which allows us to leverage approximat… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: NeurIPS 2022

  5. arXiv:2101.04454  [pdf, other

    cs.LG cs.AI

    Learning Intuitive Physics with Multimodal Generative Models

    Authors: Sahand Rezaei-Shoshtari, Francois Robert Hogan, Michael Jenkin, David Meger, Gregory Dudek

    Abstract: Predicting the future interaction of objects when they come into contact with their environment is key for autonomous agents to take intelligent and anticipatory actions. This paper presents a perception framework that fuses visual and tactile feedback to make predictions about the expected motion of objects in dynamic scenes. Visual information captures object properties such as 3D shape and loca… ▽ More

    Submitted 19 January, 2021; v1 submitted 12 January, 2021; originally announced January 2021.

    Comments: AAAI 2021

  6. arXiv:2011.09552  [pdf, other

    cs.RO

    Seeing Through your Skin: Recognizing Objects with a Novel Visuotactile Sensor

    Authors: Francois Robert Hogan, Michael Jenkin, Sahand Rezaei-Shoshtari, Yogesh Girdhar, David Meger, Gregory Dudek

    Abstract: We introduce a new class of vision-based sensor and associated algorithmic processes that combine visual imaging with high-resolution tactile sending, all in a uniform hardware and computational architecture. We demonstrate the sensor's efficacy for both multi-modal object recognition and metrology. Object recognition is typically formulated as an unimodal task, but by combining two sensor modalit… ▽ More

    Submitted 14 December, 2020; v1 submitted 18 November, 2020; originally announced November 2020.

    Comments: A version of this paper appears in WACV 2021

  7. arXiv:2007.11167  [pdf, other

    cs.RO cs.LG

    Learning the Latent Space of Robot Dynamics for Cutting Interaction Inference

    Authors: Sahand Rezaei-Shoshtari, David Meger, Inna Sharf

    Abstract: Utilization of latent space to capture a lower-dimensional representation of a complex dynamics model is explored in this work. The targeted application is of a robotic manipulator executing a complex environment interaction task, in particular, cutting a wooden object. We train two flavours of Variational Autoencoders---standard and Vector-Quantised---to learn the latent space which is then used… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

    Comments: IROS2020. Copyright 20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  8. arXiv:1910.02291  [pdf, other

    cs.RO cs.LG

    Cascaded Gaussian Processes for Data-efficient Robot Dynamics Learning

    Authors: Sahand Rezaei-Shoshtari, David Meger, Inna Sharf

    Abstract: Motivated by the recursive Newton-Euler formulation, we propose a novel cascaded Gaussian process learning framework for the inverse dynamics of robot manipulators. This approach leads to a significant dimensionality reduction which in turn results in better learning and data efficiency. We explore two formulations for the cascading: the inward and outward, both along the manipulator chain topolog… ▽ More

    Submitted 5 October, 2019; originally announced October 2019.

    Comments: IROS2019. Copyright 20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works