Showing 1–2 of 2 results for author: Kouzehgar, M
-
Multi-Agent Reinforcement Learning for Dynamic Ocean Monitoring by a Swarm of Buoys
Authors:
Maryam Kouzehgar,
Malika Meghjani,
Roland Bouffanais
Abstract:
Autonomous marine environmental monitoring problem traditionally encompasses an area coverage problem which can only be effectively carried out by a multi-robot system. In this paper, we focus on robotic swarms that are typically operated and controlled by means of simple swarming behaviors obtained from a subtle, yet ad hoc combination of bio-inspired strategies. We propose a novel and structured…
▽ More
Autonomous marine environmental monitoring problem traditionally encompasses an area coverage problem which can only be effectively carried out by a multi-robot system. In this paper, we focus on robotic swarms that are typically operated and controlled by means of simple swarming behaviors obtained from a subtle, yet ad hoc combination of bio-inspired strategies. We propose a novel and structured approach for area coverage using multi-agent reinforcement learning (MARL) which effectively deals with the non-stationarity of environmental features. Specifically, we propose two dynamic area coverage approaches: (1) swarm-based MARL, and (2) coverage-range-based MARL. The former is trained using the multi-agent deep deterministic policy gradient (MADDPG) approach whereas, a modified version of MADDPG is introduced for the latter with a reward function that intrinsically leads to a collective behavior. Both methods are tested and validated with different geometric shaped regions with equal surface area (square vs. rectangle) yielding acceptable area coverage, and benefiting from the structured learning in non-stationary environments. Both approaches are advantageous compared to a naïve swarming method. However, coverage-range-based MARL outperforms the swarm-based MARL with stronger convergence features in learning criteria and higher spreading of agents for area coverage.
△ Less
Submitted 21 December, 2020;
originally announced December 2020.
-
Fuzzy Petri Nets for Human Behavior Verification and Validation
Authors:
M. Kouzehgar,
M. A. Badamchizadeh,
S. Khanmohammadi
Abstract:
Regarding the rapid growth of the size and complexity of simulation applications, designing applicable and affordable verification and validation (V&V) structures is an important problem. On the other hand, nowadays human behavior models are principles to make decision in many simulations and in order to have valid decisions based on a reliable human decision model, first the model must pass the v…
▽ More
Regarding the rapid growth of the size and complexity of simulation applications, designing applicable and affordable verification and validation (V&V) structures is an important problem. On the other hand, nowadays human behavior models are principles to make decision in many simulations and in order to have valid decisions based on a reliable human decision model, first the model must pass the validation and verification criteria. Usually human behavior models are represented as fuzzy rule bases. In all the recent works, V&V process is applied on a ready given rule-base. In this work, we are first supposed to construct a fuzzy rule-base and then apply the V&V process on it. Considering the professor-student interaction as the case-study, in order to construct the rule base, a questionnaire is designed in a special way to be transformed to a hierarchical fuzzy rule-base. The constructed fuzzy rule base is then mapped to a fuzzy Petri net and then within the verification (generating and searching the reachability graph) and validation (reasoning the Petri net) process is searched for probable structural and semantic errors.
△ Less
Submitted 5 March, 2013;
originally announced March 2013.