Skip to main content

Showing 1–8 of 8 results for author: Angelotti, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.05703  [pdf, other

    cs.MA cs.AI cs.HC cs.LG cs.RO

    Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming

    Authors: Giorgio Angelotti, Caroline P. C. Chanel, Adam H. M. Pinto, Christophe Lounis, Corentin Chauffaut, Nicolas Drougard

    Abstract: The integration of physiological computing into mixed-initiative human-robot interaction systems offers valuable advantages in autonomous task allocation by incorporating real-time features as human state observations into the decision-making system. This approach may alleviate the cognitive load on human operators by intelligently allocating mission tasks between agents. Nevertheless, accommodati… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted as a full paper at AAMAS 2024

  2. arXiv:2401.06091  [pdf, other

    cs.LG stat.ME

    A Closer Look at AUROC and AUPRC under Class Imbalance

    Authors: Matthew B. A. McDermott, Haoran Zhang, Lasse Hyldig Hansen, Giovanni Angelotti, Jack Gallifant

    Abstract: In machine learning (ML), a widespread claim is that the area under the precision-recall curve (AUPRC) is a superior metric for model comparison to the area under the receiver operating characteristic (AUROC) for tasks with class imbalance. This paper refutes this notion on two fronts. First, we theoretically characterize the behavior of AUROC and AUPRC in the presence of model mistakes, establish… ▽ More

    Submitted 13 January, 2025; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: NeurIPS 2024 (https://openreview.net/forum?id=S3HvA808gk)

  3. arXiv:2310.19676  [pdf, ps, other

    cs.LG

    HyPE: Attention with Hyperbolic Biases for Relative Positional Encoding

    Authors: Giorgio Angelotti

    Abstract: In Transformer-based architectures, the attention mechanism is inherently permutation-invariant with respect to the input sequence's tokens. To impose sequential order, token positions are typically encoded using a scheme with either fixed or learnable parameters. We introduce Hyperbolic Positional Encoding (HyPE), a novel method that utilizes hyperbolic functions' properties to encode tokens' rel… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Independent Research

  4. Towards a more efficient computation of individual attribute and policy contribution for post-hoc explanation of cooperative multi-agent systems using Myerson values

    Authors: Giorgio Angelotti, Natalia Díaz-Rodríguez

    Abstract: A quantitative assessment of the global importance of an agent in a team is as valuable as gold for strategists, decision-makers, and sports coaches. Yet, retrieving this information is not trivial since in a cooperative task it is hard to isolate the performance of an individual from the one of the whole team. Moreover, it is not always clear the relationship between the role of an agent and his… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Accepted for publication in Elsevier's Knowledge-Based Systems

    Journal ref: Knowledge-Based Systems 260 (2023) 110189

  5. Data Augmentation through Expert-guided Symmetry Detection to Improve Performance in Offline Reinforcement Learning

    Authors: Giorgio Angelotti, Nicolas Drougard, Caroline P. C. Chanel

    Abstract: Offline estimation of the dynamical model of a Markov Decision Process (MDP) is a non-trivial task that greatly depends on the data available in the learning phase. Sometimes the dynamics of the model is invariant with respect to some transformations of the current state and action. Recent works showed that an expert-guided pipeline relying on Density Estimation methods as Deep Neural Network base… ▽ More

    Submitted 12 April, 2023; v1 submitted 18 December, 2021; originally announced December 2021.

    Comments: Accepted at ICAART 2023

  6. Expert-Guided Symmetry Detection in Markov Decision Processes

    Authors: Giorgio Angelotti, Nicolas Drougard, Caroline P. C. Chanel

    Abstract: Learning a Markov Decision Process (MDP) from a fixed batch of trajectories is a non-trivial task whose outcome's quality depends on both the amount and the diversity of the sampled regions of the state-action space. Yet, many MDPs are endowed with invariant reward and transition functions with respect to some transformations of the current state and action. Being able to detect and exploit these… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: Accepted to the 14th International Conference on Agents and Artificial Intelligence - ICAART 2022

  7. arXiv:2105.13431  [pdf, other

    cs.LG cs.AI eess.SY

    An Offline Risk-aware Policy Selection Method for Bayesian Markov Decision Processes

    Authors: Giorgio Angelotti, Nicolas Drougard, Caroline Ponzoni Carvalho Chanel

    Abstract: In Offline Model Learning for Planning and in Offline Reinforcement Learning, the limited data set hinders the estimate of the Value function of the relative Markov Decision Process (MDP). Consequently, the performance of the obtained policy in the real world is bounded and possibly risky, especially when the deployment of a wrong policy can lead to catastrophic consequences. For this reason, seve… ▽ More

    Submitted 11 April, 2023; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: Preprint, under review

  8. arXiv:2010.01931  [pdf, ps, other

    cs.LG cs.AI

    Offline Learning for Planning: A Summary

    Authors: Giorgio Angelotti, Nicolas Drougard, Caroline Ponzoni Carvalho Chanel

    Abstract: The training of autonomous agents often requires expensive and unsafe trial-and-error interactions with the environment. Nowadays several data sets containing recorded experiences of intelligent agents performing various tasks, spanning from the control of unmanned vehicles to human-robot interaction and medical applications are accessible on the internet. With the intention of limiting the costs… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 9 pages, ICAPS 2020 Conference - Bridging the Gap Between AI Planning and Reinforcement Learning (PRL) Workshop