Skip to main content

Showing 1–24 of 24 results for author: Paolo, G

.
  1. arXiv:2505.15354  [pdf, ps, other

    cs.LG stat.ML

    Human in the Loop Adaptive Optimization for Improved Time Series Forecasting

    Authors: Malik Tiomoko, Hamza Cherkaoui, Giuseppe Paolo, Zhang Yili, Yu Meng, Zhang Keli, Hafiz Tiomoko Ali

    Abstract: Time series forecasting models often produce systematic, predictable errors even in critical domains such as energy, finance, and healthcare. We introduce a novel post training adaptive optimization framework that improves forecast accuracy without retraining or architectural changes. Our method automatically applies expressive transformations optimized via reinforcement learning, contextual bandi… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  2. arXiv:2503.01780  [pdf, other

    econ.TH

    Price Impact of Health Insurance

    Authors: Andrea Di Giovan Paolo, Jose Higueras

    Abstract: This paper examines the equilibrium effects of insurance contracts on healthcare markets using a mechanism design framework. A population of risk-averse agents with preferences as in Yaari (1987) face the risk of developing an illness of unknown severity, which can be treated in a competitive hospital services market at the prevailing market price. After privately observing their health risk, but… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  3. arXiv:2502.15425  [pdf, other

    cs.AI cs.LG eess.SY

    TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning

    Authors: Giuseppe Paolo, Abdelhakim Benechehab, Hamza Cherkaoui, Albert Thomas, Balázs Kégl

    Abstract: Hierarchical organization is fundamental to biological systems and human societies, yet artificial intelligence systems often rely on monolithic architectures that limit adaptability and scalability. Current hierarchical reinforcement learning (HRL) approaches typically restrict hierarchies to two levels or require centralized training, which limits their practical applicability. We introduce TAME… ▽ More

    Submitted 5 March, 2025; v1 submitted 21 February, 2025; originally announced February 2025.

  4. arXiv:2502.10235  [pdf, other

    stat.ML cs.LG

    AdaPTS: Adapting Univariate Foundation Models to Probabilistic Multivariate Time Series Forecasting

    Authors: Abdelhakim Benechehab, Vasilii Feofanov, Giuseppe Paolo, Albert Thomas, Maurizio Filippone, Balázs Kégl

    Abstract: Pre-trained foundation models (FMs) have shown exceptional performance in univariate time series forecasting tasks. However, several practical challenges persist, including managing intricate dependencies among features and quantifying uncertainty in predictions. This study aims to tackle these critical limitations by introducing adapters; feature-space transformations that facilitate the effectiv… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  5. arXiv:2502.06483  [pdf, ps, other

    astro-ph.IM astro-ph.SR

    Sunrise III: Overview of Observatory and Instruments

    Authors: Andreas Korpi-Lagg, Achim Gandorfer, Sami K. Solanki, Jose Carlos del Toro Iniesta, Yukio Katsukawa, Pietro Bernasconi, Thomas Berkefeld, Alex Feller, Tino L. Riethmüller, Alberto Álvarez-Herrero, Masahito Kubo, Valentín Martínez Pillet, H. N. Smitha, David Orozco Suárez, Bianca Grauf, Michael Carpenter, Alexander Bell, María-Teresa Álvarez-Alonso, Daniel Álvarez García, Beatriz Aparicio del Moral, Daniel Ayoub, Francisco Javier Bailén, Eduardo Bailón Martínez, Maria Balaguer Jiménez, Peter Barthol , et al. (95 additional authors not shown)

    Abstract: In July 2024, Sunrise completed its third successful science flight. The Sunrise III observatory had been upgraded significantly after the two previous successful flights in 2009 and 2013. Three completely new instruments focus on the small-scale physical processes and their complex interaction from the deepest observable layers in the photosphere up to chromospheric heights. Previously poorly exp… ▽ More

    Submitted 30 May, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: 74 pages, 26 figures. Published as part of the Solar Physics Topical Collection "The Sunrise III Solar Observatory" (https://link.springer.com/collections/jegdciedig)

    Journal ref: Sol Phys 300, 75 (2025)

  6. arXiv:2411.03562  [pdf, other

    cs.LG cs.AI

    Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

    Authors: Antoine Grosnit, Alexandre Maraval, James Doran, Giuseppe Paolo, Albert Thomas, Refinath Shahul Hameed Nabeezath Beevi, Jonas Gonzalez, Khyati Khandelwal, Ignacio Iacobacci, Abdelhakim Benechehab, Hamza Cherkaoui, Youssef Attia El-Hili, Kun Shao, Jianye Hao, Jun Yao, Balazs Kegl, Haitham Bou-Ammar, Jun Wang

    Abstract: We introduce Agent K v1.0, an end-to-end autonomous data science agent designed to automate, optimise, and generalise across diverse data science tasks. Fully automated, Agent K v1.0 manages the entire data science life cycle by learning from experience. It leverages a highly flexible structured reasoning framework to enable it to dynamically process memory in a nested structure, effectively learn… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

  7. arXiv:2410.11711  [pdf, other

    stat.ML cs.LG

    Zero-shot Model-based Reinforcement Learning using Large Language Models

    Authors: Abdelhakim Benechehab, Youssef Attia El Hili, Ambroise Odonnat, Oussama Zekri, Albert Thomas, Giuseppe Paolo, Maurizio Filippone, Ievgen Redko, Balázs Kégl

    Abstract: The emerging zero-shot capabilities of Large Language Models (LLMs) have led to their applications in areas extending well beyond natural language processing tasks. In reinforcement learning, while LLMs have been extensively used in text-based environments, their integration with continuous state spaces remains understudied. In this paper, we investigate how pre-trained LLMs can be leveraged to pr… ▽ More

    Submitted 13 February, 2025; v1 submitted 15 October, 2024; originally announced October 2024.

    Journal ref: The Thirteenth International Conference on Learning Representations (ICLR 2025)

  8. arXiv:2402.10198  [pdf, other

    cs.LG stat.ML

    SAMformer: Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention

    Authors: Romain Ilbert, Ambroise Odonnat, Vasilii Feofanov, Aladin Virmaux, Giuseppe Paolo, Themis Palpanas, Ievgen Redko

    Abstract: Transformer-based architectures achieved breakthrough performance in natural language processing and computer vision, yet they remain inferior to simpler linear baselines in multivariate long-term forecasting. To better understand this phenomenon, we start by studying a toy linear forecasting problem for which we show that transformers are incapable of converging to their true solution despite the… ▽ More

    Submitted 3 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted as an Oral at ICML 2024, Vienna. The first two authors contributed equally

  9. arXiv:2402.03824  [pdf, ps, other

    cs.AI

    A call for embodied AI

    Authors: Giuseppe Paolo, Jonas Gonzalez-Billandon, Balázs Kégl

    Abstract: We propose Embodied AI as the next fundamental step in the pursuit of Artificial General Intelligence, juxtaposing it against current AI advancements, particularly Large Language Models. We traverse the evolution of the embodiment concept across diverse fields - philosophy, psychology, neuroscience, and robotics - to highlight how EAI distinguishes itself from the classical paradigm of static lear… ▽ More

    Submitted 13 September, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: Published in ICML 2024 Position paper track

    Journal ref: PMLR 235:39493-39508, 2024

  10. arXiv:2402.03146  [pdf, other

    cs.LG stat.ML

    A Multi-step Loss Function for Robust Learning of the Dynamics in Model-based Reinforcement Learning

    Authors: Abdelhakim Benechehab, Albert Thomas, Giuseppe Paolo, Maurizio Filippone, Balázs Kégl

    Abstract: In model-based reinforcement learning, most algorithms rely on simulating trajectories from one-step models of the dynamics learned on data. A critical challenge of this approach is the compounding of one-step prediction errors as the length of the trajectory grows. In this paper we tackle this issue by using a multi-step objective to train one-step models. Our objective is a weighted sum of the m… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  11. arXiv:2401.10107  [pdf

    eess.SP cs.LG physics.med-ph

    Comparison analysis between standard polysomnographic data and in-ear-EEG signals: A preliminary study

    Authors: Gianpaolo Palo, Luigi Fiorillo, Giuliana Monachino, Michal Bechny, Michel Walti, Elias Meier, Francesca Pentimalli Biscaretti di Ruffia, Mark Melnykowycz, Athina Tzovara, Valentina Agostini, Francesca Dalia Faraci

    Abstract: Study Objectives: Polysomnography (PSG) currently serves as the benchmark for evaluating sleep disorders. Its discomfort makes long-term monitoring unfeasible, leading to bias in sleep quality assessment. Hence, less invasive, cost-effective, and portable alternatives need to be explored. One promising contender is the in-ear-EEG sensor. This study aims to establish a methodology to assess the sim… ▽ More

    Submitted 6 August, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 20 figures, 6 tables

    Journal ref: Sleep Advances, 2025

  12. arXiv:2310.05672  [pdf, other

    cs.LG stat.ML

    Multi-timestep models for Model-based Reinforcement Learning

    Authors: Abdelhakim Benechehab, Giuseppe Paolo, Albert Thomas, Maurizio Filippone, Balázs Kégl

    Abstract: In model-based reinforcement learning (MBRL), most algorithms rely on simulating trajectories from one-step dynamics models learned on data. A critical challenge of this approach is the compounding of one-step prediction errors as length of the trajectory grows. In this paper we tackle this issue by using a multi-timestep objective to train one-step models. Our objective is a weighted sum of a los… ▽ More

    Submitted 11 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  13. arXiv:2206.09743  [pdf, other

    cs.AI cs.LG eess.SY

    Guided Safe Shooting: model based reinforcement learning with safety constraints

    Authors: Giuseppe Paolo, Jonas Gonzalez-Billandon, Albert Thomas, Balázs Kégl

    Abstract: In the last decade, reinforcement learning successfully solved complex control tasks and decision-making problems, like the Go board game. Yet, there are few success stories when it comes to deploying those algorithms to real-world scenarios. One of the reasons is the lack of guarantees when dealing with and avoiding unsafe states, a fundamental requirement in critical control engineering systems.… ▽ More

    Submitted 12 September, 2024; v1 submitted 20 June, 2022; originally announced June 2022.

  14. arXiv:2203.01027  [pdf, other

    cs.LG cs.AI cs.NE

    Learning in Sparse Rewards settings through Quality-Diversity algorithms

    Authors: Giuseppe Paolo

    Abstract: In the Reinforcement Learning (RL) framework, the learning is guided through a reward signal. This means that in situations of sparse rewards the agent has to focus on exploration, in order to discover which action, or set of actions leads to the reward. RL agents usually struggle with this. Exploration is the focus of Quality-Diversity (QD) methods. In this thesis, we approach the problem of spar… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: PhD Thesis

  15. arXiv:2111.01919  [pdf, other

    cs.LG cs.AI cs.NE cs.RO

    Discovering and Exploiting Sparse Rewards in a Learned Behavior Space

    Authors: Giuseppe Paolo, Miranda Coninx, Alban Laflaquière, Stephane Doncieux

    Abstract: Learning optimal policies in sparse rewards settings is difficult as the learning agent has little to no feedback on the quality of its actions. In these situations, a good strategy is to focus on exploration, hopefully leading to the discovery of a reward signal to improve on. A learning algorithm capable of dealing with this kind of settings has to be able to (1) explore possible agent behaviors… ▽ More

    Submitted 26 September, 2023; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: 25 pages. Published by the Evolutionary Computation Journal, MIT Press

  16. arXiv:2102.03140  [pdf, other

    cs.NE cs.AI cs.LG cs.RO

    Sparse Reward Exploration via Novelty Search and Emitters

    Authors: Giuseppe Paolo, Alexandre Coninx, Stephane Doncieux, Alban Laflaquière

    Abstract: Reward-based optimization algorithms require both exploration, to find rewards, and exploitation, to maximize performance. The need for efficient exploration is even more significant in sparse reward settings, in which performance feedback is given sparingly, thus rendering it unsuitable for guiding the search process. In this work, we introduce the SparsE Reward Exploration via Novelty and Emitte… ▽ More

    Submitted 16 April, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: In 2021 Genetic and Evolutionary Computation Conference (GECCO 21), July, 2021, Lille, France. ACM, New York, NY, USA, 11 pages

  17. arXiv:2005.06224  [pdf, other

    cs.NE cs.AI cs.LG cs.RO

    Novelty Search makes Evolvability Inevitable

    Authors: Stephane Doncieux, Giuseppe Paolo, Alban Laflaquière, Alexandre Coninx

    Abstract: Evolvability is an important feature that impacts the ability of evolutionary processes to find interesting novel solutions and to deal with changing conditions of the problem to solve. The estimation of evolvability is not straightforward and is generally too expensive to be directly used as selective pressure in the evolutionary process. Indirectly promoting evolvability as a side effect of othe… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  18. arXiv:1909.05508  [pdf

    cs.RO cs.AI cs.LG cs.NE

    Unsupervised Learning and Exploration of Reachable Outcome Space

    Authors: Giuseppe Paolo, Alban Laflaquière, Alexandre Coninx, Stephane Doncieux

    Abstract: Performing Reinforcement Learning in sparse rewards settings, with very little prior knowledge, is a challenging problem since there is no signal to properly guide the learning process. In such situations, a good search strategy is fundamental. At the same time, not having to adapt the algorithm to every single problem is very desirable. Here we introduce TAXONS, a Task Agnostic eXploration of Out… ▽ More

    Submitted 4 May, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

    Comments: Published at IEEE International Conference on Robotics and Automation (ICRA) 2020

  19. A Critical-like Collective State Leads to Long-range Cell Communication in Dictyostelium discoideum Aggregation

    Authors: Giovanna De Palo, Darvin Yi, Robert G. Endres

    Abstract: The transition from single-cell to multicellular behavior is important in early development but rarely studied. The starvation-induced aggregation of the social amoeba Dictyostelium discoideum into a multicellular slug is known to result from single-cell chemotaxis towards emitted pulses of cyclic adenosine monophosphate (cAMP). However, how exactly do transient short-range chemical gradients lead… ▽ More

    Submitted 11 January, 2018; originally announced January 2018.

    Comments: 19 pages, 4 figures. This is an earlier version which contains cell steering by applied perturbations in Fig. 4

    Journal ref: Final version (mainly different Fig. 4 and a bit less technical): De Palo G, Yi D, Endres RG. PLoS Biol. 15(4): e1002602 (2017)

  20. arXiv:1709.08528  [pdf, other

    cs.RO

    A Data-driven Model for Interaction-aware Pedestrian Motion Prediction in Object Cluttered Environments

    Authors: Mark Pfeiffer, Giuseppe Paolo, Hannes Sommer, Juan Nieto, Roland Siegwart, Cesar Cadena

    Abstract: This paper reports on a data-driven, interaction-aware motion prediction approach for pedestrians in environments cluttered with static obstacles. When navigating in such workspaces shared with humans, robots need accurate motion predictions of the surrounding pedestrians. Human navigation behavior is mostly influenced by their surrounding pedestrians and by the static obstacles in their vicinity.… ▽ More

    Submitted 26 February, 2018; v1 submitted 25 September, 2017; originally announced September 2017.

    Comments: 8 pages, accepted for publication at the IEEE International Conference on Robotics and Automation (ICRA) 2018

  21. arXiv:1709.08430  [pdf, other

    cs.RO cs.AI cs.LG

    Towards continuous control of flippers for a multi-terrain robot using deep reinforcement learning

    Authors: Giuseppe Paolo, Lei Tai, Ming Liu

    Abstract: In this paper we focus on developing a control algorithm for multi-terrain tracked robots with flippers using a reinforcement learning (RL) approach. The work is based on the deep deterministic policy gradient (DDPG) algorithm, proven to be very successful in simple simulation environments. The algorithm works in an end-to-end fashion in order to control the continuous position of the flippers. Th… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

    Comments: 12 pages, single column, submitted to International Journal of Robotics and Automation (IJRA)

  22. arXiv:1703.00420  [pdf, other

    cs.RO cs.AI cs.LG

    Virtual-to-real Deep Reinforcement Learning: Continuous Control of Mobile Robots for Mapless Navigation

    Authors: Lei Tai, Giuseppe Paolo, Ming Liu

    Abstract: We present a learning-based mapless motion planner by taking the sparse 10-dimensional range findings and the target position with respect to the mobile robot coordinate frame as input and the continuous steering commands as output. Traditional motion planners for mobile ground robots with a laser range sensor mostly depend on the obstacle map of the navigation environment where both the highly pr… ▽ More

    Submitted 21 July, 2017; v1 submitted 1 March, 2017; originally announced March 2017.

    Comments: video: https://www.youtube.com/watch?v=9AOIwBYIBbs, 6 pages, 9 figures, to appear in he 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2017), final submission version

  23. Unraveling Adaptation in Eukaryotic Pathways: Lessons from Protocells

    Authors: Giovanna De Palo, Robert G. Endres

    Abstract: Eukaryotic adaptation pathways operate within wide-ranging environmental conditions without stimulus saturation. Despite numerous differences in the adaptation mechanisms employed by bacteria and eukaryotes, all require energy consumption. Here, we present two minimal models showing that expenditure of energy by the cell is not essential for adaptation. Both models share important features with la… ▽ More

    Submitted 13 September, 2013; originally announced September 2013.

    Comments: accepted for publication in PLoS Computational Biology; 19 pages, 8 figures

  24. Properties of a family of n reggeized gluon states in multicolour QCD

    Authors: Vacca Gian Paolo

    Abstract: A general relation between families of (n+1) gluon and n gluon eigenstates of the BKP evolution kernels in the multicolour limit of QCD is derived. It allows to construct an (n+1) gluon eigenstate if an n gluon eigenstate is known; this solution is Bose symmetric and thus physical for even n. A recently found family of odderon solutions corresponds to the particular case n=2.

    Submitted 7 July, 2000; originally announced July 2000.

    Comments: 11 pages, 1 figure

    Report number: DESY 00-077

    Journal ref: Phys.Lett.B489:337-344,2000