Skip to main content

Showing 1–3 of 3 results for author: Gautron, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.04537  [pdf, other

    cs.AI

    Towards an efficient and risk aware strategy for guiding farmers in identifying best crop management

    Authors: Romain Gautron, Dorian Baudry, Myriam Adam, Gatien N Falconnier, Marc Corbeels

    Abstract: Identification of best performing fertilizer practices among a set of contrasting practices with field trials is challenging as crop losses are costly for farmers. To identify best management practices, an ''intuitive strategy'' would be to set multi-year field trials with equal proportion of each practice to test. Our objective was to provide an identification strategy using a bandit algorithm th… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  2. arXiv:2207.03270  [pdf, other

    cs.AI

    gym-DSSAT: a crop model turned into a Reinforcement Learning environment

    Authors: Romain Gautron, Emilio J. PadrĂ³n, Philippe Preux, Julien Bigot, Odalric-Ambrym Maillard, David Emukpere

    Abstract: Addressing a real world sequential decision problem with Reinforcement Learning (RL) usually starts with the use of a simulated environment that mimics real conditions. We present a novel open source RL environment for realistic crop management tasks. gym-DSSAT is a gym interface to the Decision Support System for Agrotechnology Transfer (DSSAT), a high fidelity crop simulator. DSSAT has been deve… ▽ More

    Submitted 27 September, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Report number: Report-no: Inria RR-9460

  3. arXiv:2012.05754  [pdf, other

    cs.LG

    Optimal Thompson Sampling strategies for support-aware CVaR bandits

    Authors: Dorian Baudry, Romain Gautron, Emilie Kaufmann, Odalric-Ambryn Maillard

    Abstract: In this paper we study a multi-arm bandit problem in which the quality of each arm is measured by the Conditional Value at Risk (CVaR) at some level alpha of the reward distribution. While existing works in this setting mainly focus on Upper Confidence Bound algorithms, we introduce a new Thompson Sampling approach for CVaR bandits on bounded rewards that is flexible enough to solve a variety of p… ▽ More

    Submitted 21 March, 2022; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: Presented at the Thirty-eighth International Conference on Machine Learning (ICML 2021). In this version we refine Lemma 2 and correct its proof (does not change the main theorems)