Skip to main content

Showing 1–15 of 15 results for author: Côté, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.14916  [pdf, other

    stat.ML cs.LG

    From Point to probabilistic gradient boosting for claim frequency and severity prediction

    Authors: Dominik Chevalier, Marie-Pier Côté

    Abstract: Gradient boosting for decision tree algorithms are increasingly used in actuarial applications as they show superior predictive performance over traditional generalised linear models. Many enhancements to the first gradient boosting machine algorithm exist. We present in a unified notation, and contrast, all the existing point and probabilistic gradient boosting for decision tree algorithms: GBM,… ▽ More

    Submitted 28 April, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

    Comments: 39 pages, 12 figures, 25 tables, 7 algorithms

    MSC Class: 62P05; 68T05 ACM Class: I.2.6; I.5.1; G.3; A.1

  2. arXiv:2201.13267  [pdf, other

    cs.LG econ.EM stat.AP

    Micro-level Reserving for General Insurance Claims using a Long Short-Term Memory Network

    Authors: Ihsan Chaoubi, Camille Besse, Hélène Cossette, Marie-Pier Côté

    Abstract: Detailed information about individual claims are completely ignored when insurance claims data are aggregated and structured in development triangles for loss reserving. In the hope of extracting predictive power from the individual claims characteristics, researchers have recently proposed to move away from these macro-level methods in favor of micro-level loss reserving approaches. We introduce… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  3. arXiv:2008.06110  [pdf, other

    stat.ML cs.LG

    Synthesizing Property & Casualty Ratemaking Datasets using Generative Adversarial Networks

    Authors: Marie-Pier Cote, Brian Hartman, Olivier Mercier, Joshua Meyers, Jared Cummings, Elijah Harmon

    Abstract: Due to confidentiality issues, it can be difficult to access or share interesting datasets for methodological development in actuarial science, or other fields where personal data are important. We show how to design three different types of generative adversarial networks (GANs) that can build a synthetic insurance dataset from a confidential original dataset. The goal is to obtain synthetic data… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

  4. arXiv:2007.06894  [pdf, other

    stat.ML cs.LG

    When stakes are high: balancing accuracy and transparency with Model-Agnostic Interpretable Data-driven suRRogates

    Authors: Roel Henckaerts, Katrien Antonio, Marie-Pier Côté

    Abstract: Highly regulated industries, like banking and insurance, ask for transparent decision-making algorithms. At the same time, competitive markets are pushing for the use of complex black box models. We therefore present a procedure to develop a Model-Agnostic Interpretable Data-driven suRRogate (maidrr) suited for structured tabular data. Knowledge is extracted from a black box via partial dependence… ▽ More

    Submitted 10 December, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

  5. arXiv:2006.13463  [pdf, other

    cs.LG cs.AI stat.ML

    Graph Policy Network for Transferable Active Learning on Graphs

    Authors: Shengding Hu, Zheng Xiong, Meng Qu, Xingdi Yuan, Marc-Alexandre Côté, Zhiyuan Liu, Jian Tang

    Abstract: Graph neural networks (GNNs) have been attracting increasing popularity due to their simplicity and effectiveness in a variety of fields. However, a large number of labeled data is generally required to train these networks, which could be very expensive to obtain in some domains. In this paper, we study active learning for GNNs, i.e., how to efficiently label the nodes on a graph to reduce the an… ▽ More

    Submitted 23 October, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

    ACM Class: I.2

  6. arXiv:1910.08215  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    A Deep Learning-based Framework for the Detection of Schools of Herring in Echograms

    Authors: Alireza Rezvanifar, Tunai Porto Marques, Melissa Cote, Alexandra Branzan Albu, Alex Slonimer, Thomas Tolhurst, Kaan Ersahin, Todd Mudge, Stephane Gauthier

    Abstract: Tracking the abundance of underwater species is crucial for understanding the effects of climate change on marine ecosystems. Biologists typically monitor underwater sites with echosounders and visualize data as 2D images (echograms); they interpret these data manually or semi-automatically, which is time-consuming and prone to inconsistencies. This paper proposes a deep learning framework for the… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: Accepted to NeurIPS 2019 workshop on Tackling Climate Change with Machine Learning, Vancouver, Canada

  7. arXiv:1910.03880  [pdf, other

    cs.LG cs.AI stat.ML

    Compatible features for Monotonic Policy Improvement

    Authors: Marcin B. Tomczak, Sergio Valcarcel Macua, Enrique Munoz de Cote, Peter Vrancx

    Abstract: Recent policy optimization approaches have achieved substantial empirical success by constructing surrogate optimization objectives. The Approximate Policy Iteration objective (Schulman et al., 2015a; Kakade and Langford, 2002) has become a standard optimization target for reinforcement learning problems. Using this objective in practice requires an estimator of the advantage function. Policy opti… ▽ More

    Submitted 30 October, 2019; v1 submitted 9 October, 2019; originally announced October 2019.

  8. arXiv:1908.10449  [pdf, other

    cs.CL cs.LG stat.ML

    Interactive Machine Comprehension with Information Seeking Agents

    Authors: Xingdi Yuan, Jie Fu, Marc-Alexandre Cote, Yi Tay, Christopher Pal, Adam Trischler

    Abstract: Existing machine reading comprehension (MRC) models do not scale effectively to real-world applications like web-level information retrieval and question answering (QA). We argue that this stems from the nature of MRC datasets: most of these are static environments wherein the supporting documents and all necessary information are fully observed. In this paper, we propose a simple method that refr… ▽ More

    Submitted 16 April, 2020; v1 submitted 27 August, 2019; originally announced August 2019.

    Comments: ACL2020

  9. arXiv:1906.08226  [pdf, other

    cs.LG stat.ML

    Unsupervised State Representation Learning in Atari

    Authors: Ankesh Anand, Evan Racah, Sherjil Ozair, Yoshua Bengio, Marc-Alexandre Côté, R Devon Hjelm

    Abstract: State representation learning, or the ability to capture latent generative factors of an environment, is crucial for building intelligent agents that can perform a wide variety of tasks. Learning such representations without supervision from rewards is a challenging open problem. We introduce a method that learns state representations by maximizing mutual information across spatially and temporall… ▽ More

    Submitted 5 November, 2020; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2019; v6 fixes a broken figure reference

  10. arXiv:1905.06821  [pdf, other

    stat.ML cs.LG

    Adaptive Sensor Placement for Continuous Spaces

    Authors: James A Grant, Alexis Boukouvalas, Ryan-Rhys Griffiths, David S Leslie, Sattar Vakili, Enrique Munoz de Cote

    Abstract: We consider the problem of adaptively placing sensors along an interval to detect stochastically-generated events. We present a new formulation of the problem as a continuum-armed bandit problem with feedback in the form of partial observations of realisations of an inhomogeneous Poisson process. We design a solution method by combining Thompson sampling with nonparametric inference via increasing… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Comments: 13 pages, accepted to ICML 2019

  11. arXiv:1904.10890  [pdf, other

    stat.AP cs.LG

    Boosting insights in insurance tariff plans with tree-based machine learning methods

    Authors: Roel Henckaerts, Marie-Pier Côté, Katrien Antonio, Roel Verbelen

    Abstract: Pricing actuaries typically operate within the framework of generalized linear models (GLMs). With the upswing of data analytics, our study puts focus on machine learning methods to develop full tariff plans built from both the frequency and severity of claims. We adapt the loss functions used in the algorithms such that the specific characteristics of insurance data are carefully incorporated: hi… ▽ More

    Submitted 2 March, 2020; v1 submitted 12 April, 2019; originally announced April 2019.

  12. arXiv:1812.00855  [pdf, other

    cs.LG cs.CL stat.ML

    Towards Solving Text-based Games by Producing Adaptive Action Spaces

    Authors: Ruo Yu Tao, Marc-Alexandre Côté, Xingdi Yuan, Layla El Asri

    Abstract: To solve a text-based game, an agent needs to formulate valid text commands for a given context and find the ones that lead to success. Recent attempts at solving text-based games with deep reinforcement learning have focused on the latter, i.e., learning to act optimally when valid actions are known in advance. In this work, we propose to tackle the first task and train a model that generates the… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

  13. arXiv:1806.11532  [pdf, other

    cs.LG cs.CL stat.ML

    TextWorld: A Learning Environment for Text-based Games

    Authors: Marc-Alexandre Côté, Ákos Kádár, Xingdi Yuan, Ben Kybartas, Tavian Barnes, Emery Fine, James Moore, Ruo Yu Tao, Matthew Hausknecht, Layla El Asri, Mahmoud Adada, Wendy Tay, Adam Trischler

    Abstract: We introduce TextWorld, a sandbox learning environment for the training and evaluation of RL agents on text-based games. TextWorld is a Python library that handles interactive play-through of text games, as well as backend functions like state tracking and reward assignment. It comes with a curated list of games whose features and challenges we have analyzed. More significantly, it enables users t… ▽ More

    Submitted 8 November, 2019; v1 submitted 29 June, 2018; originally announced June 2018.

    Comments: Presented at the Computer Games Workshop at IJCAI 2018, Stockholm

  14. arXiv:1711.05411  [pdf, other

    stat.ML cs.LG

    Z-Forcing: Training Stochastic Recurrent Networks

    Authors: Anirudh Goyal, Alessandro Sordoni, Marc-Alexandre Côté, Nan Rosemary Ke, Yoshua Bengio

    Abstract: Many efforts have been devoted to training generative latent variable models with autoregressive decoders, such as recurrent neural networks (RNN). Stochastic recurrent models have been successful in capturing the variability observed in natural sequential data such as speech. We unify successful ideas from recently proposed architectures into a stochastic recurrent model: each step in the sequenc… ▽ More

    Submitted 16 November, 2017; v1 submitted 15 November, 2017; originally announced November 2017.

    Comments: To appear in NIPS'17

  15. arXiv:1710.10363  [pdf, other

    cs.LG cs.MA math.OC stat.ML

    Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning

    Authors: Sergio Valcarcel Macua, Aleksi Tukiainen, Daniel García-Ocaña Hernández, David Baldazo, Enrique Munoz de Cote, Santiago Zazo

    Abstract: We propose a fully distributed actor-critic algorithm approximated by deep neural networks, named \textit{Diff-DAC}, with application to single-task and to average multitask reinforcement learning (MRL). Each agent has access to data from its local task only, but it aims to learn a policy that performs well on average for the whole set of tasks. During the learning process, agents communicate thei… ▽ More

    Submitted 25 October, 2020; v1 submitted 27 October, 2017; originally announced October 2017.

    Journal ref: Presented at Adaptive Learning Agents workshop (ALA2018), July 14th, 2018, Stockholm, Sweden