Skip to main content

Showing 1–7 of 7 results for author: Colombo, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.11112  [pdf, other

    stat.AP

    Simple yet effective: a comparative study of statistical models for yearly hurricane forecasting

    Authors: Pietro Colombo, Raffaele Mattera, Philipp Otto

    Abstract: In this paper, we study the problem of forecasting the next year's number of Atlantic hurricanes, which is relevant in many fields of applications such as land-use planning, hazard mitigation, reinsurance and long-term weather derivative market. Considering a set of well-known predictors, we compare the forecasting accuracy of both machine learning and simpler models, showing that the latter may b… ▽ More

    Submitted 17 November, 2024; originally announced November 2024.

    Comments: 16 pages, 7 figures, submitted to Environmetrics, Repository of the project: https://github.com/Pietrostat193/Hurricane-forecasting

    MSC Class: 62P12; 62M10

  2. arXiv:2407.20295  [pdf, other

    stat.AP stat.CO stat.ME

    Warped multifidelity Gaussian processes for data fusion of skewed environmental data

    Authors: Pietro Colombo, Claire Miller, Xiaochen Yang, Ruth O'Donnell, Paolo Maranzano

    Abstract: Understanding the dynamics of climate variables is paramount for numerous sectors, like energy and environmental monitoring. This study focuses on the critical need for a precise mapping of environmental variables for national or regional monitoring networks, a task notably challenging when dealing with skewed data. To address this issue, we propose a novel data fusion approach, the \textit{warped… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  3. arXiv:2306.03522  [pdf, other

    cs.LG cs.CV stat.ML

    A Functional Data Perspective and Baseline On Multi-Layer Out-of-Distribution Detection

    Authors: Eduardo Dadalto, Pierre Colombo, Guillaume Staerman, Nathan Noiry, Pablo Piantanida

    Abstract: A key feature of out-of-distribution (OOD) detection is to exploit a trained neural network by extracting statistical patterns and relationships through the multi-layer classifier to detect shifts in the expected input data distribution. Despite achieving solid results, several state-of-the-art methods rely on the penultimate or last layer outputs only, leaving behind valuable information for OOD… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  4. arXiv:2202.06618  [pdf, other

    cs.LG stat.ML

    A Differential Entropy Estimator for Training Neural Networks

    Authors: Georg Pichler, Pierre Colombo, Malik Boudiaf, Günther Koliander, Pablo Piantanida

    Abstract: Mutual Information (MI) has been widely used as a loss regularizer for training neural networks. This has been particularly effective when learn disentangled or compressed representations of high dimensional data. However, differential entropy (DE), another fundamental measure of information, has not found widespread use in neural network training. Although DE offers a potentially wider range of a… ▽ More

    Submitted 19 June, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: to be presented at ICML2022 in Baltimore, MD

  5. arXiv:2103.12711  [pdf, other

    stat.ML cs.LG

    A Pseudo-Metric between Probability Distributions based on Depth-Trimmed Regions

    Authors: Guillaume Staerman, Pavlo Mozharovskyi, Pierre Colombo, Stéphan Clémençon, Florence d'Alché-Buc

    Abstract: The design of a metric between probability distributions is a longstanding problem motivated by numerous applications in Machine Learning. Focusing on continuous probability distributions on the Euclidean space $\mathbb{R}^d$, we introduce a novel pseudo-metric between probability distributions by leveraging the extension of univariate quantiles to multivariate spaces. Data depth is a nonparametri… ▽ More

    Submitted 10 October, 2022; v1 submitted 23 March, 2021; originally announced March 2021.

  6. arXiv:2101.04430  [pdf

    physics.med-ph stat.ML

    A patient-specific approach for quantitative and automatic analysis of computed tomography images in lung disease: application to COVID-19 patients

    Authors: L. Berta, C. De Mattia, F. Rizzetto, S. Carrazza, P. E. Colombo, R. Fumagalli, T. Langer, D. Lizio, A. Vanzulli, A. Torresin

    Abstract: Quantitative metrics in lung computed tomography (CT) images have been widely used, often without a clear connection with physiology. This work proposes a patient-independent model for the estimation of well-aerated volume of lungs in CT images (WAVE). A Gaussian fit, with mean (Mu.f) and width (Sigma.f) values, was applied to the lower CT histogram data points of the lung to provide the estimatio… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: 31 pages, 7 figures, accepted in EJMP

    Report number: TIF-UNIMI-2020-26

  7. arXiv:2003.11593  [pdf, other

    stat.ML cs.CL cs.LG

    Heavy-tailed Representations, Text Polarity Classification & Data Augmentation

    Authors: Hamid Jalalzai, Pierre Colombo, Chloé Clavel, Eric Gaussier, Giovanna Varni, Emmanuel Vignon, Anne Sabourin

    Abstract: The dominant approaches to text representation in natural language rely on learning embeddings on massive corpora which have convenient properties such as compositionality and distance preservation. In this paper, we develop a novel method to learn a heavy-tailed embedding with desirable regularity properties regarding the distributional tails, which allows to analyze the points far away from the… ▽ More

    Submitted 25 March, 2021; v1 submitted 25 March, 2020; originally announced March 2020.

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS), Dec 2020