Skip to main content

Showing 1–11 of 11 results for author: Bassetti, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.15267  [pdf, ps, other

    stat.ML cond-mat.dis-nn cs.LG math.PR

    Proportional infinite-width infinite-depth limit for deep linear neural networks

    Authors: Federico Bassetti, Lucia Ladelli, Pietro Rotondo

    Abstract: We study the distributional properties of linear neural networks with random parameters in the context of large networks, where the number of layers diverges in proportion to the number of neurons per layer. Prior works have shown that in the infinite-width regime, where the number of neurons per layer grows to infinity while the depth remains fixed, neural networks converge to a Gaussian process,… ▽ More

    Submitted 22 November, 2024; originally announced November 2024.

    MSC Class: 60F05; 60H05; 62E2

  2. arXiv:2406.03260  [pdf, ps, other

    stat.ML cond-mat.dis-nn cs.LG math.ST

    Feature learning in finite-width Bayesian deep linear networks with multiple outputs and convolutional layers

    Authors: Federico Bassetti, Marco Gherardi, Alessandro Ingrosso, Mauro Pastore, Pietro Rotondo

    Abstract: Deep linear networks have been extensively studied, as they provide simplified models of deep learning. However, little is known in the case of finite-width architectures with multiple outputs and convolutional layers. In this manuscript, we provide rigorous results for the statistics of functions implemented by the aforementioned class of networks, thus moving closer to a complete characterizatio… ▽ More

    Submitted 16 June, 2025; v1 submitted 5 June, 2024; originally announced June 2024.

    MSC Class: 62E20; 62E15; 82B44

    Journal ref: Journal of Machine Learning Research 26(88):1-35, 2025. URL: http://jmlr.org/papers/v26/24-1158.html

  3. arXiv:2308.08481  [pdf, ps, other

    stat.ME

    A Spatiotemporal Gamma Shot Noise Cox Process

    Authors: Federico Bassetti, Roberto Casarin, Matteo Iacopini

    Abstract: A new discrete-time shot noise Cox process for spatiotemporal data is proposed. The random intensity is driven by a dependent sequence of latent gamma random measures. Some properties of the latent process are derived, such as an autoregressive representation and the Laplace functional. Moreover, these results are used to derive the moment, predictive, and pair correlation measures of the proposed… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  4. arXiv:2202.02029  [pdf, other

    stat.ME econ.EM

    First-order integer-valued autoregressive processes with Generalized Katz innovations

    Authors: Ovielt Baltodano Lopez, Federico Bassetti, Giulia Carallo, Roberto Casarin

    Abstract: A new integer--valued autoregressive process (INAR) with Generalised Lagrangian Katz (GLK) innovations is defined. This process family provides a flexible modelling framework for count data, allowing for under and over--dispersion, asymmetry, and excess of kurtosis and includes standard INAR models such as Generalized Poisson and Negative Binomial as special cases. We show that the GLK--INAR proce… ▽ More

    Submitted 17 December, 2024; v1 submitted 4 February, 2022; originally announced February 2022.

    MSC Class: 62F15; 62M10

  5. arXiv:1805.07416  [pdf, other

    math.OC stat.ML

    Computing Kantorovich-Wasserstein Distances on $d$-dimensional histograms using $(d+1)$-partite graphs

    Authors: Gennaro Auricchio, Federico Bassetti, Stefano Gualandi, Marco Veneroni

    Abstract: This paper presents a novel method to compute the exact Kantorovich-Wasserstein distance between a pair of $d$-dimensional histograms having $n$ bins each. We prove that this problem is equivalent to an uncapacitated minimum cost flow problem on a $(d+1)$-partite graph with $(d+1)n$ nodes and $dn^{\frac{d+1}{d}}$ arcs, whenever the cost is separable along the principal $d$-dimensional directions.… ▽ More

    Submitted 11 January, 2019; v1 submitted 18 May, 2018; originally announced May 2018.

    Comments: 12 pages, 4 figures, 3 tables

  6. arXiv:1804.00445  [pdf, other

    math.OC stat.ML

    On the Computation of Kantorovich-Wasserstein Distances between 2D-Histograms by Uncapacitated Minimum Cost Flows

    Authors: Federico Bassetti, Stefano Gualandi, Marco Veneroni

    Abstract: In this work, we present a method to compute the Kantorovich-Wasserstein distance of order one between a pair of two-dimensional histograms. Recent works in Computer Vision and Machine Learning have shown the benefits of measuring Wasserstein distances of order one between histograms with $n$ bins, by solving a classical transportation problem on very large complete bipartite graphs with $n$ nodes… ▽ More

    Submitted 26 July, 2019; v1 submitted 2 April, 2018; originally announced April 2018.

    Comments: 27 pages, 35 figures, 5 tables

    MSC Class: 90C06; 90C08

  7. arXiv:1803.05793  [pdf, ps, other

    stat.ME math.ST

    Hierarchical Species Sampling Models

    Authors: Federico Bassetti, Roberto Casarin, Luca Rossini

    Abstract: This paper introduces a general class of hierarchical nonparametric prior distributions. The random probability measures are constructed by a hierarchy of generalized species sampling processes with possibly non-diffuse base measures. The proposed framework provides a general probabilistic foundation for hierarchical random measures with either atomic or mixed base measures and allows for studying… ▽ More

    Submitted 15 March, 2018; originally announced March 2018.

  8. arXiv:1502.07246  [pdf, other

    stat.AP stat.ME

    Bayesian Nonparametric Calibration and Combination of Predictive Distributions

    Authors: Federico Bassetti, Roberto Casarin, Francesco Ravazzolo

    Abstract: We introduce a Bayesian approach to predictive density calibration and combination that accounts for parameter uncertainty and model set incompleteness through the use of random calibration functionals and random combination weights. Building on the work of Ranjan, R. and Gneiting, T. (2010) and Gneiting, T. and Ranjan, R. (2013), we use infinite beta mixtures for the calibration. The proposed Bay… ▽ More

    Submitted 25 October, 2016; v1 submitted 25 February, 2015; originally announced February 2015.

    Comments: arXiv admin note: text overlap with arXiv:1305.2026 by other authors

  9. arXiv:1109.4777  [pdf, ps, other

    math.ST math.PR stat.CO

    Beta-Product Poisson-Dirichlet Processes

    Authors: Federico Bassetti, Roberto Casarin, Fabrizio Leisen

    Abstract: Time series data may exhibit clustering over time and, in a multiple time series context, the clustering behavior may differ across the series. This paper is motivated by the Bayesian non--parametric modeling of the dependence between the clustering structures and the distributions of different time series. We follow a Dirichlet process mixture approach and introduce a new class of multivariate de… ▽ More

    Submitted 22 September, 2011; originally announced September 2011.

  10. arXiv:1012.0866  [pdf, other

    math.ST cs.LG stat.ME

    Generalized Species Sampling Priors with Latent Beta reinforcements

    Authors: Edoardo M. Airoldi, Thiago Costa, Federico Bassetti, Fabrizio Leisen, Michele Guindani

    Abstract: Many popular Bayesian nonparametric priors can be characterized in terms of exchangeable species sampling sequences. However, in some applications, exchangeability may not be appropriate. We introduce a {novel and probabilistically coherent family of non-exchangeable species sampling sequences characterized by a tractable predictive probability function with weights driven by a sequence of indepen… ▽ More

    Submitted 1 August, 2014; v1 submitted 3 December, 2010; originally announced December 2010.

    Comments: For correspondence purposes, Edoardo M. Airoldi's email is [email protected]; Federico Bassetti's email is [email protected]; Michele Guindani's email is [email protected] ; Fabrizo Leisen's email is [email protected]. To appear in the Journal of the American Statistical Association

  11. arXiv:0807.1201  [pdf, ps, other

    q-fin.ST math.PR math.ST stat.ME

    Quantitative comparisons between finitary posterior distributions and Bayesian posterior distributions

    Authors: Federico Bassetti

    Abstract: The main object of Bayesian statistical inference is the determination of posterior distributions. Sometimes these laws are given for quantities devoid of empirical value. This serious drawback vanishes when one confines oneself to considering a finite horizon framework. However, assuming infinite exchangeability gives rise to fairly tractable {\it a posteriori} quantities, which is very attract… ▽ More

    Submitted 8 July, 2008; originally announced July 2008.

    MSC Class: 62C10; 62F15; 60G09