Skip to main content

Showing 1–9 of 9 results for author: F., P C M

.
  1. arXiv:2505.12578  [pdf, ps, other

    stat.ML cs.LG

    Stacked conformal prediction

    Authors: Paulo C. Marques F

    Abstract: We consider a method for conformalizing a stacked ensemble of predictive models, showing that the potentially simple form of the meta-learner at the top of the stack enables a procedure with manageable computational cost that achieves approximate marginal validity without requiring the use of a separate calibration sample. Empirical results indicate that the method compares favorably to a standard… ▽ More

    Submitted 7 July, 2025; v1 submitted 18 May, 2025; originally announced May 2025.

    Comments: 12 pages, 2 figures

  2. arXiv:2410.24145  [pdf, other

    stat.ML cs.LG stat.ME

    Projected random forests and conformal prediction of circular data

    Authors: Paulo C. Marques F., Rinaldo Artes, Helton Graziadei

    Abstract: We apply split conformal prediction techniques to regression problems with circular responses by introducing a suitable conformity score, leading to prediction sets with adaptive arc length and finite-sample coverage guarantees for any circular predictive model under exchangeable data. Leveraging the high performance of existing predictive models designed for linear responses, we analyze a general… ▽ More

    Submitted 25 December, 2024; v1 submitted 31 October, 2024; originally announced October 2024.

    Comments: 7 pages; 4 figures

  3. arXiv:2307.13124  [pdf, ps, other

    stat.ME cs.LG stat.ML

    Conformal prediction for frequency-severity modeling

    Authors: Helton Graziadei, Paulo C. Marques F., Eduardo F. L. de Melo, Rodrigo S. Targino

    Abstract: We present a model-agnostic framework for the construction of prediction intervals of insurance claims, with finite sample statistical guarantees, extending the technique of split conformal prediction to the domain of two-stage frequency-severity modeling. The framework effectiveness is showcased with simulated and real datasets using classical parametric models and contemporary machine learning m… ▽ More

    Submitted 19 June, 2025; v1 submitted 24 July, 2023; originally announced July 2023.

  4. arXiv:2303.02770  [pdf, ps, other

    math.ST cs.LG stat.ML

    Universal distribution of the empirical coverage in split conformal prediction

    Authors: Paulo C. Marques F

    Abstract: When split conformal prediction operates in batch mode with exchangeable data, we determine the exact distribution of the empirical coverage of prediction sets produced for a finite batch of future observables, as well as the exact distribution of its almost sure limit when the batch size goes to infinity. Both distributions are universal, being determined solely by the nominal miscoverage level a… ▽ More

    Submitted 21 September, 2024; v1 submitted 5 March, 2023; originally announced March 2023.

    Comments: 6 pages, 1 table

    Journal ref: Statistics & Probability Letters, Volume 219, 2025, 110350.

  5. arXiv:2112.06101  [pdf, ps, other

    stat.ML cs.LG

    Confidence intervals for the random forest generalization error

    Authors: Paulo C. Marques F

    Abstract: We show that the byproducts of the standard training process of a random forest yield not only the well known and almost computationally free out-of-bag point estimate of the model generalization error, but also give a direct path to compute confidence intervals for the generalization error which avoids processes of data splitting and model retraining. Besides the low computational cost involved i… ▽ More

    Submitted 11 March, 2022; v1 submitted 11 December, 2021; originally announced December 2021.

    Comments: 10 pages

  6. arXiv:1907.03155  [pdf, other

    stat.ME

    Learning a latent pattern of heterogeneity in the innovation rates of a time series of counts

    Authors: Helton Graziadei, Hedibert F. Lopes, Paulo C. Marques F

    Abstract: We develop a Bayesian hierarchical semiparametric model for phenomena related to time series of counts. The main feature of the model is its capability to learn a latent pattern of heterogeneity in the distribution of the process innovation rates, which are softly clustered through time with the help of a Dirichlet process placed at the top of the model hierarchy. The probabilistic forecasting cap… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.

  7. arXiv:1312.2291  [pdf, ps, other

    stat.ME stat.AP

    Predictive analysis of microarray data

    Authors: Paulo C. Marques F., Carlos A. de B. Pereira

    Abstract: Microarray gene expression data are analyzed by means of a Bayesian nonparametric model, with emphasis on prediction of future observables, yielding a method for selection of differentially expressed genes and a classifier.

    Submitted 10 June, 2014; v1 submitted 8 December, 2013; originally announced December 2013.

  8. arXiv:1306.1170  [pdf, ps, other

    math.ST

    On the computation of the marginal likelihood

    Authors: Paulo C. Marques F

    Abstract: We describe briefly in this note a procedure for consistently estimating the marginal likelihood of a statistical model through a sample from the posterior distribution of the model parameters.

    Submitted 10 June, 2014; v1 submitted 3 June, 2013; originally announced June 2013.

  9. arXiv:1209.4947  [pdf, ps, other

    math.ST

    Bayesian Analysis of Simple Random Densities

    Authors: Paulo C. Marques F., Carlos A. de B. Pereira

    Abstract: A tractable nonparametric prior over densities is introduced which is closed under sampling and exhibits proper posterior asymptotics.

    Submitted 10 June, 2014; v1 submitted 21 September, 2012; originally announced September 2012.

    Comments: 19 pages; 6 figures