Skip to main content

Showing 1–17 of 17 results for author: Campos, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2209.10584  [pdf, other

    cs.LG cs.AI stat.ML

    Continuous Mixtures of Tractable Probabilistic Models

    Authors: Alvaro H. C. Correia, Gennaro Gala, Erik Quaeghebeur, Cassio de Campos, Robert Peharz

    Abstract: Probabilistic models based on continuous latent spaces, such as variational autoencoders, can be understood as uncountable mixture models where components depend continuously on the latent code. They have proven to be expressive tools for generative and probabilistic modelling, but are at odds with tractable probabilistic inference, that is, computing marginals and conditionals of the represented… ▽ More

    Submitted 24 March, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

  2. arXiv:2208.00027  [pdf, other

    stat.AP

    Hypothesis tests for multiple responses regression: effect of probiotics on addiction and binge eating disorder

    Authors: Lineu Alberto Cavazani de Freitas, Ligia de Oliveira Carlos, Antônio Carlos Ligocki Campos, Wagner Hugo Bonat

    Abstract: Clinical trials are common in medical research where multiple non-Gaussian responses and time-dependent observations are frequent. The analysis of data from these studies requires statistical modeling techniques that take these characteristics into account. We propose a general strategy based on the Wald statistics to perform hypothesis tests like ANOVAs, MANOVAs and multiple comparison tests on r… ▽ More

    Submitted 29 July, 2022; originally announced August 2022.

  3. Bayesian Modelling of Multivalued Power Curves from an Operational Wind Farm

    Authors: L. A. Bull, P. A. Gardner, T. J. Rogers, N. Dervilis, E. J. Cross, E. Papatheou, A. E. Maguire, C. Campos, K. Worden

    Abstract: Power curves capture the relationship between wind speed and output power for a specific wind turbine. Accurate regression models of this function prove useful in monitoring, maintenance, design, and planning. In practice, however, the measurements do not always correspond to the ideal curve: power curtailments will appear as (additional) functional components. Such multivalued relationships canno… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    Journal ref: Mechanical Systems and Signal Processing (2021): 108530

  4. arXiv:2105.04001  [pdf, other

    stat.ML cs.LG

    Bayesian Kernelised Test of (In)dependence with Mixed-type Variables

    Authors: Alessio Benavoli, Cassio de Campos

    Abstract: A fundamental task in AI is to assess (in)dependence between mixed-type variables (text, image, sound). We propose a Bayesian kernelised correlation test of (in)dependence using a Dirichlet process model. The new measure of (in)dependence allows us to answer some fundamental questions: Based on data, are (mixed-type) variables independent? How likely is dependence/independence to hold? How high is… ▽ More

    Submitted 9 May, 2021; originally announced May 2021.

  5. arXiv:2007.05721  [pdf, other

    stat.ML cs.LG

    Towards Robust Classification with Deep Generative Forests

    Authors: Alvaro H. C. Correia, Robert Peharz, Cassio de Campos

    Abstract: Decision Trees and Random Forests are among the most widely used machine learning models, and often achieve state-of-the-art performance in tabular, domain-agnostic datasets. Nonetheless, being primarily discriminative models they lack principled methods to manipulate the uncertainty of predictions. In this paper, we exploit Generative Forests (GeFs), a recent class of deep probabilistic models th… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

    Comments: Presented at the ICML 2020 Workshop on Uncertainty and Robustness in Deep Learning

  6. arXiv:2006.14937  [pdf, other

    cs.LG cs.AI stat.ML

    Joints in Random Forests

    Authors: Alvaro H. C. Correia, Robert Peharz, Cassio de Campos

    Abstract: Decision Trees (DTs) and Random Forests (RFs) are powerful discriminative learners and tools of central importance to the everyday machine learning practitioner and data scientist. Due to their discriminative nature, however, they lack principled methods to process inputs with missing features or to detect outliers, which requires pairing them with imputation techniques or a separate generative mo… ▽ More

    Submitted 19 November, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

    Journal ref: Advances in Neural Information Processing Systems 33 (2020)

  7. arXiv:1905.09943  [pdf, ps, other

    stat.ML cs.LG

    On Pruning for Score-Based Bayesian Network Structure Learning

    Authors: Alvaro H. C. Correia, James Cussens, Cassio de Campos

    Abstract: Many algorithms for score-based Bayesian network structure learning (BNSL), in particular exact ones, take as input a collection of potentially optimal parent sets for each variable in the data. Constructing such collections naively is computationally intensive since the number of parent sets grows exponentially with the number of variables. Thus, pruning techniques are not only desirable but esse… ▽ More

    Submitted 2 August, 2020; v1 submitted 23 May, 2019; originally announced May 2019.

    Journal ref: Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics (AISTATS 2020), in PMLR 108:2709-2718

  8. arXiv:1707.06194  [pdf, ps, other

    cs.AI stat.ML

    Entropy-based Pruning for Learning Bayesian Networks using BIC

    Authors: Cassio P. de Campos, Mauro Scanagatta, Giorgio Corani, Marco Zaffalon

    Abstract: For decomposable score-based structure learning of Bayesian networks, existing approaches first compute a collection of candidate parent sets for each variable and then optimize over this collection by choosing one parent set for each variable without creating directed cycles while maximizing the total score. We target the task of constructing the collection of candidate parent sets when the score… ▽ More

    Submitted 19 July, 2017; originally announced July 2017.

  9. arXiv:1608.07734  [pdf, ps, other

    cs.AI stat.ML

    Learning Bayesian Networks with Incomplete Data by Augmentation

    Authors: Tameem Adel, Cassio P. de Campos

    Abstract: We present new algorithms for learning Bayesian networks from data with missing values using a data augmentation approach. An exact Bayesian network learning algorithm is obtained by recasting the problem into a standard Bayesian network learning problem without missing data. To the best of our knowledge, this is the first exact algorithm for this problem. As expected, the exact algorithm does not… ▽ More

    Submitted 8 October, 2016; v1 submitted 27 August, 2016; originally announced August 2016.

  10. arXiv:1406.1411  [pdf, other

    cs.AI cs.LG stat.ML

    Advances in Learning Bayesian Networks of Bounded Treewidth

    Authors: Siqi Nie, Denis Deratani Maua, Cassio Polpo de Campos, Qiang Ji

    Abstract: This work presents novel algorithms for learning Bayesian network structures with bounded treewidth. Both exact and approximate methods are developed. The exact method combines mixed-integer linear programming formulations for structure learning and treewidth computation. The approximate method consists in uniformly sampling $k$-trees (maximal graphs of treewidth $k$), and subsequently selecting,… ▽ More

    Submitted 6 June, 2014; v1 submitted 5 June, 2014; originally announced June 2014.

    Comments: 23 pages, 2 figures, 3 tables

    MSC Class: 68T37

  11. Confidence Statements for Ordering Quantiles

    Authors: Carlos A. de B. Pereira, Cassio P. de Campos, Adriano Polpo

    Abstract: This work proposes Quor, a simple yet effective nonparametric method to compare independent samples with respect to corresponding quantiles of their populations. The method is solely based on the order statistics of the samples, and independence is its only requirement. All computations are performed using exact distributions with no need for any asymptotic considerations, and yet can be run using… ▽ More

    Submitted 17 July, 2014; v1 submitted 21 December, 2012; originally announced December 2012.

    Journal ref: Entropy 2016, 18, 357

  12. arXiv:1212.2458  [pdf

    cs.AI stat.CO

    Inference in Polytrees with Sets of Probabilities

    Authors: Jose Carlos Ferreira da Rocha, Fabio Gagliardi Cozman, Cassio Polpo de Campos

    Abstract: Inferences in directed acyclic graphs associated with probability sets and probability intervals are NP-hard, even for polytrees. In this paper we focus on such inferences, and propose: 1) a substantial improvement on Tessems A / R algorithm FOR polytrees WITH probability intervals; 2) a new algorithm FOR direction - based local search(IN sets OF probability) that improves ON e… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-217-224

  13. arXiv:1207.1367  [pdf

    cs.AI stat.ML

    Belief Updating and Learning in Semi-Qualitative Probabilistic Networks

    Authors: Cassio Polpo de Campos, Fabio Gagliardi Cozman

    Abstract: This paper explores semi-qualitative probabilistic networks (SQPNs) that combine numeric and qualitative information. We first show that exact inferences with SQPNs are NPPP-Complete. We then show that existing qualitative relations in SQPNs (plus probabilistic logic and imprecise assessments) can be dealt effectively through multilinear programming. We then discuss learning: we consider a maximum… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-153-160

  14. arXiv:1206.6424  [pdf

    cs.AI stat.ML

    Anytime Marginal MAP Inference

    Authors: Denis Maua, Cassio De Campos

    Abstract: This paper presents a new anytime algorithm for the marginal MAP problem in graphical models. The algorithm is described in detail, its complexity and convergence rate are studied, and relations to previous theoretical results for the problem are discussed. It is shown that the algorithm runs in polynomial-time if the underlying graph of the model has bounded tree-width, and that it provides guara… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

  15. arXiv:1110.3239  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Improving parameter learning of Bayesian nets from incomplete data

    Authors: Giorgio Corani, Cassio P. De Campos

    Abstract: This paper addresses the estimation of parameters of a Bayesian network from incomplete data. The task is usually tackled by running the Expectation-Maximization (EM) algorithm several times in order to obtain a high log-likelihood estimate. We argue that choosing the maximum log-likelihood estimate (as well as the maximum penalized log-likelihood and the maximum a posteriori estimate) has severe… ▽ More

    Submitted 12 October, 2011; originally announced October 2011.

  16. arXiv:1109.1754  [pdf, ps, other

    cs.AI cs.CC stat.ML

    Solving Limited Memory Influence Diagrams

    Authors: Denis Deratani Mauá, Cassio Polpo de Campos, Marco Zaffalon

    Abstract: We present a new algorithm for exactly solving decision making problems represented as influence diagrams. We do not require the usual assumptions of no forgetting and regularity; this allows us to solve problems with simultaneous decisions and limited information. The algorithm is empirically shown to outperform a state-of-the-art algorithm on randomly generated problems of up to 150 variables an… ▽ More

    Submitted 9 September, 2011; v1 submitted 8 September, 2011; originally announced September 2011.

    Comments: 43 pages, 8 figures

    MSC Class: 68T37 ACM Class: I.2.1; I.2.8; F.2

  17. arXiv:1007.3884  [pdf, ps, other

    cs.AI cs.CC stat.ML

    New Results for the MAP Problem in Bayesian Networks

    Authors: Cassio P. de Campos

    Abstract: This paper presents new results for the (partial) maximum a posteriori (MAP) problem in Bayesian networks, which is the problem of querying the most probable state configuration of some of the network variables given evidence. First, it is demonstrated that the problem remains hard even in networks with very simple topology, such as binary polytrees and simple trees (including the Naive Bayes stru… ▽ More

    Submitted 29 July, 2010; v1 submitted 22 July, 2010; originally announced July 2010.

    Comments: A couple of typos were fixed, as well as the notation in part of section 4, which was misleading. Theoretical and empirical results have not changed