Skip to main content

Showing 1–23 of 23 results for author: Gomes, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2504.11516  [pdf, other

    stat.ML cs.LG physics.chem-ph physics.comp-ph

    FEAT: Free energy Estimators with Adaptive Transport

    Authors: Jiajun He, Yuanqi Du, Francisco Vargas, Yuanqing Wang, Carla P. Gomes, José Miguel Hernández-Lobato, Eric Vanden-Eijnden

    Abstract: We present Free energy Estimators with Adaptive Transport (FEAT), a novel framework for free energy estimation -- a critical challenge across scientific domains. FEAT leverages learned transports implemented via stochastic interpolants and provides consistent, minimum-variance estimators based on escorted Jarzynski equality and controlled Crooks theorem, alongside variational upper and lower bound… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: 29 pages, 2 tables, 3 figures

  2. arXiv:2502.06685  [pdf, other

    cs.LG stat.ML

    No Trick, No Treat: Pursuits and Challenges Towards Simulation-free Training of Neural Samplers

    Authors: Jiajun He, Yuanqi Du, Francisco Vargas, Dinghuai Zhang, Shreyas Padhy, RuiKang OuYang, Carla Gomes, José Miguel Hernández-Lobato

    Abstract: We consider the sampling problem, where the aim is to draw samples from a distribution whose density is known only up to a normalization constant. Recent breakthroughs in generative modeling to approximate a high-dimensional data distribution have sparked significant interest in developing neural network-based methods for this challenging problem. However, neural samplers typically incur heavy com… ▽ More

    Submitted 9 April, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: 21 pages, 5 figures, 6 tables

  3. arXiv:2206.08366  [pdf, other

    cs.LG cs.AI cs.MS math.OC stat.ML

    Scalable First-Order Bayesian Optimization via Structured Automatic Differentiation

    Authors: Sebastian Ament, Carla Gomes

    Abstract: Bayesian Optimization (BO) has shown great promise for the global optimization of functions that are expensive to evaluate, but despite many successes, standard approaches can struggle in high dimensions. To improve the performance of BO, prior work suggested incorporating gradient information into a Gaussian process surrogate of the objective, giving rise to kernel matrices of size… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  4. arXiv:2203.07798  [pdf, other

    stat.ML cs.LG

    Igeood: An Information Geometry Approach to Out-of-Distribution Detection

    Authors: Eduardo Dadalto Camara Gomes, Florence Alberge, Pierre Duhamel, Pablo Piantanida

    Abstract: Reliable out-of-distribution (OOD) detection is fundamental to implementing safer modern machine learning (ML) systems. In this paper, we introduce Igeood, an effective method for detecting OOD samples. Igeood applies to any pre-trained neural network, works under various degrees of access to the ML model, does not require OOD samples or assumptions on the OOD data but can also benefit (if availab… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: Accepted in ICLR 2022

  5. arXiv:2112.12101  [pdf, other

    cs.SI stat.AP

    Faster indicators of dengue fever case counts using Google and Twitter

    Authors: Giovanni Mizzi, Tobias Preis, Leonardo Soares Bastos, Marcelo Ferreira da Costa Gomes, Claudia Torres Codeço, Helen Susannah Moat

    Abstract: Dengue is a major threat to public health in Brazil, the world's sixth biggest country by population, with over 1.5 million cases recorded in 2019 alone. Official data on dengue case counts is delivered incrementally and, for many reasons, often subject to delays of weeks. In contrast, data on dengue-related Google searches and Twitter messages is available in full with no delay. Here, we describe… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: 25 pages, 7 figures (3 in supplementary information)

  6. arXiv:2106.06095  [pdf, other

    cs.LG cs.IT math.OC stat.CO stat.ML

    Sparse Bayesian Learning via Stepwise Regression

    Authors: Sebastian Ament, Carla Gomes

    Abstract: Sparse Bayesian Learning (SBL) is a powerful framework for attaining sparsity in probabilistic models. Herein, we propose a coordinate ascent algorithm for SBL termed Relevance Matching Pursuit (RMP) and show that, as its noise variance parameter goes to zero, RMP exhibits a surprising connection to Stepwise Regression. Further, we derive novel guarantees for Stepwise Regression algorithms, which… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

  7. arXiv:2106.03235  [pdf, other

    math.OC cs.IT math.NA stat.CO

    On the Optimality of Backward Regression: Sparse Recovery and Subset Selection

    Authors: Sebatian Ament, Carla Gomes

    Abstract: Sparse recovery and subset selection are fundamental problems in varied communities, including signal processing, statistics and machine learning. Herein, we focus on an important greedy algorithm for these problems: Backward Stepwise Regression. We present novel guarantees for the algorithm, propose an efficient, numerically stable implementation, and put forth Stepwise Regression with Replacemen… ▽ More

    Submitted 6 June, 2021; originally announced June 2021.

  8. arXiv:2102.08427  [pdf, other

    cs.LG stat.ML

    Evaluating Multi-label Classifiers with Noisy Labels

    Authors: Wenting Zhao, Carla Gomes

    Abstract: Multi-label classification (MLC) is a generalization of standard classification where multiple labels may be assigned to a given sample. In the real world, it is more common to deal with noisy datasets than clean datasets, given how modern datasets are labeled by a large group of annotators on crowdsourcing platforms, but little attention has been given to evaluating multi-label classifiers with n… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

  9. arXiv:2012.13841  [pdf, other

    cs.LG stat.ML

    Understanding Decoupled and Early Weight Decay

    Authors: Johan Bjorck, Kilian Weinberger, Carla Gomes

    Abstract: Weight decay (WD) is a traditional regularization technique in deep learning, but despite its ubiquity, its behavior is still an area of active research. Golatkar et al. have recently shown that WD only matters at the start of the training in computer vision, upending traditional wisdom. Loshchilov et al. show that for adaptive optimizers, manually decaying weights can outperform adding an $l_2$ p… ▽ More

    Submitted 26 December, 2020; originally announced December 2020.

  10. arXiv:2010.16040  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Hurdle Networks for Zero-Inflated Multi-Target Regression: Application to Multiple Species Abundance Estimation

    Authors: Shufeng Kong, Junwen Bai, Jae Hee Lee, Di Chen, Andrew Allyn, Michelle Stuart, Malin Pinsky, Katherine Mills, Carla P. Gomes

    Abstract: A key problem in computational sustainability is to understand the distribution of species across landscapes over time. This question gives rise to challenging large-scale prediction problems since (i) hundreds of species have to be simultaneously modeled and (ii) the survey data are usually inflated with zeros due to the absence of species for a large number of sites. The problem of tackling both… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

    Comments: Accepted by IJCAI 2020

  11. arXiv:2009.02980  [pdf, other

    cs.LG cs.AI stat.ML

    Efficient Projection Algorithms onto the Weighted l1 Ball

    Authors: Guillaume Perez, Sebastian Ament, Carla Gomes, Michel Barlaud

    Abstract: Projected gradient descent has been proved efficient in many optimization and machine learning problems. The weighted $\ell_1$ ball has been shown effective in sparse system identification and features selection. In this paper we propose three new efficient algorithms for projecting any vector of finite length onto the weighted $\ell_1$ ball. The first two algorithms have a linear worst case compl… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

    Comments: 19 pages

  12. Disentangled Variational Autoencoder based Multi-Label Classification with Covariance-Aware Multivariate Probit Model

    Authors: Junwen Bai, Shufeng Kong, Carla Gomes

    Abstract: Multi-label classification is the challenging task of predicting the presence and absence of multiple targets, involving representation learning and label correlation modeling. We propose a novel framework for multi-label classification, Multivariate Probit Variational AutoEncoder (MPVAE), that effectively learns latent embedding spaces as well as label correlations. MPVAE learns and aligns two pr… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

  13. arXiv:1910.09357  [pdf, other

    cs.LG stat.ML

    Task-Based Learning via Task-Oriented Prediction Network with Applications in Finance

    Authors: Di Chen, Yada Zhu, Xiaodong Cui, Carla P. Gomes

    Abstract: Real-world applications often involve domain-specific and task-based performance objectives that are not captured by the standard machine learning losses, but are critical for decision making. A key challenge for direct integration of more meaningful domain and task-based evaluation criteria into an end-to-end gradient-based training process is the fact that often such performance objectives are n… ▽ More

    Submitted 26 June, 2020; v1 submitted 17 October, 2019; originally announced October 2019.

  14. arXiv:1906.05433  [pdf, other

    cs.CY cs.AI cs.LG stat.ML

    Tackling Climate Change with Machine Learning

    Authors: David Rolnick, Priya L. Donti, Lynn H. Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, Alexandra Luccioni, Tegan Maharaj, Evan D. Sherwin, S. Karthik Mukkavilli, Konrad P. Kording, Carla Gomes, Andrew Y. Ng, Demis Hassabis, John C. Platt, Felix Creutzig, Jennifer Chayes, Yoshua Bengio

    Abstract: Climate change is one of the greatest challenges facing humanity, and we, as machine learning experts, may wonder how we can help. Here we describe how machine learning can be a powerful tool in reducing greenhouse gas emissions and helping society adapt to a changing climate. From smart grids to disaster management, we identify high impact problems where existing gaps can be filled by machine lea… ▽ More

    Submitted 5 November, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: For additional resources, please visit the website that accompanies this paper: https://www.climatechange.ai/

  15. arXiv:1906.00855  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Reasoning Networks: Thinking Fast and Slow

    Authors: Di Chen, Yiwei Bai, Wenting Zhao, Sebastian Ament, John M. Gregoire, Carla P. Gomes

    Abstract: We introduce Deep Reasoning Networks (DRNets), an end-to-end framework that combines deep learning with reasoning for solving complex tasks, typically in an unsupervised or weakly-supervised setting. DRNets exploit problem structure and prior knowledge by tightly combining logic and constraint reasoning with stochastic-gradient-based neural network optimization. We illustrate the power of DRNets o… ▽ More

    Submitted 4 June, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

  16. arXiv:1902.05601  [pdf, other

    stat.ML cs.LG

    Exponentially-Modified Gaussian Mixture Model: Applications in Spectroscopy

    Authors: Sebastian Ament, John Gregoire, Carla Gomes

    Abstract: We propose a novel exponentially-modified Gaussian (EMG) mixture residual model. The EMG mixture is well suited to model residuals that are contaminated by a distribution with positive support. This is in contrast to commonly used robust residual models, like the Huber loss or $\ell_1$, which assume a symmetric contaminating distribution and are otherwise asymptotically biased. We propose an expec… ▽ More

    Submitted 14 February, 2019; originally announced February 2019.

  17. arXiv:1811.00458  [pdf, other

    cs.LG cs.AI stat.ML

    Bias Reduction via End-to-End Shift Learning: Application to Citizen Science

    Authors: Di Chen, Carla P. Gomes

    Abstract: Citizen science projects are successful at gathering rich datasets for various applications. However, the data collected by citizen scientists are often biased --- in particular, aligned more with the citizens' preferences than with scientific objectives. We propose the Shift Compensation Network (SCN), an end-to-end learning scheme which learns the shift from the scientific objectives to the bias… ▽ More

    Submitted 14 November, 2018; v1 submitted 1 November, 2018; originally announced November 2018.

  18. arXiv:1806.02375  [pdf, other

    cs.LG cs.AI stat.ML

    Understanding Batch Normalization

    Authors: Johan Bjorck, Carla Gomes, Bart Selman, Kilian Q. Weinberger

    Abstract: Batch normalization (BN) is a technique to normalize activations in intermediate layers of deep neural networks. Its tendency to improve accuracy and speed up training have established BN as a favorite technique in deep learning. Yet, despite its enormous success, there remains little consensus on the exact reason and mechanism behind these improvements. In this paper we take a step towards a bett… ▽ More

    Submitted 30 November, 2018; v1 submitted 31 May, 2018; originally announced June 2018.

  19. arXiv:1803.08591  [pdf, other

    cs.LG stat.ML

    End-to-End Learning for the Deep Multivariate Probit Model

    Authors: Di Chen, Yexiang Xue, Carla P. Gomes

    Abstract: The multivariate probit model (MVP) is a popular classic model for studying binary responses of multiple entities. Nevertheless, the computational challenge of learning the MVP model, given that its likelihood involves integrating over a multidimensional constrained space of latent variables, significantly limits its application in practice. We propose a flexible deep generalization of the classic… ▽ More

    Submitted 13 July, 2018; v1 submitted 22 March, 2018; originally announced March 2018.

  20. arXiv:1709.05612  [pdf, other

    cs.LG stat.ML

    Multi-Entity Dependence Learning with Rich Context via Conditional Variational Auto-encoder

    Authors: Luming Tang, Yexiang Xue, Di Chen, Carla P. Gomes

    Abstract: Multi-Entity Dependence Learning (MEDL) explores conditional correlations among multiple entities. The availability of rich contextual information requires a nimble learning scheme that tightly integrates with deep neural networks and has the ability to capture correlation structures among exponentially many outcomes. We propose MEDL_CVAE, which encodes a conditional multivariate distribution as a… ▽ More

    Submitted 17 September, 2017; originally announced September 2017.

    Comments: The first two authors contribute equally

  21. arXiv:1609.09353  [pdf, other

    cs.LG q-bio.PE stat.ML

    Deep Multi-Species Embedding

    Authors: Di Chen, Yexiang Xue, Shuo Chen, Daniel Fink, Carla Gomes

    Abstract: Understanding how species are distributed across landscapes over time is a fundamental question in biodiversity research. Unfortunately, most species distribution models only target a single species at a time, despite strong ecological evidence that species are not independently distributed. We propose Deep Multi-Species Embedding (DMSE), which jointly embeds vectors corresponding to multiple spec… ▽ More

    Submitted 21 February, 2017; v1 submitted 27 September, 2016; originally announced September 2016.

    Comments: 13 pages

  22. arXiv:1411.7441  [pdf, other

    cs.AI cs.LG stat.ML

    Pattern Decomposition with Complex Combinatorial Constraints: Application to Materials Discovery

    Authors: Stefano Ermon, Ronan Le Bras, Santosh K. Suram, John M. Gregoire, Carla Gomes, Bart Selman, Robert B. van Dover

    Abstract: Identifying important components or factors in large amounts of noisy data is a key problem in machine learning and data mining. Motivated by a pattern decomposition problem in materials discovery, aimed at discovering new materials for renewable energy, e.g. for fuel and solar cells, we introduce CombiFD, a framework for factor based pattern decomposition that allows the incorporation of a-priori… ▽ More

    Submitted 26 November, 2014; originally announced November 2014.

  23. arXiv:1302.6677  [pdf, other

    cs.LG cs.AI stat.ML

    Taming the Curse of Dimensionality: Discrete Integration by Hashing and Optimization

    Authors: Stefano Ermon, Carla P. Gomes, Ashish Sabharwal, Bart Selman

    Abstract: Integration is affected by the curse of dimensionality and quickly becomes intractable as the dimensionality of the problem grows. We propose a randomized algorithm that, with high probability, gives a constant-factor approximation of a general discrete integral defined over an exponentially large set. This algorithm relies on solving only a small number of instances of a discrete combinatorial op… ▽ More

    Submitted 27 February, 2013; originally announced February 2013.