Skip to main content

Showing 1–9 of 9 results for author: Thomas, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2006.08875  [pdf, other

    cs.LG cs.AI stat.ML

    Model-based Adversarial Meta-Reinforcement Learning

    Authors: Zichuan Lin, Garrett Thomas, Guangwen Yang, Tengyu Ma

    Abstract: Meta-reinforcement learning (meta-RL) aims to learn from multiple training tasks the ability to adapt efficiently to unseen test tasks. Despite the success, existing meta-RL algorithms are known to be sensitive to the task distribution shift. When the test task distribution is different from the training task distribution, the performance may degrade significantly. To address this issue, this pape… ▽ More

    Submitted 27 February, 2021; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: Accepted by NeurIPS 2020. Code at https://github.com/LinZichuan/AdMRL

  2. arXiv:2005.13239  [pdf, other

    cs.LG cs.AI stat.ML

    MOPO: Model-based Offline Policy Optimization

    Authors: Tianhe Yu, Garrett Thomas, Lantao Yu, Stefano Ermon, James Zou, Sergey Levine, Chelsea Finn, Tengyu Ma

    Abstract: Offline reinforcement learning (RL) refers to the problem of learning policies entirely from a large batch of previously collected data. This problem setting offers the promise of utilizing such datasets to acquire policies without any costly or dangerous active exploration. However, it is also challenging, due to the distributional shift between the offline training data and those states visited… ▽ More

    Submitted 22 November, 2020; v1 submitted 27 May, 2020; originally announced May 2020.

    Comments: NeurIPS 2020. First two authors contributed equally. Last two authors advised equally

  3. arXiv:1910.08857  [pdf, other

    astro-ph.IM physics.ed-ph stat.AP

    LRP2020: Astrostatistics in Canada

    Authors: Gwendolyn Eadie, Arash Bahramian, Pauline Barmby, Radu Craiu, Derek Bingham, Renée Hložek, JJ Kavelaars, David Stenning, Samantha Benincasa, Guillaume Thomas, Karun Thanjavur, Jo Bovy, Jan Cami, Ray Carlberg, Sam Lawler, Adrian Liu, Henry Ngo, Mubdi Rahman, Michael Rupen

    Abstract: (Abridged from Executive Summary) This white paper focuses on the interdisciplinary fields of astrostatistics and astroinformatics, in which modern statistical and computational methods are applied to and developed for astronomical data. Astrostatistics and astroinformatics have grown dramatically in the past ten years, with international organizations, societies, conferences, workshops, and summe… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

    Comments: White paper E017 submitted to the Canadian Long Range Plan LRP2020

  4. arXiv:1907.04964  [pdf, other

    cs.LG cs.AI stat.ML

    A Model-based Approach for Sample-efficient Multi-task Reinforcement Learning

    Authors: Nicholas C. Landolfi, Garrett Thomas, Tengyu Ma

    Abstract: The aim of multi-task reinforcement learning is two-fold: (1) efficiently learn by training against multiple tasks and (2) quickly adapt, using limited samples, to a variety of new tasks. In this work, the tasks correspond to reward functions for environments with the same (or similar) dynamical models. We propose to learn a dynamical model during the training process and use this model to perform… ▽ More

    Submitted 3 November, 2019; v1 submitted 10 July, 2019; originally announced July 2019.

    Comments: 13 pages, 3 figures

  5. arXiv:1901.06261  [pdf, other

    cs.LG cs.SE stat.ML

    NeuNetS: An Automated Synthesis Engine for Neural Network Design

    Authors: Atin Sood, Benjamin Elder, Benjamin Herta, Chao Xue, Costas Bekas, A. Cristiano I. Malossi, Debashish Saha, Florian Scheidegger, Ganesh Venkataraman, Gegi Thomas, Giovanni Mariani, Hendrik Strobelt, Horst Samulowitz, Martin Wistuba, Matteo Manica, Mihir Choudhury, Rong Yan, Roxana Istrate, Ruchir Puri, Tejaswini Pedapati

    Abstract: Application of neural networks to a vast variety of practical applications is transforming the way AI is applied in practice. Pre-trained neural network models available through APIs or capability to custom train pre-built neural network architectures with customer data has made the consumption of AI by developers much simpler and resulted in broad adoption of these complex AI models. While prebui… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: 14 pages, 12 figures. arXiv admin note: text overlap with arXiv:1806.00250

  6. arXiv:1812.01719  [pdf, other

    cs.CV cs.LG stat.ML

    Knowing what you know in brain segmentation using Bayesian deep neural networks

    Authors: Patrick McClure, Nao Rho, John A. Lee, Jakub R. Kaczmarzyk, Charles Zheng, Satrajit S. Ghosh, Dylan Nielson, Adam G. Thomas, Peter Bandettini, Francisco Pereira

    Abstract: In this paper, we describe a Bayesian deep neural network (DNN) for predicting FreeSurfer segmentations of structural MRI volumes, in minutes rather than hours. The network was trained and evaluated on a large dataset (n = 11,480), obtained by combining data from more than a hundred different sites, and also evaluated on another completely held-out dataset (n = 418). The network was trained using… ▽ More

    Submitted 18 September, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

    Comments: Submitted to Frontiers in Neuroinformatics

  7. arXiv:1810.03085  [pdf, other

    stat.AP

    Analysis of a longitudinal multilevel experiment using GAMLSSs

    Authors: Gustavo Thomas, Alexandre Igor de Azevedo Pereira, Cristian Marcelo Villegas Lobos, Clarice G. B. Demétrio.

    Abstract: The standard procedures for analysing hierarquical or grouped data are by (non)linear mixed models or generalized mixed models. However, the generalized additive models for location, scale and shape (GAMLSSs) also allow different types of random effects to be included in the model formulation. Even though already popular in many areas of research, this type of models have not been found to be used… ▽ More

    Submitted 7 October, 2018; originally announced October 2018.

    Comments: 30 pages, 16 figures, received the prize of 2nd best oral presentation at the XV MGEST, Minas Gerais meeting of Statistics, Belo Horizonte, Brazil in 2017

  8. arXiv:1810.02618  [pdf, other

    stat.AP

    Modeling data with zero inflation and overdispersion using GAMLSSs

    Authors: Gustavo Thomas, Luiz R. Nakamura, Rafael A. Moral, Clarice G. B. Demétrio

    Abstract: Count data with high frequencies of zeros are found in many areas, specially in biology. Statistical models to analyze such data started to be developed in the 80s and are still a topic of active research. Such models usually assume a response distribution that belongs to the exponential family of distributions and the analysis is performed under the generalized linear models framework. However, t… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

    Comments: 19 pages, 7 figures, presented at the 62nd International Biometric Society Reunion (RBras) and 17th Symposium of Applied Statistics in Agronomy (SEAGRO) in 2017 at Lavras-MG, Brazil

  9. arXiv:1602.02867  [pdf, other

    cs.AI cs.LG cs.NE stat.ML

    Value Iteration Networks

    Authors: Aviv Tamar, Yi Wu, Garrett Thomas, Sergey Levine, Pieter Abbeel

    Abstract: We introduce the value iteration network (VIN): a fully differentiable neural network with a `planning module' embedded within. VINs can learn to plan, and are suitable for predicting outcomes that involve planning-based reasoning, such as policies for reinforcement learning. Key to our approach is a novel differentiable approximation of the value-iteration algorithm, which can be represented as a… ▽ More

    Submitted 20 March, 2017; v1 submitted 9 February, 2016; originally announced February 2016.

    Comments: Fixed missing table values

    Journal ref: Advances in Neural Information Processing Systems 29 pages 2154--2162, 2016