Skip to main content

Showing 1–9 of 9 results for author: Barnes, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.07657  [pdf, other

    stat.AP stat.ME

    Probabilistic Modelling of Multiple Long-Term Condition Onset Times

    Authors: Kieran Richards, Kelly Fleetwood, Regina Prigge, Paolo Missier, Michael Barnes, Nick J. Reynolds, Bruce Guthrie, Sohan Seth

    Abstract: The co-occurrence of multiple long-term conditions (MLTC), or multimorbidity, in an individual can reduce their lifespan and severely impact their quality of life. Exploring the longitudinal patterns, e.g. clusters, of disease accrual can help better understand the genetic and environmental drivers of multimorbidity, and potentially identify individuals who may benefit from early targeted interven… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

  2. arXiv:2409.01369  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Imitating Language via Scalable Inverse Reinforcement Learning

    Authors: Markus Wulfmeier, Michael Bloesch, Nino Vieillard, Arun Ahuja, Jorg Bornschein, Sandy Huang, Artem Sokolov, Matt Barnes, Guillaume Desjardins, Alex Bewley, Sarah Maria Elisabeth Bechtle, Jost Tobias Springenberg, Nikola Momchev, Olivier Bachem, Matthieu Geist, Martin Riedmiller

    Abstract: The majority of language model training builds on imitation learning. It covers pretraining, supervised fine-tuning, and affects the starting conditions for reinforcement learning from human feedback (RLHF). The simplicity and scalability of maximum likelihood estimation (MLE) for next token prediction led to its role as predominant paradigm. However, the broader field of imitation learning can mo… ▽ More

    Submitted 9 December, 2024; v1 submitted 2 September, 2024; originally announced September 2024.

    Comments: Published at NeurIPS 2024

  3. arXiv:1912.01649  [pdf, other

    cs.LG stat.ML

    Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

    Authors: Samuel Ainsworth, Matt Barnes, Siddhartha Srinivasa

    Abstract: In many environments, only a relatively small subset of the complete state space is necessary in order to accomplish a given task. We develop a simple technique using emergency stops (e-stops) to exploit this phenomenon. Using e-stops significantly improves sample complexity by reducing the amount of required exploration, while retaining a performance bound that efficiently trades off the rate of… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    Journal ref: NeurIPS 2019

  4. arXiv:1905.12888  [pdf, other

    cs.LG cs.IT cs.RO stat.ML

    Imitation Learning as $f$-Divergence Minimization

    Authors: Liyiming Ke, Sanjiban Choudhury, Matt Barnes, Wen Sun, Gilwoo Lee, Siddhartha Srinivasa

    Abstract: We address the problem of imitation learning with multi-modal demonstrations. Instead of attempting to learn all modes, we argue that in many tasks it is sufficient to imitate any one of them. We show that the state-of-the-art methods such as GAIL and behavior cloning, due to their choice of loss function, often incorrectly interpolate between such modes. Our key insight is to minimize the right d… ▽ More

    Submitted 31 May, 2020; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: International Workshop on the Algorithmic Foundations of Robotics (WAFR) 2020

  5. arXiv:1807.06713  [pdf, ps, other

    stat.ML cs.LG

    On the Interaction Effects Between Prediction and Clustering

    Authors: Matt Barnes, Artur Dubrawski

    Abstract: Machine learning systems increasingly depend on pipelines of multiple algorithms to provide high quality and well structured predictions. This paper argues interaction effects between clustering and prediction (e.g. classification, regression) algorithms can cause subtle adverse behaviors during cross-validation that may not be initially apparent. In particular, we focus on the problem of estimati… ▽ More

    Submitted 28 December, 2018; v1 submitted 17 July, 2018; originally announced July 2018.

    Journal ref: Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS) 2019, Volume 89

  6. arXiv:1703.02679  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Performance Bounds for Graphical Record Linkage

    Authors: Rebecca C. Steorts, Matt Barnes, Willie Neiswanger

    Abstract: Record linkage involves merging records in large, noisy databases to remove duplicate entities. It has become an important area because of its widespread occurrence in bibliometrics, public health, official statistics production, political science, and beyond. Traditional linkage methods directly linking records to one another are computationally infeasible as the number of records grows. As a res… ▽ More

    Submitted 7 March, 2017; originally announced March 2017.

    Comments: 11 pages with supplement; 4 figures and 2 tables; to appear in AISTATS 2017

  7. arXiv:1605.01779  [pdf, other

    stat.ML

    Clustering on the Edge: Learning Structure in Graphs

    Authors: Matt Barnes, Artur Dubrawski

    Abstract: With the recent popularity of graphical clustering methods, there has been an increased focus on the information between samples. We show how learning cluster structure using edge features naturally and simultaneously determines the most likely number of clusters and addresses data scale issues. These results are particularly useful in instances where (a) there are a large number of clusters and (… ▽ More

    Submitted 5 May, 2016; originally announced May 2016.

  8. arXiv:1509.04238  [pdf, ps, other

    cs.DB stat.ML

    A Practioner's Guide to Evaluating Entity Resolution Results

    Authors: Matt Barnes

    Abstract: Entity resolution (ER) is the task of identifying records belonging to the same entity (e.g. individual, group) across one or multiple databases. Ironically, it has multiple names: deduplication and record linkage, among others. In this paper we survey metrics used to evaluate ER results in order to iteratively improve performance and guarantee sufficient quality prior to deployment. Some of these… ▽ More

    Submitted 14 September, 2015; originally announced September 2015.

    Comments: Technical report

  9. arXiv:1509.03302  [pdf, ps, other

    stat.ML cs.CY cs.DB cs.LG

    Performance Bounds for Pairwise Entity Resolution

    Authors: Matt Barnes, Kyle Miller, Artur Dubrawski

    Abstract: One significant challenge to scaling entity resolution algorithms to massive datasets is understanding how performance changes after moving beyond the realm of small, manually labeled reference datasets. Unlike traditional machine learning tasks, when an entity resolution algorithm performs well on small hold-out datasets, there is no guarantee this performance holds on larger hold-out datasets. W… ▽ More

    Submitted 10 September, 2015; originally announced September 2015.