Skip to main content

Showing 1–3 of 3 results for author: Hagenbuch, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.18572  [pdf, other

    stat.AP math.ST stat.OT

    Bernoulli amputation

    Authors: Marius Hofert, James Jackson, Niels Hagenbuch

    Abstract: An approach to amputation, the process of introducing missing values to a complete dataset, is presented. It allows to construct missingness indicators in a flexible and principled way via copulas and Bernoulli margins and to incorporate dependence in missingness patterns. Besides more classical missingness models such as missing completely at random, missing at random, and missing not at random,… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    MSC Class: 62D10; 62H99; 65C60

  2. arXiv:2307.02650  [pdf, other

    stat.ME stat.AP stat.ML

    A Complete Characterisation of Structured Missingness

    Authors: James Jackson, Robin Mitra, Niels Hagenbuch, Sarah McGough, Chris Harbron

    Abstract: Our capacity to process large complex data sources is ever-increasing, providing us with new, important applied research questions to address, such as how to handle missing values in large-scale databases. Mitra et al. (2023) noted the phenomenon of Structured Missingness (SM), which is where missingness has an underlying structure. Existing taxonomies for defining missingness mechanisms typically… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  3. arXiv:2304.01429  [pdf, other

    stat.ML cs.LG

    Learning from data with structured missingness

    Authors: Robin Mitra, Sarah F. McGough, Tapabrata Chakraborti, Chris Holmes, Ryan Copping, Niels Hagenbuch, Stefanie Biedermann, Jack Noonan, Brieuc Lehmann, Aditi Shenvi, Xuan Vinh Doan, David Leslie, Ginestra Bianconi, Ruben Sanchez-Garcia, Alisha Davies, Maxine Mackintosh, Eleni-Rosalina Andrinopoulou, Anahid Basiri, Chris Harbron, Ben D. MacArthur

    Abstract: Missing data are an unavoidable complication in many machine learning tasks. When data are `missing at random' there exist a range of tools and techniques to deal with the issue. However, as machine learning studies become more ambitious, and seek to learn from ever-larger volumes of heterogeneous data, an increasingly encountered problem arises in which missing values exhibit an association or st… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.