Skip to main content

Showing 1–3 of 3 results for author: Föge, N

.
  1. arXiv:2412.13020  [pdf, other

    math.ST stat.ML

    A Central Limit Theorem for the permutation importance measure

    Authors: Nico Föge, Lena Schmid, Marc Ditzhaus, Markus Pauly

    Abstract: Random Forests have become a widely used tool in machine learning since their introduction in 2001, known for their strong performance in classification and regression tasks. One key feature of Random Forests is the Random Forest Permutation Importance Measure (RFPIM), an internal, non-parametric measure of variable importance. While widely used, theoretical work on RFPIM is sparse, and most resea… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  2. arXiv:2404.06850   

    math.ST

    Even naive trees are consistent

    Authors: Nico Föge, Markus Pauly, Lena Schmid, Marc Ditzhaus

    Abstract: The last decade has shed some light on theoretical properties such as their consistency for regression tasks. In the current paper, we propose a new class of very simple learners based on so-called naive trees. These naive trees partition the feature space completely at random and independent of the data. Although counter-intuitive, we prove these naive trees and ensembles are consistent under fai… ▽ More

    Submitted 17 December, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: Wrong proof

    MSC Class: Primary 62G05; secondary 62G20

  3. arXiv:2401.14161  [pdf, other

    stat.AP stat.ML

    Adapting tree-based multiple imputation methods for multi-level data? A simulation study

    Authors: Nico Föge, Jakob Schwerter, Ketevan Gurtskaia, Markus Pauly, Philipp Doebler

    Abstract: When data have a hierarchical structure, such as students nested within classrooms, ignoring dependencies between observations can compromise the validity of imputation procedures. Standard tree-based imputation methods implicitly assume independence between observations, limiting their applicability in multilevel data settings. Although Multivariate Imputation by Chained Equations (MICE) is widel… ▽ More

    Submitted 19 March, 2025; v1 submitted 25 January, 2024; originally announced January 2024.