Skip to main content

Showing 1–10 of 10 results for author: Klaassen, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.17290  [pdf, other

    stat.ML cs.LG econ.EM stat.ME

    Calibration Strategies for Robust Causal Estimation: Theoretical and Empirical Insights on Propensity Score-Based Estimators

    Authors: Sven Klaassen, Jan Rabenseifner, Jannis Kueck, Philipp Bach

    Abstract: The partitioning of data for estimation and calibration critically impacts the performance of propensity score based estimators like inverse probability weighting (IPW) and double/debiased machine learning (DML) frameworks. We extend recent advances in calibration techniques for propensity score estimation, improving the robustness of propensity scores in challenging settings such as limited overl… ▽ More

    Submitted 19 May, 2025; v1 submitted 21 March, 2025; originally announced March 2025.

  2. arXiv:2501.00382  [pdf, other

    econ.GN cs.AI stat.AP stat.ML

    Adventures in Demand Analysis Using AI

    Authors: Philipp Bach, Victor Chernozhukov, Sven Klaassen, Martin Spindler, Jan Teichert-Kluge, Suhas Vijaykumar

    Abstract: This paper advances empirical demand analysis by integrating multimodal product representations derived from artificial intelligence (AI). Using a detailed dataset of toy cars on \textit{Amazon.com}, we combine text descriptions, images, and tabular covariates to represent each product using transformer-based embedding models. These embeddings capture nuanced attributes, such as quality, branding,… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    Comments: 42 pages, 9 figures

  3. arXiv:2406.11308  [pdf, other

    cs.LG cs.AI econ.EM stat.ML

    Management Decisions in Manufacturing using Causal Machine Learning -- To Rework, or not to Rework?

    Authors: Philipp Schwarz, Oliver Schacht, Sven Klaassen, Daniel Grünbaum, Sebastian Imhof, Martin Spindler

    Abstract: In this paper, we present a data-driven model for estimating optimal rework policies in manufacturing systems. We consider a single production stage within a multistage, lot-based system that allows for optional rework steps. While the rework decision depends on an intermediate state of the lot and system, the final product inspection, and thus the assessment of the actual yield, is delayed until… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 30 pages, 10 figures

  4. arXiv:2402.04674  [pdf, other

    econ.EM stat.ML

    Hyperparameter Tuning for Causal Inference with Double Machine Learning: A Simulation Study

    Authors: Philipp Bach, Oliver Schacht, Victor Chernozhukov, Sven Klaassen, Martin Spindler

    Abstract: Proper hyperparameter tuning is essential for achieving optimal performance of modern machine learning (ML) methods in predictive tasks. While there is an extensive literature on tuning ML learners for prediction, there is only little guidance available on tuning ML learners for causal machine learning and how to select among different ML learners. In this paper, we empirically assess the relation… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  5. arXiv:2402.01785  [pdf, other

    cs.LG cs.AI econ.EM stat.ME stat.ML

    DoubleMLDeep: Estimation of Causal Effects with Multimodal Data

    Authors: Sven Klaassen, Jan Teichert-Kluge, Philipp Bach, Victor Chernozhukov, Martin Spindler, Suhas Vijaykumar

    Abstract: This paper explores the use of unstructured, multimodal data, namely text and images, in causal inference and treatment effect estimation. We propose a neural network architecture that is adapted to the double machine learning (DML) framework, specifically the partially linear model. An additional contribution of our paper is a new method to generate a semi-synthetic dataset which can be used to e… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    MSC Class: 62; 91 ACM Class: I.2.0

  6. arXiv:2306.04223  [pdf, other

    stat.ML cs.AI cs.LG

    Causally Learning an Optimal Rework Policy

    Authors: Oliver Schacht, Sven Klaassen, Philipp Schwarz, Martin Spindler, Daniel Grünbaum, Sebastian Imhof

    Abstract: In manufacturing, rework refers to an optional step of a production process which aims to eliminate errors or remedy products that do not meet the desired quality standards. Reworking a production lot involves repeating a previous production stage with adjustments to ensure that the final product meets the required specifications. While offering the chance to improve the yield and thus increase th… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 22 pages, 15 figures

  7. arXiv:2103.09603  [pdf, other

    stat.ML cs.LG econ.EM

    DoubleML -- An Object-Oriented Implementation of Double Machine Learning in R

    Authors: Philipp Bach, Victor Chernozhukov, Malte S. Kurz, Martin Spindler, Sven Klaassen

    Abstract: The R package DoubleML implements the double/debiased machine learning framework of Chernozhukov et al. (2018). It provides functionalities to estimate parameters in causal models based on machine learning methods. The double machine learning framework consist of three key ingredients: Neyman orthogonality, high-quality machine learning estimation and sample splitting. Estimation of nuisance compo… ▽ More

    Submitted 5 June, 2024; v1 submitted 17 March, 2021; originally announced March 2021.

    Comments: 56 pages, 8 Figures, 1 Table; Updated version for DoubleML 1.0.0; Updated version due to changes in R package paradox (for parameter tuning with mlr3)

    MSC Class: 62-04

    Journal ref: Journal of Statistical Software 2024

  8. arXiv:2004.01623  [pdf, other

    stat.ME econ.EM stat.ML

    Estimation and Uniform Inference in Sparse High-Dimensional Additive Models

    Authors: Philipp Bach, Sven Klaassen, Jannis Kueck, Martin Spindler

    Abstract: We develop a novel method to construct uniformly valid confidence bands for a nonparametric component $f_1$ in the sparse additive model $Y=f_1(X_1)+\ldots + f_p(X_p) + \varepsilon$ in a high-dimensional setting. Our method integrates sieve estimation into a high-dimensional Z-estimation framework, facilitating the construction of uniformly valid confidence bands for the target component $f_1$. To… ▽ More

    Submitted 23 April, 2024; v1 submitted 3 April, 2020; originally announced April 2020.

    MSC Class: 62G08; 62-07

  9. arXiv:1808.10532  [pdf, other

    stat.ME cs.LG econ.EM stat.ML

    Uniform Inference in High-Dimensional Gaussian Graphical Models

    Authors: Sven Klaassen, Jannis Kück, Martin Spindler, Victor Chernozhukov

    Abstract: Graphical models have become a very popular tool for representing dependencies within a large set of variables and are key for representing causal structures. We provide results for uniform inference on high-dimensional graphical models with the number of target parameters $d$ being possible much larger than sample size. This is in particular important when certain features or structures of a caus… ▽ More

    Submitted 3 December, 2018; v1 submitted 30 August, 2018; originally announced August 2018.

    Comments: 59 pages, 2 figures, 6 tables

    MSC Class: 62H15; 62J07;

  10. arXiv:1712.07364  [pdf, other

    stat.ME econ.EM math.ST stat.ML

    Transformation Models in High-Dimensions

    Authors: Sven Klaassen, Jannis Kueck, Martin Spindler

    Abstract: Transformation models are a very important tool for applied statisticians and econometricians. In many applications, the dependent variable is transformed so that homogeneity or normal distribution of the error holds. In this paper, we analyze transformation models in a high-dimensional setting, where the set of potential covariates is large. We propose an estimator for the transformation paramete… ▽ More

    Submitted 20 December, 2017; originally announced December 2017.

    Comments: 63 pages, 4 figures

    MSC Class: 62H; 62F