Skip to main content

Showing 1–7 of 7 results for author: Heinze-Deml, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.19575  [pdf, other

    stat.ML cs.LG

    Considerations for Distribution Shift Robustness of Diagnostic Models in Healthcare

    Authors: Arno Blaas, Adam Goliński, Andrew Miller, Luca Zappella, Jörn-Henrik Jacobsen, Christina Heinze-Deml

    Abstract: We consider robustness to distribution shifts in the context of diagnostic models in healthcare, where the prediction target $Y$, e.g., the presence of a disease, is causally upstream of the observations $X$, e.g., a biomarker. Distribution shifts may occur, for instance, when the training data is collected in a domain with patients having particular demographic characteristics while the model is… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

  2. arXiv:2410.14582  [pdf, other

    cs.AI cs.CL

    Do LLMs estimate uncertainty well in instruction-following?

    Authors: Juyeon Heo, Miao Xiong, Christina Heinze-Deml, Jaya Narain

    Abstract: Large language models (LLMs) could be valuable personal AI agents across various domains, provided they can precisely follow user instructions. However, recent studies have shown significant limitations in LLMs' instruction-following capabilities, raising concerns about their reliability in high-stakes applications. Accurately estimating LLMs' uncertainty in adhering to instructions is critical to… ▽ More

    Submitted 28 March, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

  3. arXiv:2410.14516  [pdf, other

    cs.AI cs.CL

    Do LLMs "know" internally when they follow instructions?

    Authors: Juyeon Heo, Christina Heinze-Deml, Oussama Elachqar, Kwan Ho Ryan Chan, Shirley Ren, Udhay Nallasamy, Andy Miller, Jaya Narain

    Abstract: Instruction-following is crucial for building AI agents with large language models (LLMs), as these models must adhere strictly to user-provided constraints and guidelines. However, LLMs often fail to follow even simple and clear instructions. To improve instruction-following behavior and prevent undesirable outputs, a deeper understanding of how LLMs' internal states relate to these outcomes is r… ▽ More

    Submitted 28 March, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

  4. arXiv:2009.13962  [pdf, other

    cs.LG stat.ML

    Think before you act: A simple baseline for compositional generalization

    Authors: Christina Heinze-Deml, Diane Bouchacourt

    Abstract: Contrarily to humans who have the ability to recombine familiar expressions to create novel ones, modern neural networks struggle to do so. This has been emphasized recently with the introduction of the benchmark dataset "gSCAN" (Ruis et al. 2020), aiming to evaluate models' performance at compositional generalization in grounded language understanding. In this work, we challenge the gSCAN benchma… ▽ More

    Submitted 1 October, 2020; v1 submitted 29 September, 2020; originally announced September 2020.

  5. arXiv:1906.11235  [pdf, other

    cs.LG cs.CV stat.ML

    Invariance-inducing regularization using worst-case transformations suffices to boost accuracy and spatial robustness

    Authors: Fanny Yang, Zuowen Wang, Christina Heinze-Deml

    Abstract: This work provides theoretical and empirical evidence that invariance-inducing regularizers can increase predictive accuracy for worst-case spatial transformations (spatial robustness). Evaluated on these adversarially transformed examples, we demonstrate that adding regularization on top of standard or adversarial training reduces the relative error by 20% for CIFAR10 without increasing the compu… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

  6. arXiv:1710.11469  [pdf, other

    stat.ML cs.LG

    Conditional Variance Penalties and Domain Shift Robustness

    Authors: Christina Heinze-Deml, Nicolai Meinshausen

    Abstract: When training a deep neural network for image classification, one can broadly distinguish between two types of latent features of images that will drive the classification. We can divide latent features into (i) "core" or "conditionally invariant" features $X^\text{core}$ whose distribution $X^\text{core}\vert Y$, conditional on the class $Y$, does not change substantially across domains and (ii)… ▽ More

    Submitted 13 April, 2019; v1 submitted 31 October, 2017; originally announced October 2017.

  7. arXiv:1703.00403  [pdf, other

    stat.ML cs.CR cs.DC cs.LG

    Preserving Differential Privacy Between Features in Distributed Estimation

    Authors: Christina Heinze-Deml, Brian McWilliams, Nicolai Meinshausen

    Abstract: Privacy is crucial in many applications of machine learning. Legal, ethical and societal issues restrict the sharing of sensitive data making it difficult to learn from datasets that are partitioned between many parties. One important instance of such a distributed setting arises when information about each record in the dataset is held by different data owners (the design matrix is "vertically-pa… ▽ More

    Submitted 27 June, 2017; v1 submitted 1 March, 2017; originally announced March 2017.

    Journal ref: Stat 7 (1), 2018