Skip to main content

Showing 1–5 of 5 results for author: Aerni, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.05126  [pdf, ps, other

    cs.CR cs.LG

    Membership Inference Attacks on Sequence Models

    Authors: Lorenzo Rossi, Michael Aerni, Jie Zhang, Florian Tramèr

    Abstract: Sequence models, such as Large Language Models (LLMs) and autoregressive image generators, have a tendency to memorize and inadvertently leak sensitive information. While this tendency has critical legal implications, existing tools are insufficient to audit the resulting risks. We hypothesize that those tools' shortcomings are due to mismatched assumptions. Thus, we argue that effectively measuri… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Accepted to the 8th Deep Learning Security and Privacy Workshop (DLSP) workshop (best paper award)

  2. arXiv:2411.10242  [pdf, other

    cs.CL cs.LG

    Measuring Non-Adversarial Reproduction of Training Data in Large Language Models

    Authors: Michael Aerni, Javier Rando, Edoardo Debenedetti, Nicholas Carlini, Daphne Ippolito, Florian Tramèr

    Abstract: Large language models memorize parts of their training data. Memorizing short snippets and facts is required to answer questions about the world and to be fluent in any language. But models have also been shown to reproduce long verbatim sequences of memorized text when prompted by a motivated adversary. In this work, we investigate an intermediate regime of memorization that we call non-adversari… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

  3. Evaluations of Machine Learning Privacy Defenses are Misleading

    Authors: Michael Aerni, Jie Zhang, Florian Tramèr

    Abstract: Empirical defenses for machine learning privacy forgo the provable guarantees of differential privacy in the hope of achieving higher utility while resisting realistic adversaries. We identify severe pitfalls in existing empirical privacy evaluations (based on membership inference attacks) that result in misleading conclusions. In particular, we show that prior evaluations fail to characterize the… ▽ More

    Submitted 5 September, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted at ACM CCS 2024

  4. arXiv:2301.07605  [pdf, other

    stat.ML cs.LG

    Strong inductive biases provably prevent harmless interpolation

    Authors: Michael Aerni, Marco Milanta, Konstantin Donhauser, Fanny Yang

    Abstract: Classical wisdom suggests that estimators should avoid fitting noise to achieve good generalization. In contrast, modern overparameterized models can yield small test error despite interpolating noise -- a phenomenon often called "benign overfitting" or "harmless interpolation". This paper argues that the degree to which interpolation is harmless hinges upon the strength of an estimator's inductiv… ▽ More

    Submitted 1 March, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: Accepted at ICLR 2023

  5. arXiv:2108.02883  [pdf, other

    stat.ML cs.LG

    Interpolation can hurt robust generalization even when there is no noise

    Authors: Konstantin Donhauser, Alexandru Ţifrea, Michael Aerni, Reinhard Heckel, Fanny Yang

    Abstract: Numerous recent works show that overparameterization implicitly reduces variance for min-norm interpolators and max-margin classifiers. These findings suggest that ridge regularization has vanishing benefits in high dimensions. We challenge this narrative by showing that, even in the absence of noise, avoiding interpolation through ridge regularization can significantly improve generalization. We… ▽ More

    Submitted 16 December, 2021; v1 submitted 5 August, 2021; originally announced August 2021.