Skip to main content

Showing 1–3 of 3 results for author: Dern, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2410.16201  [pdf, ps, other

    stat.ML cs.LG

    Theoretical Limitations of Ensembles in the Age of Overparameterization

    Authors: Niclas Dern, John P. Cunningham, Geoff Pleiss

    Abstract: Classic ensembles generalize better than any single component model. In contrast, recent empirical studies find that modern ensembles of (overparameterized) neural networks may not provide any inherent generalization advantage over single but larger neural networks. This paper clarifies how modern overparameterized ensembles differ from their classic underparameterized counterparts, using ensemble… ▽ More

    Submitted 9 June, 2025; v1 submitted 21 October, 2024; originally announced October 2024.

    Comments: Accepted for publication at ICML 2025. 33 pages, 17 figures

  2. arXiv:2405.19985  [pdf, other

    stat.ME cs.LG

    Targeted Sequential Indirect Experiment Design

    Authors: Elisabeth Ailer, Niclas Dern, Jason Hartford, Niki Kilbertus

    Abstract: Scientific hypotheses typically concern specific aspects of complex, imperfectly understood or entirely unknown mechanisms, such as the effect of gene expression levels on phenotypes or how microbial communities influence environmental health. Such queries are inherently causal (rather than purely associational), but in many settings, experiments can not be conducted directly on the target variabl… ▽ More

    Submitted 3 March, 2025; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: published at NeurIPS 2024

  3. arXiv:2307.02301  [pdf, other

    cs.LG cs.CL stat.ML

    Sumformer: Universal Approximation for Efficient Transformers

    Authors: Silas Alberti, Niclas Dern, Laura Thesing, Gitta Kutyniok

    Abstract: Natural language processing (NLP) made an impressive jump with the introduction of Transformers. ChatGPT is one of the most famous examples, changing the perception of the possibilities of AI even outside the research community. However, besides the impressive performance, the quadratic time and space complexity of Transformers with respect to sequence length pose significant limitations for handl… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.