Skip to main content

Showing 1–10 of 10 results for author: Bloecher, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.18082  [pdf, ps, other

    cs.CL stat.AP

    Statistical Multicriteria Evaluation of LLM-Generated Text

    Authors: Esteban Garces Arias, Hannah Blocher, Julian Rodemann, Matthias Aßenmacher, Christoph Jansen

    Abstract: Assessing the quality of LLM-generated text remains a fundamental challenge in natural language processing. Current evaluation approaches often rely on isolated metrics or simplistic aggregations that fail to capture the nuanced trade-offs between coherence, diversity, fluency, and other relevant indicators of text quality. In this work, we adapt a recently proposed framework for statistical infer… ▽ More

    Submitted 24 June, 2025; v1 submitted 22 June, 2025; originally announced June 2025.

  2. arXiv:2410.18653  [pdf, ps, other

    cs.CL cs.LG

    Towards Better Open-Ended Text Generation: A Multicriteria Evaluation Framework

    Authors: Esteban Garces Arias, Hannah Blocher, Julian Rodemann, Meimingwei Li, Christian Heumann, Matthias Aßenmacher

    Abstract: Open-ended text generation has become a prominent task in natural language processing due to the rise of powerful (large) language models. However, evaluating the quality of these models and the employed decoding strategies remains challenging due to trade-offs among widely used metrics such as coherence, diversity, and perplexity. This paper addresses the specific problem of multicriteria evaluat… ▽ More

    Submitted 17 June, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

    Comments: Accepted at the $GEM^2$ Workshop (co-located with ACL 2025)

  3. arXiv:2406.03924  [pdf, other

    stat.ML cs.LG stat.ME

    Statistical Multicriteria Benchmarking via the GSD-Front

    Authors: Christoph Jansen, Georg Schollmeyer, Julian Rodemann, Hannah Blocher, Thomas Augustin

    Abstract: Given the vast number of classifiers that have been (and continue to be) proposed, reliable methods for comparing them are becoming increasingly important. The desire for reliability is broken down into three main aspects: (1) Comparisons should allow for different quality metrics simultaneously. (2) Comparisons should take into account the statistical uncertainty induced by the choice of benchmar… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: CJ, GS,JR and HB equally contributed to this work

    MSC Class: 62G05; 62G35; 62G09; 62G10 ACM Class: G.3

  4. arXiv:2402.16565  [pdf, other

    cs.LG stat.ML

    Partial Rankings of Optimizers

    Authors: Julian Rodemann, Hannah Blocher

    Abstract: We introduce a framework for benchmarking optimizers according to multiple criteria over various test functions. Based on a recently introduced union-free generic depth function for partial orders/rankings, it fully exploits the ordinal information and allows for incomparability. Our method describes the distribution of all partial orders/rankings, avoiding the notorious shortcomings of aggregatio… ▽ More

    Submitted 6 September, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  5. arXiv:2312.12839  [pdf, other

    cs.LG stat.ML

    Comparing Machine Learning Algorithms by Union-Free Generic Depth

    Authors: Hannah Blocher, Georg Schollmeyer, Malte Nalenz, Christoph Jansen

    Abstract: We propose a framework for descriptively analyzing sets of partial orders based on the concept of depth functions. Despite intensive studies in linear and metric spaces, there is very little discussion on depth functions for non-standard data types such as partial orders. We introduce an adaptation of the well-known simplicial depth to the set of all partial orders, the union-free generic (ufg) de… ▽ More

    Submitted 21 February, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2304.09872

  6. arXiv:2309.04362  [pdf, ps, other

    eess.SP cs.IT

    Sparse Codesigned Communication and Radar Systems

    Authors: Hyeon Seok Rou, Giuseppe Thadeu Freitas de Abreu, Saravanan Nagesh, Andreas Bathelt, David González G., Osvaldo Gonsa, Hans-Ludwig Bloecher

    Abstract: In the envisioned beyond-fifth-generation (B5G) and sixth-generation (6G) scenarios which expect massive multiple-input multiple-output (mMIMO) and high frequency communications in the millimeter-wave (mmWave) and Terahertz (THz) bands, efficiency in both energy and spectrum is of increasing significance. To that extent, a novel ISAC framework called "sparse codesigned communication and radar (SCC… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  7. arXiv:2306.12803  [pdf, other

    stat.ML cs.LG math.ST

    Robust Statistical Comparison of Random Variables with Locally Varying Scale of Measurement

    Authors: Christoph Jansen, Georg Schollmeyer, Hannah Blocher, Julian Rodemann, Thomas Augustin

    Abstract: Spaces with locally varying scale of measurement, like multidimensional structures with differently scaled dimensions, are pretty common in statistics and machine learning. Nevertheless, it is still understood as an open question how to exploit the entire information encoded in them properly. We address this problem by considering an order based on (sets of) expectations of random variables mappin… ▽ More

    Submitted 4 March, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: Accepted for the 39th Conference on Uncertainty in Artificial Intelligence (UAI 2023)

    MSC Class: 62G10; 62G35

  8. arXiv:2304.10549  [pdf, ps, other

    cs.LG cs.AI math.LO

    A note on the connectedness property of union-free generic sets of partial orders

    Authors: Georg Schollmeyer, Hannah Blocher

    Abstract: This short note describes and proves a connectedness property which was introduced in Blocher et al. [2023] in the context of data depth functions for partial orders. The connectedness property gives a structural insight into union-free generic sets. These sets, presented in Blocher et al. [2023], are defined by using a closure operator on the set of all partial orders which naturally appears with… ▽ More

    Submitted 21 December, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

  9. arXiv:2304.09872  [pdf, other

    cs.LG stat.ME

    Depth Functions for Partial Orders with a Descriptive Analysis of Machine Learning Algorithms

    Authors: Hannah Blocher, Georg Schollmeyer, Christoph Jansen, Malte Nalenz

    Abstract: We propose a framework for descriptively analyzing sets of partial orders based on the concept of depth functions. Despite intensive studies of depth functions in linear and metric spaces, there is very little discussion on depth functions for non-standard data types such as partial orders. We introduce an adaptation of the well-known simplicial depth to the set of all partial orders, the union-fr… ▽ More

    Submitted 9 February, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: Accepted to ISIPTA 2023; Forthcoming in: Proceedings of Machine Learning Research

  10. arXiv:2110.12879  [pdf, other

    cs.AI stat.ME

    Information efficient learning of complexly structured preferences: Elicitation procedures and their application to decision making under uncertainty

    Authors: Christoph Jansen, Hannah Blocher, Thomas Augustin, Georg Schollmeyer

    Abstract: In this paper we propose efficient methods for elicitation of complexly structured preferences and utilize these in problems of decision making under (severe) uncertainty. Based on the general framework introduced in Jansen, Schollmeyer and Augustin (2018, Int. J. Approx. Reason), we now design elicitation procedures and algorithms that enable decision makers to reveal their underlying preference… ▽ More

    Submitted 1 February, 2022; v1 submitted 19 October, 2021; originally announced October 2021.