Skip to main content

Showing 1–11 of 11 results for author: Kroll, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.04910  [pdf, other

    cs.CL stat.ME

    Maximizing Signal in Human-Model Preference Alignment

    Authors: Kelsey Kraus, Margaret Kroll

    Abstract: The emergence of powerful LLMs has led to a paradigm shift in Natural Language Understanding and Natural Language Generation. The properties that make LLMs so valuable for these tasks -- creativity, ability to produce fluent speech, and ability to quickly and effectively abstract information from large corpora -- also present new challenges to evaluating their outputs. The rush to market has led t… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: Presented at AAAI 2025, special track on AI Alignment

  2. arXiv:2410.18218  [pdf, ps, other

    cs.AI cs.CL cs.SD eess.AS

    Optimizing the role of human evaluation in LLM-based spoken document summarization systems

    Authors: Margaret Kroll, Kelsey Kraus

    Abstract: The emergence of powerful LLMs has led to a paradigm shift in abstractive summarization of spoken documents. The properties that make LLMs so valuable for this task -- creativity, ability to produce fluent speech, and ability to abstract information from large corpora -- also present new challenges to evaluating their content. Quick, cost-effective automatic evaluations such as ROUGE and BERTScore… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

    Journal ref: Proc. Interspeech 2024, 1935-1939 (2024)

  3. arXiv:1907.06233  [pdf, other

    math.ST cs.CR

    Pointwise adaptive kernel density estimation under local approximate differential privacy

    Authors: Martin Kroll

    Abstract: We consider non-parametric density estimation in the framework of local approximate differential privacy. In contrast to centralized privacy scenarios with a trusted curator, in the local setup anonymization must be guaranteed already on the individual data owners' side and therefore must precede any data mining tasks. Thus, the published anonymized data should be compatible with as many statistic… ▽ More

    Submitted 14 July, 2019; originally announced July 2019.

    Comments: 24 pages, 1 figure

    MSC Class: 62G05; 68P25

  4. arXiv:1903.01927  [pdf, ps, other

    math.ST cs.CR

    Local differential privacy: Elbow effect in optimal density estimation and adaptation over Besov ellipsoids

    Authors: Cristina Butucea, Amandine Dubois, Martin Kroll, Adrien Saumard

    Abstract: We address the problem of non-parametric density estimation under the additional constraint that only privatised data are allowed to be published and available for inference. For this purpose, we adopt a recent generalisation of classical minimax theory to the framework of local $α$-differential privacy and provide a lower bound on the rate of convergence over Besov spaces $B^s_{pq}$ under mean in… ▽ More

    Submitted 5 March, 2019; originally announced March 2019.

    MSC Class: 62G07 (primary); 62G20 (secondary)

  5. arXiv:1901.04182  [pdf, ps, other

    cs.DB

    Complexity Bounds for Relational Algebra over Document Spanners

    Authors: Liat Peterfreund, Dominik D. Freydenberger, Benny Kimelfeld, Markus Kröll

    Abstract: We investigate the complexity of evaluating queries in Relational Algebra (RA) over the relations extracted by regex formulas (i.e., regular expressions with capture variables) over text documents. Such queries, also known as the regular document spanners, were shown to have an evaluation with polynomial delay for every positive RA expression (i.e., consisting of only natural joins, projections an… ▽ More

    Submitted 6 February, 2019; v1 submitted 14 January, 2019; originally announced January 2019.

  6. arXiv:1812.03831  [pdf, ps, other

    cs.DB

    On the Enumeration Complexity of Unions of Conjunctive Queries

    Authors: Nofar Carmeli, Markus Kröll

    Abstract: We study the enumeration complexity of Unions of Conjunctive Queries(UCQs). We aim to identify the UCQs that are tractable in the sense that the answer tuples can be enumerated with a linear preprocessing phase and a constant delay between every successive tuples. It has been established that, in the absence of self-joins and under conventional complexity assumptions, the CQs that admit such an ev… ▽ More

    Submitted 6 May, 2021; v1 submitted 10 December, 2018; originally announced December 2018.

  7. arXiv:1712.07880  [pdf, ps, other

    cs.DB cs.CC

    Enumeration Complexity of Conjunctive Queries with Functional Dependencies

    Authors: Nofar Carmeli, Markus Kröll

    Abstract: We study the complexity of enumerating the answers of Conjunctive Queries (CQs) in the presence of Functional Dependencies (FDs). Our focus is on the ability to list output tuples with a constant delay in between, following a linear-time preprocessing. A known dichotomy classifies the acyclic self-join-free CQs into those that admit such enumeration, and those that do not. However, this classifica… ▽ More

    Submitted 26 September, 2021; v1 submitted 21 December, 2017; originally announced December 2017.

    Comments: Published in ICDT 2018 and TOCS 2019

    ACM Class: H.2.3

  8. arXiv:1610.05493  [pdf, ps, other

    cs.CC

    A Complexity Theory for Hard Enumeration Problems

    Authors: Nadia Creignou, Markus Kröll, Reinhard Pichler, Sebastian Skritek, Heribert Vollmer

    Abstract: Complexity theory provides a wealth of complexity classes for analyzing the complexity of decision and counting problems. Despite the practical relevance of enumeration problems, the tools provided by complexity theory for this important class of problems are very limited. In particular, complexity classes analogous to the polynomial hierarchy and an appropriate notion of problem reduction are mis… ▽ More

    Submitted 24 October, 2017; v1 submitted 18 October, 2016; originally announced October 2016.

    Comments: Preprint submitted to Elsevier

    ACM Class: F.1.3

  9. arXiv:1604.02833  [pdf, other

    cs.DS

    Efficiently Enumerating Minimal Triangulations

    Authors: Nofar Carmeli, Batya Kenig, Benny Kimelfeld, Markus Kröll

    Abstract: We present an algorithm that enumerates all the minimal triangulations of a graph in incremental polynomial time. Consequently, we get an algorithm for enumerating all the proper tree decompositions, in incremental polynomial time, where "proper" means that the tree decomposition cannot be improved by removing or splitting a bag. The algorithm can incorporate any method for (ordinary, single resul… ▽ More

    Submitted 27 July, 2023; v1 submitted 11 April, 2016; originally announced April 2016.

  10. arXiv:1410.6739  [pdf, other

    cs.CR

    Automated Cryptanalysis of Bloom Filter Encryptions of Health Records

    Authors: Martin Kroll, Simone Steinmetzer

    Abstract: Privacy-preserving record linkage with Bloom filters has become increasingly popular in medical applications, since Bloom filters allow for probabilistic linkage of sensitive personal data. However, since evidence indicates that Bloom filters lack sufficiently high security where strong security guarantees are required, several suggestions for their improvement have been made in literature. One of… ▽ More

    Submitted 24 October, 2014; originally announced October 2014.

    Comments: Contribution to the 8th International Conference on Health Informatics, Lisbon 2015

  11. arXiv:1402.3198  [pdf, other

    cs.CR

    A graph theoretic linkage attack on microdata in a metric space

    Authors: Martin Kroll

    Abstract: Certain methods of analysis require the knowledge of the spatial distances between entities whose data are stored in a microdata table. For instance, such knowledge is necessary and sufficient to perform data mining tasks such as nearest neighbour searches or clustering. However, when inter-record distances are published in addition to the microdata for research purposes, the risk of identity disc… ▽ More

    Submitted 13 February, 2014; originally announced February 2014.

    Comments: 24 pages, 15 figures