Skip to main content

Showing 1–10 of 10 results for author: Engelmann, B

.
  1. arXiv:2504.07584  [pdf, other

    cs.IR

    REANIMATOR: Reanimate Retrieval Test Collections with Extracted and Synthetic Resources

    Authors: Björn Engelmann, Fabian Haak, Philipp Schaer, Mani Erfanian Abdoust, Linus Netze, Meik Bittkowski

    Abstract: Retrieval test collections are essential for evaluating information retrieval systems, yet they often lack generalizability across tasks. To overcome this limitation, we introduce REANIMATOR, a versatile framework designed to enable the repurposing of existing test collections by enriching them with extracted and synthetic resources. REANIMATOR enhances test collections from PDF files by parsing f… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  2. Investigating Bias in Political Search Query Suggestions by Relative Comparison with LLMs

    Authors: Fabian Haak, Björn Engelmann, Christin Katharina Kreutz, Philipp Schaer

    Abstract: Search query suggestions affect users' interactions with search engines, which then influences the information they encounter. Thus, bias in search query suggestions can lead to exposure to biased search results and can impact opinion formation. This is especially critical in the political domain. Detecting and quantifying bias in web search engines is difficult due to its topic dependency, comple… ▽ More

    Submitted 31 October, 2024; originally announced October 2024.

    MSC Class: 94-02 ACM Class: H.3.3

  3. arXiv:2404.07501  [pdf, other

    cs.CL

    Leveraging Data Augmentation for Process Information Extraction

    Authors: Julian Neuberger, Leonie Doll, Benedict Engelmann, Lars Ackermann, Stefan Jablonski

    Abstract: Business Process Modeling projects often require formal process models as a central component. High costs associated with the creation of such formal process models motivated many different fields of research aimed at automated generation of process models from readily available data. These include process mining on event logs, and generating business process models from natural language texts. Re… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Accepted at BPMDS 2024 (https://sites.google.com/view/bpmds/), to be printed

  4. arXiv:2401.08892  [pdf, other

    q-fin.RM

    Spurious Default Probability Projections in Credit Risk Stress Testing Models

    Authors: Bernd Engelmann

    Abstract: Credit risk stress testing has become an important risk management device which is used both by banks internally and by regulators. Stress testing is complex because it essentially means projecting a bank's full balance sheet conditional on a macroeconomic scenario over multiple years. Part of the complexity stems from using a wide range of model parameters for, e.g., rating transition, write-off… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 15 pages, 4 figures

    MSC Class: 91

  5. arXiv:2312.09631  [pdf, other

    cs.IR

    Context-Driven Interactive Query Simulations Based on Generative Large Language Models

    Authors: Björn Engelmann, Timo Breuer, Jana Isabelle Friese, Philipp Schaer, Norbert Fuhr

    Abstract: Simulating user interactions enables a more user-oriented evaluation of information retrieval (IR) systems. While user simulations are cost-efficient and reproducible, many approaches often lack fidelity regarding real user behavior. Most notably, current user models neglect the user's context, which is the primary driver of perceived relevance and the interactions with the search results. To this… ▽ More

    Submitted 25 January, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted at ECIR 2024 (Full Paper)

  6. Simulating Users in Interactive Web Table Retrieval

    Authors: Björn Engelmann, Timo Breuer, Philipp Schaer

    Abstract: Considering the multimodal signals of search items is beneficial for retrieval effectiveness. Especially in web table retrieval (WTR) experiments, accounting for multimodal properties of tables boosts effectiveness. However, it still remains an open question how the single modalities affect user experience in particular. Previous work analyzed WTR performance in ad-hoc retrieval benchmarks, which… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 4 pages + references; accepted at CIKM'23

    Journal ref: CIKM 2023

  7. arXiv:2307.03569  [pdf, other

    cs.CL cs.CY

    Text Simplification of Scientific Texts for Non-Expert Readers

    Authors: Björn Engelmann, Fabian Haak, Christin Katharina Kreutz, Narjes Nikzad Khasmakhi, Philipp Schaer

    Abstract: Reading levels are highly individual and can depend on a text's language, a person's cognitive abilities, or knowledge on a topic. Text simplification is the task of rephrasing a text to better cater to the abilities of a specific target reader group. Simplification of scientific abstracts helps non-experts to access the core information by bypassing formulations that require domain or expert know… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: Paper accepted at SimpleText@CLEF'23, 12 pages, 1 Figure, 4 Tables

  8. arXiv:2306.04876  [pdf, ps, other

    stat.ME

    Comprehensive Stepwise Selection for Logistic Regression

    Authors: Bernd Engelmann

    Abstract: Automated variable selection is widely applied in statistical model development. Algorithms like forward, backward or stepwise selection are available in statistical software packages like R and SAS. Many researchers have criticized the use of these algorithms because the models resulting from automated selection algorithms are not based on theory and tend to be unstable. Furthermore, simulation s… ▽ More

    Submitted 15 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

  9. arXiv:1509.08605  [pdf, other

    cs.PL cs.LO

    A Sound and Complete Hoare Logic for Dynamically-Typed, Object-Oriented Programs -- Extended Version --

    Authors: Björn Engelmann, Ernst-Rüdiger Olderog

    Abstract: A simple dynamically-typed, (purely) object-oriented language is defined. A structural operational semantics as well as a Hoare-style program logic for reasoning about programs in the language in multiple notions of correctness are given. The Hoare logic is proved to be both sound and (relative) complete and is -- to the best of our knowledge -- the first such logic presented for a dynamically-typ… ▽ More

    Submitted 8 January, 2016; v1 submitted 29 September, 2015; originally announced September 2015.

    Comments: Extended Version -- contains all proofs, proof rules and additional information; new version -- elaborated explanations in section 7, added reference, minor visual improvements; new version -- incorporated reviews & improved formalizations

  10. arXiv:1501.02699  [pdf, other

    cs.PL cs.LO cs.SE

    Closing the Gap -- Formally Verifying Dynamically Typed Programs like Statically Typed Ones Using Hoare Logic -- Extended Version --

    Authors: Björn Engelmann, Ernst-Rüdiger Olderog, Nils Erik Flick

    Abstract: Dynamically typed object-oriented languages enable programmers to write elegant, reusable and extensible programs. However, with the current methodology for program verification, the absence of static type information creates significant overhead. Our proposal is two-fold: First, we propose a layer of abstraction hiding the complexity of dynamic typing when provided with sufficient type informat… ▽ More

    Submitted 12 January, 2015; originally announced January 2015.

    Comments: includes all appendices