Skip to main content

Showing 1–10 of 10 results for author: Peinl, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11604  [pdf

    cs.AI cs.CL cs.CV

    VLM@school -- Evaluation of AI image understanding on German middle school knowledge

    Authors: René Peinl, Vincent Tischler

    Abstract: This paper introduces a novel benchmark dataset designed to evaluate the capabilities of Vision Language Models (VLMs) on tasks that combine visual reasoning with subject-specific background knowledge in the German language. In contrast to widely used English-language benchmarks that often rely on artificially difficult or decontextualized problems, this dataset draws from real middle school curri… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  2. arXiv:2504.11108  [pdf

    cs.CL

    Benchmarking Vision Language Models on German Factual Data

    Authors: René Peinl, Vincent Tischler

    Abstract: Similar to LLMs, the development of vision language models is mainly driven by English datasets and models trained in English and Chinese language, whereas support for other languages, even those considered high-resource languages such as German, remains significantly weaker. In this work we present an analysis of open-weight VLMs on factual knowledge in the German and English language. We disenta… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  3. arXiv:2504.11104  [pdf

    cs.CL cs.CV cs.CY

    Using LLMs as prompt modifier to avoid biases in AI image generators

    Authors: René Peinl

    Abstract: This study examines how Large Language Models (LLMs) can reduce biases in text-to-image generation systems by modifying user prompts. We define bias as a model's unfair deviation from population statistics given neutral prompts. Our experiments with Stable Diffusion XL, 3.5 and Flux demonstrate that LLM-modified prompts significantly increase image diversity and reduce bias without the need to cha… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  4. arXiv:2305.11991  [pdf

    cs.CL

    Evaluation of medium-large Language Models at zero-shot closed book generative question answering

    Authors: René Peinl, Johannes Wirth

    Abstract: Large language models (LLMs) have garnered significant attention, but the definition of "large" lacks clarity. This paper focuses on medium-sized language models (MLMs), defined as having at least six billion parameters but less than 100 billion. The study evaluates MLMs regarding zero-shot generative question answering, which requires models to provide elaborate answers without external document… ▽ More

    Submitted 3 July, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    ACM Class: I.2.7

    Journal ref: Under review in ARIA 2023

  5. arXiv:2302.06008  [pdf

    cs.CL cs.AI cs.LG cs.SD eess.AS

    ASR Bundestag: A Large-Scale political debate dataset in German

    Authors: Johannes Wirth, René Peinl

    Abstract: We present ASR Bundestag, a dataset for automatic speech recognition in German, consisting of 610 hours of aligned audio-transcript pairs for supervised training as well as 1,038 hours of unlabeled audio snippets for self-supervised learning, based on raw audio data and transcriptions from plenary sessions and committee meetings of the German parliament. In addition, we discuss utilized approaches… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

    Comments: 13 pages, 2 tables, 4 figures

  6. arXiv:2204.05617  [pdf

    cs.CL cs.AI

    ASR in German: A Detailed Error Analysis

    Authors: Johannes Wirth, Rene Peinl

    Abstract: The amount of freely available systems for automatic speech recognition (ASR) based on neural networks is growing steadily, with equally increasingly reliable predictions. However, the evaluation of trained models is typically exclusively based on statistical metrics such as WER or CER, which do not provide any insight into the nature or impact of the errors produced when predicting transcripts fr… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    ACM Class: C.4; I.2.7

  7. arXiv:2106.06309  [pdf

    cs.SD cs.CL eess.AS

    HUI-Audio-Corpus-German: A high quality TTS dataset

    Authors: Pascal Puchtler, Johannes Wirth, René Peinl

    Abstract: The increasing availability of audio data on the internet lead to a multitude of datasets for development and training of text to speech applications, based on neural networks. Highly differing quality of voice, low sampling rates, lack of text normalization and disadvantageous alignment of audio samples to corresponding transcript sentences still limit the performance of deep neural networks trai… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

  8. arXiv:2106.06230  [pdf

    cs.CL cs.AI

    Sprachsynthese -- State-of-the-Art in englischer und deutscher Sprache

    Authors: René Peinl

    Abstract: Reading text aloud is an important feature for modern computer applications. It not only facilitates access to information for visually impaired people, but is also a pleasant convenience for non-impaired users. In this article, the state of the art of speech synthesis is presented separately for mel-spectrogram generation and vocoders. It concludes with an overview of available data sets for Engl… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: in German

  9. arXiv:2002.07576  [pdf

    cs.HC

    Presence in VR experiences -- an empirical cost-benefit-analysis

    Authors: René Peinl, Tobias Wirth

    Abstract: Virtual reality (VR) is on the edge of getting a mainstream platform for gaming, education and product design. The feeling of being present in the virtual world is influenced by many factors and even more intriguing a single negative influence can destroy the illusion that was created with a lot of effort by other measures. Therefore, it is crucial to have a balance between the influencing factors… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

    Comments: empirical study

    MSC Class: I.6.3

  10. ClouNS - A Cloud-native Application Reference Model for Enterprise Architects

    Authors: Nane Kratzke, René Peinl

    Abstract: The capability to operate cloud-native applications can generate enormous business growth and value. But enterprise architects should be aware that cloud-native applications are vulnerable to vendor lock-in. We investigated cloud-native application design principles, public cloud service providers, and industrial cloud standards. All results indicate that most cloud service categories seem to fost… ▽ More

    Submitted 14 September, 2017; originally announced September 2017.