Skip to main content

Showing 1–19 of 19 results for author: Kuhlmann, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.14980  [pdf, ps, other

    cs.CV cs.RO

    Advances in Compliance Detection: Novel Models Using Vision-Based Tactile Sensors

    Authors: Ziteng Li, Malte Kuhlmann, Ilana Nisky, Nicolás Navarro-Guerrero

    Abstract: Compliance is a critical parameter for describing objects in engineering, agriculture, and biomedical applications. Traditional compliance detection methods are limited by their lack of portability and scalability, rely on specialized, often expensive equipment, and are unsuitable for robotic applications. Moreover, existing neural network-based approaches using vision-based tactile sensors still… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: Accepted in the IEEE International Conference on Development and Learning (ICDL). The paper contains 8 pages and 7 figures

    ACM Class: I.2.9

  2. arXiv:2505.14309  [pdf, ps, other

    cs.CL

    Studying the Role of Input-Neighbor Overlap in Retrieval-Augmented Language Models Training Efficiency

    Authors: Ehsan Doostmohammadi, Marco Kuhlmann

    Abstract: Retrieval-augmented language models have demonstrated performance comparable to much larger models while requiring fewer computational resources. The effectiveness of these models crucially depends on the overlap between query and retrieved context, but the optimal degree of this overlap remains unexplored. In this paper, we systematically investigate how varying levels of query--context overlap a… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  3. arXiv:2501.08791  [pdf, other

    eess.AS cs.SD

    Speech Synthesis along Perceptual Voice Quality Dimensions

    Authors: Frederik Rautenberg, Michael Kuhlmann, Fritz Seebauer, Jana Wiechmann, Petra Wagner, Reinhold Haeb-Umbach

    Abstract: While expressive speech synthesis or voice conversion systems mainly focus on controlling or manipulating abstract prosodic characteristics of speech, such as emotion or accent, we here address the control of perceptual voice qualities (PVQs) recognized by phonetic experts, which are speech properties at a lower level of abstraction. The ability to manipulate PVQs can be a valuable tool for teachi… ▽ More

    Submitted 15 January, 2025; originally announced January 2025.

    Comments: Accepted by ICASSP 2025

  4. arXiv:2411.08533  [pdf, other

    cs.RO cs.AI

    ACROSS: A Deformation-Based Cross-Modal Representation for Robotic Tactile Perception

    Authors: Wadhah Zai El Amri, Malte Kuhlmann, Nicolás Navarro-Guerrero

    Abstract: Tactile perception is essential for human interaction with the environment and is becoming increasingly crucial in robotics. Tactile sensors like the BioTac mimic human fingertips and provide detailed interaction data. Despite its utility in applications like slip detection and object identification, this sensor is now deprecated, making many valuable datasets obsolete. However, recreating similar… ▽ More

    Submitted 19 February, 2025; v1 submitted 13 November, 2024; originally announced November 2024.

    Comments: Accepted to 2025 IEEE Conference on Robotics and Automation (ICRA 2025). arXiv admin note: text overlap with arXiv:2410.14310

  5. arXiv:2410.14405  [pdf, ps, other

    cs.CL

    Fact Recall, Heuristics or Pure Guesswork? Precise Interpretations of Language Models for Fact Completion

    Authors: Denitsa Saynova, Lovisa Hagström, Moa Johansson, Richard Johansson, Marco Kuhlmann

    Abstract: Language models (LMs) can make a correct prediction based on many possible signals in a prompt, not all corresponding to recall of factual associations. However, current interpretations of LMs fail to take this into account. For example, given the query "Astrid Lindgren was born in" with the corresponding completion "Sweden", no difference is made between whether the prediction was based on knowin… ▽ More

    Submitted 1 July, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

    Comments: accepted to ACL Findings 2025

  6. arXiv:2410.14310  [pdf, other

    cs.RO cs.AI

    Transferring Tactile Data Across Sensors

    Authors: Wadhah Zai El Amri, Malte Kuhlmann, Nicolás Navarro-Guerrero

    Abstract: Tactile perception is essential for human interaction with the environment and is becoming increasingly crucial in robotics. Tactile sensors like the BioTac mimic human fingertips and provide detailed interaction data. Despite its utility in applications like slip detection and object identification, this sensor is now deprecated, making many existing datasets obsolete. This article introduces a n… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: Extended Abstract. Accepted in ICRA@40 (40th Anniversary of the IEEE International Conference on Robotics and Automation) 23-26 September, 2024 Rotterdam, Netherlands

  7. arXiv:2405.04215  [pdf, other

    cs.AI

    NL2Plan: Robust LLM-Driven Planning from Minimal Text Descriptions

    Authors: Elliot Gestrin, Marco Kuhlmann, Jendrik Seipp

    Abstract: Today's classical planners are powerful, but modeling input tasks in formats such as PDDL is tedious and error-prone. In contrast, planning with Large Language Models (LLMs) allows for almost any input text, but offers no guarantees on plan quality or even soundness. In an attempt to merge the best of these two approaches, some work has begun to use LLMs to automate parts of the PDDL creation proc… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted for the ICAPS 2024 Workshop on Human-Aware and Explainable Planning

  8. arXiv:2402.10770  [pdf, other

    cs.CL cs.AI

    How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?

    Authors: Ehsan Doostmohammadi, Oskar Holmström, Marco Kuhlmann

    Abstract: Work on instruction-tuned Large Language Models (LLMs) has used automatic methods based on text overlap and LLM judgments as cost-effective alternatives to human evaluation. In this paper, we perform a meta-evaluation of such methods and assess their reliability across a broad range of tasks. In evaluating how well automatic methods align with human evaluations, correlation metrics are the most co… ▽ More

    Submitted 2 October, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  9. arXiv:2402.10532  [pdf, other

    cs.CL cs.AI cs.CY cs.HC cs.LG

    Properties and Challenges of LLM-Generated Explanations

    Authors: Jenny Kunz, Marco Kuhlmann

    Abstract: The self-rationalising capabilities of large language models (LLMs) have been explored in restricted settings, using task/specific data sets. However, current LLMs do not (only) rely on specifically annotated data; nonetheless, they frequently explain their outputs. The properties of the generated explanations are influenced by the pre-training corpus and by the target data used for instruction fi… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  10. arXiv:2310.04445  [pdf, other

    cs.CL cs.AI cs.LG

    LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model

    Authors: Muhammad Ahmed Shah, Roshan Sharma, Hira Dhamyal, Raphael Olivier, Ankit Shah, Joseph Konan, Dareen Alharthi, Hazim T Bukhari, Massa Baali, Soham Deshmukh, Michael Kuhlmann, Bhiksha Raj, Rita Singh

    Abstract: It has been shown that Large Language Model (LLM) alignments can be circumvented by appending specially crafted attack suffixes with harmful queries to elicit harmful responses. To conduct attacks against private target models whose characterization is unknown, public models can be used as proxies to fashion the attack, with successful attacks being transferred from public proxies to private targe… ▽ More

    Submitted 21 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

  11. arXiv:2306.04621  [pdf, other

    cs.LG cs.CV

    Flexible Distribution Alignment: Towards Long-tailed Semi-supervised Learning with Proper Calibration

    Authors: Emanuel Sanchez Aimar, Nathaniel Helgesen, Yonghao Xu, Marco Kuhlmann, Michael Felsberg

    Abstract: Long-tailed semi-supervised learning (LTSSL) represents a practical scenario for semi-supervised applications, challenged by skewed labeled distributions that bias classifiers. This problem is often aggravated by discrepancies between labeled and unlabeled class distributions, leading to biased pseudo-labels, neglect of rare classes, and poorly calibrated probabilities. To address these issues, we… ▽ More

    Submitted 15 July, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted at ECCV2024, 25 pages, 6 figures

  12. arXiv:2305.16243  [pdf, other

    cs.CL

    Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language Models

    Authors: Ehsan Doostmohammadi, Tobias Norlund, Marco Kuhlmann, Richard Johansson

    Abstract: Augmenting language models with a retrieval mechanism has been shown to significantly improve their performance while keeping the number of parameters low. Retrieval-augmented models commonly rely on a semantic retrieval mechanism based on the similarity between dense representations of the query chunk and potential neighbors. In this paper, we study the state-of-the-art Retro model and observe th… ▽ More

    Submitted 4 July, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  13. arXiv:2302.12128  [pdf, other

    cs.CL

    On the Generalization Ability of Retrieval-Enhanced Transformers

    Authors: Tobias Norlund, Ehsan Doostmohammadi, Richard Johansson, Marco Kuhlmann

    Abstract: Recent work on the Retrieval-Enhanced Transformer (RETRO) model has shown that off-loading memory from trainable weights to a retrieval database can significantly improve language modeling and match the performance of non-retrieval models that are an order of magnitude larger in size. It has been suggested that at least some of this performance gain is due to non-trivial generalization based on bo… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  14. arXiv:2206.05260  [pdf, other

    cs.CV cs.LG

    Balanced Product of Calibrated Experts for Long-Tailed Recognition

    Authors: Emanuel Sanchez Aimar, Arvi Jonnarth, Michael Felsberg, Marco Kuhlmann

    Abstract: Many real-world recognition problems are characterized by long-tailed label distributions. These distributions make representation learning highly challenging due to limited generalization over the tail classes. If the test distribution differs from the training distribution, e.g. uniform versus long-tailed, the problem of the distribution shift needs to be addressed. A recent line of work propose… ▽ More

    Submitted 7 June, 2023; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: Accepted at CVPR 2023, 19 pages

  15. arXiv:2005.12963  [pdf, ps, other

    eess.AS cs.SD

    Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations

    Authors: Janek Ebbers, Michael Kuhlmann, Tobias Cord-Landwehr, Reinhold Haeb-Umbach

    Abstract: In this work we address disentanglement of style and content in speech signals. We propose a fully convolutional variational autoencoder employing two encoders: a content encoder and a style encoder. To foster disentanglement, we propose adversarial contrastive predictive coding. This new disentanglement method does neither need parallel data nor any supervision. We show that the proposed techniqu… ▽ More

    Submitted 11 March, 2021; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: accepted by icassp 2021

  16. arXiv:1702.06594  [pdf, ps, other

    cs.CL

    On the Complexity of CCG Parsing

    Authors: Marco Kuhlmann, Giorgio Satta, Peter Jonsson

    Abstract: We study the parsing complexity of Combinatory Categorial Grammar (CCG) in the formalism of Vijay-Shanker and Weir (1994). As our main result, we prove that any parsing algorithm for this formalism will take in the worst case exponential time when the size of the grammar, and not only the length of the input sentence, is included in the analysis. This sets the formalism of Vijay-Shanker and Weir (… ▽ More

    Submitted 4 May, 2018; v1 submitted 21 February, 2017; originally announced February 2017.

    Comments: 39 pages, 17 figures

  17. arXiv:1504.05908  [pdf, ps, other

    cs.CC

    Maximum Pagenumber-k Subgraph is NP-Complete

    Authors: Peter Jonsson, Marco Kuhlmann

    Abstract: Given a graph $G$ with a total order defined on its vertices, the Maximum Pagenumber-$k$ Subgraph Problem asks for a maximum subgraph $G'$ of $G$ such that $G'$ can be embedded into a $k$-book when the vertices are placed on the spine according to the specified total order. We show that this problem is NP-complete for $k \geq 2$.

    Submitted 23 April, 2015; v1 submitted 21 April, 2015; originally announced April 2015.

    Comments: 6 pages, 1 figure

  18. arXiv:1504.04993  [pdf, ps, other

    cs.DS math.CO

    Tabulation of Noncrossing Acyclic Digraphs

    Authors: Marco Kuhlmann

    Abstract: I present an algorithm that, given a number $n \geq 1$, computes a compact representation of the set of all noncrossing acyclic digraphs with $n$ nodes. This compact representation can be used as the basis for a wide range of dynamic programming algorithms on these graphs. As an illustration, along with this note I am releasing the implementation of an algorithm for counting the number of noncross… ▽ More

    Submitted 20 April, 2015; originally announced April 2015.

    Comments: 9 pages, several figures

  19. arXiv:0810.4249  [pdf, other

    cs.CC

    Ogden's Lemma for Regular Tree Languages

    Authors: Marco Kuhlmann

    Abstract: We motivate and prove a strong pumping lemma for regular tree languages. The new lemma can be seen as the natural correspondent of Ogden's lemma for context-free string languages.

    Submitted 23 October, 2008; originally announced October 2008.

    ACM Class: F.4.3