Skip to main content

Showing 1–50 of 150 results for author: Muller, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11829  [pdf, ps, other

    cs.RO cs.HC stat.ME

    The Space Between Us: A Methodological Framework for Researching Bonding and Proxemics in Situated Group-Agent Interactions

    Authors: Ana Müller, Anja Richert

    Abstract: This paper introduces a multimethod framework for studying spatial and social dynamics in real-world group-agent interactions with socially interactive agents. Drawing on proxemics and bonding theories, the method combines subjective self-reports and objective spatial tracking. Applied in two field studies in a museum (N = 187) with a robot and a virtual agent, the paper addresses the challenges i… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: Accepted for presentation at the Workshop on Advancing Group Understanding and Robots' Adaptive Behavior (GROUND), held at the Intelligent Autonomous Systems (IAS) Conference 2025, Genoa, Italy

  2. arXiv:2506.10686  [pdf, ps, other

    cs.RO cs.SC math.GR math.OC physics.class-ph

    An $O(n$)-Algorithm for the Higher-Order Kinematics and Inverse Dynamics of Serial Manipulators using Spatial Representation of Twists

    Authors: Andreas Mueller

    Abstract: Optimal control in general, and flatness-based control in particular, of robotic arms necessitate to compute the first and second time derivatives of the joint torques/forces required to achieve a desired motion. In view of the required computational efficiency, recursive $O(n)$-algorithms were proposed to this end. Aiming at compact yet efficient formulations, a Lie group formulation was recently… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Journal ref: IEEE ROBOTICS AND AUTOMATION LETTERS, VOL. 6, NO. 2, APRIL 2021

  3. arXiv:2506.10462  [pdf, ps, other

    cs.RO

    Are We Generalizing from the Exception? An In-the-Wild Study on Group-Sensitive Conversation Design in Human-Agent Interactions

    Authors: Ana Müller, Sabina Jeschke, Anja Richert

    Abstract: This paper investigates the impact of a group-adaptive conversation design in two socially interactive agents (SIAs) through two real-world studies. Both SIAs - Furhat, a social robot, and MetaHuman, a virtual agent - were equipped with a conversational artificial intelligence (CAI) backend combining hybrid retrieval and generative models. The studies were carried out in an in-the-wild setting wit… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: Accepted as a regular paper at the 2025 IEEE International Conference on Robot and Human Interactive Communication (RO-MAN). \c{opyright} IEEE. This is the preprint version. The final version will appear in the IEEE proceedings

  4. arXiv:2506.05584  [pdf, ps, other

    cs.LG

    TabFlex: Scaling Tabular Learning to Millions with Linear Attention

    Authors: Yuchen Zeng, Tuan Dinh, Wonjun Kang, Andreas C Mueller

    Abstract: Leveraging the in-context learning (ICL) capability of Large Language Models (LLMs) for tabular classification has gained significant attention for its training-free adaptability across diverse datasets. Recent advancements, like TabPFN, excel in small-scale tabular datasets but struggle to scale for large and complex datasets. Our work enhances the efficiency and scalability of TabPFN for larger… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: 30 pages, ICML 2025

  5. arXiv:2505.20209  [pdf, other

    cs.CL

    How to Improve the Robustness of Closed-Source Models on NLI

    Authors: Joe Stacey, Lisa Alazraki, Aran Ubhi, Beyza Ermis, Aaron Mueller, Marek Rei

    Abstract: Closed-source Large Language Models (LLMs) have become increasingly popular, with impressive performance across a wide range of natural language tasks. These models can be fine-tuned to further improve performance, but this often results in the models learning from dataset-specific heuristics that reduce their robustness on out-of-distribution (OOD) data. Existing methods to improve robustness eit… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    ACM Class: I.2.7

  6. arXiv:2505.20063  [pdf, other

    cs.LG cs.AI cs.CL

    SAEs Are Good for Steering -- If You Select the Right Features

    Authors: Dana Arad, Aaron Mueller, Yonatan Belinkov

    Abstract: Sparse Autoencoders (SAEs) have been proposed as an unsupervised approach to learn a decomposition of a model's latent space. This enables useful applications such as steering - influencing the output of a model towards a desired concept - without requiring labeled data. Current methods identify SAE features to steer by analyzing the input tokens that activate them. However, recent work has highli… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  7. arXiv:2504.13151  [pdf, ps, other

    cs.LG cs.AI cs.CL

    MIB: A Mechanistic Interpretability Benchmark

    Authors: Aaron Mueller, Atticus Geiger, Sarah Wiegreffe, Dana Arad, Iván Arcuschin, Adam Belfki, Yik Siu Chan, Jaden Fiotto-Kaufman, Tal Haklay, Michael Hanna, Jing Huang, Rohan Gupta, Yaniv Nikankin, Hadas Orgad, Nikhil Prakash, Anja Reusch, Aruna Sankaranarayanan, Shun Shao, Alessandro Stolfo, Martin Tutek, Amir Zur, David Bau, Yonatan Belinkov

    Abstract: How can we know whether new mechanistic interpretability methods achieve real improvements? In pursuit of lasting evaluation standards, we propose MIB, a Mechanistic Interpretability Benchmark, with two tracks spanning four tasks and five models. MIB favors methods that precisely and concisely recover relevant causal pathways or causal variables in neural language models. The circuit localization… ▽ More

    Submitted 9 June, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: Accepted to ICML 2025. Project website at https://mib-bench.github.io

  8. arXiv:2504.11011  [pdf, other

    cs.IR cs.AI

    Document Quality Scoring for Web Crawling

    Authors: Francesca Pezzuti, Ariane Mueller, Sean MacAvaney, Nicola Tonellotto

    Abstract: The internet contains large amounts of low-quality content, yet users expect web search engines to deliver high-quality, relevant results. The abundant presence of low-quality pages can negatively impact retrieval and crawling processes by wasting resources on these documents. Therefore, search engines can greatly benefit from techniques that leverage efficient quality estimation methods to mitiga… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: Presented at WOWS2025

  9. Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora

    Authors: Alex Warstadt, Aaron Mueller, Leshem Choshen, Ethan Wilcox, Chengxu Zhuang, Juan Ciro, Rafael Mosquera, Bhargavi Paranjape, Adina Williams, Tal Linzen, Ryan Cotterell

    Abstract: Children can acquire language from less than 100 million words of input. Large language models are far less data-efficient: they typically require 3 or 4 orders of magnitude more data and still do not perform as well as humans on many evaluations. These intensive resource demands limit the ability of researchers to train new models and use existing models as developmentally plausible cognitive mod… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: Published in Proceedings of BabyLM. Please cite the published version on ACL anthology: http://aclanthology.org/2023.conll-babylm.1/

    Journal ref: 2023. In Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning, pages 1-34, Singapore. Association for Computational Linguistics

  10. arXiv:2503.23760  [pdf, other

    cs.RO cs.CL cs.HC

    Towards a cognitive architecture to enable natural language interaction in co-constructive task learning

    Authors: Manuel Scheibl, Birte Richter, Alissa Müller, Michael Beetz, Britta Wrede

    Abstract: This research addresses the question, which characteristics a cognitive architecture must have to leverage the benefits of natural language in Co-Constructive Task Learning (CCTL). To provide context, we first discuss Interactive Task Learning (ITL), the mechanisms of the human memory system, and the significance of natural language and multi-modality. Next, we examine the current state of cogniti… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

    Comments: 8 pages, 5 figures, submitted to: IEEE RO-MAN 2025

  11. arXiv:2503.11404  [pdf, other

    cs.CR cs.AI cs.CV

    Towards A Correct Usage of Cryptography in Semantic Watermarks for Diffusion Models

    Authors: Jonas Thietke, Andreas Müller, Denis Lukovnikov, Asja Fischer, Erwin Quiring

    Abstract: Semantic watermarking methods enable the direct integration of watermarks into the generation process of latent diffusion models by only modifying the initial latent noise. One line of approaches building on Gaussian Shading relies on cryptographic primitives to steer the sampling process of the latent noise. However, we identify several issues in the usage of cryptographic techniques in Gaussian… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: 8 pages, 3 figures, WMark@ICLR

  12. arXiv:2503.02922  [pdf, other

    cs.IR

    Optimizing open-domain question answering with graph-based retrieval augmented generation

    Authors: Joyce Cahoon, Prerna Singh, Nick Litombe, Jonathan Larson, Ha Trinh, Yiwen Zhu, Andreas Mueller, Fotis Psallidas, Carlo Curino

    Abstract: In this work, we benchmark various graph-based retrieval-augmented generation (RAG) systems across a broad spectrum of query types, including OLTP-style (fact-based) and OLAP-style (thematic) queries, to address the complex demands of open-domain question answering (QA). Traditional RAG methods often fall short in handling nuanced, multi-document synthesis tasks. By structuring knowledge as graphs… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    ACM Class: H.3.3; I.2.7

  13. arXiv:2502.11673  [pdf, ps, other

    cs.LG stat.ML

    Best of Both Worlds: Regret Minimization versus Minimax Play

    Authors: Adrian Müller, Jon Schneider, Stratis Skoulakis, Luca Viano, Volkan Cevher

    Abstract: In this paper, we investigate the existence of online learning algorithms with bandit feedback that simultaneously guarantee $O(1)$ regret compared to a given comparator strategy, and $\tilde{O}(\sqrt{T})$ regret compared to any fixed strategy, where $T$ is the number of rounds. We provide the first affirmative answer to this question whenever the comparator strategy supports every action. In the… ▽ More

    Submitted 4 June, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

  14. arXiv:2502.10645  [pdf, other

    cs.CL

    BabyLM Turns 3: Call for papers for the 2025 BabyLM workshop

    Authors: Lucas Charpentier, Leshem Choshen, Ryan Cotterell, Mustafa Omer Gul, Michael Hu, Jaap Jumelet, Tal Linzen, Jing Liu, Aaron Mueller, Candace Ross, Raj Sanjay Shah, Alex Warstadt, Ethan Wilcox, Adina Williams

    Abstract: BabyLM aims to dissolve the boundaries between cognitive modeling and language modeling. We call for both workshop papers and for researchers to join the 3rd BabyLM competition. As in previous years, we call for participants in the data-efficient pretraining challenge in the general track. This year, we also offer a new track: INTERACTION. This new track encourages interactive behavior, learning f… ▽ More

    Submitted 24 February, 2025; v1 submitted 14 February, 2025; originally announced February 2025.

    Comments: EMNLP 2025 BabyLM Workshop. arXiv admin note: text overlap with arXiv:2404.06214

  15. arXiv:2502.05392  [pdf, other

    cs.LG

    Open Challenges in Time Series Anomaly Detection: An Industry Perspective

    Authors: Andreas Mueller

    Abstract: Current research in time-series anomaly detection is using definitions that miss critical aspects of how anomaly detection is commonly used in practice. We list several areas that are of practical relevance and that we believe are either under-investigated or missing entirely from the current discourse. Based on an investigation of systems deployed in a cloud environment, we motivate the areas of… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  16. arXiv:2502.04577  [pdf, other

    cs.LG cs.CL

    Position-aware Automatic Circuit Discovery

    Authors: Tal Haklay, Hadas Orgad, David Bau, Aaron Mueller, Yonatan Belinkov

    Abstract: A widely used strategy to discover and understand language model mechanisms is circuit analysis. A circuit is a minimal subgraph of a model's computation graph that executes a specific task. We identify a gap in existing circuit discovery methods: they assume circuits are position-invariant, treating model components as equally relevant across input positions. This limits their ability to capture… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    MSC Class: 68T50 ACM Class: I.2.7

  17. arXiv:2502.03376  [pdf

    cs.CY cs.CV

    Ethical Considerations for the Military Use of Artificial Intelligence in Visual Reconnaissance

    Authors: Mathias Anneken, Nadia Burkart, Fabian Jeschke, Achim Kuwertz-Wolf, Almuth Mueller, Arne Schumann, Michael Teutsch

    Abstract: This white paper underscores the critical importance of responsibly deploying Artificial Intelligence (AI) in military contexts, emphasizing a commitment to ethical and legal standards. The evolving role of AI in the military goes beyond mere technical applications, necessitating a framework grounded in ethical principles. The discussion within the paper delves into ethical AI principles, particul… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

    Comments: White Paper, 30 pages, 7 figures

  18. arXiv:2501.15849  [pdf, ps, other

    eess.SY cs.LG

    Gaussian Process-Based Prediction and Control of Hammerstein-Wiener Systems

    Authors: Mingzhou Yin, Matthias A. Müller

    Abstract: This work investigates data-driven prediction and control of Hammerstein-Wiener systems using physics-informed Gaussian process models. Data-driven prediction algorithms have been developed for structured nonlinear systems based on Willems' fundamental lemma. However, existing frameworks cannot treat output nonlinearities and require a dictionary of basis functions for Hammerstein systems. In this… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  19. arXiv:2501.10713  [pdf

    cs.HC cs.RO

    Human-like Nonverbal Behavior with MetaHumans in Real-World Interaction Studies: An Architecture Using Generative Methods and Motion Capture

    Authors: Oliver Chojnowski, Alexander Eberhard, Michael Schiffmann, Ana Müller, Anja Richert

    Abstract: Socially interactive agents are gaining prominence in domains like healthcare, education, and service contexts, particularly virtual agents due to their inherent scalability. To facilitate authentic interactions, these systems require verbal and nonverbal communication through e.g., facial expressions and gestures. While natural language processing technologies have rapidly advanced, incorporating… ▽ More

    Submitted 18 January, 2025; originally announced January 2025.

    Comments: Accepted for presentation at the ACM/IEEE International Conference on Human-Robot Interaction (HRI 2025) as a Late-Breaking Report

  20. arXiv:2501.08618  [pdf, other

    cs.CL cs.AI

    Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models

    Authors: Aruna Sankaranarayanan, Dylan Hadfield-Menell, Aaron Mueller

    Abstract: All natural languages are structured hierarchically. In humans, this structural restriction is neurologically coded: when two grammars are presented with identical vocabularies, brain areas responsible for language processing are only sensitive to hierarchical grammars. Using large language models (LLMs), we investigate whether such functionally distinct hierarchical processing regions can arise s… ▽ More

    Submitted 15 January, 2025; originally announced January 2025.

  21. arXiv:2501.06346  [pdf, other

    cs.CL

    Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages

    Authors: Jannik Brinkmann, Chris Wendler, Christian Bartelt, Aaron Mueller

    Abstract: Human bilinguals often use similar brain regions to process multiple languages, depending on when they learned their second language and their proficiency. In large language models (LLMs), how are multiple languages learned and encoded? In this work, we explore the extent to which LLMs share representations of morphsyntactic concepts such as grammatical number, gender, and tense across languages.… ▽ More

    Submitted 23 May, 2025; v1 submitted 10 January, 2025; originally announced January 2025.

  22. arXiv:2412.20409  [pdf, other

    cs.RO math.GR math.NA math.RA

    Analytically Informed Inverse Kinematics Solution at Singularities

    Authors: Andreas Mueller

    Abstract: Near kinematic singularities of a serial manipulator, the inverse kinematics (IK) problem becomes ill-conditioned, which poses computational problems for the numerical solution. Computational methods to tackle this issue are based on various forms of a pseudoinverse (PI) solution to the velocity IK problem. The damped least squares (DLS) method provides a robust solution with controllable converge… ▽ More

    Submitted 29 December, 2024; originally announced December 2024.

    Journal ref: In: Lenarcic, J., Husty, M. (eds) Advances in Robot Kinematics 2024. ARK 2024. Springer Proceedings in Advanced Robotics, vol 31. Springer, Cham

  23. Dynamics of Parallel Manipulators with Hybrid Complex Limbs -- Modular Modeling and Parallel Computing

    Authors: Andreas Mueller

    Abstract: Parallel manipulators, also called parallel kinematics machines (PKM), enable robotic solutions for highly dynamic handling and machining applications. The safe and accurate design and control necessitates high-fidelity dynamics models. Such modeling approaches have already been presented for PKM with simple limbs (i.e. each limb is a serial kinematic chain). A systematic modeling approach for PKM… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Journal ref: Mechanism and Machine Theory, Volume 167, January 2022

  24. arXiv:2412.13638  [pdf, other

    cs.RO math.DS math.GR physics.app-ph

    A Constraint Embedding Approach for Dynamics Modeling of Parallel Kinematic Manipulators with Hybrid Limbs

    Authors: Andreas Mueller

    Abstract: Parallel kinematic manipulators (PKM) are characterized by closed kinematic loops, due to the parallel arrangement of limbs but also due to the existence of kinematic loops within the limbs. Moreover, many PKM are built with limbs constructed by serially combining kinematic loops. Such limbs are called hybrid, which form a particular class of complex limbs. Design and model-based control requires… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Journal ref: Robotics and Autonomous Systems, Volume 155, September 2022

  25. arXiv:2412.05353  [pdf, other

    cs.CL

    Incremental Sentence Processing Mechanisms in Autoregressive Transformer Language Models

    Authors: Michael Hanna, Aaron Mueller

    Abstract: Autoregressive transformer language models (LMs) possess strong syntactic abilities, often successfully handling phenomena from agreement to NPI licensing. However, the features they use to incrementally process language inputs are not well understood. In this paper, we fill this gap by studying the mechanisms underlying garden path sentence processing in LMs. We ask: (1) Do LMs use syntactic feat… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

    Comments: Code and data available at https://github.com/hannamw/GP-mechanisms

  26. arXiv:2412.05149  [pdf, other

    cs.CL

    Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora

    Authors: Michael Y. Hu, Aaron Mueller, Candace Ross, Adina Williams, Tal Linzen, Chengxu Zhuang, Ryan Cotterell, Leshem Choshen, Alex Warstadt, Ethan Gotlieb Wilcox

    Abstract: The BabyLM Challenge is a community effort to close the data-efficiency gap between human and computational language learners. Participants compete to optimize language model training on a fixed language data budget of 100 million words or less. This year, we released improved text corpora, as well as a vision-and-language corpus to facilitate research into cognitively plausible vision language mo… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

  27. arXiv:2412.03283  [pdf, ps, other

    cs.CR cs.AI cs.CV

    Black-Box Forgery Attacks on Semantic Watermarks for Diffusion Models

    Authors: Andreas Müller, Denis Lukovnikov, Jonas Thietke, Asja Fischer, Erwin Quiring

    Abstract: Integrating watermarking into the generation process of latent diffusion models (LDMs) simplifies detection and attribution of generated content. Semantic watermarks, such as Tree-Rings and Gaussian Shading, represent a novel class of watermarking techniques that are easy to implement and highly robust against various perturbations. However, our work demonstrates a fundamental security vulnerabili… ▽ More

    Submitted 7 June, 2025; v1 submitted 4 December, 2024; originally announced December 2024.

    Comments: CVPR 2025

    Journal ref: Proc. IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2025, pp. 20937-20946

  28. arXiv:2411.09826  [pdf, other

    cs.CL

    Evaluating Gender Bias in Large Language Models

    Authors: Michael Döll, Markus Döhring, Andreas Müller

    Abstract: Gender bias in artificial intelligence has become an important issue, particularly in the context of language models used in communication-oriented applications. This study examines the extent to which Large Language Models (LLMs) exhibit gender bias in pronoun selection in occupational contexts. The analysis evaluates the models GPT-4, GPT-4o, PaLM 2 Text Bison and Gemini 1.0 Pro using a self-gen… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

    Comments: 13 pages, 12 figures, 1 table

  29. arXiv:2410.22590  [pdf, other

    cs.CL

    Characterizing the Role of Similarity in the Property Inferences of Language Models

    Authors: Juan Diego Rodriguez, Aaron Mueller, Kanishka Misra

    Abstract: Property inheritance -- a phenomenon where novel properties are projected from higher level categories (e.g., birds) to lower level ones (e.g., sparrows) -- provides a unique window into how humans organize and deploy conceptual knowledge. It is debated whether this ability arises due to explicitly stored taxonomic knowledge vs. simple computations of similarity between mental representations. How… ▽ More

    Submitted 9 March, 2025; v1 submitted 29 October, 2024; originally announced October 2024.

    Comments: Published at NAACL 2025

  30. arXiv:2410.21272  [pdf, other

    cs.CL

    Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics

    Authors: Yaniv Nikankin, Anja Reusch, Aaron Mueller, Yonatan Belinkov

    Abstract: Do large language models (LLMs) solve reasoning tasks by learning robust generalizable algorithms, or do they memorize training data? To investigate this question, we use arithmetic reasoning as a representative task. Using causal analysis, we identify a subset of the model (a circuit) that explains most of the model's behavior for basic arithmetic logic and examine its functionality. By zooming i… ▽ More

    Submitted 20 May, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

    MSC Class: 68T5 ACM Class: I.2.7

  31. arXiv:2410.14463  [pdf, ps, other

    quant-ph cs.DM math.SG

    An abstract structure determines the contextuality degree of observable-based Kochen-Specker proofs

    Authors: Axel Muller, Alain Giorgetti

    Abstract: This article delves into the concept of quantum contextuality, specifically focusing on proofs of the Kochen-Specker theorem obtained by assigning Pauli observables to hypergraph vertices satisfying a given commutation relation. The abstract structure composed of this hypergraph and the graph of anticommutations is named a hypergram. Its labelings with Pauli observables generalize the well-known m… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 18 pages, 3 figures, 1 table

  32. arXiv:2410.06029  [pdf, other

    quant-ph cs.CR

    Unclonable Functional Encryption

    Authors: Arthur Mehta, Anne Müller

    Abstract: In a functional encryption (FE) scheme, a user that holds a ciphertext and a function key can learn the result of applying the function to the plaintext message. Security requires that the user does not learn anything beyond the function evaluation. We extend this notion to the quantum setting by providing definitions and a construction for a quantum functional encryption (QFE) scheme which allows… ▽ More

    Submitted 14 March, 2025; v1 submitted 8 October, 2024; originally announced October 2024.

  33. arXiv:2410.04560  [pdf, other

    cs.LG stat.ML

    GAMformer: In-Context Learning for Generalized Additive Models

    Authors: Andreas Mueller, Julien Siems, Harsha Nori, David Salinas, Arber Zela, Rich Caruana, Frank Hutter

    Abstract: Generalized Additive Models (GAMs) are widely recognized for their ability to create fully interpretable machine learning models for tabular data. Traditionally, training GAMs involves iterative learning algorithms, such as splines, boosted trees, or neural networks, which refine the additive components through repeated error reduction. In this paper, we introduce GAMformer, the first method to le… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

    Comments: 20 pages, 12 figures

  34. arXiv:2409.11933  [pdf, other

    cs.LG

    Reinforcement Learning as an Improvement Heuristic for Real-World Production Scheduling

    Authors: Arthur Müller, Lukas Vollenkemper

    Abstract: The integration of Reinforcement Learning (RL) with heuristic methods is an emerging trend for solving optimization problems, which leverages RL's ability to learn from the data generated during the search process. One promising approach is to train an RL agent as an improvement heuristic, starting with a suboptimal solution that is iteratively improved by applying small changes. We apply this app… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

    Comments: This paper was accepted at the ICMLA 2024

  35. arXiv:2408.09841  [pdf, other

    cs.AI

    Demystifying Reinforcement Learning in Production Scheduling via Explainable AI

    Authors: Daniel Fischer, Hannah M. Hüsener, Felix Grumbach, Lukas Vollenkemper, Arthur Müller, Pascal Reusch

    Abstract: Deep Reinforcement Learning (DRL) is a frequently employed technique to solve scheduling problems. Although DRL agents ace at delivering viable results in short computing times, their reasoning remains opaque. We conduct a case study where we systematically apply two explainable AI (xAI) frameworks, namely SHAP (DeepSHAP) and Captum (Input x Gradient), to describe the reasoning behind scheduling d… ▽ More

    Submitted 30 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

  36. arXiv:2408.01416  [pdf, other

    cs.LG cs.AI

    The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability

    Authors: Aaron Mueller, Jannik Brinkmann, Millicent Li, Samuel Marks, Koyena Pal, Nikhil Prakash, Can Rager, Aruna Sankaranarayanan, Arnab Sen Sharma, Jiuding Sun, Eric Todd, David Bau, Yonatan Belinkov

    Abstract: Interpretability provides a toolset for understanding how and why neural networks behave in certain ways. However, there is little unity in the field: most studies employ ad-hoc evaluations and do not share theoretical foundations, making it difficult to measure progress and compare the pros and cons of different techniques. Furthermore, while mechanistic understanding is frequently discussed, the… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  37. arXiv:2407.19427  [pdf

    cs.CY cs.HC

    The influence of Automated Decision-Making systems in the context of street-level bureaucrats' practices

    Authors: Manuel Portela, A. Paula Rodriguez Müller, Luca Tangi

    Abstract: In an era of digital governance, the use of automation for individual and cooperative work is increasing in public administrations (Tangi et al., 2022). Despite the promises of efficiency and cost reduction, automation could bring new challenges to the governance schemes. Regional, national, and local governments are taking measures to regulate and measure the impact of automated decision-making s… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  38. arXiv:2407.18009  [pdf, other

    cs.RO

    Egocentric Robots in a Human-Centric World? Exploring Group-Robot-Interaction in Public Spaces

    Authors: Ana Müller, Anja Richert

    Abstract: The deployment of social robots in real-world scenarios is increasing, supporting humans in various contexts. However, they still struggle to grasp social dynamics, especially in public spaces, sometimes resulting in violations of social norms, such as interrupting human conversations. This behavior, originating from a limited processing of social norms, might be perceived as robot-centered. Under… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: Accepted at the workshop on advancing Group Understanding and robots' adaptive behavior (GROUND), held at the Robotics Science and Systems (RSS) Conference, 2024

  39. arXiv:2407.14561  [pdf, other

    cs.LG cs.AI

    NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals

    Authors: Jaden Fiotto-Kaufman, Alexander R. Loftus, Eric Todd, Jannik Brinkmann, Koyena Pal, Dmitrii Troitskii, Michael Ripa, Adam Belfki, Can Rager, Caden Juang, Aaron Mueller, Samuel Marks, Arnab Sen Sharma, Francesca Lucchetti, Nikhil Prakash, Carla Brodley, Arjun Guha, Jonathan Bell, Byron C. Wallace, David Bau

    Abstract: We introduce NNsight and NDIF, technologies that work in tandem to enable scientific study of the representations and computations learned by very large neural networks. NNsight is an open-source system that extends PyTorch to introduce deferred remote execution. The National Deep Inference Fabric (NDIF) is a scalable inference service that executes NNsight requests, allowing users to share GPU re… ▽ More

    Submitted 1 April, 2025; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: Code at https://nnsight.net

  40. arXiv:2407.04690  [pdf, other

    cs.LG cs.CL

    Missed Causes and Ambiguous Effects: Counterfactuals Pose Challenges for Interpreting Neural Networks

    Authors: Aaron Mueller

    Abstract: Interpretability research takes counterfactual theories of causality for granted. Most causal methods rely on counterfactual interventions to inputs or the activations of particular model components, followed by observations of the change in models' output logits or behaviors. While this yields more faithful evidence than correlational methods, counterfactuals nonetheless have key problems that bi… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  41. Is there an optimal choice of configuration space for Lie group integration schemes applied to constrained MBS?

    Authors: Andreas Mueller, Zdravko Terze

    Abstract: Recently various numerical integration schemes have been proposed for numerically simulating the dynamics of constrained multibody systems (MBS) operating. These integration schemes operate directly on the MBS configuration space considered as a Lie group. For discrete spatial mechanical systems there are two Lie group that can be used as configuration space: $SE\left( 3\right) $ and… ▽ More

    Submitted 18 June, 2024; originally announced July 2024.

    Journal ref: Proceedings of the ASME 2013 International Design Engineering Technical Conferences & Computers and Information in Engineering Conference, IDETC/CIE 2013, August 12-15, 2013, Portland, OR, USA

  42. arXiv:2407.02928  [pdf, other

    quant-ph cs.DM math-ph math.CO

    A new heuristic approach for contextuality degree estimates and its four- to six-qubit portrayals

    Authors: Axel Muller, Metod Saniga, Alain Giorgetti, Frédéric Holweck, Colm Kelleher

    Abstract: We introduce and describe a new heuristic method for finding an upper bound on the degree of contextuality and the corresponding unsatisfied part of a quantum contextual configuration with three-element contexts (i.e., lines) located in a multi-qubit symplectic polar space of order two. While the previously used method based on a SAT solver was limited to three qubits, this new method is much fast… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 35 pages, 14 figures

    MSC Class: 81P13 ACM Class: J.2

    Journal ref: J. Phys. A: Math. Theor. 58 (2025) 215302

  43. The significance of the configuration space Lie group for the constraint satisfaction in numerical time integration of multibody systems

    Authors: Andreas Mueller, Zdravko Terze

    Abstract: The dynamics simulation of multibody systems (MBS) using spatial velocities (non-holonomic velocities) requires time integration of the dynamics equations together with the kinematic reconstruction equations (relating time derivatives of configuration variables to rigid body velocities). The latter are specific to the geometry of the rigid body motion underlying a particular formulation, and thus… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Journal ref: The significance of the configuration space Lie group for the constraint satisfaction in numerical time integration of multibody systems, Mechanism and Machine Theory, Vol. 82, 2014, pp. 173-202

  44. arXiv:2406.03348  [pdf, other

    cs.LG

    Position: A Call to Action for a Human-Centered AutoML Paradigm

    Authors: Marius Lindauer, Florian Karl, Anne Klier, Julia Moosbauer, Alexander Tornede, Andreas Mueller, Frank Hutter, Matthias Feurer, Bernd Bischl

    Abstract: Automated machine learning (AutoML) was formed around the fundamental objectives of automatically and efficiently configuring machine learning (ML) workflows, aiding the research of new ML algorithms, and contributing to the democratization of ML by making it accessible to a broader audience. Over the past decade, commendable achievements in AutoML have primarily focused on optimizing predictive p… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  45. arXiv:2406.02294  [pdf, other

    cs.LG

    Smaller Batches, Bigger Gains? Investigating the Impact of Batch Sizes on Reinforcement Learning Based Real-World Production Scheduling

    Authors: Arthur Müller, Felix Grumbach, Matthia Sabatelli

    Abstract: Production scheduling is an essential task in manufacturing, with Reinforcement Learning (RL) emerging as a key solution. In a previous work, RL was utilized to solve an extended permutation flow shop scheduling problem (PFSSP) for a real-world production line with two stages, linked by a central buffer. The RL agent was trained to sequence equallysized product batches to minimize setup efforts an… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: This paper was accepted at the ETFA 2024 conference

  46. Towards Building Autonomous Data Services on Azure

    Authors: Yiwen Zhu, Yuanyuan Tian, Joyce Cahoon, Subru Krishnan, Ankita Agarwal, Rana Alotaibi, Jesús Camacho-Rodríguez, Bibin Chundatt, Andrew Chung, Niharika Dutta, Andrew Fogarty, Anja Gruenheid, Brandon Haynes, Matteo Interlandi, Minu Iyer, Nick Jurgens, Sumeet Khushalani, Brian Kroth, Manoj Kumar, Jyoti Leeka, Sergiy Matusevych, Minni Mittal, Andreas Mueller, Kartheek Muthyala, Harsha Nagulapalli , et al. (13 additional authors not shown)

    Abstract: Modern cloud has turned data services into easily accessible commodities. With just a few clicks, users are now able to access a catalog of data processing systems for a wide range of tasks. However, the cloud brings in both complexity and opportunity. While cloud users can quickly start an application by using various data services, it can be difficult to configure and optimize these services to… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: SIGMOD Companion of the 2023 International Conference on Management of Data. 2023

  47. arXiv:2404.06214  [pdf, other

    cs.CL

    [Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus

    Authors: Leshem Choshen, Ryan Cotterell, Michael Y. Hu, Tal Linzen, Aaron Mueller, Candace Ross, Alex Warstadt, Ethan Wilcox, Adina Williams, Chengxu Zhuang

    Abstract: After last year's successful BabyLM Challenge, the competition will be hosted again in 2024/2025. The overarching goals of the challenge remain the same; however, some of the competition rules will be different. The big changes for this year's competition are as follows: First, we replace the loose track with a paper track, which allows (for example) non-model-based submissions, novel cognitively-… ▽ More

    Submitted 27 July, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  48. arXiv:2403.19647  [pdf, other

    cs.LG cs.AI cs.CL

    Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

    Authors: Samuel Marks, Can Rager, Eric J. Michaud, Yonatan Belinkov, David Bau, Aaron Mueller

    Abstract: We introduce methods for discovering and applying sparse feature circuits. These are causally implicated subnetworks of human-interpretable features for explaining language model behaviors. Circuits identified in prior work consist of polysemantic and difficult-to-interpret units like attention heads or neurons, rendering them unsuitable for many downstream applications. In contrast, sparse featur… ▽ More

    Submitted 27 March, 2025; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Code and data at https://github.com/saprmarks/feature-circuits. Demonstration at https://feature-circuits.xyz

    Journal ref: International Conference on Learning Representations, 2025

  49. arXiv:2403.18587  [pdf, other

    cs.CR cs.CV cs.LG

    The Impact of Uniform Inputs on Activation Sparsity and Energy-Latency Attacks in Computer Vision

    Authors: Andreas Müller, Erwin Quiring

    Abstract: Resource efficiency plays an important role for machine learning nowadays. The energy and decision latency are two critical aspects to ensure a sustainable and practical application. Unfortunately, the energy consumption and decision latency are not robust against adversaries. Researchers have recently demonstrated that attackers can compute and submit so-called sponge examples at inference time t… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted at the DLSP 2024

  50. arXiv:2403.09988  [pdf, other

    cs.RO

    Interactive Distance Field Mapping and Planning to Enable Human-Robot Collaboration

    Authors: Usama Ali, Lan Wu, Adrian Mueller, Fouad Sukkar, Tobias Kaupp, Teresa Vidal-Calleja

    Abstract: Human-robot collaborative applications require scene representations that are kept up-to-date and facilitate safe motions in dynamic scenes. In this letter, we present an interactive distance field mapping and planning (IDMP) framework that handles dynamic objects and collision avoidance through an efficient representation. We define interactive mapping and planning as the process of creating and… ▽ More

    Submitted 22 October, 2024; v1 submitted 14 March, 2024; originally announced March 2024.