Skip to main content

Showing 1–50 of 155 results for author: Hoffmann, J

.
  1. arXiv:2505.21410  [pdf, ps, other

    cs.AI cs.LG cs.RO

    MRSD: Multi-Resolution Skill Discovery for HRL Agents

    Authors: Shashank Sharma, Janina Hoffmann, Vinay Namboodiri

    Abstract: Hierarchical reinforcement learning (HRL) relies on abstract skills to solve long-horizon tasks efficiently. While existing skill discovery methods learns these skills automatically, they are limited to a single skill per task. In contrast, humans learn and use both fine-grained and coarse motor skills simultaneously. Inspired by human motor control, we propose Multi-Resolution Skill Discovery (MR… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  2. Proceedings 16th International Workshop on Programming Language Approaches to Concurrency and Communication-cEntric Software

    Authors: Farzaneh Derakhshan, Jan Hoffmann

    Abstract: This volume contains the proceedings of PLACES 2025, the 16th edition of the Workshop on Programming Language Approaches to Concurrency and Communication-cEntric Software. The workshop is scheduled to take place in Hamilton, Canada, on May 4, 2025, as a satellite event of ETAPS, the European Joint Conferences on Theory and Practice of Software. PLACES offers a forum for exchanging new ideas on how… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Journal ref: EPTCS 420, 2025

  3. arXiv:2505.01353  [pdf, other

    math.OC cs.AI cs.LG

    Differentiable Nonlinear Model Predictive Control

    Authors: Jonathan Frey, Katrin Baumgärtner, Gianluca Frison, Dirk Reinhardt, Jasper Hoffmann, Leonard Fichtner, Sebastien Gros, Moritz Diehl

    Abstract: The efficient computation of parametric solution sensitivities is a key challenge in the integration of learning-enhanced methods with nonlinear model predictive control (MPC), as their availability is crucial for many learning algorithms. While approaches presented in the machine learning community are limited to convex or unconstrained formulations, this paper discusses the computation of soluti… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

    Comments: 19 page, 4 figures, 2 tables

  4. arXiv:2505.00439  [pdf, other

    cs.LG cs.AI

    Per-Domain Generalizing Policies: On Validation Instances and Scaling Behavior

    Authors: Timo P. Gros, Nicola J. Müller, Daniel Fiser, Isabel Valera, Verena Wolf, Jörg Hoffmann

    Abstract: Recent work has shown that successful per-domain generalizing action policies can be learned. Scaling behavior, from small training instances to large test instances, is the key objective; and the use of validation instances larger than training instances is one key to achieve it. Prior work has used fixed validation sets. Here, we introduce a method generating the validation set dynamically, on t… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: 7 pages, 3 tables, 3 figures, 3 algorithms

  5. arXiv:2504.01431  [pdf, other

    math.OC cs.CE cs.LG

    Multi-convex Programming for Discrete Latent Factor Models Prototyping

    Authors: Hao Zhu, Shengchao Yan, Jasper Hoffmann, Joschka Boedecker

    Abstract: Discrete latent factor models (DLFMs) are widely used in various domains such as machine learning, economics, neuroscience, psychology, etc. Currently, fitting a DLFM to some dataset relies on a customized solver for individual models, which requires lots of effort to implement and is limited to the targeted specific instance of DLFMs. In this paper, we propose a generic framework based on CVXPY,… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    MSC Class: 90C25 (Primary); 90C59; 90C90

  6. arXiv:2503.05662  [pdf, other

    stat.ML cs.LG

    On Mitigating Affinity Bias through Bandits with Evolving Biased Feedback

    Authors: Matthew Faw, Constantine Caramanis, Jessica Hoffmann

    Abstract: Unconscious bias has been shown to influence how we assess our peers, with consequences for hiring, promotions and admissions. In this work, we focus on affinity bias, the component of unconscious bias which leads us to prefer people who are similar to us, despite no deliberate intention of favoritism. In a world where the people hired today become part of the hiring committee of tomorrow, we are… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

  7. arXiv:2503.03654  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Neutral Point of View Text Generation through Parameter-Efficient Reinforcement Learning and a Small-Scale High-Quality Dataset

    Authors: Jessica Hoffmann, Christiane Ahlheim, Zac Yu, Aria Walfrand, Jarvis Jin, Marie Tano, Ahmad Beirami, Erin van Liemt, Nithum Thain, Hakim Sidahmed, Lucas Dixon

    Abstract: This paper describes the construction of a dataset and the evaluation of training methods to improve generative large language models' (LLMs) ability to answer queries on sensitive topics with a Neutral Point of View (NPOV), i.e., to provide significantly more informative, diverse and impartial answers. The dataset, the SHQ-NPOV dataset, comprises 300 high-quality, human-written quadruplets: a que… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

  8. Soft phonon and the central peak at the cubic-to-tetragonal phase transition in SrTiO$_3$

    Authors: Avishek Maity, Klaus Habicht, Michael Merz, Ayman H. Said, Christo Guguschev, Danny Kojda, Britta Ryll, Jan-Ekkehard Hoffmann, Andrea Dittmar, Thomas Keller, Frank Weber

    Abstract: The continuous displacive phase transition in SrTiO$_3$ near $T_c \approx 105$ K features a central elastic peak in neutron scattering investigations at temperatures above $T_c$, i.e., before the corresponding soft phonon mode is overdamped upon cooling. The origin of this central peak is still not understood. Here, we report an inelastic x-ray scattering investigation of the cubic-to-tetragonal p… ▽ More

    Submitted 29 March, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

    Comments: Manuscript contains 9 pages, 4 figures. Supplementary information contains 31 pages, 17 figures, 3 tables

    Journal ref: Physical Review B, 111, 134108 (2025)

  9. arXiv:2502.12676  [pdf

    cond-mat.mtrl-sci

    A thin film source in a solid-state diffusion experiment: CoO on SrTiO3

    Authors: Qian Ma, Jan Erik Rybak, Natalie Jacqueline Ottinger, Timo Kassubek, Jörg Hoffmann, Karl-Michael Weitzel, Cynthia A. Volkert, Christian Jooss

    Abstract: To realize a chemical diffusion experiment for simple quantitative analysis of one-dimensional diffusion profiles requires the fabrication of a planar and chemically sharp interface between two phases, one serving as the diffusion source and the other as the material to be studied. We demonstrate a thin film source on top of single crystals or epitaxial films for the example of cobalt (II) oxide (… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  10. arXiv:2502.02133  [pdf, other

    eess.SY cs.AI cs.LG

    Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification

    Authors: Rudolf Reiter, Jasper Hoffmann, Dirk Reinhardt, Florian Messerer, Katrin Baumgärtner, Shamburaj Sawant, Joschka Boedecker, Moritz Diehl, Sebastien Gros

    Abstract: The fields of MPC and RL consider two successful control techniques for Markov decision processes. Both approaches are derived from similar fundamental principles, and both are widely used in practical applications, including robotics, process control, energy systems, and autonomous driving. Despite their similarities, MPC and RL follow distinct paradigms that emerged from diverse communities and… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  11. arXiv:2502.01956  [pdf, other

    cs.RO cs.AI cs.LG

    DHP: Discrete Hierarchical Planning for Hierarchical Reinforcement Learning Agents

    Authors: Shashank Sharma, Janina Hoffmann, Vinay Namboodiri

    Abstract: Hierarchical Reinforcement Learning (HRL) agents often struggle with long-horizon visual planning due to their reliance on error-prone distance metrics. We propose Discrete Hierarchical Planning (DHP), a method that replaces continuous distance estimates with discrete reachability checks to evaluate subgoal feasibility. DHP recursively constructs tree-structured plans by decomposing long-term goal… ▽ More

    Submitted 27 May, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

  12. arXiv:2501.16188  [pdf, other

    hep-lat

    $\bar{b}\bar{b}ud$ Tetraquarks with $I(J^P)=0(1^-)$ and $\bar{b}\bar{c}ud$ Tetraquarks with $I(J^P)=0(0^+)$ and $I(J^P)=0(1^+)$ from Lattice QCD Antistatic-Antistatic Potentials

    Authors: Jakob Hoffmann, Lasse Müller, Marc Wagner

    Abstract: We study heavy spin effects in $\bar{b}\bar{b}ud$ and $\bar{b}\bar{c}ud$ four-quark systems using the Born-Oppenheimer approximation and existing antistatic-antistatic potentials computed with lattice QCD. We report about a recent refined investigation of the $\bar{b}\bar{b}ud$ system with $I(J^P)=0(1^-)$, where we predicted a tetraquark resonance slightly above the $B^{*}B^{*}$ threshold. Further… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

    Comments: 9 pages, 2 figures

  13. Prediction of an $I(J^{P})=0(1^{-})$ $\bar{b}\bar{b}ud$ Tetraquark Resonance Close to the $B^\ast B^\ast$ Threshold Using Lattice QCD Potentials

    Authors: Jakob Hoffmann, Marc Wagner

    Abstract: We use antistatic-antistatic potentials computed with lattice QCD and a coupled-channel Born-Oppenheimer approach to explore the existence of a $\bar{b} \bar{b} u d$ tetraquark resonance with quantum numbers $I(J^P) = 0(1^-)$. A pole in the $\mbox{T}$ matrix signals a resonance with mass $m = 2 m_B + 94.0^{+1.3}_{-5.4} \, \text{MeV}$ and decay width $Γ= 140^{+86}_{-66} \, \text{MeV}$, i.e. very cl… ▽ More

    Submitted 21 March, 2025; v1 submitted 9 December, 2024; originally announced December 2024.

    Comments: 20 pages, 6 figures

    Journal ref: Phys. Rev. D 111 (2025), 054507

  14. Comparison of Impedance Matching Networks for Scanning Microwave Microscopy

    Authors: Johannes Hoffmann, Sophie de Preville, Bruno Eckmann, Hung-Ju Lin, Benedikt Herzog, Kamel Haddadi, Didier Theron, Georg Gramse, Damien Richert, Jose Moran-Meza, Francois Piquemal

    Abstract: In this paper, a definition of the gain and added noise of impedance matching networks for scanning microwave microscopy is given. This definition can be used to compare different impedance matching techniques independently of the instrument used to measure the S-parameter. As a demonstration, impedance matching devices consisting of a Beatty line, a tuner, and interferometric setups with and with… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

    Comments: IEEE Transactions on Instrumentation and Measurement (2024)

  15. arXiv:2408.06876  [pdf, other

    cs.AI cs.RO

    Decision-Focused Learning to Predict Action Costs for Planning

    Authors: Jayanta Mandi, Marco Foschini, Daniel Holler, Sylvie Thiebaux, Jorg Hoffmann, Tias Guns

    Abstract: In many automated planning applications, action costs can be hard to specify. An example is the time needed to travel through a certain road segment, which depends on many factors, such as the current weather conditions. A natural way to address this issue is to learn to predict these parameters based on input features (e.g., weather forecasts) and use the predicted action costs in automated plann… ▽ More

    Submitted 26 August, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

  16. arXiv:2408.05937  [pdf, other

    astro-ph.HE astro-ph.CO

    The impact of the FREDDA dedispersion algorithm on $H_0$ estimations with FRBs

    Authors: Jordan Hoffmann, Clancy W. James, Hao Qiu, Marcin Glowacki, Keith W. Bannister, Vivek Gupta, Jason X. Prochaska, Apurba Bera, Adam T. Deller, Kelly Gourdji, Lachlan Marnoch, Stuart D. Ryder, Danica R. Scott, Ryan M. Shannon, Nicolas Tejos

    Abstract: Fast radio bursts (FRBs) are transient radio signals of extragalactic origins that are subjected to propagation effects such as dispersion and scattering. It follows then that these signals hold information regarding the medium they have traversed and are hence useful as cosmological probes of the Universe. Recently, FRBs were used to make an independent measure of the Hubble Constant $H_0$, promi… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: 8 pages, 6 figures, Published in MNRAS

  17. arXiv:2408.04878  [pdf, other

    astro-ph.CO astro-ph.HE

    Modelling DSA, FAST and CRAFT surveys in a z-DM analysis and constraining a minimum FRB energy

    Authors: Jordan Hoffmann, Clancy W. James, Marcin Glowacki, Jason X. Prochaska, Alexa C. Gordon, Adam T. Deller, Ryan M. Shannon, Stuart D. Ryder

    Abstract: Fast radio burst (FRB) science primarily revolves around two facets: the origin of these bursts and their use in cosmological studies. This work follows from previous redshift-dispersion measure ($z$-DM) analyses in which we model instrumental biases and simultaneously fit population parameters and cosmological parameters to the observed population of FRBs. This sheds light on both the progenitors… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 17 pages, 7 figures, submitted to PASA

  18. arXiv:2406.03995  [pdf, other

    eess.SY cs.AI

    AC4MPC: Actor-Critic Reinforcement Learning for Nonlinear Model Predictive Control

    Authors: Rudolf Reiter, Andrea Ghezzi, Katrin Baumgärtner, Jasper Hoffmann, Robert D. McAllister, Moritz Diehl

    Abstract: \Ac{MPC} and \ac{RL} are two powerful control strategies with, arguably, complementary advantages. In this work, we show how actor-critic \ac{RL} techniques can be leveraged to improve the performance of \ac{MPC}. The \ac{RL} critic is used as an approximation of the optimal value function, and an actor roll-out provides an initial guess for primal variables of the \ac{MPC}. A parallel control arc… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  19. arXiv:2405.21008  [pdf, other

    gr-qc

    Continuation of Bianchi Spacetimes Through The Big Bang

    Authors: Josh Hoffmann, David Sloan

    Abstract: In this paper we present a framework in which the relational description of General Relativity can be used to smoothly continue cosmological dynamical systems through the Big Bang without invoking quantum gravity effects. Cosmological spacetimes contain as a key dynamical variable a notion of scale through the volume factor $ν$. However no cosmological observer is ever able to separate their measu… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 52 pages, 38 figures

  20. arXiv:2405.08178  [pdf, ps, other

    gr-qc math-ph

    A Theoretical Framework for Self-Gravitating k-Form Boson Stars with Internal Symmetries

    Authors: Jakob Hoffmann, Cédric Jockel

    Abstract: Current boson star models are largely restricted to global symmetries and lower spin fields. In this work, we generalize these systems of self-gravitating bosonic fields to allow for arbitrary totally antisymmetric tensor fields and arbitrary internal gauge symmetries. We construct a generalized formalism for Yang-Mills-like theories, which allows for arbitrary k-form fields, instead of just vecto… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 58 pages including appendix, both authors are first authors

  21. arXiv:2405.02598  [pdf, other

    cs.LG

    UDUC: An Uncertainty-driven Approach for Learning-based Robust Control

    Authors: Yuan Zhang, Jasper Hoffmann, Joschka Boedecker

    Abstract: Learning-based techniques have become popular in both model predictive control (MPC) and reinforcement learning (RL). Probabilistic ensemble (PE) models offer a promising approach for modelling system dynamics, showcasing the ability to capture uncertainty and scalability in high-dimensional control scenarios. However, PE models are susceptible to mode collapse, resulting in non-robust control whe… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  22. arXiv:2404.18863  [pdf, other

    cs.RO math.OC

    PlanNetX: Learning an Efficient Neural Network Planner from MPC for Longitudinal Control

    Authors: Jasper Hoffmann, Diego Fernandez, Julien Brosseit, Julian Bernhard, Klemens Esterle, Moritz Werling, Michael Karg, Joschka Boedecker

    Abstract: Model predictive control (MPC) is a powerful, optimization-based approach for controlling dynamical systems. However, the computational complexity of online optimization can be problematic on embedded devices. Especially, when we need to guarantee fixed control frequencies. Thus, previous work proposed to reduce the computational burden using imitation learning (IL) approximating the MPC policy by… ▽ More

    Submitted 22 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: 6th Annual Learning for Dynamics & Control Conference (L4DC 2024)

  23. arXiv:2403.10704  [pdf, other

    cs.LG cs.AI cs.CL

    Parameter Efficient Reinforcement Learning from Human Feedback

    Authors: Hakim Sidahmed, Samrat Phatale, Alex Hutcheson, Zhuonan Lin, Zhang Chen, Zac Yu, Jarvis Jin, Simral Chaudhary, Roman Komarytsia, Christiane Ahlheim, Yonghao Zhu, Bowen Li, Saravanan Ganesh, Bill Byrne, Jessica Hoffmann, Hassan Mansoor, Wei Li, Abhinav Rastogi, Lucas Dixon

    Abstract: While Reinforcement Learning from Human Feedback (RLHF) effectively aligns pretrained Large Language and Vision-Language Models (LLMs, and VLMs) with human preferences, its computational cost and complexity hamper its wider adoption. To alleviate some of the computational burden of fine-tuning, parameter efficient methods, like LoRA were introduced. In this work, we empirically evaluate the setup… ▽ More

    Submitted 12 September, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  24. arXiv:2403.08904  [pdf, other

    cs.CL

    Detecting Hallucination and Coverage Errors in Retrieval Augmented Generation for Controversial Topics

    Authors: Tyler A. Chang, Katrin Tomanek, Jessica Hoffmann, Nithum Thain, Erin van Liemt, Kathleen Meier-Hellstern, Lucas Dixon

    Abstract: We explore a strategy to handle controversial topics in LLM-based chatbots based on Wikipedia's Neutral Point of View (NPOV) principle: acknowledge the absence of a single true answer and surface multiple perspectives. We frame this as retrieval augmented generation, where perspectives are retrieved from a knowledge base and the LLM is tasked with generating a fluent and faithful response from the… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted at LREC-COLING 2024

  25. arXiv:2402.02992  [pdf, other

    cs.LG cs.AI cs.CL

    Decoding-time Realignment of Language Models

    Authors: Tianlin Liu, Shangmin Guo, Leonardo Bianco, Daniele Calandriello, Quentin Berthet, Felipe Llinares, Jessica Hoffmann, Lucas Dixon, Michal Valko, Mathieu Blondel

    Abstract: Aligning language models with human preferences is crucial for reducing errors and biases in these models. Alignment techniques, such as reinforcement learning from human feedback (RLHF), are typically cast as optimizing a tradeoff between human preference rewards and a proximity regularization term that encourages staying close to the unaligned model. Selecting an appropriate level of regularizat… ▽ More

    Submitted 24 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: In Proceedings of the 41st International Conference on Machine Learning (ICML 2024)

  26. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

  27. arXiv:2312.11270  [pdf, other

    physics.med-ph

    Modelling the Lymphatic Metastatic Progression Pathways of OPSCC from Multi-Institutional Datasets

    Authors: Roman Ludwig, Adrian Schubert, Dorothea Barbatei, Lauence Bauwens, Jean-Marc Hoffmann, Sandrine Werlen, Olgun Elicin, Matthias Dettmer, Philippe Zrounba, Bertrand Pouymayou, Panagiotis Balermpas, Vincent Grégoire, Roland Giger, Jan Unkelbach

    Abstract: The elective clinical target volume (CTV-N) in oropharyngeal squamous cell carcinoma (OPSCC) is currently based mostly on the prevalence of lymph node metastases in different lymph node levels (LNLs) for a given primary tumor location. We present a probabilistic model for ipsilateral lymphatic spread that can quantify the microscopic nodal involvement risk based on an individual patient's T-catego… ▽ More

    Submitted 21 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 17 pages, 12 figures, 7 tables, submitted to Physics in Medicine and Biology

  28. arXiv:2311.09830  [pdf, other

    cs.AI cs.CL

    Automating the Generation of Prompts for LLM-based Action Choice in PDDL Planning

    Authors: Katharina Stein, Daniel Fišer, Jörg Hoffmann, Alexander Koller

    Abstract: Large language models (LLMs) have revolutionized a large variety of NLP tasks. An active debate is to what extent they can do reasoning and planning. Prior work has assessed the latter in the specific context of PDDL planning, based on manually converting three PDDL domains into natural language (NL) prompts. Here we automate this conversion step, showing how to leverage an LLM to automatically ge… ▽ More

    Submitted 2 May, 2025; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Extended version of the paper from the ICAPS'25 proceedings (same main part + additional appendix)

  29. arXiv:2310.10199  [pdf, other

    eess.IV

    Impact of Data Synthesis Strategies for the Classification of Craniosynostosis

    Authors: Matthias Schaufelberger, Reinald Peter Kühle, Andreas Wachter, Frederic Weichel, Niclas Hagen, Friedemann Ringwald, Urs Eisenmann, Jürgen Hoffmann, Michael Engel, Christian Freudlsperger, Werner Nahm

    Abstract: Introduction: Photogrammetric surface scans provide a radiation-free option to assess and classify craniosynostosis. Due to the low prevalence of craniosynostosis and high patient restrictions, clinical data is rare. Synthetic data could support or even replace clinical data for the classification of craniosynostosis, but this has never been studied systematically. Methods: We test the combination… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  30. arXiv:2309.08042  [pdf, other

    cs.CV cs.AI

    Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved

    Authors: Yao Sun, Anna Kruspe, Liqiu Meng, Yifan Tian, Eike J Hoffmann, Stefan Auer, Xiao Xiang Zhu

    Abstract: Crowdsourced platforms provide huge amounts of street-view images that contain valuable building information. This work addresses the challenges in applying Scene Text Recognition (STR) in crowdsourced street-view images for building attribute mapping. We use Flickr images, particularly examining texts on building facades. A Berlin Flickr dataset is created, and pre-trained STR models are used for… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  31. Worst-Case Input Generation for Concurrent Programs under Non-Monotone Resource Metrics

    Authors: Long Pham, Jan Hoffmann

    Abstract: Worst-case input generation aims to automatically generate inputs that exhibit the worst-case performance of programs. It has several applications, and can, for example, detect vulnerabilities to denial-of-service (DoS) attacks. However, it is non-trivial to generate worst-case inputs for concurrent programs, particularly for resources like memory where the peak cost depends on how processes are s… ▽ More

    Submitted 21 December, 2024; v1 submitted 3 September, 2023; originally announced September 2023.

    Journal ref: Logical Methods in Computer Science, Volume 20, Issue 4 (December 24, 2024) lmcs:12242

  32. arXiv:2308.15470  [pdf, other

    cs.LG

    Policy composition in reinforcement learning via multi-objective policy optimization

    Authors: Shruti Mishra, Ankit Anand, Jordan Hoffmann, Nicolas Heess, Martin Riedmiller, Abbas Abdolmaleki, Doina Precup

    Abstract: We enable reinforcement learning agents to learn successful behavior policies by utilizing relevant pre-existing teacher policies. The teacher policies are introduced as objectives, in addition to the task objective, in a multi-objective policy optimization setting. Using the Multi-Objective Maximum a Posteriori Policy Optimization algorithm (Abdolmaleki et al. 2020), we show that teacher policies… ▽ More

    Submitted 30 August, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

  33. Computing the noncommutative inner rank by means of operator-valued free probability theory

    Authors: Johannes Hoffmann, Tobias Mai, Roland Speicher

    Abstract: We address the noncommutative version of the Edmonds' problem, which asks to determine the inner rank of a matrix in noncommuting variables. We provide an algorithm for the calculation of this inner rank by relating the problem with the distribution of a basic object in free probability theory, namely operator-valued semicircular elements. We have to solve a matrix-valued quadratic equation, for w… ▽ More

    Submitted 28 June, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: In the second version we have not only improved the presentation of the results, but we supply in addition now actually also a certificate for the termination of our algorithm (this relies on recent theoretical results in the paper arxiv.org/abs/2406.15922)

    MSC Class: 46L54; 65J15; 12E15

    Journal ref: Found Comput Math (2024)

  34. arXiv:2308.02041  [pdf

    cs.CY cs.AI

    Regulating AI: Applying insights from behavioural economics and psychology to the application of article 5 of the EU AI Act

    Authors: Huixin Zhong, Eamonn O'Neill, Janina A. Hoffmann

    Abstract: Article 5 of the European Union's Artificial Intelligence Act is intended to regulate AI use to prevent potentially harmful consequences. Nevertheless, applying this legislation practically is likely to be challenging because of ambiguously used terminologies and because it fails to specify which manipulation techniques may be invoked by AI, potentially leading to significant harm. This paper aims… ▽ More

    Submitted 25 February, 2024; v1 submitted 24 July, 2023; originally announced August 2023.

    Comments: This paper was accepted for publication by AAAI 2024 paper on December of 2023

  35. arXiv:2305.17300  [pdf, other

    cs.NE cs.AI cs.LG

    Exploiting Large Neuroimaging Datasets to Create Connectome-Constrained Approaches for more Robust, Efficient, and Adaptable Artificial Intelligence

    Authors: Erik C. Johnson, Brian S. Robinson, Gautam K. Vallabha, Justin Joyce, Jordan K. Matelsky, Raphael Norman-Tenazas, Isaac Western, Marisel Villafañe-Delgado, Martha Cervantes, Michael S. Robinette, Arun V. Reddy, Lindsey Kitchell, Patricia K. Rivlin, Elizabeth P. Reilly, Nathan Drenkow, Matthew J. Roos, I-Jeng Wang, Brock A. Wester, William R. Gray-Roncal, Joan A. Hoffmann

    Abstract: Despite the progress in deep learning networks, efficient learning at the edge (enabling adaptable, low-complexity machine learning solutions) remains a critical need for defense and commercial applications. We envision a pipeline to utilize large neuroimaging datasets, including maps of the brain which capture neuron and synapse connectivity, to improve machine learning approaches. We have pursue… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 11 pages, 4 figures

  36. arXiv:2304.13627  [pdf, ps, other

    cs.PL cs.LO

    Automatic Amortized Resource Analysis with Regular Recursive Types

    Authors: Jessie Grosen, David M. Kahn, Jan Hoffmann

    Abstract: The goal of automatic resource bound analysis is to statically infer symbolic bounds on the resource consumption of the evaluation of a program. A longstanding challenge for automatic resource analysis is the inference of bounds that are functions of complex custom data structures. This article builds on type-based automatic amortized resource analysis (AARA) to address this challenge. AARA is bas… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 15 pages, 5 figures; to be published in LICS'23

  37. arXiv:2302.06541  [pdf, other

    cs.CL

    Towards Agile Text Classifiers for Everyone

    Authors: Maximilian Mozes, Jessica Hoffmann, Katrin Tomanek, Muhamed Kouate, Nithum Thain, Ann Yuan, Tolga Bolukbasi, Lucas Dixon

    Abstract: Text-based safety classifiers are widely used for content moderation and increasingly to tune generative language model behavior - a topic of growing concern for the safety of digital assistants and chatbots. However, different policies require different classifiers, and safety policies themselves improve from iteration and adaptation. This paper introduces and evaluates methods for agile text cla… ▽ More

    Submitted 21 October, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Findings of EMNLP 2023

  38. arXiv:2212.01607  [pdf, other

    cs.RO eess.SY

    A Hierarchical Approach for Strategic Motion Planning in Autonomous Racing

    Authors: Rudolf Reiter, Jasper Hoffmann, Joschka Boedecker, Moritz Diehl

    Abstract: We present an approach for safe trajectory planning, where a strategic task related to autonomous racing is learned sample-efficient within a simulation environment. A high-level policy, represented as a neural network, outputs a reward specification that is used within the cost function of a parametric nonlinear model predictive controller (NMPC). By including constraints and vehicle kinematics… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

  39. arXiv:2211.15765  [pdf, other

    hep-lat hep-ph

    Inclusion of heavy spin effects in the $u d \bar{b} \bar{b}$ $I(J^{P})=0(1^{-})$ four-quark channel in the Born-Oppenheimer approximation

    Authors: Jakob Hoffmann, André Zimermmane-Santos, Marc Wagner

    Abstract: We refine our previous study of a $u d \bar{b} \bar{b}$ tetraquark resonance with quantum numbers $I(J^{P})=0(1^{-})$, which is based on antiheavy-antiheavy lattice QCD potentials, by including heavy quark spin effects via the mass difference of the $B$ and the $B^{*}$ meson. This leads to a coupled channel Schrödinger equation, where the two channels correspond to $BB$ and $B^{*}B^{*}$, respectiv… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: 9 pages, 4 figures, talk given at "The 39th International Symposium on Lattice Field Theory", 08th-13th August 2022, Bonn, Germany

  40. arXiv:2211.00543  [pdf

    cs.CV

    Geo-Information Harvesting from Social Media Data

    Authors: Xiao Xiang Zhu, Yuanyuan Wang, Mrinalini Kochupillai, Martin Werner, Matthias Häberle, Eike Jens Hoffmann, Hannes Taubenböck, Devis Tuia, Alex Levering, Nathan Jacobs, Anna Kruspe, Karam Abdulahhad

    Abstract: As unconventional sources of geo-information, massive imagery and text messages from open platforms and social media form a temporally quasi-seamless, spatially multi-perspective stream, but with unknown and diverse quality. Due to its complementarity to remote sensing data, geo-information from these sources offers promising perspectives, but harvesting is not trivial due to its data characterist… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: Accepted for publication IEEE Geoscience and Remote Sensing Magazine

  41. Regularization of Single Field Inflation Models

    Authors: Josh Hoffmann, David Sloan

    Abstract: There are many single field inflationary models that are consistent with the recent Planck 2018 measurements of the spectral index $n_s$ and tensor-to-scalar ratio $r$. Despite good agreement with observational data some of these models suffer from having unregularized potentials which would produce a collapsing universe shortly after the end of inflation. In this paper we show that how one choose… ▽ More

    Submitted 23 November, 2022; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: 25 pages, 25 figures

  42. arXiv:2206.06054  [pdf, other

    cs.LG cs.SE

    Specifying and Testing $k$-Safety Properties for Machine-Learning Models

    Authors: Maria Christakis, Hasan Ferit Eniser, Jörg Hoffmann, Adish Singla, Valentin Wüstholz

    Abstract: Machine-learning models are becoming increasingly prevalent in our lives, for instance assisting in image-classification or decision-making tasks. Consequently, the reliability of these models is of critical importance and has resulted in the development of numerous approaches for validating and verifying their robustness and fairness. However, beyond such specific properties, it is challenging to… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  43. arXiv:2204.08524  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    So2Sat POP -- A Curated Benchmark Data Set for Population Estimation from Space on a Continental Scale

    Authors: Sugandha Doda, Yuanyuan Wang, Matthias Kahl, Eike Jens Hoffmann, Kim Ouan, Hannes Taubenböck, Xiao Xiang Zhu

    Abstract: Obtaining a dynamic population distribution is key to many decision-making processes such as urban planning, disaster management and most importantly helping the government to better allocate socio-technical supply. For the aspiration of these objectives, good population data is essential. The traditional method of collecting population data through the census is expensive and tedious. In recent y… ▽ More

    Submitted 10 November, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  44. arXiv:2203.15556  [pdf, other

    cs.CL cs.LG

    Training Compute-Optimal Large Language Models

    Authors: Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego de Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katie Millican, George van den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Jack W. Rae, Oriol Vinyals, Laurent Sifre

    Abstract: We investigate the optimal model size and number of tokens for training a transformer language model under a given compute budget. We find that current large language models are significantly undertrained, a consequence of the recent focus on scaling language models whilst keeping the amount of training data constant. By training over 400 language models ranging from 70 million to over 16 billion… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  45. arXiv:2203.09361  [pdf, other

    cs.AI cs.CC cs.LO

    Expressivity of Planning with Horn Description Logic Ontologies (Technical Report)

    Authors: Stefan Borgwardt, Jörg Hoffmann, Alisa Kovtunova, Markus Krötzsch, Bernhard Nebel, Marcel Steinmetz

    Abstract: State constraints in AI Planning globally restrict the legal environment states. Standard planning languages make closed-domain and closed-world assumptions. Here we address open-world state constraints formalized by planning over a description logic (DL) ontology. Previously, this combination of DL and planning has been investigated for the light-weight DL DL-Lite. Here we propose a novel compila… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 16 pages with appendix

    MSC Class: 68 ACM Class: I.2.4; I.2.8

  46. arXiv:2202.07315  [pdf, other

    cs.CV

    Using Social Media Images for Building Function Classification

    Authors: Eike Jens Hoffmann, Karam Abdulahhad, Xiao Xiang Zhu

    Abstract: Urban land use on a building instance level is crucial geo-information for many applications, yet difficult to obtain. An intuitive approach to close this gap is predicting building functions from ground level imagery. Social media image platforms contain billions of images, with a large variety of motifs including but not limited to street perspectives. To cope with this issue this study proposes… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  47. arXiv:2202.01169  [pdf, other

    cs.CL cs.LG

    Unified Scaling Laws for Routed Language Models

    Authors: Aidan Clark, Diego de las Casas, Aurelia Guy, Arthur Mensch, Michela Paganini, Jordan Hoffmann, Bogdan Damoc, Blake Hechtman, Trevor Cai, Sebastian Borgeaud, George van den Driessche, Eliza Rutherford, Tom Hennigan, Matthew Johnson, Katie Millican, Albin Cassirer, Chris Jones, Elena Buchatskaya, David Budden, Laurent Sifre, Simon Osindero, Oriol Vinyals, Jack Rae, Erich Elsen, Koray Kavukcuoglu , et al. (1 additional authors not shown)

    Abstract: The performance of a language model has been shown to be effectively modeled as a power-law in its parameter count. Here we study the scaling behaviors of Routing Networks: architectures that conditionally use only a subset of their parameters while processing an input. For these models, parameter count and computational requirement form two independent axes along which an increase leads to better… ▽ More

    Submitted 9 February, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: Fixing typos and affiliation clarity

  48. AT2019azh: an unusually long-lived, radio-bright thermal tidal disruption event

    Authors: A. J. Goodwin, S. van Velzen, J. C. A. Miller-Jones, A. Mummery, M. F. Bietenholz, A. Wederfoort, E. Hammerstein, C. Bonnerot, J. Hoffmann, L. Yan

    Abstract: Tidal disruption events (TDEs) occur when a star is destroyed by a supermassive black hole at the center of a galaxy, temporarily increasing the accretion rate onto the black hole and producing a bright flare across the electromagnetic spectrum. Radio observations of TDEs trace outflows and jets that may be produced. Radio detections of the outflows from TDEs are uncommon, with only about one thir… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

    Comments: 17 pages, 8 figures. Submitted to MNRAS. Comments welcome!

  49. arXiv:2201.03288  [pdf, other

    eess.IV cs.CV

    A statistical shape model for radiation-free assessment and classification of craniosynostosis

    Authors: Matthias Schaufelberger, Reinald Peter Kühle, Andreas Wachter, Frederic Weichel, Niclas Hagen, Friedemann Ringwald, Urs Eisenmann, Jürgen Hoffmann, Michael Engel, Christian Freudlsperger, Werner Nahm

    Abstract: The assessment of craniofacial deformities requires patient data which is sparsely available. Statistical shape models provide realistic and synthetic data enabling comparisons of existing methods on a common dataset. We build the first publicly available statistical 3D head model of craniosynostosis patients and the first model focusing on infants younger than 1.5 years. We further present a sh… ▽ More

    Submitted 28 March, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

  50. arXiv:2112.11446  [pdf, other

    cs.CL cs.AI

    Scaling Language Models: Methods, Analysis & Insights from Training Gopher

    Authors: Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George van den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-Sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor , et al. (55 additional authors not shown)

    Abstract: Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and understand the world. In this paper, we present an analysis of Transformer-based language model performance across a wide range of model scales -- from models with tens of millions of parameters up to a 280 billion parameter model called Gop… ▽ More

    Submitted 21 January, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: 120 pages