Skip to main content

Showing 1–15 of 15 results for author: Huebotter, J

.
  1. arXiv:2505.19850  [pdf, ps, other

    cs.LG cs.AI cs.RO

    DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning

    Authors: Leander Diaz-Bone, Marco Bagatella, Jonas Hübotter, Andreas Krause

    Abstract: Sparse-reward reinforcement learning (RL) can model a wide range of highly complex tasks. Solving sparse-reward tasks is RL's core premise - requiring efficient exploration coupled with long-horizon credit assignment - and overcoming these challenges is key for building self-improving agents with superhuman ability. We argue that solving complex and high-dimensional tasks requires solving simpler… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  2. arXiv:2505.14136  [pdf, ps, other

    cs.LG cs.AI

    Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging

    Authors: Ryo Bertolissi, Jonas Hübotter, Ido Hakimi, Andreas Krause

    Abstract: Mixture of expert (MoE) models are a promising approach to increasing model capacity without increasing inference cost, and are core components of many state-of-the-art language models. However, current MoE models typically use only few experts due to prohibitive training and inference cost. We propose Test-Time Model Merging (TTMM) which scales the MoE paradigm to an order of magnitude more exper… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  3. arXiv:2502.05244  [pdf, other

    cs.AI cs.LG

    Probabilistic Artificial Intelligence

    Authors: Andreas Krause, Jonas Hübotter

    Abstract: Artificial intelligence commonly refers to the science and engineering of artificial systems that can carry out tasks generally associated with requiring aspects of human intelligence, such as playing games, translating languages, and driving cars. In recent years, there have been exciting advances in learning-based, data-driven approaches towards AI, and machine learning and deep learning have en… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  4. arXiv:2501.13535  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    LITE: Efficiently Estimating Gaussian Probability of Maximality

    Authors: Nicolas Menet, Jonas Hübotter, Parnian Kassraie, Andreas Krause

    Abstract: We consider the problem of computing the probability of maximality (PoM) of a Gaussian random vector, i.e., the probability for each dimension to be maximal. This is a key challenge in applications ranging from Bayesian optimization to reinforcement learning, where the PoM not only helps with finding an optimal action, but yields a fine-grained analysis of the action domain, crucial in tasks such… ▽ More

    Submitted 15 February, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

    Comments: accepted in AISTATS 2025

  5. arXiv:2411.17438  [pdf, other

    cs.AI cs.CV cs.LG cs.NE

    Object-centric proto-symbolic behavioural reasoning from pixels

    Authors: Ruben van Bergen, Justus Hübotter, Pablo Lanillos

    Abstract: Autonomous intelligent agents must bridge computational challenges at disparate levels of abstraction, from the low-level spaces of sensory input and motor commands to the high-level domain of abstract reasoning and planning. A key question in designing such agents is how best to instantiate the representational space that will interface between these two levels -- ideally without requiring superv… ▽ More

    Submitted 11 February, 2025; v1 submitted 26 November, 2024; originally announced November 2024.

    ACM Class: I.2.0; I.2.6; I.2.10

  6. arXiv:2410.08020  [pdf, other

    cs.LG cs.AI

    Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs

    Authors: Jonas Hübotter, Sascha Bongni, Ido Hakimi, Andreas Krause

    Abstract: Recent efforts in fine-tuning language models often rely on automatic data selection, commonly using Nearest Neighbors retrieval from large datasets. However, we theoretically show that this approach tends to select redundant data, limiting its effectiveness or even hurting performance. To address this, we introduce SIFT, a data selection algorithm designed to reduce uncertainty about the model's… ▽ More

    Submitted 8 February, 2025; v1 submitted 10 October, 2024; originally announced October 2024.

    Comments: accepted in ICLR 2025

  7. arXiv:2410.05026  [pdf, ps, other

    cs.LG cs.RO

    Active Multi-task Policy Fine-tuning

    Authors: Marco Bagatella, Jonas Hübotter, Georg Martius, Andreas Krause

    Abstract: Pre-trained generalist policies are rapidly gaining relevance in robot learning due to their promise of fast adaptation to novel, in-domain tasks. This adaptation often relies on collecting new demonstrations for a specific task of interest and applying imitation learning algorithms, such as behavioral cloning. However, as soon as several tasks need to be learned, we must decide which tasks should… ▽ More

    Submitted 31 May, 2025; v1 submitted 7 October, 2024; originally announced October 2024.

  8. arXiv:2402.15898  [pdf, other

    cs.LG cs.AI

    Transductive Active Learning: Theory and Applications

    Authors: Jonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause

    Abstract: We study a generalization of classical active learning to real-world settings with concrete prediction targets where sampling is restricted to an accessible region of the domain, while prediction targets may lie outside this region. We analyze a family of decision rules that sample adaptively to minimize uncertainty about prediction targets. We are the first to show, under general regularity assum… ▽ More

    Submitted 8 February, 2025; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: accepted in NeurIPS 2024. arXiv admin note: text overlap with arXiv:2402.15441

  9. arXiv:2402.15441  [pdf, other

    cs.LG cs.AI

    Active Few-Shot Fine-Tuning

    Authors: Jonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause

    Abstract: We study the question: How can we select the right data for fine-tuning to a specific task? We call this data selection problem active fine-tuning and show that it is an instance of transductive active learning, a novel generalization of classical active learning. We propose ITL, short for information-based transductive learning, an approach which samples adaptively to maximize information gained… ▽ More

    Submitted 21 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  10. arXiv:2310.19848  [pdf, other

    cs.LG cs.RO math.OC

    Efficient Exploration in Continuous-time Model-based Reinforcement Learning

    Authors: Lenart Treven, Jonas Hübotter, Bhavya Sukhija, Florian Dörfler, Andreas Krause

    Abstract: Reinforcement learning algorithms typically consider discrete-time dynamics, even though the underlying systems are often continuous in time. In this paper, we introduce a model-based reinforcement learning algorithm that represents continuous-time dynamics using nonlinear ordinary differential equations (ODEs). We capture epistemic uncertainty using well-calibrated probabilistic models, and use t… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  11. arXiv:2306.07092  [pdf, other

    cs.RO cs.AI

    Tuning Legged Locomotion Controllers via Safe Bayesian Optimization

    Authors: Daniel Widmer, Dongho Kang, Bhavya Sukhija, Jonas Hübotter, Andreas Krause, Stelian Coros

    Abstract: This paper presents a data-driven strategy to streamline the deployment of model-based controllers in legged robotic hardware platforms. Our approach leverages a model-free safe learning algorithm to automate the tuning of control gains, addressing the mismatch between the simplified model used in the control formulation and the real system. This method substantially mitigates the risk of hazardou… ▽ More

    Submitted 25 October, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted to the 2023 Conference on Robot Learning (CoRL 2023.) The first two authors contributed equally. The supplementary video is available at https://youtu.be/zDBouUgegrU and the code implementation is available at https://github.com/lasgroup/gosafeopt

  12. arXiv:2211.11726  [pdf, ps, other

    cs.DS cs.DC math.CO

    A Cut-Matching Game for Constant-Hop Expanders

    Authors: Bernhard Haeupler, Jonas Huebotter, Mohsen Ghaffari

    Abstract: This paper extends and generalizes the well-known cut-matching game framework and provides a novel cut-strategy that produces constant-hop expanders. Constant-hop expanders are a significant strengthening of regular expanders with the additional guarantee that any demand can be (obliviously) routed along constant-hop flow-paths - in contrast to the $Ω(\log n)$-hop paths in expanders. Cut-match… ▽ More

    Submitted 28 October, 2024; v1 submitted 21 November, 2022; originally announced November 2022.

  13. arXiv:2209.08033  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Learning Policies for Continuous Control via Transition Models

    Authors: Justus Huebotter, Serge Thill, Marcel van Gerven, Pablo Lanillos

    Abstract: It is doubtful that animals have perfect inverse models of their limbs (e.g., what muscle contraction must be applied to every joint to reach a particular location in space). However, in robot control, moving an arm's end-effector to a target position or along a target trajectory requires accurate forward and inverse models. Here we show that by learning the transition (forward) model from interac… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

  14. arXiv:2109.11045  [pdf, other

    cs.NE cs.LG

    Training Deep Spiking Auto-encoders without Bursting or Dying Neurons through Regularization

    Authors: Justus F. Hübotter, Pablo Lanillos, Jakub M. Tomczak

    Abstract: Spiking neural networks are a promising approach towards next-generation models of the brain in computational neuroscience. Moreover, compared to classic artificial neural networks, they could serve as an energy-efficient deployment of AI by enabling fast computation in specialized neuromorphic hardware. However, training deep spiking neural networks, especially in an unsupervised manner, is chall… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: Under review

  15. arXiv:2108.09489  [pdf, other

    cs.DS

    Implementation of Algorithms for Right-Sizing Data Centers

    Authors: Jonas Hübotter

    Abstract: The energy consumption of data centers assumes a significant fraction of the world's overall energy consumption. Most data centers are statically provisioned, leading to a very low average utilization of servers. In this work, we survey uni-dimensional and high-dimensional approaches for dynamically powering up and powering down servers to reduce the energy footprint of data centers while ensuring… ▽ More

    Submitted 21 August, 2021; originally announced August 2021.