Skip to main content

Showing 1–9 of 9 results for author: Böther, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.19790  [pdf, other

    cs.LG cs.AI cs.DB

    Mixtera: A Data Plane for Foundation Model Training

    Authors: Maximilian Böther, Xiaozhe Yao, Tolga Kerimoglu, Dan Graur, Viktor Gsteiger, Ana Klimovic

    Abstract: State-of-the-art large language and vision models are trained over trillions of tokens that are aggregated from a large variety of sources. As training data collections grow, manually managing the samples becomes time-consuming, tedious, and prone to errors. Yet recent research shows that the data mixture and the order in which samples are visited during training can significantly influence model… ▽ More

    Submitted 3 April, 2025; v1 submitted 27 February, 2025; originally announced February 2025.

    Comments: under submission

  2. arXiv:2402.16442  [pdf, other

    cs.LG cs.AI cs.CV cs.DC math.OC

    On Distributed Larger-Than-Memory Subset Selection With Pairwise Submodular Functions

    Authors: Maximilian Böther, Abraham Sebastian, Pranjal Awasthi, Ana Klimovic, Srikumar Ramalingam

    Abstract: Modern datasets span billions of samples, making training on all available data infeasible. Selecting a high quality subset helps in reducing training costs and enhancing model quality. Submodularity, a discrete analogue of convexity, is commonly used for solving such subset selection problems. However, existing algorithms for optimizing submodular functions are sequential, and the prior distribut… ▽ More

    Submitted 3 April, 2025; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: accepted at MLSys 2025

  3. arXiv:2312.06254  [pdf, other

    cs.LG cs.AI cs.DB cs.DC stat.ML

    Modyn: Data-Centric Machine Learning Pipeline Orchestration

    Authors: Maximilian Böther, Ties Robroek, Viktor Gsteiger, Robin Holzinger, Xianzhe Ma, Pınar Tözün, Ana Klimovic

    Abstract: In real-world machine learning (ML) pipelines, datasets are continuously growing. Models must incorporate this new training data to improve generalization and adapt to potential distribution shifts. The cost of model retraining is proportional to how frequently the model is retrained and how much data it is trained on, which makes the naive approach of retraining from scratch each time impractical… ▽ More

    Submitted 24 January, 2025; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: final version published at SIGMOD'25; 30 pages

  4. arXiv:2208.02590  [pdf, other

    cs.DS

    Efficiently Computing Directed Minimum Spanning Trees

    Authors: Maximilian Böther, Otto Kißig, Christopher Weyand

    Abstract: Computing a directed minimum spanning tree, called arborescence, is a fundamental algorithmic problem, although not as common as its undirected counterpart. In 1967, Edmonds discussed an elegant solution. It was refined to run in $O(\min(n^2, m\log n))$ by Tarjan which is optimal for very dense and very sparse graphs. Gabow et al.~gave a version of Edmonds' algorithm that runs in $O(n\log n + m)$,… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

  5. arXiv:2201.10494  [pdf, other

    cs.LG cs.AI math.OC

    What's Wrong with Deep Learning in Tree Search for Combinatorial Optimization

    Authors: Maximilian Böther, Otto Kißig, Martin Taraz, Sarel Cohen, Karen Seidel, Tobias Friedrich

    Abstract: Combinatorial optimization lies at the core of many real-world problems. Especially since the rise of graph neural networks (GNNs), the deep learning community has been developing solvers that derive solutions to NP-hard problems by learning the problem-specific solution structure. However, reproducing the results of these publications proves to be difficult. We make three contributions. First, we… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: 25 pages, accepted for publication at ICLR 2022

  6. arXiv:2110.11984  [pdf, other

    cs.IR cs.CL cs.CY cs.SE cs.SI

    Law Smells: Defining and Detecting Problematic Patterns in Legal Drafting

    Authors: Corinna Coupette, Dirk Hartung, Janis Beckedorf, Maximilian Böther, Daniel Martin Katz

    Abstract: Building on the computer science concept of code smells, we initiate the study of law smells, i.e., patterns in legal texts that pose threats to the comprehensibility and maintainability of the law. With five intuitive law smells as running examples - namely, duplicated phrase, long element, large reference tree, ambiguous syntax, and natural language obsession -, we develop a comprehensive law sm… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

    Comments: 36 pages, 11 figures

  7. arXiv:2011.09866  [pdf, other

    cs.LO cs.CL cs.FL cs.LG

    Learning Languages with Decidable Hypotheses

    Authors: Julian Berger, Maximilian Böther, Vanja Doskoč, Jonathan Gadea Harder, Nicolas Klodt, Timo Kötzing, Winfried Lötzsch, Jannik Peters, Leon Schiller, Lars Seifert, Armin Wells, Simon Wietheger

    Abstract: In language learning in the limit, the most common type of hypothesis is to give an enumerator for a language. This so-called $W$-index allows for naming arbitrary computably enumerable languages, with the drawback that even the membership problem is undecidable. In this paper we use a different system which allows for naming arbitrary decidable languages, namely programs for characteristic functi… ▽ More

    Submitted 15 October, 2020; originally announced November 2020.

  8. arXiv:2010.09460  [pdf, other

    cs.LG cs.FL

    Maps for Learning Indexable Classes

    Authors: Julian Berger, Maximilian Böther, Vanja Doskoč, Jonathan Gadea Harder, Nicolas Klodt, Timo Kötzing, Winfried Lötzsch, Jannik Peters, Leon Schiller, Lars Seifert, Armin Wells, Simon Wietheger

    Abstract: We study learning of indexed families from positive data where a learner can freely choose a hypothesis space (with uniformly decidable membership) comprising at least the languages to be learned. This abstracts a very universal learning task which can be found in many areas, for example learning of (subsets of) regular languages or learning of natural languages. We are interested in various restr… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  9. arXiv:2008.10316  [pdf, other

    cs.DS

    A Strategic Routing Framework and Algorithms for Computing Alternative Paths

    Authors: Thomas Bläsius, Maximilian Böther, Philipp Fischbeck, Tobias Friedrich, Alina Gries, Falk Hüffner, Otto Kißig, Pascal Lenzner, Louise Molitor, Leon Schiller, Armin Wells, Simon Wietheger

    Abstract: Traditional navigation services find the fastest route for a single driver. Though always using the fastest route seems desirable for every individual, selfish behavior can have undesirable effects such as higher energy consumption and avoidable congestion, even leading to higher overall and individual travel times. In contrast, strategic routing aims at optimizing the traffic for all agents regar… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: 19 pages, 7 figures, full version of paper accepted at ATMOS 2020