Skip to main content

Showing 1–13 of 13 results for author: Buschjäger, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.18962  [pdf, ps, other

    cs.LG

    Lift What You Can: Green Online Learning with Heterogeneous Ensembles

    Authors: Kirsten Köbschall, Sebastian Buschjäger, Raphael Fischer, Lisa Hartung, Stefan Kramer

    Abstract: Ensemble methods for stream mining necessitate managing multiple models and updating them as data distributions evolve. Considering the calls for more sustainability, established methods are however not sufficiently considerate of ensemble members' computational expenses and instead overly focus on predictive capabilities. To address these challenges and enable green online learning, we propose he… ▽ More

    Submitted 23 September, 2025; originally announced September 2025.

  2. arXiv:2504.17617  [pdf, other

    cs.LG cs.AI

    Decentralized Time Series Classification with ROCKET Features

    Authors: Bruno Casella, Matthias Jakobs, Marco Aldinucci, Sebastian Buschjäger

    Abstract: Time series classification (TSC) is a critical task with applications in various domains, including healthcare, finance, and industrial monitoring. Due to privacy concerns and data regulations, Federated Learning has emerged as a promising approach for learning from distributed time series data without centralizing raw information. However, most FL solutions rely on a client-server architecture, w… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: Submitted to Workshop on Federated Learning Advancements 2025, in conjunction with ECML-PKDD, WAFL25

    MSC Class: 68T07 ACM Class: I.2.11; I.2.6

  3. arXiv:2503.17497  [pdf, other

    cs.CV cs.LG

    You Only Look Once at Anytime (AnytimeYOLO): Analysis and Optimization of Early-Exits for Object-Detection

    Authors: Daniel Kuhse, Harun Teper, Sebastian Buschjäger, Chien-Yao Wang, Jian-Jia Chen

    Abstract: We introduce AnytimeYOLO, a family of variants of the YOLO architecture that enables anytime object detection. Our AnytimeYOLO networks allow for interruptible inference, i.e., they provide a prediction at any point in time, a property desirable for safety-critical real-time applications. We present structured explorations to modify the YOLO architecture, enabling early termination to obtain int… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  4. arXiv:2305.08579  [pdf, other

    cs.LG

    Fast Inference of Tree Ensembles on ARM Devices

    Authors: Simon Koschel, Sebastian Buschjäger, Claudio Lucchese, Katharina Morik

    Abstract: With the ongoing integration of Machine Learning models into everyday life, e.g. in the form of the Internet of Things (IoT), the evaluation of learned models becomes more and more an important issue. Tree ensembles are one of the best black-box classifiers available and routinely outperform more complex classifiers. While the fast application of tree ensembles has already been studied in the lite… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 12 pages, 2 figures, 4 algorithms

  5. arXiv:2112.03723  [pdf, other

    cs.LG cs.AI cs.NI

    Shrub Ensembles for Online Classification

    Authors: Sebastian Buschjäger, Sibylle Hess, Katharina Morik

    Abstract: Online learning algorithms have become a ubiquitous tool in the machine learning toolbox and are frequently used in small, resource-constraint environments. Among the most successful online learning methods are Decision Tree (DT) ensembles. DT ensembles provide excellent performance while adapting to changes in the data, but they are not resource efficient. Incremental tree learners keep adding ne… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: 9 pages main content, 13 pages appendix, accepted at AAAI-2022

  6. arXiv:2111.04409  [pdf, other

    cs.LG stat.ML

    There is no Double-Descent in Random Forests

    Authors: Sebastian Buschjäger, Katharina Morik

    Abstract: Random Forests (RFs) are among the state-of-the-art in machine learning and offer excellent performance with nearly zero parameter tuning. Remarkably, RFs seem to be impervious to overfitting even though their basic building blocks are well-known to overfit. Recently, a broadly received study argued that a RF exhibits a so-called double-descent curve: First, the model overfits the data in a u-shap… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

    Comments: 11 pages, 3 figures, 3 algorithms

  7. arXiv:2110.10075  [pdf, other

    cs.LG cs.AR

    Improving the Accuracy-Memory Trade-Off of Random Forests Via Leaf-Refinement

    Authors: Sebastian Buschjäger, Katharina Morik

    Abstract: Random Forests (RF) are among the state-of-the-art in many machine learning applications. With the ongoing integration of ML models into everyday life, the deployment and continuous application of models becomes more and more an important issue. Hence, small models which offer good predictive performance but use small amounts of memory are required. Ensemble pruning is a standard technique to remo… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: 2 algorithms, 6 tables, 4 plots and a very long appendix

  8. arXiv:2105.12026  [pdf, ps, other

    cs.DC cs.AI

    Providing Meaningful Data Summarizations Using Exemplar-based Clustering in Industry 4.0

    Authors: Philipp-Jan Honysz, Alexander Schulze-Struchtrup, Sebastian Buschjäger, Katharina Morik

    Abstract: Data summarizations are a valuable tool to derive knowledge from large data streams and have proven their usefulness in a great number of applications. Summaries can be found by optimizing submodular functions. These functions map subsets of data to real values, which indicate their "representativeness" and which should be maximized to find a diverse summary of the underlying data. In this paper,… ▽ More

    Submitted 18 June, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2101.08763

  9. arXiv:2102.01344  [pdf

    cs.LG cs.NE

    Bit Error Tolerance Metrics for Binarized Neural Networks

    Authors: Sebastian Buschjäger, Jian-Jia Chen, Kuan-Hsun Chen, Mario Günzel, Katharina Morik, Rodion Novkin, Lukas Pfahler, Mikail Yayla

    Abstract: To reduce the resource demand of neural network (NN) inference systems, it has been proposed to use approximate memory, in which the supply voltage and the timing parameters are tuned trading accuracy with energy consumption and performance. Tuning these parameters aggressively leads to bit errors, which can be tolerated by NNs when bit flips are injected during training. However, bit flip trainin… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: Presented at DATE Friday Workshop on System-level Design Methods for Deep Learning on Heterogeneous Architectures (SLOHA 2021) (arXiv:2102.00818)

    Report number: SLOHA/2021/02

  10. arXiv:2101.08763  [pdf, ps, other

    cs.DC cs.AI

    GPU-Accelerated Optimizer-Aware Evaluation of Submodular Exemplar Clustering

    Authors: Philipp-Jan Honysz, Sebastian Buschjäger, Katharina Morik

    Abstract: The optimization of submodular functions constitutes a viable way to perform clustering. Strong approximation guarantees and feasible optimization w.r.t. streaming data make this clustering approach favorable. Technically, submodular functions map subsets of data to real values, which indicate how "representative" a specific subset is. Optimal sets might then be used to partition the data space an… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

  11. arXiv:2011.02952  [pdf, other

    cs.LG stat.ML

    Generalized Negative Correlation Learning for Deep Ensembling

    Authors: Sebastian Buschjäger, Lukas Pfahler, Katharina Morik

    Abstract: Ensemble algorithms offer state of the art performance in many machine learning applications. A common explanation for their excellent performance is due to the bias-variance decomposition of the mean squared error which shows that the algorithm's error can be decomposed into its bias and variance. Both quantities are often opposed to each other and ensembles offer an effective way to manage them… ▽ More

    Submitted 9 December, 2020; v1 submitted 5 November, 2020; originally announced November 2020.

    Comments: 12 (+8) pages, 1(+1) figures

  12. arXiv:2010.10059  [pdf, other

    cs.LG cs.DM stat.ML

    Very Fast Streaming Submodular Function Maximization

    Authors: Sebastian Buschjäger, Philipp-Jan Honysz, Lukas Pfahler, Katharina Morik

    Abstract: Data summarization has become a valuable tool in understanding even terabytes of data. Due to their compelling theoretical properties, submodular functions have been in the focus of summarization algorithms. These algorithms offer worst-case approximations guarantees to the expense of higher computation and memory requirements. However, many practical applications do not fall under this worst-case… ▽ More

    Submitted 8 May, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: 9 pages, 14 pages appendix, 5 figures, 2 tables, 10 algorithms

  13. arXiv:2002.00909  [pdf, ps, other

    cs.LG cs.ET stat.ML

    Towards Explainable Bit Error Tolerance of Resistive RAM-Based Binarized Neural Networks

    Authors: Sebastian Buschjäger, Jian-Jia Chen, Kuan-Hsun Chen, Mario Günzel, Christian Hakert, Katharina Morik, Rodion Novkin, Lukas Pfahler, Mikail Yayla

    Abstract: Non-volatile memory, such as resistive RAM (RRAM), is an emerging energy-efficient storage, especially for low-power machine learning models on the edge. It is reported, however, that the bit error rate of RRAMs can be up to 3.3% in the ultra low-power setting, which might be crucial for many use cases. Binary neural networks (BNNs), a resource efficient variant of neural networks (NNs), can toler… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

    Comments: 6 pages, 2 figures

    MSC Class: 68T05 ACM Class: I.2.6; B.7.1