Skip to main content

Showing 1–30 of 30 results for author: Wistuba, M

.
  1. arXiv:2504.08703  [pdf, other

    cs.SE

    SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents

    Authors: Muhammad Shihab Rashid, Christian Bock, Yuan Zhuang, Alexander Buchholz, Tim Esler, Simon Valentin, Luca Franceschi, Martin Wistuba, Prabhu Teja Sivaprasad, Woo Jung Kim, Anoop Deoras, Giovanni Zappella, Laurent Callot

    Abstract: Coding agents powered by large language models have shown impressive capabilities in software engineering tasks, but evaluating their performance across diverse programming languages and real-world scenarios remains challenging. We introduce SWE-PolyBench, a new multi-language benchmark for repository-level, execution-based evaluation of coding agents. SWE-PolyBench contains 2110 instances from 21… ▽ More

    Submitted 23 April, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

    Comments: 20 pages, 6 figures, corrected author name spelling

  2. arXiv:2412.07820  [pdf, ps, other

    cs.LG cs.AI

    Hyperband-based Bayesian Optimization for Black-box Prompt Selection

    Authors: Lennart Schneider, Martin Wistuba, Aaron Klein, Jacek Golebiowski, Giovanni Zappella, Felice Antonio Merra

    Abstract: Optimal prompt selection is crucial for maximizing large language model (LLM) performance on downstream tasks, especially in black-box settings where models are only accessible via APIs. Black-box prompt selection is challenging due to potentially large, combinatorial search spaces, absence of gradient information, and high evaluation cost of prompts on a validation set. We propose HbBoPs, a novel… ▽ More

    Submitted 3 June, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: Accepted at ICML 2025. 26 pages, 11 tables, 7 figures

  3. arXiv:2406.03216  [pdf, other

    cs.LG cs.AI

    Choice of PEFT Technique in Continual Learning: Prompt Tuning is Not All You Need

    Authors: Martin Wistuba, Prabhu Teja Sivaprasad, Lukas Balles, Giovanni Zappella

    Abstract: Recent Continual Learning (CL) methods have combined pretrained Transformers with prompt tuning, a parameter-efficient fine-tuning (PEFT) technique. We argue that the choice of prompt tuning in prior works was an undefended and unablated decision, which has been uncritically adopted by subsequent research, but warrants further research to understand its implications. In this paper, we conduct this… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  4. arXiv:2311.17601  [pdf, ps, other

    cs.LG cs.AI

    Continual Learning with Low Rank Adaptation

    Authors: Martin Wistuba, Prabhu Teja Sivaprasad, Lukas Balles, Giovanni Zappella

    Abstract: Recent work using pretrained transformers has shown impressive performance when fine-tuned with data from the downstream problem of interest. However, they struggle to retain that performance when the data characteristics changes. In this paper, we focus on continual learning, where a pre-trained transformer is updated to perform well on new data, while retaining its performance on data it was pre… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted at Workshop on Distribution Shifts (DistShift), NeurIPS 2023

  5. arXiv:2304.12067  [pdf, other

    cs.LG cs.AI cs.CV

    Renate: A Library for Real-World Continual Learning

    Authors: Martin Wistuba, Martin Ferianc, Lukas Balles, Cedric Archambeau, Giovanni Zappella

    Abstract: Continual learning enables the incremental training of machine learning models on non-stationary data streams.While academic interest in the topic is high, there is little indication of the use of state-of-the-art continual learning algorithms in practical machine learning deployment. This paper presents Renate, a continual learning library designed to build real-world updating pipelines for PyTor… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: Paper accepted at the CLVision workshop at CVPR 2023

  6. arXiv:2302.10706  [pdf, other

    cs.LG stat.ML

    Variational Boosted Soft Trees

    Authors: Tristan Cinquin, Tammo Rukat, Philipp Schmidt, Martin Wistuba, Artur Bekasov

    Abstract: Gradient boosting machines (GBMs) based on decision trees consistently demonstrate state-of-the-art results on regression and classification tasks with tabular data, often outperforming deep neural networks. However, these models do not provide well-calibrated predictive uncertainties, which prevents their use for decision making in high-risk applications. The Bayesian treatment is known to improv… ▽ More

    Submitted 22 February, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  7. arXiv:2302.00441  [pdf, other

    cs.LG

    Scaling Laws for Hyperparameter Optimization

    Authors: Arlind Kadra, Maciej Janowski, Martin Wistuba, Josif Grabocka

    Abstract: Hyperparameter optimization is an important subfield of machine learning that focuses on tuning the hyperparameters of a chosen algorithm to achieve peak performance. Recently, there has been a stream of methods that tackle the issue of hyperparameter optimization, however, most of the methods do not exploit the dominant power law nature of learning curves for Bayesian optimization. In this work,… ▽ More

    Submitted 25 October, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Accepted at NeurIPS 2023

  8. arXiv:2207.06940  [pdf, other

    cs.LG stat.ML

    PASHA: Efficient HPO and NAS with Progressive Resource Allocation

    Authors: Ondrej Bohdal, Lukas Balles, Martin Wistuba, Beyza Ermis, Cédric Archambeau, Giovanni Zappella

    Abstract: Hyperparameter optimization (HPO) and neural architecture search (NAS) are methods of choice to obtain the best-in-class machine learning models, but in practice they can be costly to run. When models are trained on large datasets, tuning them with HPO or NAS rapidly becomes prohibitively expensive for practitioners, even when efficient multi-fidelity methods are employed. We propose an approach t… ▽ More

    Submitted 8 March, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Accepted at ICLR 2023

  9. arXiv:2206.14085  [pdf, other

    cs.LG cs.CV

    Continual Learning with Transformers for Image Classification

    Authors: Beyza Ermis, Giovanni Zappella, Martin Wistuba, Aditya Rawal, Cedric Archambeau

    Abstract: In many real-world scenarios, data to train machine learning models become available over time. However, neural network models struggle to continually learn new concepts without forgetting what has been learnt in the past. This phenomenon is known as catastrophic forgetting and it is often difficult to prevent due to practical constraints, such as the amount of data that can be stored or the limit… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: Appeared in CVPR CLVision workshop. arXiv admin note: substantial text overlap with arXiv:2203.04640

  10. arXiv:2203.04640  [pdf, other

    cs.CL cs.AI stat.ML

    Memory Efficient Continual Learning with Transformers

    Authors: Beyza Ermis, Giovanni Zappella, Martin Wistuba, Aditya Rawal, Cedric Archambeau

    Abstract: In many real-world scenarios, data to train machine learning models becomes available over time. Unfortunately, these models struggle to continually learn new concepts without forgetting what has been learnt in the past. This phenomenon is known as catastrophic forgetting and it is difficult to prevent due to practical constraints. For instance, the amount of data that can be stored or the computa… ▽ More

    Submitted 13 January, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: This paper was published at NeurIPS 2022

  11. arXiv:2202.09774  [pdf, other

    cs.LG cs.AI

    Supervising the Multi-Fidelity Race of Hyperparameter Configurations

    Authors: Martin Wistuba, Arlind Kadra, Josif Grabocka

    Abstract: Multi-fidelity (gray-box) hyperparameter optimization techniques (HPO) have recently emerged as a promising direction for tuning Deep Learning methods. However, existing methods suffer from a sub-optimal allocation of the HPO budget to the hyperparameter configurations. In this work, we introduce DyHPO, a Bayesian Optimization method that learns to decide which hyperparameter configuration to trai… ▽ More

    Submitted 1 June, 2023; v1 submitted 20 February, 2022; originally announced February 2022.

    Comments: Accepted at NeurIPS 2022

  12. arXiv:2106.06257  [pdf, other

    cs.LG

    HPO-B: A Large-Scale Reproducible Benchmark for Black-Box HPO based on OpenML

    Authors: Sebastian Pineda Arango, Hadi S. Jomaa, Martin Wistuba, Josif Grabocka

    Abstract: Hyperparameter optimization (HPO) is a core problem for the machine learning community and remains largely unsolved due to the significant computational resources required to evaluate hyperparameter configurations. As a result, a series of recent related works have focused on the direction of transfer learning for quickly fine-tuning hyperparameters on a dataset. Unfortunately, the community does… ▽ More

    Submitted 11 October, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: 10 pages

  13. arXiv:2101.09336  [pdf, other

    cs.LG cs.CC

    A Comprehensive Survey on Hardware-Aware Neural Architecture Search

    Authors: Hadjer Benmeziane, Kaoutar El Maghraoui, Hamza Ouarnoughi, Smail Niar, Martin Wistuba, Naigang Wang

    Abstract: Neural Architecture Search (NAS) methods have been growing in popularity. These techniques have been fundamental to automate and speed up the time consuming and error-prone process of synthesizing novel Deep Learning (DL) architectures. NAS has been extensively studied in the past few years. Arguably their most significant impact has been in image classification and object detection tasks where th… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

    Comments: Submitted to Proceedings of IEEE

  14. arXiv:2101.07667  [pdf, other

    cs.LG

    Few-Shot Bayesian Optimization with Deep Kernel Surrogates

    Authors: Martin Wistuba, Josif Grabocka

    Abstract: Hyperparameter optimization (HPO) is a central pillar in the automation of machine learning solutions and is mainly performed via Bayesian optimization, where a parametric surrogate is learned to approximate the black box response function (e.g. validation error). Unfortunately, evaluating the response function is computationally intensive. As a remedy, earlier work emphasizes the need for transfe… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

    Comments: Published as a conference paper at ICLR 2021

  15. arXiv:2006.03361  [pdf, other

    cs.LG cs.CV stat.ML

    Learning to Rank Learning Curves

    Authors: Martin Wistuba, Tejaswini Pedapati

    Abstract: Many automated machine learning methods, such as those for hyperparameter and neural architecture optimization, are computationally expensive because they involve training many different model configurations. In this work, we present a new method that saves computational budget by terminating poor configurations early on in the training. In contrast to existing methods, we consider this task as a… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

    Comments: Accepted at the International Conference on Machine Learning (ICML) 2020

  16. arXiv:1910.14436  [pdf, other

    cs.AI cs.LG

    How can AI Automate End-to-End Data Science?

    Authors: Charu Aggarwal, Djallel Bouneffouf, Horst Samulowitz, Beat Buesser, Thanh Hoang, Udayan Khurana, Sijia Liu, Tejaswini Pedapati, Parikshit Ram, Ambrish Rawat, Martin Wistuba, Alexander Gray

    Abstract: Data science is labor-intensive and human experts are scarce but heavily involved in every aspect of it. This makes data science time consuming and restricted to experts with the resulting quality heavily dependent on their experience and skills. To make data science more accessible and scalable, we need its democratization. Automated Data Science (AutoDS) is aimed towards that goal and is emergin… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

  17. arXiv:1907.08307  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    XferNAS: Transfer Neural Architecture Search

    Authors: Martin Wistuba

    Abstract: The term Neural Architecture Search (NAS) refers to the automatic optimization of network architectures for a new, previously unknown task. Since testing an architecture is computationally very expensive, many optimizers need days or even weeks to find suitable architectures. However, this search time can be significantly reduced if knowledge from previous searches on different tasks is reused. In… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

  18. arXiv:1905.01392  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    A Survey on Neural Architecture Search

    Authors: Martin Wistuba, Ambrish Rawat, Tejaswini Pedapati

    Abstract: The growing interest in both the automation of machine learning and deep learning has inevitably led to the development of a wide variety of automated methods for neural architecture search. The choice of the network architecture has proven to be critical, and many advances in deep learning spring from its immediate improvements. However, deep learning techniques are computationally intensive and… ▽ More

    Submitted 18 June, 2019; v1 submitted 3 May, 2019; originally announced May 2019.

  19. arXiv:1903.03536  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Inductive Transfer for Neural Architecture Optimization

    Authors: Martin Wistuba, Tejaswini Pedapati

    Abstract: The recent advent of automated neural network architecture search led to several methods that outperform state-of-the-art human-designed architectures. However, these approaches are computationally expensive, in extreme cases consuming GPU years. We propose two novel methods which aim to expedite this optimization problem by transferring knowledge acquired from previous tasks to new ones. First, w… ▽ More

    Submitted 8 March, 2019; originally announced March 2019.

  20. arXiv:1901.06261  [pdf, other

    cs.LG cs.SE stat.ML

    NeuNetS: An Automated Synthesis Engine for Neural Network Design

    Authors: Atin Sood, Benjamin Elder, Benjamin Herta, Chao Xue, Costas Bekas, A. Cristiano I. Malossi, Debashish Saha, Florian Scheidegger, Ganesh Venkataraman, Gegi Thomas, Giovanni Mariani, Hendrik Strobelt, Horst Samulowitz, Martin Wistuba, Matteo Manica, Mihir Choudhury, Rong Yan, Roxana Istrate, Ruchir Puri, Tejaswini Pedapati

    Abstract: Application of neural networks to a vast variety of practical applications is transforming the way AI is applied in practice. Pre-trained neural network models available through APIs or capability to custom train pre-built neural network architectures with customer data has made the consumption of AI by developers much simpler and resulted in broad adoption of these complex AI models. While prebui… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: 14 pages, 12 figures. arXiv admin note: text overlap with arXiv:1806.00250

  21. arXiv:1807.01069  [pdf, other

    cs.LG stat.ML

    Adversarial Robustness Toolbox v1.0.0

    Authors: Maria-Irina Nicolae, Mathieu Sinn, Minh Ngoc Tran, Beat Buesser, Ambrish Rawat, Martin Wistuba, Valentina Zantedeschi, Nathalie Baracaldo, Bryant Chen, Heiko Ludwig, Ian M. Molloy, Ben Edwards

    Abstract: Adversarial Robustness Toolbox (ART) is a Python library supporting developers and researchers in defending Machine Learning models (Deep Neural Networks, Gradient Boosted Decision Trees, Support Vector Machines, Random Forests, Logistic Regression, Gaussian Processes, Decision Trees, Scikit-learn Pipelines, etc.) against adversarial threats and helps making AI systems more secure and trustworthy.… ▽ More

    Submitted 15 November, 2019; v1 submitted 3 July, 2018; originally announced July 2018.

    Comments: 34 pages

  22. arXiv:1806.05886  [pdf, other

    cs.CV

    Automated Image Data Preprocessing with Deep Reinforcement Learning

    Authors: Tran Ngoc Minh, Mathieu Sinn, Hoang Thanh Lam, Martin Wistuba

    Abstract: Data preparation, i.e. the process of transforming raw data into a format that can be used for training effective machine learning models, is a tedious and time-consuming task. For image data, preprocessing typically involves a sequence of basic transformations such as cropping, filtering, rotating or flipping images. Currently, data scientists decide manually based on their experience which trans… ▽ More

    Submitted 29 April, 2021; v1 submitted 15 June, 2018; originally announced June 2018.

  23. arXiv:1806.02659  [pdf, other

    cs.LG cs.CR stat.ML

    Scalable Multi-Class Bayesian Support Vector Machines for Structured and Unstructured Data

    Authors: Martin Wistuba, Ambrish Rawat

    Abstract: We introduce a new Bayesian multi-class support vector machine by formulating a pseudo-likelihood for a multi-class hinge loss in the form of a location-scale mixture of Gaussians. We derive a variational-inference-based training objective for gradient-based learning. Additionally, we employ an inducing point approximation which scales inference to large data sets. Furthermore, we develop hybrid B… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

  24. arXiv:1801.05372  [pdf, other

    cs.AI cs.LG

    Neural Feature Learning From Relational Database

    Authors: Hoang Thanh Lam, Tran Ngoc Minh, Mathieu Sinn, Beat Buesser, Martin Wistuba

    Abstract: Feature engineering is one of the most important but most tedious tasks in data science. This work studies automation of feature learning from relational database. We first prove theoretically that finding the optimal features from relational data for predictive tasks is NP-hard. We propose an efficient rule-based approach based on heuristics and a deep neural network to automatically learn approp… ▽ More

    Submitted 15 June, 2019; v1 submitted 16 January, 2018; originally announced January 2018.

  25. arXiv:1712.07420  [pdf, other

    cs.LG cs.CV stat.ML

    Finding Competitive Network Architectures Within a Day Using UCT

    Authors: Martin Wistuba

    Abstract: The design of neural network architectures for a new data set is a laborious task which requires human deep learning expertise. In order to make deep learning available for a broader audience, automated methods for finding a neural network architecture are vital. Recently proposed methods can already achieve human expert level performances. However, these methods have run times of months or even y… ▽ More

    Submitted 23 July, 2018; v1 submitted 20 December, 2017; originally announced December 2017.

    Journal ref: Proceedings of the 5th IEEE International Conference on Data Science and Advanced Analytics, pages 263-272, 2018

  26. arXiv:1711.08244  [pdf, other

    stat.ML cs.LG

    Adversarial Phenomenon in the Eyes of Bayesian Deep Learning

    Authors: Ambrish Rawat, Martin Wistuba, Maria-Irina Nicolae

    Abstract: Deep Learning models are vulnerable to adversarial examples, i.e.\ images obtained via deliberate imperceptible perturbations, such that the model misclassifies them with high confidence. However, class confidence by itself is an incomplete picture of uncertainty. We therefore use principled Bayesian methods to capture model uncertainty in prediction for observing adversarial misclassification. We… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.

    Comments: 13 pages, 7 figures

  27. arXiv:1610.03996  [pdf, other

    cs.LG cs.AI

    Bank Card Usage Prediction Exploiting Geolocation Information

    Authors: Martin Wistuba, Nghia Duong-Trung, Nicolas Schilling, Lars Schmidt-Thieme

    Abstract: We describe the solution of team ISMLL for the ECML-PKDD 2016 Discovery Challenge on Bank Card Usage for both tasks. Our solution is based on three pillars. Gradient boosted decision trees as a strong regression and classification model, an intensive search for good hyperparameter configurations and strong features that exploit geolocation information. This approach achieved the best performance o… ▽ More

    Submitted 13 October, 2016; originally announced October 2016.

    Comments: Describes the winning solution for the ECML-PKDD 2016 Discovery Challenge on Bank Card Usage Analysis. Final results on the private leaderboard are available here: https://dms.sztaki.hu/ecml-pkkd-2016/#/app/privateleaderboard

  28. arXiv:1503.05018  [pdf, other

    cs.LG

    Ultra-Fast Shapelets for Time Series Classification

    Authors: Martin Wistuba, Josif Grabocka, Lars Schmidt-Thieme

    Abstract: Time series shapelets are discriminative subsequences and their similarity to a time series can be used for time series classification. Since the discovery of time series shapelets is costly in terms of time, the applicability on long or multivariate time series is difficult. In this work we propose Ultra-Fast Shapelets that uses a number of random shapelets. It is shown that Ultra-Fast Shapelets… ▽ More

    Submitted 17 March, 2015; originally announced March 2015.

    Comments: Preprint submitted to Journal of Data & Knowledge Engineering January 24, 2015

  29. arXiv:1503.03238  [pdf, other

    cs.LG

    Scalable Discovery of Time-Series Shapelets

    Authors: Josif Grabocka, Martin Wistuba, Lars Schmidt-Thieme

    Abstract: Time-series classification is an important problem for the data mining community due to the wide range of application domains involving time-series data. A recent paradigm, called shapelets, represents patterns that are highly predictive for the target variable. Shapelets are discovered by measuring the prediction accuracy of a set of potential (shapelet) candidates. The candidates typically consi… ▽ More

    Submitted 11 March, 2015; originally announced March 2015.

    Comments: Under review in the journal "Knowledge and Information Systems" (KAIS)

  30. arXiv:1307.6365  [pdf, other

    cs.AI cs.DB cs.LG

    Time-Series Classification Through Histograms of Symbolic Polynomials

    Authors: Josif Grabocka, Martin Wistuba, Lars Schmidt-Thieme

    Abstract: Time-series classification has attracted considerable research attention due to the various domains where time-series data are observed, ranging from medicine to econometrics. Traditionally, the focus of time-series classification has been on short time-series data composed of a unique pattern with intraclass pattern distortions and variations, while recently there have been attempts to focus on l… ▽ More

    Submitted 23 December, 2013; v1 submitted 24 July, 2013; originally announced July 2013.