Skip to main content

Showing 1–2 of 2 results for author: Schürholt, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.16988  [pdf, other

    cs.LG stat.ML

    MD tree: a model-diagnostic tree grown on loss landscape

    Authors: Yefan Zhou, Jianlong Chen, Qinxue Cao, Konstantin Schürholt, Yaoqing Yang

    Abstract: This paper considers "model diagnosis", which we formulate as a classification problem. Given a pre-trained neural network (NN), the goal is to predict the source of failure from a set of failure modes (such as a wrong hyperparameter, inadequate model size, and insufficient data) without knowing the training configuration of the pre-trained NN. The conventional diagnosis approach uses training and… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: ICML 2024, first two authors contributed equally

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:61825-61853, 2024

  2. arXiv:2006.10424  [pdf, other

    cs.LG cs.AI stat.ML

    An Investigation of the Weight Space to Monitor the Training Progress of Neural Networks

    Authors: Konstantin Schürholt, Damian Borth

    Abstract: Safe use of Deep Neural Networks (DNNs) requires careful testing. However, deployed models are often trained further to improve in performance. As rigorous testing and evaluation is expensive, triggers are in need to determine the degree of change of a model. In this paper we investigate the weight space of DNN models for structure that can be exploited to that end. Our results show that DNN model… ▽ More

    Submitted 17 March, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: 8 pages, 9 figures