Skip to main content

Showing 1–10 of 10 results for author: Biehl, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2008.13454  [pdf, ps, other

    cs.LG stat.ML

    Complex-valued embeddings of generic proximity data

    Authors: Maximilian Münch, Michiel Straat, Michael Biehl, Frank-Michael Schleif

    Abstract: Proximities are at the heart of almost all machine learning methods. If the input data are given as numerical vectors of equal lengths, euclidean distance, or a Hilbertian inner product is frequently used in modeling algorithms. In a more generic view, objects are compared by a (symmetric) similarity or dissimilarity measure, which may not obey particular mathematical properties. This renders many… ▽ More

    Submitted 31 August, 2020; originally announced August 2020.

    Comments: Proximity learning, embedding, complex values, complex-valued embedding, learning vector quantization

  2. arXiv:2005.10531  [pdf, ps, other

    cs.LG cond-mat.stat-mech stat.ML

    Supervised Learning in the Presence of Concept Drift: A modelling framework

    Authors: Michiel Straat, Fthi Abadi, Zhuoyun Kan, Christina Göpfert, Barbara Hammer, Michael Biehl

    Abstract: We present a modelling framework for the investigation of supervised learning in non-stationary environments. Specifically, we model two example types of learning systems: prototype-based Learning Vector Quantization (LVQ) for classification and shallow, layered neural networks for regression tasks. We investigate so-called student teacher scenarios in which the systems are trained from a stream o… ▽ More

    Submitted 27 February, 2021; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: 17 pages in twocolumn

    Journal ref: Neural Computing and Applications 2021

  3. Feature Relevance Determination for Ordinal Regression in the Context of Feature Redundancies and Privileged Information

    Authors: Lukas Pfannschmidt, Jonathan Jakob, Fabian Hinder, Michael Biehl, Peter Tino, Barbara Hammer

    Abstract: Advances in machine learning technologies have led to increasingly powerful models in particular in the context of big data. Yet, many application scenarios demand for robustly interpretable models rather than optimum model accuracy; as an example, this is the case if potential biomarkers or causal factors should be discovered based on a set of given measurements. In this contribution, we focus on… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

    Comments: Preprint accepted at Neurocomputing

  4. arXiv:1910.07476  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Hidden Unit Specialization in Layered Neural Networks: ReLU vs. Sigmoidal Activation

    Authors: Elisa Oostwal, Michiel Straat, Michael Biehl

    Abstract: We study layered neural networks of rectified linear units (ReLU) in a modelling framework for stochastic training processes. The comparison with sigmoidal activation functions is in the center of interest. We compute typical learning curves for shallow networks with K hidden units in matching student teacher scenarios. The systems exhibit sudden changes of the generalization performance via the p… ▽ More

    Submitted 27 May, 2020; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: Main changes compared to first version: Added a section on supporting Monte Carlo simulations, results and additional figures are presented and discussed. Some references added. Layout changed to single column layout for better readability. Minor textual changes and typos corrected

    Journal ref: Physica A: Statistical Mechanics and its Applications 564: 125517, 2020

  5. arXiv:1903.07749  [pdf, other

    astro-ph.GA cs.LG stat.ML

    Galaxy classification: A machine learning analysis of GAMA catalogue data

    Authors: Aleke Nolte, Lingyu Wang, Maciej Bilicki, Benne Holwerda, Michael Biehl

    Abstract: We present a machine learning analysis of five labelled galaxy catalogues from the Galaxy And Mass Assembly (GAMA): The SersicCatVIKING and SersicCatUKIDSS catalogues containing morphological features, the GaussFitSimple catalogue containing spectroscopic features, the MagPhys catalogue including physical parameters for galaxies, and the Lambdar catalogue, which contains photometric measurements.… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.

    Comments: Accepted for the ESANN 2018 Special Issue of Neurocomputing

    Journal ref: Neurocomputing 342: 172-190, 2019

  6. arXiv:1903.07378  [pdf, ps, other

    cs.LG cond-mat.dis-nn stat.ML

    On-line learning dynamics of ReLU neural networks using statistical physics techniques

    Authors: Michiel Straat, Michael Biehl

    Abstract: We introduce exact macroscopic on-line learning dynamics of two-layer neural networks with ReLU units in the form of a system of differential equations, using techniques borrowed from statistical physics. For the first experiments, numerical solutions reveal similar behavior compared to sigmoidal activation researched in earlier work. In these experiments the theoretical results show good correspo… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.

    Comments: Accepted contribution: ESANN 2019, 6 pages European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning 2019

  7. arXiv:1903.07273  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Prototype-based classifiers in the presence of concept drift: A modelling framework

    Authors: Michael Biehl, Fthi Abadi, Christina Göpfert, Barbara Hammer

    Abstract: We present a modelling framework for the investigation of prototype-based classifiers in non-stationary environments. Specifically, we study Learning Vector Quantization (LVQ) systems trained from a stream of high-dimensional, clustered data.We consider standard winner-takes-all updates known as LVQ1. Statistical properties of the input data change on the time scale defined by the training process… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.

    Comments: Accepted contribution to WSOM+ 2019, Barcelona/Spain, June 2019 13th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization 11 pages

  8. arXiv:1902.07662  [pdf, ps, other

    cs.LG stat.ML

    Feature Relevance Bounds for Ordinal Regression

    Authors: Lukas Pfannschmidt, Jonathan Jakob, Michael Biehl, Peter Tino, Barbara Hammer

    Abstract: The increasing occurrence of ordinal data, mainly sociodemographic, led to a renewed research interest in ordinal regression, i.e. the prediction of ordered classes. Besides model accuracy, the interpretation of these models itself is of high relevance, and existing approaches therefore enforce e.g. model sparsity. For high dimensional or highly correlated data, however, this might be misleading d… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

    Comments: preprint of a paper accepted for oral presentation at the 27th European Symposium on Artificial Neural Networks (ESANN 2019)

  9. arXiv:1806.00201  [pdf, other

    cs.AI cs.NE stat.ML

    Being curious about the answers to questions: novelty search with learned attention

    Authors: Nicholas Guttenberg, Martin Biehl, Nathaniel Virgo, Ryota Kanai

    Abstract: We investigate the use of attentional neural network layers in order to learn a `behavior characterization' which can be used to drive novelty search and curiosity-based policies. The space is structured towards answering a particular distribution of questions, which are used in a supervised way to train the attentional neural network. We find that in a 2d exploration task, the structure of the sp… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

    Comments: 8 pages, 7 figures, ALife 2018

  10. arXiv:1609.00116  [pdf, other

    cs.AI cs.LG stat.ML

    Neural Coarse-Graining: Extracting slowly-varying latent degrees of freedom with neural networks

    Authors: Nicholas Guttenberg, Martin Biehl, Ryota Kanai

    Abstract: We present a loss function for neural networks that encompasses an idea of trivial versus non-trivial predictions, such that the network jointly determines its own prediction goals and learns to satisfy them. This permits the network to choose sub-sets of a problem which are most amenable to its abilities to focus on solving, while discarding 'distracting' elements that interfere with its learning… ▽ More

    Submitted 1 September, 2016; originally announced September 2016.

    Comments: 9 pages, 5 figures, 3 tables