Skip to main content

Showing 1–9 of 9 results for author: Haunschmid, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2011.02949  [pdf, other

    eess.AS cs.LG cs.SD

    Anomalous Sound Detection as a Simple Binary Classification Problem with Careful Selection of Proxy Outlier Examples

    Authors: Paul Primus, Verena Haunschmid, Patrick Praher, Gerhard Widmer

    Abstract: Unsupervised anomalous sound detection is concerned with identifying sounds that deviate from what is defined as 'normal', without explicitly specifying the types of anomalies. A significant obstacle is the diversity and rareness of outliers, which typically prevent us from collecting a representative set of anomalous sounds. As a consequence, most anomaly detection methods use unsupervised rather… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

    Comments: published in DCASE 2020 Workshop

  2. arXiv:2009.02051  [pdf, other

    cs.SD cs.IR cs.LG eess.AS

    Towards Musically Meaningful Explanations Using Source Separation

    Authors: Verena Haunschmid, Ethan Manilow, Gerhard Widmer

    Abstract: Deep neural networks (DNNs) are successfully applied in a wide variety of music information retrieval (MIR) tasks. Such models are usually considered "black boxes", meaning that their predictions are not interpretable. Prior work on explainable models in MIR has generally used image processing tools to produce explanations for DNN predictions, but these are not necessarily musically meaningful, or… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

    Comments: 6+2 pages, 4 figures; Submitted to International Society for Music Information Retrieval Conference 2020

  3. arXiv:2008.00582  [pdf, other

    cs.SD cs.IR cs.LG eess.AS

    audioLIME: Listenable Explanations Using Source Separation

    Authors: Verena Haunschmid, Ethan Manilow, Gerhard Widmer

    Abstract: Deep neural networks (DNNs) are successfully applied in a wide variety of music information retrieval (MIR) tasks but their predictions are usually not interpretable. We propose audioLIME, a method based on Local Interpretable Model-agnostic Explanations (LIME) extended by a musical definition of locality. The perturbations used in LIME are created by switching on/off components extracted by sourc… ▽ More

    Submitted 7 September, 2020; v1 submitted 2 August, 2020; originally announced August 2020.

    Comments: In The 13th International Workshop on Machine Learning and Music, ECML-PKDD 2020

  4. arXiv:2007.13503  [pdf, other

    eess.AS cs.LG cs.SD

    Receptive-Field Regularized CNNs for Music Classification and Tagging

    Authors: Khaled Koutini, Hamid Eghbal-Zadeh, Verena Haunschmid, Paul Primus, Shreyan Chowdhury, Gerhard Widmer

    Abstract: Convolutional Neural Networks (CNNs) have been successfully used in various Music Information Retrieval (MIR) tasks, both as end-to-end models and as feature extractors for more complex systems. However, the MIR field is still dominated by the classical VGG-based CNN architecture variants, often in combination with more complex modules such as attention, and/or techniques such as pre-training on l… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

  5. arXiv:2007.02650  [pdf, other

    cs.LG stat.ML

    On Data Augmentation and Adversarial Risk: An Empirical Analysis

    Authors: Hamid Eghbal-zadeh, Khaled Koutini, Paul Primus, Verena Haunschmid, Michal Lewandowski, Werner Zellinger, Bernhard A. Moser, Gerhard Widmer

    Abstract: Data augmentation techniques have become standard practice in deep learning, as it has been shown to greatly improve the generalisation abilities of models. These techniques rely on different ideas such as invariance-preserving transformations (e.g, expert-defined augmentation), statistical heuristics (e.g, Mixup), and learning the data distribution (e.g, GANs). However, in the adversarial setting… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

    Comments: 21 pages, 15 figures, 3 tables

  6. arXiv:1911.05833  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Emotion and Theme Recognition in Music with Frequency-Aware RF-Regularized CNNs

    Authors: Khaled Koutini, Shreyan Chowdhury, Verena Haunschmid, Hamid Eghbal-zadeh, Gerhard Widmer

    Abstract: We present CP-JKU submission to MediaEval 2019; a Receptive Field-(RF)-regularized and Frequency-Aware CNN approach for tagging music with emotion/mood labels. We perform an investigation regarding the impact of the RF of the CNNs on their performance on this dataset. We observe that ResNets with smaller receptive fields -- originally adapted for acoustic scene classification -- also perform well… ▽ More

    Submitted 28 October, 2019; originally announced November 2019.

    Comments: MediaEval`19, 27-29 October 2019, Sophia Antipolis, France

  7. arXiv:1907.03572  [pdf, other

    cs.SD cs.LG stat.ML

    Towards Explainable Music Emotion Recognition: The Route via Mid-level Features

    Authors: Shreyan Chowdhury, Andreu Vall, Verena Haunschmid, Gerhard Widmer

    Abstract: Emotional aspects play an important part in our interaction with music. However, modelling these aspects in MIR systems have been notoriously challenging since emotion is an inherently abstract and subjective experience, thus making it difficult to quantify or predict in the first place, and to make sense of the predictions in the next. In an attempt to create a model that can give a musically mea… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

    Comments: International Society for Music Information Retrieval Conference, Delft, The Netherlands, 2019

  8. arXiv:1905.11760  [pdf, other

    cs.SD cs.LG eess.AS

    Two-level Explanations in Music Emotion Recognition

    Authors: Verena Haunschmid, Shreyan Chowdhury, Gerhard Widmer

    Abstract: Current ML models for music emotion recognition, while generally working quite well, do not give meaningful or intuitive explanations for their predictions. In this work, we propose a 2-step procedure to arrive at spectrogram-level explanations that connect certain aspects of the audio to interpretable mid-level perceptual features, and these to the actual emotion prediction. That makes it possibl… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: ML4MD Workshop of the 36th International Conference on Machine Learning

  9. arXiv:1707.08776  [pdf, other

    cs.NE

    An Evolutionary Stochastic-Local-Search Framework for One-Dimensional Cutting-Stock Problems

    Authors: Georgios C. Chasparis, Michael Rossbory, Verena Haunschmid

    Abstract: We introduce an evolutionary stochastic-local-search (SLS) algorithm for addressing a generalized version of the so-called 1/V/D/R cutting-stock problem. Cutting-stock problems are encountered often in industrial environments and the ability to address them efficiently usually results in large economic benefits. Traditionally linear-programming-based techniques have been utilized to address such p… ▽ More

    Submitted 27 July, 2017; originally announced July 2017.