Skip to main content

Showing 1–6 of 6 results for author: Haunschmid, V

Searching in archive eess. Search in all archives.
.
  1. arXiv:2011.02949  [pdf, other

    eess.AS cs.LG cs.SD

    Anomalous Sound Detection as a Simple Binary Classification Problem with Careful Selection of Proxy Outlier Examples

    Authors: Paul Primus, Verena Haunschmid, Patrick Praher, Gerhard Widmer

    Abstract: Unsupervised anomalous sound detection is concerned with identifying sounds that deviate from what is defined as 'normal', without explicitly specifying the types of anomalies. A significant obstacle is the diversity and rareness of outliers, which typically prevent us from collecting a representative set of anomalous sounds. As a consequence, most anomaly detection methods use unsupervised rather… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

    Comments: published in DCASE 2020 Workshop

  2. arXiv:2009.02051  [pdf, other

    cs.SD cs.IR cs.LG eess.AS

    Towards Musically Meaningful Explanations Using Source Separation

    Authors: Verena Haunschmid, Ethan Manilow, Gerhard Widmer

    Abstract: Deep neural networks (DNNs) are successfully applied in a wide variety of music information retrieval (MIR) tasks. Such models are usually considered "black boxes", meaning that their predictions are not interpretable. Prior work on explainable models in MIR has generally used image processing tools to produce explanations for DNN predictions, but these are not necessarily musically meaningful, or… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

    Comments: 6+2 pages, 4 figures; Submitted to International Society for Music Information Retrieval Conference 2020

  3. arXiv:2008.00582  [pdf, other

    cs.SD cs.IR cs.LG eess.AS

    audioLIME: Listenable Explanations Using Source Separation

    Authors: Verena Haunschmid, Ethan Manilow, Gerhard Widmer

    Abstract: Deep neural networks (DNNs) are successfully applied in a wide variety of music information retrieval (MIR) tasks but their predictions are usually not interpretable. We propose audioLIME, a method based on Local Interpretable Model-agnostic Explanations (LIME) extended by a musical definition of locality. The perturbations used in LIME are created by switching on/off components extracted by sourc… ▽ More

    Submitted 7 September, 2020; v1 submitted 2 August, 2020; originally announced August 2020.

    Comments: In The 13th International Workshop on Machine Learning and Music, ECML-PKDD 2020

  4. arXiv:2007.13503  [pdf, other

    eess.AS cs.LG cs.SD

    Receptive-Field Regularized CNNs for Music Classification and Tagging

    Authors: Khaled Koutini, Hamid Eghbal-Zadeh, Verena Haunschmid, Paul Primus, Shreyan Chowdhury, Gerhard Widmer

    Abstract: Convolutional Neural Networks (CNNs) have been successfully used in various Music Information Retrieval (MIR) tasks, both as end-to-end models and as feature extractors for more complex systems. However, the MIR field is still dominated by the classical VGG-based CNN architecture variants, often in combination with more complex modules such as attention, and/or techniques such as pre-training on l… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

  5. arXiv:1911.05833  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Emotion and Theme Recognition in Music with Frequency-Aware RF-Regularized CNNs

    Authors: Khaled Koutini, Shreyan Chowdhury, Verena Haunschmid, Hamid Eghbal-zadeh, Gerhard Widmer

    Abstract: We present CP-JKU submission to MediaEval 2019; a Receptive Field-(RF)-regularized and Frequency-Aware CNN approach for tagging music with emotion/mood labels. We perform an investigation regarding the impact of the RF of the CNNs on their performance on this dataset. We observe that ResNets with smaller receptive fields -- originally adapted for acoustic scene classification -- also perform well… ▽ More

    Submitted 28 October, 2019; originally announced November 2019.

    Comments: MediaEval`19, 27-29 October 2019, Sophia Antipolis, France

  6. arXiv:1905.11760  [pdf, other

    cs.SD cs.LG eess.AS

    Two-level Explanations in Music Emotion Recognition

    Authors: Verena Haunschmid, Shreyan Chowdhury, Gerhard Widmer

    Abstract: Current ML models for music emotion recognition, while generally working quite well, do not give meaningful or intuitive explanations for their predictions. In this work, we propose a 2-step procedure to arrive at spectrogram-level explanations that connect certain aspects of the audio to interpretable mid-level perceptual features, and these to the actual emotion prediction. That makes it possibl… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: ML4MD Workshop of the 36th International Conference on Machine Learning