Skip to main content

Showing 1–14 of 14 results for author: Arzt, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:1909.02869  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Exploiting Parallel Audio Recordings to Enforce Device Invariance in CNN-based Acoustic Scene Classification

    Authors: Paul Primus, Hamid Eghbal-zadeh, David Eitelsebner, Khaled Koutini, Andreas Arzt, Gerhard Widmer

    Abstract: Distribution mismatches between the data seen at training and at application time remain a major challenge in all application areas of machine learning. We study this problem in the context of machine listening (Task 1b of the DCASE 2019 Challenge). We propose a novel approach to learn domain-invariant classifiers in an end-to-end fashion by enforcing equal hidden layer representations for domain-… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

    Comments: Published at the Workshop on Detection and Classification of Acoustic Scenes and Events, 25-26 October 2019, New York, USA

  2. arXiv:1907.05982  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Learning Complex Basis Functions for Invariant Representations of Audio

    Authors: Stefan Lattner, Monika Dörfler, Andreas Arzt

    Abstract: Learning features from data has shown to be more successful than using hand-crafted features for many machine learning tasks. In music information retrieval (MIR), features learned from windowed spectrograms are highly variant to transformations like transposition or time-shift. Such variances are undesirable when they are irrelevant for the respective MIR task. We propose an architecture called C… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

    Comments: Paper accepted at the 20th International Society for Music Information Retrieval Conference, ISMIR 2019, Delft, The Netherlands, November 4-8; 8 pages, 4 figures, 4 tables

  3. arXiv:1906.10996  [pdf, other

    cs.IR cs.CV cs.LG cs.SD eess.AS

    Learning Soft-Attention Models for Tempo-invariant Audio-Sheet Music Retrieval

    Authors: Stefan Balke, Matthias Dorfer, Luis Carvalho, Andreas Arzt, Gerhard Widmer

    Abstract: Connecting large libraries of digitized audio recordings to their corresponding sheet music images has long been a motivation for researchers to develop new cross-modal retrieval systems. In recent years, retrieval systems based on embedding space learning with deep neural networks got a step closer to fulfilling this vision. However, global and local tempo deviations in the music recordings still… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

    Comments: Accepted for publication at ISMIR 2019

  4. Cross-Modal Music Retrieval and Applications: An Overview of Key Methodologies

    Authors: Meinard Müller, Andreas Arzt, Stefan Balke, Matthias Dorfer, Gerhard Widmer

    Abstract: There has been a rapid growth of digitally available music data, including audio recordings, digitized images of sheet music, album covers and liner notes, and video clips. This huge amount of data calls for retrieval strategies that allow users to explore large music collections in a convenient way. More precisely, there is a need for cross-modal retrieval algorithms that, given a query in one mo… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

    Journal ref: IEEE Signal Processing Magazine (Volume: 36, Issue: 1, Jan. 2019)

  5. arXiv:1807.07278  [pdf, other

    cs.SD cs.MM eess.AS

    Audio-to-Score Alignment using Transposition-invariant Features

    Authors: Andreas Arzt, Stefan Lattner

    Abstract: Audio-to-score alignment is an important pre-processing step for in-depth analysis of classical music. In this paper, we apply novel transposition-invariant audio features to this task. These low-dimensional features represent local pitch intervals and are learned in an unsupervised fashion by a gated autoencoder. Our results show that the proposed features are indeed fully transposition-invariant… ▽ More

    Submitted 19 July, 2018; originally announced July 2018.

    Comments: 19th International Society for Music Information Retrieval Conference, Paris, France, 2018

  6. arXiv:1711.02427  [pdf, other

    cs.SD cs.HC eess.AS

    The ACCompanion v0.1: An Expressive Accompaniment System

    Authors: Carlos Cancino-Chacón, Martin Bonev, Amaury Durand, Maarten Grachten, Andreas Arzt, Laura Bishop, Werner Goebl, Gerhard Widmer

    Abstract: In this paper we present a preliminary version of the ACCompanion, an expressive accompaniment system for MIDI input. The system uses a probabilistic monophonic score follower to track the position of the soloist in the score, and a linear Gaussian model to compute tempo updates. The expressiveness of the system is powered by the Basis-Mixer, a state-of-the-art computational model of expressive mu… ▽ More

    Submitted 7 November, 2017; originally announced November 2017.

    Comments: Presented at the Late-Breaking Demo Session of the 18th International Society for Music Information Retrieval Conference (ISMIR 2017), Suzhou, China, 2017

  7. arXiv:1708.02100  [pdf, other

    cs.MM

    Aktuelle Entwicklungen in der Automatischen Musikverfolgung

    Authors: Andreas Arzt, Matthias Dorfer

    Abstract: In this paper we present current trends in real-time music tracking (a.k.a. score following). Casually speaking, these algorithms "listen" to a live performance of music, compare the audio signal to an abstract representation of the score, and "read" along in the sheet music. In this way at any given time the exact position of the musician(s) in the sheet music is computed. Here, we focus on the a… ▽ More

    Submitted 7 August, 2017; originally announced August 2017.

    Comments: In German. Published in Maximilian Eibl, Martin Gaedke (Hrsg.): INFORMATIK 2017. Lecture Notes in Informatics (LNI), Gesellschaft für Informatik, Bonn 2017

  8. arXiv:1708.00733  [pdf, other

    cs.IR

    Piece Identification in Classical Piano Music Without Reference Scores

    Authors: Andreas Arzt, Gerhard Widmer

    Abstract: In this paper we describe an approach to identify the name of a piece of piano music, based on a short audio excerpt of a performance. Given only a description of the pieces in text format (i.e. no score information is provided), a reference database is automatically compiled by acquiring a number of audio representations (performances of the pieces) from internet sources. These are transcribed, p… ▽ More

    Submitted 2 August, 2017; originally announced August 2017.

    Comments: In Proceedings of the 18th International Society for Music Information Retrieval Conference (ISMIR 2017)

  9. arXiv:1707.09887  [pdf, other

    cs.IR cs.SD

    Learning Audio - Sheet Music Correspondences for Score Identification and Offline Alignment

    Authors: Matthias Dorfer, Andreas Arzt, Gerhard Widmer

    Abstract: This work addresses the problem of matching short excerpts of audio with their respective counterparts in sheet music images. We show how to employ neural network-based cross-modality embedding spaces for solving the following two sheet music-related tasks: retrieving the correct piece of sheet music from a database when given a music audio as a search query; and aligning an audio recording of a p… ▽ More

    Submitted 31 July, 2017; originally announced July 2017.

    Comments: In Proceedings of the 18th International Society for Music Information Retrieval Conference (ISMIR 2017)

  10. arXiv:1707.04457  [pdf, other

    cs.IR cs.SD

    Modeling Harmony with Skip-Grams

    Authors: David R. W. Sears, Andreas Arzt, Harald Frostel, Reinhard Sonnleitner, Gerhard Widmer

    Abstract: String-based (or viewpoint) models of tonal harmony often struggle with data sparsity in pattern discovery and prediction tasks, particularly when modeling composite events like triads and seventh chords, since the number of distinct n-note combinations in polyphonic textures is potentially enormous. To address this problem, this study examines the efficacy of skip-grams in music research, an alte… ▽ More

    Submitted 18 July, 2017; v1 submitted 14 July, 2017; originally announced July 2017.

    Comments: 7 pages, 5 figures. To appear in Proceedings of the 18th International Society for Music Information Retrieval Conference (ISMIR), Suzhou, China

  11. arXiv:1612.05153  [pdf, other

    cs.SD cs.LG

    On the Potential of Simple Framewise Approaches to Piano Transcription

    Authors: Rainer Kelz, Matthias Dorfer, Filip Korzeniowski, Sebastian Böck, Andreas Arzt, Gerhard Widmer

    Abstract: In an attempt at exploring the limitations of simple approaches to the task of piano transcription (as usually defined in MIR), we conduct an in-depth analysis of neural network-based framewise transcription. We systematically compare different popular input representations for transcription systems to determine the ones most suitable for use with neural networks. Exploiting recent advances in tra… ▽ More

    Submitted 15 December, 2016; originally announced December 2016.

    Comments: Proceedings of the 17th International Society for Music Information Retrieval Conference (ISMIR 2016), New York, NY

  12. arXiv:1612.05076  [pdf, other

    cs.SD

    Live Score Following on Sheet Music Images

    Authors: Matthias Dorfer, Andreas Arzt, Sebastian Böck, Amaury Durand, Gerhard Widmer

    Abstract: In this demo we show a novel approach to score following. Instead of relying on some symbolic representation, we are using a multi-modal convolutional neural network to match the incoming audio stream directly to sheet music images. This approach is in an early stage and should be seen as proof of concept. Nonetheless, the audience will have the opportunity to test our implementation themselves vi… ▽ More

    Submitted 15 December, 2016; originally announced December 2016.

    Comments: 17th International Society for Music Information Retrieval Conference (ISMIR 2016), Late Breaking/Demo Papers, New York, NY

  13. arXiv:1612.05070  [pdf, other

    cs.SD cs.IR cs.LG

    Towards End-to-End Audio-Sheet-Music Retrieval

    Authors: Matthias Dorfer, Andreas Arzt, Gerhard Widmer

    Abstract: This paper demonstrates the feasibility of learning to retrieve short snippets of sheet music (images) when given a short query excerpt of music (audio) -- and vice versa --, without any symbolic representation of music or scores. This would be highly useful in many content-based musical retrieval scenarios. Our approach is based on Deep Canonical Correlation Analysis (DCCA) and learns correlated… ▽ More

    Submitted 15 December, 2016; originally announced December 2016.

    Comments: In NIPS 2016 End-to-end Learning for Speech and Audio Processing Workshop, Barcelona, Spain

  14. arXiv:1612.05050  [pdf, other

    cs.LG cs.CV

    Towards Score Following in Sheet Music Images

    Authors: Matthias Dorfer, Andreas Arzt, Gerhard Widmer

    Abstract: This paper addresses the matching of short music audio snippets to the corresponding pixel location in images of sheet music. A system is presented that simultaneously learns to read notes, listens to music and matches the currently played music to its corresponding notes in the sheet. It consists of an end-to-end multi-modal convolutional neural network that takes as input images of sheet music a… ▽ More

    Submitted 15 December, 2016; originally announced December 2016.

    Comments: Published In Proceedings of the 17th International Society for Music Information Retrieval Conference (2016)