Barwise Compression Schemes for Audio-Based Music Structure Analysis

Marmoret, Axel; Cohen, Jérémy E.; Bimbot, Frédéric

Computer Science > Sound

arXiv:2202.04981 (cs)

[Submitted on 10 Feb 2022 (v1), last revised 15 Apr 2022 (this version, v2)]

Title:Barwise Compression Schemes for Audio-Based Music Structure Analysis

Authors:Axel Marmoret, Jérémy E. Cohen, Frédéric Bimbot

View PDF

Abstract:Music Structure Analysis (MSA) consists in segmenting a music piece in several distinct sections. We approach MSA within a compression framework, under the hypothesis that the structure is more easily revealed by a simplified representation of the original content of the song. More specifically, under the hypothesis that MSA is correlated with similarities occurring at the bar scale, this article introduces the use of linear and non-linear compression schemes on barwise audio signals. Compressed representations capture the most salient components of the different bars in the song and are then used to infer the song structure using a dynamic programming algorithm. This work explores both low-rank approximation models such as Principal Component Analysis or Nonnegative Matrix Factorization and "piece-specific" Auto-Encoding Neural Networks, with the objective to learn latent representations specific to a given song. Such approaches do not rely on supervision nor annotations, which are well-known to be tedious to collect and possibly ambiguous in MSA description. In our experiments, several unsupervised compression schemes achieve a level of performance comparable to that of state-of-the-art supervised methods (for 3s tolerance) on the RWC-Pop dataset, showcasing the importance of the barwise compression processing for MSA.

Comments:	Published at the 2022 Sound and Music Computing (SMC) conference, 8 pages, 6 figures, 1 table, code available at this https URL. arXiv admin note: substantial text overlap with arXiv:2110.14437
Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
ACM classes:	H.5.5
Cite as:	arXiv:2202.04981 [cs.SD]
	(or arXiv:2202.04981v2 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2202.04981

Submission history

From: Axel Marmoret [view email]
[v1] Thu, 10 Feb 2022 12:23:57 UTC (1,536 KB)
[v2] Fri, 15 Apr 2022 15:52:46 UTC (1,540 KB)

Computer Science > Sound

Title:Barwise Compression Schemes for Audio-Based Music Structure Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Barwise Compression Schemes for Audio-Based Music Structure Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators