Using Generic Summarization to Improve Music Information Retrieval Tasks

Raposo, Francisco; Ribeiro, Ricardo; de Matos, David Martins

doi:10.1109/TASLP.2016.2541299

Computer Science > Information Retrieval

arXiv:1503.06666 (cs)

[Submitted on 23 Mar 2015 (v1), last revised 9 Mar 2016 (this version, v3)]

Title:Using Generic Summarization to Improve Music Information Retrieval Tasks

Authors:Francisco Raposo, Ricardo Ribeiro, David Martins de Matos

View PDF

Abstract:In order to satisfy processing time constraints, many MIR tasks process only a segment of the whole music signal. This practice may lead to decreasing performance, since the most important information for the tasks may not be in those processed segments. In this paper, we leverage generic summarization algorithms, previously applied to text and speech summarization, to summarize items in music datasets. These algorithms build summaries, that are both concise and diverse, by selecting appropriate segments from the input signal which makes them good candidates to summarize music as well. We evaluate the summarization process on binary and multiclass music genre classification tasks, by comparing the performance obtained using summarized datasets against the performances obtained using continuous segments (which is the traditional method used for addressing the previously mentioned time constraints) and full songs of the same original dataset. We show that GRASSHOPPER, LexRank, LSA, MMR, and a Support Sets-based Centrality model improve classification performance when compared to selected 30-second baselines. We also show that summarized datasets lead to a classification performance whose difference is not statistically significant from using full songs. Furthermore, we make an argument stating the advantages of sharing summarized datasets for future MIR research.

Comments:	24 pages, 10 tables; Submitted to IEEE/ACM Transactions on Audio, Speech and Language Processing
Subjects:	Information Retrieval (cs.IR); Machine Learning (cs.LG); Sound (cs.SD)
ACM classes:	H.5.5
Cite as:	arXiv:1503.06666 [cs.IR]
	(or arXiv:1503.06666v3 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.1503.06666
Journal reference:	IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 24, n. 6, March 2016
Related DOI:	https://doi.org/10.1109/TASLP.2016.2541299

Submission history

From: David Martins de Matos [view email]
[v1] Mon, 23 Mar 2015 14:48:24 UTC (40 KB)
[v2] Thu, 3 Dec 2015 18:38:22 UTC (346 KB)
[v3] Wed, 9 Mar 2016 16:24:42 UTC (3,170 KB)

Computer Science > Information Retrieval

Title:Using Generic Summarization to Improve Music Information Retrieval Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Using Generic Summarization to Improve Music Information Retrieval Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators