Skip to main content

Showing 1–8 of 8 results for author: Oramas, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.06556  [pdf, other

    cs.IR

    Contrastive Learning for Cross-modal Artist Retrieval

    Authors: Andres Ferraro, Jaehun Kim, Sergio Oramas, Andreas Ehmann, Fabien Gouyon

    Abstract: Music retrieval and recommendation applications often rely on content features encoded as embeddings, which provide vector representations of items in a music dataset. Numerous complementary embeddings can be derived from processing items originally represented in several modalities, e.g., audio signals, user interaction data, or editorial data. However, data of any given modality might not be ava… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

  2. arXiv:2210.03799  [pdf, other

    cs.SD cs.AI cs.IR cs.LG cs.MM eess.AS

    Supervised and Unsupervised Learning of Audio Representations for Music Understanding

    Authors: Matthew C. McCallum, Filip Korzeniowski, Sergio Oramas, Fabien Gouyon, Andreas F. Ehmann

    Abstract: In this work, we provide a broad comparative analysis of strategies for pre-training audio understanding models for several tasks in the music domain, including labelling of genre, era, origin, mood, instrumentation, key, pitch, vocal characteristics, tempo and sonority. Specifically, we explore how the domain of pre-training datasets (music or generic audio) and the pre-training methodology (supe… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  3. arXiv:2107.14541  [pdf, other

    cs.IR cs.LG cs.SD eess.AS

    Artist Similarity with Graph Neural Networks

    Authors: Filip Korzeniowski, Sergio Oramas, Fabien Gouyon

    Abstract: Artist similarity plays an important role in organizing, understanding, and subsequently, facilitating discovery in large collections of music. In this paper, we present a hybrid approach to computing similarity between artists using graph neural networks trained with triplet loss. The novelty of using a graph neural network architecture is to combine the topology of a graph of artist connections… ▽ More

    Submitted 30 July, 2021; originally announced July 2021.

    Comments: Appears in Proc. of the International Society for Music Information Retrieval Conference 2021 (ISMIR 2021)

  4. arXiv:2010.16030  [pdf, other

    cs.IR cs.MM cs.SD eess.AS

    Multimodal Metric Learning for Tag-based Music Retrieval

    Authors: Minz Won, Sergio Oramas, Oriol Nieto, Fabien Gouyon, Xavier Serra

    Abstract: Tag-based music retrieval is crucial to browse large-scale music libraries efficiently. Hence, automatic music tagging has been actively explored, mostly as a classification task, which has an inherent limitation: a fixed vocabulary. On the other hand, metric learning enables flexible vocabularies by using pretrained word embeddings as side information. Also, metric learning has already proven its… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

    Comments: 5 pages, 2 figures, submitted to ICASSP 2021

  5. arXiv:2010.11512  [pdf, other

    cs.SD cs.IR eess.AS

    Mood Classification Using Listening Data

    Authors: Filip Korzeniowski, Oriol Nieto, Matthew McCallum, Minz Won, Sergio Oramas, Erik Schmidt

    Abstract: The mood of a song is a highly relevant feature for exploration and recommendation in large collections of music. These collections tend to require automatic methods for predicting such moods. In this work, we show that listening-based features outperform content-based ones when classifying moods: embeddings obtained through matrix factorization of listening data appear to be more informative of a… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: Appears in Proc. of the International Society for Music Information Retrieval Conference 2020 (ISMIR 2020)

  6. Natural Language Processing for Music Knowledge Discovery

    Authors: Sergio Oramas, Luis Espinosa-Anke, Francisco Gómez, Xavier Serra

    Abstract: Today, a massive amount of musical knowledge is stored in written form, with testimonies dated as far back as several centuries ago. In this work, we present different Natural Language Processing (NLP) approaches to harness the potential of these text collections for automatic music knowledge discovery, covering different phases in a prototypical NLP pipeline, namely corpus compilation, text-minin… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Journal ref: Journal of New Music Research (2018)

  7. arXiv:1707.04916  [pdf, other

    cs.IR

    Multi-label Music Genre Classification from Audio, Text, and Images Using Deep Features

    Authors: Sergio Oramas, Oriol Nieto, Francesco Barbieri, Xavier Serra

    Abstract: Music genres allow to categorize musical items that share common characteristics. Although these categories are not mutually exclusive, most related research is traditionally focused on classifying tracks into a single class. Furthermore, these categories (e.g., Pop, Rock) tend to be too broad for certain applications. In this work we aim to expand this task by categorizing musical items into mult… ▽ More

    Submitted 16 July, 2017; originally announced July 2017.

    Comments: In Proceedings of the 18th International Society of Music Information Retrieval Conference (ISMIR 2017)

  8. A Deep Multimodal Approach for Cold-start Music Recommendation

    Authors: Sergio Oramas, Oriol Nieto, Mohamed Sordo, Xavier Serra

    Abstract: An increasing amount of digital music is being published daily. Music streaming services often ingest all available music, but this poses a challenge: how to recommend new artists for which prior knowledge is scarce? In this work we aim to address this so-called cold-start problem by combining text and audio information with user feedback data using deep network architectures. Our method is divide… ▽ More

    Submitted 24 July, 2017; v1 submitted 29 June, 2017; originally announced June 2017.

    Comments: In Proceedings of the 2nd Workshop on Deep Learning for Recommender Systems (DLRS 2017), collocated with RecSys 2017