Skip to main content

Showing 1–3 of 3 results for author: Martinsson, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2503.02422  [pdf, other

    cs.SD cs.LG eess.AS

    Aggregation Strategies for Efficient Annotation of Bioacoustic Sound Events Using Active Learning

    Authors: Richard Lindholm, Oscar Marklund, Olof Mogren, John Martinsson

    Abstract: The vast amounts of audio data collected in Sound Event Detection (SED) applications require efficient annotation strategies to enable supervised learning. Manual labeling is expensive and time-consuming, making Active Learning (AL) a promising approach for reducing annotation effort. We introduce Top K Entropy, a novel uncertainty aggregation strategy for AL that prioritizes the most uncertain se… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

  2. arXiv:2403.08525  [pdf, other

    cs.SD cs.LG eess.AS

    From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning

    Authors: John Martinsson, Olof Mogren, Maria Sandsten, Tuomas Virtanen

    Abstract: We propose an adaptive change point detection method (A-CPD) for machine guided weak label annotation of audio recording segments. The goal is to maximize the amount of information gained about the temporal activations of the target sounds. For each unlabeled audio recording, we use a prediction model to derive a probability curve used to guide annotation. The prediction model is initially pre-tra… ▽ More

    Submitted 26 August, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted at EUSIPCO 2024 (nominated best student paper)

  3. arXiv:2006.09114  [pdf, other

    eess.AS cs.LG cs.SD

    Adversarial representation learning for private speech generation

    Authors: David Ericsson, Adam Östberg, Edvin Listo Zec, John Martinsson, Olof Mogren

    Abstract: As more and more data is collected in various settings across organizations, companies, and countries, there has been an increase in the demand of user privacy. Developing privacy preserving methods for data analytics is thus an important area of research. In this work we present a model based on generative adversarial networks (GANs) that learns to obfuscate specific sensitive attributes in speec… ▽ More

    Submitted 17 June, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Submitted to ICML 2020 Workshop on Self-supervision in Audio and Speech (SAS)