Skip to main content

Showing 1–1 of 1 results for author: Malan, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2409.14486  [pdf, other

    eess.AS cs.CL cs.SD

    Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming

    Authors: Simon Malan, Benjamin van Niekerk, Herman Kamper

    Abstract: We look at the long-standing problem of segmenting unlabeled speech into word-like segments and clustering these into a lexicon. Several previous methods use a scoring model coupled with dynamic programming to find an optimal segmentation. Here we propose a much simpler strategy: we predict word boundaries using the dissimilarity between adjacent self-supervised features, then we cluster the predi… ▽ More

    Submitted 12 January, 2025; v1 submitted 22 September, 2024; originally announced September 2024.

    Comments: Accepted at ICASSP 2025