Skip to main content

Showing 1–2 of 2 results for author: Neo, V W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.16763  [pdf, other

    eess.AS cs.SD

    Uncertainty Quantification in Machine Learning for Joint Speaker Diarization and Identification

    Authors: Simon W. McKnight, Aidan O. T. Hogg, Vincent W. Neo, Patrick A. Naylor

    Abstract: This paper studies modulation spectrum features ($Φ$) and mel-frequency cepstral coefficients ($Ψ$) in joint speaker diarization and identification (JSID). JSID is important as speaker diarization on its own to distinguish speakers is insufficient for many applications, it is often necessary to identify speakers as well. Machine learning models are set up using convolutional neural networks (CNNs)… ▽ More

    Submitted 30 December, 2023; v1 submitted 27 December, 2023; originally announced December 2023.

    Comments: 12 pages, 7 figures

  2. arXiv:2308.04169  [pdf, other

    cs.SD cs.LG eess.AS

    Dual input neural networks for positional sound source localization

    Authors: Eric Grinstein, Vincent W. Neo, Patrick A. Naylor

    Abstract: In many signal processing applications, metadata may be advantageously used in conjunction with a high dimensional signal to produce a desired output. In the case of classical Sound Source Localization (SSL) algorithms, information from a high dimensional, multichannel audio signals received by many distributed microphones is combined with information describing acoustic properties of the scene, s… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.