Skip to main content

Showing 1–5 of 5 results for author: Schatz, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.04363  [pdf, other

    eess.AS cs.CL

    Simulating Articulatory Trajectories with Phonological Feature Interpolation

    Authors: Angelo Ortiz Tandazo, Thomas Schatz, Thomas Hueber, Emmanuel Dupoux

    Abstract: As a first step towards a complete computational model of speech learning involving perception-production loops, we investigate the forward mapping between pseudo-motor commands and articulatory trajectories. Two phonological feature sets, based respectively on generative and articulatory phonology, are used to encode a phonetic target sequence. Different interpolation techniques are compared to g… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: accepted at Interspeech 2024

  2. arXiv:2101.11332  [pdf, other

    cs.CL

    A phonetic model of non-native spoken word processing

    Authors: Yevgen Matusevych, Herman Kamper, Thomas Schatz, Naomi H. Feldman, Sharon Goldwater

    Abstract: Non-native speakers show difficulties with spoken word processing. Many studies attribute these difficulties to imprecise phonological encoding of words in the lexical memory. We test an alternative hypothesis: that some of these difficulties can arise from the non-native speakers' phonetic perception. We train a computational model of phonetic learning, which has no access to phonology, on either… ▽ More

    Submitted 11 March, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: Accepted for publication in Proceedings of EACL-2021. 11 pages, 5 figures, 2 tables

  3. arXiv:2008.02888  [pdf, other

    cs.CL cs.SD eess.AS

    Evaluating computational models of infant phonetic learning across languages

    Authors: Yevgen Matusevych, Thomas Schatz, Herman Kamper, Naomi H. Feldman, Sharon Goldwater

    Abstract: In the first year of life, infants' speech perception becomes attuned to the sounds of their native language. Many accounts of this early phonetic learning exist, but computational models predicting the attunement patterns observed in infants from the speech input they hear have been lacking. A recent study presented the first such model, drawing on algorithms proposed for unsupervised learning fr… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: 7 pages, 1 figure

    Journal ref: 2020. In S. Denison, M. Mack, Y. Xu, and B. Armstrong (Eds.), Proceedings of the 42nd Annual Conference of the Cognitive Science Society (pp. 571-577). Austin, TX: Cognitive Science Society

  4. arXiv:1804.11297  [pdf, other

    cs.CL cs.LG

    Sampling strategies in Siamese Networks for unsupervised speech representation learning

    Authors: Rachid Riad, Corentin Dancette, Julien Karadayi, Neil Zeghidour, Thomas Schatz, Emmanuel Dupoux

    Abstract: Recent studies have investigated siamese network architectures for learning invariant speech representations using same-different side information at the word level. Here we investigate systematically an often ignored component of siamese networks: the sampling procedure (how pairs of same vs. different tokens are selected). We show that sampling strategies taking into account Zipf's Law, the dist… ▽ More

    Submitted 23 August, 2018; v1 submitted 30 April, 2018; originally announced April 2018.

    Comments: Conference paper at Interspeech 2018

  5. arXiv:1711.01161  [pdf, other

    cs.CL

    Learning Filterbanks from Raw Speech for Phone Recognition

    Authors: Neil Zeghidour, Nicolas Usunier, Iasonas Kokkinos, Thomas Schatz, Gabriel Synnaeve, Emmanuel Dupoux

    Abstract: We train a bank of complex filters that operates on the raw waveform and is fed into a convolutional neural network for end-to-end phone recognition. These time-domain filterbanks (TD-filterbanks) are initialized as an approximation of mel-filterbanks, and then fine-tuned jointly with the remaining convolutional architecture. We perform phone recognition experiments on TIMIT and show that for seve… ▽ More

    Submitted 4 April, 2018; v1 submitted 3 November, 2017; originally announced November 2017.

    Comments: Accepted at ICASSP 2018