Showing 1–2 of 2 results for author: Wrench, A

Search v0.5.6 released 2020-02-24

arXiv:2011.09804 [pdf, other]

eess.AS cs.CL cs.CV cs.SD eess.IV

TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos

Authors: Manuel Sam Ribeiro, Jennifer Sanger, Jing-Xuan Zhang, Aciel Eshky, Alan Wrench, Korin Richmond, Steve Renals

Abstract: We present the Tongue and Lips corpus (TaL), a multi-speaker corpus of audio, ultrasound tongue imaging, and lip videos. TaL consists of two parts: TaL1 is a set of six recording sessions of one professional voice talent, a male native speaker of English; TaL80 is a set of recording sessions of 81 native speakers of English without voice talent experience. Overall, the corpus contains 24 hours of… ▽ More We present the Tongue and Lips corpus (TaL), a multi-speaker corpus of audio, ultrasound tongue imaging, and lip videos. TaL consists of two parts: TaL1 is a set of six recording sessions of one professional voice talent, a male native speaker of English; TaL80 is a set of recording sessions of 81 native speakers of English without voice talent experience. Overall, the corpus contains 24 hours of parallel ultrasound, video, and audio data, of which approximately 13.5 hours are speech. This paper describes the corpus and presents benchmark results for the tasks of speech recognition, speech synthesis (articulatory-to-acoustic mapping), and automatic synchronisation of ultrasound to audio. The TaL corpus is publicly available under the CC BY-NC 4.0 license. △ Less

Submitted 19 November, 2020; originally announced November 2020.

Comments: 8 pages, 4 figures, Accepted to SLT2021, IEEE Spoken Language Technology Workshop
arXiv:1907.00835 [pdf, other]

cs.CL cs.CV cs.SD eess.AS eess.IV

doi 10.21437/Interspeech.2018-1736

UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions

Authors: Aciel Eshky, Manuel Sam Ribeiro, Joanne Cleland, Korin Richmond, Zoe Roxburgh, James Scobbie, Alan Wrench

Abstract: We introduce UltraSuite, a curated repository of ultrasound and acoustic data, collected from recordings of child speech therapy sessions. This release includes three data collections, one from typically developing children and two from children with speech sound disorders. In addition, it includes a set of annotations, some manual and some automatically produced, and software tools to process, tr… ▽ More We introduce UltraSuite, a curated repository of ultrasound and acoustic data, collected from recordings of child speech therapy sessions. This release includes three data collections, one from typically developing children and two from children with speech sound disorders. In addition, it includes a set of annotations, some manual and some automatically produced, and software tools to process, transform and visualise the data. △ Less

Submitted 1 July, 2019; originally announced July 2019.

Comments: 5 pages, 1 figure, 3 tables; accepted to Interspeech 2018: 19th Annual Conference of the International Speech Communication Association (ISCA)

Search v0.5.6 released 2020-02-24