Skip to main content

Showing 1–3 of 3 results for author: Walker, D D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.12029  [pdf, other

    cs.CL cs.AI cs.LG

    MultiTurnCleanup: A Benchmark for Multi-Turn Spoken Conversational Transcript Cleanup

    Authors: Hua Shen, Vicky Zayats, Johann C. Rocholl, Daniel D. Walker, Dirk Padfield

    Abstract: Current disfluency detection models focus on individual utterances each from a single speaker. However, numerous discontinuity phenomena in spoken conversational transcripts occur across multiple turns, hampering human readability and the performance of downstream NLP tasks. This study addresses these phenomena by proposing an innovative Multi-Turn Cleanup task for spoken conversational transcript… ▽ More

    Submitted 27 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 main conference. Dataset: https://github.com/huashen218/MultiTurnCleanup

  2. arXiv:2205.00620  [pdf, other

    cs.CL

    Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming Disfluency Detection

    Authors: Angelica Chen, Vicky Zayats, Daniel D. Walker, Dirk Padfield

    Abstract: In modern interactive speech-based systems, speech is consumed and transcribed incrementally prior to having disfluencies removed. This post-processing step is crucial for producing clean transcripts and high performance on downstream tasks (e.g. machine translation). However, most current state-of-the-art NLP models such as the Transformer operate non-incrementally, potentially causing unacceptab… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

    Comments: To be published at NAACL 2022

  3. arXiv:2104.10769  [pdf, ps, other

    cs.CL

    Disfluency Detection with Unlabeled Data and Small BERT Models

    Authors: Johann C. Rocholl, Vicky Zayats, Daniel D. Walker, Noah B. Murad, Aaron Schneider, Daniel J. Liebling

    Abstract: Disfluency detection models now approach high accuracy on English text. However, little exploration has been done in improving the size and inference time of the model. At the same time, automatic speech recognition (ASR) models are moving from server-side inference to local, on-device inference. Supporting models in the transcription pipeline (like disfluency detection) must follow suit. In this… ▽ More

    Submitted 27 July, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: INTERSPEECH 2021