Skip to main content

Showing 1–4 of 4 results for author: Massey, N

Searching in archive cs. Search in all archives.
.
  1. A High Quality Text-To-Speech System Composed of Multiple Neural Networks

    Authors: Orhan Karaali, Gerald Corrigan, Noel Massey, Corey Miller, Otto Schnurr, Andrew Mackie

    Abstract: While neural networks have been employed to handle several different text-to-speech tasks, ours is the first system to use neural networks throughout, for both linguistic and acoustic processing. We divide the text-to-speech task into three subtasks, a linguistic module mapping from text to a linguistic representation, an acoustic module mapping from the linguistic representation to speech, and… ▽ More

    Submitted 4 December, 1998; originally announced December 1998.

    Comments: Source link (9812006.tar.gz) contains: 1 PostScript file (4 pages) and 3 WAV audio files. If your system does not support Windows WAV files, try a tool like "sox" to translate the audio into a format of your choice

    ACM Class: I.2.6; K.3.2

    Journal ref: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (1998) 2:1237-1240. Seattle, Washington

  2. arXiv:cs/9811032  [pdf, ps

    cs.NE cs.HC

    Text-To-Speech Conversion with Neural Networks: A Recurrent TDNN Approach

    Authors: Orhan Karaali, Gerald Corrigan, Ira Gerson, Noel Massey

    Abstract: This paper describes the design of a neural network that performs the phonetic-to-acoustic mapping in a speech synthesis system. The use of a time-domain neural network architecture limits discontinuities that occur at phone boundaries. Recurrent data input also helps smooth the output parameter tracks. Independent testing has demonstrated that the voice quality produced by this system compares… ▽ More

    Submitted 24 November, 1998; originally announced November 1998.

    Comments: 4 pages, PostScript

    ACM Class: I.2.6; K.3.2

    Journal ref: Proceedings of Eurospeech (1997) 561-564. Rhodes, Greece

  3. arXiv:cs/9811030  [pdf, ps

    cs.NE cs.HC

    Generating Segment Durations in a Text-To-Speech System: A Hybrid Rule-Based/Neural Network Approach

    Authors: Gerald Corrigan, Noel Massey, Orhan Karaali

    Abstract: A combination of a neural network with rule firing information from a rule-based system is used to generate segment durations for a text-to-speech system. The system shows a slight improvement in performance over a neural network system without the rule firing information. Synthesized speech using segment durations was accepted by listeners as having about the same quality as speech generated us… ▽ More

    Submitted 24 November, 1998; originally announced November 1998.

    Comments: 4 pages, PostScript

    ACM Class: I.2.6; K.3.2

    Journal ref: Proceedings of Eurospeech (1997) 2675-2678. Rhodes, Greece

  4. arXiv:cmp-lg/9711004  [pdf, ps

    cs.CL

    Variation and Synthetic Speech

    Authors: Corey Miller, Orhan Karaali, Noel Massey

    Abstract: We describe the approach to linguistic variation taken by the Motorola speech synthesizer. A pan-dialectal pronunciation dictionary is described, which serves as the training data for a neural network based letter-to-sound converter. Subsequent to dictionary retrieval or letter-to-sound generation, pronunciations are submitted a neural network based postlexical module. The postlexical module has… ▽ More

    Submitted 17 November, 1997; originally announced November 1997.

    Comments: 18 pages, 2 figures

    Report number: Motorola-SSML-1