Pronunciation of Digit Sequences in Text-to-Speech Systems
作者:
W. A. AINSWORTH,
N. P. WARREN,
期刊:
Connection Science
(Taylor Available online 1990)
卷期:
Volume 2,
issue 3
页码: 241-249
ISSN:0954-0091
年代: 1990
DOI:10.1080/09540099008915671
出版商: Taylor & Francis Group
数据来源: Taylor
摘要:
Text-to-speech systems usually consist of a preprocessor for expanding abbreviations, a system for converting orthographic text to a phonemic representation, rules for generating appropriate rhythm and intonation, and a speech synthesizer to generate an acoustic waveform from the phonemic representation. Multi-layer perceptrons have recently been used for the orthographic to phonemic conversion process. In this paper the possibility of using perceptrons in the preprocessor is explored. It is shown that single-layer perceptrons are sufficient for expanding 3-digit numbers, 4-digit numbers and cardinal numbers into appropriate orthographic text, but a multi-layer perceptron is required for expanding 12-hour clock times.
点击下载:
PDF (134KB)
返 回