NSTL回溯数据服务平台

首页

按字顺浏览

期刊浏览

卷期浏览

Pronunciation of Digit Sequences in Text-to-Speech Systems

Pronunciation of Digit Sequences in Text-to-Speech Systems

作者: W. A. AINSWORTH, N. P. WARREN,

期刊: Connection Science （Taylor Available online 1990）
卷期: Volume 2, issue 3

页码: 241-249

ISSN:0954-0091

年代: 1990

DOI:10.1080/09540099008915671

出版商: Taylor & Francis Group

数据来源: Taylor

摘要:

Text-to-speech systems usually consist of a preprocessor for expanding abbreviations, a system for converting orthographic text to a phonemic representation, rules for generating appropriate rhythm and intonation, and a speech synthesizer to generate an acoustic waveform from the phonemic representation. Multi-layer perceptrons have recently been used for the orthographic to phonemic conversion process. In this paper the possibility of using perceptrons in the preprocessor is explored. It is shown that single-layer perceptrons are sufficient for expanding 3-digit numbers, 4-digit numbers and cardinal numbers into appropriate orthographic text, but a multi-layer perceptron is required for expanding 12-hour clock times.

点击下载: PDF (134KB)