首页   按字顺浏览 期刊浏览 卷期浏览 Pronunciation of Digit Sequences in Text-to-Speech Systems
Pronunciation of Digit Sequences in Text-to-Speech Systems

 

作者: W. A. AINSWORTH,   N. P. WARREN,  

 

期刊: Connection Science  (Taylor Available online 1990)
卷期: Volume 2, issue 3  

页码: 241-249

 

ISSN:0954-0091

 

年代: 1990

 

DOI:10.1080/09540099008915671

 

出版商: Taylor & Francis Group

 

数据来源: Taylor

 

摘要:

Text-to-speech systems usually consist of a preprocessor for expanding abbreviations, a system for converting orthographic text to a phonemic representation, rules for generating appropriate rhythm and intonation, and a speech synthesizer to generate an acoustic waveform from the phonemic representation. Multi-layer perceptrons have recently been used for the orthographic to phonemic conversion process. In this paper the possibility of using perceptrons in the preprocessor is explored. It is shown that single-layer perceptrons are sufficient for expanding 3-digit numbers, 4-digit numbers and cardinal numbers into appropriate orthographic text, but a multi-layer perceptron is required for expanding 12-hour clock times.

 

点击下载:  PDF (134KB)



返 回