首页   按字顺浏览 期刊浏览 卷期浏览 From theory to practice: A 10‐yr path
From theory to practice: A 10‐yr path

 

作者: M. H. O'Malley,  

 

期刊: The Journal of the Acoustical Society of America  (AIP Available online 1990)
卷期: Volume 88, issue S1  

页码: 196-196

 

ISSN:0001-4966

 

年代: 1990

 

DOI:10.1121/1.2028880

 

出版商: Acoustical Society of America

 

数据来源: AIP

 

摘要:

Berkeley Speech Technologies, Inc. has been developing commercial text to speech synthesis technology for over 10 yr. What started out as a quick “technology transfer” has grown to become a complex body of “intellectual property” that has been realized in such products as a 100 000‐word talking dictionary, a telephone response system with 16 T‐T‐S lines on one board, a satellite communication system for trucks, and a portable talking computer for blind users. Practical considerations caused modification of the initial theoretical assumptions. From the beginning, it was assumed that high intelligibility and high phoneme accuracy were essential, but it was soon learned that 700 words per minute with a 25‐ms start and stop are equally important for blind users. Similarly, academic research had assumed wide bandwidth and low noise, but telephone systems require that all of the speech information be packed into a 3.5‐kHz telephone bandwidth. Initially, the choice was made to use demi‐syllable synthesis because it seemed to be an “engineering shortcut” that might cover gaps in standard scientific descriptions. As the technology developed, however, the decision was made to convert to a more scientifically based synthesis model because it offered higher quality, greater flexibility, and faster development, especially of new languages. Our 10‐yr development could not have been justified on the basis of expected financial return. However, it was and is fun.

 

点击下载:  PDF (106KB)



返 回