首页   按字顺浏览 期刊浏览 卷期浏览 Models of speech production for speech analysis and synthesis
Models of speech production for speech analysis and synthesis

 

作者: M. M. Sondhi,  

 

期刊: The Journal of the Acoustical Society of America  (AIP Available online 1990)
卷期: Volume 87, issue S1  

页码: 14-14

 

ISSN:0001-4966

 

年代: 1990

 

DOI:10.1121/1.2028027

 

出版商: Acoustical Society of America

 

数据来源: AIP

 

摘要:

Coding (i.e., analysis and resynthesis) of speech signals on the basis of physiological models of speech production mechanisms has received considerable attention during the past several years. One of the reasons for this renewed interest is that the known coding algorithms give unacceptably poor quality of speech at low bit rates (e.g., at 2400 bits/s). A coding scheme that mimics human speech production may have advantages at such low bit rates. Basically, three aspects of speech production need to be modeled. First, the geometry of the vocal and nasal tracts needs to be parametrized. Second, a model must be selected to describe wave propagation in the tract. Finally, the sound sources (vocal cords and turbulent airflow) and their interactions with the tract must be modeled. In this talk the current models being used for each of these categories will be briefly described, and the way in which these models are employed for coding will also be described. Also, the use of such models for text‐to‐speech synthesis will be mentioned. Finally, some examples of speech produced by such models, both from text as well as by analysis and resynthesis of a given speech utterance will be played.

 

点击下载:  PDF (101KB)



返 回