NSTL回溯数据服务平台

首页

按字顺浏览

期刊浏览

卷期浏览

Models of speech production for speech analysis and synthesis

Models of speech production for speech analysis and synthesis

作者: M. M. Sondhi,

期刊: The Journal of the Acoustical Society of America （AIP Available online 1990）
卷期: Volume 87, issue S1

页码: 14-14

ISSN:0001-4966

年代: 1990

DOI:10.1121/1.2028027

出版商: Acoustical Society of America

数据来源: AIP

摘要:

Coding (i.e., analysis and resynthesis) of speech signals on the basis of physiological models of speech production mechanisms has received considerable attention during the past several years. One of the reasons for this renewed interest is that the known coding algorithms give unacceptably poor quality of speech at low bit rates (e.g., at 2400 bits/s). A coding scheme that mimics human speech production may have advantages at such low bit rates. Basically, three aspects of speech production need to be modeled. First, the geometry of the vocal and nasal tracts needs to be parametrized. Second, a model must be selected to describe wave propagation in the tract. Finally, the sound sources (vocal cords and turbulent airflow) and their interactions with the tract must be modeled. In this talk the current models being used for each of these categories will be briefly described, and the way in which these models are employed for coding will also be described. Also, the use of such models for text‐to‐speech synthesis will be mentioned. Finally, some examples of speech produced by such models, both from text as well as by analysis and resynthesis of a given speech utterance will be played.

点击下载: PDF (101KB)