首页   按字顺浏览 期刊浏览 卷期浏览 Nonlinear analysis and classification of speech under stressed conditions
Nonlinear analysis and classification of speech under stressed conditions

 

作者: Douglas A. Cairns,   John H. L. Hansen,  

 

期刊: The Journal of the Acoustical Society of America  (AIP Available online 1994)
卷期: Volume 96, issue 6  

页码: 3392-3400

 

ISSN:0001-4966

 

年代: 1994

 

DOI:10.1121/1.410601

 

出版商: Acoustical Society of America

 

关键词: SPEECH PRODUCTION;STRESS ANALYSIS;NONLINEAR PROBLEMS;VOICES;LOUDNESS

 

数据来源: AIP

 

摘要:

The speech production system is capable of conveying an abundance of information with regards to sentence text, speaker identity, prosodics, as well as emotion and speaker stress. In an effort to better understand the mechanism of human voice communication, researchers have attempted to determine reliable acoustic indicators of stress using such speech production features as fundamental frequency (F0), intensity, spectral tilt, the distribution of spectral energy, and others. Their findings indicate that more work is necessary to propose a general solution. In this study, we hypothesize that speech consists of a linear and nonlinear component, and that the nonlinear component changes markedly between normal and stressed speech. To quantify the changes between normal and stressed speech, a classification procedure was developed based on the nonlinear Teager Energy operator. The Teager Energy operator provides an indirect means of evaluating the nonlinear component of speech. The system was tested using VC and CVC utterances from native speakers of English across the following speaking styles; neutral, loud, angry, Lombard effect, and clear. Results of the system evaluation show that loud and angry speech can be differentiated from neutral speech, while clear speech is more difficult to differentiate. Results also show that reliable classification of Lombard effect speech is possible, but system performance varies across speakers.

 

点击下载:  PDF (1260KB)



返 回