首页   按字顺浏览 期刊浏览 卷期浏览 Speaker‐identifying features based on formant tracks
Speaker‐identifying features based on formant tracks

 

作者: Ursula G. Goldstein,  

 

期刊: The Journal of the Acoustical Society of America  (AIP Available online 1976)
卷期: Volume 59, issue 1  

页码: 176-182

 

ISSN:0001-4966

 

年代: 1976

 

DOI:10.1121/1.380837

 

出版商: Acoustical Society of America

 

关键词: 7065;7040

 

数据来源: AIP

 

摘要:

The formant structure of three diphthongs, four tense vowels, and three retroflex sounds was examined in detail for possible speaker‐identifying features. These sounds were spoken five times each in sentence context by ten speakers of General American on one day and by six of the speakers on a second day at least three weeks later. Formant tracks were computed for each sound under investigation using covariance‐type pitch‐asynchronous linear prediction together with a root‐finding algorithm. The interspeaker variability of about 200 measurements made on these formant tracks was compared initially with intraspeaker variability through the calculation ofFratios. Those with averageFratios greater than 60 were evaluated further with a probability‐of‐error criterion. Features that are potentially most effective in identifying speakers are the minimum second‐formant value in [‐r], the maximum first‐formant value in [‐r], the maximum second‐formant values of [o] and [‐I], and the minimum third‐formant value of [‐]. The individual differences apparent in these sounds presumably depend more on speaker habits than on vocal‐tract anatomy. The error bound predicted for a speaker identification procedure based on these five features is 0.24%. An identification experiment using only the best two features gave 12 errors out of 80 identifications.Subject Classification: [43]70.65, [43]70.40.

 

点击下载:  PDF (903KB)



返 回