NSTL回溯数据服务平台

首页

按字顺浏览

期刊浏览

卷期浏览

Speaker‐identifying features based on formant tracks

Speaker‐identifying features based on formant tracks

作者: Ursula G. Goldstein,

期刊: The Journal of the Acoustical Society of America （AIP Available online 1976）
卷期: Volume 59, issue 1

页码: 176-182

ISSN:0001-4966

年代: 1976

DOI:10.1121/1.380837

出版商: Acoustical Society of America

关键词: 7065;7040

数据来源: AIP

摘要:

The formant structure of three diphthongs, four tense vowels, and three retroflex sounds was examined in detail for possible speaker‐identifying features. These sounds were spoken five times each in sentence context by ten speakers of General American on one day and by six of the speakers on a second day at least three weeks later. Formant tracks were computed for each sound under investigation using covariance‐type pitch‐asynchronous linear prediction together with a root‐finding algorithm. The interspeaker variability of about 200 measurements made on these formant tracks was compared initially with intraspeaker variability through the calculation ofFratios. Those with averageFratios greater than 60 were evaluated further with a probability‐of‐error criterion. Features that are potentially most effective in identifying speakers are the minimum second‐formant value in [‐r], the maximum first‐formant value in [‐r], the maximum second‐formant values of [o] and [‐I], and the minimum third‐formant value of [‐]. The individual differences apparent in these sounds presumably depend more on speaker habits than on vocal‐tract anatomy. The error bound predicted for a speaker identification procedure based on these five features is 0.24%. An identification experiment using only the best two features gave 12 errors out of 80 identifications.Subject Classification: [43]70.65, [43]70.40.

点击下载: PDF (903KB)