Predictor codebook for speaker‐independent speech recognition
作者:
Takeshi Kawabata,
期刊:
Systems and Computers in Japan
(WILEY Available online 1994)
卷期:
Volume 25,
issue 1
页码: 37-46
ISSN:0882-1666
年代: 1994
DOI:10.1002/scj.4690250103
出版商: Wiley Subscription Services, Inc., A Wiley Company
关键词: Speech recognition;speaker‐independent recognition;predictor codebook;HMM
数据来源: WILEY
摘要:
AbstractThis paper discusses a method to handle the diversified dynamic features of speech by representing the dynamic features of speech by spectrum predictors and constructing the codebook containing predictors as the elements. The effectiveness of the method for speaker‐independent speech recognition is examined. Three kinds of predictor structures, i.e., the forward predictor, the backward predictor and the interpolator, are examined. The predictor codebook is constructed by the predictor quantization procedure, which is a small modification of the LBG algorithm. For the evaluation of the phoneme recognition level, two kinds statistical evaluation quantities and the phoneme recognition rate have been considered. It is seen as a result that the predictor codebook can realize a high phoneme separation capability and the robustness against the speaker variation. By combining the process actually into the phrase recognition system, the performance at the continuous speech recognition level is evaluated. In either case, the codebook with the backward predictor as the elements exhibited the highest performanc
点击下载:
PDF
(752KB)
返 回