NSTL回溯数据服务平台

首页

按字顺浏览

期刊浏览

卷期浏览

Segmentation Scheme for Use of a Speech Recognition Computer Program

Segmentation Scheme for Use of a Speech Recognition Computer Program

作者: James W. Forgie, Carma D. Forgie,

期刊: The Journal of the Acoustical Society of America （AIP Available online 1960）
卷期: Volume 32, issue 11

页码: 1516-1516

ISSN:0001-4966

年代: 1960

DOI:10.1121/1.1936353

出版商: Acoustical Society of America

数据来源: AIP

摘要:

A segmentation scheme is being developed to operate as a first step in a general speech recognition program. Input to the segmenter is sampled, quantized spectral data (35 frequency channels scanned every 5.5 msec). The output is a sequence of boundary markers together with a rough classification of the enclosed segments. The segments are generally phonemic, or smaller, in size, and the classification corresponds either to type of phoneme (e.g., fricative, nasal, vowel) or to sound characteristic (e.g., silence, aspiration). The segmenter operates by first classifying each 5.5‐msec scan of the spectral data. This classification is obtained by computing a number of measurements of different attributes of the spectral pattern and combining the results with suitable weighting functions. The number of such measurements is made larger than the logical minimum to avoid placing too much dependence on any one measurement. Segments are then built by grouping together scans which have the same individual classifications. Finally these primitive segments are subjected to an editing process which modifies and merges together certain of the primitive segments according to rules based on length, amplitude, the classification of neighboring segments, etc. The effect of the editing process is to insure that the output segment sequence does not violate any simple articulatory rules for the language. (Lincoln Laboratory is operated with support from the U. S. Army, Navy, and Air Force.)

点击下载: PDF (169KB)