NSTL回溯数据服务平台

首页

按字顺浏览

期刊浏览

卷期浏览

The HEARSAY Speech Understanding System

The HEARSAY Speech Understanding System

作者: D. Raj Reddy, Lee D. Erman, Richard D. Fennell, Bruce T. Lowerre, Richard B. Neely,

期刊: The Journal of the Acoustical Society of America （AIP Available online 1974）
卷期: Volume 55, issue 2

页码: 409-409

ISSN:0001-4966

年代: 1974

DOI:10.1121/1.3437267

出版商: Acoustical Society of America

数据来源: AIP

摘要:

This talk describes the present state of performance of the HEARSAY system. [For more complete descriptions of the system see D. R. Reddy, L. D. Erman, and R. D. Neely, “A Model and a System for Machine Recognition of Speech,” IEEE Trans. Audio Electroacoust. AU‐21, 229–238 (1973) and D. R. Reddy, L. D. Erman, R. D. Fennell, and R. B. Neely, “The HEARSAY Speech Understanding System : An Example of the Recognition Process,” Proc. 3rd Int. Joint Conf. on Artificial Intelligence (Aug. 1973)]. The system uses task and context‐dependent information to help in the recognition of the utterance; this system consists of a set of cooperating parallel processes, each representing a different source of knowledge (e.g., acoustic‐phonetic, syntactic, semantic). The knowledge is used either to predict what may appear in a given context or to verify an hypothesis resulting from a previous prediction. Performance data of the system on several tasks (e.g., medical diagnosis, news retrieval, chess, and programming) will be presented. For example: The voice‐chess task contains a 31‐word vocabulary with about 5 000 000 possible sentences. One particular set of data contains 19 utterances of three to nine (mean =4.5) words each. Seventy‐nine percent of the utterances are correctly recognized (at about four times real time on a PDP10 computer); removing the semantic source of knowledge (but leaving the syntactic and acoustic‐phonetic) reduces recognition to 42%.

点击下载: PDF (141KB)