NSTL回溯数据服务平台

首页

按字顺浏览

期刊浏览

卷期浏览

Decoding the speech code—Applications of temporal decomposition

Decoding the speech code—Applications of temporal decomposition

作者: Stephen M. Marcus, Bishnu S. Atal,

期刊: The Journal of the Acoustical Society of America （AIP Available online 1986）
卷期: Volume 80, issue S1

页码: 17-17

ISSN:0001-4966

年代: 1986

DOI:10.1121/1.2023680

出版商: Acoustical Society of America

数据来源: AIP

摘要:

Articulatory phonetics describes speech as a sequence of overlapping articulatory gestures, each of which may be associated with a characteristic ideal target spectrum. In normal speech, the idealized target gestures for each speech sound are often never attained, and the speech signal exhibits only transitions between such (implicit) targets. It has been suggested that the underlying speech sounds can only be recovered by reference to detailed knowledge of the gestures by which individual speech sounds are produced. It will be shown that it is possible to decompose the speech signal into overlapping “temporal transition functions” using techniques which make no assumptions about the phonetic structure of the signal or the articulatory constraints used in speech production. Previous work has shown that these techniques can produce a large reduction in the information rate needed to represent the spectral information in speech signals [B.S. Atal, Proc. ICASSP83, 2.6, 81–84 (1983)]. It will be shown that these methods are able to derive speech components of low bandwiths that vary on a time scale closely related to traditional phonetic events. Implications for perception and the application of such techniques both for speech coding and as a possible front end for speech recognition will be discussed.

点击下载: PDF (178KB)