|
61. |
Subjective sound quality of all‐pass networks |
|
The Journal of the Acoustical Society of America,
Volume 61,
Issue S1,
1977,
Page 61-61
Paul Milner,
Benjamin B. Bauer,
Preview
|
PDF (209KB)
|
|
摘要:
In an earlier study [J. Acoust. Soc. Am.54, 334 (1973)] we reported that the subjective quality of music appeared to be unaffected by processing signals through all‐pass phase‐shift networks as used in the SQ quadraphonic matrix encoding system. These conclusions were based on the results of a same/different experiment in which the listeners appeared to be operating at a chance level for a two‐alternative forced‐choice paradigm. They responded with an equal likelihood (50%) of same and different regardless of the nature of the stimulus pairs. Additional experiments have been performed with speech samples using several lists of Harvard PB Sentences. In a same/different experiment, the listeners overwhelmingly judged “same” regardless of conditions. However, in an A‐B‐X experiment, the listeners favored the “B” choice, the one nearest the time to the unknown. Foil conditions of high‐pass filtering, low‐pass filtering, and peak‐clipping were included to verify the performance of the listeners. Correct responses to these conditions were provided except at near discrimination thresholds. One may therefore conclude that such all‐pass networks have no audible effect on subjective quality. In addition, whe presented with a difficult A‐B‐X discrimination task, a significant response bias results from a preference to the stimulus closest in time to the unknown.
ISSN:0001-4966
DOI:10.1121/1.2015807
出版商:Acoustical Society of America
年代:1977
数据来源: AIP
|
62. |
Discrimination of a moving auditory “dot” from a stationary auditory “line” |
|
The Journal of the Acoustical Society of America,
Volume 61,
Issue S1,
1977,
Page 62-62
D. Wesley Grantham,
Frederic L. Wightman,
Preview
|
PDF (203KB)
|
|
摘要:
In order to assess the binaural system's ability to detect varying interaural temporal differences (IATD's) we required observers to discriminate a binaural noise with a sinusoidally modulated IATD (a “moving dot”) from a binaural noise with random static IATD's (a “stationary line”). This latter stimulus was constructed so that its image presumably filled the same intracranial space as that described by the excursions of the “dot.” For several modulation frequencies (fm) from 0 to 200 Hz we determined the peak IATD (Δtmax, corresponding to the extent of movement) required for 75% discriminability of the dot from the line. Asfmincreased from 0 to 50 Hz, threshold Δtmaxincreased from 30 to 130 μsec, reflecting the decreasing ability of the binaural system to follow the fluctuating IATD's (forfm>50‐Hz thresholds decreased again, probably because cues other than movement were available at these high rates of modulation). A simple extension of the Jeffress coincidence network model for localization is proposed as a possible “movement detector.”
ISSN:0001-4966
DOI:10.1121/1.2015815
出版商:Acoustical Society of America
年代:1977
数据来源: AIP
|
63. |
Relation of some personality types to annoyance response to noise |
|
The Journal of the Acoustical Society of America,
Volume 61,
Issue S1,
1977,
Page 63-63
T. Shigehisa,
W. J. Gunn,
Preview
|
PDF (166KB)
|
|
摘要:
Subjects were classified into several groups based on their scores on extraversion, neuroticism, psychoticism, and lie scales of the Eysenck Personality Questionnaire. They were instructed to estimate the magnitude of annoyance of 27 aircraft noise stimuli while engaged in viewing television. Test/retest correlations between average annoyance estimations (over the 27 noises),slopes and intercepts of annoyancefunction in three test sessions (three different conditions of test room illumination), as well as split‐half reliabilities, showed that annoyance measures were stable and consistent among the subjects. The slope increased and the intercept decreased when the illumination was lower than ordinary in psychotics (but not in nonpsychotics) and in low‐Lscorers (but not in high‐Lscorers). The intercept tended to correlate positively while the slope correlated negatively with introversion, when illumination was brighter, and the slope correlated negatively with neuroticism whet illuminations was dimmer. The average estimation measures showed that hysterics (neurotic extroverts) are high annoyance estimators, where dysthymics (neurotic introverts) are low annoyance estimators of noise at relatively high levels (ranging from 84 to 92 dBA, peak value).
ISSN:0001-4966
DOI:10.1121/1.2015821
出版商:Acoustical Society of America
年代:1977
数据来源: AIP
|
64. |
Consonant duration in speech production and perception of adults and children |
|
The Journal of the Acoustical Society of America,
Volume 61,
Issue S1,
1977,
Page 64-64
Sarah Hawkins,
Preview
|
PDF (182KB)
|
|
摘要:
A method of adjustment to internal criterion was used to investigate the perception of consonant duration. Three adults and two children, 5 and 7 years old, adjusted the durations of /s/ and /t/ in the synthesised wordssay, tay, stay, andstray. The words were presented in two synthesised sentences, in both an adult male and a child's voice. The perceptual data were compared with speech produced by the same and different subjects. The just noticeable difference (jnd) for consonant segment duration was of the order of 25 msec; but the internal criterion for “acceptable” durations can shift over time by more than the jnd at any one time. Durations of /s,t/ in perception and production were strongly associated within and between subjects, but there were some interesting adult—child differences. Compared with adults, children appeared to lack sensitivity to differences between sentence contexts in the same speaker voice, but to differentiate more between /s,t/ durations they required for the child versus the adult speaker.The differences required in perception for the child speaker did not always mirror the child listener's own production durations. These data are interpreted with respect to possible processes governing the regulation of segmental timing and rhythm in perception and production. [Supported by SRC, Great Britian.]
ISSN:0001-4966
DOI:10.1121/1.2015827
出版商:Acoustical Society of America
年代:1977
数据来源: AIP
|
65. |
Response bias and subjective estimation of consonant frequency |
|
The Journal of the Acoustical Society of America,
Volume 61,
Issue S1,
1977,
Page 65-65
Louis Goldstein,
Marcel van den Broecke,
Preview
|
PDF (209KB)
|
|
摘要:
Do people have expectations about the frequency of occurrence of consonants in a given context and if so, do such expectations manifest themselves as consistent response biases in perceptual confusions? Subjective estimates of the frequency of selected /ιC/ syllables in English were measured. Subjects were auditorily presented with pairs of syllables and were asked to judge which member of the pair was the more frequent sound sequence in English. The scaled frequency judgments correlated highly (r= 0.84,p<0.001) with consonant frequencies in the same environment in published counts, Available data on perceptual confusions was analysed to determine if there was any consistent response bias. Consistent differences among various consonants in terms of their tendency to occur as responses would constitute a response bias. The number of times a certain consonant was given as a response in the material examined did not correlate with that consonant's frequency of occurrence. Such consistent response biases as did occur were due to asymmetric relations within a small subset of the total set of consonants. [Work supported by NSF, Z.W.O.]
ISSN:0001-4966
DOI:10.1121/1.2015831
出版商:Acoustical Society of America
年代:1977
数据来源: AIP
|
66. |
Cross language differences in tone perception |
|
The Journal of the Acoustical Society of America,
Volume 61,
Issue S1,
1977,
Page 66-66
J. Gandour,
R. Harshman,
Preview
|
PDF (178KB)
|
|
摘要:
Using an individual differences multidimensional scaling model of perception, the present crosslanguage investigation seeks to determine what dimensions underlie the perception of linguistic tone, and to what extent an individual's language background (Thai, Yoruba, or American English) influences his perception. Dissimilarities data were obtained from subject's paired—comparison judgments of thirteen different pitch patterns superimposed on a synthetic speechlike syllable. A multidimensional scaling analysis of the data for the total group revealed that five dimensions—interpretively labeled, AVERAGE PITCH, DIRECTION, LENGTH, EXTREME ENDPOINT and SLOPE—best summarize the perceptual structure underlying the dissimilarities data. Language subgroup variation in relative importance of these dimensions appears to be primarily related to subgroup differences in the way pitch is used to convey linguistic information. Discriminant analysis showed that most individual speakers of a tone language (Thai or Yoruba) can be easily distinguished from speakers of a nontone language (English) on the basis of their distinctive patterns of perceptual saliency for these five dimensions. [Research supported in part by fellowship grants provided by SSRC and NSF to first author. UCLA 1975–1976.]
ISSN:0001-4966
DOI:10.1121/1.2015836
出版商:Acoustical Society of America
年代:1977
数据来源: AIP
|
67. |
Standards Committee S1, Acoustics |
|
The Journal of the Acoustical Society of America,
Volume 61,
Issue S1,
1977,
Page 67-67
Preview
|
PDF (92KB)
|
|
摘要:
Working group chairpersons will report on their progress in the preparation of standards, methods of measurements and testing, and terminology in physical acoustics, electroacoustics, sonics, ultrasonics, and underwater sound. Work in progress includes measurements of noise sources, noise dosimeters, integrating sound‐level meters, and revision and extension of sound‐level meter specifications. Open discussion of committee reports is encouraged.
ISSN:0001-4966
DOI:10.1121/1.2015840
出版商:Acoustical Society of America
年代:1977
数据来源: AIP
|
68. |
Synthesizing whistle stops |
|
The Journal of the Acoustical Society of America,
Volume 61,
Issue S1,
1977,
Page 68-68
Jared Bernstein,
Preview
|
PDF (211KB)
|
|
摘要:
Certain human languages have a derived form in which sound is generated by whistling. The information found in syllabic portions of speech is usually conveyed in whistle language by the frequency of the whistle as it mimics either the fundamental frequency (for tones) or the second formant (for vowel qualities) in speech. Some of these whistle languages include consonant articulations. This paper describes techniques for hand synthesis of whistled [t] and [k]and automatic analysis—synthesis of whistle stops using Wm. Henke's MITSYN system. Characteristic bursts and frequency transitions for stops during bilabial whistling are discussed and compared to bursts and transitions in speech. [Work supported by NIH Grant NS 04332.]
ISSN:0001-4966
DOI:10.1121/1.2015846
出版商:Acoustical Society of America
年代:1977
数据来源: AIP
|
69. |
Principal‐components analysis of speech spectra |
|
The Journal of the Acoustical Society of America,
Volume 61,
Issue S1,
1977,
Page 69-69
Stephen Zahorian,
Martin Rothenberg,
Preview
|
PDF (208KB)
|
|
摘要:
The principal‐components method is sometimes employed to describe speech spectra in terms of a small number of factors, that is eigenvectors of the covariance matrix. In this study, this method has been used to analyze the spectral properties of continuous speech and to regenerate the spectra from a small set of factors. Spectral data from 0 to 5000 Hz was obtained using a linear‐predictive (LP) all‐pole model. This data was used to calculate the average energy in twenty bands, each about an auditory critical bandwidth wide. The analysis was applied to (1) the log‐coded band energies and (2) the band energies to the one‐third power. Continuous speech was synthesized using spectra computed from a small number of principal‐component factors and the LP residual signal. The quality of this speech is compared with speech synthesized using a low‐order LP vocoder. In addition, it is argued that the principal components might be more easily identifiable with linguistic categories than are the poles of the low‐order LP model.
ISSN:0001-4966
DOI:10.1121/1.2015853
出版商:Acoustical Society of America
年代:1977
数据来源: AIP
|
70. |
Large speech data bases |
|
The Journal of the Acoustical Society of America,
Volume 61,
Issue S1,
1977,
Page 70-70
D. Raj Reddy,
Bruce T. Lowerre,
Preview
|
PDF (156KB)
|
|
摘要:
It is easy enough to digitize a large amount of speech data. But before it can be effectively used in speech research, it must be cataloged to indicate position and descriptions of words and phones that are present in the data. In the absence of adequate tools, experts must do these tasks manually. Given interactive, automatic, and semiautomatic speech analysis programs, one can significantly improve the quality of the data base and the productivity of the experts. This paper describes the structure and characteristics of programs developed at Carnegie‐Mellon University which interactively bootstrap themselves to generate symbolic descriptions of a given set of data. The present system contains programs for (1) generating environment and speaker adapted phone templates (2) interactive generation of a phone lexicon containing alternating pronunciation of words generated from data, and (3) a program for machine aided labeling of a given phrase or sentence giving the beginning and ending of each phone and each word. Using these programs, over three hours of connected speech data has been digitized, and analyzed, to generate symbolic descriptions of the data in terms of phones and words. Retrieval programs are then used to retrieve all symbols satisfying a given property.
ISSN:0001-4966
DOI:10.1121/1.2015861
出版商:Acoustical Society of America
年代:1977
数据来源: AIP
|
|