Return to search

Acoustic correlates of word stress in American English

Thesis (Ph. D.)--Harvard-MIT Division of Health Sciences and Technology, 2006. / Includes bibliographical references (p. 123-126). / Acoustic parameters that differentiate between primary stress and non-primary full vowels were determined using two-syllable real and novel words and specially constructed novel words with identical syllable compositions. The location of the high focal pitch accent within a declarative carrier phrase was varied using an innovative object naming task that allowed for a natural and spontaneous manipulation of phrase-level accentuation. Results from male native speakers of American English show that when the high focal pitch accent was on the novel word, vowel differences in pitch, intensity prominence, and amplitude of the first harmonic, H1 * (corrected for the effect of the vocal tract filter), accurately distinguished full vowel syllables carrying primary stress vs. non-primary stress. Acoustic parameters that correlated to word stress under all conditions tested were syllable duration, HI*-A3*, as a measurement of spectral tilt, and noise at high frequencies, determined by band-pass filtering the F3 region of the spectrum. Furthermore, the results indicate that word stress cues are augmented when the high focal pitch accent is on the target word. / (cont.) This became apparent after a formula was devised to correct for the masking effect of phrase-level accentuation on the spectral tilt measurement, Hi *-A3*. Perceptual experiments also show that male native speakers of American English utilized differences in syllable duration and spectral tilt, as controlled by the KLSYN88 parameters DU and TL, to assign prominence status to the syllables of a novel word embedded in a carrier phrase. Results from this study suggest that some correlates to word stress are produced in the laryngeal region and are due to vocal fold configuration. The model of word stress that emerges from this study has aspects that differ from other widely accepted models of prosody at the word level. The model can also be applied to improve the prosody of synthesized speech, as well as to improve machine recognition of speech. / by Anthony O. Okobi. / Ph.D.

Identiferoai:union.ndltd.org:MIT/oai:dspace.mit.edu:1721.1/37963
Date January 2006
CreatorsOkobi, Anthony O. (Anthony Obiesie), 1976-
ContributorsKenneth N. Stevens., Harvard University--MIT Division of Health Sciences and Technology., Harvard University--MIT Division of Health Sciences and Technology.
PublisherMassachusetts Institute of Technology
Source SetsM.I.T. Theses and Dissertation
LanguageEnglish
Detected LanguageEnglish
TypeThesis
Format126 p., application/pdf
RightsM.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission., http://dspace.mit.edu/handle/1721.1/7582

Page generated in 0.183 seconds