The human voice is a source of important information regarding the physical, psychological, and mental health conditions of a speaker. Acoustic properties of speech have previously been reported as possible cues to risk of committing suicide in persons suffering from severe depression. Certain vocal parameters may be capable of objectively distinguishing depressive speech from near-term suicidal speech. Studies were performed to analyze and statistically compare the speech acoustics of separate female and male samples comprised of subjects attempting suicide and subjects carrying diagnoses of depression and remission (recovery from depression). In this study, two types of speech recordings, spontaneous and reading speech, were collected from each subject of diagnostic groups participating in interview and text-reading sessions. Acoustic analyses of energy distribution within a 0-2,000 Hz frequency range and energy concentration characterizing the vocal tract spectral response based on the Gaussian mixture model (GMM) were performed on speech samples. Discriminant analyses demonstrated the significance of energy distribution and GMM-based vocal features as being effective indicators of perceptual changes in speech production and articulation caused by the severity of psychological state, and as powerful discriminators of diagnostic groups in both female and male studies. Based on the most important pairwise study of depressed and suicidal speech, the 12-fold cross validations yielded the correct classification scores of 86% and 90.33% in classifying spontaneous and reading speech of females, and 86% and 88.50% in classifying male spontaneous and reading speech, respectively. Results suggest the investigated features derived from the reading speech capable of identifying the degree of psychological state as effective as those derived from the spontaneous speech among diagnostic groups.
Identifer | oai:union.ndltd.org:VANDERBILT/oai:VANDERBILTETD:etd-08152007-120523 |
Date | 20 August 2007 |
Creators | Yingthawornsuk, Thaweesak |
Contributors | Richard G.Shiavi, Ronald M. Salomon, D. Mitchell Wilkes, Ralph N. Ohde, A.B. Bonds III |
Publisher | VANDERBILT |
Source Sets | Vanderbilt University Theses |
Language | English |
Detected Language | English |
Type | text |
Format | application/pdf |
Source | http://etd.library.vanderbilt.edu/available/etd-08152007-120523/ |
Rights | unrestricted, I hereby certify that, if appropriate, I have obtained and attached hereto a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dissertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to Vanderbilt University or its agents the non-exclusive license to archive and make accessible, under the conditions specified below, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report. |
Page generated in 0.0018 seconds