The diploma thesis deals with the analysis of human emotional states speaker by the help of analyse speech signals. The thesis has two parts. In the first part, the process of speech generating is described in addition to the description of the commonly used pre-processing methods such as denoising or preemphasis. The first part also deals with the major and minor prosody features, these features are: the fundamental frequency, energy, spectral features and time domain features such as the speech rate. The second part of this thesis deals with a task of emotion recognition from the speech signal. When we accumulate sufficient of the number of recordings emotive state will be able to rekognize emotive state with high probability. All project is prepared for use in real time. The last part of this thesis thesis contains description and results of the experiments made on a large number of speech records.
Identifer | oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:217263 |
Date | January 2008 |
Creators | Navrátil, Michal |
Contributors | Atassi, Hicham, Smékal, Zdeněk |
Publisher | Vysoké učení technické v Brně. Fakulta elektrotechniky a komunikačních technologií |
Source Sets | Czech ETDs |
Language | Czech |
Detected Language | English |
Type | info:eu-repo/semantics/masterThesis |
Rights | info:eu-repo/semantics/restrictedAccess |
Page generated in 0.0024 seconds