The objective of this thesis is to increase the correct recognition rate of the two-word Mandarin phrases. The reason for inaccuracy is due to the ambiguities of the syllables and the intonations. For the syllable ambiguity, a balanced speech training dataset is designed and the weights of the state observation probabilities on vowels and consonants are adjusted. For the tone ambiguity, both the pitch contour and the spectrum evolution property derived from the Karhunen-Loéve transform are applied. The experimental results indicate that an 85% correct rate can be achieved, that is a 6% increase in the performance for the system without the above improvements.
Identifer | oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0906107-011936 |
Date | 06 September 2007 |
Creators | Jheng, He-de |
Contributors | Tsung Lee, Chii Maw Uang, Chih-Chien Chen |
Publisher | NSYSU |
Source Sets | NSYSU Electronic Thesis and Dissertation Archive |
Language | Cholon |
Detected Language | English |
Type | text |
Format | application/pdf |
Source | http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0906107-011936 |
Rights | not_available, Copyright information available at source archive |
Page generated in 0.0014 seconds