In this thesis, a three-word and four-word Mandarin phrases speech recognition system is developed. This system contains two recordings of twenty-four thousand three-word phrases and twenty-two thousand four-word phrases in the database. And it applies MFCC, mono-syllable HMM¡¦s and speech-text alignment scheme to select the initial phrase candidates. A wavelet transform based vowel segmentation technique and a Mandarin pitch identification method is then followed to increase the phrase correct identification rate and obtain the final answer. Experimental results indicate that 92% and 96% correct rates can be achieved for three-word and four-word phrases recognition problems respectively, under the conditions that the first recording of this database is used for training and the second one is for testing. For the speaker-dependent case, the correct phrase can be found within 1 second, using a PC with Intel Celeron 2.4 GHz CPU and RedHat Linux 9.0 Operation System.
Identifer | oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0910106-211507 |
Date | 10 September 2006 |
Creators | Sue, Ji-sin |
Contributors | Chih-Chien Chen, Chii-Maw Uang, Tsung Lee |
Publisher | NSYSU |
Source Sets | NSYSU Electronic Thesis and Dissertation Archive |
Language | Cholon |
Detected Language | English |
Type | text |
Format | application/pdf |
Source | http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0910106-211507 |
Rights | not_available, Copyright information available at source archive |
Page generated in 0.0016 seconds