The objective of this thesis is to reduce the training size for the Mandarin address inputting system and the Mandarin phrase recognition system. A set of two-word Mandarin phrases is developed by the balanced sieving and mixture training techniques. This greatly reduces the training data size for the systems. Hidden Markov model using both MFCC and LPCC features is proposed in this thesis. Speech-text alignment, frame overlapping and tone recognition are incorporated to increase the correct recognition rates. For the speaker-dependent case, any phrase in these two speech systems can be recognized within one second.
Identifer | oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0826108-145039 |
Date | 26 August 2008 |
Creators | Lai, Jhao-Rong |
Contributors | Chii-Maw Uang, Chih-Chien Chen, Tsung Lee |
Publisher | NSYSU |
Source Sets | NSYSU Electronic Thesis and Dissertation Archive |
Language | Cholon |
Detected Language | English |
Type | text |
Format | application/pdf |
Source | http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0826108-145039 |
Rights | not_available, Copyright information available at source archive |
Page generated in 0.0023 seconds