This thesis investigates the design and implementation strategies for a Taiwanese speech recognition system. It adopts a 4 plus 1¡]five times¡^recording strategy, where the 1st four recordings are used for speech feature training and the last recording for speech recognition simulation. Mel-frequency cepstrum coefficients and hidden Markov model are used as the feature model and the recognition model respectively. Under the Intel Celeron 2.4 GHz personal computer and Red Hat Linux 9.0 operating system environment, a correct phrase recognition rate of 90% can be reached for a 4200 Taiwanese phrase database.
Identifer | oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0824109-170104 |
Date | 24 August 2009 |
Creators | Jhu, Hao-fu |
Contributors | Chii-Maw Uang, Chih-Chien Chen, Tsung Lee, Sheau-Shong Bor |
Publisher | NSYSU |
Source Sets | NSYSU Electronic Thesis and Dissertation Archive |
Language | Cholon |
Detected Language | English |
Type | text |
Format | application/pdf |
Source | http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0824109-170104 |
Rights | not_available, Copyright information available at source archive |
Page generated in 0.0017 seconds