This thesis investigates the design and implementation strategies for a English speech recognition system. Two speech inputting methods, the spelling inputting and the reading inputting, are implemented for English word recognition and query. Mel-frequency cepstrum coefficients, linear predicted cepstrum coefficients, and hidden Markov model are used as the two feature models and the recognition model respectively. Under the Pentium 1.6 GHz personal computer and Ubuntu 8.04 operating system environment, a 95% correct recognition rate can be obtained for a 110 thousand English word database by the spelling inputting method; and a 93% correct recognition rate can be achieved for a 1,500 English word database by the reading inputting method. The average computation time for each word using either inputting method is about 1.5 seconds.
Identifer | oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0824109-172533 |
Date | 24 August 2009 |
Creators | Chen, Yung-ming |
Contributors | Tsung Lee, Chih-Chien Chen, Chii-Maw Uang, Sheau-Shong Bor, Sheau-Shong Bor |
Publisher | NSYSU |
Source Sets | NSYSU Electronic Thesis and Dissertation Archive |
Language | Cholon |
Detected Language | English |
Type | text |
Format | application/pdf |
Source | http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0824109-172533 |
Rights | not_available, Copyright information available at source archive |
Page generated in 0.0017 seconds