This thesis investigates the recognition rate improvement strategies for a Japanese speech recognition system. Both training data development and consonant correction scheme are studied. For training data development, a database of 995 two-syllable Japanese words is established by phonetic balanced sieving. Furthermore, feature models for the 188 common Japanese mono-syllables are derived through mixed position training scheme to increase recognition rate. For consonant correction, a sub-syllable model is developed to enhance the consonant recognition accuracy, and hence further improve the overall correct rate for the whole Japanese phrases. Experimental results indicate that the average correct rate for Japanese phrase recognition system with 34 thousand phrases can be improved from 86.91% to 92.38%.
Identifer | oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0824110-152453 |
Date | 24 August 2010 |
Creators | Lin, Cheng-Hung |
Contributors | Er-Hui Lu, Tsung Lee, Xiao-Song Bo, Chih-Chien Chen, Chii-Maw Uang |
Publisher | NSYSU |
Source Sets | NSYSU Electronic Thesis and Dissertation Archive |
Language | Cholon |
Detected Language | English |
Type | text |
Format | application/pdf |
Source | http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0824110-152453 |
Rights | not_available, Copyright information available at source archive |
Page generated in 0.0022 seconds