Spelling suggestions: "subject:"1inear predictive cepstral coefficients"" "subject:"cinear predictive cepstral coefficients""
1 |
A Design of French Speech Recognition SystemLi, Chun-Ching 24 August 2010 (has links)
This thesis investigates the design and implementation strategies for a French speech recognition system. It utilizes the speech features of the 425 common French mono-syllables as the major training and recognition methodology. A training database is established by reading each mono-syllable 12 times in 6 rounds. Every mono-syllable is consecutively read twice with different tones. The first pronounced pattern has high pitch of tone 1,while the second one has falling pitch of tone 4. Mel-frequency cepstrum coefficients, linear predictive cepstrum coefficients, and hidden Markov model are used as the two feature models and the recognition model respectively. Under the AMD Athlon xp 2800+ with clock rate 2.2GHz personal computer and Ubuntu 9.04 operating system environment, a correct phrase recognition rate of 86% can be reached for a 3850 French phrase database. The average computation time for each phrase is about 1.5 seconds.
|
2 |
A Design of Korean Speech Recognition SystemWu, Bing-Yang 24 August 2010 (has links)
This thesis investigates the design and implementation strategies for a Korean speech recognition system. It utilizes the speech features of the common Korean mono-syllables as the major training and recognition methodology. A training database of 10 utterances per mono-syllable is established by applying Korean pronunciation rules. These 10 utterances are collected through reading 5 rounds of the same mono-syllables twice with different tones. The first pronounced pattern has high pitch of tone 1,while the second one has falling pitch of tone 4.Mel-frequency cepstral coefficients, linear predictive cepstrum coefficients, and hidden Markov model are used as the two feature models and the recognition model respectively. Under the Pentium 2.4 GHz personal computer and Ubuntu 9.04 operating system environment, a correct phrase recognition rate of 92.25% can be reached for a 4865 Korean phrase database. The average computation time for each phrase is about 1.5 seconds.
|
3 |
A Design of Recognition Rate Improving Strategy for Japanese Speech Recognition SystemLin, Cheng-Hung 24 August 2010 (has links)
This thesis investigates the recognition rate improvement strategies for a Japanese speech recognition system. Both training data development and consonant correction scheme are studied. For training data development, a database of 995 two-syllable Japanese words is established by phonetic balanced sieving. Furthermore, feature models for the 188 common Japanese mono-syllables are derived through mixed position training scheme to increase recognition rate. For consonant correction, a sub-syllable model is developed to enhance the consonant recognition accuracy, and hence further improve the overall correct rate for the whole Japanese phrases. Experimental results indicate that the average correct rate for Japanese phrase recognition system with 34 thousand phrases can be improved from 86.91% to 92.38%.
|
Page generated in 0.119 seconds