A Design of Korean Speech Recognition System

This thesis investigates the design and implementation strategies for a Korean speech recognition system. It utilizes the speech features of the common Korean mono-syllables as the major training and recognition methodology. A training database of 10 utterances per mono-syllable is established by applying Korean pronunciation rules. These 10 utterances are collected through reading 5 rounds of the same mono-syllables twice with different tones. The first pronounced pattern has high pitch of tone 1,while the second one has falling pitch of tone 4.Mel-frequency cepstral coefficients, linear predictive cepstrum coefficients, and hidden Markov model are used as the two feature models and the recognition model respectively. Under the Pentium 2.4 GHz personal computer and Ubuntu 9.04 operating system environment, a correct phrase recognition rate of 92.25% can be reached for a 4865 Korean phrase database. The average computation time for each phrase is about 1.5 seconds.

Identiferoai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0824110-153543
Date24 August 2010
CreatorsWu, Bing-Yang
ContributorsXiao-Song Bo, Chih-Chien Chen, Chii-Maw Uang, Tsung Lee, Er-Hui Lu
PublisherNSYSU
Source SetsNSYSU Electronic Thesis and Dissertation Archive
LanguageCholon
Detected LanguageEnglish
Typetext
Formatapplication/pdf
Sourcehttp://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0824110-153543
Rightsnot_available, Copyright information available at source archive

Page generated in 0.0015 seconds