Global ETD Search

A Design of Korean Speech Recognition System

This thesis investigates the design and implementation strategies for a Korean speech recognition system. It utilizes the speech features of the common Korean mono-syllables as the major training and recognition methodology. A training database of 10 utterances per mono-syllable is established by applying Korean pronunciation rules. These 10 utterances are collected through reading 5 rounds of the same mono-syllables twice with different tones. The first pronounced pattern has high pitch of tone 1,while the second one has falling pitch of tone 4.Mel-frequency cepstral coefficients, linear predictive cepstrum coefficients, and hidden Markov model are used as the two feature models and the recognition model respectively. Under the Pentium 2.4 GHz personal computer and Ubuntu 9.04 operating system environment, a correct phrase recognition rate of 92.25% can be reached for a 4865 Korean phrase database. The average computation time for each phrase is about 1.5 seconds.

http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0824110-153543

Mel-frequency cepstral coefficients

Linear predictive cepstrum coefficients

Hidden Markov model

Identifer	oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0824110-153543
Date	24 August 2010
Creators	Wu, Bing-Yang
Contributors	Xiao-Song Bo, Chih-Chien Chen, Chii-Maw Uang, Tsung Lee, Er-Hui Lu
Publisher	NSYSU
Source Sets	NSYSU Electronic Thesis and Dissertation Archive
Language	Cholon
Detected Language	English
Type	text
Format	application/pdf
Source	http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0824110-153543
Rights	not_available, Copyright information available at source archive

Page generated in 0.0015 seconds

A Design of Korean Speech Recognition System

Description

Links & Downloads

Tags

Additional Fields