Global ETD Search

Return to search

A Design of Recognition Rate Improving Strategy for Japanese Speech Recognition System

This thesis investigates the recognition rate improvement strategies for a Japanese speech recognition system. Both training data development and consonant correction scheme are studied. For training data development, a database of 995 two-syllable Japanese words is established by phonetic balanced sieving. Furthermore, feature models for the 188 common Japanese mono-syllables are derived through mixed position training scheme to increase recognition rate. For consonant correction, a sub-syllable model is developed to enhance the consonant recognition accuracy, and hence further improve the overall correct rate for the whole Japanese phrases. Experimental results indicate that the average correct rate for Japanese phrase recognition system with 34 thousand phrases can be improved from 86.91% to 92.38%.

http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0824110-152453

Phrase training

Sub-syllable model

Mel-frequency cepstral coefficients

Speech recognition

Linear predictive cepstrum coefficients

Hidden Markov model

Identifer	oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0824110-152453
Date	24 August 2010
Creators	Lin, Cheng-Hung
Contributors	Er-Hui Lu, Tsung Lee, Xiao-Song Bo, Chih-Chien Chen, Chii-Maw Uang
Publisher	NSYSU
Source Sets	NSYSU Electronic Thesis and Dissertation Archive
Language	Cholon
Detected Language	English
Type	text
Format	application/pdf
Source	http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0824110-152453
Rights	not_available, Copyright information available at source archive

Page generated in 0.0015 seconds

A Design of Recognition Rate Improving Strategy for Japanese Speech Recognition System

Description

Links & Downloads

Tags

Additional Fields