Return to search

A Design of Recognition Rate Improving Strategy for Mandarin Speech Recognition System - A Case Study on Address Inputting System and Phrase Recognition System

This thesis investigates the recognition rate improvement strategies for a Mandarin speech recognition system. Both automatic tone recognition and consonant correction schemes are studied and applied to the Mandarin address inputting system and the Mandarin 2, 3, 4-word phrase recognition systems. For automatic tone recognition scheme, the acoustic properties of the four tones in the Mandarin training database are estimated statistically by 4 sets of parameters within 6 minutes. These automatically generated parameters can greatly increase the tone recognition accuracy, and at the same time reduce the amount of time spent in the manual tone parameter adjustment, that is about 8 hours in general. For consonant correction scheme, the sub-syllable models are developed to enhance the consonant recognition accuracy, and hence further improve the overall correct rate for the whole Mandarin phrases. Experimental results indicate that over 90% correct rate can be achieved for the Mandarin address inputting system with 180 thousand place names by applying the above two schemes. Furthermore, the recognition rates for the Mandarin 2, 3, 4-word phrase recognition systems with 116 thousand phrases in total can be improved from 77%, 94% and 97.5%, to 85%, 96% and 98% respectively.

Identiferoai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0824109-165741
Date24 August 2009
CreatorsHsieh, Wen-kuang
ContributorsChih-Chien Chen, Chii-Maw Uang, Tsung Lee, Sheau-Shong Bor
PublisherNSYSU
Source SetsNSYSU Electronic Thesis and Dissertation Archive
LanguageCholon
Detected LanguageEnglish
Typetext
Formatapplication/pdf
Sourcehttp://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0824109-165741
Rightsnot_available, Copyright information available at source archive

Page generated in 0.0022 seconds