Return to search

A Design of Speech Recognition System for Two-Word Mandarin Phrases

The objective of this thesis is to increase the correct recognition rate of the two-word Mandarin phrases. The reason for inaccuracy is due to the ambiguities of the syllables and the intonations. For the syllable ambiguity, a balanced speech training dataset is designed and the weights of the state observation probabilities on vowels and consonants are adjusted. For the tone ambiguity, both the pitch contour and the spectrum evolution property derived from the Karhunen-Loéve transform are applied. The experimental results indicate that an 85% correct rate can be achieved, that is a 6% increase in the performance for the system without the above improvements.

Identiferoai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0906107-011936
Date06 September 2007
CreatorsJheng, He-de
ContributorsTsung Lee, Chii Maw Uang, Chih-Chien Chen
PublisherNSYSU
Source SetsNSYSU Electronic Thesis and Dissertation Archive
LanguageCholon
Detected LanguageEnglish
Typetext
Formatapplication/pdf
Sourcehttp://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0906107-011936
Rightsnot_available, Copyright information available at source archive

Page generated in 0.0017 seconds