Return to search

A Design of Trilingual Speech Recognition System for Chinese, English and Vietnamese

History, culture and economy constitute the foundation of language. Mandarin Chinese is our native language, spoken by over 1.2 billion people. Its population is ranked number one in the world. In the recent years, the emerging China not only possesses market and labor forces, but also develops the Chinese culture circle in Asia. British history and American politics make English the most influential language in the 20th century. Vietnam has been under the profound influence of Chinese culture. The reformed and opened economy in the past decade brought her tremendous foreign investments, including those from Taiwan. It is our objective to establish a trilingual system for travel, living and speech learning.
This thesis investigates the design and implementation strategies for a trilingual speech recognition system of Chinese, English and Vietnamese. It utilizes the speech features of 404 Chinese, 925 English and 154 Vietnamese mono-syllables as the major training and recognition methodology. Mel-frequency cepstral coefficients, linear predicted cepstral coefficients, and hidden Markov model are used as the two syllable feature models and the recognition model respectively. Under the AMD XP 2800+ personal computer and Ubuntu 9.04 operating system environment, the correct rates of 88.16%, 82.74% and 87.45% can be reached using phonotactical rules for the 82,000 Chinese, 30,795 English and 3,300 Vietnamese phrase database respectively. The computation for each system can be completed within 2 seconds. Furthermore, a trilingual language-speech recognition system for 300 common words, composed of 100 words from each language, is developed. A 98% correct language-phrase recognition rate can be obtained with the computation time less than 2 seconds.

Identiferoai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0910112-154149
Date10 September 2012
CreatorsTzeng, Yi-Ying
ContributorsChii-Maw Uang, Chih-Chien Chen, Sheau-Shong Bor
PublisherNSYSU
Source SetsNSYSU Electronic Thesis and Dissertation Archive
LanguageCholon
Detected LanguageEnglish
Typetext
Formatapplication/pdf
Sourcehttp://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0910112-154149
Rightsuser_define, Copyright information available at source archive

Page generated in 0.0013 seconds