Return to search

A Design of Trilingual Speech Recognition System for Chinese, Taiwanese and Cantonese

Mandarin Chinese, Taiwanese and Cantonese all belong to the Chinese language family. According to the statistics from Summer Institute of Linguistics, USA, Chinese language are spoken by over 1.2 billion population, ranked number one in the world. The regions where these three languages are spoken have been playing an important role for global economy. For example, Hong Kong and Taiwan all have flourishing harbors for international trade. Furthermore, Mandarin Chinese, Taiwanese and Cantonese are the most influential among the seven Chinese dialects. Mandarin Chinese was admitted as a language by the United Nations in the early years while Cantonese was accepted in 2006. Cantonese is spoken in many Western countries. She is the fourth language in Australia as well as the third language in Canada and America. From the phonetics point of view, these three languages are all tonal languages in which words or phrases uttered in different pitch or duration have distinct lexical meaning.
This thesis investigates the design and implementation strategies for Chinese, Taiwanese and Cantonese. Based on their pronunciation rules and tonal properties, common mono-syllables for each language are selected and utilized as the major speech training and recognition methodology. Mel-frequency cepstral coefficients, linear predicted cepstral coefficients, and hidden Markov model are used as the two syllable feature models and the recognition model respectively. Under the AMD Athlon XP 2800+ personal computer and Ubuntu 9.04 operating system environment, the correct recognition rates of 88.03%, 86.00% and 86.79% can be reached using phonotactical rules for the 82,000 Chinese, 5,129 Taiwanese and 3,051 Cantonese phrase database respectively. Furthermore, a trilingual language-speech recognition system for 300 common words, composed of 100 words from each language, is developed. A 97.66% correct language-phrase recognition rate can be obtained.

Identiferoai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0910112-151648
Date10 September 2012
CreatorsZheng, Po-Xin
ContributorsChii-Maw Uang, Chih-Chien Chen, Sheau-Shong Bor
PublisherNSYSU
Source SetsNSYSU Electronic Thesis and Dissertation Archive
LanguageCholon
Detected LanguageEnglish
Typetext
Formatapplication/pdf
Sourcehttp://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0910112-151648
Rightsuser_define, Copyright information available at source archive

Page generated in 0.0018 seconds