Kam Patgi. / Thesis (M.Phil.)--Chinese University of Hong Kong, 2003. / Includes bibliographical references (leaf 103). / Abstracts in English and Chinese. / Chapter Chapter 1. --- Introduction --- p.1 / Chapter 1.1 --- Automatic Speech Recognition --- p.1 / Chapter 1.2 --- Pronunciation Modeling in ASR --- p.2 / Chapter 1.3 --- Obj ectives of the Thesis --- p.5 / Chapter 1.4 --- Thesis Outline --- p.5 / Reference --- p.7 / Chapter Chapter 2. --- The Cantonese Dialect --- p.9 / Chapter 2.1 --- Cantonese - A Typical Chinese Dialect --- p.10 / Chapter 2.1.1 --- Cantonese Phonology --- p.11 / Chapter 2.1.2 --- Cantonese Phonetics --- p.12 / Chapter 2.2 --- Pronunciation Variation in Cantonese --- p.13 / Chapter 2.2.1 --- Phone Change and Sound Change --- p.14 / Chapter 2.2.2 --- Notation for Different Sound Units --- p.16 / Chapter 2.3 --- Summary --- p.17 / Reference --- p.18 / Chapter Chapter 3. --- Large-Vocabulary Continuous Speech Recognition for Cantonese --- p.19 / Chapter 3.1 --- Feature Representation of the Speech Signal --- p.20 / Chapter 3.2 --- Probabilistic Framework of ASR --- p.20 / Chapter 3.3 --- Hidden Markov Model for Acoustic Modeling --- p.21 / Chapter 3.4 --- Pronunciation Lexicon --- p.25 / Chapter 3.5 --- Statistical Language Model --- p.25 / Chapter 3.6 --- Decoding --- p.26 / Chapter 3.7 --- The Baseline Cantonese LVCSR System --- p.26 / Chapter 3.7.1 --- System Architecture --- p.26 / Chapter 3.7.2 --- Speech Databases --- p.28 / Chapter 3.8 --- Summary --- p.29 / Reference --- p.30 / Chapter Chapter 4. --- Pronunciation Model --- p.32 / Chapter 4.1 --- Pronunciation Modeling at Different Levels --- p.33 / Chapter 4.2 --- Phone-level pronunciation model and its Application --- p.35 / Chapter 4.2.1 --- IF Confusion Matrix (CM) --- p.35 / Chapter 4.2.2 --- Decision Tree Pronunciation Model (DTPM) --- p.38 / Chapter 4.2.3 --- Refinement of Confusion Matrix --- p.41 / Chapter 4.3 --- Summary --- p.43 / References --- p.44 / Chapter Chapter 5. --- Pronunciation Modeling at Lexical Level --- p.45 / Chapter 5.1 --- Construction of PVD --- p.46 / Chapter 5.2 --- PVD Pruning by Word Unigram --- p.48 / Chapter 5.3 --- Recognition Experiments --- p.49 / Chapter 5.3.1 --- Experiment 1 ´ؤPronunciation Modeling in LVCSR --- p.49 / Chapter 5.3.2 --- Experiment 2 ´ؤ Pronunciation Modeling in Domain Specific task --- p.58 / Chapter 5.3.3 --- Experiment 3 ´ؤ PVD Pruning by Word Unigram --- p.62 / Chapter 5.4 --- Summary --- p.63 / Reference --- p.64 / Chapter Chapter 6. --- Pronunciation Modeling at Acoustic Model Level --- p.66 / Chapter 6.1 --- Hierarchy of HMM --- p.67 / Chapter 6.2 --- Sharing of Mixture Components --- p.68 / Chapter 6.3 --- Adaptation of Mixture Components --- p.70 / Chapter 6.4 --- Combination of Mixture Component Sharing and Adaptation --- p.74 / Chapter 6.5 --- Recognition Experiments --- p.78 / Chapter 6.6 --- Result Analysis --- p.80 / Chapter 6.6.1 --- Performance of Sharing Mixture Components --- p.81 / Chapter 6.6.2 --- Performance of Mixture Component Adaptation --- p.84 / Chapter 6.7 --- Summary --- p.85 / Reference --- p.87 / Chapter Chapter 7. --- Pronunciation Modeling at Decoding Level --- p.88 / Chapter 7.1 --- Search Process in Cantonese LVCSR --- p.88 / Chapter 7.2 --- Model-Level Search Space Expansion --- p.90 / Chapter 7.3 --- State-Level Output Probability Modification --- p.92 / Chapter 7.4 --- Recognition Experiments --- p.93 / Chapter 7.4.1 --- Experiment 1 ´ؤModel-Level Search Space Expansion --- p.93 / Chapter 7.4.2 --- Experiment 2 ´ؤ State-Level Output Probability Modification …… --- p.94 / Chapter 7.5 --- Summary --- p.96 / Reference --- p.97 / Chapter Chapter 8. --- Conclusions and Suggestions for Future Work --- p.98 / Chapter 8.1 --- Conclusions --- p.98 / Chapter 8.2 --- Suggestions for Future Work --- p.100 / Reference --- p.103 / Appendix I Base Syllable Table --- p.104 / Appendix II Cantonese Initials and Finals --- p.105 / Appendix III IF confusion matrix --- p.106 / Appendix IV Phonetic Question Set --- p.112 / Appendix V CDDT and PCDT --- p.114
Identifer | oai:union.ndltd.org:cuhk.edu.hk/oai:cuhk-dr:cuhk_324321 |
Date | January 2003 |
Contributors | Kam, Patgi., Chinese University of Hong Kong Graduate School. Division of Electronic Engineering. |
Source Sets | The Chinese University of Hong Kong |
Language | English, Chinese |
Detected Language | English |
Type | Text, bibliography |
Format | print, xii, 115 leaves : ill. ; 30 cm. |
Rights | Use of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/) |
Page generated in 0.002 seconds