Return to search

Prosody analysis and modeling for Cantonese text-to-speech.

Li Yu Jia. / Thesis (M.Phil.)--Chinese University of Hong Kong, 2003. / Includes bibliographical references. / Abstracts in English and Chinese. / Chapter Chapter 1 --- Introduction --- p.1 / Chapter 1.1. --- TTS Technology --- p.1 / Chapter 1.2. --- Prosody --- p.2 / Chapter 1.2.1. --- What is Prosody --- p.2 / Chapter 1.2.2. --- Prosody from Different Perspectives --- p.3 / Chapter 1.2.3. --- Acoustical Parameters of Prosody --- p.3 / Chapter 1.2.4. --- Prosody in TTS --- p.5 / Chapter 1.2.4.1 --- Analysis --- p.5 / Chapter 1.2.4.2 --- Modeling --- p.6 / Chapter 1.2.4.3 --- Evaluation --- p.6 / Chapter 1.3. --- Thesis Objectives --- p.7 / Chapter 1.4. --- Thesis Outline --- p.7 / Reference --- p.8 / Chapter Chapter 2 --- Cantonese --- p.9 / Chapter 2.1. --- The Cantonese Dialect --- p.9 / Chapter 2.1.1. --- Phonology --- p.10 / Chapter 2.1.1.1 --- Initial --- p.11 / Chapter 2.1.1.2 --- Final --- p.12 / Chapter 2.1.1.3 --- Tone --- p.13 / Chapter 2.1.2. --- Phonological Constraints --- p.14 / Chapter 2.2. --- Tones in Cantonese --- p.15 / Chapter 2.2.1. --- Tone System --- p.15 / Chapter 2.2.2. --- Linguistic Significance --- p.18 / Chapter 2.2.3. --- Acoustical Realization --- p.18 / Chapter 2.3. --- Prosodic Variation in Continuous Cantonese Speech --- p.20 / Chapter 2.4. --- Cantonese Speech Corpus - CUProsody --- p.21 / Reference --- p.23 / Chapter Chapter 3 --- F0 Normalization --- p.25 / Chapter 3.1. --- F0 in Speech Production --- p.25 / Chapter 3.2. --- F0 Extraction --- p.27 / Chapter 3.3. --- Duration-normalized Tone Contour --- p.29 / Chapter 3.4. --- F0 Normalization --- p.30 / Chapter 3.4.1. --- Necessity and Motivation --- p.30 / Chapter 3.4.2. --- F0 Normalization --- p.33 / Chapter 3.4.2.1 --- Methodology --- p.33 / Chapter 3.4.2.2 --- Assumptions --- p.34 / Chapter 3.4.2.3 --- Estimation of Relative Tone Ratios --- p.35 / Chapter 3.4.2.4 --- Derivation of Phrase Curve --- p.37 / Chapter 3.4.2.5 --- Normalization of Absolute FO Values --- p.39 / Chapter 3.4.3. --- Experiments and Discussion --- p.39 / Chapter 3.5. --- Conclusions --- p.44 / Reference --- p.45 / Chapter Chapter 4 --- Acoustical FO Analysis --- p.48 / Chapter 4.1. --- Methodology of FO Analysis --- p.48 / Chapter 4.1.1. --- Analysis-by-Synthesis --- p.48 / Chapter 4.1.2. --- Acoustical Analysis --- p.51 / Chapter 4.2. --- Acoustical FO Analysis for Cantonese --- p.52 / Chapter 4.2.1. --- Analysis of Phrase Curves --- p.52 / Chapter 4.2.2. --- Analysis of Tone Contours --- p.55 / Chapter 4.2.2.1 --- Context-independent Single-tone Contours --- p.56 / Chapter 4.2.2.2 --- Contextual Variation --- p.58 / Chapter 4.2.2.3 --- Co-articulated Tone Contours of Disyllabic Word --- p.59 / Chapter 4.2.2.4 --- Cross-word Contours --- p.62 / Chapter 4.2.2.5 --- Phrase-initial Tone Contours --- p.65 / Chapter 4.3. --- Summary --- p.66 / Reference --- p.67 / Chapter Chapter5 --- Prosody Modeling for Cantonese Text-to-Speech --- p.70 / Chapter 5.1. --- Parametric Model and Non-parametric Model --- p.70 / Chapter 5.2. --- Cantonese Text-to-Speech: Baseline System --- p.72 / Chapter 5.2.1. --- Sub-syllable Unit --- p.72 / Chapter 5.2.2. --- Text Analysis Module --- p.73 / Chapter 5.2.3. --- Acoustical Synthesis --- p.74 / Chapter 5.2.4. --- Prosody Module --- p.74 / Chapter 5.3. --- Enhanced Prosody Model --- p.74 / Chapter 5.3.1. --- Modeling Tone Contours --- p.75 / Chapter 5.3.1.1 --- Word-level FO Contours --- p.76 / Chapter 5.3.1.2 --- Phrase-initial Tone Contours --- p.77 / Chapter 5.3.1.3 --- Tone Contours at Word Boundary --- p.78 / Chapter 5.3.2. --- Modeling Phrase Curves --- p.79 / Chapter 5.3.3. --- Generation of Continuous FO Contours --- p.81 / Chapter 5.4. --- Summary --- p.81 / Reference --- p.82 / Chapter Chapter 6 --- Performance Evaluation --- p.83 / Chapter 6.1. --- Introduction to Perceptual Test --- p.83 / Chapter 6.1.1. --- Aspects of Evaluation --- p.84 / Chapter 6.1.2. --- Methods of Judgment Test --- p.84 / Chapter 6.1.3. --- Problems in Perceptual Test --- p.85 / Chapter 6.2. --- Perceptual Tests for Cantonese TTS --- p.86 / Chapter 6.2.1. --- Intelligibility Tests --- p.86 / Chapter 6.2.1.1 --- Method --- p.86 / Chapter 6.2.1.2 --- Results --- p.88 / Chapter 6.2.1.3 --- Analysis --- p.89 / Chapter 6.2.2. --- Naturalness Tests --- p.90 / Chapter 6.2.2.1 --- Word-level --- p.90 / Chapter 6.2.2.1.1 --- Method --- p.90 / Chapter 6.2.2.1.2 --- Results --- p.91 / Chapter 6.2.3.1.3 --- Analysis --- p.91 / Chapter 6.2.2.2 --- Sentence-level --- p.92 / Chapter 6.2.2.2.1 --- Method --- p.92 / Chapter 6.2.2.2.2 --- Results --- p.93 / Chapter 6.2.2.2.3 --- Analysis --- p.94 / Chapter 6.3. --- Conclusions --- p.95 / Chapter 6.4. --- Summary --- p.95 / Reference --- p.96 / Chapter Chapter 7 --- Conclusions and Future Work --- p.97 / Chapter 7.1. --- Conclusions --- p.97 / Chapter 7.2. --- Suggested Future Work --- p.99 / Appendix --- p.100 / Appendix 1 Linear Regression --- p.100 / Appendix 2 36 Templates of Cross-word Contours --- p.101 / Appendix 3 Word List for Word-level Tests --- p.102 / Appendix 4 Syllable Occurrence in Word List of Intelligibility Test --- p.108 / Appendix 5 Wrongly Identified Word List --- p.112 / Appendix 6 Confusion Matrix --- p.115 / Appendix 7 Unintelligible Word List --- p.117 / Appendix 8 Noisy Word List --- p.119 / Appendix 9 Sentence List for Naturalness Test --- p.120

Identiferoai:union.ndltd.org:cuhk.edu.hk/oai:cuhk-dr:cuhk_324229
Date January 2003
ContributorsLi, Yu Jia., Chinese University of Hong Kong Graduate School. Division of Electronic Engineering.
Source SetsThe Chinese University of Hong Kong
LanguageEnglish, Chinese
Detected LanguageEnglish
TypeText, bibliography
Formatprint, xi, 125 leaves : ill. ; 30 cm.
RightsUse of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Page generated in 0.0024 seconds