Global ETD Search

Return to search

HMM based connected speech recognition system for Cantonese =: 建基於隱馬爾可夫模型的粤語連續語音識別系統. / 建基於隱馬爾可夫模型的粤語連續語音識別系統 / An HMM based connected speech recognition system for Cantonese =: Jian ji yu Yin Ma'erkefu mo xing de Yue yu lian xu yu yin shi bie xi tong. / Jian ji yu Yin Ma'erkefu mo xing de Yue yu lian xu yu yin shi bie xi tong

by Chow Ka Fai. / Thesis (M.Phil.)--Chinese University of Hong Kong, 1998. / Includes bibliographical references (leaves [124-132]). / Text in English; abstract also in Chinese. / by Chow Ka Fai. / Chapter 1 --- INTRODUCTION --- p.1 / Chapter 1.1 --- Speech Recognition Technology --- p.4 / Chapter 1.2 --- Automatic Recognition of Cantonese Speech --- p.6 / Chapter 1.3 --- Objectives of the thesis --- p.8 / Chapter 1.4 --- Thesis Outline --- p.11 / Chapter 2 --- FUNDAMENTALS OF HMM BASED RECOGNITION SYSTEM --- p.13 / Chapter 2.1 --- Introduction --- p.13 / Chapter 2.2 --- HMM Fundamentals --- p.13 / Chapter 2.2.1 --- HMM Structure and Behavior --- p.13 / Chapter 2.2.2 --- HMM-based Speech Modeling --- p.15 / Chapter 2.2.3 --- Mathematics --- p.18 / Chapter 2.3 --- hmm Based Speech Recognition System --- p.22 / Chapter 2.3.1 --- Isolated Speech Recognition --- p.23 / Chapter 2.3.2 --- Connected Speech Recognition --- p.25 / Chapter 2.4 --- Algorithms for Finding Hidden State Sequence --- p.28 / Chapter 2.4.1 --- Forward-backward algorithm --- p.29 / Chapter 2.4.2 --- Viterbi Decoder Algorithm --- p.31 / Chapter 2.5 --- Parameter Estimation --- p.32 / Chapter 2.5.1 --- Basic Ideas for Estimation --- p.32 / Chapter 2.5.2 --- Single Model Re-estimation Using Best State-Time Alignment (HINIT) --- p.36 / Chapter 2.5.3 --- Single Model Re-estimation Using Baum- Welch Method (HREST) --- p.39 / Chapter 2.5.4 --- HMM Embedded Re-estimation (HEREST) --- p.41 / Chapter 2.6 --- Feature Extraction --- p.42 / Chapter 2.7 --- Summary --- p.47 / Chapter 3 --- CANTONESE PHONOLOGY AND LANGUAGE PROPERTIES --- p.48 / Chapter 3.1 --- Introduction --- p.48 / Chapter 3.2 --- Cantonese and Chinese Language --- p.48 / Chapter 3.2.1 --- Chinese Words and Characters --- p.48 / Chapter 3.2.2 --- The Relationship between Cantonese and Chinese Characters --- p.50 / Chapter 3.3 --- Basic Syllable structure --- p.51 / Chapter 3.3.1 --- CVC structure --- p.51 / Chapter 3.3.2 --- Cantonese Phonemes --- p.52 / Chapter 3.3.3 --- The Initial-Final structure --- p.55 / Chapter 3.3.4 --- Cantonese Nine Tone System --- p.57 / Chapter 3.4 --- Acoustic Properties of Cantonese --- p.58 / Chapter 3.5 --- Cantonese Phonology for Speech Recognition --- p.60 / Chapter 3.6 --- Summary --- p.62 / Chapter 4 --- CANTONESE SPEECH DATABASES --- p.64 / Chapter 4.1 --- Introduction --- p.64 / Chapter 4.2 --- The Importance of Speech Data --- p.64 / Chapter 4.3 --- The Demands of Cantonese Speech Databases --- p.67 / Chapter 4.4 --- Principles in Cantonese Database Development --- p.67 / Chapter 4.5 --- Resources and Limitations for Database Designs --- p.69 / Chapter 4.6 --- Details of Speech Databases --- p.69 / Chapter 4.6.1 --- Multiple speakers' Speech Database (CUWORD) --- p.70 / Chapter 4.6.2 --- Single Speaker's Speech Database (MYVOICE) --- p.72 / Chapter 4.7 --- Difficulties and Solutions in Recording Process --- p.76 / Chapter 4.8 --- Verification of Phonetic Transcription --- p.78 / Chapter 4.9 --- Summary --- p.79 / Chapter 5 --- TRAINING OF AN HMM BASED CANTONESE SPEECH RECOGNITION SYSTEM --- p.80 / Chapter 5.1 --- Introduction --- p.80 / Chapter 5.2 --- Objectives of HMM Development --- p.81 / Chapter 5.3 --- The Design of Initial-Final Models --- p.83 / Chapter 5.4 --- Initialization of Basic Initial-Final Models --- p.84 / Chapter 5.4.1 --- The Initialization Training with HEREST --- p.85 / Chapter 5.4.2 --- Refinement of Initialized Models --- p.88 / Chapter 5.4.3 --- Evaluation of the Models --- p.90 / Chapter 5.5 --- Training of Connected Speech Speaker Dependent Models --- p.93 / Chapter 5.5.1 --- Training Strategy --- p.93 / Chapter 5.5.2 --- Preliminary Result --- p.94 / Chapter 5.6 --- Design and Training of Context Dependent Initial Final Models --- p.95 / Chapter 5.6.1 --- Intra-syllable Context Dependent Units --- p.96 / Chapter 5.6.2 --- The Inter-syllable Context Dependent Units --- p.97 / Chapter 5.6.3 --- Model Refinement by Using Mixture Incrementing --- p.98 / Chapter 5.7 --- Training of Speaker Independent Models --- p.99 / Chapter 5.8 --- Discussions --- p.100 / Chapter 5.9 --- Summary --- p.101 / Chapter 6 --- PERFORMANCE ANALYSIS --- p.102 / Chapter 6.1 --- Substitution Errors --- p.102 / Chapter 6.1.1 --- Confusion of Long Vowels and Short Vowels for Initial Stop Consonants --- p.102 / Chapter 6.1.2 --- Confusion of Nasal Endings --- p.103 / Chapter 6.1.3 --- Confusion of Final Stop Consonants --- p.104 / Chapter 6.2 --- Insertion Errors and Deletion Errors --- p.105 / Chapter 6.3 --- Accuracy of Individual Models --- p.106 / Chapter 6.4 --- The Impact of Individual Models --- p.107 / Chapter 6.4.1 --- The Expected Error Rate of Initial Models --- p.110 / Chapter 6.4.2 --- The Expected Error Rate of Final Models --- p.111 / Chapter 6.5 --- Suggested Solutions for Error Reduction --- p.113 / Chapter 6.5.1 --- Duration Constraints --- p.113 / Chapter 6.5.2 --- The Use of Language Model --- p.113 / Chapter 6.6 --- Summary --- p.114 / Chapter 7 --- APPLICATIONS EXAMPLES OF THE HMM RECOGNITION SYSTEM --- p.115 / Chapter 7.1 --- Introduction --- p.115 / Chapter 7.2 --- Application 1: A Hong Kong Stock Market Inquiry System --- p.116 / Chapter 7.3 --- Application 2： A Navigating System for Hong Kong Street Map --- p.117 / Chapter 7.4 --- Automatic Character-to-Phonetic Conversion --- p.118 / Chapter 7.5 --- Summary --- p.119 / Chapter 8 --- CONCLUSIONS AND SUGGESTIONS FOR FURTHER WORK --- p.120 / Chapter 8.1 --- Conclusions --- p.120 / Chapter 8.2 --- Suggestions for Future Work --- p.122 / Chapter 8.2.1 --- Development of Continuous Speech Recognition System --- p.122 / Chapter 8.2.2 --- Implementation of Statistical Language Models --- p.122 / Chapter 8.2.3 --- Tones for Continuous Speech --- p.123 / BIBILOGRAPHY / APPENDIX

Automatic speech recognition

Markov processes

Cantonese dialects--Data processing

Identifer	oai:union.ndltd.org:cuhk.edu.hk/oai:cuhk-dr:cuhk_322348
Date	January 1998
Contributors	Chow, Ka Fai., Chinese University of Hong Kong Graduate School. Division of Electronic Engineering.
Source Sets	The Chinese University of Hong Kong
Language	English, Chinese
Detected Language	English
Type	Text, bibliography
Format	print, 123, [19] leaves : ill. ; 30 cm.
Rights	Use of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Page generated in 0.0061 seconds

Description

Links & Downloads

Tags

Additional Fields