Return to search

Large vocabulary Cantonese speech recognition using neural networks.

Tsik Chung Wai Benjamin. / Thesis (M.Phil.)--Chinese University of Hong Kong, 1994. / Includes bibliographical references (leaves 67-70). / Chapter 1 --- Introduction --- p.1 / Chapter 1.1 --- Automatic Speech Recognition --- p.1 / Chapter 1.2 --- Cantonese Speech Recognition --- p.3 / Chapter 1.3 --- Neural Networks --- p.4 / Chapter 1.4 --- About this Thesis --- p.5 / Chapter 2 --- The Phonology of Cantonese --- p.6 / Chapter 2.1 --- The Syllabic Structure of Cantonese Syllable --- p.7 / Chapter 2.2 --- The Tone System of Cantonese --- p.9 / Chapter 3 --- Review of Automatic Speech Recognition Systems --- p.12 / Chapter 3.1 --- Hidden Markov Model Approach --- p.12 / Chapter 3.2 --- Neural Networks Approach --- p.13 / Chapter 3.2.1 --- Multi-Layer Perceptrons (MLP) --- p.13 / Chapter 3.2.2 --- Time-Delay Neural Networks (TDNN) --- p.15 / Chapter 3.2.3 --- Recurrent Neural Networks --- p.17 / Chapter 3.3 --- Integrated Approach --- p.18 / Chapter 3.4 --- Mandarin and Cantonese Speech Recognition Systems --- p.19 / Chapter 4 --- The Speech Corpus and Database --- p.21 / Chapter 4.1 --- Design of the Speech Corpus --- p.21 / Chapter 4.2 --- Speech Database Acquisition --- p.23 / Chapter 5 --- Feature Parameters Extraction --- p.24 / Chapter 5.1 --- Endpoint Detection --- p.25 / Chapter 5.2 --- Speech Processing --- p.26 / Chapter 5.3 --- Speech Segmentation --- p.27 / Chapter 5.4 --- Phoneme Feature Extraction --- p.29 / Chapter 5.5 --- Tone Feature Extraction --- p.30 / Chapter 6 --- The Design of the System --- p.33 / Chapter 6.1 --- Towards Large Vocabulary System --- p.34 / Chapter 6.2 --- Overview of the Isolated Cantonese Syllable Recognition System --- p.36 / Chapter 6.3 --- The Primary Level: Phoneme Classifiers and Tone Classifier --- p.38 / Chapter 6.4 --- The Intermediate Level: Ending Corrector --- p.42 / Chapter 6.5 --- The Secondary Level: Syllable Classifier --- p.43 / Chapter 6.5.1 --- Concatenation with Correction Approach --- p.44 / Chapter 6.5.2 --- Fuzzy ART Approach --- p.45 / Chapter 7 --- Computer Simulation --- p.49 / Chapter 7.1 --- Experimental Conditions --- p.49 / Chapter 7.2 --- Experimental Results of the Primary Level Classifiers --- p.50 / Chapter 7.3 --- Overall Performance of the System --- p.57 / Chapter 7.4 --- Discussions --- p.61 / Chapter 8 --- Further Works --- p.62 / Chapter 8.1 --- Enhancement on Speech Segmentation --- p.62 / Chapter 8.2 --- Towards Speaker-Independent System --- p.63 / Chapter 8.3 --- Towards Speech-to-Text System --- p.64 / Chapter 9 --- Conclusions --- p.65 / Bibliography --- p.67 / Appendix A. Cantonese Syllable Full Set List --- p.71

Identiferoai:union.ndltd.org:cuhk.edu.hk/oai:cuhk-dr:cuhk_318090
Date January 1994
ContributorsTsik, Chung Wai Benjamin., Chinese University of Hong Kong Graduate School. Division of Computer Science.
PublisherChinese University of Hong Kong
Source SetsThe Chinese University of Hong Kong
LanguageEnglish
Detected LanguageEnglish
TypeText, bibliography
Formatprint, v, 85 leaves : ill. ; 30 cm.
RightsUse of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Page generated in 0.0183 seconds