There are two research topics in this thesis. First, we implement a
highly efficient Chinese input method. Second, we apply a
divide-and-conquer scheme to the speaker diarization problem.
The implemented Chinese input method transforms an input first-symbol
sequence into a character string (a sentence). This means that a user
only needs to input a first Mandarin phonetic symbol per character,
which is very efficient compared to the current methods.
The implementation is based on a dynamic programming scheme
and language models. To reduce time complexity, the vocabulary for the
language model consists of 1-, 2-, and 3-character words only.
The speaker diarization system consists of segmentation and clustering
modules. The divide-and-conquer scheme is essentially implemented in
the clustering module. We evaluate the performance of our system using
the speaker diarization score defined in the 2003 Rich Transcription
Evaluation Plan. Compared to the baseline, our method significantly
reduces the processing time without compromising diarization accuracy.
Identifer | oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0909108-140401 |
Date | 09 September 2008 |
Creators | Tseng, Chun-han |
Contributors | Chang-Biau Yang, Chia-Ping Chen, Hsin-Min Wang, Chung-Nan Lee |
Publisher | NSYSU |
Source Sets | NSYSU Electronic Thesis and Dissertation Archive |
Language | English |
Detected Language | English |
Type | text |
Format | application/pdf |
Source | http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0909108-140401 |
Rights | off_campus_withheld, Copyright information available at source archive |
Page generated in 0.0019 seconds