Global ETD Search

Return to search

Robust methods for Chinese spoken document retrieval.

Hui Pui Yu. / Thesis (M.Phil.)--Chinese University of Hong Kong, 2003. / Includes bibliographical references (leaves 158-169). / Abstracts in English and Chinese. / Abstract --- p.2 / Acknowledgements --- p.6 / Chapter 1 --- Introduction --- p.23 / Chapter 1.1 --- Spoken Document Retrieval --- p.24 / Chapter 1.2 --- The Chinese Language and Chinese Spoken Documents --- p.28 / Chapter 1.3 --- Motivation --- p.33 / Chapter 1.3.1 --- Assisting the User in Query Formation --- p.34 / Chapter 1.4 --- Goals --- p.34 / Chapter 1.5 --- Thesis Organization --- p.35 / Chapter 2 --- Multimedia Repository --- p.37 / Chapter 2.1 --- The Cantonese Corpus --- p.37 / Chapter 2.1.1 --- The RealMedia´ёØCollection --- p.39 / Chapter 2.1.2 --- The MPEG-1 Collection --- p.40 / Chapter 2.2 --- The Multimedia Markup Language --- p.42 / Chapter 2.3 --- Chapter Summary --- p.44 / Chapter 3 --- Monolingual Retrieval Task --- p.45 / Chapter 3.1 --- Properties of Cantonese Video Archive --- p.45 / Chapter 3.2 --- Automatic Speech Transcription --- p.46 / Chapter 3.2.1 --- Transcription of Cantonese Spoken Documents --- p.47 / Chapter 3.2.2 --- Indexing Units --- p.48 / Chapter 3.3 --- Known-Item Retrieval Task --- p.49 / Chapter 3.3.1 --- Evaluation ´ؤ Average Inverse Rank --- p.50 / Chapter 3.4 --- Retrieval Model --- p.51 / Chapter 3.5 --- Experimental Results --- p.52 / Chapter 3.6 --- Chapter Summary --- p.53 / Chapter 4 --- The Use of Audio and Video Information for Monolingual Spoken Document Retrieval --- p.55 / Chapter 4.1 --- Video-based Segmentation --- p.56 / Chapter 4.1.1 --- Metric Computation --- p.57 / Chapter 4.1.2 --- Shot Boundary Detection --- p.58 / Chapter 4.1.3 --- Shot Transition Detection --- p.67 / Chapter 4.2 --- Audio-based Segmentation --- p.69 / Chapter 4.2.1 --- Gaussian Mixture Models --- p.69 / Chapter 4.2.2 --- Transition Detection --- p.70 / Chapter 4.3 --- Performance Evaluation --- p.72 / Chapter 4.3.1 --- Automatic Story Segmentation --- p.72 / Chapter 4.3.2 --- Video-based Segmentation Algorithm --- p.73 / Chapter 4.3.3 --- Audio-based Segmentation Algorithm --- p.74 / Chapter 4.4 --- Fusion of Video- and Audio-based Segmentation --- p.75 / Chapter 4.5 --- Retrieval Performance --- p.76 / Chapter 4.6 --- Chapter Summary --- p.78 / Chapter 5 --- Document Expansion for Monolingual Spoken Document Retrieval --- p.79 / Chapter 5.1 --- Document Expansion using Selected Field Speech Segments --- p.81 / Chapter 5.1.1 --- Annotations from MmML --- p.81 / Chapter 5.1.2 --- Selection of Cantonese Field Speech --- p.83 / Chapter 5.1.3 --- Re-weighting Different Retrieval Units --- p.84 / Chapter 5.1.4 --- Retrieval Performance with Document Expansion using Selected Field Speech --- p.84 / Chapter 5.2 --- Document Expansion using N-best Recognition Hypotheses --- p.87 / Chapter 5.2.1 --- Re-weighting Different Retrieval Units --- p.90 / Chapter 5.2.2 --- Retrieval Performance with Document Expansion using TV-best Recognition Hypotheses --- p.90 / Chapter 5.3 --- Document Expansion using Selected Field Speech and N-best Recognition Hypotheses --- p.92 / Chapter 5.3.1 --- Re-weighting Different Retrieval Units --- p.92 / Chapter 5.3.2 --- Retrieval Performance with Different Indexed Units --- p.93 / Chapter 5.4 --- Chapter Summary --- p.94 / Chapter 6 --- Query Expansion for Cross-language Spoken Document Retrieval --- p.97 / Chapter 6.1 --- The TDT-2 Corpus --- p.99 / Chapter 6.1.1 --- English Textual Queries --- p.100 / Chapter 6.1.2 --- Mandarin Spoken Documents --- p.101 / Chapter 6.2 --- Query Processing --- p.101 / Chapter 6.2.1 --- Query Weighting --- p.101 / Chapter 6.2.2 --- Bigram Formation --- p.102 / Chapter 6.3 --- Cross-language Retrieval Task --- p.103 / Chapter 6.3.1 --- Indexing Units --- p.104 / Chapter 6.3.2 --- Retrieval Model --- p.104 / Chapter 6.3.3 --- Performance Measure --- p.105 / Chapter 6.4 --- Relevance Feedback --- p.106 / Chapter 6.4.1 --- Pseudo-Relevance Feedback --- p.107 / Chapter 6.5 --- Retrieval Performance --- p.107 / Chapter 6.6 --- Chapter Summary --- p.109 / Chapter 7 --- Conclusions and Future Work --- p.111 / Chapter 7.1 --- Future Work --- p.114 / Chapter A --- XML Schema for Multimedia Markup Language --- p.117 / Chapter B --- Example of Multimedia Markup Language --- p.128 / Chapter C --- Significance Tests --- p.135 / Chapter C.1 --- Selection of Cantonese Field Speech Segments --- p.135 / Chapter C.2 --- Fusion of Video- and Audio-based Segmentation --- p.137 / Chapter C.3 --- Document Expansion with Reporter Speech --- p.137 / Chapter C.4 --- Document Expansion with N-best Recognition Hypotheses --- p.140 / Chapter C.5 --- Document Expansion with Reporter Speech and N-best Recognition Hypotheses --- p.140 / Chapter C.6 --- Query Expansion with Pseudo Relevance Feedback --- p.142 / Chapter D --- Topic Descriptions of TDT-2 Corpus --- p.145 / Chapter E --- Speech Recognition Output from Dragon in CLSDR Task --- p.148 / Chapter F --- Parameters Estimation --- p.152 / Chapter F.1 --- "Estimating the Number of Relevant Documents, Nr" --- p.152 / Chapter F.2 --- "Estimating the Number of Terms Added from Relevant Docu- ments, Nrt , to Original Query" --- p.153 / Chapter F.3 --- "Estimating the Number of Non-relevant Documents, Nn , from the Bottom-scoring Retrieval List" --- p.153 / Chapter F.4 --- "Estimating the Number of Terms, Selected from Non-relevant Documents (Nnt), to be Removed from Original Query" --- p.154 / Chapter G --- Abbreviations --- p.155 / Bibliography --- p.158

Automatic speech recognition

Speech processing systems

Cantonese dialects--Data processing

Identifer	oai:union.ndltd.org:cuhk.edu.hk/oai:cuhk-dr:cuhk_324388
Date	January 2003
Contributors	Hui, Pui Yu., Chinese University of Hong Kong Graduate School. Division of Systems Engineering and Engineering Management.
Source Sets	The Chinese University of Hong Kong
Language	English, Chinese
Detected Language	English
Type	Text, bibliography
Format	print, 169 leaves : ill. ; 30 cm.
Rights	Use of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Page generated in 0.0022 seconds

Robust methods for Chinese spoken document retrieval.

Description

Links & Downloads

Tags

Additional Fields