Return to search

Automatic construction of English/Chinese parallel corpus.

Li Kar Wing. / Thesis (M.Phil.)--Chinese University of Hong Kong, 2001. / Includes bibliographical references (leaves 88-96). / Abstracts in English and Chinese. / ABSTRACT --- p.i / ACKNOWLEDGEMENTS --- p.v / LIST OF TABLES --- p.viii / LIST OF FIGURES --- p.ix / CHAPTERS / Chapter 1. --- INTRODUCTION --- p.1 / Chapter 1.1 --- Application of corpus-based techniques --- p.2 / Chapter 1.1.1 --- Machine Translation (MT) --- p.2 / Chapter 1.1.1.1 --- Linguistic --- p.3 / Chapter 1.1.1.2 --- Statistical --- p.4 / Chapter 1.1.1.3 --- Lexicon construction --- p.4 / Chapter 1.1.2 --- Cross-lingual Information Retrieval (CLIR) --- p.6 / Chapter 1.1.2.1 --- Controlled vocabulary --- p.6 / Chapter 1.1.2.2 --- Free text --- p.7 / Chapter 1.1.2.3 --- Application corpus-based approach in CLIR --- p.9 / Chapter 1.2 --- Overview of linguistic resources --- p.10 / Chapter 1.3 --- Written language corpora --- p.12 / Chapter 1.3.1 --- Types of corpora --- p.13 / Chapter 1.3.2 --- Limitation of comparable corpora --- p.16 / Chapter 1.4 --- Outline of the dissertation --- p.17 / Chapter 2. --- LITERATURE REVIEW --- p.19 / Chapter 2.1 --- Research in automatic corpus construction --- p.20 / Chapter 2.2 --- Research in translation alignment --- p.25 / Chapter 2.2.1 --- Sentence alignment --- p.27 / Chapter 2.2.2 --- Word alignment --- p.28 / Chapter 2.3 --- Research in alignment of sequences --- p.33 / Chapter 3. --- ALIGNMENT AT WORD LEVEL AND CHARACTER LEVEL --- p.35 / Chapter 3.1 --- Title alignment --- p.35 / Chapter 3.1.1 --- Lexical features --- p.37 / Chapter 3.1.2 --- Grammatical features --- p.40 / Chapter 3.1.3 --- The English/Chinese alignment model --- p.41 / Chapter 3.2 --- Alignment at word level and character level --- p.42 / Chapter 3.2.1 --- Alignment at word level --- p.42 / Chapter 3.2.2 --- Alignment at character level: Longest matching --- p.44 / Chapter 3.2.3 --- Longest common subsequence(LCS) --- p.46 / Chapter 3.2.4 --- Applying LCS in the English/Chinese alignment model --- p.48 / Chapter 3.3 --- Reduce overlapping ambiguity --- p.52 / Chapter 3.3.1 --- Edit distance --- p.52 / Chapter 3.3.2 --- Overlapping in the algorithm model --- p.54 / Chapter 4. --- ALIGNMENT AT TITLE LEVEL --- p.59 / Chapter 4.1 --- Review of score functions --- p.59 / Chapter 4.2 --- The Score function --- p.60 / Chapter 4.2.1 --- (C matches E) and (E matches C) --- p.60 / Chapter 4.2.2 --- Length similarity --- p.63 / Chapter 5. --- EXPERIMENTAL RESULTS --- p.69 / Chapter 5.1 --- Hong Kong government press release articles --- p.69 / Chapter 5.2 --- Hang Seng Bank economic monthly reports --- p.76 / Chapter 5.3 --- Hang Seng Bank press release articles --- p.78 / Chapter 5.4 --- Hang Seng Bank speech articles --- p.81 / Chapter 5.5 --- Quality of the collections and future work --- p.84 / Chapter 6. --- CONCLUSION --- p.87 / Bibliography

Identiferoai:union.ndltd.org:cuhk.edu.hk/oai:cuhk-dr:cuhk_323403
Date January 2001
ContributorsLi, Kar Wing., Chinese University of Hong Kong Graduate School. Division of Systems Engineering and Engineering Management.
Source SetsThe Chinese University of Hong Kong
LanguageEnglish, Chinese
Detected LanguageEnglish
TypeText, bibliography
Formatprint, ix, 96 leaves : ill. ; 30 cm.
CoverageChina, Hong Kong, China, Hong Kong, China, Hong Kong, China, Hong Kong
RightsUse of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Page generated in 0.0018 seconds