Global ETD Search

21	Query processing in Chiql: optimization and translation. January 1997 (has links) by Yip Suen-man. / Appendixes in Chinese and English. / Thesis (M.Phil.)--Chinese University of Hong Kong, 1997. / Includes bibliographical references. / Acknowledgment --- p.1 / Abstract --- p.2 / Table of Contents --- p.3 / List of Tables --- p.5 / List of Figures --- p.6 / Chapter Chapter 1 --- Introduction --- p.7 / Chapter 1.1 --- Objectives --- p.9 / Chapter 1.2 --- Chapter Summary --- p.10 / Chapter Chapter 2 --- Related Work --- p.11 / Chapter 2.1 --- Relational Query Language --- p.11 / Chapter 2.1.1 --- Relational Algebra Vs Relational Calculus --- p.11 / Chapter 2.1.2 --- Procedural Vs Nonprocedural --- p.13 / Chapter 2.1.3 --- Natural Language (NL) Vs Restricted Natural Language (RNL) --- p.13 / Chapter 2.2 --- Existing Relational Query Language --- p.14 / Chapter 2.3 --- Chinese Related Work --- p.16 / Chapter 2.4 --- Chapter Summary --- p.17 / Chapter Chapter 3 --- Chinese Database Query Language : Chiql --- p.19 / Chapter 3.1 --- Naturalness --- p.19 / Chapter 3.2 --- Simplicity --- p.20 / Chapter 3.3 --- Procedural and Multi-statements Query Style --- p.21 / Chapter 3.4 --- Functional Completeness --- p.22 / Chapter 3.5 --- Chapter Summary --- p.25 / Chapter Chapter 4 --- Query Processing --- p.26 / Chapter 4.1 --- Query Optimization --- p.27 / Chapter 4.1.1 --- Query Representation --- p.27 / Chapter 4.1.2 --- Standardization --- p.28 / Chapter 4.1.3 --- Simplification --- p.29 / Chapter 4.1.4 --- Amelioration --- p.29 / Chapter 4.2 --- Query Translation of SQL --- p.29 / Chapter 4.3 --- Query Processing in Chiql --- p.33 / Chapter 4.3.1 --- Overview of the Query Processing --- p.33 / Chapter 4.3.2 --- Inter-Statement Dependency --- p.34 / Chapter 4.3.3 --- Translation flow of Chiql-to-SQL --- p.36 / Chapter 4.3.4 --- An Introductory Example --- p.37 / Chapter 4.4 --- Chapter Summary --- p.40 / Chapter Chapter 5 --- Statement Merging Algorithm (SMA) --- p.41 / Chapter 5.1 --- Problems --- p.41 / Chapter 5.2 --- Definitions --- p.42 / Chapter 5.3 --- Linear Merging Algorithm (LMA) --- p.43 / Chapter 5.4 --- Tree Merging Algorithm (TMA) --- p.47 / Chapter 5.5 --- Statement Merging Algorithm (SMA) --- p.50 / Chapter 5.6 --- Improvement --- p.56 / Chapter 5.7 --- Chapter Summary --- p.57 / Chapter Chapter 6 --- Pattern Mapping Algorithm (PMA) --- p.58 / Chapter 6.1 --- Problem --- p.58 / Chapter 6.2 --- Type of Patterns --- p.61 / Chapter 6.3 --- Pre-requisite of Pattern Mapping --- p.65 / Chapter 6.4 --- Pattern Mapping Algorithm (PMA) --- p.65 / Chapter 6.5 --- An Illustration Example --- p.68 / Chapter 6.6 --- Chapter Summary --- p.72 / Chapter Chapter 7 --- Evaluation --- p.73 / Chapter 7.1 --- Testing the Correctness --- p.73 / Chapter 7.2 --- Comparison in Translation Power With Other Translator --- p.76 / Chapter 7.3 --- Chapter Summary --- p.78 / Chapter Chapter 8 --- Conclusion --- p.79 / Reference --- p.82 / Appendix --- p.86 Chinese language--Data processing Query languages (Computer science) Database searching
22	Breaking the learning barrier of Chinese Changjei input method Wong Kun-wing, Peter., 黃冠榮. January 1998 (has links) published_or_final_version / Education / Master / Master of Education Chinese character sets (Data processing) Chinese language - Data processing.
23	A rule-based analysis system for Chinese sentences 林碧, Lum, Bik. January 1989 (has links) published_or_final_version / Computer Science / Master / Master of Philosophy Chinese language - Data processing. Rule-based programming. Parsing (Computer grammar).
24	Towards discourse classication for Chinese, a resource-poor language January 2014 (has links) Discourse raises issues about semantics, and especially the nature of coherence and cohesion of texts. Similar to part-of-speech tagging and syntactic parsing, discourse classification is fundamental in computational linguistics. But relatively, this issue is not well studied. The lack of annotated corpora brings limitations to research of discourse classification for most languages other than English (e.g., Chinese). Manual annotation for discourse classification is complex, time consuming and costly. To overcome this predicament, one alternative is to explore unsupervised learning methods. Nevertheless, previous work on English showed that unsupervised methods could only deal with coarse-grained discourse relations and suffered from low precision. Another possible way is to make use of discourse classification capabilities from other languages which have rich discourse corpora. But the problem of cross language discourse classification is still very much open for investigation. Using Chinese as the target, this thesis presents the first study on discourse classification for resource-poor language. Furthermore, we also annotate the first open discourse treebank for Chinese which includes 890 news articles. / At the beginning, we propose a novel bootstrapping unsupervised method based on semantic sequential representation (SSR) for discourse classification. SSR is a new representation for discourse instances which integrate basic bag-of-words information with lexical, semantic and word sequential information. Our method starts with a small set of cue-phrase-based patterns to collect large number of discourse instances which are later converted to SSRs. We then propose an unsupervised SSR learner to generate, weigh and filter new SSRs without cue phrases for recognizing discourse relations. Experimental results showed that our method outperformed previous unsupervised method by 7% in F-score. We also show that SSRs are effective features for supervised learning methods. / The SSR-based method (F-score = 0:63) ignores the ambiguities of discourse connectives. As a result, it suffers from low recall (Recall = 0:49). To discover and eliminate these ambiguities, we further propose a cross-language framework for discourse classification. In our framework, discourse classification for Chinese is achieved in two steps: (1) Discourse connective/trigger identification and (2) Sense classification. English Penn Discourse Treebank 2 (PDTB2) and Chinese-English parallel data are coupled to provide the training data for a co-training based framework. Experimental results showed that our method achieved significant improvement comparing to SSR based method. The proposed framework is practical and effective especially in coping with the inter community problem, which is common in cross-language discourse classification. Moreover, the proposed framework does not integrate any language specific features, making it theoretically applicable for other languages. / Every language has its unique characteristics, our cross-language framework which focuses on the common characteristics between languages is ineffective in detecting Chinese language specific characteristics. As a result, we package the corpus we used in this research to form the Discourse Treebank for Chinese (DTBC). DTBC adopts the principles of PDTB2, and at the same time, it incorporates the linguistic characteristics of Chinese. The annotation work adds a discourse layer to 890 articles from the Penn Chinese Tree Bank 5 (CTB5). DTBC is the first ever open Chinese discourse treebank, which will be an invaluable linguistic resource for future research in Chinese discourse. / 語篇(Discourse)提出了關於語義理解的問題，特別是篇章的銜接與連貫問題。與詞法分析、語法分析相似，語篇分類问题是計算語言學的基本問題之一。較同领域其他問題而言，語篇分類的研究尚處於初級階段。對於除英文外的絕大多數語言，由於缺乏语篇標注資料，語篇分類的研究受到了很大的限制。眾所周知，語篇資料的標注工作複雜度较高而且需要花費大量的時間。為了克服這一困境，一種方法是探索無指導的語篇分類方法。然而，在英文上的先行研究表明，無指導语篇分类方法的缺陷是準確率較低並且僅能處理粗粒度的語篇關係。另一種方法是將語篇分類技術從有大量標注資料的源語言遷移到其他目標語言。然而，當前跨語言語篇分類技術尚不成熟。本文以中文為目標語言，首創了在本地標注資料非常有限(Resource-Poor)的情況下，對中文進行語篇分類的研究。不僅如此，我們還標註了中文第一個公開的，包含890篇新聞文章的語篇樹庫。 / 為了克服以往無指導方法的缺點，我們首先提出了一種新穎的，基於語義有序標記法 (SSR: Semantic Sequential Representation) 的無指導方法。語義有序標記法是一種新的表示語篇實例的方法，它集成了詞袋(bag-of-words)資訊，詞法資訊，語義資訊以及詞序資訊。我們的方法首先從一小組基於語篇連接詞的模式出發，在中文生語料中獲取大量的語篇實例，我們用語義有序標記法表示這些語篇實例。然後，我們提出了一種無指導的，在不考慮語篇連接詞的情況下，對語義有序表示進行挖掘，打分和過濾的方法。實驗結果證明，我們提出的方法比先前的方法在F值上提高了7%。我們還證明了語義有序表示也可以成為有指導語篇分類方法的有效特徵。 / 基於挖掘語義有序表示的無指導方法（F-score=0.63）忽略了語篇連接詞的歧義性。因此，其召回率較低。爲消除歧義，我們進一步提出了一種跨語言的語篇分類框架。在我們的框架中，中文語篇分類任務由兩個步驟組成：（1）語篇連詞/觸發詞的發現；（2）語篇關係分類。我們將英文語篇樹庫（PDTB2: Penn Discourse TreeBank 2.0）和中文樹庫（CTB5: Chinese TreeBank 5.0）結合起來作為訓練資料，作為co-training演算法框架的輸入。實驗結果表明，我們提出的跨語言語篇分類方法比單純使用語義有序表示的方法在F值上有非常顯著的提高。這說明我們提出的跨語言框架可以有效地通過雙語平行語料的橋樑作用，識別不同語言之間的語篇分類的共通性。值得一提的是，我們提出的演算法框架並不需要特定的，語言相關的特徵，因此，它具有很強的擴展並應用到其他語言的能力。 / 每種語言都有其獨特的特點，我們提出的跨語言方法主要注重於發掘語言之間的共同特點，因此並不能有效地發掘中文篇章分類的獨有特點。我們將實驗中標注過的中文語篇分析資料進行了總結和歸納，形成了中文語篇樹庫（DTBC: Discourse TreeBank for Chinese）。中文語篇樹庫繼承了英文語篇庫的構建原則，與此同時，它針對中文獨有的特點進行了大量的本地化工作。我們的標注工作為中文樹庫（CTB5: The Chinese TreeBank 5.0）的全部890篇新聞文章添加了語篇資訊層。中文語篇樹庫是第一個開放的、大規模中文語篇樹庫語料。它為未來的中文語篇分析研究提供了至關重要的基礎性標註數據。 / Zhou, Lanjun. / Thesis (Ph.D.)--Chinese University of Hong Kong, 2014. / Includes bibliographical references (leaves 98-104). / Abstracts also in Chinese. / Title from PDF title page (viewed on 20, December, 2016). / Detailed summary in vernacular field only. / Detailed summary in vernacular field only. / Detailed summary in vernacular field only. / Detailed summary in vernacular field only. Chinese language--Discourse analysis Chinese language--Data processing PL1271 .Z5494 2014eb
25	The word segmentation & part-of-speech tagging system for the modern Chinese. / Word segmentation and part-of-speech tagging system for the modern Chinese January 1994 (has links) Liu Hon-lung. / Title also in Chinese characters. / Thesis (M.Phil.)--Chinese University of Hong Kong, 1994. / Includes bibliographical references (leaves [58-59]). / Chapter 1. --- Introduction --- p.1 / Chapter 2. --- "Word Segmentation and Part-of-Speech Tagging: Techniques, Current Researches and The Embraced Problems" --- p.6 / Chapter 2.1. --- Various Methods on Word Segmentation and Part-of-Speech Tagging --- p.6 / Chapter 2.2. --- Current Researches on Word Segmentation and Part-of-Speech Tagging --- p.9 / Chapter 2.3. --- Embraced Problems in Word Segmentation and Part-of-Speech Tagging --- p.9 / Chapter 3. --- Branch-and-Bound Algorithm for Combinational Optimization of the Probabilistic Scoring Function --- p.15 / Chapter 3.1. --- Definition of Word Segmentation and Part-of-Speech Tagging --- p.15 / Chapter 3.2. --- Framework --- p.17 / Chapter 3.3. --- "Weight Assignment, Intermediate Score Computation & Optimization" --- p.20 / Chapter 4. --- Implementation Issues of the Proposed Word Segmentation and Part-of-Speech Tagging System --- p.26 / Chapter 4.1. --- Design of System Dictionary and Data Structure --- p.30 / Chapter 4.2. --- Training Process --- p.33 / Chapter 4.3. --- Tagging Process --- p.35 / Chapter 4.4. --- Tagging Samples of the Word Segmentation & Part-of-Speech Tagging System --- p.39 / Chapter 5. --- Experiments on the Proposed Word Segmentation and Part-Of-Speech Tagging System --- p.41 / Chapter 5.1. --- Closed Test --- p.41 / Chapter 5.2. --- Open Test --- p.42 / Chapter 6. --- Testing and Statistics --- p.43 / Chapter 7. --- Conclusions and Discussions --- p.47 / References / Appendices / Appendix A: sysdict.tag Sample / Appendix B: econ.tag Sample / Appendix C: open. tag Sample / Appendix D:漢語分詞及詞性標注系統for Windows / Appendix E: Neural Network Chinese language--Data processing Chinese language--Machine translating
26	Prospects of Chinese computing in Hong Kong. January 1985 (has links) by Hui Chiu-kwong & Tsoi Po-tak. / Questionaire in Chinese and English / Bibliography: leaf 89 / Thesis (M.B.A.)--Chinese University of Hong Kong, 1985 Computer programs--Marketing Computer software industry Chinese language--Data processing
27	Free-style phonetic input of Chinese. January 1993 (has links) by Lau Chi Ching, Donny. / Thesis (M.Sc.)--Chinese University of Hong Kong, 1993. / Includes bibliographical references (leaves [71]). / Chapter 1. --- Introduction / Chapter 1.1 --- Introduction --- p.1 / Chapter 1.2 --- Comparison of Phonetic and Written Character Input --- p.2 / Chapter 1.3 --- Significance of Phonetic Input --- p.4 / Chapter 1.4 --- Drawbacks of Current Phonetic Input Methods --- p.4 / Chapter 2. --- Objectives of the Research / Chapter 2.1 --- Main Objectives --- p.6 / Chapter 2.2 --- User Background Pre-requisite --- p.8 / Chapter 2.3 --- Roman-Spelling (Recommended Phonetic Scheme) --- p.9 / Chapter 2.4 --- User Input and the Output Scenario --- p.10 / Chapter 2.5 --- Outline of Free-Style Phonetic Input Processing --- p.15 / Chapter 3. --- Lexical Analyser / Chapter 3.1 --- Overview of Lexical Analyser --- p.17 / Chapter 3.2 --- Identification of Character Boundary --- p.19 / Chapter 3.3 --- Lexical Tree --- p.20 / Chapter 4. --- Selection Module / Chapter 4.1 --- Overview of Selection Module --- p.23 / Chapter 4.2 --- Fault-tolerance Capability --- p.24 / Chapter 4.3 --- Group Table (Groups of Similar Sounds) --- p.26 / Chapter 4.4 --- Distance Calculation Algorithm --- p.30 / Chapter 4.4.1 --- Character Dictionary --- p.31 / Chapter 4.4.2 --- Phrase Dictionary --- p.33 / Chapter 4.4.3 --- Hashing Key of the Dictionaries --- p.35 / Chapter 4.4.4 --- Maintenance of Dictionaries --- p.36 / Chapter 4.4.5 --- Distance Calculation of Character Input --- p.37 / Chapter 4.4.5.1 --- Examples of Character Output --- p.39 / Chapter 4.4.6 --- Distance Calculation of Phrase Input --- p.40 / Chapter 4.4.6.1 --- Examples of Phrase Output --- p.44 / Chapter 4.4.7 --- Explanation of Algorithm --- p.45 / Chapter 5. --- Syntax Analyser / Chapter 5.1 --- Overview of Syntax Analyser --- p.46 / Chapter 5.2 --- Overview of a Chinese Simple Sentence --- p.47 / Chapter 5.3 --- Testing Simple Sentence Rules --- p.48 / Chapter 5.3.1 --- NDFA for Chinese Grammar Rules --- p.49 / Chapter 5.4 --- Syntax Analysis Algorithm --- p.51 / Chapter 5.4.1 --- Explanation of Algorithm --- p.52 / Chapter 5.4.2 --- Justification of Algorithm --- p.54 / Chapter 5.4.3 --- Examples of Syntax Analysis --- p.55 / Chapter 5.5 --- Parse Tree for Semantic Analysis --- p.59 / Chapter 6. --- Division of Technical Work --- p.61 / Chapter 7. --- Applied Areas of the Research / Chapter 7.1 --- Chinese User Interface with Operating System --- p.63 / Chapter 7.2 --- Bilingual Programming Language Editor --- p.64 / Chapter 7.3 --- Development of a Chinese Programming Language --- p.66 / Chapter 7.4 --- Putonghua Training --- p.67 / Chapter 8. --- Conclusions and Future Improvements / Chapter 8.1 --- Conclusions --- p.68 / Chapter 8.2 --- Future Improvements --- p.69 / References / Appendix A Input design, Computer Chinese character sets (Data processing) Chinese language--Data processing
28	An on-line handwritten Chinese input system using a "unique character mapping" algorithm. January 1987 (has links) by Chan Shing Chi, Michael. / Thesis (M.Ph.)--Chinese University of Hong Kong, 1987. / Bibliography: leaves [112]-[114] Input design, Computer Chinese character sets (Data processing) Chinese language--Data processing Online data processing
29	Chinese character processing. January 1987 (has links) by Yeung Chuen-sang. / Thesis (M.Ph.)--Chinese University of Hong Kong, 1987. / Bibliography: leaves 190-194. Chinese character sets (Data processing) Chinese language--Data processing Input design, Computer
30	A methodology for constructing compact Chinese font libraries by radical composition. January 1993 (has links) by Wai-Yip Tung. / Thesis (M.Phil.)--Chinese University of Hong Kong, 1993. / Includes bibliographical references (leaves 55-56). / Chapter 1. --- Introduction --- p.1 / Chapter 1.1. --- Previous work --- p.2 / Chapter 1.1.1. --- A Chinese METAFONT --- p.2 / Chapter 1.1.2. --- Chinese character generator --- p.2 / Chapter 1.1.3. --- Chinese Character Design System CCDS --- p.2 / Chapter 1.2. --- Goals of the thesis --- p.3 / Chapter 1.3. --- Overview of the thesis --- p.3 / Chapter 2. --- Construction of Chinese Characters --- p.5 / Chapter 2.1 --- Introduction --- p.5 / Chapter 2.2. --- liu shu(六書）Six Principles of Chinese Character Construction --- p.5 / Chapter 2.3. --- Structural Analysis of Chinese Characters --- p.7 / Chapter 2.3.1. --- Left-Right Structure --- p.8 / Chapter 2.3.2. --- Top-Bottom Structure --- p.9 / Chapter 2.3.3. --- Inside-Outside Structure --- p.10 / Chapter 2.3.4. --- Singleton Structure --- p.10 / Chapter 2.4. --- Usage frequency of radicals --- p.11 / Chapter 2.5. --- Usage frequency of Bushou --- p.11 / Chapter 2.6. --- Usage frequency of Shengpang --- p.13 / Chapter 2.7. --- Summary --- p.15 / Chapter 3. --- Composition by Radicals --- p.17 / Chapter 3.1. --- Introduction --- p.17 / Chapter 3.2. --- Transforming radicals --- p.18 / Chapter 3.3. --- Quality of transformed radicals --- p.19 / Chapter 3.4. --- Lower level components --- p.20 / Chapter 3.5. --- Summary --- p.23 / Chapter 4. --- Automatic Hinting for Chinese Font --- p.24 / Chapter 4.1 --- Introduction --- p.24 / Chapter 4.2. --- Automatic hinting for Chinese font --- p.26 / Chapter 4.3. --- Stroke recognition --- p.30 / Chapter 4.3.1. --- Identify horizontal lines --- p.31 / Chapter 4.3.2. --- Identify stroke segments --- p.31 / Chapter 4.3.3. --- Stroke recognition --- p.32 / Chapter 4.4. --- Regularize stroke width --- p.33 / Chapter 4.5. --- Grid-fitting horizontal and vertical strokes --- p.33 / Chapter 4.6. --- Grid-fitting radicals --- p.37 / Chapter 4.7. --- Summary --- p.39 / Chapter 5. --- RADIT - A Chinese Font Editor --- p.41 / Chapter 5.1. --- Introduction --- p.41 / Chapter 5.2. --- RADIT basics --- p.41 / Chapter 5.2.1. --- Character selection window --- p.42 / Chapter 5.2.2. --- Character window --- p.42 / Chapter 5.2.3. --- Tools Palette --- p.43 / Chapter 5.2.4. --- Toolbar --- p.43 / Chapter 5.2.5. --- Zooming the character window --- p.44 / Chapter 5.3. --- Editing a character --- p.44 / Chapter 5.3.1. --- Selecting handles --- p.44 / Chapter 5.3.2. --- Adding lines and curves --- p.45 / Chapter 5.3.3. --- Delete control points --- p.45 / Chapter 5.3.4. --- Moving control points --- p.45 / Chapter 5.3.5. --- Cut and paste --- p.46 / Chapter 5.3.6. --- Undo --- p.46 / Chapter 5.4. --- Adding radicals to a character --- p.46 / Chapter 5.5. --- Rasterizing and grid-fitting a character --- p.47 / Chapter 5.5.1. --- Rasterizing a character --- p.48 / Chapter 5.5.2. --- Stroke detection and regularization --- p.48 / Chapter 5.5.3. --- Grid-fitting and rasterizing a character --- p.49 / Chapter 6. --- Conclusions --- p.50 / Appendix A: Sample Fonts --- p.52 / References --- p.55 Chinese character sets (Data processing) Input design, Computer Chinese language--Data processing

Search results