Return to search

Automatic index generation for the free-text based database.

by Leung Chi Hong. / Thesis (M.Phil.)--Chinese University of Hong Kong, 1992. / Includes bibliographical references (leaves 183-184). / Chapter Chapter one: --- Introduction --- p.1 / Chapter Chapter two: --- Background knowledge and linguistic approaches of automatic indexing --- p.5 / Chapter 2.1 --- Definition of index and indexing --- p.5 / Chapter 2.2 --- Indexing methods and problems --- p.7 / Chapter 2.3 --- Automatic indexing and human indexing --- p.8 / Chapter 2.4 --- Different approaches of automatic indexing --- p.10 / Chapter 2.5 --- Example of semantic approach --- p.11 / Chapter 2.6 --- Example of syntactic approach --- p.14 / Chapter 2.7 --- Comments on semantic and syntactic approaches --- p.18 / Chapter Chapter three: --- Rationale and methodology of automatic index generation --- p.19 / Chapter 3.1 --- Problems caused by natural language --- p.19 / Chapter 3.2 --- Usage of word frequencies --- p.20 / Chapter 3.3 --- Brief description of rationale --- p.24 / Chapter 3.4 --- Automatic index generation --- p.27 / Chapter 3.4.1 --- Training phase --- p.27 / Chapter 3.4.1.1 --- Selection of training documents --- p.28 / Chapter 3.4.1.2 --- Control and standardization of variants of words --- p.28 / Chapter 3.4.1.3 --- Calculation of associations between words and indexes --- p.30 / Chapter 3.4.1.4 --- Discarding false associations --- p.33 / Chapter 3.4.2 --- Indexing phase --- p.38 / Chapter 3.4.3 --- Example of automatic indexing --- p.41 / Chapter 3.5 --- Related researches --- p.44 / Chapter 3.6 --- Word diversity and its effect on automatic indexing --- p.46 / Chapter 3.7 --- Factors affecting performance of automatic indexing --- p.60 / Chapter 3.8 --- Application of semantic representation --- p.61 / Chapter 3.8.1 --- Problem of natural language --- p.61 / Chapter 3.8.2 --- Use of concept headings --- p.62 / Chapter 3.8.3 --- Example of using concept headings in automatic indexing --- p.65 / Chapter 3.8.4 --- Advantages of concept headings --- p.68 / Chapter 3.8.5 --- Disadvantages of concept headings --- p.69 / Chapter 3.9 --- Correctness prediction for proposed indexes --- p.78 / Chapter 3.9.1 --- Example of using index proposing rate --- p.80 / Chapter 3.10 --- Effect of subject matter on automatic indexing --- p.83 / Chapter 3.11 --- Comparison with other indexing methods --- p.85 / Chapter 3.12 --- Proposal for applying Chinese medical knowledge --- p.90 / Chapter Chapter four: --- Simulations of automatic index generation --- p.93 / Chapter 4.1 --- Training phase simulations --- p.93 / Chapter 4.1.1 --- Simulation of association calculation (word diversity uncontrolled) --- p.94 / Chapter 4.1.2 --- Simulation of association calculation (word diversity controlled) --- p.102 / Chapter 4.1.3 --- Simulation of discarding false associations --- p.107 / Chapter 4.2 --- Indexing phase simulation --- p.115 / Chapter 4.3 --- Simulation of using concept headings --- p.120 / Chapter 4.4 --- Simulation for testing performance of predicting index correctness --- p.125 / Chapter 4.5 --- Summary --- p.128 / Chapter Chapter five: --- Real case study in database of Chinese Medicinal Material Research Center --- p.130 / Chapter 5.1 --- Selection of real documents --- p.130 / Chapter 5.2 --- Case study one: Overall performance using real data --- p.132 / Chapter 5.2.1 --- Sample results of automatic indexing for real documents --- p.138 / Chapter 5.3 --- Case study two: Using multi-word terms --- p.148 / Chapter 5.4 --- Case study three: Using concept headings --- p.152 / Chapter 5.5 --- Case study four: Prediction of proposed index correctness --- p.156 / Chapter 5.6 --- Case study five: Use of (Σ ΔRij) Fi to determine false association --- p.159 / Chapter 5.7 --- Case study six: Effect of word diversity --- p.162 / Chapter 5.8 --- Summary --- p.166 / Chapter Chapter six: --- Conclusion --- p.168 / Appendix A: List of stopwords --- p.173 / Appendix B: Index terms used in case studies --- p.174 / References --- p.183

Identiferoai:union.ndltd.org:cuhk.edu.hk/oai:cuhk-dr:cuhk_318984
Date January 1992
ContributorsLeung, Chi Hong., Chinese University of Hong Kong Graduate School. Division of Computer Science.
PublisherChinese University of Hong Kong
Source SetsThe Chinese University of Hong Kong
LanguageEnglish
Detected LanguageEnglish
TypeText, bibliography
Formatprint, [5], 184 leaves : ill. ; 30 cm.
RightsUse of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Page generated in 0.0019 seconds