Liu Nan. / Thesis (M.Phil.)--Chinese University of Hong Kong, 2006. / Includes bibliographical references (leaves 84-88). / Abstracts in English and Chinese. / Abstract --- p.1 / Acknowledgements --- p.3 / Contents --- p.4 / List of Figures --- p.6 / List of Tables --- p.7 / Chapter Chapter 1 --- Introduction --- p.8 / Chapter Chapter 2 --- Related Work --- p.12 / Chapter 2.1 --- Web Structure Mining --- p.12 / Chapter 2.1.1 --- HITS Algorithm --- p.13 / Chapter 2.1.2 --- PageRank Algorithm --- p.13 / Chapter 2.2 --- Website Mining --- p.14 / Chapter 2.2.1 --- Website Classification --- p.14 / Chapter 2.2.2 --- Web Unit Mining --- p.16 / Chapter 2.2.3 --- Logical Domain Extraction --- p.16 / Chapter 2.2.4 --- Web Thesaurus Construction --- p.17 / Chapter Chapter 3 --- Website Topic Hierarchy Generation --- p.19 / Chapter 3.1 --- Problem Definition --- p.19 / Chapter 3.2 --- Graph Based Algorithms --- p.21 / Chapter 3.2.1 --- Breadth First Search --- p.21 / Chapter 3.2.2 --- Shortest Path Search --- p.23 / Chapter 3.2.3 --- Minimum Directed Spanning Tree --- p.24 / Chapter 3.2.4 --- Discussion --- p.27 / Chapter 3.3 --- Edge Weight Function --- p.28 / Chapter 3.3.1 --- Relevance Method --- p.29 / Chapter 3.3.2 --- Machine Learning Method --- p.32 / Chapter 3.4 --- Experiments --- p.47 / Chapter 3.4.1 --- Data Preparation --- p.47 / Chapter 3.4.2 --- Performances of Breadth-first Search --- p.50 / Chapter 3.4.3 --- Performances of Shortest-path Search --- p.50 / Chapter 3.4.4 --- Performances of Directed Minimum Spanning Tree --- p.54 / Chapter 3.4.5 --- Comparison of Different Algorithms --- p.55 / Chapter Chapter 4 --- Website Summarization Through Keyphrase Extraction --- p.58 / Chapter 4.1 --- Introduction --- p.58 / Chapter 4.2 --- Background --- p.60 / Chapter 4.3 --- Keyphrase Extraction --- p.69 / Chapter 4.3.1 --- Candidate Phrases Idenfication --- p.69 / Chapter 4.3.2 --- Feature Calculation without Topic Hierarchy --- p.70 / Chapter 4.3.3 --- Feature Calculation with Topic Hierarchy --- p.72 / Chapter 4.3.4 --- Extraction of Keyphrases --- p.75 / Chapter 4.4 --- Experiments --- p.76 / Chapter Chapter 5 --- Conclusion and Future Work --- p.82 / References: --- p.84
Identifer | oai:union.ndltd.org:cuhk.edu.hk/oai:cuhk-dr:cuhk_325809 |
Date | January 2006 |
Contributors | Liu, Nan., Chinese University of Hong Kong Graduate School. Division of Systems Engineering and Engineering Management. |
Source Sets | The Chinese University of Hong Kong |
Language | English, Chinese |
Detected Language | English |
Type | Text, bibliography |
Format | print, 88 leaves : ill. ; 30 cm. |
Rights | Use of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/) |
Page generated in 0.0019 seconds