Return to search

Website summarization: a topic hierarchy based approach.

Liu Nan. / Thesis (M.Phil.)--Chinese University of Hong Kong, 2006. / Includes bibliographical references (leaves 84-88). / Abstracts in English and Chinese. / Abstract --- p.1 / Acknowledgements --- p.3 / Contents --- p.4 / List of Figures --- p.6 / List of Tables --- p.7 / Chapter Chapter 1 --- Introduction --- p.8 / Chapter Chapter 2 --- Related Work --- p.12 / Chapter 2.1 --- Web Structure Mining --- p.12 / Chapter 2.1.1 --- HITS Algorithm --- p.13 / Chapter 2.1.2 --- PageRank Algorithm --- p.13 / Chapter 2.2 --- Website Mining --- p.14 / Chapter 2.2.1 --- Website Classification --- p.14 / Chapter 2.2.2 --- Web Unit Mining --- p.16 / Chapter 2.2.3 --- Logical Domain Extraction --- p.16 / Chapter 2.2.4 --- Web Thesaurus Construction --- p.17 / Chapter Chapter 3 --- Website Topic Hierarchy Generation --- p.19 / Chapter 3.1 --- Problem Definition --- p.19 / Chapter 3.2 --- Graph Based Algorithms --- p.21 / Chapter 3.2.1 --- Breadth First Search --- p.21 / Chapter 3.2.2 --- Shortest Path Search --- p.23 / Chapter 3.2.3 --- Minimum Directed Spanning Tree --- p.24 / Chapter 3.2.4 --- Discussion --- p.27 / Chapter 3.3 --- Edge Weight Function --- p.28 / Chapter 3.3.1 --- Relevance Method --- p.29 / Chapter 3.3.2 --- Machine Learning Method --- p.32 / Chapter 3.4 --- Experiments --- p.47 / Chapter 3.4.1 --- Data Preparation --- p.47 / Chapter 3.4.2 --- Performances of Breadth-first Search --- p.50 / Chapter 3.4.3 --- Performances of Shortest-path Search --- p.50 / Chapter 3.4.4 --- Performances of Directed Minimum Spanning Tree --- p.54 / Chapter 3.4.5 --- Comparison of Different Algorithms --- p.55 / Chapter Chapter 4 --- Website Summarization Through Keyphrase Extraction --- p.58 / Chapter 4.1 --- Introduction --- p.58 / Chapter 4.2 --- Background --- p.60 / Chapter 4.3 --- Keyphrase Extraction --- p.69 / Chapter 4.3.1 --- Candidate Phrases Idenfication --- p.69 / Chapter 4.3.2 --- Feature Calculation without Topic Hierarchy --- p.70 / Chapter 4.3.3 --- Feature Calculation with Topic Hierarchy --- p.72 / Chapter 4.3.4 --- Extraction of Keyphrases --- p.75 / Chapter 4.4 --- Experiments --- p.76 / Chapter Chapter 5 --- Conclusion and Future Work --- p.82 / References: --- p.84

Identiferoai:union.ndltd.org:cuhk.edu.hk/oai:cuhk-dr:cuhk_325809
Date January 2006
ContributorsLiu, Nan., Chinese University of Hong Kong Graduate School. Division of Systems Engineering and Engineering Management.
Source SetsThe Chinese University of Hong Kong
LanguageEnglish, Chinese
Detected LanguageEnglish
TypeText, bibliography
Formatprint, 88 leaves : ill. ; 30 cm.
RightsUse of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Page generated in 0.002 seconds