Chan Ka Yan. / Thesis (M.Phil.)--Chinese University of Hong Kong, 2004. / Includes bibliographical references (leaves i-iv (3rd gp.)). / Abstracts in English and Chinese. / ACKNOWLEDGEMENT --- p.I / ABSTRACT --- p.II / 摘要 --- p.IV / TABLE OF CONTENT --- p.VI / LIST OF FIGURE --- p.VIII / LIST OF TABLE --- p.IX / Chapter CHAPTER 1. --- INTRODUCTION --- p.1 / Chapter 1.1 --- Background --- p.1 / Chapter 1.2 --- Importance of hyperlink analysis --- p.2 / Chapter CHAPTER 2. --- RELATED WORK --- p.4 / Chapter 2.1 --- Crawling --- p.4 / Chapter 2.1.1 --- Crawling method for HITS Algorithm --- p.4 / Chapter 2.1.2 --- Crawling method for Page Rank Algorithm --- p.7 / Chapter 2.2 --- Ranking --- p.7 / Chapter 2.2.1 --- Page Rank Algorithm --- p.8 / Chapter 2.2.2 --- HITS Algorithm --- p.11 / Chapter 2.2.3 --- PageRank-HITS Algorithm --- p.15 / Chapter 2.2.4 --- SALSA Algorithm --- p.16 / Chapter 2.2.5 --- Average and Sim --- p.18 / Chapter 2.2.6 --- Netscape Approach --- p.19 / Chapter 2.2.7 --- Cocitation Approach --- p.19 / Chapter 2.3 --- Multimedia Information Retrieval --- p.20 / Chapter 2.3.1 --- Octopus --- p.21 / Chapter CHAPTER 3. --- RESEARCH METHODOLOGY --- p.25 / Chapter 3.1 --- Research Objective --- p.25 / Chapter 3.2 --- Proposed Crawling Methodology --- p.26 / Chapter 3.2.1 --- Collecting Media Objects --- p.26 / Chapter 3.2.2 --- Filtering the collection of links --- p.29 / Chapter 3.3 --- Proposed Ranking Methodology --- p.34 / Chapter 3.3.1 --- Identifying the factors affect ranking --- p.34 / Chapter 3.3.2 --- Modified Ranking Algorithms --- p.37 / Chapter CHAPTER 4. --- EXPERIMENTAL RESULTS AND DISCUSSIONS --- p.52 / Chapter 4.1 --- Experimental Setup --- p.52 / Chapter 4.1.1 --- Assumptions for the Experiment --- p.53 / Chapter 4.2 --- Some Observations from Experiment --- p.54 / Chapter 4.2.1 --- Dangling links --- p.55 / Chapter 4.2.2 --- "Good Hub = bad Authority, Good Authority = bad Hub?" --- p.55 / Chapter 4.2.3 --- Setting of weights --- p.56 / Chapter 4.3 --- Discussion on Experimental Results --- p.57 / Chapter 4.3.1 --- Relevance --- p.57 / Chapter 4.3.2 --- Precision and recall --- p.58 / Chapter 4.3.3 --- Significance testing --- p.61 / Chapter 4.3.4 --- Ranking --- p.63 / Chapter 4.4 --- Limitations and Difficulties --- p.67 / Chapter 4.4.1 --- Small size of the base set --- p.68 / Chapter 4.4.2 --- Parameter settings --- p.68 / Chapter 4.4.3 --- Unable to remove all the meaningless links from base set --- p.68 / Chapter 4.4.4 --- Resources and time-consuming --- p.69 / Chapter 4.4.5 --- TKC Effect --- p.69 / Chapter 4.4.6 --- Continuously updated format of HTML codes and file types --- p.70 / Chapter 4.4.7 --- The object citation habit of authors --- p.70 / Chapter CHAPTER 5. --- CONCLUSION --- p.71 / Chapter 5.1 --- Contribution of our Methodology --- p.71 / Chapter 5.2 --- Possible Improvement --- p.71 / Chapter 5.3 --- Conclusion --- p.72 / BIBLIOGRAPHY --- p.I / APPENDIX --- p.A-I / Chapter A.1 --- One-tailed paired t-test results --- p.A-I / Chapter A2. --- Anova results --- p.A-IV
Identifer | oai:union.ndltd.org:cuhk.edu.hk/oai:cuhk-dr:cuhk_324722 |
Date | January 2004 |
Contributors | Chan, Ka Yan., Chinese University of Hong Kong Graduate School. Division of Systems Engineering and Engineering Management. |
Source Sets | The Chinese University of Hong Kong |
Language | English, Chinese |
Detected Language | English |
Type | Text, bibliography |
Format | print, x, 73, iv, v leaves : ill. ; 30 cm. |
Rights | Use of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/) |
Page generated in 0.002 seconds