As the rapid growth of World Wide Web nowadays, many companies tend to disseminate relevant information such as the introduction of product and service through their commercial Web sites. A company¡¦s Web site is deemed as a certain kind of its business assets. Customers, suppliers, partners, associations and other outsiders who desire to get access to the assets from the Web construct a company¡¦s external Web environment. From a strategic planning point of view, identifying a company¡¦s external environment helps to create its business values.
Therefore, this research focuses on the issue of assisting a company to identify its external Web environment using mining techniques. Several research works pointed out that the hyperlink structure among Web pages could contribute to
classifying the relationships within a company¡¦s external environment. We then propose a classifier that combines Web content mining and hyperlink structure, CNB-HI, for such a purpose.
We apply our proposed approach to a real case to help identify the roles of customers, partners, media, and associations. Two experiments are conducted to examine the performance. In the first experiment, we compare CNB with other forms of Naïve Bayesian classifiers, and conclude that CNB achieves a better performance. However, even the performance by CNB is not satisfactory based exclusively on
content classification. The second experiment is conducted to examine the benefits with hyperlink information incorporated (CNB-HI). The result shows that the
performance of CNB-HI improves markedly. It thus justifies the feasibility of the proposed approach to real applications.
Identifer | oai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0117106-112531 |
Date | 17 January 2006 |
Creators | Chen, Hsaio |
Contributors | Wen-Feng Hsiao, Pei Chen Sun, Te-Min Chang |
Publisher | NSYSU |
Source Sets | NSYSU Electronic Thesis and Dissertation Archive |
Language | English |
Detected Language | English |
Type | text |
Format | application/pdf |
Source | http://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0117106-112531 |
Rights | restricted, Copyright information available at source archive |
Page generated in 0.0022 seconds