Return to search

Concept Extraction With Change Detection From Navigated Information

To manage the information flood in the Internet, we usually navigate specific information using the provided search engines. Search engines are convenient but with limited functions. For example, it is impractical and impossible to browse through the entire collected information for us to gain an overall picture about what the navigated information stands for. To do so, we need an appropriate approach to automatically extracting concepts from the navigated information to assist users to easily and quickly gain the primary understanding toward a topic that interests users.
In this research, we propose an approach to extracting concepts from the navigated web information and detecting the concept changes over time. It basically includes two stages. In the first stage, information is decomposed into paragraphs and they are clustered with key terms identified through the aid of latent semantic indexing method. Concepts are represented in the form of paragraph summary and associated key terms, which allows the user to easily comprehend what they describe. The second stage is to adaptively modify the concept structure to detect concept changes. With new information added, the concepts could be merging, splitting, or even emerging with time.
Three experiments are conducted in this research to verify the proposed approach. Results of the first and second experiments show both high recall and high precision that matches the predefined concept categories. The last one is an illustrated real case application on the tsunami event. It shows that we can easily grasp different concepts of the tsunami reports and realize their changes by using our approach. The feasibility of employing our approach is thus justified.

Identiferoai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0707105-132553
Date07 July 2005
CreatorsLin, Tzu-hsiang
ContributorsTe-min Chang, none, none
PublisherNSYSU
Source SetsNSYSU Electronic Thesis and Dissertation Archive
LanguageEnglish
Detected LanguageEnglish
Typetext
Formatapplication/pdf
Sourcehttp://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0707105-132553
Rightscampus_withheld, Copyright information available at source archive

Page generated in 0.0018 seconds