Return to search

On Travel Article Classification Based on Consumer Information Search Process Model

The information overload problem becomes imperative with the explosion of information, and people need some agents to facilitate them to filter the information to meet their personal need. In this work, we conduct a research for the article classification in the tourism domain so as to identify articles that meet users¡¦ information need. We propose an information need orientation model in tourism, which consists of four goals: Initiation, Attraction, Accommodation, and Route planning. These goals can be characterized by 13 features. Some of the identified features can be enhanced by WordNet and Named Entity Recognition techniques as supplement techniques. To test the effectiveness of using the 13 features for classification and the relevant methods, we collected 15,797 articles from TripAdvisor.com, the world's largest travel site, and randomly selected 600 articles as training data labeled by two labelers. The experimental results show that our approach generally has comparable or better performance than that of using purely lexical features, namely TF-IDF, for classification, with fewer features.

Identiferoai:union.ndltd.org:NSYSU/oai:NSYSU:etd-0727111-144058
Date27 July 2011
CreatorsHsiao, Yung-Lin
ContributorsTe-min Chang, Hsiang-Li Chiang, chenli.kuo@gmail.com, Wan-Shiou Yang, San-Yih Hwang
PublisherNSYSU
Source SetsNSYSU Electronic Thesis and Dissertation Archive
LanguageEnglish
Detected LanguageEnglish
Typetext
Formatapplication/pdf
Sourcehttp://etd.lib.nsysu.edu.tw/ETD-db/ETD-search/view_etd?URN=etd-0727111-144058
Rightswithheld, Copyright information available at source archive

Page generated in 0.002 seconds