The effectiveness of keyword-based search engines depends largely on the ability of a user to formulate proper queries that are both expressive and selective. However, web search queries issued by casual users are often short and with limited expressiveness. Query recommendation is a popular technique employed by search engines to help users refine their queries. Traditional similarity-based methods, however, often result in redundant and monotonic recommendations. We identify five basic requirements of a query recommendation system, namely relevancy, redundancy-free, diversity, ranking and efficiency. In particular, we focus on the requirements of redundancy-free and diversified recommendations.
We propose the DQR framework, which mines a search log to achieve two goals:
(1) It clusters search log queries to extract query concepts, based on which recommended queries are selected. Through query construction from the query concepts, we are able to avoid recommendation redundancy. (2) It employs a probabilistic model and a greedy heuristic algorithm to achieve recommendation diversification. Through a comprehensive user study we compare DQR against five other recommendation methods on real search log datasets. Our experiment shows that DQR outperforms the other methods in terms of relevancy, diversity, and ranking performance of the recommendations. At the same time, DQR also achieves high efficiency performance. / published_or_final_version / Computer Science / Master / Master of Philosophy
Identifer | oai:union.ndltd.org:HKU/oai:hub.hku.hk:10722/181541 |
Date | January 2012 |
Creators | Li, Ruirui., 李锐瑞. |
Contributors | Kao, CM |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Source Sets | Hong Kong University Theses |
Language | English |
Detected Language | English |
Type | PG_Thesis |
Source | http://hub.hku.hk/bib/B49799757 |
Rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works., Creative Commons: Attribution 3.0 Hong Kong License |
Relation | HKU Theses Online (HKUTO) |
Page generated in 0.0365 seconds