Global ETD Search

Return to search

Text mining of online book reviews for non-trivial clustering of books and users

Indiana University-Purdue University Indianapolis (IUPUI) / The classification of consumable media by mining relevant text for their identifying features is a subjective process. Previous attempts to perform this type of feature mining have generally been limited in scope due having limited access to user data. Many of these studies used human domain knowledge to evaluate the accuracy of features extracted using these methods. In this thesis, we mine book review text to identify nontrivial features of a set of similar books. We make comparisons between books by looking for books that share characteristics, ultimately performing clustering on the books in our data set. We use the same mining process to identify a corresponding set of characteristics in users. Finally, we evaluate the quality of our methods by examining the correlation between our similarity metric, and user ratings.

Information behavior -- Research

Cluster analysis -- Data processing

System analysis -- Data processing

Information retrieval -- Book reviews

Identifer	oai:union.ndltd.org:IUPUI/oai:scholarworks.iupui.edu:1805/3421
Date	14 August 2013
Creators	Lin, Eric
Contributors	Fang, Shiaofen, Mukhopadhyay, Snehasis, Du, Yingzi, 1975-
Source Sets	Indiana University-Purdue University Indianapolis
Language	en_US
Detected Language	English

Page generated in 0.0018 seconds

Text mining of online book reviews for non-trivial clustering of books and users

Description

Links & Downloads

Tags

Additional Fields