Return to search

Automatic identification of hot topics and user clusters from online discussion forums

With the advancement of Internet technology and the changes in the mode

of communications, it is found that much first-hand news have been discussed

in Internet forums well before they are reported in traditional mass media.

Also, this communication channel provides an effective channel for illegal activities

such as dissemination of copyrighted movies, threatening messages and

online gambling etc. The law enforcement agencies are looking for solutions to

monitor these discussion forums for possible criminal activities and download

suspected postings as evidence for investigation. The volume of postings is

huge, for 10 popular forums in Hong Kong; we found that there are 300,000

new messages every day. In this thesis, we propose an automatic system that

tackles this problem. Our proposed system downloads postings from selected

discussion forums continuously and employs data mining techniques to identify

hot topics and cluster authors into different groups using word based user

profiles. Using these data, we try to locate some useful trends and detect crime

from the data, the result is discussed afterward with include advantages and

limitations of different approaches and at the end, there is a conclusion of the

way to solve those problems and provide future direction of this research. / published_or_final_version / Computer Science / Master / Master of Philosophy

  1. 10.5353/th_b4784995
  2. b4784995
Identiferoai:union.ndltd.org:HKU/oai:hub.hku.hk:10722/174553
Date January 2011
CreatorsLai, Yiu-ming., 黎耀明.
ContributorsChow, KP, Hui, CK
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Source SetsHong Kong University Theses
LanguageEnglish
Detected LanguageEnglish
TypePG_Thesis
Sourcehttp://hub.hku.hk/bib/B47849952
RightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works., Creative Commons: Attribution 3.0 Hong Kong License
RelationHKU Theses Online (HKUTO)

Page generated in 0.002 seconds