With the advancement of Internet technology and the changes in the mode
of communications, it is found that much first-hand news have been discussed
in Internet forums well before they are reported in traditional mass media.
Also, this communication channel provides an effective channel for illegal activities
such as dissemination of copyrighted movies, threatening messages and
online gambling etc. The law enforcement agencies are looking for solutions to
monitor these discussion forums for possible criminal activities and download
suspected postings as evidence for investigation. The volume of postings is
huge, for 10 popular forums in Hong Kong; we found that there are 300,000
new messages every day. In this thesis, we propose an automatic system that
tackles this problem. Our proposed system downloads postings from selected
discussion forums continuously and employs data mining techniques to identify
hot topics and cluster authors into different groups using word based user
profiles. Using these data, we try to locate some useful trends and detect crime
from the data, the result is discussed afterward with include advantages and
limitations of different approaches and at the end, there is a conclusion of the
way to solve those problems and provide future direction of this research. / published_or_final_version / Computer Science / Master / Master of Philosophy
Identifer | oai:union.ndltd.org:HKU/oai:hub.hku.hk:10722/174553 |
Date | January 2011 |
Creators | Lai, Yiu-ming., 黎耀明. |
Contributors | Chow, KP, Hui, CK |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Source Sets | Hong Kong University Theses |
Language | English |
Detected Language | English |
Type | PG_Thesis |
Source | http://hub.hku.hk/bib/B47849952 |
Rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works., Creative Commons: Attribution 3.0 Hong Kong License |
Relation | HKU Theses Online (HKUTO) |
Page generated in 0.0019 seconds