As the web expands exponentially, the need to put some order to its content becomes apparent. Hypertext categorization, that is the automatic classification of web documents into predefined classes, came to elevate humans from that task. The extra information available in a hypertext document poses new challenges for automatic categorization. HTML tags and linked neighbourhood all provide rich information for hypertext categorization that is no available in traditional text classification.
Identifer | oai:union.ndltd.org:bl.uk/oai:ethos.bl.uk:494145 |
Date | January 2008 |
Creators | Benbrahim, Houda |
Publisher | University of Portsmouth |
Source Sets | Ethos UK |
Detected Language | English |
Type | Electronic Thesis or Dissertation |
Page generated in 0.01 seconds