• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 1
  • Tagged with
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • 1
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

An Evolution-based Approach to Support Effective Document-Category Management

Lee, Yen-Hsien 10 August 2005 (has links)
Observations of textual document management by individuals and organizations have suggested the popularity of using categories to organize, archive and access documents. The adequacy of an existing category understandably may diminish as it includes influxes of new documents over time or retains only a part of existing documents, bringing about significant changes to its content. Thus, the existing document categories have to be evolved over time as new documents are acquired. Following an evolution-based approach for document-category management, this dissertation extends Category Evolution (CE) technique by addressing its inherent limitations. The proposed technique (namely, CE2) automatically re-organizes document categories while taking into account those previously established. Furthermore, we propose the Ontology-based Category Evolution technique (namely, ONCE) to overcome the problems of word mismatch and ambiguity encountered by the lexicon-based category evolution approach (e.g., CE and CE2). Facilitated by a domain ontology, ONCE can evolve document categories on the conceptual rather the lexical level. Finally, this dissertation further considers the evolution of category hierarchy and proposes Category Hierarchy Evolution technique (CHE) and Ontology-based Category Hierarchy Evolution technique (OCHE) to evolve from an existing category hierarchy. We empirically evaluate the effectiveness of our proposed CE2, ONCE, CHE, and OCHE in different category evolution scenarios, respectively. Our analysis results show CE2 to be more effective than CE and the category discovery approach (specifically, HAC). The ontology-based category evolution approach, ONCE, shows its advantage over CE2 which represents the lexicon-based approach. Finally, the effectiveness attained by CHE and OCHE are satisfactory; and similarly, the ontology-based approach, OCHE, also outperforms the lexicon-based one. This dissertation has contributed to the text mining, document management, and ontology learning research and practice.

Page generated in 0.079 seconds