The root is the primary lexical unit of Ontological terms, which carries the most significant aspects of semantic content and cannot be reduced into small constituents. It is the key of ontological term structure. After the identification of root, we can easily get the meaning of terms. According to the meaning, it’s helpful to identify the other parts of terms, such as the relation, definition and so on. We have generated a general classification model to identify the roots of terms in this master thesis. There are four features defined in our classification model: the Token, the POS, the Length and the Position. Implementation is followed using Java and algorithm is followed using Naïve Bayes. We implemented and evaluated the classification model using Gene Ontology (GO). The evaluation results showed that our framework and model were effective.
Identifer | oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:hj-20913 |
Date | January 2013 |
Creators | Chao, Yang, Zhang, Peng |
Publisher | Tekniska Högskolan, Högskolan i Jönköping, JTH. Forskningsmiljö Informationsteknik, Tekniska Högskolan, Högskolan i Jönköping, JTH. Forskningsmiljö Informationsteknik |
Source Sets | DiVA Archive at Upsalla University |
Language | English |
Detected Language | English |
Type | Student thesis, info:eu-repo/semantics/bachelorThesis, text |
Format | application/pdf |
Rights | info:eu-repo/semantics/openAccess |
Page generated in 0.0021 seconds