The aim of this study is to illustrate the state of-the art of technical tools which allow the user to build the lexicon of a Swahili text. Different kinds of statistical information can also be extracted from the text with the aid of tailor made software. The basic operation in building the lexicon of a text is lemmatization, i. e extracting the lemma from the forms contained in the text. Once the lemma list is ready it can be converted into a list of entties, to be filled according to selected criteria.
Identifer | oai:union.ndltd.org:DRESDEN/oai:qucosa:de:qucosa:10564 |
Date | January 1994 |
Creators | Toscana, Maddalena |
Contributors | University of Naples, Universität zu Köln |
Source Sets | Hochschulschriftenserver (HSSS) der SLUB Dresden |
Language | English |
Detected Language | English |
Type | doc-type:article, info:eu-repo/semantics/article, doc-type:Text |
Source | Swahili Forum; 1 (1994), S. 181-195 |
Rights | info:eu-repo/semantics/openAccess |
Relation | urn:nbn:de:bsz:15-qucosa-94963, qucosa:11611 |
Page generated in 0.002 seconds