From text to dictionary.

The aim of this study is to illustrate the state of-the art of technical tools which allow the user to build the lexicon of a Swahili text. Different kinds of statistical information can also be extracted from the text with the aid of tailor made software. The basic operation in building the lexicon of a text is lemmatization, i. e extracting the lemma from the forms contained in the text. Once the lemma list is ready it can be converted into a list of entties, to be filled according to selected criteria.

Identiferoai:union.ndltd.org:DRESDEN/oai:qucosa.de:bsz:15-qucosa-95184
Date15 October 2012
CreatorsToscana, Maddalena
ContributorsUniversity of Naples, Istituto Universitario Orientate, Universität zu Köln, Institut für Afrikanistik
PublisherUniversitätsbibliothek Leipzig
Source SetsHochschulschriftenserver (HSSS) der SLUB Dresden
LanguageEnglish
Detected LanguageEnglish
Typedoc-type:article
Formatapplication/pdf
SourceSwahili Forum; 1 (1994), S. 181-195

Page generated in 0.0018 seconds