The field of Information Retrieval recognizes the importance of stemming in improving retrieval effectiveness. This same tool, when applied to searches conducted in the Arabic language, increases the relevancy of documents returned and expands searches to encompass the general meaning of a word instead of the word itself. Since the Arabic language relies mainly on triconsonantal roots for verb forms and derives nouns by adding affixes, words with similar consonants are closely related in meaning. Stemming allows a search term to focus more on the meaning of a term and closely related terms and less on specific character matches. This paper discusses the strengths of light stemming, the best techniques, and components for algorithmic affix-based stemmers used in keyword searching in the Arabic language.
Identifer | oai:union.ndltd.org:UNC_CH/oai:etd.ils.unc.edu:1901/572 |
Date | 17 November 2008 |
Creators | Brittany E. Rogerson |
Contributors | Ronald E. Bergquist |
Publisher | School of Information and Library Science |
Source Sets | University of North Carolina-Chapel Hill |
Language | en_US |
Detected Language | English |
Type | Electronic Theses and Dissertations |
Format | application/pdf, 331500 bytes, application/pdf |
Page generated in 0.0018 seconds