Return to search

Unscharfe Suche für Terme geringer Frequenz in einem großen Korpus / Fuzzy Search for Infrequent Terms in a Large Corpus

Until now infrequent terms have been neglected in searching in order to save time
and memory. With the help of a cascaded index and the introduced algorithms,
such considerations are no longer necessary.
A fast and efficient method was developed in order to find all terms in the
largest freely available corpus of texts in the German language by exact search,
part-word-search and fuzzy search.
The process can be extended to include transliterated passages.
In addition, documents that contain the term with a modified spelling, can
also be found by a fuzzy search.
Time and memory requirements are determined and fall considerably below
the requests of common search engines.

Identiferoai:union.ndltd.org:uni-osnabrueck.de/oai:repositorium.ub.uni-osnabrueck.de:urn:nbn:de:gbv:700-201101107278
Date10 January 2011
CreatorsGerhards, Karl
ContributorsProf. Dr. Kai-Uwe Kühnberger, PD Dr. Helmar Gust
Source SetsUniversität Osnabrück
LanguageGerman
Detected LanguageEnglish
Typedoc-type:doctoralThesis
Formatapplication/pdf, application/zip
RightsNamensnennung-NichtKommerziell-KeineBearbeitung 3.0 Unported, http://creativecommons.org/licenses/by-nc-nd/3.0/

Page generated in 0.0024 seconds