Global ETD Search

Return to search

Evaluation of word segmentation algorithms applied on handwritten text

The aim of this thesis is to build and evaluate how a word segmentation algorithm performs when extracting words from historical handwritten documents. Since historical documents often consist of background noise, the aim will also be to investigate whether applying a background removal algorithm will affect the final result or not. Three different types of historical handwritten documents are used to be able to compare the output when applying two different word segmentation algorithms. The result attained indicates that the background removal algorithm increases the accuracy obtained when using the word segmentation algorithm. The word segmentation algorithm developed successfully manages to extract a majority of the words while the obtained algorithm has difficulties for some documents. A conclusion made was that the type of document plays the key role in whether a poor result will be obtained or not. Hence, different algorithms may be needed rather than using one for all types of documents.

http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-414609

word segmentation

historical documents

handwritten documents

Engineering and Technology

Teknik och teknologier

Identifer	oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:uu-414609
Date	January 2020
Creators	Isaac, Andreas
Publisher	Uppsala universitet, Avdelningen för visuell information och interaktion
Source Sets	DiVA Archive at Upsalla University
Language	English
Detected Language	English
Type	Student thesis, info:eu-repo/semantics/bachelorThesis, text
Format	application/pdf
Rights	info:eu-repo/semantics/openAccess
Relation	UPTEC F, 1401-5757 ; 20015

Page generated in 0.0019 seconds

Evaluation of word segmentation algorithms applied on handwritten text

Description

Links & Downloads

Tags

Additional Fields