Return to search

Automating the process of dividing a map image into sections : Using Tesseract OCR and pixel traversing / Automatisering av processen att dela in en kartbild i sektioner : Med hjälp av Tesseract OCR och pixel traversering

This paper presents an algorithm with the purpose of automatically dividing a simple floor plan into sections. Sections include names, size and location on the image, all of which will be automatically extracted by the algorithm as a step of converting a simple image into an interactive map. The labels for each section utilizes tesseract-OCR wrapper tesseractJS to extract text and label location. In regards to section borders pixel traversing is employed coupled with CIE76 for color comparison which results in the discovery of size and location of the section. Performance of the algorithm was measured on three different maps using metrics such as correctness, quality, completeness, jaccard index and name accuracy. The metrics showed the potential of such an algorithm in terms of automating the task of sectioning an image. With results ranging between lowest percentage of 48% and highest of 100% on three different maps looking at correctness, quality, completeness, average jaccard index and average name accuracy per map.

Identiferoai:union.ndltd.org:UPSALLA1/oai:DiVA.org:liu-148319
Date January 2018
CreatorsSkoglund, Jesper, Vikström, Lukas
PublisherLinköpings universitet, Institutionen för datavetenskap, Linköpings universitet, Institutionen för datavetenskap
Source SetsDiVA Archive at Upsalla University
LanguageEnglish
Detected LanguageEnglish
TypeStudent thesis, info:eu-repo/semantics/bachelorThesis, text
Formatapplication/pdf
Rightsinfo:eu-repo/semantics/openAccess

Page generated in 0.0018 seconds