Global ETD Search

Return to search

OCR of hand-written transcriptions of hieroglyphic text

Encoding hieroglyphic texts is time-consuming. If a text already exists as hand-written transcription, there is an alternative, namely OCR. Off-the-shelf OCR systems seem difficult to adapt to the peculiarities of Ancient Egyptian. Presented is a proof-of-concept tool that was designed to digitize texts of Urkunden IV in the hand-writing of Kurt Sethe. It automatically recognizes signs and produces a normalized encoding, suitable for storage in a database, or for printing on a screen or on paper, requiring little manual correction.
The encoding of hieroglyphic text is RES (Revised Encoding Scheme) rather than (common dialects of) MdC (Manuel de Codage). Earlier papers argued against MdC and in favour of RES for corpus development. Arguments in favour of RES include longevity of the encoding, as its semantics are font-independent. The present study provides evidence that RES is also much preferable to MdC in the context of OCR. With a well-understood parsing technique, relative positioning of scanned signs can be straightforwardly mapped to suitable primitives of the encoding.

optische Charakter-Rekognition

Handschriften-Texterkennung

Hieroglyphen

Korpora

Bild-Verzerrungsmodelle

optical character recognition

handwritten text recognition

hieroglyphs

corpora

image distortion models

ddc:930

Identifer	oai:union.ndltd.org:DRESDEN/oai:qucosa.de:bsz:15-qucosa-201704
Date	20 April 2016
Creators	Nederhof, Mark-Jan
Contributors	Universität Leipzig, Ägyptologisches Institut, Universität Leipzig, Digital Humanities
Publisher	Universitätsbibliothek Leipzig
Source Sets	Hochschulschriftenserver (HSSS) der SLUB Dresden
Language	English
Detected Language	English
Type	doc-type:conferenceObject
Format	application/pdf
Source	Altertumswissenschaften in a Digital Age : Egyptology, Papyrology and beyond ; proceedings of a conference and workshop in Leipzig, November 4-6, 2015 / edited by Monica Berti and Franziska Naether. Leipzig, 2016. Beitrag 17

Page generated in 0.0024 seconds

OCR of hand-written transcriptions of hieroglyphic text

Description

Links & Downloads

Tags

Additional Fields