• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 2
  • 1
  • 1
  • Tagged with
  • 4
  • 4
  • 4
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
1

Anagnosis

Damiani, Vincenzo 20 April 2016 (has links) (PDF)
In recent years many institutions holding papyri have put images of their collections online, while transcriptions previously published in print are now hosted in the Digital Corpus of Literary Papyri. Anagnosis aims to provide an intuitive and easy-to-use web interface between those images and related digitized texts. The main goal lies in automatic data processing and text-recognition accuracy: Through a dedicated OCR algorithm, letters on the image are identified with single boxes and thus linked to the transcription. A coordinates system of the glyphs on the image can then be transferred and applied to each new image uploaded for the same text section. Once all character boxes are generated, Anagnosis can extract a sample alphabet that users may rearrange to virtually restore lost parts of text directly on the image.
2

OCR of hand-written transcriptions of hieroglyphic text

Nederhof, Mark-Jan 20 April 2016 (has links) (PDF)
Encoding hieroglyphic texts is time-consuming. If a text already exists as hand-written transcription, there is an alternative, namely OCR. Off-the-shelf OCR systems seem difficult to adapt to the peculiarities of Ancient Egyptian. Presented is a proof-of-concept tool that was designed to digitize texts of Urkunden IV in the hand-writing of Kurt Sethe. It automatically recognizes signs and produces a normalized encoding, suitable for storage in a database, or for printing on a screen or on paper, requiring little manual correction. The encoding of hieroglyphic text is RES (Revised Encoding Scheme) rather than (common dialects of) MdC (Manuel de Codage). Earlier papers argued against MdC and in favour of RES for corpus development. Arguments in favour of RES include longevity of the encoding, as its semantics are font-independent. The present study provides evidence that RES is also much preferable to MdC in the context of OCR. With a well-understood parsing technique, relative positioning of scanned signs can be straightforwardly mapped to suitable primitives of the encoding.
3

Anagnosis: automatisierte Buchstabenverknüpfung von Transkript und Papyrusabbildung

Damiani, Vincenzo January 2016 (has links)
In recent years many institutions holding papyri have put images of their collections online, while transcriptions previously published in print are now hosted in the Digital Corpus of Literary Papyri. Anagnosis aims to provide an intuitive and easy-to-use web interface between those images and related digitized texts. The main goal lies in automatic data processing and text-recognition accuracy: Through a dedicated OCR algorithm, letters on the image are identified with single boxes and thus linked to the transcription. A coordinates system of the glyphs on the image can then be transferred and applied to each new image uploaded for the same text section. Once all character boxes are generated, Anagnosis can extract a sample alphabet that users may rearrange to virtually restore lost parts of text directly on the image.
4

OCR of hand-written transcriptions of hieroglyphic text

Nederhof, Mark-Jan January 2016 (has links)
Encoding hieroglyphic texts is time-consuming. If a text already exists as hand-written transcription, there is an alternative, namely OCR. Off-the-shelf OCR systems seem difficult to adapt to the peculiarities of Ancient Egyptian. Presented is a proof-of-concept tool that was designed to digitize texts of Urkunden IV in the hand-writing of Kurt Sethe. It automatically recognizes signs and produces a normalized encoding, suitable for storage in a database, or for printing on a screen or on paper, requiring little manual correction. The encoding of hieroglyphic text is RES (Revised Encoding Scheme) rather than (common dialects of) MdC (Manuel de Codage). Earlier papers argued against MdC and in favour of RES for corpus development. Arguments in favour of RES include longevity of the encoding, as its semantics are font-independent. The present study provides evidence that RES is also much preferable to MdC in the context of OCR. With a well-understood parsing technique, relative positioning of scanned signs can be straightforwardly mapped to suitable primitives of the encoding.

Page generated in 0.0737 seconds