Global ETD Search

Return to search

Rozpoznávání textu pomocí konvolučních sítí / Optical Character Recognition Using Convolutional Networks

This thesis aims at creation of new datasets for text recognition machine learning tasks and experiments with convolutional neural networks on these datasets. It describes architecture of convolutional nets, difficulties of recognizing text from photographs and contemporary works using these networks. Next, creation of annotation, using Tesseract OCR, for dataset comprised from photos of document pages, taken by mobile phones, named Mobile Page Photos. From this dataset two additional are created by cropping characters out of its photos formatted as Street View House Numbers dataset. Dataset Mobile Nice Page Photos Characters contains readable characters and Mobile Page Photos Characters adds hardly readable and unreadable ones. Three models of convolutional nets are created and used for text recognition experiments on these datasets, which are also used for estimation of annotation error.

http://www.nusl.cz/ntk/nusl-255303

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:255303
Date	January 2016
Creators	Csóka, Pavel
Contributors	Behúň, Kamil, Hradiš, Michal
Publisher	Vysoké učení technické v Brně. Fakulta informačních technologií
Source Sets	Czech ETDs
Language	Czech
Detected Language	English
Type	info:eu-repo/semantics/masterThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.0016 seconds

Rozpoznávání textu pomocí konvolučních sítí / Optical Character Recognition Using Convolutional Networks

Description

Links & Downloads

Tags

Additional Fields