Global ETD Search

Return to search

Active Learning pro zpracování archivních pramenů / Active Learning for Processing of Archive Sources

This work deals with the creation of a system that allows uploading and annotating scans of historical documents and subsequent active learning of models for character recognition (OCR) on available annotations (marked lines and their transcripts). The work describes the process, classifies the techniques and presents an existing system for character recognition. Above all, emphasis is placed on machine learning methods. Furthermore, the methods of active learning are explained and a method of active learning of available OCR models from annotated scans is proposed. The rest of the work deals with a system design, implementation, available datasets, evaluation of self-created OCR model and testing of the entire system.

http://www.nusl.cz/ntk/nusl-445535

Identifer	oai:union.ndltd.org:nusl.cz/oai:invenio.nusl.cz:445535
Date	January 2021
Creators	Hříbek, David
Contributors	Zbořil, František, Rozman, Jaroslav
Publisher	Vysoké učení technické v Brně. Fakulta informačních technologií
Source Sets	Czech ETDs
Language	Czech
Detected Language	English
Type	info:eu-repo/semantics/masterThesis
Rights	info:eu-repo/semantics/restrictedAccess

Page generated in 0.002 seconds

Active Learning pro zpracování archivních pramenů / Active Learning for Processing of Archive Sources

Description

Links & Downloads

Tags

Additional Fields