Global ETD Search

71	Evaluating Methods for Optical Character Recognition on a Mobile Platform : comparing standard computer vision techniques with deep learning in the context of scanning prescription medicine labels Bisiach, Jonathon, Zabkar, Matej January 2020 (has links) Deep learning has become ubiquitous as part of Optical Character Recognition (OCR), but there are few examples of research into whether the two technologies are feasible for deployment on a mobile platform. This study examines which particular method of OCR would be best suited for a mobile platform in the specific context of a prescription medication label scanner. A case study using three different methods of OCR – classic computer vision techniques, standard deep learning and specialised deep learning – tested against 100 prescription medicine label images shows that the method that provides the best combination of accuracy, speed and resource using has proven to be standard seep learning, or Tesseract 4.1.1 in this particular case. Tesseract 4.1.1 tested with 76% accuracy with a further 10% of results being one character away from being accurate. Additionally, 9% of images were processed in less than one second and 41% were processed in less than 10 seconds. Tesseract 4.1.1 also had very reasonable resource costs, comparable to methods that did not utilise deep learning. Optical character recognition deep learning Tesseract EAST testing performance Android Computer Sciences Datavetenskap (datalogi)
72	Form data enriching using a post OCR clustering process : Measuring accuracy of field names and field values clustering Aboulkacim, Adil January 2022 (has links) Med OCR teknologier kan innehållet av ett formulär läsas in, positionen av varje ord och dess innehåll kan extraheras, dock kan relationen mellan orden ej förstås. Denna rapport siktar på att lösa problemet med att berika data från ett strukturerat formulär utan någon förinställd konfiguration genom användandet utav klustring. Detta görs med en kvantitativ metod där mätning av en utvecklad prototyp som räknar antal korrekt klustrade textrutor och en kvalitativ utvärdering. Prototypen fungerar genom att mata en bild av ett ofyllt formulär och en annan bild av ett ifyllt formulär och en annan bild av ett ifyllt formulär som innehåller informationen som ska berikas till en OCR-motor. Utdatan från OCR-motorn körs genom ett efterbearbetningssteg som tillsammans med en modifierad euklidisk algoritm och en oskarp strängsökningsalgoritm kan klustra fältnamn och fältvärden i den ifyllda formulärbilden. Resultatet av prototypen för tre olika formulärstrukturer och 15 olika bilder vardera gav en träffsäkerhet från 100% till 92% beroende på formulärstruktur. Denna rapport kunde visa möjligheten att grupper ihop fältnamn och fältvärden i ett formulera, med andra ord utvinna information från formuläret / With OCR technologies the text in a form can be read, the position of each word and its contents can be extracted, however the relation between the words cannot be understood. This thesis aims to solve the problem of enriching data from a structured form without any pre-set configuration using clustering. This is done using the method of a quantitative measurement of a developed prototype counting correctly clustered text boxes and a qualitative evaluation. The prototype works by feeding an image of an unfilled form and another image of a filled form which contains the data to be enriched to an OCR engine. The OCR engine extracts the text and its positions which is then run through a post-processing step which together with a modified Euclidean and fuzzy string search algorithm, both together is able to cluster field names and field values in the filled in form image. The result of the prototype for three different form structures and 15 different images for each structure ranges from 100% to 92% accuracy depending on form structure. This thesis successfully was able to show the possibility of clustering together names and values in a form i.e., enriching data from the form. Optical Character Recognition Form Processing Data enrichment Optisk teckenläsning Formulärbearbetning Databerikning Software Engineering Programvaruteknik
73	Detection and Recognition of U.S. Speed Signs from Grayscale Images for Intelligent Vehicles Kanaparthi, Pradeep Kumar January 2012 (has links) No description available. Engineering Information Technology Technology Speed sign connected component labeling regions optical character recognition neural network.
74	Underwater Document Recognition Shah, Jaimin Nitesh 18 May 2021 (has links) No description available. Computer Science image denoising image quality assessment - IQA optical character recognition - OCR
75	The Convolutional Recurrent Structure in Computer Vision Applications Xie, Dong 12 1900 (has links) By organically fusing the methods of convolutional neural network (CNN) and recurrent neural network (RNN), this dissertation focuses on the application of optical character recognition and image classification processing. The first part of this dissertation presents an end-to-end novel receipt recognition system for capturing effective information from receipts (CEIR). The main contributions of this research part are divided into three parts. First, this research develops a preprocessing method for receipt images. Second, the modified connectionist text proposal network is introduced to execute text detection. Third, the CEIR combines the convolutional recurrent neural network with the connectionist temporal classification with maximum entropy regularization as a loss function to update the weights in networks and extract the characters from receipt. The CEIR system is validated with the scanned receipts optical character recognition and information extraction (SROIE) database. Furthermore, the CEIR system has strong robustness and can be extended to a variety of different scenarios beyond receipts. For the convolutional recurrent structure application of land use image classification, this dissertation comes up with a novel deep learning model for land use classification, the convolutional recurrent land use classifier (CRLUC), which further improves the accuracy in classifying remote sensing land use images. Besides, the convolutional fully-connected neural networks with hard sample memory pool structure (CFMP) is invented to tackle the remote sensing land use image classification tasks. The CRLUC and CFMP algorithm performances are tested in popular datasets. Experimental studies show the proposed algorithms can classify images with higher accuracy and fewer training episodes compared to popular image classification algorithms. Computer Vision Convolutional Neural Network Recurrent Neural Network Optical Character Recognition Land Use Classification
76	Ocr: A Statistical Model Of Multi-engine Ocr Systems McDonald, Mercedes Terre 01 January 2004 (has links) This thesis is a benchmark performed on three commercial Optical Character Recognition (OCR) engines. The purpose of this benchmark is to characterize the performance of the OCR engines with emphasis on the correlation of errors between each engine. The benchmarks are performed for the evaluation of the effect of a multi-OCR system employing a voting scheme to increase overall recognition accuracy. This is desirable since currently OCR systems are still unable to recognize characters with 100% accuracy. The existing error rates of OCR engines pose a major problem for applications where a single error can possibly effect significant outcomes, such as in legal applications. The results obtained from this benchmark are the primary determining factor in the decision of implementing a voting scheme. The experiment performed displayed a very high accuracy rate for each of these commercial OCR engines. The average accuracy rate found for each engine was near 99.5% based on a less than 6,000 word document. While these error rates are very low, the goal is 100% accuracy in legal applications. Based on the work in this thesis, it has been determined that a simple voting scheme will help to improve the accuracy rate. Character recognition accuracy Machine readability Optical character recognition (OCR) Voting scheme Electrical and Computer Engineering Engineering
77	The CAR (Confront, Address, Replace) Strategy: An Antiracist Engineering Pedagogy Asfaw, Amman Fasil 01 June 2021 (has links) (PDF) The CAR (confront, address, replace) Strategy is an antiracist pedagogy aiming to drive out exclusionary terminology in engineering education. “Master-slave” terminology is still commonplace in engineering education and industry. However, questions have been raised about the negative impacts of such language. Usage of exclusionary terminology such as “master-slave” in academia can make students—especially those who identify as women and/or Black/African-American—feel uncomfortable, potentially evoking Stereotype Threat (Danowitz, 2020) and/or Curriculum Trauma (Buul, 2020). Indeed, prior research shows that students from a number of backgrounds find non-inclusive terminologies such as “master-slave” to be a major problem (Danowitz, 2020). Currently, women-identifying and gender nonbinary students are underrepresented in the engineering industry (ASEE, 2020) while Black/African-American students are underrepresented in the entire higher education system, including engineering fields (NSF, 2019). The CAR Strategy, introduced here, stands for: 1) confront; 2) address; 3) replace and aims to provide a framework for driving out iniquitous terminologies in engineering education such as “master-slave.” The first step is to confront the historical significance of the terminology in question. The second step is to address the technical inaccuracies of the legacy terminology. Lastly, replace the problematic terminology with an optional but recommended replacement. This thesis reports on student perceptions and the effectiveness of The CAR Strategy piloted as a teaching framework in the computer engineering department of Cal Poly. Of 64 students surveyed: 70% either agree or strongly agree that The CAR Strategy is an effective framework for driving out exclusionary terminologies. Amman Asfaw first presented certain portions of this thesis at the virtual 2021 American Society for Engineering Education (ASEE) Annual Conference and Exposition. The original publication’s copyright is held by ASEE (Asfaw, 2021); secondary authors included Storm Randolph, Victoria Siaumau, Yumi Aguilar, Emily Flores, Dr. Jane Lehr, and Dr. Andrew Danowitz. Engineering Education Reform Antiracist Pedagogy Master-Slave Female-Male Optical Character Recognition Engineering Education
78	A New Approach to Synthetic Image Evaluation Memari, Majid 01 December 2023 (has links) (PDF) This study is dedicated to enhancing the effectiveness of Optical Character Recognition (OCR) systems, with a special emphasis on Arabic handwritten digit recognition. The choice to focus on Arabic handwritten digits is twofold: first, there has been relatively less research conducted in this area compared to its English counterparts; second, the recognition of Arabic handwritten digits presents more challenges due to the inherent similarities between different Arabic digits.OCR systems, engineered to decipher both printed and handwritten text, often face difficulties in accurately identifying low-quality or distorted handwritten text. The quality of the input image and the complexity of the text significantly influence their performance. However, data augmentation strategies can notably improve these systems' performance. These strategies generate new images that closely resemble the original ones, albeit with minor variations, thereby enriching the model's learning and enhancing its adaptability. The research found Conditional Variational Autoencoders (C-VAE) and Conditional Generative Adversarial Networks (C-GAN) to be particularly effective in this context. These two generative models stand out due to their superior image generation and feature extraction capabilities. A significant contribution of the study has been the formulation of the Synthetic Image Evaluation Procedure, a systematic approach designed to evaluate and amplify the generative models' image generation abilities. This procedure facilitates the extraction of meaningful features, computation of the Fréchet Inception Distance (LFID) score, and supports hyper-parameter optimization and model modifications. Data Augmentation Generative Adversarial Networks Generative Models Optical Character Recognition Synthetic Image Evaluation Variational Autoencoders
79	Mathematical Expression Detection and Segmentation in Document Images Bruce, Jacob Robert 19 March 2014 (has links) Various document layout analysis techniques are employed in order to enhance the accuracy of optical character recognition (OCR) in document images. Type-specific document layout analysis involves localizing and segmenting specific zones in an image so that they may be recognized by specialized OCR modules. Zones of interest include titles, headers/footers, paragraphs, images, mathematical expressions, chemical equations, musical notations, tables, circuit diagrams, among others. False positive/negative detections, oversegmentations, and undersegmentations made during the detection and segmentation stage will confuse a specialized OCR system and thus may result in garbled, incoherent output. In this work a mathematical expression detection and segmentation (MEDS) module is implemented and then thoroughly evaluated. The module is fully integrated with the open source OCR software, Tesseract, and is designed to function as a component of it. Evaluation is carried out on freely available public domain images so that future and existing techniques may be objectively compared. / Master of Science document layout analysis optical character recognition document image type-specific layout analysis
80	Gerçek zamanlı taşıt plaka tanıma sistemi / Boztoprak, Halime. Merdan, Mustafa. January 2007 (has links) (PDF) Tez (Yüksek Lisans) - Süleyman Demirel Üniversitesi, Fen Bilimleri Enstitüsü, Elektronik ve Haberleşme Mühendisliği Anabilim Dalı, 2007. / Kaynakça var.

Search results