Spelling suggestions: "subject:"text arecognition"" "subject:"text 2recognition""
1 |
The contribution of semantics to automatic text processingJobbins, Amanda Caryn January 1999 (has links)
No description available.
|
2 |
Processo automático de reconhecimento de texto em imagens de documentos de identificação genéricos. / Automatic text recognition process in identification document images.Romero, Rodolfo Valiente 12 December 2017 (has links)
Existe uma busca crescente por métodos de extração de texto em imagens de documentos. O uso de imagens digitais tem se tornado cada vez mais frequente em diversas áreas. O mundo moderno está cheio de texto, que os seres humanos usam para identificar objetos, navegar e tomar decisões. Embora o problema do reconhecimento de texto tenha sido amplamente estudado dentro de determinados domínios, detectar e ler texto em documentos de identificação, continua sendo um desafio aberto. Apresenta-se uma arquitetura que integra os diferentes algoritmos de localização, extração e reconhecimento aplicados à extração de texto em documentos de identificação genéricos. O método de localização proposto usa o algoritmo MSER junto com uma melhoria do contraste e a informação das bordas dos objetos da imagem, para localizar os possíveis caracteres. A etapa de seleção desenvolveu-se mediante a busca de heurísticas, capazes de classificar as regiões localizadas como textuais e não-textuais. Na etapa de reconhecimento é proposto um método iterativo para melhorar o desempenho do OCR. O processo foi avaliado usando as métricas precisão e revocação e foi realizada uma prova de conceito do sistema em um ambiente real. A abordagem proposta é robusta na detecção de textos oriundos de imagens complexas com diferentes orientações, dimensões e cores. O sistema de reconhecimento de texto proposto apresenta resultados competitivos, tanto em precisão e taxa de reconhecimento, quando comparados com outros sistemas. Mostrando excelente desempenho e viabilidade de sua implementação em sistemas reais. / The use of digital images has become more and more frequent in several areas. The modern world is full of text, which humans use to identify objects, navigate and make decisions. Although the problem of text recognition has been extensively studied within certain domains, detecting and recognizing text in identification documents remains an open challenge. We present an architecture that integrates the different localization, extraction and recognition algorithms applied to extracting text in generic identification documents. The proposed localization method uses the MSER algorithm together to contrast enhance and edge detection to find the possible characters. The selection stage was developed through the search for heuristics, capable of classifying the located regions in textual and non-textual. In the recognition step, an iterative method is proposed to improve OCR performance. The process was evaluated using the metrics precision and recall and a proof of concept of the system was performed in a real environment. The proposed approach is robust in detecting texts from complex images with different orientations, dimensions and colors. The text recognition system presents competitive results, both in accuracy and recognition rate, when compared with other systems in the current technical literature. Showing excellent performance and feasibility of its implementation in real systems.
|
3 |
Processo automático de reconhecimento de texto em imagens de documentos de identificação genéricos. / Automatic text recognition process in identification document images.Rodolfo Valiente Romero 12 December 2017 (has links)
Existe uma busca crescente por métodos de extração de texto em imagens de documentos. O uso de imagens digitais tem se tornado cada vez mais frequente em diversas áreas. O mundo moderno está cheio de texto, que os seres humanos usam para identificar objetos, navegar e tomar decisões. Embora o problema do reconhecimento de texto tenha sido amplamente estudado dentro de determinados domínios, detectar e ler texto em documentos de identificação, continua sendo um desafio aberto. Apresenta-se uma arquitetura que integra os diferentes algoritmos de localização, extração e reconhecimento aplicados à extração de texto em documentos de identificação genéricos. O método de localização proposto usa o algoritmo MSER junto com uma melhoria do contraste e a informação das bordas dos objetos da imagem, para localizar os possíveis caracteres. A etapa de seleção desenvolveu-se mediante a busca de heurísticas, capazes de classificar as regiões localizadas como textuais e não-textuais. Na etapa de reconhecimento é proposto um método iterativo para melhorar o desempenho do OCR. O processo foi avaliado usando as métricas precisão e revocação e foi realizada uma prova de conceito do sistema em um ambiente real. A abordagem proposta é robusta na detecção de textos oriundos de imagens complexas com diferentes orientações, dimensões e cores. O sistema de reconhecimento de texto proposto apresenta resultados competitivos, tanto em precisão e taxa de reconhecimento, quando comparados com outros sistemas. Mostrando excelente desempenho e viabilidade de sua implementação em sistemas reais. / The use of digital images has become more and more frequent in several areas. The modern world is full of text, which humans use to identify objects, navigate and make decisions. Although the problem of text recognition has been extensively studied within certain domains, detecting and recognizing text in identification documents remains an open challenge. We present an architecture that integrates the different localization, extraction and recognition algorithms applied to extracting text in generic identification documents. The proposed localization method uses the MSER algorithm together to contrast enhance and edge detection to find the possible characters. The selection stage was developed through the search for heuristics, capable of classifying the located regions in textual and non-textual. In the recognition step, an iterative method is proposed to improve OCR performance. The process was evaluated using the metrics precision and recall and a proof of concept of the system was performed in a real environment. The proposed approach is robust in detecting texts from complex images with different orientations, dimensions and colors. The text recognition system presents competitive results, both in accuracy and recognition rate, when compared with other systems in the current technical literature. Showing excellent performance and feasibility of its implementation in real systems.
|
4 |
Automatiserad matchning av relaterad data från olika datakällor / Automated matching of related data from different data sourcesHarch, Gais, Ullström, Robin January 2014 (has links)
Sociala medier innehåller idag massor av information som kan bidra till att ge applikationer och produkter ett stort mervärde genom att ge en förbättrad användarupplevelse. I vissa fall kan sådan information inte erhållas utan att först matcha data från en eller flera datakällor genom en data fusion. Eniro Initiatives AB vill undersöka möjligheter för att genomföra en automatiserad data fusion genom att koppla företag från sitt API till motsvarande företag på sociala medier. Problematiken ligger i att den enda helt säkra källan till matchning av alla svenska företag är dess organisationsnummer, vilket är data som inte finns tillgänglig hos API:er från utländska företag. Syftet var att undersöka möjligheter för att på automatiserat sätt kunna matcha relaterad data från olika datakällor. I detta examensarbete har en prototyp utvecklats som matchar företag från Eniros API med företags sidor från Facebooks API. Resultatet från tester av denna prototyp visar dock brister, då det uppkom fall där redundant information bidrog till att prototypen kunde godkänna inofficiella sidor med koppling till det relevanta företaget, vilket inte var önskvärt. / Social media today contains a lot of information that can add a great value for applications and products by achieve an improved user experience. In some cases, such information cannot be obtained without matching data from one or several data sources through a data fusion. Eniro Initiatives AB wants to explore opportunities to implement an automated data fusion model by matching companies from its own API to the corresponding company on social media. The problem is that the only completely secured data of matching of all Swedish companies is its corporate identity, which is data that is not available with APIs that origin from foreign companies. The aim was to explore possibilities for the automated way to match related data from different data sources. In this thesis, a prototype was developed to match companies from Eniro’s API with company pages from Facebook's API. The results from the tests of this prototype shows small deficiencies where redundant information made the prototype able to approve unofficial pages with links to the relevant company, which was not desirable.
|
5 |
Interpreting the Script : Image Analysis and Machine Learning for Quantitative Studies of Pre-modern ManuscriptsWahlberg, Fredrik January 2017 (has links)
The humanities have for a long time been a collection of fields that have not gained from the advancements in computational power, as predicted by Moore´s law. Fields like medicine, biology, physics, chemistry, geology and economics have all developed quantitative tools that take advantage of the exponential increase of processing power over time. Recent advances in computerized pattern recognition, in combination with a rapid digitization of historical document collections around the world, is about to change this. The first part of this dissertation focuses on constructing a full system for finding handwritten words in historical manuscripts. A novel segmentation algorithm is presented, capable of finding and separating text lines in pre-modern manuscripts. Text recognition is performed by translating the image data of the text lines into sequences of numbers, called features. Commonly used features are analysed and evaluated on manuscript sources from the Uppsala University library Carolina Rediviva and the US Library of Congress. Decoding the text in the vast number of photographed manuscripts from our libraries makes computational linguistics and social network analysis directly applicable to historical sources. Hence, text recognition is considered a key technology for the future of computerized research methods in the humanities. The second part of this thesis addresses digital palaeography, using a computers superior capacity for endlessly performing measurements on ink stroke shapes. Objective criteria of character shapes only partly catches what a palaeographer use for assessing similarity. The palaeographer often gets a feel for the scribe's style. This is, however, hard to quantify. A method for identifying the scribal hands of a pre-modern copy of the revelations of saint Bridget of Sweden, using semi-supervised learning, is presented. Methods for production year estimation are presented and evaluated on a collection with close to 11000 medieval charters. The production dates are estimated using a Gaussian process, where the uncertainty is inferred together with the most likely production year. In summary, this dissertation presents several novel methods related to image analysis and machine learning. In combination with recent advances of the field, they enable efficient computational analysis of very large collections of historical documents. / q2b
|
6 |
Automated system tests with image recognition : focused on text detection and recognition / Automatiserat systemtest med bildigenkänning : fokuserat på text detektering och igenkänningOlsson, Oskar, Eriksson, Moa January 2019 (has links)
Today’s airplanes and modern cars are equipped with displays to communicate important information to the pilot or driver. These displays needs to be tested for safety reasons; displays that fail can be a huge safety risk and lead to catastrophic events. Today displays are tested by checking the output signals or with the help of a person who validates the physical display manually. However this technique is very inefficient and can lead to important errors being unnoticed. MindRoad AB is searching for a solution where validation of the display is made from a camera pointed at it, text and numbers will then be recognized using a computer vision algorithm and validated in a time efficient and accurate way. This thesis compares the three different text detection algorithms, EAST, SWT and Tesseract to determine the most suitable for continued work. The chosen algorithm is then optimized and the possibility to develop a program which meets MindRoad ABs expectations is investigated. As a result several algorithms were combined to a fully working program to detect and recognize text in industrial displays.
|
7 |
Rozpoznávání ručně psaného textu pomocí hlubokých neuronových sítí / Deep Networks for Handwriting RecognitionRichtarik, Lukáš January 2020 (has links)
The work deals with the issue of handrwritten text recognition problem with deep neural networks. It focuses on the use of sequence to sequence method using encoder-decoder model. It also includes design of encoder-decoder model for handwritten text recognition using a transformer instead of recurrent neurons and a set of experiments that were performed on it.
|
8 |
Svět kolem nás jako hyperlink / Local Environment as HyperlinkMešár, Marek January 2013 (has links)
Document describes selected techniques and approaches to problem of text detection, extraction and recognition on modern mobile devices. It also describes their proper presentation to the user interface and their conversion to hyperlinks as a source of information about surrounding world. The paper outlines text detection and recognition technique based on MSER detection and also describes the use of image features tracking method for text motion estimation.
|
9 |
Technological solution for the identification and reduction of stress level using wearablesRaymondi, Luis Guillermo Antezana, Guzman, Fabricio Eduardo Aguirre, Armas-Aguirre, Jimmy, Agonzalez, Paola 01 June 2020 (has links)
El texto completo de este trabajo no está disponible en el Repositorio Académico UPC por restricciones de la casa editorial donde ha sido publicado. / In this article, a technological solution is proposed to identify and reduce the level of mental stress of a person through a wearable device. The proposal identifies a physiological variable: Heart rate, through the integration between a wearable and a mobile application through text recognition using the back camera of a smartphone. As part of the process, the technological solution shows a list of guidelines depending on the level of stress obtained in a given time. Once completed, it can be measured again in order to confirm the evolution of your stress level. This proposal allows the patient to keep his stress level under control in an effective and accessible way in real time. The proposal consists of four phases: 1. Collection of parameters through the wearable; 2. Data reception by the mobile application; 3. Data storage in a cloud environment and 4. Data collection and processing; this last phase is divided into 4 sub-phases: 4.1. Stress level analysis, 4.2. Recommendations to decrease the level obtained, 4.3. Comparison between measurements and 4.4. Measurement history per day. The proposal was validated in a workplace with people from 20 to 35 years old located in Lima, Peru. Preliminary results showed that 80% of patients managed to reduce their stress level with the proposed solution. / Revisión por pares
|
10 |
Word Recognition in Nutrition Labels with Convolutional Neural NetworkKhasgiwala, Anuj 01 August 2018 (has links)
Nowadays, everyone is very busy and running around trying to maintain a balance between their work life and family, as the working hours are increasing day by day. In such hassled life people either ignore or do not give enough attention to a healthy diet. An imperative part of a healthy eating routine is the cognizance and maintenance of nourishing data and comprehension of how extraordinary sustenance and nutritious constituents influence our bodies. Besides in the USA, in many other countries, nutritional information is fundamentally passed on to consumers through nutrition labels (NLs) which can be found in all packaged food products in the form of nutrition table. However, sometimes it turns out to be challenging to utilize this information available in these NLs notwithstanding for consumers who are health conscious as they may not be familiar with nutritional terms and discover it hard to relate nutritional information into their day by day activities because of lack of time, inspiration, or training. So it is essential to automate this information gathering and interpretation procedure by incorporating Machine Learning based algorithm to abstract nutritional information from NLs on the grounds that it enhances the consumer’s capacity to participate in nonstop nutritional information gathering and analysis.
|
Page generated in 0.0694 seconds