Global ETD Search

31	Ontologias no processo de indexação automática de documentos textuais / Ontologies in automatic indexing proccess of textual documents Pansani Junior, Eder Antonio [UNESP] 06 May 2016 (has links) Submitted by EDER ANTONIO PANSANI JUNIOR null (epansani@gmail.com) on 2016-06-03T12:24:33Z No. of bitstreams: 1 Dissertação_ME_Eder_Pansani-v15(Final).pdf: 3197217 bytes, checksum: 2e90e8854397255d61133c2c895baaca (MD5) / Approved for entry into archive by Ana Paula Grisoto (grisotoana@reitoria.unesp.br) on 2016-06-06T18:47:42Z (GMT) No. of bitstreams: 1 pansanijunior_ea_me_mar.pdf: 3197217 bytes, checksum: 2e90e8854397255d61133c2c895baaca (MD5) / Made available in DSpace on 2016-06-06T18:47:42Z (GMT). No. of bitstreams: 1 pansanijunior_ea_me_mar.pdf: 3197217 bytes, checksum: 2e90e8854397255d61133c2c895baaca (MD5) Previous issue date: 2016-05-06 / Apesar dos avanços tecnológicos das últimas décadas, a busca por informações relevantes ainda é uma tarefa árdua. A recuperação de informação envolve, por um lado, um acervo documental que deve ser representado por expressões linguísticas que resumem seu conteúdo temático. Por outro lado, pessoas tentam descrever linguisticamente as suas necessidades de informação a fim de obterem documentos relevantes para satisfazer tais necessidades. Um sistema de recuperação de informação é, portanto, um elemento mediador entre um acervo documental e seus requisitantes. Um dos aspectos que interferem diretamente na sua eficiência é a forma como os documentos são representados. Sendo assim, pesquisas sobre indexação automática tomam importância, principalmente em ambiente de grande produção e disseminação de documentos, como é o caso da Web. A utilização de vocabulários controlados como elementos de normalização terminológica é um recurso utilizado para melhorar os resultados do processo de indexação. Este trabalho tem por objetivo propor, avaliar e desenvolver um método de utilização de ontologias no processo de indexação automática de documentos textuais, fazendo uso da estrutura lógica e conceitual das ontologias de domínio e implementado um método que permite aos sistemas de indexação automática a realização de inferências automáticas, favorecendo uma representação dos documentos mais semântica e abrangente. Conclui-se com o estudo que a utilização das ontologias como vocabulários controlados em sistemas de indexação automática pode oferecer resultados promissores, permitindo a descoberta automática de termos e a resolução de alguns problemas ligados à linguagem que permeia todo o processo de recuperação de informação. / Despite the technological advances of recent decades, the search for relevant information is still an arduous task. The information retrieval involves, on the one hand, a documentary collection that must be represented by linguistic expressions which summarize its thematic content. On the other hand, people try describing linguistically their information needs in order to obtain relevant documents to satisfy those needs. An information retrieval system is therefore a mediating element between a documentary collection and its requesters. One of the aspects that directly interferes in their efficiency is how documents are represented. Therefore, researches on automatic indexing take importance, particularly, in an environment of large production and dissemination of documents, as it’s the case of the Web. The use of controlled vocabularies as terminology standardization elements is a feature used to improve the results of the indexing process. This study aims to propose, evaluate and develop a method for using ontologies in the automatic indexing process of textual documents, making use of logical and conceptual structure of domain ontologies and implementing a method that enables automatic indexing systems, an execution of automatic inferences, favoring a semantic and comprehensive documents representation. The study conclusion is that the use of ontologies as controlled vocabularies in automatic indexing systems can offer promising results, allowing the automatic discovery of terms and the resolution of some language related problems that permeates the whole process of information retrieval. Indexação automática Vocabulário controlado Ontologias Recuperação da Informação Automatic indexing Controlled vocabulary Ontology Information Retrieval
32	Detecção rápida de legendas em vídeos utilizando o ritmo visual / Fast video caption detection based on visual rhythm Valio, Felipe Braunger, 1984- 19 August 2018 (has links) Orientadores: Neucimar Jerônimo Leite, Hélio Pedrini / Dissertação (mestrado) - Universidade Estadual de Campinas, Instituto de Computação / Made available in DSpace on 2018-08-19T05:52:55Z (GMT). No. of bitstreams: 1 Valio_FelipeBraunger_M.pdf: 3505580 bytes, checksum: 3b20a046a5822011c617729904457d95 (MD5) Previous issue date: 2011 / Resumo: Detecção de textos em imagens é um problema que vem sendo estudado a várias décadas. Existem muitos trabalhos que estendem os métodos existentes para uso em análise de vídeos, entretanto, poucos deles criam ou adaptam abordagens que consideram características inerentes dos vídeos, como as informações temporais. Um problema particular dos vídeos, que será o foco deste trabalho, é o de detecção de legendas. Uma abordagem rápida para localizar quadros de vídeos que contenham legendas é proposta baseada em uma estrutura de dados especial denominada ritmo visual. O método é robusto à detecção de legendas com respeito ao alfabeto utilizado, ao estilo de fontes, à intensidade de cores e à orientação das legendas. Vários conjuntos de testes foram utilizados em nosso experimentos para demonstrar a efetividade do método / Abstract: Detection of text in images is a problem that has been studied for several decades. There are many works that extend the existing methods for use in video analysis, however, few of them create or adapt approaches that consider the inherent characteristics of video, such as temporal information. A particular problem of the videos, which will be the focus of this work, is the detection of subtitles. A fast method for locating video frames containing captions is proposed based on a special data structure called visual rhythm. The method is robust to the detection of legends with respect to the used alphabet, font style, color intensity and subtitle orientation. Several datasets were used in our experiments to demonstrate the effectiveness of the method / Mestrado / Ciência da Computação / Mestre em Ciência da Computação Processamento de textos (Computação) Indexação automática Rastreamento automático Text processing (Computer science) Automatic indexing Automatic tracking
33	Semantic search of multimedia data objects through collaborative intelligence Chan, Wing Sze 01 January 2010 (has links) No description available. Audio-visual materials Automatic indexing Data processing Multimedia systems Semantic computing
34	Automatic index generation for the free-text based database. January 1992 (has links) by Leung Chi Hong. / Thesis (M.Phil.)--Chinese University of Hong Kong, 1992. / Includes bibliographical references (leaves 183-184). / Chapter Chapter one: --- Introduction --- p.1 / Chapter Chapter two: --- Background knowledge and linguistic approaches of automatic indexing --- p.5 / Chapter 2.1 --- Definition of index and indexing --- p.5 / Chapter 2.2 --- Indexing methods and problems --- p.7 / Chapter 2.3 --- Automatic indexing and human indexing --- p.8 / Chapter 2.4 --- Different approaches of automatic indexing --- p.10 / Chapter 2.5 --- Example of semantic approach --- p.11 / Chapter 2.6 --- Example of syntactic approach --- p.14 / Chapter 2.7 --- Comments on semantic and syntactic approaches --- p.18 / Chapter Chapter three: --- Rationale and methodology of automatic index generation --- p.19 / Chapter 3.1 --- Problems caused by natural language --- p.19 / Chapter 3.2 --- Usage of word frequencies --- p.20 / Chapter 3.3 --- Brief description of rationale --- p.24 / Chapter 3.4 --- Automatic index generation --- p.27 / Chapter 3.4.1 --- Training phase --- p.27 / Chapter 3.4.1.1 --- Selection of training documents --- p.28 / Chapter 3.4.1.2 --- Control and standardization of variants of words --- p.28 / Chapter 3.4.1.3 --- Calculation of associations between words and indexes --- p.30 / Chapter 3.4.1.4 --- Discarding false associations --- p.33 / Chapter 3.4.2 --- Indexing phase --- p.38 / Chapter 3.4.3 --- Example of automatic indexing --- p.41 / Chapter 3.5 --- Related researches --- p.44 / Chapter 3.6 --- Word diversity and its effect on automatic indexing --- p.46 / Chapter 3.7 --- Factors affecting performance of automatic indexing --- p.60 / Chapter 3.8 --- Application of semantic representation --- p.61 / Chapter 3.8.1 --- Problem of natural language --- p.61 / Chapter 3.8.2 --- Use of concept headings --- p.62 / Chapter 3.8.3 --- Example of using concept headings in automatic indexing --- p.65 / Chapter 3.8.4 --- Advantages of concept headings --- p.68 / Chapter 3.8.5 --- Disadvantages of concept headings --- p.69 / Chapter 3.9 --- Correctness prediction for proposed indexes --- p.78 / Chapter 3.9.1 --- Example of using index proposing rate --- p.80 / Chapter 3.10 --- Effect of subject matter on automatic indexing --- p.83 / Chapter 3.11 --- Comparison with other indexing methods --- p.85 / Chapter 3.12 --- Proposal for applying Chinese medical knowledge --- p.90 / Chapter Chapter four: --- Simulations of automatic index generation --- p.93 / Chapter 4.1 --- Training phase simulations --- p.93 / Chapter 4.1.1 --- Simulation of association calculation (word diversity uncontrolled) --- p.94 / Chapter 4.1.2 --- Simulation of association calculation (word diversity controlled) --- p.102 / Chapter 4.1.3 --- Simulation of discarding false associations --- p.107 / Chapter 4.2 --- Indexing phase simulation --- p.115 / Chapter 4.3 --- Simulation of using concept headings --- p.120 / Chapter 4.4 --- Simulation for testing performance of predicting index correctness --- p.125 / Chapter 4.5 --- Summary --- p.128 / Chapter Chapter five: --- Real case study in database of Chinese Medicinal Material Research Center --- p.130 / Chapter 5.1 --- Selection of real documents --- p.130 / Chapter 5.2 --- Case study one: Overall performance using real data --- p.132 / Chapter 5.2.1 --- Sample results of automatic indexing for real documents --- p.138 / Chapter 5.3 --- Case study two: Using multi-word terms --- p.148 / Chapter 5.4 --- Case study three: Using concept headings --- p.152 / Chapter 5.5 --- Case study four: Prediction of proposed index correctness --- p.156 / Chapter 5.6 --- Case study five: Use of (Σ ΔRij) Fi to determine false association --- p.159 / Chapter 5.7 --- Case study six: Effect of word diversity --- p.162 / Chapter 5.8 --- Summary --- p.166 / Chapter Chapter six: --- Conclusion --- p.168 / Appendix A: List of stopwords --- p.173 / Appendix B: Index terms used in case studies --- p.174 / References --- p.183 Automatic indexing Medicine, Chinese Traditional--abstracts
35	Child's play: activity recognition for monitoring children's developmental progress with augmented toys Westeyn, Tracy Lee 20 May 2010 (has links) The way in which infants play with objects can be indicative of their developmental progress and may serve as an early indicator for developmental delays. However, the observation of children interacting with toys for the purpose of quantitative analysis can be a difficult task. To better quantify how play may serve as an early indicator, researchers have conducted retrospective studies examining the differences in object play behaviors among infants. However, such studies require that researchers repeatedly inspect videos of play often at speeds much slower than real-time to indicate points of interest. The research presented in this dissertation examines whether a combination of sensors embedded within toys and automatic pattern recognition of object play behaviors can help expedite this process. For my dissertation, I developed the Child'sPlay system which uses augmented toys and statistical models to automatically provide quantitative measures of object play interactions, as well as, provide the PlayView interface to view annotated play data for later analysis. In this dissertation, I examine the hypothesis that sensors embedded in objects can provide sufficient data for automatic recognition of certain exploratory, relational, and functional object play behaviors in semi-naturalistic environments and that a continuum of recognition accuracy exists which allows automatic indexing to be useful for retrospective review. I designed several augmented toys and used them to collect object play data from more than fifty play sessions. I conducted pattern recognition experiments over this data to produce statistical models that automatically classify children's object play behaviors. In addition, I conducted a user study with twenty participants to determine if annotations automatically generated from these models help improve performance in retrospective review tasks. My results indicate that these statistical models increase user performance and decrease perceived effort when combined with the PlayView interface during retrospective review. The presence of high quality annotations are preferred by users and promotes an increase in the effective retrieval rates of object play behaviors. Retrospective review Wireless sensing Child development Object play Pattern recognition Automatic indexing Human activity recognition Pattern recognition systems
36	Lingo – ein System zur automatischen Indexierung – Anwendung und Einsatzmöglichkeiten Müller, Thomas 26 January 2011 (has links) (PDF) Die heterogenen musealen Bestände (Text, Bild, gegenständliche Objekte) im Haus der Geschichte der Bundesrepublik Deutschland umfassen derzeit über 365.000 Objektbeschreibungen zeithistorischer Objekte. Auf der Basis des Open Source Indexierungssystems lingo wird eine automatische Indexierung entwickelt, die - aufsetzend auf den existierenden Rahmenbedingungen - normierte Beschreibungsmerkmale generiert und als Indexterme für das Retrieval zur Verfügung stellt. Zielvorstellung ist es, eine einheitliche Suche über die Objektbeschreibungen anhand der sprachlichen und semantischen Vereinheitlichung der Indexterme zu realisieren. Objektbeschreibung Information retrieval automatic indexing database retrieval ddc:020 Automatische Indexierung Linguistik Lingo Datenbank Indexierung <Inhaltserschließung>
37	Ranking de publicações baseado na extração de textos da Internet / Ranking of publications based on extraction of texts of the Internet Oliveira, Henrique Przibisczki de 12 April 2009 (has links) Orientador: Ricardo de Oliveira Anido / Dissertação (mestrado) - Universidade Estadual de Campinas, Instituto de Computação / Made available in DSpace on 2018-08-15T07:19:24Z (GMT). No. of bitstreams: 1 Oliveira_HenriquePrzibisczkide_M.pdf: 1997897 bytes, checksum: fce2bcda34e198778d87b8c87f83e484 (MD5) Previous issue date: 2009 / Resumo: Vários métodos de ranking atuais comparam os diversos veículos de publicação em relação à qualidade ou impacto. Esta informação é muito importante para que um pesquisador selecione veículos de renome para publicar suas pesquisas, ou mesmo, instituições podem promover seus pesquisadores baseando-se na qualidade dos veículos onde publicam. Esta informação sobre os veículos pode também ser valiosa para um governo destinar recursos 'as instituições ou uma empresa avaliar a qualidade de um candidato a um emprego. Existem várias métricas distintas para realizar ranking de veículos, mas o ponto comum entre a maioria é o uso de citações. Portanto, por mais que um veículo seja bastante prestigiado pelos pesquisadores, se ele não for indexado em uma base sua qualidade não será considerada. Este trabalho propõe um método para ranking de veículos de publicação obtendo as informações não de uma base de citações existente, mas de uma outra fonte de dados: a Web. As páginas dos professores de universidades são visitadas e delas são extraídas as suas publicações. De cada publicação é extraído o veículo e dessa forma, baseado nos veículos que um pesquisador quis exibir em sua página, os mesmos são ordenados. Este método irá contemplar veículos de publicação não existentes nas atuais bases de dados criando um novo ranking de publicações. Vários problemas computacionais interessantes são abordados neste trabalho: busca de informação na internet, segmentação textual, extração de componentes em uma referência bibliográfica e agrupamento / Abstract: Several current ranking methods compare different publication venues in relation to quality or impact. This information is very important for a researcher to choose renowned venues to publish his research. Institutes could promote their researchers based on the quality of places they have published. This information about the venues can also be valuable for a government to allocate resources to universities, or for companies to evaluate the quality of a candidate for a job. There are other distinct measures to perform a ranking of venues, but the idea in common among most of them is the use of citations. Therefore, despite the fact a venue is very prestigious for its researchers, if it is not indexed in a citation database, it will not be considered, since its "quality" cannot be measured. This work proposes to construct a ranking of publication venues obtaining the information not from a database, but from another data source: the Web. The university professor's webpages are visited to extract the publications. The venue is extracted from each publication, and thus, based on venues which a researcher wanted to show in his webpage, they are ranked. This method will include publication venues that do not exist in current databases, creating a new ranking of publications. Many interesting computational problems are discussed in this work: information search on the internet, text segmentation, extraction of components in a bibliographic citation, and clustering / Mestrado / Metodologia e Tecnicas da Computação / Mestre em Ciência da Computação Publicações científicas Classificações bibliográficas Bibliometria Indexação automática Recuperação da informação Referencias bibliograficas Science publishing Bibliographic classification Bibliometrics Automatic indexing Information retrieval
38	A Framework of Automatic Subject Term Assignment: An Indexing Conception-Based Approach Chung, EunKyung 12 1900 (has links) The purpose of dissertation is to examine whether the understandings of subject indexing processes conducted by human indexers have a positive impact on the effectiveness of automatic subject term assignment through text categorization (TC). More specifically, human indexers' subject indexing approaches or conceptions in conjunction with semantic sources were explored in the context of a typical scientific journal article data set. Based on the premise that subject indexing approaches or conceptions with semantic sources are important for automatic subject term assignment through TC, this study proposed an indexing conception-based framework. For the purpose of this study, three hypotheses were tested: 1) the effectiveness of semantic sources, 2) the effectiveness of an indexing conception-based framework, and 3) the effectiveness of each of three indexing conception-based approaches (the content-oriented, the document-oriented, and the domain-oriented approaches). The experiments were conducted using a support vector machine implementation in WEKA (Witten, & Frank, 2000). The experiment results pointed out that cited works, source title, and title were as effective as the full text, while keyword was found more effective than the full text. In addition, the findings showed that an indexing conception-based framework was more effective than the full text. Especially, the content-oriented and the document-oriented indexing approaches were found more effective than the full text. Among three indexing conception-based approaches, the content-oriented approach and the document-oriented approach were more effective than the domain-oriented approach. In other words, in the context of a typical scientific journal article data set, the objective contents and authors' intentions were more focused that the possible users' needs. The research findings of this study support that incorporation of human indexers' indexing approaches or conception in conjunction with semantic sources has a positive impact on the effectiveness of automatic subject term assignment. Automatic indexing. Indexing. Subject headings. subject indexing processes text categorization (TC) automatic subject term assignment subject indexing approaches
39	Regensburger Verbundklassifikation und Schlagwortnormdatei im Tandem: Regensburger Verbundklassifikation und Schlagwortnormdatei imTandem Probstmeyer, Judith 24 January 2011 (has links) Im Katalog des Südwestverbunds besitzen zahlreiche Publikationen sowohl SWD-Schlagwörter und -ketten als auch Notationen der Regensburger Verbundklassifikation (RVK). An der Universitätsbibliothek Mannheim wurden auf dieser Datenbasis automatische Korrelationen zwischen SWD und RVK generiert, die im Rahmen einer Bachelorarbeit an der Hochschule der Medien Stuttgart analysiert wurden. Im Vortrag werden die Ergebnisse der Analyse vorgestellt und Überlegungen zu möglichen praktischen Anwendungen solcher Korrelationen angestellt. info:eu-repo/classification/ddc/020 ddc:020 RVK, SWD, Automatische Erschließung
40	A novel sentence-based approach for extracting modifying adjective-noun pairs from a corpus Shen, Rong 01 October 2002 (has links) No description available. Automatic indexing Information retrieval Subject headings Thesauri Electrical and Computer Engineering Engineering Systems and Communications

Search results