Spelling suggestions: "subject:"aemantic integration"" "subject:"emantic integration""
21 |
Semantic Integration across Heterogeneous Databases : Finding Data Correspondences using Agglomerative Hierarchical Clustering and Artificial Neural Networks / Semantisk integrering mellan heterogena databaser : Hitta datakopplingar med hjälp av hierarkisk klustring och artificiella neuronnätHobro, Mark January 2018 (has links)
The process of data integration is an important part of the database field when it comes to database migrations and the merging of data. The research in the area has grown with the addition of machine learning approaches in the last 20 years. Due to the complexity of the research field, no go-to solutions have appeared. Instead, a wide variety of ways of enhancing database migrations have emerged. This thesis examines how well a learning-based solution performs for the semantic integration problem in database migrations. Two algorithms are implemented. One that is based on information retrieval theory, with the goal of yielding a matching result that can be used as a benchmark for measuring the performance of the machine learning algorithm. The machine learning approach is based on grouping data with agglomerative hierarchical clustering and then training a neural network to recognize patterns in the data. This allows making predictions about potential data correspondences across two databases. The results show that agglomerative hierarchical clustering performs well in the task of grouping the data into classes. The classes can in turn be used for training a neural network. The matching algorithm gives a high recall of matching tables, but improvements are needed to both receive a high recall and precision. The conclusion is that the proposed learning-based approach, using agglomerative hierarchical clustering and a neural network, works as a solid base to semi-automate the data integration problem seen in this thesis. But the solution needs to be enhanced with scenario specific algorithms and rules, to reach desired performance. / Dataintegrering är en viktig del inom området databaser när det kommer till databasmigreringar och sammanslagning av data. Forskning inom området har ökat i takt med att maskininlärning blivit ett attraktivt tillvägagångssätt under de senaste 20 åren. På grund av komplexiteten av forskningsområdet, har inga optimala lösningar hittats. Istället har flera olika tekniker framställts, som tillsammans kan förbättra databasmigreringar. Denna avhandling undersöker hur bra en lösning baserad på maskininlärning presterar för dataintegreringsproblemet vid databasmigreringar. Två algoritmer har implementerats. En är baserad på informationssökningsteori, som främst används för att ha en prestandamässig utgångspunkt för algoritmen som är baserad på maskininlärning. Den algoritmen består av ett första steg, där data grupperas med hjälp av hierarkisk klustring. Sedan tränas ett artificiellt neuronnät att hitta mönster i dessa grupperingar, för att kunna göra förutsägelser huruvida olika datainstanser har ett samband mellan två databaser. Resultatet visar att agglomerativ hierarkisk klustring presterar väl i uppgiften att klassificera den data som använts. Resultatet av matchningsalgoritmen visar på att en stor mängd av de matchande tabellerna kan hittas. Men förbättringar behöver göras för att både ge hög en hög återkallelse av matchningar och hög precision för de matchningar som hittas. Slutsatsen är att ett inlärningsbaserat tillvägagångssätt, i detta fall att använda agglomerativ hierarkisk klustring och sedan träna ett artificiellt neuronnät, fungerar bra som en basis för att till viss del automatisera ett dataintegreringsproblem likt det som presenterats i denna avhandling. För att få bättre resultat, krävs att lösningen förbättras med mer situationsspecifika algoritmer och regler.
|
22 |
APPLYING ENTERPRISE MODELS AS INTERFACE FOR INFORMATION SEARCHINGMATONGO, Tanguy, DEGBELO, Auriol January 2009 (has links)
<p>Nowadays, more and more companies use Enterprise Models to integrate and coordinate their business processes with the aim of remaining competitive on the market. Consequently, Enterprise Models play a critical role in this integration enabling to improve the objectives of the enterprise, and ways to reach them in a given period of time. Through Enterprise Models, companies are able to improve the management of their operations, actors, processes and also to improve communication within the organisation.</p><p>This thesis describes another use of Enterprise Models. In this work, we intend to apply Enterprise Models as interface for information searching. The underlying needsfor this project lay in the fact that we would like to show that Enterprise Models canbe more than just models but it can be used in a more dynamic way which is through a software program for information searching. The software program aimed at, first,extracting the information contained in the Enterprise Models (which are stored into aXML file on the system). Once the information is extracted, it is used to express a query which will be sent into a search engine to retrieve some relevant document to the query and return them to the user.</p><p>The thesis was carried out over an entire academic semester. The results of this workare a report which summarizes all the knowledge gained into the field of the study. A software has been built to serve as a proof of testing the theories.</p>
|
23 |
Semantics-enabled framework for knowledge discovery from Earth observation dataDurbha, Surya Srinivas. January 2006 (has links)
Thesis (Ph.D.) -- Mississippi State University. Department of Electrical and Computer Engineering. / Title from title screen. Includes bibliographical references.
|
24 |
[en] SEMANTIC INTEGRATION OF INFORMATION SYSTEMS / [pt] INTEGRAÇÃO SEMÂNTICA DE SISTEMAS DE INFORMAÇÃOMARCOS MAGALHAES MOREIRA 21 January 2004 (has links)
[pt] Propõe-se neste trabalho uma abordagem para a integração
semântica de informações baseada na linguagem de
ontologia
OWL, utilizada como linguagem padrão para tornar
compatíveis as diversas fontes de informação.
Incialmente,
apresenta-se o problema de integração de informações e
discute-se a aplicação de ontologia para resolvê-lo. Em
seguida, indentificam-se as formas de obtenção e extração
de ontologias , com ênfase em sistemas de bancos de
dados.
Da mesma forma, propõem-se alternativas para mapeamento
entre classes , propriedades e instâncias das ontologias
obtidas. Finalmente, desenvolve-se um estudo de caso para
aplicação e validação das idéias apresentadas. Como
resultado, propõe-se uma arquitetura de um sistema
integrador e discute-se a implementação de alguns dos
seus
componentes. / [en] This work presents a semantic approach to information
integration based on the OWL ontology language, proposed as
a standard language to facilitate the integration of
different information sources. The information integration
problem is first presented and then the use of ontologies
to solve it is addressed. Then, strategies to obtain and
extract ontologies are identified, emphasizing database
system. Alternative mappings between classes , properties
and instances of the resulting ontologies are also
proposed.Finally, a case study is developed to apply and
validate the strategies presented. As a result, an
integrator system architecture is proposed and the
implementation of some of its components is discussed.
|
25 |
APPLYING ENTERPRISE MODELS AS INTERFACE FOR INFORMATION SEARCHINGMATONGO, Tanguy, DEGBELO, Auriol January 2009 (has links)
Nowadays, more and more companies use Enterprise Models to integrate and coordinate their business processes with the aim of remaining competitive on the market. Consequently, Enterprise Models play a critical role in this integration enabling to improve the objectives of the enterprise, and ways to reach them in a given period of time. Through Enterprise Models, companies are able to improve the management of their operations, actors, processes and also to improve communication within the organisation. This thesis describes another use of Enterprise Models. In this work, we intend to apply Enterprise Models as interface for information searching. The underlying needsfor this project lay in the fact that we would like to show that Enterprise Models canbe more than just models but it can be used in a more dynamic way which is through a software program for information searching. The software program aimed at, first,extracting the information contained in the Enterprise Models (which are stored into aXML file on the system). Once the information is extracted, it is used to express a query which will be sent into a search engine to retrieve some relevant document to the query and return them to the user. The thesis was carried out over an entire academic semester. The results of this workare a report which summarizes all the knowledge gained into the field of the study. A software has been built to serve as a proof of testing the theories.
|
26 |
DISCOVERY AND PRIORITIZATION OF BIOLOGICAL ENTITIES UNDERLYING COMPLEX DISORDERS BY PHENOME-GENOME NETWORK INTEGRATIONGUDIVADA, RANGA CHANDRA January 2007 (has links)
No description available.
|
27 |
Semantic interoperability in ad-hoc computing environmentsRendo Fernandez, Jose Ignacio January 2007 (has links)
This thesis introduces a novel approach in which multiple heterogeneous devices collaborate to provide useful applications in an ad-hoc network. This thesis proposes a smart home as a particular ubiquitous computing scenario considering all the requirements given by the literature for succeed in this kind of systems. To that end, we envision a horizontally integrated smart home built up from independent components that provide services. These components are described with enough syntactic, semantic and pragmatic knowledge to accomplish spontaneous collaboration. The objective of these collaboration is domestic use, that is, the provision of valuable services for home residents capable of supporting users in their daily activities. Moreover, for the system to be attractive for potential customers, it should offer high levels of trust and reliability, all of them not at an excessive price. To achieve this goal, this thesis proposes to study the synergies available when an ontological description of home device functionality is paired with a formal method. We propose an ad-hoc home network in which components are home devices modelled as processes represented as semantic services by means of the Web Service Ontology (OWL-S). In addition, such services are specified, verified and implemented by means of the Communicating Sequential Processes (CSP), a process algebra for describing concurrent systems. The utilisation of an ontology brings the desired levels of knowledge for a system to compose services in a ad-hoc environment. Services are composed by a goal based system in order to satisfy user needs. Such system is capable of understaning, both service representations and user context information. Furthermore, the inclusion of a formal method contributes with additional semantics to check that such compositions will be correctly implemented and executed, achieving the levels of reliability and costs reduction (costs derived form the design, development and implementation of the system) needed for a smart home to succeed.
|
28 |
Discourse Comprehension in L1 and L2 / The role of context in the processing of discourse-bound wordsKohlstedt, Tatiana 22 September 2014 (has links)
Die vorliegende Dissertation trägt zur Erforschung der fremdsprachlichen Sprachverarbeitung bei, indem sie den Einfluss von wachsendem Diskurs auf die semantische Verarbeitung von Wörtern durch fortgeschrittene Deutschlerner (L2) im Vergleich zu Muttersprachlern (L1) untersucht. Im Moment des Lese- und Hörverstehens wurde unter Laborbedingungen die kognitive Verarbeitung von bekannten Wörtern und die Bedeutungserschließung von unbekannten Wörtern mit Hilfe von kontextuellen Hinweisen im vorausgehenden Kontext erforscht. In der EEG-Studie wurde im Prozess des leisen Lesens die Gehirnreaktion der Versuchspersonen auf das kritische Wort im Text gemessen. Die Analyse des ereigniskorrelierten Potentials N400, das semantische Verarbeitung reflektiert, implizierte, dass selbst ein reduzierter unterstützender Kontext die Verarbeitung des kritischen bekannten Wortes erleichtert. Im Gegensatz dazu fördert ein zusammenhängender neutraler Kontext, der keine Hinweise auf die Bedeutung des kritischen bekannten Wortes enthält, dessen kognitive Verarbeitung nicht. Wenn aber in einem Text ein unbekanntes Wort vorkommt, scheinen nur die Muttersprachler, jedoch nicht die L2-Lerner, von erweitertem unterstützendem Kontext zu profitieren, um die Bedeutung dieses Wortes in realer Zeit zu erschließen.
Die ursprüngliche Forschungsfrage wurde erweitert, um zu testen, ob Personen dazu fähig sind, ein nachfolgendes Wort im Kontext auf Grundlage der vorausgehenden semantischen Hinweise vorauszusagen. Diese Fähigkeit wurde im Moment der auditiv-visuellen Diskursverarbeitung untersucht. In der Eye-Tracking-Studie wurden die Blickbewegungen der Versuchsteilnehmer zu den kritischen Bildern auf dem Monitor aufgenommen, bevor das entsprechende kritische Wort genannt wurde. Die gewonnenen Daten belegen, dass Muttersprachler den nachfolgenden Input selbst auf Basis eines reduzierten unterstützenden Kontextes voraussagen können. Für die L2-Lermer scheint ein reduzierter unterstützender Kontext keine prädiktive Kraft zu besitzen.
Im Kontrast zu den Ergebnissen der früheren Lesestudie demonstrierten beide Versuchspersonengruppen eine ähnlich ausgeprägte Fähigkeit dabei, die Bedeutung eines unbekannten Wortes aufgrund der semantischen Signale des vorausgehenden erweiterten Kontextes zu erschließen. Das könnte bedeuten, dass fortgeschrittene L2-Lerner in einer natürlicheren Sprachverarbeitungssituation – insbesondere wenn sie die Informationen über verschiedene sensorische Kanäle bekommen – die kontextuellen Signale eines erweiterten unterstützenden Kontextes ähnlich effektiv wie die Muttersprachler einsetzen können, um die nachfolgenden Wörter leichter zu verarbeiten. Die berichteten Ergebnisse wurden vor dem Hintergrund von gedächtnisbasierten Theorien der Diskursverarbeitung diskutiert. Es wurde angenommen, dass die gemessenen Unterschiede bei der kontextgebundener Wortverarbeitung mit der ungleich ausgeprägten Fähigkeit zusammenhängen könnten, in L1 und L2 mentale Modelle des verarbeiteten Diskurses aufzubauen.
Die Erkenntnisse der vorliegenden Dissertation tragen zu dem Wissensstand über Diskursverarbeitungsstrategien durch neue empirische Daten bei, die einen Einblick darin gewähren, wie geschriebene und gesprochene Texte von L2-Sprechern und Muttersprachlern ähnlich oder unterschiedlich verarbeitet werden. Vom Standpunkt der Fremdsprachendidaktik aus könnten die Ergebnisse als Effizienzbeweis für die kontextbasierten Semantisierungsstrategie interpretiert werden. Die gewonnenen Erkenntnisse können in der Fremdsprachendidaktik eingesetzt werden, um die Diskursverarbeitung bei L2-Sprechern zu fördern.
|
29 |
Semantic integration of thematic geographic information in a multimedia contextNavarrete Terrassa, Antonio, 1973- 05 September 2006 (has links)
Els datasets geogràfics representen la realitat mitjançant un conjunt d'entitats temàtiques que sovint no estan definides d'una manera precisa i que diferents persones poden entendre de diferents formes. La integració d'informació geogràfica provinent de diverses fonts presenta un important repte des del punt de vista semàntic. En aquesta tesi es proposa una solució a aquest problema basada en la definició d'un marc semàntic el nucli del qual és una ontologia que representa els conceptes temàtics en un repositori de datasets, així com les relacions entre aquests conceptes. També es proposa un mètode semi-automàtic per fusionar les ontologies d'aplicació dels datasets en el repositori. El marc semàntic permet també la definició de serveis semàntics, en concret la integració en un nou dataset d'informació temàtica provinent de diverses fonts. Finalment, el marc semàntic i els seus serveis s'utilitzaran en un sistema d'indexació i recuperació d'elements multimèdia geo-referenciats a partir del seu contingut geogràfic temàtic. / Los datasets geográficos representan la realidad mediante un conjunto de entidades temáticas que a menudo no están definidas de una manera precisa y que diferentes sujetos pueden entender de distintas formas. La integración de información geográfica proveniente de diversas fuentes presenta un importante reto desde el punto de vista semántico. En esta tesis se propone una solución a este problema basada en la definición de un marco semántico cuyo núcleo es una ontología que representa los conceptos temáticos en un repositorio de datasets, así como las relaciones entre dichos conceptos. También se propone un método semi-automático para fusionar las ontologías de aplicación de los datasets en el repositorio. El marco semántico permite además la definición de servicios semánticos, en concreto la integración en un nuevo dataset de información temática proveniente de diversas fuentes. Finalmente, el marco semántico y sus servicios se utilizarán en un sistema de indexación y recuperación de elementos multimedia geo-referenciados a partir de su contenido geográfico temático. / Geographic datasets represent reality through a set of thematic entities that are often not precisely defined and that may be understood in different ways by different subjects. Integrating geographic information from diverse datasets presents an important challenge from the semantic point of view. A solution to this problem is proposed in this thesis based on the definition of a semantic framework whose core is an ontology that represents the thematic concepts in a repository of datasets as well as their relations. A semi-automatic method is also proposed to merge the application ontologies of the datasets in the repository. The semantic framework supports the definition of semantic services, particularly the integration of the thematic information from diverse datasets in a new one. Finally, the semantic framework and its services have been used in the context of indexing and retrieving geo-referenced multimedia elements based on their thematic geographic content.
|
Page generated in 0.1354 seconds