Global ETD Search

91	Techniques d'optimisation pour des données semi-structurées du web sémantique / Database techniques for semantics-rich semi-structured Web data Leblay, Julien 27 September 2013 (has links) RDF et SPARQL se sont imposés comme modèle de données et langage de requêtes standard pour décrire et interroger les données sur la Toile. D’importantes quantités de données RDF sont désormais disponibles, sous forme de jeux de données ou de méta-données pour des documents semi-structurés, en particulier XML. La coexistence et l’interdépendance grandissantes entre RDF et XML rendent de plus en plus pressant le besoin de représenter et interroger ces données conjointement. Bien que de nombreux travaux couvrent la production et la publication, manuelles ou automatiques, d’annotations pour données semi-structurées, peu de recherches ont été consacrées à l’exploitation de telles données. Cette thèse pose les bases de la gestion de données hybrides XML-RDF. Nous présentons XR, un modèle de données accommodant l’aspect structurel d’XML et la sémantique de RDF. Le modèle est suffisamment général pour représenter des données indépendantes ou interconnectées, pour lesquelles chaque nœud XML est potentiellement une ressource RDF. Nous introduisons le langage XRQ, qui combine les principales caractéristiques des langages XQuery et SPARQL. Le langage permet d’interroger la structure des documents ainsi que la sémantique de leurs annotations, mais aussi de produire des données semi-structurées annotées. Nous introduisons le problème de composition de requêtes dans le langage XRQ et étudions de manière exhaustive les techniques d’évaluation de requêtes possibles. Nous avons développé la plateforme XRP, implantant les algorithmes d’évaluation de requêtes dont nous comparons les performances expérimentalement. Nous présentons une application reposant sur cette plateforme pour l’annotation automatique et manuelle de pages trouvées sur la Toile. Enfin, nous présentons une technique pour l’inférence RDFS dans les systèmes de gestion de données RDF (et par extension XR). / Since the beginning of the Semantic Web, RDF and SPARQL have become the standard data model and query language to describe resources on the Web. Large amounts of RDF data are now available either as stand-alone datasets or as metadata over semi-structured documents, typically XML. The ability to apply RDF annotations over XML data emphasizes the need to represent and query data and metadata simultaneously. While significant efforts have been invested into producing and publishing annotations manually or automatically, little attention has been devoted to exploiting such data. This thesis aims at setting database foundations for the management of hybrid XML-RDF data. We present a data model capturing the structural aspects of XML data and the semantics of RDF. Our model is general enough to describe pure XML or RDF datasets, as well as RDF-annotated XML data, where any XML node can act as a resource. We also introduce the XRQ query language that combines features of both XQuery and SPARQL. XRQ not only allows querying the structure of documents and the semantics of their annotations, but also producing annotated semi-structured data on-the-fly. We introduce the problem of query composition in XRQ, and exhaustively study query evaluation techniques for XR data to demonstrate the feasibility of this data management setting. We have developed an XR platform on top of well-known data management systems for XML and RDF. The platform features several query processing algorithms, whose performance is experimentally compared. We present an application built on top of the XR platform. The application provides manual and automatic annotation tools, and an interface to query annotated Web page and publicly available XML and RDF datasets concurrently. As a generalization of RDF and SPARQL, XR and XRQ enables RDFS-type of query answering. In this respect, we present a technique to support RDFS-entailments in RDF (and by extension XR) data management systems. Web sémantique XML RDF Linked Data Modèles de données Langages de requêtes Composition de requêtes Réponse aux requêtes Optimisation de requêtes Semantic Web XML RDF Linked Data Data models Query languages Query composition Query answering Query optimization
92	CircularTrip and ArcTrip:effective grid access methods for continuous spatial queries. Cheema, Muhammad Aamir, Computer Science & Engineering, Faculty of Engineering, UNSW January 2007 (has links) A k nearest neighbor query q retrieves k objects that lie closest to the query point q among a given set of objects P. With the availability of inexpensive location aware mobile devices, the continuous monitoring of such queries has gained lot of attention and many methods have been proposed for continuously monitoring the kNNs in highly dynamic environment. Multiple continuous queries require real-time results and both the objects and queries issue frequent location updates. Most popular spatial index, R-tree, is not suitable for continuous monitoring of these queries due to its inefficiency in handling frequent updates. Recently, the interest of database community has been shifting towards using grid-based index for continuous queries due to its simplicity and efficient update handling. For kNN queries, the order in which cells of the grid are accessed is very important. In this research, we present two efficient and effective grid access methods, CircularTrip and ArcTrip, that ensure that the number of cells visited for any continuous kNN query is minimum. Our extensive experimental study demonstrates that CircularTrip-based continuous kNN algorithm outperforms existing approaches in terms of both efficiency and space requirement. Moreover, we show that CircularTrip and ArcTrip can be used for many other variants of nearest neighbor queries like constrained nearest neighbor queries, farthest neighbor queries and (k + m)-NN queries. All the algorithms presented for these queries preserve the properties that they visit minimum number of cells for each query and the space requirement is low. Our proposed techniques are flexible and efficient and can be used to answer any query that is hybrid of above mentioned queries. For example, our algorithms can easily be used to efficiently monitor a (k + m) farthest neighbor query in a constrained region with the flexibility that the spatial conditions that constrain the region can be changed by the user at any time. Nearest neighbor analysis (Statistics) Querying (Computer science) Query languages (Computer science) Data structures (Computer science) Real-time data processing. Computational grids (Computer systems) Computer algorithms.
93	CircularTrip and ArcTrip:effective grid access methods for continuous spatial queries. Cheema, Muhammad Aamir, Computer Science & Engineering, Faculty of Engineering, UNSW January 2007 (has links) A k nearest neighbor query q retrieves k objects that lie closest to the query point q among a given set of objects P. With the availability of inexpensive location aware mobile devices, the continuous monitoring of such queries has gained lot of attention and many methods have been proposed for continuously monitoring the kNNs in highly dynamic environment. Multiple continuous queries require real-time results and both the objects and queries issue frequent location updates. Most popular spatial index, R-tree, is not suitable for continuous monitoring of these queries due to its inefficiency in handling frequent updates. Recently, the interest of database community has been shifting towards using grid-based index for continuous queries due to its simplicity and efficient update handling. For kNN queries, the order in which cells of the grid are accessed is very important. In this research, we present two efficient and effective grid access methods, CircularTrip and ArcTrip, that ensure that the number of cells visited for any continuous kNN query is minimum. Our extensive experimental study demonstrates that CircularTrip-based continuous kNN algorithm outperforms existing approaches in terms of both efficiency and space requirement. Moreover, we show that CircularTrip and ArcTrip can be used for many other variants of nearest neighbor queries like constrained nearest neighbor queries, farthest neighbor queries and (k + m)-NN queries. All the algorithms presented for these queries preserve the properties that they visit minimum number of cells for each query and the space requirement is low. Our proposed techniques are flexible and efficient and can be used to answer any query that is hybrid of above mentioned queries. For example, our algorithms can easily be used to efficiently monitor a (k + m) farthest neighbor query in a constrained region with the flexibility that the spatial conditions that constrain the region can be changed by the user at any time. Nearest neighbor analysis (Statistics) Querying (Computer science) Query languages (Computer science) Data structures (Computer science) Real-time data processing. Computational grids (Computer systems) Computer algorithms.
94	Query Segmentation For E-Commerce Sites Gong, Xiaojing 12 July 2013 (has links) Indiana University-Purdue University Indianapolis (IUPUI) / Query segmentation module is an integral part of Natural Language Processing which analyzes users' query and divides them into separate phrases. Published works on the query segmentation focus on the web search using Google n-gram frequencies corpus or text retrieval from relational databases. However, this module is also useful in the domain of E-Commerce for product search. In this thesis, we will discuss query segmentation in the context of the E-Commerce area. We propose a hybrid unsupervised segmentation methodology which is based on prefix tree, mutual information and relative frequency count to compute the score of query pairs and involve Wikipedia for new words recognition. Furthermore, we use two unique E-Commerce evaluation methods to quantify the accuracy of our query segmentation method. Query Segmentation prefix tree unsupervised segmentation E-Commerce Query languages (Computer science) Electronic commerce -- Computer programs Computational linguistics Pattern recognition systems
95	Distributed Immersive Participation : Realising Multi-Criteria Context-Centric Relationships on an Internet of Things Walters, Jamie January 2014 (has links) Advances in Internet-of-Things integrate sensors and actuators in everyday items or even people transforming our society at an accelerated pace. This occurs in areas such as agriculture, logistics, transport, healthcare, and smart cities and has created new ways to interact with and experience entertainment, (serious) games, education, etc. Common to these domains is the challenge to realize and maintain complex relations with any object or individual globally, with the requirement for immediacy in maintaining relations of varying complexity. Existing architectures for maintaining relations on the Internet, e.g., DNS and search engines are insufficient in meeting these challenges. Their deficiencies mandate the research presented in this dissertation enabling the maintenance of dynamic and multi-criteria relationships among entities in real-time in an Internet-of-Things while minimizing the overall cost for maintaining such context-centric relationships. A second challenge is the need to represent nearness in context-centric relationships, since solutions need to build on what is closely related. The dissertation shows that the proximity on relations can be used to bring about the scalability of maintaining relationships across the IoT. It successfully demonstrates the concept and feasibility of self-organizing context-centric overlay networks for maintaining scalable and real-time relationships between endpoints co-located with associated physical entities. This is complemented by an object model for annotating objects and their relationships as derived and defined over the underpinning context interactions. Complementing measures of nearness are added through a non-metric multi-criteria approach to evaluating the notion of context proximity. A query language and an extension to the publish-subscribe approaches achieves distributed support for discovering such relationships; locating entities relative to a defined hyper-sphere of interest. Furthermore, it introduces adaptive algorithms for maintaining such relationships at minimal overall costs. The results demonstrate the feasibility of moving towards context-centric approaches to immersion and that such approaches are realizable over vast and distributed heterogeneous collections of user and their associated context information. pervasive computing ubiquitous computing mediasense context context awareness context centric peer-to-peer immersive participation multi-criteria context context proximity context distance evolving context context-aware query languages publish.subscribe distributed context awareness sensor information context information context models context relationships context relations

Page generated in 0.1099 seconds