Information integration has been widely addressed over the last several decades. However, it is far from solved due to the complexity of resolving schema and data heterogeneities. In this paper, we propose out attempt to alleviate such difficulty by realizing keyword search functionality for integrating information from heterogeneous databases. Our solution does not require predefined global schema or any mappings between databases. Rather, it relies on an operator called keyword join to take a set of lists of partial answers from different data sources as input, and output a list of results that are joined by the tuples from input lists based on predefined similarity measures as integrated results. Our system allows source databases remain autonomous and the system to be dynamic and extensible. We have tested our system with real dataset and benchmark, which shows that our proposed method is practical and effective. / Singapore-MIT Alliance (SMA)
Identifer | oai:union.ndltd.org:MIT/oai:dspace.mit.edu:1721.1/30263 |
Date | 01 1900 |
Creators | Yu, Bei, Liu, Ling, Ooi, Beng Chin, Tan, Kian Lee |
Source Sets | M.I.T. Theses and Dissertation |
Language | English |
Detected Language | English |
Type | Article |
Format | 236260 bytes, application/pdf |
Relation | Computer Science (CS) |
Page generated in 0.0017 seconds