Open Access
XML data integration with identification
Antonella Poggi,Serge Abiteboul +1 more
- 01 Jan 2005
- pp 348-355
22
TL;DR: This work proposes a formal framework for XML data integration systems based on an expressive XML global schema, a set of XML data sources and aSet of mappings specified by means of a simple tree language, and defines an identification function that aims at globally identifying nodes coming from different sources.
read more
Abstract: Data integration is the problem of combining data residing at different sources, and providing the user with a virtual view, called global schema, which is independent from the model and the physical origin of the sources. Whereas many data integration systems and theoretical works have been proposed for relational data, not much investigation has been focused yet on XML data integration. Our goal is therefore to address some of its related issues. In particular, we highlight two major issues that emerge in the XML context: (i) the global schema may be characterized by a set of constraints, expressed by means of a DTD and XML integrity constraints, (ii) the concept of node identity requires to introduce semantic criteria to identify nodes coming from different sources. We propose a formal framework for XML data integration systems based on an expressive XML global schema, a set of XML data sources and a set of mappings specified by means of a simple tree language. Then, we define an identification function that aims at globally identifying nodes coming from different sources. Finally, we propose algorithms to answer queries under different assumptions for the mappings.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Heterogeneous fuzzy XML data integration based on structural and semantic similarities
Zongmin Ma,Zhen Zhao,Li Yan +2 more
TL;DR: A new fuzzy XML tree model is proposed, and an effective algorithm based on the tree edit distance is presented to identify the structural and semantic similarities between the fuzzy documents represented in the proposed fuzzyxml tree model.
17
Transition of Keys in XML Data Transformation
Md. Sumon Shahriar,Jixue Liu +1 more
- 13 Oct 2008
TL;DR: This work defines XML keys and XML functional dependencies and shows key preservation in transformation and how to capture them as XFDs, and refers to the key transition as key transition towards handling the issues of integrity constraints in XML data integration.
8
•Dissertation
From Relations to XML: Cleaning, Integrating and Securing Data
Xibei Jia
- 01 Jan 2008
TL;DR: This thesis presents an integrated framework for cleaning, integrating and securing data using both relational and XML technologies, and provides a uniform approach to clean, integrate and secure data.
7
•Proceedings Article
Towards the Preservation of Keys in XML Data Transformation for Integration.
Md. Sumon Shahriar
- 01 Jan 2008
TL;DR: This paper study how XML keys are transformed, and whether the transformed keys are valid and preserved to the target schema, and shows that the important XML transformation operations are key preserving with necessary and sufficient conditions.
7
XML data fusion
Frantchesco Cecchin,Cristina Dutra de Aguiar Ciferri,Carmem S. Hara +2 more
- 30 Aug 2010
TL;DR: This paper proposes a model for XML data fusion, which allows the integrator to define data cleaning rules for solving value conflicts that may have been detected during the integration process, and develops XFusion, a rulebased cleaning tool that stores curated data in a integrated repository.
References
Data integration: a theoretical perspective
Maurizio Lenzerini
- 03 Jun 2002
TL;DR: The tutorial is focused on some of the theoretical issues that are relevant for data integration: modeling a data integration application, processing queries in data integration, dealing with inconsistent data sources, and reasoning on queries.
Answering queries using views: A survey
Alon Halevy
- 01 Dec 2001
TL;DR: The state of the art on the problem of answering queries using views is surveyed, the algorithms proposed to solve it are described, and the disparate works into a coherent framework are synthesized.
Incomplete Information in Relational Databases
Tomasz Imielinski,Witold Lipski +1 more
TL;DR: There are precise conditions that should be satisfied in a semantically meaningful extension of the usual relational operators, such as projection, selection, union, and join, from operators on relations to operators on tables with “null values” of various kinds allowed.
Information integration using logical views
Jeffrey D. Ullman
- 08 Jan 1997
TL;DR: The formal basis of information-integration techniques, which are closely related to containment algorithms for conjunctive queries and/or Datalog programs, are reviewed.
XML data exchange: consistency and query answering
Marcelo Arenas,Leonid Libkin +1 more
- 13 Jun 2005
TL;DR: This paper starts looking into the basic properties of XML data exchange, that is, restructuring of XML documents that conform to a source DTD under a target DTD, and answering queries written over the target schema, and proves a dichotomy theorem that classifies data exchange settings into those over which query answering is tractable, and those overWhich it is coNP-complete.