Journal Article10.1007/S00778-005-0172-6
Query optimization in XML structured-document databases
Dunren Che,Karl Aberer,Tamer Özsu +2 more
- 01 Sep 2006
- Vol. 15, Iss: 3, pp 263-289
TL;DR: A novel approach performs heuristic-based algebraic transformations on XPath queries, represented as PAT algebraic expressions, to achieve query optimization, and develops a large set of deterministic algebraic transformation rules for XML query optimization.
read more
Abstract: While the information published in the form of XML-compliant documents keeps fast mounting up, efficient and effective query processing and optimization for XML have now become more important than ever. This article reports our recent advances in XML structured-document query optimization. In this article, we elaborate on a novel approach and the techniques developed for XML query optimization. Our approach performs heuristic-based algebraic transformations on XPath queries, represented as PAT algebraic expressions, to achieve query optimization. This article first presents a comprehensive set of general equivalences with regard to XML documents and XML queries. Based on these equivalences, we developed a large set of deterministic algebraic transformation rules for XML query optimization. Our approach is unique, in that it performs exclusively deterministic transformations on queries for fast optimization. The deterministic nature of the proposed approach straightforwardly renders high optimization efficiency and simplicity in implementation. Our approach is a logical-level one, which is independent of any particular storage model. Therefore, the optimizers developed based on our approach can be easily adapted to a broad range of XML data/information servers to achieve fast query optimization. Experimental study confirms the validity and effectiveness of the proposed approach.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Proceedings Article
Inferring XML schema definitions from XML data
Geert Jan Bex,Frank Neven,Stijn Vansummeren +2 more
- 23 Sep 2007
TL;DR: A theoretically complete algorithm is provided that always infers the correct XSD when a sufficiently large corpus of XML documents is available and a variant of this algorithm is presented that works well on real-world data sets.
Learning deterministic regular expressions for the inference of schemas from XML data
Geert Jan Bex,Wouter Gelade,Frank Neven,Stijn Vansummeren +3 more
- 21 Apr 2008
TL;DR: A probabilistic algorithm that learns k-occurrence regular expressions for increasing values of k, and selects the one that best describes the sample based on a Minimum Description Length argument is provided.
75
Data storage practices and query processing in XML databases: a survey
Su-Cheng Haw,Chien-Sing Lee +1 more
TL;DR: An indexing classification scheme is suggested and some of the current trends in indexing methods, which indicate a clear shift towards hybrid indexing are discussed, are discussed.
61
Patent
Method and apparatus for XML query evaluation using early-outs and multiple passes
Fatma Ozcan,Edison Lao Ting +1 more
- 20 Dec 2006
TL;DR: In this article, a method and apparatus for XML query evaluation using early-outs and multiple passes to evaluate an XML query is described, which allows the evaluation to exit early when the veracity of the query is determined, not necessarily when every component has been evaluated.
56
Simplifying XML schema: effortless handling of nondeterministic regular expressions
Geert Jan Bex,Wouter Gelade,Wim Martens,Frank Neven +3 more
- 29 Jun 2009
TL;DR: Solutions to relieve users from the burden of UPA are investigated by automatically transforming nondeterministic expressions into concise deterministic ones defining the same language or constituting good approximations.
44
References
•Proceedings Article
Relational Databases for Querying XML Documents: Limitations and Opportunities
Jayavel Shanmugasundaram,Kristin Tufte,Chun Zhang,Gang He,David J. DeWitt,Jeffrey F. Naughton +5 more
- 07 Sep 1999
TL;DR: It turns out that the relational approach can handle most (but not all) of the semantics of semi-structured queries over XML data, but is likely to be effective only in some cases.
On supporting containment queries in relational database management systems
Chun Zhang,Jeffrey F. Naughton,David J. DeWitt,Qiong Luo,Guy M. Lohman +4 more
- 01 May 2001
TL;DR: The results suggest that contrary to most expectations, with some modifications, a native implementations in an RDBMS can support this class of query much more efficiently.
Structural joins: a primitive for efficient XML query pattern matching
Shurug Al-Khalifa,H. V. Jagadish,Nick Koudas,Jignesh M. Patel,Divesh Srivastava,Yuqing Wu +5 more
- 07 Aug 2002
TL;DR: It is shown that, in some cases, tree-merge algorithms can have performance comparable to stack-tree algorithms, in many cases they are considerably worse, and this behavior is explained by analytical results that demonstrate that, on sorted inputs, the stack- tree algorithms have worst-case I/O and CPU complexities linear in the sum of the sizes of inputs and output, while the tree-MERge algorithms do not have the same guarantee.
•Proceedings Article
Indexing and Querying XML Data for Regular Path Expressions
Quanzhong Li,Bongki Moon +1 more
- 11 Sep 2001
TL;DR: Wang et al. as mentioned in this paper proposed a new system for indexing and storing XML data based on a numbering scheme for elements, which quickly determines the ancestor-descendant relationship between elements in the hierarchy of XML data.
Efficient algorithms for processing XPath queries
TL;DR: It is shown that XPath can be processed much more efficiently, and proposed main-memory algorithms for this problem with polynomial-time combined query evaluation complexity with profitably integrated into existing XPath processors.
Related Papers (5)
Ioana Manolescu,Daniela Florescu,Donald Kossmann +2 more
- 11 Sep 2001
Geert Jan Bex,Frank Neven,Thomas Schwentick,Karl Tuyls +3 more
- 01 Sep 2006
Serge Abiteboul,Peter Buneman,Dan Suciu +2 more
- 21 Oct 1999