Parallel processing of XML databases

doi:10.1109/CCECE.2005.1557377

Proceedings Article10.1109/CCECE.2005.1557377

Parallel processing of XML databases

Ghassan Z. Qadah

- 01 May 2005

- pp 2000-2004

7

TL;DR: This paper examines several techniques for structuring and storing XML data across the different cluster nodes and develops a number of algorithms suitable for processing a certain class of queries, namely, the containment queries, against the parallel XML database.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.1109/IPDPS.2008.4536240

Simultaneous transducers for data-parallel XML parsing

Yinfei Pan, +2 more

- 14 Apr 2008

TL;DR: This work parallelize the preparsing pass itself by using a simultaneous finite transducer (SFT), which implicitly maintains multiple preparser results and addresses the challenge of determining the correct initial state at beginning of a chunk by simply considering all possible initial states simultaneously.

...read moreread less

28

Proceedings Article•10.1109/ICWS.2008.107

Hybrid Parallelism for XML SAX Parsing

Yinfei Pan, +2 more

- 23 Sep 2008

TL;DR: To handle inherent data dependencies in XML while still allowing reasonable scalability, this work uses a 4-stage software pipeline with a combination of strictly sequential stages and stages that can be further data-parallelized within the stage, a hybrid between pipelined parallelism and data parallelism.

...read moreread less

22

Proceedings Article•10.1109/HIPC.2009.5433187

Speculative p-DFAs for parallel XML parsing

Ying Zhang, +2 more

- 01 Dec 2009

TL;DR: This paper explores the use of speculation to improve the performance of parallel XML parsing by using an initial preparsing stage to build a sketch of the document which is called the skeleton, and shows good performance and scalability on both a 30 CPU Sun E6500 machine running Solaris and a Linux machine with two Intel Xeon L5320 CPUs.

...read moreread less

15

•Proceedings Article•10.5555/1791889.1791908

Parsing XML using parallel traversal of streaming trees

Yinfei Pan, +2 more

- 17 Dec 2008

TL;DR: This paper investigates parallel, SAX-style parsing of XML via a parallel, depth-first traversal of the streaming document, and shows good scalability up to about 6 cores on a Linux platform.

...read moreread less

10

Practical and Theoretical Aspects of a Parallel Twig Join Algorithm for XML Processing using a GPGPU

Lila Shnaiderman, +1 more

- 01 Jan 2012

TL;DR: GPU-Twig as discussed by the authors uses the data and task parallelism of the GPU to perform memory-intensive tasks whereas the CPU is used to perform I/O and resource management, which reduces the running time of queries in comparison with other algorithms on CPU based platforms and multicore based platforms.

...read moreread less

7

References

A Relational Model of Data Large Shared Data Banks

E. F. Codd

- 01 Jan 1970

TL;DR: In this paper, a model based on n-ary relations, a normal form for data base relations, and the concept of a universal data sublanguage are introduced, and certain operations on relations are discussed and applied to the problems of redundancy and consistency in the user's model.

...read moreread less

4.4K

A Relational Model of Data for Large Shared Data Banks (Original Manuscript)

E. F. Codd

- 01 Jan 1970

TL;DR: A model based on n-ary relations, a normal form for data base relations, and the concept of a universal data sublanguage are introduced and certain operations on relations are discussed and applied to the problems of redundancy and consistency in the user's model.

...read moreread less

2.6K

Proceedings Article•10.1145/375663.375722

On supporting containment queries in relational database management systems

Chun Zhang, +4 more

- 01 May 2001

TL;DR: The results suggest that contrary to most expectations, with some modifications, a native implementations in an RDBMS can support this class of query much more efficiently.

...read moreread less

955

Proceedings Article•10.1145/308386.308429

Object-oriented database systems

François Banciihon

- 01 Mar 1988

TL;DR: This paper describes the vision of the current state of object-oriented database research, and describes what it considers to be the main characteristics of an object oriented system: encapsulation, object identity, classes or types, inheritance, overriding and late binding.

...read moreread less

293

•Book

Object-Oriented Database Systems

Elisa Bertino, +1 more

- 01 Jan 1993

Abstract: Object-oriented data models query languages versions evolution authorization query processing storage management and indexing techniques systems definition of covariance and contravariance formulation of derived parameters for the cost model concludions and future developments.

...read moreread less

201

Parallel processing of XML databases

Chat with Paper

AI Agents for this Paper

Citations

Simultaneous transducers for data-parallel XML parsing

Hybrid Parallelism for XML SAX Parsing

Speculative p-DFAs for parallel XML parsing

Parsing XML using parallel traversal of streaming trees

Practical and Theoretical Aspects of a Parallel Twig Join Algorithm for XML Processing using a GPGPU

References

A Relational Model of Data Large Shared Data Banks

A Relational Model of Data for Large Shared Data Banks (Original Manuscript)

On supporting containment queries in relational database management systems

Object-oriented database systems

Object-Oriented Database Systems

Related Papers (5)

Querying XML Data using PC Cluster System

Processing XPath Queries in PC-Clusters Using XML Data Partitioning

Incremental fusion of XML fragments through semantic identifiers

Processing Queries over Distributed XML Databases

A Parallel Approach to XML Parsing