Top 7 papers published in the topic of Streaming XML in 2019

Showing papers on "Streaming XML published in 2019"

Journal Article•10.1007/S00450-013-0253-5•

Using versioned trees, change detection and node identity for three-way XML merging

[...]

Cheng Thao¹, Ethan V. Munson²•Institutions (2)

University of Wisconsin–Whitewater¹, University of Wisconsin–Milwaukee²

01 Mar 2019-Computer Science - Research and Development

TL;DR: A three-way XML merge algorithm that is faster, uses less memory and is more precise than previous algorithms, which uses a specialized versioning tree data structure that supports node identity and change detection.

...read moreread less

Abstract: XML has become the standard document representation for many popular tools in various domains. When multiple authors collaborate to produce a document, they must be able to work in parallel and periodically merge their efforts into a single work. While there exist a small number of three-way XML merging tools, their performance could be improved in several areas. We present a three-way XML merge algorithm that is faster, uses less memory and is more precise than previous algorithms. It uses a specialized versioning tree data structure that supports node identity and change detection. The algorithm applies the traditional three-way merge found in GNU diff3 to the children of changed nodes. The editing operations it supports are addition, deletion, update, and move. The algorithm is evaluated by comparing its performance to that of the previous algorithms, using synthetically generated XML documents of a range of sizes and modified by varying numbers of random editing operations. The prototype merge tool used in these tests also includes a simple graphical interface for visualizing and resolving conflicts.

...read moreread less

5 citations

Journal Article•10.1002/DAC.4122•

An improvised indexing technique for XML data over multiple channels in wireless environment: (1, Xm) method

[...]

Vikas Goel¹, Deepali Gautam¹, Amit Kumar Gupta², Sachin Kumar¹•Institutions (2)

Ajay Kumar Garg Engineering College¹, Krishna Institute of Engineering and Technology²

05 Aug 2019-International Journal of Communication Systems

TL;DR: This paper has proposed (1, Xm) method, which basically implies the XML data passing on to the multiple channels, and sent data by indexing onto two different channels: index and data channel.

...read moreread less

1 citations

Repository•10.5281/zenodo.12770398•

Leveraging Apache Hive External Tables for Efficient XML Data Processing

[...]

Pankaj Dureja

28 Feb 2019

Abstract: This document explores the utilization of Apache Hive external tables for efficient XML data processing. XML (eXtensible Markup Language) is a widely used format for data interchange, and processing XML data efficiently poses challenges, especially when dealing with large datasets. Apache Hive, a data warehousing infrastructure built on top of Hadoop, offers a solution for processing structured data by providing a SQL-like interface. By leveraging Apache Hive external tables, XML data can be efficiently processed and queried in a distributed environment. This paper discusses the benefits of using external tables for XML data processing, provides a step-by-step guide for setting up and querying XML data in Apache Hive, and presents performance benchmarks demonstrating the efficiency of this approach.

...read moreread less

Repository•10.5281/zenodo.3592306•

Schema based storage of xml documents in relational databases

[...]

Dr. Pushpa Suri, Divyesh Sharma

24 Dec 2019

Abstract: XML (Extensible Mark up language) is emerging as a tool for representing and exchanging data over the internet. When we want to store and query XML data, we can use two approaches either by using native databases or XML enabled databases. In this paper we deal with XML enabled databases. We use relational databases to store XML documents. In this paper we focus on mapping of XML DTD into relations. Mapping needs three steps: 1) Simplify Complex DTD’s 2) Make DTD graph by using simplified DTD’s 3) Generate Relational schema. We present an inlining algorithm for generating relational schemas from available DTD’s. This algorithm also handles recursion in an XML document.

...read moreread less

Repository•10.5281/zenodo.3563455•

Relational storage for xml rules

[...]

A. A. Abd El Aziz, A. Kannan

27 Dec 2019

Abstract: Very few research works have been done on XML security over relational databases despite that XML became the de facto standard for the data representation and exchange on the internet and a lot of XML documents are stored in RDBMS. In [14], the author proposed an access control model for schema-based storage of XML documents in relational storage and translating XML access control rules to relational access control rules. However, the proposed algorithms had performance drawbacks. In this paper, we will use the same access control model of [14] and try to overcome the drawbacks of [14] by proposing an efficient technique to store the XML access control rules in a relational storage of XML DTD. The mapping of the XML DTD to relational schema is proposed in [7]. We also propose an algorithm to translate XPath queries to SQL queries based on the mapping algorithm in [7].

...read moreread less

Repository•10.5281/zenodo.3375193•

XML Document Probabilistic Clustering Based on Structure and Content

[...]

Naderi, Hassan, MojtabaRashidi

23 Aug 2019

Abstract: Large volume of information is stored in XML format in the Web, and clustering is a management method for this documents. Most of current methods for clustering XML documents consider only one of these two aspects. In this paper, we propose SCEM (Expectation Maximization Structure and Content) for XML documents which is used to effectively cluster XML documents by combining content and structural features. The other contribution of this paper is that we used probabilistic distributions in such way that have probability parameters corresponding to one cluster. In this way, we obtained better effectiveness compared to other clustering methods due to generality. Experimental results on real datasets show effectiveness of proposed method, particularly when it is applied on large XML documents without schema. Also it can be used to improve accuracy and effectiveness of XML information retrieval.

...read moreread less

Dataset•10.7910/dvn/2jt5jk/4xp0jj•

States1840.xml.xml

[...]

David Bateman

1 Jan 2019

Abstract: Shapefiles and related files

...read moreread less