Top 307 papers published in the topic of Streaming XML in 2011

Showing papers on "Streaming XML published in 2011"

Journal Article•10.1016/J.KNOSYS.2011.06.006•

Data storage practices and query processing in XML databases: a survey

[...]

Su-Cheng Haw¹, Chien-Sing Lee¹•Institutions (1)

01 Dec 2011-Knowledge Based Systems

TL;DR: An indexing classification scheme is suggested and some of the current trends in indexing methods, which indicate a clear shift towards hybrid indexing are discussed, are discussed.

...read moreread less

Abstract: With the rapid emergence of XML as a data exchange standard over the Web, storing and querying XML data have become critical issues. The two main approaches to storing XML data are (1) to employ traditional storage such as relational database, object-oriented database and so on, and (2) to create an XML-specific native storage. The storage representation affects the efficiency of query processing. In this paper, firstly, we review the two approaches for storing XML data. Secondly, we review various query optimization techniques such as indexing, labeling and join algorithms to enhance query processing in both approaches. Next, we suggest an indexing classification scheme and discuss some of the current trends in indexing methods, which indicate a clear shift towards hybrid indexing.

...read moreread less

61 citations

Journal Article•10.1007/S11280-011-0128-2•

Processing keyword search on XML: a survey

[...]

Ziyang Liu¹, Yi Chen¹•Institutions (1)

Arizona State University¹

01 Oct 2011-World Wide Web

TL;DR: This survey divides the existing approaches to keyword search on XML into several classes based on the problem they tackled, and performs a comprehensive analysis of these works.

...read moreread less

Abstract: Keyword search is a user-friendly approach for users to retrieve information from XML data. Since an XML document can have a large size and contain a lot of information, an XML keyword search result should be a fragment of an XML document dynamically constructed at query time, which is achievable due to the structuredness of XML. Processing keyword searches on XML has several challenges, e.g., what are the elements in the XML document that are relevant to the query? How to generate the results efficiently and rank the results meaningfully? How to present the results to the user in a way such that the user can quickly find the desired information? In this survey, we review the papers in the literature that attempted to address these problems. We divide the existing approaches into several classes based on the problem they tackled, and perform a comprehensive analysis of these works.

...read moreread less

53 citations

Journal Article•10.1016/J.DATAK.2010.08.001•

Indexing and querying XML using extended Dewey labeling scheme

[...]

Jiaheng Lu¹, Xiaofeng Meng¹, Tok Wang Ling¹•Institutions (1)

Renmin University of China¹

1 Jan 2011

TL;DR: A novel labeling scheme is introduced, called extended Dewey, which effectively extends the existing Dewey labeling scheme to combine the types and identifiers of elements in a label, and to avoid the scan of labels for internal query nodes to accelerate query processing (in I/O cost).

...read moreread less

Abstract: Finding all the occurrences of a tree pattern in an XML database is a core operation for efficient evaluation of XML queries. The Dewey labeling scheme is commonly used to label an XML document to facilitate XML query processing by recording information on the path of an element. In order to improve the efficiency of XML tree pattern matching, we introduce a novel labeling scheme, called extended Dewey, which effectively extends the existing Dewey labeling scheme to combine the types and identifiers of elements in a label, and to avoid the scan of labels for internal query nodes to accelerate query processing (in I/O cost). Based on extended Dewey, we propose a series of holistic XML tree pattern matching algorithms. We first present TJFast to answer an XML twig pattern query. To efficiently answer a generalized XML tree pattern, we then propose GTJFast, an optimization that exploits the non-output nodes. In addition, we propose TJFastTL and GTJFastTL based on the tag+level data partition scheme to further reduce I/O costs by level pruning. Finally, we report our comprehensive experimental results to show that our set of XML tree pattern matching algorithms are superior to existing approaches in terms of the number of elements scanned, the size of intermediate results and query performance.

...read moreread less

38 citations

Proceedings Article•10.1109/IWSSCLOUD.2011.6049019•

On the effectiveness of XML Schema validation for countering XML Signature Wrapping attacks

[...]

Meiko Jensen¹, Christopher Meyer¹, Juraj Somorovsky¹, Jörg Schwenk¹•Institutions (1)

Ruhr University Bochum¹

20 Oct 2011

TL;DR: It is concluded that xml Schema validation with a hardened XML Schema is capable of fending XML Signature Wrapping attacks, but bears some pitfalls and disadvantages as well.

...read moreread less

Abstract: In the context of security of Web Services, the XML Signature Wrapping attack technique has lately received increasing attention. Following a broad range of real-world exploits, general interest in applicable countermeasures rises. However, few approaches for countering these attacks have been investigated closely enough to make any claims about their effectiveness. In this paper, we analyze the effectiveness of the specific countermeasure of XML Schema validation in terms of fending Signature Wrapping attacks. We investigate the problems of XML Schema validation for Web Services messages, and discuss the approach of Schema Hardening, a technique for strengthening XML Schema declarations. We conclude that XML Schema validation with a hardened XML Schema is capable of fending XML Signature Wrapping attacks, but bears some pitfalls and disadvantages as well.

...read moreread less

37 citations

Having a ChuQL at XML on the Cloud.

[...]

Shahan Khatchadourian¹, Mariano P. Consens¹, Jérôme Siméon²•Institutions (2)

University of Toronto¹, IBM²

1 Jan 2011

TL;DR: The ChuQL language incorporates records to support the key/value data model of MapReduce, leverages higher-order functions to provide clean semantics, and exploits side-effects to fully expose to XQuery developers the Hadoop framework.

...read moreread less

Abstract: MapReduce/Hadoop has gained acceptance as a framework to process, transform, integrate, and analyze massive amounts of Web data on the Cloud The MapReduce model (simple, fault tolerant, data parallelism on elastic clouds of commodity servers) is also attractive for processing enterprise and scientific data Despite XML ubiquity, there is yet little support for XML processing on top of MapReduce In this paper, we describe ChuQL, a MapReduce extension to XQuery, with its corresponding Hadoop implementation The ChuQL language incorporates records to support the key/value data model of MapReduce, leverages higher-order functions to provide clean semantics, and exploits side-effects to fully expose to XQuery developers the Hadoop framework The ChuQL implementation distributes computation to multiple XQuery engines, providing developers with an expressive language to describe tasks over big data

...read moreread less

30 citations

Proceedings Article•

XSLT transformation generating OWL ontologies automatically based on XML Schemas

[...]

Thomas Bosch¹, Brigitte Mathiak¹•Institutions (1)

Leibniz Association¹

1 Dec 2011

TL;DR: The aim of this paper is to show the implementation of the general approach transforming any XML Schemas into generated ontologies automatically using XSLT.

...read moreread less

Abstract: Designing domain ontologies from scratch is a time-consuming process. In many cases, both the terminologies and the syntactic structures of domain data models are already described in form of XML Schemas. XSLT transformations are used to lift the syntactic level of XML documents to the semantic level of OWL ontologies by mapping any XML Schemas to generated ontologies automatically. Ontology engineers base domain ontologies on generated ontologies to enrich the information located in the XML schemas with additional domain specific semantic information. The aim of this paper is to show the implementation of the general approach transforming any XML Schemas into generated ontologies automatically using XSLT.

...read moreread less

22 citations

Proceedings Article•10.1109/ETFA.2011.6059002•

Optimized XML-based Web service generation for service communication in restricted embedded environments

[...]

Sebastian Käbisch¹, Daniel Peintner¹, Jörg Heuer¹, Harald Kosch²•Institutions (2)

Siemens¹, University of Passau²

24 Oct 2011

TL;DR: Evaluation results based on the dataset from the ISO/IEC standardization of the vehicle to grid communication interface (V2G CI) prove the applicability of the generated XML-based Web services of restricted devices in terms of message size, performance, and code footprint.

...read moreread less

Abstract: Embedded network programming remains a highly complex task for developers since unique characteristics of such networks have to be faced: one of them is the communication between a diversity of resource constraint nodes. Another one is the infrastructure dynamics. The widely-used standardized Web service technologies would perfectly meet such unique characteristics and ease the development of applications. Such technologies that enable, e.g., requesting or subscribing service data, however, process usually plain XML documents which are not suitable for small embedded devices with very limited resources. This is due to XML's verbosity, its bandwidth usage, and its associated processing overhead. The paper addresses these issues and describes an innovative and optimized source code generation technique by means of W3C's Efficient XML Interchange (EXI) format for developing XML-based Web services for the embedded domain. This offers developers a seamless use of the wide-spread service protocols in the embedded domain as well. Evaluation results based on the dataset from the ISO/IEC standardization of the vehicle to grid communication interface (V2G CI) prove the applicability of the generated XML-based Web services of restricted devices in terms of message size, performance, and code footprint.

...read moreread less

21 citations

Proceedings Article•10.1109/ICDE.2011.5767951•

Updating XML schemas and associated documents through exup

[...]

Federico Cavalieri¹, Giovanna Guerrini¹, Marco Mesiti²•Institutions (2)

University of Genoa¹, University of Milan²

11 Apr 2011

TL;DR: An overview of the facilities of the XSUpdate language and of the Eχup system is provided to provide an insight into the functioning of this engine for processing schema modification and document adaptation statements.

...read moreread less

Abstract: Data on the Web mostly are in XML format and the need often arises to update their structure, commonly described by an XML Schema. When a schema is modified the effects of the modification on documents need to be faced. XSUpdate is a language that allows to easily identify parts of an XML Schema, apply a modification primitive on them and finally define an adaptation for associated documents, while Eχup is the corresponding engine for processing schema modification and document adaptation statements. Purpose of this demonstration is to provide an overview of the facilities of the XSUpdate language and of the Eχup system.

...read moreread less

21 citations

Patent•

System and method for non-programmers to dynamically manage multiple sets of XML document data

[...]

Richard William VanderDrift¹•Institutions (1)

Wilmington University¹

3 Jun 2011

TL;DR: A system and method for dynamically retrieving, manipulating, updating, creating, and displaying data from sources of Extensible Markup Language (XML) documents is presented in this article.

...read moreread less

Abstract: A system and method for dynamically retrieving, manipulating, updating, creating, and displaying data from sources of Extensible Markup Language (XML) documents. The program memory comprises system-user entered data definitions and business rules. The system imports XML document data into the system data definitions, processes the data using the business rules definitions and exports XML documents. The system can automatically create XML document formats from its data definitions and can automatically create its data definitions from XML document formats. The system-user can also define the mapping between XML document formats and the system data definitions. The system data definition is the combination of a Relational data model, an Object data model, and an XML data model.

...read moreread less

19 citations

Book Chapter•10.1007/978-3-642-23737-9_27•

XML data transformations as schema evolves

[...]

Jakub Malý¹, Irena Mlýnková¹, Martin Nečaský¹•Institutions (1)

Charles University in Prague¹

20 Sep 2011

TL;DR: The approach presented in this paper extends an existing XML conceptual model with the support for multiple versions of the model, and it is possible to define a set of changes between two versions of a schema.

...read moreread less

Abstract: One of the key characteristics of XML applications is their dynamic nature. When a system grows and evolves, old user requirements change and/or new requirements accumulate. Apart from changes in the interface, it is also necessary to modify the existing documents with each new version, so they are valid against the new specification. The approach presented in this paper extends an existing XML conceptual model with the support for multiple versions of the model. Thanks to this extension, it is possible to define a set of changes between two versions of a schema. This work contains an outline of an algorithm that compares two versions of a schema and produces a revalidation script in XSL.

...read moreread less

17 citations

Journal Article•10.1016/J.CAGEO.2010.09.010•

Providing access to satellite imagery through OGC catalog service interfaces in support of the Global Earth Observation System of Systems

[...]

Yuqi Bai¹, Liping Di¹•Institutions (1)

George Mason University¹

01 Apr 2011-Computers & Geosciences

TL;DR: This study investigates the characteristics and challenges in building Open Geospatial Consortium Inc. (OGC) catalog service, and presents a general lightweight XML adapter for relational tables, followed by a general OGC catalog service solution based on this adapter.

...read moreread less

Journal Article•10.1016/J.DATAK.2011.07.005•

Mapping between heterogeneous XML and OWL transaction representations in B2B integration

[...]

Jorge Cardoso¹, Christoph Bussler•Institutions (1)

University of Coimbra¹

1 Dec 2011

TL;DR: A conceptual approach, and its implementation, to integrate external syntactic data representations with organizational internal semantic data representations by using the notion of heterogeneous mappings which are established between the two types of representations are presented.

...read moreread less

Abstract: XML-based standards have been widely used to enable and ease Business-to-Business (B2B) integration. Examples of standards include cXML, CIDX and ebXML. While these XML-based standards are syntactic, contemporary organizations have available new means to structure their internal data representations using semantic descriptions, such as RDF(S) and OWL. This scenario poses an interesting challenge: ''How to reconcile external XML-based standards and internal OWL-based representations in B2B integration scenarios?'' In this paper, we present a conceptual approach, and its implementation, to integrate external syntactic data representations with organizational internal semantic data representations by using the notion of heterogeneous mappings which are established between the two types of representations. The application developed, B2BISS, enables an effective management of mappings. As the number of mappings stored in the repository increases over time, organizations can gradually rely on a semi-automatic to automatic B2B integration.

...read moreread less

Journal Article•10.2478/S13537-011-0005-1•

XML document-grammar comparison: related problems and applications

[...]

Joe Tekli¹, Richard Chbeir, Agma J. M. Traina¹, Caetano Traina¹•Institutions (1)

University of São Paulo¹

25 Mar 2011

TL;DR: An overview on existing research related to XML document/grammar comparison is provided, presenting the background and discussing the various techniques related to the problem, as well as discussing some prominent application domains.

...read moreread less

Abstract: XML document comparison is becoming an ever more popular research issue due to the increasingly abundant use of XML. Likewise, a growing interest fosters the development of XML grammar matching and comparison, due to the proliferation of heterogeneous XML data sources, particularly on the Web. Nonetheless, the process of comparing XML documents with XML grammars, i.e., XML document and grammar similarity evaluation, has not yet received the attention it deserves. In this paper, we provide an overview on existing research related to XML document/grammar comparison, presenting the background and discussing the various techniques related to the problem. We also discuss some prominent application domains, ranging over document classification and clustering, document transformation, grammar evolution, selective dissemination of XML information, XML querying, as well as alert filtering in intrusion detection systems and Web Services matching and communications.

...read moreread less

Proceedings Article•10.1145/1951365.1951388•

Algebraic incremental maintenance of XML views

[...]

Angela Bonifati¹, Martin Hugh Goodfellow², Ioana Manolescu¹, Domenica Sileo³•Institutions (3)

French Institute for Research in Computer Science and Automation¹, University of Strathclyde², University of Basilicata³

21 Mar 2011

TL;DR: This work presents an algebraic approach for propagating source updates to XML materialized views expressed in a powerful XML tree pattern formalism and highlights the benefits of this approach over existing algorithms through a series of experiments.

...read moreread less

Abstract: Materialized views can bring important performance benefits when querying XML documents. In the presence of XML document changes, materialized views need to be updated to faithfully reflect the changed document. In this work, we present an algebraic approach for propagating source updates to XML materialized views expressed in a powerful XML tree pattern formalism. Our approach differs from the state of the art in the area in two important ways. First, it relies on set-oriented, algebraic operations, to be contrasted with node-based previous approaches. Second, it exploits state-of-the-art features of XML stores and XML query evaluation engines, notably XML structural identifiers and associated structural join algorithms. We present algorithms for determining how updates should be propagated to views, and highlight the benefits of our approach over existing algorithms through a series of experiments.

...read moreread less

Patent•

System and Method Using A Simplified XML Format for Real-Time Content Publication

[...]

Sameer Merchant, Gerald Bueshel, Jules Michael McLeod, John Marshall

1 Aug 2011

TL;DR: In this paper, a system and method for delivering content in real-time using advanced messaging technology that reduces the risk of content being lost or dropped in transmission is presented, using a custom simplified XML format to deliver realtime textual, numeric, and metadata content directly to subscribers.

...read moreread less

Abstract: A system and method for delivering content in real-time using advanced messaging technology that reduces the risk of content being lost or dropped in transmission. The system and method utilize a custom, simplified XML format to deliver real-time textual, numeric, and metadata content directly to subscribers. The XML tag set specifies all of the information needed to package, process, and distribute real-time content messages and includes an advanced tagging structure that allows granular content customization. Messages are built on the fly using multi-channel data processing techniques. The XML delivery system and method offers an array of real-time market-specific page-based “Alert” services and aggregated newswires with accompanying real-time numeric data feeds. These feeds contain proprietary assessments and other price data across a broad spectrum of global and regional commodity markets, including oil, petrochemicals, metals, electric power, natural gas, coal, and risk.

...read moreread less

Book Chapter•10.1007/978-3-642-22630-4_4•

TwigTable: using semantics in XML twig pattern query processing

[...]

Huayu Wu¹, Tok Wang Ling¹, Bo Chen¹, Liang Xu¹•Institutions (1)

National University of Singapore¹

01 Jan 2011-Journal on Data Semantics

TL;DR: This paper designs TwigTable algorithm to incorporate property and value information into query processing, and proposes three object-based optimization techniques to Twig table that can be correctly discovered in any XML data.

...read moreread less

Abstract: In this paper, we demonstrate how the semantic information, such as value, property, object class and relationship between object classes in XML data impacts XML query processing. We show that the lack of using semantics causes different problems in value management and content search in existing approaches. Motivated on solving these problems, we propose a semantic approach for XML twig pattern query processing. In particular, we design TwigTable algorithm to incorporate property and value information into query processing. This information can be correctly discovered in any XML data. In addition, we propose three object-based optimization techniques to TwigTable. If more semantics of object classes are known in an XML document, we can process queries more efficiently with these semantic optimizations. Last, we show the benefits of our approach by a comprehensive experimental study.

...read moreread less

Journal Article•10.1016/J.TCS.2011.05.047•

Rewriting of visibly pushdown languages for XML data integration

[...]

Alex Thomo¹, S. Venkatesh¹•Institutions (1)

University of Victoria¹

09 Sep 2011-Theoretical Computer Science

TL;DR: This work focuses on XML data integration by studying rewritings of XML target schemas in terms of source schemas, and considers Visibly pushdown Automata (VPAs), which accept Visibly Pushdown Languages (VPLs), which are the basis of formalisms for specifying XML schemas.

...read moreread less

Dissertation•

Maintainability of XML Transformations

[...]

Siim Karus

30 May 2011

Patent•

Hybrid binary XML storage model for efficient XML processing

[...]

Sam Idicula¹, Balasubramanyam Sthanikam¹, Nipun Agarwal¹•Institutions (1)

Business International Corporation¹

5 Dec 2011

TL;DR: In this paper, a hybrid navigation/streaming format for XML documents is proposed to allow efficient storage and processing of queries on the XML data that provides the benefits of both navigation and streaming and ameliorates the disadvantages of each.

...read moreread less

Abstract: A method for storing XML documents a hybrid navigation/streaming format is provided to allow efficient storage and processing of queries on the XML data that provides the benefits of both navigation and streaming and ameliorates the disadvantages of each. Each XML document to be stored is independently analyzed to determine a combination of navigable and streamable storage format that optimizes the processing of the data for anticipated access patterns.

...read moreread less

Patent•

Markup language based query and file generation

[...]

Arnab Sinha¹, Bharat Kumar Thakur¹•Institutions (1)

Tata Consultancy Services¹

21 Jun 2011

TL;DR: In this paper, an XML template having one or more nodes is received and mapping information indicating an association of data and nodes of the uploaded XML template is obtained. Once the mapping is received, the structure of the XML template was determined.

...read moreread less

Abstract: An XML template having one or more nodes is received. Mapping information indicating an association of data and nodes of the uploaded XML template is obtained. Once the mapping is received, the structure of the XML template is determined. Based on the determined structure and the mapping provided, an XML based SQL query is generated. The generated SQL query can be executed to provide the XML document.

...read moreread less

Book Chapter•10.1007/978-3-642-20039-7_10•

A comparative analysis of managing XML data in relational database

[...]

Kamsuriah Ahmad¹•Institutions (1)

National University of Malaysia¹

20 Apr 2011

TL;DR: A new mapping method is developed to overcome the limitations the limitations and shows that it is efficient in terms of removing relation redundancy.

...read moreread less

Abstract: The eXtensible Markup Language (XML) has recently emerged as a standard for data representation and interchange on the web. Based on its popularity used in most application, the critical issues are to store and to query XML data to exploit the full power of this technology. Since relational database is widely used technology for storing and querying, therefore replacing it with pure XML database is not a good choice and very expensive process. It is thus crucial to map XML data into relational data and this process is one that occurs frequently. Many existing methods exist in the literature, and defining what the best mapping method is explicitly important. The intention of this paper is to the existing mapping methods in terms of generating good relational schema. At the end a new mapping method is developed to overcome the limitations the limitations and shows that it is efficient in terms of removing relation redundancy.

...read moreread less

Journal Article•10.1016/J.SCICO.2009.11.007•

XML graphs in program analysis

[...]

Anders Møller¹, Michael I. Schwartzbach¹•Institutions (1)

Aarhus University¹

01 Jun 2011-Science of Computer Programming

TL;DR: A unified definition is presented, the key properties including validation of XML graphs against different XML schema languages are outlined, and a software package is provided that enables others to make use of these ideas.

...read moreread less

XML data model

[...]

Bama

28 Feb 2011

Proceedings Article•10.1109/IRI.2011.6009523•

Mapping OWL ontologies to relational schemas

[...]

Deise de Brum Saccol¹, Tobias de Campos Andrade¹, Eduardo Kessler Piveta¹•Institutions (1)

Universidade Federal de Santa Maria¹

6 Sep 2011

TL;DR: This paper proposes a mechanism for generating the relational schema from a set of integrated XML files, which includes defining aset of mapping rules from the OWL (Ontology Web Language) ontology to the relational format.

...read moreread less

Abstract: Many applications require storing XML data, which can be achieved by using a relational database (RDB). In order to accomplish that, we need a set of transformation rules that maps the XML structure to a collection of relations. However, XML files from the same application domain might have different structures, making the mapping process to a unique relational schema more difficult. To overcome this, we can previously generate an integrated schema that represents the individual XML structures, and then map it to the relational format. Afterwards, the original XML files are stored into the database. In our proposal, the integrated schema is represented as an ontology. In this paper, we propose a mechanism for generating the relational schema from a set of integrated XML files, which includes defining a set of mapping rules from the OWL (Ontology Web Language) ontology to the relational format. The mapping process is implemented in OntoRel tool.

...read moreread less

Journal Article•10.1016/J.TCS.2011.04.037•

Evolving schemas for streaming XML

[...]

Maryam Shoaran¹, Alex Thomo¹•Institutions (1)

University of Victoria¹

01 Aug 2011-Theoretical Computer Science

TL;DR: It is shown that Visibly Pushdown Languages are closed under the defined language operators and this enables us to expand the schemas (for XML) in order to account for flexible or constrained evolution.

...read moreread less

Proceedings Article•10.1109/TIME.2011.17•

Efficient Encoding of Temporal XML Documents

[...]

Mohamed Amine Baazizi, Nicole Bidoit-Tollu, Dario Colazzo

12 Sep 2011

TL;DR: A notion of compactness is formally defined which allows for comparing documents and shows that the update-based method produces time-stamped XML documents that are more satisfactory wrt space-efficiency than the general method.

...read moreread less

Abstract: The management of temporal data is a crucial issue in many applications. Recently, XML has become the standard for data exchange and representation. Consequently, important efforts have been made on the development of temporal extensions for XML. This paper investigates how to generate or maintain space-efficient time-stamped documents. We formally define a notion of compactness which allows for comparing documents. Then, we present two methods. For the first one, called general method, no restriction is made on the evolution of the XML documents whereas for the second one, called update-based method, changes are assumed to be specified by updates. For both methods, the issue is to enable processing very large documents, to use existing engines and to comply to Xquery Update Facility. The two methods are compared in terms of space-efficiency. The update-based method produces time-stamped XML documents that are more satisfactory wrt space-efficiency than the general method. This goes to show that the update-based method effectively takes advantage of the updates.

...read moreread less

Journal Article•

XCleaner: A New Method for Clustering XML Documents by Structure

[...]

Dariusz Brzezinski, Anna Leśniewska, Tadeusz Morzy, Maciej Piernik

01 Jan 2011-Control and Cybernetics

TL;DR: A new XML clustering algorithm that relies solely on document structure and the use of maximal frequent subtrees and an operator called Satisfy/Violate to divide documents into groups is put forward.

...read moreread less

Abstract: With the vastly growing data resources on the In- ternet, XML is one of the most important standards for document management. Not only does it provide enhancements to document exchange and storage, but it is also helpful in a variety of informa- tion retrieval tasks. Document clustering is one of the most inter- esting research areas that utilize XML's semi-structural nature. In this paper, we put forward a new XML clustering algorithm that relies solely on document structure. We propose the use of maximal frequent subtrees and an operator called Satisfy/Violate to divide documents into groups. The algorithm is experimentally evaluated on real and synthetic data sets with promising results.

...read moreread less

Proceedings Article•10.1109/WAINA.2011.13•

Generating Lowering and Lifting Schema Mappings for Semantic Web Services

[...]

Jakub Klímek¹, Martin Nečaský¹•Institutions (1)

Charles University in Prague¹

22 Mar 2011

TL;DR: A conceptual model for XML data is exploited to generate SAWSDL enriched XML schemas, but mainly to automatically generate the so called Lifting and Lowering schema mappings in a form of XSLT scripts.

...read moreread less

Abstract: With the introduction of the SAWSDL W3C recommendation, the possibility of enriching web service interfaces with semantic model references surfaced as a foundation for semantic web services. However, the recommendation says neither what the semantic model should be nor what to do with the actual XML data. In this paper, we exploit our conceptual model for XML data to generate SAWSDL enriched XML schemas, but mainly to automatically generate the so called Lifting and Lowering schema mappings in a form of XSLT scripts. These scripts can be used to transform the XML data produced by the web service into RDF data (lifting) and vice versa (lowering). In the RDF data state the data can be manipulated using a knowledge given by a corresponding ontology mapped to our model. Also the reasoning power granted by the ontology description can be exploited.

...read moreread less

Proceedings Article•10.1109/PAAP.2011.30•

Parallel Optimization of Queries in XML Dataset Using GPU

[...]

Xujie Si¹, Airu Yin¹, Xiaocheng Huang¹, Xiaojie Yuan¹, Xiaoguang Liu¹, Gang Wang¹ - Show less +2 more•Institutions (1)

Nankai University¹

9 Dec 2011

TL;DR: This work has developed a parallel simplified XPath language using Compute Unified Device Architecture (CUDA) on GPU, and evaluates the model on a recent NVIDIA GPU in comparison with its counterpart on eight-core CPU.

...read moreread less

Abstract: As XML is playing a crucial role in web services, databases, and document processing, efficient processing of XML queries has become an important issue. On the other hand, due to the increasing number of users, high throughput of XML queries is also required to execute tens of thousands of queries in a short time. Given the great success of GPGPU (General-Purpose computations on the Graphics Processors), we propose a parallel XML query model based on GPU, which mainly consists of two efficient task distribution strategies, to improve the efficiency and throughput of XML queries. We have developed a parallel simplified XPath language using Compute Unified Device Architecture (CUDA) on GPU, and evaluate our model on a recent NVIDIA GPU in comparison with its counterpart on eight-core CPU. The experiment results show that our model achieves both higher throughput and efficiency than CPU-based XML query.

...read moreread less

Proceedings Article•10.1145/2063576.2063813•

Tractable XML data exchange via relations

[...]

Rada Chirkova¹, Leonid Libkin², Juan L. Reutter²•Institutions (2)

North Carolina State University¹, University of Edinburgh²

24 Oct 2011

TL;DR: This work isolates a set of five requirements that must be fulfilled in order to have a faithful representation of the XML data-exchange problem by a relational translation, and demonstrates that these requirements naturally suggest the inlining technique for dataexchange tasks.

...read moreread less

Abstract: We consider data exchange for XML documents: given source and target schemas, a mapping between them, and a document conforming to the source schema, construct a target document and answer target queries in a way that is consistent with source information. The problem has primarily been studied in the relational context, in which data-exchange systems have also been built. Since many XML documents are stored in relations, it is natural to consider using a relational system for XML data exchange. However, there is a complexity mismatch between query answering in relational and XML data exchange, which indicates that restrictions have to be imposed on XML schemas and mappings, and on XML shredding schemes, to make the use of relational systems possible. We isolate a set of five requirements that must be fulfilled in order to have a faithful representation of the XML data-exchange problem by a relational translation. We then demonstrate that these requirements naturally suggest the inlining technique for dataexchange tasks. Our key contribution is to provide shredding algorithms for schemas, documents, mappings and queries, and demonstrate that they enable us to correctly perform XML data-exchange tasks using a relational system.

...read moreread less

...

Expand