Scispace (Formerly Typeset)
  1. Home
  2. Topics
  3. Efficient XML Interchange
  4. 2006
  1. Home
  2. Topics
  3. Efficient XML Interchange
  4. 2006
Showing papers on "Efficient XML Interchange published in 2006"
Web application description language (WADL)

[...]

Marc J. Hadley1•
Sun Microsystems1
1 Jan 2006
TL;DR: This article describes the Web Application Description Language (WADL), designed to provide a machine processable protocol description format for use with such HTTP-based Web applications, especially those using XML.
Abstract: This article describes the Web Application Description Language (WADL). An increasing number of Web-based enterprises (Google, Yahoo, Amazon, Flickr - to name but a few) are developing HTTP-based applications that provide access to their internal data using XML. Typically these applications are described using a combination of textual protocol descriptions combined with XML schema-based data format descriptions; WADL is designed to provide a machine processable protocol description format for use with such HTTP-based Web applications, especially those using XML.

321 citations

Journal Article•10.1109/TKDE.2006.61•
Keyword proximity search in XML trees

[...]

Vagelis Hristidis1, Nick Koudas, Yannis Papakonstantinou2, Divesh Srivastava•
University of Miami1, University of California, Los Angeles2
01 Apr 2006-IEEE Transactions on Knowledge and Data Engineering
TL;DR: This paper defines XML keyword, proximity queries to return the (possibly heterogeneous) set of minimum connecting trees (MCTs) of the matches to the individual keywords in the query to efficiently execute keyword proximity queries on labeled trees (XML).
Abstract: Recent works have shown the benefits of keyword proximity search in querying XML documents in addition to text documents. For example, given query keywords over Shakespeare's plays in XML, the user might be interested in knowing how the keywords cooccur. In this paper, we focus on XML trees and define XML keyword, proximity queries to return the (possibly heterogeneous) set of minimum connecting trees (MCTs) of the matches to the individual keywords in the query. We consider efficiently executing keyword proximity queries on labeled trees (XML) in various settings: 1) when the XML database has been preprocessed and 2) when no indices are available on the XML database. We perform a detailed experimental evaluation to study the benefits of our approach and show that our algorithms considerably outperform prior algorithms and other applicable approaches.

189 citations

Proceedings Article•10.1109/ICGRID.2006.311019•
A Parallel Approach to XML Parsing

[...]

Wei Lu1, Kenneth Chiu2, Yinfei Pan2•
Indiana University1, Binghamton University2
28 Sep 2006
TL;DR: The design and implementation of an initial preparsing phase to determine the structure of the XML document, followed by a full, parallel parse, which shows that the approach applies to real-world, production quality parsers.
Abstract: A language for semi-structured documents, XML has emerged as the core of the Web services architecture, and is playing crucial roles in messaging systems, databases, and document processing However, the processing of XML documents has a reputation for poor performance, and a number of optimizations have been developed to address this performance problem from different perspectives, none of which have been entirely satisfactory In this paper, we present a seemingly quixotic, but novel approach: parallel XML parsing Parallel XML parsing leverages the growing prevalence of multicore architectures in all sectors of the computer market, and yields significant performance improvements This paper presents our design and implementation of parallel XML parsing Our design consists of an initial preparsing phase to determine the structure of the XML document, followed by a full, parallel parse The results of the preparsing phase are used to help partition the XML document for data parallel processing Our parallel parsing phase is a modification of the libxml2 in Veillard, D (2004) XML parser, which shows that our approach applies to real-world, production quality parsers Our empirical study shows our parallel XML parsing algorithm can improved the XML parsing performance significantly and scales well

169 citations

Advances in XML Information Retrieval and Evaluation, 4th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2005, Dagstuhl Castle, Germany, November 28-30, 2005, Revised Selected Papers

[...]

Gabriella Kazai
1 May 2006
TL;DR: The INEX Environment is used as a Test Bed for Various User Models for XML Retrieval and the Interpretation of CAS is studied.

121 citations

Proceedings Article•10.1145/1135777.1135796•
XML screamer: an integrated approach to high performance XML parsing, validation and deserialization

[...]

Margaret Gaitatzes Kostoulas1, Morris Matsa1, Noah R. Mendelsohn1, Eric Perkins1, Abraham Heifets1, Martha Mercaldi2 •
IBM1, University of Washington2
23 May 2006
TL;DR: This paper describes an experimental system in which customized high performance XML parsers are prepared using parser generation and compilation techniques, and the resulting validating processors are shown to be as fast as or in many cases significantly faster than traditional nonvalidating parsers.
Abstract: This paper describes an experimental system in which customized high performance XML parsers are prepared using parser generation and compilation techniques. Parsing is integrated with Schema-based validation and deserialization, and the resulting validating processors are shown to be as fast as or in many cases significantly faster than traditional nonvalidating parsers. High performance is achieved by integration across layers of software that are traditionally separate, by avoiding unnecessary data copying and transformation, and by careful attention to detail in the generated code. The effect of API design on XML performance is also briefly discussed..

89 citations

Journal Article•10.1145/1228268.1228271•
XML search: languages, INEX and scoring

[...]

Sihem Amer-Yahia1, Mounia Lalmas2•
AT&T Labs1, University of London2
1 Dec 2006
TL;DR: Support for a combination of "structured" and full-text search for effectively querying XML documents was unanimous in a recent panel at SIGMOD 2005, and is being widely studied in the IR community.
Abstract: The development of approaches to access XML content has generated a wealth of issues in information retrieval (IR) and database (DB) (e.g., [2, 15, 17, 20, 19, 47, 26, 32, 24]). While the IR community has traditionally focused on searching unstructured content, and has developed various techniques for ranking query results and evaluating their effectiveness, the DB community has focused on developing query languages and efficient evaluation algorithms for highly structured content. Recent trends in DB and IR research demonstrate a growing interest in merging IR and DB techniques for accessing XML content. Support for a combination of "structured" and full-text search for effectively querying XML documents was unanimous in a recent panel at SIGMOD 2005 [3], and is being widely studied in the IR community [20].

87 citations

Journal Article•10.1145/1178618.1178621•
XML access control using static analysis

[...]

Makoto Murata1, Akihiko Tozawa1, Michiharu Kudo1, Satoshi Hada1•
IBM1
01 Aug 2006-ACM Transactions on Information and System Security
TL;DR: In this paper, a static analysis for XML access-control policies is presented, where the static analysis determines if a query expression is guaranteed not to access elements or attributes that are hidden by the access control policy but permitted by the schema.
Abstract: Access control policies for XML typically use regular path expressions such as XPath for specifying the objects for access-control policies. However such access-control policies are burdens to the query engines for XML documents. To relieve this burden, we introduce static analysis for XML access-control. Given an access-control policy, query expression, and an optional schema, static analysis determines if this query expression is guaranteed not to access elements or attributes that are hidden by the access-control policy but permitted by the schema. Static analysis can be performed without evaluating any query expression against actual XML documents. Run-time checking is required only when static analysis is unable to determine whether to grant or deny access requests. A side effect of static analysis is query optimization: access-denied expressions in queries can be evaluated to empty lists at compile time. We further extend static analysis for handling value-based access-control policies and introduce view schemas.

87 citations

Patent•
Efficient replication of XML data in a relational database management system

[...]

James W. Warner1, Zhen Hua Liu1, Sundeep Abraham1, Muralidhar Krishnaprasad1, Geeta Arora1, Ravi Murthy1, Sivasankaran Chandrasekar1, Lik Wong1, Nimar S. Arora1 •
Business International Corporation1
3 May 2006
TL;DR: In this paper, the authors propose techniques for replicating XML data involved with an insert operation, an update operation, and a sequence of XQuery Data Model sequences to reduce the amount of information that would otherwise need to be transmitted over a network for XML data replication.
Abstract: Efficiently replicating XML data among databases includes techniques for (a) replicating XML data involved with an insert operation; (b) replicating XML data involved with an update operation; (c) leveraging existing relational replication techniques for XML data stored in shredded form using object-relational constructs; and (d) replicating XQuery Data Model sequences. Each technique reduces the amount of information that would otherwise need to be transmitted over a network for XML data replication purposes.

72 citations

Journal Article•10.1007/S10994-006-5832-2•
XRules: An effective algorithm for structural classification of XML data

[...]

Mohammed J. Zaki1, Charu C. Aggarwal2•
Rensselaer Polytechnic Institute1, IBM2
01 Feb 2006-Machine Learning
TL;DR: This paper discusses the problem of rule based classification of XML data by using frequent discriminatory substructures within XML documents and shows the effectiveness of the method with respect to other classifiers.
Abstract: XML documents have recently become ubiquitous because of their varied applicability in a number of applications. Classification is an important problem in the data mining domain, but current classification methods for XML documents use IR-based methods in which each document is treated as a bag of words. Such techniques ignore a significant amount of information hidden inside the documents. In this paper we discuss the problem of rule based classification of XML data by using frequent discriminatory substructures within XML documents. Such a technique is more capable of finding the classification characteristics of documents. In addition, the technique can also be extended to cost sensitive classification. We show the effectiveness of the method with respect to other classifiers. We note that the methodology discussed in this paper is applicable to any kind of semi-structured data.

70 citations

Patent•
Techniques of efficient XML query using combination of XML table index and path/value index

[...]

Zhen Hua Liu1, Muralidhar Krishnaprasad1, Hui Joe Chang1, Vikas Arora1•
Business International Corporation1
17 Nov 2006
TL;DR: In this article, a mechanism for accessing XML data in a database system using a combination of a XML Table Index table and a XML Path Index table is presented, where the XML Path index table gives the database system an ability to navigate to a specific location given a path expression.
Abstract: A mechanism is provided for accessing XML data in a database system using a combination of a XML Table Index table and a XML Path Index table. By using a combination of a XML Table Index and a XML Path Index, both selection access and navigational access involved in a query can be optimized. For example, the XML Table Index gives the database system an ability to readily evaluate the predicate expression, thereby improving the selection access. Moreover, in some embodiments, the selection access can be further improved by using secondary indexes on columns contained in the XML Table Index table. In a complementary manner, the XML Path Index table gives the database system an ability to navigate to a specific location given a path expression, thereby improving the navigational access. Thus, by combining both tables, both selection and navigational accesses are improved.

70 citations

PFTijah: text search in an XML database system

[...]

Djoerd Hiemstra, Henning Rode, R. van Os, Jan Flokstra
1 Jan 2006
TL;DR: The PFTijah system is introduced, a text search system that is integrated with an XML/X query database management system and is part of the open source release of MonetDB/XQuery.
Abstract: This paper introduces the PFTijah system, a text search system that is integrated with an XML/XQuery database management system. We present examples of its use, we explain some of the system internals, and discuss plans for future work. PFTijah is part of the open source release of MonetDB/XQuery.
Journal Article•10.1016/J.CSI.2005.02.004•
XML-based e-business frameworks and standardization

[...]

Juha-Miikka Nurmilaakso1, Paavo Kotinurmi1, Hannu Laesvuori1•
Helsinki University of Technology1
01 Jun 2006-Computer Standards & Interfaces
TL;DR: This paper analyzes the properties and standardization of 12 prominent XML-based e-business frameworks and focuses on the commonalities, differences and regularities between these e- business frameworks and their standardization.
Patent•
Techniques of efficient XML meta-data query using XML table index

[...]

Muralidhar Krishnaprasad1, Zhen Hua Liu1, Hui Joe Chang1, Vikas Arora1, Susan Kotsovolos1 •
Business International Corporation1
31 Mar 2006
TL;DR: XML table indexes as mentioned in this paper are a set of tables created to project out in column form commonly sought metadata from stored XML documents, which can leverage the enhanced functionality provided by the database tables.
Abstract: XML table indexes provide a more efficient mechanism for searching data stored in aggregate form. XML table indexes are a set of tables created to project out in column form commonly sought metadata from stored XML documents. By projecting the data includes into column form, queries on the XML documents can be efficiently processed as they can leverage the enhanced functionality provided by the database tables. The XML table indexes may use aliases, partitioning, constraints and other functions to further improve query flexibility and performance.
Proceedings Article•10.1109/ICDE.2006.175•
XCluster Synopses for Structured XML Content

[...]

Neoklis Polyzotis1, Minos Garofalakis2•
University of California, Santa Cruz1, Intel2
3 Apr 2006
TL;DR: This work introduces a novel XML-summarization model, termed XCLUSTERs, that enables accurate selectivity estimates for the class of twig queries with numeric-range, substring, and textual IR predicates over the content of XML elements.
Abstract: We tackle the difficult problem of summarizing the path/branching structure and value content of an XML database that comprises both numeric and textual values. We introduce a novel XML-summarization model, termed XCLUSTERs, that enables accurate selectivity estimates for the class of twig queries with numeric-range, substring, and textual IR predicates over the content of XML elements. In a nutshell, an XCLUSTER synopsis represents an effective clustering of XML elements based on both their structural and value-based characteristics. By leveraging techniques for summarizing XML-document structure as well as numeric and textual data distributions, our XCLUSTER model provides the first known unified framework for handling path/branching structure and different types of element values. We detail the XCLUSTER model, and develop a systematic framework for the construction of effective XCLUSTER summaries within a specified storage budget. Experimental results on synthetic and real-life data verify the effectiveness of our XCLUSTER synopses, clearly demonstrating their ability to accurately summarize XML databases with mixed-value content. To the best of our knowledge, ours is the first work to address the summarization problem for structured XML content in its full generality.
Journal Article•
GPX - Gardens Point XML IR at INEX 2005

[...]

Shlomo Geva
01 Jan 2006-Lecture Notes in Computer Science
TL;DR: This paper described the approach that was adopted to satisfy the requirements of all the tasks, CAS and CO, in Thorough, Focused, and Fetch Browse mode, using the same underlying system.
Abstract: The INEX 2005 evaluation consisted of numerous tasks that required different approaches. In this paper we described the approach that we adopted to satisfy the requirements of all the tasks, CAS and CO, in Thorough, Focused, and Fetch Browse mode, using the same underlying system The retrieval approach is based on the construction of a collection sub-tree, consisting of all nodes that contain one or more of the search terms. Nodes containing search terms are then assigned a score using a TF_IDF variant, scores are propagated upwards in the document XML tree, and finally all XML elements are ranked. We present results that demonstrate that the approach is versatile and produces consistently good performance across all INEX 2005 tasks.
Patent•
XML message validation in a network infrastructure element

[...]

Sandeep Kumar1, Karempudi Ramarao1, Yuquan Jiang1, Yi Jin1, Tefcros Anthias1 •
Cisco Systems, Inc.1
21 Jun 2006
TL;DR: In this article, a network infrastructure element such as a router or switch performs transparent and optimized validation of XML schemas of XML payloads received in the network element, which comprises logic for receiving and storing one or more validation scope rules that define a portion of an extensible markup language (XML) schema for validation.
Abstract: A network infrastructure element such as a router or switch performs transparent and optimized validation of XML schemas of XML payloads received in the network element. The network element comprises logic for receiving and storing one or more validation scope rules that define a portion of an extensible markup language (XML) schema for validation; receiving and storing the XML schema; receiving over the network an application-layer message comprising one or more of the packets; identifying a particular XML element in an XML payload of the application-layer message, wherein the particular XML element is within the portion of the XML schema defined in the one or more validation scope rules; determining whether the particular XML element conforms to the XML schema; and performing a responsive action based on whether the particular XML element conforms to the XML schema.
Book Chapter•10.1007/11896548_16•
Validity-sensitive querying of XML databases

[...]

Slawomir Staworko1, Jan Chomicki1•
University at Buffalo1
26 Mar 2006
TL;DR: A validity-sensitive method of querying XML documents, which extracts more information from invalid XML documents than does the standard query evaluation, is presented.
Abstract: We consider the problem of querying XML documents which are not valid with respect to given DTDs. We propose a framework for measuring the invalidity of XML documents and compactly representing minimal repairing scenarios. Furthermore, we present a validity-sensitive method of querying XML documents, which extracts more information from invalid XML documents than does the standard query evaluation. Finally, we provide experimental results which validate our approach.
Proceedings Article•
Toward Semantic XML Clustering.

[...]

Andrea Tagarelli1, Sergio Greco1•
University of Calabar1
1 Jan 2006
TL;DR: This work proposes a framework for clustering semantically cohesive XML structures based on a transactional representation model and experiments give evidence that the proposed approach is highly effective in detecting groups of XML data that exhibit structure and/or content affinities.
Abstract: The increasing availability of heterogeneous XML informative sources has raised a number of issues concerning how to represent and manage semistructured data. Although XML sources can exhibit proper structures and contents, differently annotated XML documents may in principle encode related semantics due to subjective definitions of markup tags. Discovering knowledge to infer semantic organization of XML documents has become a major challenge in XML data management. In this context, we address the problem of clustering XML data according to structure as well as content features enriched with lexical ontology knowledge. We propose a framework for clustering semantically cohesive XML structures based on a transactional representation model. Experiments on large real datasets give evidence that the proposed approach is highly effective in detecting groups of XML data that exhibit structure and/or content affinities.
Journal Article•10.1147/SJ.452.0299•
Cost-based optimization in DB2 XML

[...]

Andrey Balmin1, Tom Eliaz1, J. Hornibrook1, Lipyeow Lim1, Guy M. Lohman1, David E. Simmen1, M. Wang1, C. Zhang •
IBM1
01 Jan 2006-Ibm Systems Journal
TL;DR: The challenses to the relational infrastructure that supporting XQuery and SQL/XML poses are described and the rationale for the extensions that were made to the three main parts of the optimizer: the plan operators, the cardinality and cost model, and statistics collection are provided.
Abstract: DB2 XML is a hybrid database system that combines the relational capabilities of DB2 Universal DatabaseTM (UDB) with comprehensive native XML support. DB2 XML augments DB2® UDB with a native XML store, XML indexes, and query processing capabilities for both XQuery and SQL/XML that are integrated with those of SQL. This paper presents the extensions made to the DB2 UDB compiler, and especially its cost-based query optimizer, to support XQuery and SQL/XML queries, using much of the same infrastructure developed for relational data queried by SQL. It describes the challenses to the relational infrastructure that supporting XQuery and SQL/XML poses and provides the rationale for the extensions that were made to the three main parts of the optimizer: the plan operators, the cardinality and cost model, and statistics collection.
Proceedings Article•10.1109/ICWS.2006.15•
A Table-Driven Streaming XML Parsing Methodology for High-Performance Web Services

[...]

Wei Zhang1, R.A. van Engelen1•
Florida State University1
18 Sep 2006
TL;DR: This paper presents a table-driven streaming XML parsing methodology, called TDX, which expedites XML parsing by pre-recording the states of an XML parser in tabular form and by utilizing an efficient runtime streaming parsing engine based on a push-down automaton.
Abstract: This paper presents a table-driven streaming XML parsing methodology, called TDX. TDX expedites XML parsing by pre-recording the states of an XML parser in tabular form and by utilizing an efficient runtime streaming parsing engine based on a push-down automaton. The parsing tables are automatically produced from the XML schemas of a WSDL service description. Because the schema constraints are pre-encoded in a parsing table, the approach effectively implements a schema-specific XML parsing technique that combines parsing and validation into a single pass. This significantly increases the performance of XML Web services, which results in better response time and may reduce the impact of the flash-crowd effect. To implement TDX, we developed a parser construction toolkit to automatically construct parsers in C code from WSDLs and XML schemas. We applied the toolkit to an example Web services application and measured the raw performance compared to popular high-performance parsers written in C/C++, such as eXpat, gSOAP, and Xerces. The performance results show that TDX can be an order of magnitude faster.
Patent•
Automated interactive visual mapping utility and method for validation and storage of XML data

[...]

Farzad Farahbod1•
IBM1
30 Mar 2006
TL;DR: In this article, a method for automatic validation of Extensible Markup Language (XML) data through interactive visual mapping, before storing the validated XML data into a target destination.
Abstract: A method, apparatus and computer program product is provided for automatic validation of Extensible Markup Language (XML) data through interactive visual mapping, before storing the validated XML data into a target destination. The method allows a user to interact with a graphic user interface (GUI) to obtain mapping of an XML source document or its fragment and the target destination through at least one XML validation schema, and to automatically obtain XML data validations. The GUI is menu-driven and displays the XML data in the native format and a selection of the XML validation schemas. Interactions with the GUI automatically generate an XML validation mapping definition document and a validation code. The target destination can be one or more XML columns, a web service call, a standalone application call, a function's or stored procedure's input parameter of XML type and it may be located in a heterogeneous RDBMS environment.
Patent•
Method for combining input data with run-time parameters into xml output using xsl/xslt

[...]

Robert J Anderson
29 Dec 2006
TL;DR: In this paper, a system and method for adding dynamic information to XML source data without substantial, or preferably any, modification of the underlying source code and without requiring the writing of any XSLT extension functions is presented.
Abstract: A system and method is disclosed for adding dynamic information to XML source data without substantial, or preferably any, modification of the underlying source code and without requiring the writing of any XSLT extension functions. The addition of dynamic information enables a user to customize XML source data for display based on parameters known only at run-time. The present invention receives various run-time and system parameters, uses the parameters to construct queries, and executes the queries against internal and/or external data sources. Datasets that are generated based on query execution are converted to XML document fragments and an XSL Transform converts the XML document fragments into a desired markup language file for presentation within a browser application.
Patent•
Computer directory system having an application integration driver infrastructure

[...]

Nick N. Nikols1, Daniel Wallace Rapp•
Novell1
18 Oct 2006
TL;DR: An application integration driver infrastructure for facilitating the use of a distributed directory running in a computer network is provided in this article, which can transform specific directory events into a vendor-neutral data identification system and then use vendor neutral transformation technologies to transform the neutral data identification into a specific application's data format, and vice-versa.
Abstract: An application integration driver infrastructure for facilitating the use of a distributed directory running in a computer network is provided. The infrastructure can transform specific directory events into a vendor-neutral data identification system and then use vendor-neutral transformation technologies to transform the neutral data identification into a specific application's data format, and vice-versa. The infrastructure receives an event from the distributed directory into a markup language generation system, such as an extensible markup language (“XML”) generator. The XML generator converts the event into XML data and provides the XML data to a transformation processing system, such as an extensible transformation language (“XSLT”) processor. The XSLT processor transforms the XML data to a predetermined format. The format can be dictated by a stylesheet provided to the XSLT processor, the stylesheet being responsive to requirements of a computer application. The transformed data is then provided to the application for the application to use in a conventional manner. The application may use an application shim to convert the transformed data into a native application program interface (“API”) for the application.
Patent•
Method and apparatus for providing XML signature service in wireless environment

[...]

Jae Lee1, Soo Woong Kim, Ki Hak Moon, Chung Kyo Il, Sohn Sung Won •
Electronics and Telecommunications Research Institute1
7 Dec 2006
TL;DR: In this article, a mobile extensible Markup Language (XML) signature service providing apparatus and method is proposed, which includes an XML message analyzing unit authenticating a mobile client, according to an XML signature template generation request or an XML verification request received from the mobile client.
Abstract: Provided are a mobile extensible Markup Language (XML) signature service providing apparatus and method. The mobile XML signature service providing apparatus includes: an XML message analyzing unit authenticating a mobile client, according to an XML signature template generation request or an XML signature verification request received from the mobile client; an XML signature processor generating an XML signature template and a SignedInfo element in a canonicalized format if the authentication is successful, and verifying an XML signature; and an encoder providing key information and at least one setting value for the generation of the XML signature template and verification of the XML signature, to the XML signature processor. Therefore, the mobile XML signature service providing apparatus and method provide authentication, integrity, non-repudiation, etc. with respect to messages received/transmitted in a wireless environment, are applied to a wireless environment having limited resources, are compatible with an XML signature for an existing wired environment that is to be applied to wired-and-wireless integration electronic commerce, and minimizes a change in an existing wired environment when a mobile XML signature is applied.
Focused information access using XML element retrieval

[...]

Börkur Sigurbjörnsson
1 Jan 2006
TL;DR: This dissertation aims to provide a history of web exceptionalism from 1989 to 2002, a period chosen in order to explore its roots as well as specific cases up to and including the year in which descriptions of “Web 2.0” began to circulate.
Abstract: Disclaimer/Complaints regulations If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library: http://uba.uva.nl/en/contact, or a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible.
Patent•
XML-based control and customization of application programs

[...]

Dror Cohen, David Rachamim, Yiftach Nun, Ronen Rubinfeld, Vishal Trivedi 
14 Apr 2006
TL;DR: In this article, an XML document is marked as editable during installation, and an installation manager review the base XML document, identifies editable portions therein and reads installation settings data therefor.
Abstract: Operation of a computer system is governed by an executable application and settings set forth in an XML document to which the application refers during execution. When an application is published for installation, it includes the application and a base XML document. Portions of the base XML document are marked as editable during installation. During installation, an installation manager review the base XML document, identifies editable portions therein and reads installation settings data therefor. The installation manager may generate a modified XML document representing the settings of the base XML document and any modified settings obtained for installation. Modified settings data may be obtained from an operator, from a computer system on which installation occurs or from a dataset representing settings data obtained in prior installations.
Journal Article•
Transforming XML trees for efficient classification and clustering

[...]

Laurent Candillier, Isabelle Tellier, Fabien Torre
01 Jan 2006-Lecture Notes in Computer Science
TL;DR: This paper investigates the use of a different kind of representation for the manipulation of XML documents to transform the trees into sets of attribute-values, so as to be able to apply various existing methods of classification and clustering on such data, and benefit from their strengths.
Abstract: Most of the existing methods we know to tackle datasets of XML documents directly work on the trees representing these XML documents. We investigate in this paper the use of a different kind of representation for the manipulation of XML documents. Our idea is to transform the trees into sets of attribute-values, so as to be able to apply various existing methods of classification and clustering on such data, and benefit from their strengths. We apply this strategy both for the classification task and for the clustering task using the structural description of XML documents alone. For instance, we show that the use of boosted C5 [1] leads to very good results in the classification task of XML documents transformed in this way. The use of SSC [2] in the clustering task benefits from its ability to provide as output an interpretable representation of the clusters found. Finally, we also propose an adaptation of SSC for the classification of XML documents, so that the produced classifier is understandable.
Conceptual Modeling for XML: A Survey.

[...]

Martin Nečaský1•
Charles University in Prague1
1 Jan 2006
TL;DR: The current approaches to the conceptual modeling of XML data are described in an uniform style and described approaches are compared on the base of the requirements.
Abstract: Recently XML is the standard format used for the exchange of data between information systems and is also frequently applied as a logical database model. If we use XML as a logical database model we need a conceptual model for the description of its semantics. However, XML as a logical database model has some special characteristics which makes existing conceptual models as E-R or UML unsuitable. In this pa- per, the current approaches to the conceptual modeling of XML data are described in an uniform style. A list of requirements for XML conceptual models is presented and described approaches are compared on the base of the requirements.
Book Chapter•10.1007/11733836_59•
DTD-Diff: a change detection algorithm for DTDs

[...]

Erwin Leonardi1, Tran T. Hoai1, Sourav S. Bhowmick1, Sanjay Kumar Madria2•
Nanyang Technological University1, Missouri University of Science and Technology2
12 Apr 2006
TL;DR: This paper presents a novel algorithm called DTD-DIFF to detect the changes to DTDs that defines the structure of a set of XML documents, and shows that converting DTD to XML Schema (XSD) and detecting the changes using existing XML change detection algorithms is not a feasible option.
Abstract: The DTD of a set of XML documents may change due to many reasons such as changes to the real world events, changes to the user's requirements, and mistakes in the initial design. In this paper, we present a novel algorithm called DTD-Diff to detect the changes to DTDs that defines the structure of a set of XML documents. Such change detection tool can be useful in several ways such as maintenance of XML documents, incremental maintenance of relational schema for storing XML data, and XML schema integration. We compare DTD-Diff with existing XML change detection approaches and show that converting DTD to XML Schema (XSD) (which is in XML document format) and detecting the changes using existing XML change detection algorithms is not a feasible option. Our experimental results show that DTD-Diff is 5–325 times faster than X-Diff when it detects the changes to the XSD files. We also study the result quality of detected deltas.
Proceedings Article•10.1109/ICCIAS.2006.295443•
Design and Implementation of an XML Firewall

[...]

Yin-soon Loh1, Wei-Chuen Yau1, Chien-thang Wong1, Wai-chuen Ho1•
Multimedia University1
1 Nov 2006
TL;DR: The design of the architecture and filtering policies for an XML firewall is presented and it is shown that the firewall is capable of allowing valid SOAP messages while blocking malicious SoAP messages that contain attacks such as oversized payloads, recursive pay loads, and SQL injections.
Abstract: Web services provide a means to communicate easily between applications to exchange information. However, the lack in security features provided by Web services creates a window of opportunities for attackers. This paper presents the design of the architecture and filtering policies for an XML firewall. The firewall is implemented using Java language. We conduct a series of tests for verifying the functionality of the firewall. The results of the tests show that the firewall is capable of allowing valid SOAP messages while blocking malicious SOAP messages that contain attacks such as oversized payloads, recursive pay loads, and SQL injections
...

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve