TL;DR: This work introduces VisGets - interactive query visualizations of Web-based information that operate with online information within a Web browser and facilitates the construction of dynamic search queries that combine filters from more than one data dimension.
Abstract: In common Web-based search interfaces, it can be difficult to formulate queries that simultaneously combine temporal, spatial, and topical data filters. We investigate how coordinated visualizations can enhance search and exploration of information on the World Wide Web by easing the formulation of these types of queries. Drawing from visual information seeking and exploratory search, we introduce VisGets - interactive query visualizations of Web-based information that operate with online information within a Web browser. VisGets provide the information seeker with visual overviews of Web resources and offer a way to visually filter the data. Our goal is to facilitate the construction of dynamic search queries that combine filters from more than one data dimension. We present a prototype information exploration system featuring three linked VisGets (temporal, spatial, and topical), and used it to visually explore news items from online RSS feeds.
TL;DR: This paper details the application of a particular interlinking algorithm, taking into account both the similarities of web resources and of their neighbours, in two contexts: to link a Creative Commons music dataset to an editorial one, and tolink a personal music collection to corresponding web identifiers.
Abstract: In this paper, we describe current efforts towards interlinking music-related datasets on the Web. We first explain some initial interlinking experiences, and the poor results obtained by taking a naive approach. We then detail a particular interlinking algorithm, taking into account both the similarities of web resources and of their neighbours. We detail the application of this algorithm in two contexts: to link a Creative Commons music dataset to an editorial one, and to link a personal music collection to corresponding web identifiers. The latter provides a user with personally meaningful entry points for exploring the web of data, and we conclude by describing some concrete tools built to generate and use such links.
TL;DR: The NeOn project aims at advancing the state of the art in using ontologies for large-scale semantic applications by providing an infrastructure for networked ontology management and engineering capable of suiting the community’s needs, and features methods and tools for managing knowledge that is distributed, heterogeneous, contextualized, and developed collaboratively.
Abstract: Ontologies are considered a key technology enabling semantic interoperability and integration of data and processes. We are now entering a phase of knowledge system development, in which ontologies are produced in larger numbers and exhibit greater complexity. Also we have witnessed an increasing need for technologies that enable the reuse of existing (Semantic) Web resources from within the ontology engineering environment itself. In the NeOn project we aim at advancing the state of the art in using ontologies for large-scale semantic applications. This is realized by providing an infrastructure for networked ontology management and engineering capable of suiting the community’s needs [1]. The heart of this infrastructure is the NeOn Toolkit for engineering contextualized networked ontologies and semantic applications. The shift from closed semantic applications, characterized by a monolithic corporate ontology, to open semantic applications, characterized by networks of ontologies, implies that a number of other aspects like dealing with context, collaboration or data and web integration become crucial. The NeOn Toolkit was built with the web-centric elements of semantic technologies in mind: much like Web 2.0 environments emphasize distributed content production, the NeOn Toolkit features methods and tools for managing knowledge that is distributed, heterogeneous, contextualized, and developed collaboratively.
TL;DR: An interview study investigated why and how independent learners use Web-based resources, exploring not only the academic context of the courses studied, but also any relevant personal, domestic and employment-related circumstances.
Abstract: Undergraduate students access the Internet for a range of purposes, many unrelated to their studies. Increasingly, learners are using the Internet to find information and resources for coursework, whether or not this is promoted or endorsed by their teachers. This article reports an interview study that investigated why and how independent learners use Web-based resources, exploring not only the academic context of the courses studied, but also any relevant personal, domestic and employment-related circumstances. Factors were identified, which enhanced or competed with study activities, acting as incentives or disincentives for learners. The findings suggest that it is not technologies per se, but a combination of various contextual factors, that determine students' use of Web resources for learning. Of the academic factors that emerged from the interviews, assessment requirements and pedagogic approach were particularly important.
TL;DR: The Art of Capacity Planning is written by the manager of data operations for the world-famous photo-sharing site Flickr, which draws on years of valuable experience to help plan, deploy, and manage web application infrastructure.
Abstract: Success on the web is measured by usage and growth. Web-based companies live or die by the ability to scale their infrastructure to accommodate increasing demand. This book is a hands-on and practical guide to planning for such growth, with many techniques and considerations to help you plan, deploy, and manage web application infrastructure. The Art of Capacity Planning is written by the manager of data operations for the world-famous photo-sharing site Flickr.com, now owned by Yahoo! John Allspaw combines personal anecdotes from many phases of Flickr's growth with insights from his colleagues in many other industries to give you solid guidelines for measuring your growth, predicting trends, and making cost-effective preparations. Topics include: Evaluating tools for measurement and deployment Capacity analysis and prediction for storage, database, and application servers Designing architectures to easily add and measure capacity Handling sudden spikes Predicting exponential and explosive growth How cloud services such as EC2 can fit into a capacity strategy In this book, Allspaw draws on years of valuable experience, starting from the days when Flickr was relatively small and had to deal with the typical growth pains and cost/performance trade-offs of a typical company with a Web presence. The advice he offers in The Art of Capacity Planning will not only help you prepare for explosive growth, it will save you tons of grief.
TL;DR: The OAI Object Reuse and Exchange (OAI-ORE) framework recasts the repository-centric notion of digital object to a bounded aggregation of Web resources that conforms more closely with notions in eScience and eScholarship.
Abstract: The OAI Object Reuse and Exchange (OAI-ORE) framework recasts the repository-centric notion of digital object to a bounded aggregation of Web resources. In this manner, digital library content is more integrated with the Web architecture, and thereby more accessible to Web applications and clients. This generalized notion of an aggregation that is independent of repository containment conforms more closely with notions in eScience and eScholarship, where content is distributed across multiple services and databases. We provide a motivation for the OAI-ORE project, review previous interoperability efforts, describe draft ORE specifications and report on promising results from early experimentation that illustrate improved interoperability and reuse of digital objects.
TL;DR: Analysis of students' log of a Learning Management System (LMS) of two subjects at a university in Bangladesh, taught over six weeks duration shows that students having 'Low' access obtained poor grade, on campus access was higher than access from home.
Abstract: Application of data mining for web log analysis has received significant attention in finding customers' behavioral pattern in e-commerce and learners' behavioral pattern in e- learning. While hit-counts indicate customers' interest in the product or purchasing behavior, a student's visits to a Learning Management System (LMS) do not necessarily involve transfer of learning. Addressing such complexity in e-learning, this study analyzed students' log of a Learning Management System (LMS) of two subjects at a university in Bangladesh, taught over six weeks duration. Data mining and statistical tools have been used to find relationships between students' LMS access behavior and overall performances. Results show that students having 'Low' access obtained poor grade, on campus access was higher than access from home. Background of students is very important for effective usage of web resources. Majority of the student considered LMS to be a quite helpful tool as teaching-learning method. Preparation and cleaning of the web-log files as well as application of data mining algorithms is important for learners' web usage analysis.
TL;DR: It is shown that WIP provides a SOA that can be extended seamlessly from a primitive P2P end point for voice and video communication to an end point with advanced call control and switching capabilities typically requiring the assistance of a dedicated PBX.
Abstract: This article describes a service-oriented communication (SOC) paradigm based on Web services for real-time communication and converged communication services over IP. This approach extends Web services from a methodology for service integration to a framework for SOC. In particular, we introduce the generic Web services-based application session management (WS-session), the two-way full duplex Web services interaction for communication, and most importantly, the development of Web Services Initiation Protocol. WIP is a full-featured Web services and SOA-based communication paradigm for multimedia and voice communication-over-IP. In the SOC paradigm of WIP, each WIP communication end point may be exposed as a UDDI-publishable Web resource, searchable by Web search engines, and capable of being integrated into a communication-enabled business process as services. We show that WIP provides a SOA that can be extended seamlessly from a primitive P2P end point for voice and video communication to an end point with advanced call control and switching capabilities typically requiring the assistance of a dedicated PBX. A prototype WIP system is fully developed. Architectural design and a system implementation of Web services-based communication are studied and applied to real scenarios of converged communication services. The advance of WIP indicates the beginning of a full Web service and SOA-based communication paradigm that can reshape the direction of communication over IP.
TL;DR: In this article, the coding logic needed to implement a custom web spider is outlined, and the ethical and operational issues associated with Web spiders are addressed, as well as a solution for automated data collection using web spiders.
Abstract: Data collection is always a challenge. Researchers in the hospitality and tourism industry can benefit from the extensive amount of industry related data that are publicly available on the Internet. The issue holding most researchers back is how to efficiently collect such data. This article provides a solution: automated data collection using Web spiders. The coding logic needed to implement a custom Web spider is outlined. Also addressed are the ethical and operational issues associated with Web spiders.
TL;DR: In this paper, an overview of approaches in this context to manage probabilistic uncertainty, possibilistic uncertainty and vagueness in expressive description logics for the SemanticWeb is presented.
Abstract: Ontologies play a crucial role in the development of the Semantic Web as a means for defining shared terms in web resources. They are formulated in web ontology languages, which are based on expressive description logics. Significant research efforts in the semantic web community are recently directed towards representing and reasoning with uncertainty and vagueness in ontologies for the SemanticWeb. In this paper, we give an overview of approaches in this context to managing probabilistic uncertainty, possibilistic uncertainty, and vagueness in expressive description logics for the SemanticWeb.
TL;DR: This article proposes an ontology, IRE, that provides a formal way to model both the problem and the solution spaces of the identity of resources, and proposes a formal pattern for modeling and comparing different solutions to the problems of the Identity of resources.
Abstract: One of the main strengths of the Web is that it allows any party of its global community to share information with any other party. This goal has been achieved by making use of a unique and uniform mechanism of identification, the uniform resource identifiers (URI). Although URIs succeed when used for retrieving resources on the Web, their suitability for identifying any kind of thing, for example, resources that are not on the Web, is not guaranteed. In this article we investigate the meaning of the identity of a Web resource, and how the current situation, as well as existing and possible future improvements, can be modeled and implemented on the Web. In particular, we propose an ontology, IRE, that provides a formal way to model both the problem and the solution spaces. IRE describes the concept of resource from the viewpoint of the Web, by reusing an ontology of information objects, built on top of DOLCE+ and its extensions. In particular, we formalize the concept of Web resource, as distinguished from the concept of a generic entity, and how those and other concepts are related, for example, by different proxy for relations. Based on the analysis formalized in IRE, we propose a formal pattern for modeling and comparing different solutions to the problems of the identity of resources.
TL;DR: In this article, an electronic document for updating resource files for an application comprises logical names used by the application for referring to resource files and parameters for specifying constraints for the resource files.
Abstract: In one aspect, an electronic document for updating resource files for an application comprises logical names used by the application for referring to resource files and parameters for specifying constraints for the resource files. In another aspect, the electronic document comprises an application identifier identifying an application to which the electronic document corresponds, and a universal resource identifier identifying a resource file for use with the application.
TL;DR: A method to detect place names employed by people submitting information to Web resources is presented and Web mining is proposed as a means to decide whether a given particular named entity is in fact a place.
Abstract: With the aim to improve the quality of gazetteers for geographic information retrieval systems, we present a method to detect place names employed by people submitting information to Web resources. We investigate how often people refer to a place using locative phrases in web queries and address the problem of defining cognitively significant place names. We propose Web mining as a means to decide whether a given particular named entity is in fact a place.
TL;DR: In this article, the authors present an analysis of uniform resource identifiers associated with web pages, and further discuss gathering information about web pages by analyzing the uniform resources identifiers, which may relate to analyses of web pages.
Abstract: Subject matter disclosed herein may relate to analyses of uniform resource identifiers associated with web pages, and further may relate to gathering information about web pages by analyzing the uniform resource identifiers.
TL;DR: This work analyzes two scenarios related to e-government services through identifying goals and activities of administrations and their clients, on the basis of which it claims that such an analysis narrows down development and use of metadata, lowers the barriers of ontology usage (for metadata production), and efficiently guides development of e- government ontologies towards the needs of citizens, businesses and administrations alike.
Abstract: In order to assist bottom-up efforts by administrations annotating their Web resources with adequate metadata, we introduce and follow an activity-based approach towards development and use of ontologies for e-government services. Specifically, we analyze two scenarios related to e-government services through identifying goals and activities of administrations and their clients, on the basis of which we determine the need for sharing perspectives and (electronic) boundary objects. We claim that such an analysis narrows down development and use of metadata, lowers the barriers of ontology usage (for metadata production), and efficiently guides development of e-government ontologies towards the needs of citizens, businesses and administrations alike.
TL;DR: In this article, the alignment of uniform resource identifiers associated with web pages is discussed, which may provide improved performance of applications such as, for example, uniform resource identifier normalization, sitemap construction, etc.
Abstract: Subject matter disclosed herein may relate to alignment of uniform resource identifiers associated with web pages, and further may relate to multiple sequence alignment of uniform resource identifiers In one or more example embodiments, multiple sequence alignment techniques may provide improved tokenization of uniform resource identifiers associated with web pages, which may provide improved performance of applications such as, for example, uniform resource identifier normalization, sitemap construction, etc
TL;DR: The implementation of the ASL-STEM Forum is discussed, the approach to building a community using the site is described, and the unique opportunities it offers for observing a language developing from the bottom-up are overviewed.
Abstract: Deaf and hard of hearing students studying advanced topics in Science, Technology, Engineering, and Mathematics (STEM) lack standard terminology to enable them to learn, discuss and contribute to their chosen fields. The ASL-STEM Forum enables the diverse, thinly-spread groups that are independently creating and us1ing terminology to come together using a community-based, video-enabled web resource. A common vocabulary would provide interpreters with consistent terminology and enable deaf scientists to more easily converse from a common basis. This paper discusses the implementation of the ASL-STEM Forum, describes our approach to building a community using the site, and overviews the unique opportunities it offers for observing a language developing from the bottom-up.
TL;DR: The use of social bookmarking at a medium-sized comprehensive college library for the creation and maintenance of modern languages subject guides is described and a brief technical description is outlined that outlines the JavaScript code that needs to be included in the subject guides.
Abstract: By using Web 2.0 social bookmarking sites, libraries can more easily manage subject guides and other lists of Web resources. Social bookmarking services such as Delicious provide a one-click method to bookmark a Web site, allowing librarians to describe and categorize Web sites. Using a small amount of JavaScript, these bookmarked resources can be dynamically included into subject guides and other Web-based library resources.
This paper describes and analyses the use of social bookmarking at a medium-sized comprehensive college library for the creation and maintenance of modern languages subject guides. A brief technical description outlining necessary JavaScript code provides a way for librarians to try this idea elsewhere.
This paper examines the initiative at The College of New Jersey (TCNJ) Library to utilize the social bookmarking Web site del.icio.us to easily update and maintain modern language subject-specific guides on the library Web site. Previously, the subject guides have proven difficult to maintain and a solution that allowed subject librarians to quickly and easily update subject guides from anywhere was desired. By using del.icio.us, librarians at TCNJ have been able to stream line subject guide maintenance. This paper describes the process used to include resources bookmarked on del.icio.us by librarians from both the subject librarians' and systems librarian's perspectives. Included is a brief technical description that outlines the JavaScript code that needs to be included in the subject guides that other libraries can use as an example if they choose to embark on a similar project. The response from librarians and teaching faculty has been positive. Librarians appreciate the ease of use while teaching faculty appreciate the constantly evolving nature of the subject guides.
Recommendations on ways to expand this project, including methods to allowing students and faculty to identify content to be included in subject guides using del.icio.us are also discussed.
TL;DR: GroupMe! extends the idea of current social tagging systems by enabling users to not only tag Web resources they are interested in, but also to create collections (groups) of these Web resources by simple drag & drop operations.
Abstract: Common social tagging systems like Flickr, del.icio.us and others lately became very popular. The key benefits of these systems include that users get involved in the content creation process, can easily — without overhead – comment or annotate Web content, share resources with fellow users, and benefit from the comments / annotations of other users with improved retrieval support. With GroupMe! we extend the idea of current social tagging systems by enabling users to not only tag Web resources they are interested in, but also to create collections (groups) of these Web resources by simple drag & drop operations. The grouping metaphor is intuitive and easy for the users, and our evaluation shows that users appreciate the grouping facility, and use this feature to organize and structure Web content. Technically, the grouping of resources carries valuable information about Web resources and their relations, and can be exploited to improve the mining of Web content, e.g. for search and retrieval.
TL;DR: A method for linking tags from a folksonomy to concepts of an existing ontology, adopting a statistic approach is proposed, to capture emerging semantics in social tagging systems and to overcome the gap between Semantic Web and Web 2.0.
Abstract: This work focusses on bridging between folksonomies, which provide social but mainly flat and unstructured metadata on web resources, and semantic web ontologies, which instead design structured, machine-processable knowledge spaces. The main purpose is to capture emerging semantics in social tagging systems and to overcome the gap between Semantic Web and Web 2.0, by preserving the complementary advantages of social and ontology-driven methods for describing, categorizing and processing web content. As a way to bridge this gap, we propose a method for linking tags from a folksonomy to concepts of an existing ontology, adopting a statistic approach. We have applied the proposed method to the data collected through the art portal Arsmeteo, relating them to the concepts of an OWL ontology of emotions. Intuitively, by our method we try to capture the latent emotional semantics of the tags. Some of the artworks in Arsmeteo could be visited in real exhibitions. In order to capture the emotional potential of the tagging activity during the visit, we explored the possibility to enable tagging of artifacts in real spaces, by using Semacode technology.
TL;DR: Methods to evaluate web sites and techniques for efficient access to web resources are presented and discussed and methods for determining bias, validity, and fit to a patient population are discussed.
TL;DR: In this paper, a method for marking up a plurality of electronic documents, comprising: semantic marking-up electronic documents according to a predetermined domain model presented in the form of domain ontology, thereby creating mark-up results represented in Ontology Web Language, OWL, format.
Abstract: The present invention provides method and apparatus, including computer program products, A method for marking-up a plurality of electronic documents, comprising: semantic marking-up electronic documents according to a predetermined domain model presented in the form of domain ontology, thereby creating mark-up results represented in Ontology Web Language, OWL, format; and storing the mark-up results and the links to the marked-up electronic documents represented by Universal Resource Locators, URLs, in a Resource Description Framework, RDF, storage.
TL;DR: The bottom‐up analysis of processing web resources shows that a balanced description of tasks and their distribution over staff members was established, and that the present workflow meets the requirements of efficient processing of web resources.
Abstract: Purpose – The purpose of this paper is to assess costs in the National and University Library of Croatia for processing Croatian web resources and the maintenance and development of the service, and to analyse the present organisation and workflow of their processing, and to propose improvements.Design/methodology/approach – The assessment period was two months, during which the members of staff involved minutely monitored their tasks. The results were compared to the same exercise reported by the National Library of Australia and processing costs of cataloguing Croatian print publications.Findings – The bottom‐up analysis of processing web resources shows that a balanced description of tasks and their distribution over staff members was established, and that the present workflow meets the requirements of efficient processing of web resources. As a general finding, approximately the same time was spent on archiving new items, as on the control and maintenance of the already archived ones due to the change...
TL;DR: The Kinds of Tags project as discussed by the authors identifies elements required for metadata originating from folksonomies and proposes an application profile for DC Social Tagging, which provides information that may be used by software applications to assign tags to metadata elements and, therefore, means for tags to be conveniently gathered by metadata interoperability tools.
Abstract: Folksonomy is the result of describing Web resources with tags created by Web users. Although it has become a popular application for the description of resources, in general terms Folksonomies are not being conveniently integrated in metadata. However, if the appropriate metadata elements are identified, then further work may be conducted to automatically assign tags to these elements (RDF properties) and use them in Semantic Web applications. This article presents research carried out to continue the project Kinds of Tags, which intends to identify elements required for metadata originating from folksonomies and to propose an application profile for DC Social Tagging. The work provides information that may be used by software applications to assign tags to metadata elements and, therefore, means for tags to be conveniently gathered by metadata interoperability tools. Despite the unquestionably high value of DC and the significance of the already existing properties in DC Terms, the pilot study show revealed a significant number of tags for which no corresponding properties yet existed. A need for new properties, such as Action, Depth, Rate, and Utility was determined. Those potential new properties will have to be validated in a later stage by the DC Social Tagging Community.
TL;DR: In this article, the authors explored how a relatively insignificant business can gain and maintain access to a major supermarket chain on the other side of the globe in a world oversupplied with fine wines.
Abstract: Purpose – The purpose of this case study is to explore how a relatively economically insignificant business can gain and maintain access to a major supermarket chain on the other side of the globe in a world oversupplied with fine wines.Design/methodology/approach – Based on the approach to case studies recommended by Lyons (2005), this case study is built on semi‐structured interviews with key informants, previous experience, observations, documentary and web resources, combined in a process of triangulation to ensure reliability and content validity.Findings – The nature of the problems facing a small wine producer are described, followed by an account of how access to Tesco was achieved and maintained. Personnel with previous Tesco contact were found to be vital to gaining access. Providing exactly what the supermarket wanted, when it wanted it and with reliability to continue supply over time were found to be critical as was the role of the channel coordinator. Supplier/supermarket loyalty was able to...
TL;DR: The purpose of this article is to describe a web-based resource for the UK's long-term individual-based time-series (LITS) data, and introduce the LITS project, to discuss several issues the project raises that may influence future moves to make more ecological information available on the web.
Abstract: Online data resources are becoming increasingly important for ecologists. However, compared with fields such as molecular biology and physical environmental science, there are few resources containing raw ecological data that can be used for further research. Reasons for this include the greater heterogeneity of most ecological data compared with, say, nucleic acid sequences, or remote-sensing data, and the fact that many data are collected by individual research laboratories rather than by a community effort. Examples of web-based ecological data resources are the Long-term Ecological Research (LTER) catalogue of the data collected across the US network of environmental monitoring sites (http://www.lternet.edu/), and the Global Population Dynamics Database (GPDD) (NERC Centre for Population Biology at Imperial College, 1999) which makes available 5000 time-series of species popu lation densities, each over 10 years duration. The Ecological Society of America established the Ecological Archives (Peet 1998) as a depositary for electronic information. Fields that overlap with ecology are perhaps richer in web-based data resources: Treebase (http://www.treebase.org/treebase/) is a database of phytogenies, while an increasing number of taxonomic catalogues are now web-based. Much biodiversity information including distributional and ecological data are being collected digitally, or are being digitized, and the goal of the Global Biodiversity Information Facility (GBIF) is to provide a seamless portal for access to this distributed resource. The purpose of this article is to describe a web-based resource for the UK's long-term individual-based time-series (LITS) data. In the next section we explain what we mean by LITS data, introduce the LITS project, and then describe the web resource. In the concluding part we discuss several issues the project raises that may influence future moves to make more ecological information available on the web.
TL;DR: In this article, the authors propose a simple model for everyday web sites which takes advantage of the web server itself to help prepare the site's resources for preservation, which is accomplished by having metadata utilities analyze the resource at the time of dissemination.
Abstract: There are innumerable departmental, community, and personal web sites worthy of long-term preservation but proportionally fewer archivists available to properly prepare and process such sites. We propose a simple model for such everyday web sites which takes advantage of the web server itself to help prepare the site's resources for preservation. This is accomplished by having metadata utilities analyze the resource at the time of dissemination. The web server responds to the archiving repository crawler by sending both the resource and the just-in-time generated metadata as a straight-forward XML-formatted response. We call this complex object (resource + metadata) a CRATE. In this paper we discuss modoai, the web server module we developed to support this approach, and we describe the process of harvesting preservationready resources using this technique.
TL;DR: Application of ICT in libraries has become inevitable in an era of information explosion and widespread use of digital information resources and helps in performing their operations and services most efficiently.
Abstract: Application of ICT in libraries has become inevitable in an era of information explosion and widespread use of digital information resources. Effective application of ICT in libraries helps in performing their operations and services most efficiently. The modernization of libraries and information centers enabled information transfer and access, meeting objectives and there by establishes a network of libraries and information centres. This initiative saw a major shift in resource development, resource sharing and their utilization at various levels. They subscribe to e-journals, CD-ROM databases, online databases, web-based resources, and a variety of other electronic resources. They participate in library consortia and build digital libraries.
TL;DR: This chapter surveys techniques for ontology evolutio n, detailing the various existing languages and techniques devoted to Web data evolution, with particular atte ntion to Semantic Web concepts, and how these language and techniques can be adapted to ev olving data in order to improve the quality of Web Information Systems applications.
Abstract: The evolution of Web information is of utmost impor tance in the design of good Web Information Systems applications. New emerging para digms, like the Semantic Web, use ontologies for describing metadata and are defined, i part to aid in Web evolution. In this chapter, we survey techniques for ontology evolutio n. After identifying the different kinds of evolution the Web is confronted with, we detail the various existing languages and techniques devoted to Web data evolution, with particular atte ntion to Semantic Web concepts, and how these languages and techniques can be adapted to ev olving data in order to improve the quality of Web Information Systems applications. Towards the Adaptive Web using Metadata Evolution When we consider the evolution of the World Wide We b (WWW) and its development over the last decade, we can see drastic changes ha v taken place. Originally, the Web was built on static HTML documents and used as experimental c ommunication means for specific communities in educational institutions and governm e t defence agencies. With its ever increasing popularity, the WWW is now the largest r esource of information in the world. As such, it contains either durable or volatile inform ation that needs to be maintained easily and often with predefined time delays. For example, thi s is the case for day-to-day news diffusion. The emergence of new paradigms like the Semantic We b (B rners-Lee, Hendler, & Lassila, 2001) has further intensified the evolution of the W b. Since Semantic Web has the ability to make Web information machines understandable, it co rrelates to an improvement in quality of Web content and Web services and the result unloads user of tedious search tasks. These changes are accompanied by the manifestation of new application families such as Web Information System (WIS) applications. WIS appl ications are developed based on Web content and consequently must adapt to its evolutio n. C nsider, for example, a tourism application for helping users prepare their holiday s. If the company introduces a new kind of vacation (e.g., social holidays for elderly people) then the application needs to be maintained in order to cope with the evolution of this domain. Th e development of WIS applications is not an easy task since it is difficult to clearly catch th e semantics of actual Web content, which in turn limits the retrieval of relevant Web information (G uelfi & Pruski, 2006). Parallel to this development, the use of metadata t o drive the advancement of Web-based applications has proven to be a very promising fiel d of research (“Web Engineering: Principles and Techniques,” 2005) and we believe that the use of metadata can improve the quality of WIS applications. Metadata is useful to describe Web re sources in a structured and precise way. It can be used, for instance, to describe the content of t ourism resources in our application example. Consequently, languages are needed to define metada ta nd techniques are required for the metadata to smoothly follow the evolution of the de scribed resource. To this end, the Semantic Web paradigm can be the key to success. It consists in using ontologies (Gruber, 1993) to define the semantics of a particular domain and then uses the vocabulary contained in these ontologies as metadata to annotate Web content. In this context, many ontology languages have been int sively studied in the literature (Antoniou & van Harmelen, 2004; Charlet, Laublet, & Reynaud, 2004). Although they provide basic modelling features, they do not offer enough properties to design and maintain ontologies that have the ability to evolve over time. Static o ntologies designed with existing languages seem to be unsuitable for today’s dynamic Web and what i t requires instead are “dynamic” ones, which are built using new kinds of languages and te chniques in order to give them the property of being evolvable over time according to modificat ion of Web content. Since these new languages and techniques will have an important imp act on Web-based applications and mainly WIS applications, we need to carefully identify the different kinds of evolution that can affect Web content and to evaluate the current capabilitie s of existing languages. In this chapter we propose to survey state-of-the-a rt dynamic ontology languages and techniques. This study also contains notes concerni ng knowledge evolution in other paradigms connected to the WWW, like reactivity (Alferes et a l., 2004) and Adaptive Hypermedia Systems (AHS) (Brusilovsky, 2001), both of which have eleme nts that should be integrated in the development of advanced WIS applications. Such adap t tion techniques can be used, for example, to adapt the presentation of content to us ers. Imagine our tourism application able to display in bold red font a particular offer, like a fee reduction on a particular service. In order to cope with various kinds of Web content evolution, w e ill provide some elements regarding the improvement of these languages. Based on our survey and the proposals we make, we believe that it will be easier to understand the impact of languages and technologies on the quality of future WIS applications. We first discuss existing types of evolution impact ing the Web as well as related work in the field of languages and techniques for Web data evolution. Dealing with the evolutionary aspect of Web content, two important research field s stand out: reactivity and AHS. Within the framework of reactivity, several interesting EventConditions-Actions (ECA) languages have been proposed to not only make Semantic Web concept s r active but also classic web sites. AHS is a young research field that we will talk about i n detail since it takes into account knowledge evolution and introduces user adaptation, which is important for future versions of the Web. Following this, we discuss related work in the fiel d of languages and techniques for ontology evolution of the Semantic Web. Although a set of standards have being established by the World Wide Web Consortium (W3C) and intensively studied in literature, new ones are under investigation. In this section we will presen t these emergent languages and techniques. We will briefly introduce standards like Web Ontology Language (OWL) and focus on the description of new languages like dOWL and the pres entation of methodologies that have been proposed to deal with ontology evolution. We also p r pose some perspectives about the extension of the studied languages with respect to on logy evolution. The next section discusses the impact of the concep ts we have presented on the quality of WIS applications. Based on our remarks, we illustra te what has to be done to increase the quality of WIS applications at each stage of its lifecycle, i. . from design to runtime, mainly by taking into account user preferences, experiences and beha viour. Finally, we wrap up with concluding remarks. WEB DATA EVOLUTION: PRINCIPLES AND TECHNIQUES