TL;DR: A new hypertext resource discovery system called a Focused Crawler that is robust against large perturbations in the starting set of URLs, and capable of exploring out and discovering valuable resources that are dozens of links away from the start set, while carefully pruning the millions of pages that may lie within this same radius.
TL;DR: In this paper, the authors present a web resource for the prediction of the biological activity spectra of organic compounds based on their structural formulas for more than 4000 types of biological activity with average accuracy above 95% (http://www.way2drug.com/passonline ).
Abstract: The freely accessible web resource PASS Online is presented. This resource is designed for the prediction of the biological activity spectra of organic compounds based on their structural formulas for more than 4000 types of biological activity with average accuracy above 95% (
http://www.way2drug.com/passonline
). The prediction is based on an analysis of the structure-activity relationships in the training set containing information on the structure and biological activity of more than 300000 organic compounds. The possibilities and limitations of this approach are described. Recommendations are given for interpreting the prediction results. Examples are given for the practical use of the PASS Online web resource in order to establish priorities for chemical synthesis and biological testing of substances on the basis of prediction results. The further trends are considered for the using PASS Online as an Internet platform for joint projects of academic researchers for the search and development of new pharmaceutical agents.
TL;DR: M is presented, an annotation tool which provides both automated and semi-automated support for annotating web pages with semantic contents and integrates a web browser with an ontology editor and provides open APIs to link to ontology servers and for integrating information extraction tools.
Abstract: An important precondition for realizing the goal of a semantic web is the ability to annotate web resources with semantic information. In order to carry out this task, users need appropriate representation languages, ontologies, and support tools. In this paper we present MnM, an annotation tool which provides both automated and semi-automated support for annotating web pages with semantic contents. MnM integrates a web browser with an ontology editor and provides open APIs to link to ontology servers and for integrating information extraction tools. MnM can be seen as an early example of the next generation of ontology editors, being web-based, oriented to semantic markup and providing mechanisms for large-scale automatic markup of web pages.
TL;DR: The DNS Security Extensions (DNSSEC) as discussed by the authors are a collection of resource records and protocol modifications that provide source authentication for the DNS, including public key (DNSKEY), delegation signer (DS), resource record digital signature (RRSIG), and authenticated denial of existence (NSEC).
Abstract: This document is part of a family of documents that describe the DNS
Security Extensions (DNSSEC). The DNS Security Extensions are a
collection of resource records and protocol modifications that provide
source authentication for the DNS. This document defines the public
key (DNSKEY), delegation signer (DS), resource record digital
signature (RRSIG), and authenticated denial of existence (NSEC)
resource records. The purpose and format of each resource record is
described in detail, and an example of each resource record is given.
This document obsoletes RFC 2535 and incorporates changes from all
updates to RFC 2535. [STANDARDS-TRACK]
TL;DR: Sindice, a lookup index over resources crawled on the Semantic Web, allows applications to automatically retrieve sources with information about a given resource and allows resource retrieval through inverse-functional properties.
Abstract: Developers of Semantic Web applications face a challenge with respect to the decentralised publication model: where to find statements about encountered resources. The "linked data" approach, which mandates that resource URIs should be de-referenced and yield meta-data about the resource, helps but is only a partial solution. We present Sindice, a lookup index over resources crawled on the Semantic Web. Our index allows applications to automatically retrieve sources with information about a given resource. In addition we allow resource retrieval through inverse-functional properties, offer full-text search and index SPARQL endpoints.