Topic

Web resource

About: Web resource is a research topic. Over the lifetime, 1867 publications have been published within this topic receiving 30710 citations.

...read moreread less

Topic Tools

Find unexplored research gaps

Generate a literature review

Explore related concepts

Papers published on a yearly basis

Papers

Journal Article•10.1016/S1389-1286(99)00052-3•

Focused crawling: a new approach to topic-specific Web resource discovery

[...]

Soumen Chakrabarti¹, Martin van den Berg², Byron Dom³•Institutions (3)

Indian Institute of Technology Bombay¹, FX Palo Alto Laboratory², IBM³

17 May 1999

TL;DR: A new hypertext resource discovery system called a Focused Crawler that is robust against large perturbations in the starting set of URLs, and capable of exploring out and discovering valuable resources that are dozens of links away from the start set, while carefully pruning the millions of pages that may lie within this same radius.

...read moreread less

Abstract: The rapid growth of the World-Wide Web poses unprecedented scaling challenges for general-purpose crawlers and search engines In this paper we describe a new hypertext resource discovery system called a Focused Crawler The goal of a focused crawler is to selectively seek out pages that are relevant to a pre-defined set of topics The topics are specified not using keywords, but using exemplary documents Rather than collecting and indexing all accessible Web documents to be able to answer all possible ad-hoc queries, a focused crawler analyzes its crawl boundary to find the links that are likely to be most relevant for the crawl, and avoids irrelevant regions of the Web This leads to significant savings in hardware and network resources, and helps keep the crawl more up-to-date To achieve such goal-directed crawling, we designed two hypertext mining programs that guide our crawler: a classifier that evaluates the relevance of a hypertext document with respect to the focus topics, and a distiller that identifies hypertext nodes that are great access points to many relevant pages within a few links We report on extensive focused-crawling experiments using several topics at different levels of specificity Focused crawling acquires relevant pages steadily while standard crawling quickly loses its way, even though they are started from the same root set Focused crawling is robust against large perturbations in the starting set of URLs It discovers largely overlapping sets of resources in spite of these perturbations It is also capable of exploring out and discovering valuable resources that are dozens of links away from the start set, while carefully pruning the millions of pages that may lie within this same radius Our anecdotes suggest that focused crawling is very effective for building high-quality collections of Web documents on specific topics, using modest desktop hardware © 1999 Published by Elsevier Science BV All rights reserved

...read moreread less

1,790 citations

Journal Article•10.1007/S10593-014-1496-1•

Prediction of the Biological Activity Spectra of Organic Compounds Using the Pass Online Web Resource

[...]

Dmitry Filimonov¹, Alexey Lagunin¹, Tatyana A. Gloriozova¹, A. V. Rudik¹, D. S. Druzhilovskii¹, Pavel V. Pogodin², Pavel V. Pogodin¹, Vladimir Poroikov², Vladimir Poroikov¹ - Show less +5 more•Institutions (2)

Russian Academy¹, Russian National Research Medical University²

28 May 2014-Chemistry of Heterocyclic Compounds

TL;DR: In this paper, the authors present a web resource for the prediction of the biological activity spectra of organic compounds based on their structural formulas for more than 4000 types of biological activity with average accuracy above 95% (http://www.way2drug.com/passonline ).

...read moreread less

Abstract: The freely accessible web resource PASS Online is presented. This resource is designed for the prediction of the biological activity spectra of organic compounds based on their structural formulas for more than 4000 types of biological activity with average accuracy above 95% ( http://www.way2drug.com/passonline ). The prediction is based on an analysis of the structure-activity relationships in the training set containing information on the structure and biological activity of more than 300000 organic compounds. The possibilities and limitations of this approach are described. Recommendations are given for interpreting the prediction results. Examples are given for the practical use of the PASS Online web resource in order to establish priorities for chemical synthesis and biological testing of substances on the basis of prediction results. The further trends are considered for the using PASS Online as an Internet platform for joint projects of academic researchers for the search and development of new pharmaceutical agents.

...read moreread less

957 citations

Book Chapter•10.1007/3-540-45810-7_34•

MnM: Ontology Driven Semi-automatic and Automatic Support for Semantic Markup

[...]

Maria Vargas-Vera¹, Enrico Motta¹, John Domingue¹, Mattia Lanzoni¹, Arthur Stutt¹, Fabio Ciravegna² - Show less +2 more•Institutions (2)

Open University¹, University of Sheffield²

1 Oct 2002

TL;DR: M is presented, an annotation tool which provides both automated and semi-automated support for annotating web pages with semantic contents and integrates a web browser with an ontology editor and provides open APIs to link to ontology servers and for integrating information extraction tools.

...read moreread less

Abstract: An important precondition for realizing the goal of a semantic web is the ability to annotate web resources with semantic information. In order to carry out this task, users need appropriate representation languages, ontologies, and support tools. In this paper we present MnM, an annotation tool which provides both automated and semi-automated support for annotating web pages with semantic contents. MnM integrates a web browser with an ontology editor and provides open APIs to link to ontology servers and for integrating information extraction tools. MnM can be seen as an early example of the next generation of ontology editors, being web-based, oriented to semantic markup and providing mechanisms for large-scale automatic markup of web pages.

...read moreread less

365 citations

Resource Records for the DNS Security Extensions

[...]

Matt Larson, Dan Massey, Scott Rose, Roy Arends, Rob Austein - Show less +1 more

1 Mar 2005

TL;DR: The DNS Security Extensions (DNSSEC) as discussed by the authors are a collection of resource records and protocol modifications that provide source authentication for the DNS, including public key (DNSKEY), delegation signer (DS), resource record digital signature (RRSIG), and authenticated denial of existence (NSEC).

...read moreread less

Abstract: This document is part of a family of documents that describe the DNS Security Extensions (DNSSEC). The DNS Security Extensions are a collection of resource records and protocol modifications that provide source authentication for the DNS. This document defines the public key (DNSKEY), delegation signer (DS), resource record digital signature (RRSIG), and authenticated denial of existence (NSEC) resource records. The purpose and format of each resource record is described in detail, and an example of each resource record is given. This document obsoletes RFC 2535 and incorporates changes from all updates to RFC 2535. [STANDARDS-TRACK]

...read moreread less

334 citations

Book Chapter•10.1007/978-3-540-76298-0_40•

Sindice.com: weaving the open linked data

[...]

Giovanni Tummarello¹, Renaud Delbru¹, Eyal Oren¹•Institutions (1)

National University of Ireland, Galway¹

11 Nov 2007

TL;DR: Sindice, a lookup index over resources crawled on the Semantic Web, allows applications to automatically retrieve sources with information about a given resource and allows resource retrieval through inverse-functional properties.

...read moreread less

Abstract: Developers of Semantic Web applications face a challenge with respect to the decentralised publication model: where to find statements about encountered resources. The "linked data" approach, which mandates that resource URIs should be de-referenced and yield meta-data about the resource, helps but is only a partial solution. We present Sindice, a lookup index over resources crawled on the Semantic Web. Our index allows applications to automatically retrieve sources with information about a given resource. In addition we allow resource retrieval through inverse-functional properties, offer full-text search and index SPARQL endpoints.

...read moreread less

329 citations

...

Expand

Performance Metrics

1,907

Papers

14,288

Citations

No. of papers in the topic in previous years
Year	Papers
2025	2
2024	3
2023	7
2022	11
2021	37
2020	53

Web resource

Topic Tools

Papers published on a yearly basis

Papers

Focused crawling: a new approach to topic-specific Web resource discovery

Prediction of the Biological Activity Spectra of Organic Compounds Using the Pass Online Web Resource

MnM: Ontology Driven Semi-automatic and Automatic Support for Semantic Markup

Resource Records for the DNS Security Extensions

Sindice.com: weaving the open linked data

Related Topics (5)

Performance Metrics