Automatically indexing documents: content vs. reference

doi:10.1145/502716.502746

Proceedings Article10.1145/502716.502746

Automatically indexing documents: content vs. reference

Shannon Bradshaw, +1 more

- 13 Jan 2002

- pp 180-181

24

TL;DR: It is indicated that reference identifies the value of documents more accurately and with a greater diversity of language than content, which is superior to indexing documents based on their content.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Patent

Automatic method and system for formulating and transforming representations of context used by information services

Kristian J. Hammond, +2 more

- 30 Jul 2003

TL;DR: In this article, an information retrieval system for automatically retrieving information related to the context of an active task being manipulated by a user is presented, where the system observes the operation of the active task and user interactions and utilizes predetermined criteria to generate a context representation.

...read moreread less

206

Patent

Method and system for assessing relevant properties of work contexts for use by information services

Kristian J. Hammond, +2 more

- 11 Apr 2014

TL;DR: In this paper, an information retrieval system for automatically retrieving information related to the context of an active task being manipulated by a user is presented, where the system observes the operation of the active task and user interactions, and utilizes predetermined criteria to generate context representation to generate queries or search terms for conducting information search.

...read moreread less

125

Patent

Query preprocessing and pipelining

Eric B. Watson, +3 more

- 26 Jan 2004

TL;DR: The authors modify queries by grouping terms as phrases, correcting spelling errors, and augmenting the query with category terms that trigger query execution on certain data sources to better reflect the user's intent.

...read moreread less

49

Patent

Index partitioning based on document relevance for document indexes

Darren A. Shakib, +2 more

- 22 Jan 2004

TL;DR: In this paper, index queries reference the first partition and move to a subsequent partition when a static rank for the subsequent partition is higher than a weighted portion of the target score added to a weighted part of a dynamic rank corresponding to the relevance of the results set generated thus far.

...read moreread less

45

•Journal Article•10.1177/0165551511417785

An unsupervised approach to automatic classification of scientific literature utilizing bibliographic metadata

Arash Joorabchi, +1 more

- 01 Oct 2011

- Journal of Information Science

TL;DR: An unsupervised approach for automatic classification of scientific literature archived in digital libraries and repositories according to a standard library classification scheme based on identifying all the references cited in the document to be classified.

...read moreread less

35

...

Expand

References

Journal Article•10.1016/S0169-7552(98)00110-X

The anatomy of a large-scale hypertextual Web search engine

Sergey Brin, +1 more

- 01 Apr 1998

TL;DR: This paper provides an in-depth description of Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and looks at the problem of how to effectively deal with uncontrolled hypertext collections where anyone can publish anything they want.

...read moreread less

16.6K

•Journal Article

The Anatomy of a Large-Scale Hypertextual Web Search Engine.

Sergey Brin, +1 more

- 01 Jan 1998

- Computer Networks

TL;DR: Google as discussed by the authors is a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and is designed to crawl and index the Web efficiently and produce much more satisfying search results than existing systems.

...read moreread less

13.3K

•Proceedings Article

The Anatomy of Large-scale Hypertextual Web Search Engine

S. Brin

- 01 Jan 1998

TL;DR: We present Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext to produce better search results.

...read moreread less

9.7K

Journal Article•10.1145/32206.32212

The vocabulary problem in human-system communication

George W. Furnas, +3 more

- 01 Nov 1987

- Communications of The ACM

TL;DR: It is shown how this fundamental property of language limits the success of various design methodologies for vocabulary-driven interaction, and an optimal strategy, unlimited aliasing, is derived and shown to be capable of several-fold improvements.

...read moreread less

1.6K

Journal Article•10.1002/1097-4571(2000)9999:9999<::AID-ASI1591>3.3.CO;2-I

Searching the Web: the public and their queries

Amanda Spink, +3 more

- 01 Feb 2001

- Journal of the Association for Informati...

TL;DR: It is found that most people use few search terms, few modified queries, view few Web pages, and rarely use advanced search features, and the language of Web queries is distinctive.

...read moreread less

1.1K

Automatically indexing documents: content vs. reference

Chat with Paper

AI Agents for this Paper

Citations

Automatic method and system for formulating and transforming representations of context used by information services

Method and system for assessing relevant properties of work contexts for use by information services

Query preprocessing and pipelining

Index partitioning based on document relevance for document indexes

An unsupervised approach to automatic classification of scientific literature utilizing bibliographic metadata

References

The anatomy of a large-scale hypertextual Web search engine

The Anatomy of a Large-Scale Hypertextual Web Search Engine.

The Anatomy of Large-scale Hypertextual Web Search Engine

The vocabulary problem in human-system communication

Searching the Web: the public and their queries

Related Papers (5)

A vector space model for automatic indexing

Syskill & webert: Identifying interesting web sites

SAVVYSEARCH: A Metasearch Engine That Learns Which Search Engines to Query

Systems, methods, and interfaces for providing personalized search and information access

The lumière project: Bayesian user modeling for inferring the goals and needs of software users