Journal Article10.1145/297117.297123
Methods for information server selection
TL;DR: A novel method using Lightweight Probe queries (LWP method) is compared with several methods based on data from past query processing, while Random and Optimal server rankings serve as controls.
read more
Abstract: The problem of using a broker to select a subset of available information servers in order to achieve a good trade-off between document retrieval effectiveness and cost is addressed. Server selection methods which are capable of operating in the absence of global information, and where servers have no knowledge of brokers, are investigated. A novel method using Lightweight Probe queries (LWP method) is compared with several methods based on data from past query processing, while Random and Optimal server rankings serve as controls. Methods are evaluated, using TREC data and relevance judgments, by computing ratios, both empirical and ideal, of recall and early precision for the subset versus the complete set of available servers. Estimates are also made of the best-possible performance of each of the methods. LWP and Topic Similarity methods achieved best results, each being capable of retrieving about 60% of the relevant documents for only one-third of the cost of querying all servers. Subject to the applicable cost model, the LWP method is likely to be preferred because it is suited to dynamic environments. The good results obtained with a simple automatic LWP implementation were replicated using different data and a larger set of query topics.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Distributed information retrieval
Jamie Callan
- 01 Jan 2002
TL;DR: A broad and diverse group of experimental results is presented to demonstrate that the algorithms are effective, efficient, robust, and scalable.
Structured databases on the web: observations and implications
Kevin Chen-Chuan Chang,Bin He,Chengkai Li,Mitesh Pankaj Patel,Zhen Zhang +4 more
- 01 Sep 2004
TL;DR: This paper surveys this relatively unexplored frontier of the deep Web, measuring characteristics pertinent to both exploring and integrating structured Web sources, to conclude with several implications which, while necessarily subjective, might help shape research directions and solutions.
•Proceedings Article
Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Edward A. Fox,Peter Ingwersen,Raya Fidel +2 more
- 01 Jul 1995
374
Cluster-based language models for distributed retrieval
Jinxi Xu,W. Bruce Croft +1 more
- 01 Aug 1999
TL;DR: A new approach to distributed retrieval based on document clustering and language modeling is proposed and it is shown that all three methods improve the effectiveness of distributed retrieval.
348
Federated Search
Milad Shokouhi,Luo Si +1 more
TL;DR: The goal of this work, is to provide a comprehensive summary of the previous research on the federated search challenges described above.
222
References
WordNet: a lexical database for English
TL;DR: WordNet1 provides a more effective combination of traditional lexicographic information and modern computing, and is an online lexical database designed for use under program control.
16.9K
Proceedings of the 1982 international conference on parallel processing
K.E. Batcher,W.C. Meilander,J.L. Potter +2 more
- 01 Jan 1982
TL;DR: The following topics were dealt with: interconnection networks; numeric algorithms; network diagnosis and fault tolerance; dataflow and reduction machines; languages; nonnumeric algorithms; large scale scientific processing; array processors; MIMD processing; special purpose processors; distributed processing; and multimicroprocessors.
397
•Proceedings Article
Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Edward A. Fox,Peter Ingwersen,Raya Fidel +2 more
- 01 Jul 1995
374
•Proceedings Article
LSI meets TREC: a status report
Susan T. Dumais
- 01 Jan 1992
TL;DR: Describes the Latent Semantic Indexing approach, an extension of the vector retrieval method and the use of singular-value decomposition applied to the TREC collection.
132
•Proceedings Article
Using Query Zoning and Correlation Within SMART: TREC 5.
Chris Buckley,Amit Singhal,Mandar Mitra +2 more
- 01 Jan 1996
TL;DR: The major focus this year is on zoning different parts of an initial retrieval ranking, and treating each type of query zone differently as processing continues, as well as experiment with dynamic phrasing.
78
Related Papers (5)
Budi Yuwono,Dik Lun Lee +1 more
- 01 Apr 1997
Jinxi Xu,W. Bruce Croft +1 more
- 01 Aug 1999
Nick Craswell,Peter Bailey,David Hawking +2 more
- 01 Jun 2000
Jamie Callan,Margaret E. Connell,Aiqun Du +2 more
- 01 Jun 1999