A constraint to automatically regulate document-length normalisation

doi:10.1145/2396761.2398662

Proceedings Article10.1145/2396761.2398662

A constraint to automatically regulate document-length normalisation

Ronan Cummins, +1 more

- 29 Oct 2012

- pp 2443-2446

16

TL;DR: This paper formally describes the interaction between query-terms and document length normalisation using a constraint, and develops a general pre-retrieval approach to adapt a number of state-of-the-art ranking functions so that they adhere to the constraint.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.1145/3471158.3472256

Towards Axiomatic Explanations for Neural Ranking Models

Michael Völske, +6 more

- 11 Jul 2021

TL;DR: In this article, the authors investigate whether neural ranking models can be explained in terms of well-studied principles of document ranking by using established theories from axiomatic~IR, and propose a set of axioms to reproduce ranking decisions based on combinations of elementary constraints.

...read moreread less

27

Proceedings Article•10.1145/3077136.3080761

Improving Retrieval Performance for Verbose Queries via Axiomatic Analysis of Term Discrimination Heuristic

Mozhdeh Ariannezhad, +3 more

- 07 Aug 2017

TL;DR: This paper proposes a constraint to model the interaction between query length and IDF, and suggests a modification to adapt BM25 so that it adheres to the new constraint.

...read moreread less

14

•Proceedings Article•10.1145/2872427.2883009

A Study of Retrieval Models for Long Documents and Queries in Information Retrieval

Ronan Cummins

- 11 Apr 2016

TL;DR: This paper formally analyse two important but distinct reasons for normalising documents with respect to length, namely verbosity and scope, and develops a new discriminative query language modelling approach that demonstrates improved performance on long verbose queries by appropriately weighting salient aspects of the query.

...read moreread less

11

Proceedings Article•10.1145/2806416.2806592

A Study of Query Length Heuristics in Information Retrieval

Yuanhua Lv

- 17 Oct 2015

TL;DR: It is revealed that query length actually interacts with term frequency (TF) normalization, a key component of all effective retrieval models and that, in order to solve this problem, the TF normalization component in a retrieval function should be adapted to query length.

...read moreread less

8

Journal Article•10.1016/J.IPM.2017.09.006

Verbosity normalized pseudo-relevance feedback in information retrieval

Seung-Hoon Na, +1 more

- 01 Mar 2018

- Information Processing and Management

TL;DR: The results of the experiments show that the proposed verbosity normalized pseudo-relevance feedback consistently provides statistically significant improvements over conventional methods, under the settings of the relevance model and latent concept expansion.

...read moreread less

7

...

Expand

References

Journal Article•10.1016/S0306-4573(00)00015-7

A probabilistic model of information retrieval: development and comparative experiments

K. Sparck Jones, +3 more

- 06 Nov 2000

- Information Processing and Management

TL;DR: The paper combines a comprehensive account of the probabilistic model of retrieval with new systematic experiments on TREC Programme material, and presents the model from its foundations through its logical development to cover more aspects of retrieval data and a wider range of system functions.

...read moreread less

1.2K

Journal Article•10.1145/582415.582416

Probabilistic models of information retrieval based on measuring the divergence from randomness

Gianni Amati, +1 more

- 01 Oct 2002

- ACM Transactions on Information Systems

TL;DR: A framework for deriving probabilistic models of Information Retrieval using term-weighting models obtained in the language model approach by measuring the divergence of the actual term distribution from that obtained under a random process is introduced.

...read moreread less

1K

•Journal Article•10.1145/3130348.3130365

Pivoted document length normalization

Amit Singhal, +2 more

- 18 Aug 1996

TL;DR: Pivoted normalization is presented, a technique that can be used to modify any normalization function thereby reducing the gap between the relevance and the retrieval probabilities, and two new normalization functions--pivoted unique normalization and piuotert byte size normalization are presented.

...read moreread less

989

•Proceedings Article•10.1145/1008992.1009004

A formal study of information retrieval heuristics

Hui Fang, +2 more

- 25 Jul 2004

TL;DR: A formal study of retrieval heuristics is presented and it is found that the empirical performance of a retrieval formula is tightly related to how well it satisfies basic desirable constraints.

...read moreread less

396

Proceedings Article•10.1145/1076034.1076116

An exploration of axiomatic approaches to information retrieval

Hui Fang, +1 more

- 15 Aug 2005

TL;DR: This paper proposes a new axiomatic approach to developing retrieval models based on direct modeling of relevance with formalized retrieval constraints defined at the level of terms, and derives several new retrieval functions using this framework.

...read moreread less

204