Learning Neural Ranking Models Online from Implicit User Feedback

doi:10.1145/3485447.3512250

Open AccessProceedings Article10.1145/3485447.3512250

Learning Neural Ranking Models Online from Implicit User Feedback

- 17 Jan 2022

7

TL;DR: This work proposes to directly learn a neural ranking model from users’ implicit feedback, focusing on RankNet and LambdaRank, and proves that under standard assumptions the OL2R solution achieves a gap-dependent upper regret bound of O(log 2(T), in which the regret is defined on the total number of mis-ordered pairs over T rounds.

Abstract: Existing online learning to rank (OL2R) solutions are limited to linear models, which are incompetent to capture possible non-linear relations between queries and documents. In this work, to unleash the power of representation learning in OL2R, we propose to directly learn a neural ranking model from users’ implicit feedback (e.g., clicks) collected on the fly. We focus on RankNet and LambdaRank, due to their great empirical success and wide adoption in offline settings, and control the notorious explore-exploit trade-off based on the convergence analysis of neural networks using neural tangent kernel. Specifically, in each round of result serving, exploration is only performed on document pairs where the predicted rank order between the two documents is uncertain; otherwise, the ranker’s predicted order will be followed in result ranking. We prove that under standard assumptions our OL2R solution achieves a gap-dependent upper regret bound of O(log 2(T)), in which the regret is defined on the total number of mis-ordered pairs over T rounds. Comparisons against an extensive set of state-of-the-art OL2R baselines on two public learning to rank benchmark datasets demonstrate the effectiveness of the proposed solution.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/access.2023.3295776

Information Retrieval: Recent Advances and Beyond

Kailash A. Hambarde, +1 more

- 01 Jan 2023

- IEEE Access

TL;DR: Information retrieval models and techniques overviewed. Covers state-of-the-art models and challenges faced in the field.

...read moreread less

27

Journal Article•10.1016/j.ipm.2023.103525

GPR-OPT: A Practical Gaussian optimization criterion for implicit recommender systems

Ting Bai, +5 more

- 01 Jan 2024

TL;DR: GPR-OPT, a novel Gaussian optimization criterion, learns user preference confidence from implicit feedback, assuming user interests follow Gaussian distributions, and achieves 11.64% average improvement over collaborative filtering algorithms on three real-world datasets.

...read moreread less

4

Proceedings Article•10.1145/3543507.3583429

TRAVERS: A Diversity-Based Dynamic Approach to Iterative Relevance Search over Knowledge Graphs

Ziyang Li, +4 more

- 30 Apr 2023

TL;DR: TRAVERS as mentioned in this paper is a labeling-based iterative relevance search approach, where the user effortlessly labels current answer entities, and is rewarded with improved answer entities in the next iteration.

...read moreread less

1

Journal Article•10.1016/j.knosys.2023.111299

A holistic view on positive and negative implicit feedback for micro-video recommendation

Pan Gu, +1 more

- 01 Dec 2023

- Knowledge Based Systems

TL;DR: This study proposes HMNet, a holistic framework for micro-video recommendation, leveraging multiplex networks to capture sequential and non-sequential dependencies between positive and negative implicit feedback, outperforming state-of-the-art methods on two real-world datasets.

...read moreread less

Book Chapter•10.1007/978-3-031-56063-7_7

How to Forget Clients in Federated Online Learning to Rank?

Shuyi Wang, +2 more

- 01 Jan 2024

- Lecture Notes in Computer Science

TL;DR: How to forget clients in Federated Online Learning to Rank (FORL) systems while maintaining model effectiveness and privacy. Unlearning contributions from a client without re-training the global model.

...read moreread less

References

•Proceedings Article

Neural Tangent Kernel: Convergence and Generalization in Neural Networks

Arthur Jacot, +2 more

- 20 Jun 2018

TL;DR: This talk will introduce this formalism and give a number of results on the Neural Tangent Kernel and explain how they give us insight into the dynamics of neural networks during training and into their generalization features.

...read moreread less

2.8K

•Posted Content

Introduction to the non-asymptotic analysis of random matrices

Roman Vershynin

- 12 Nov 2010

- arXiv: Probability

TL;DR: A tutorial on some basic nonasymptotic methods and concepts in random matrix theory can be found in this paper, where the reader can learn several tools for the analysis of the extreme singular values of random matrices with independent rows or columns.

...read moreread less

2K

•Proceedings Article

Improved Algorithms for Linear Stochastic Bandits

Yasin Abbasi-Yadkori, +2 more

- 12 Dec 2011

TL;DR: A simple modification of Auer's UCB algorithm achieves with high probability constant regret and improves the regret bound by a logarithmic factor, though experiments show a vast improvement.

...read moreread less

1.7K

•Proceedings Article•10.1145/1076034.1076063

Accurately interpreting clickthrough data as implicit feedback

Thorsten Joachims, +3 more

- 15 Aug 2005

TL;DR: It is concluded that clicks are informative but biased, and while this makes the interpretation of clicks as absolute relevance judgments difficult, it is shown that relative preferences derived from clicks are reasonably accurate on average.

...read moreread less

1.6K

From RankNet to LambdaRank to LambdaMART: An Overview

Christopher J. C. Burges

- 23 Jun 2010

TL;DR: RankNet, LambdaRank, and LambdaMART have proven to be very successful algorithms for solving real world ranking problems and the details are spread across several papers and reports, so here is a self-contained, detailed and complete description of them.

...read moreread less

1.3K

...

Expand

Learning Neural Ranking Models Online from Implicit User Feedback

Chat with Paper

AI Agents for this Paper

Citations

Information Retrieval: Recent Advances and Beyond

GPR-OPT: A Practical Gaussian optimization criterion for implicit recommender systems

TRAVERS: A Diversity-Based Dynamic Approach to Iterative Relevance Search over Knowledge Graphs

A holistic view on positive and negative implicit feedback for micro-video recommendation

How to Forget Clients in Federated Online Learning to Rank?

References

Neural Tangent Kernel: Convergence and Generalization in Neural Networks

Introduction to the non-asymptotic analysis of random matrices

Improved Algorithms for Linear Stochastic Bandits

Accurately interpreting clickthrough data as implicit feedback

From RankNet to LambdaRank to LambdaMART: An Overview

Related Papers (5)

Northeastern University in TREC 2009 Million Query Track

LETOR: A benchmark collection for research on learning to rank for information retrieval

Learning to rank with document ranks and scores

Tri-Rank: An Authority Ranking Framework in Heterogeneous Academic Networks by Mutual Reinforce

Scaling Learning to Rank to Big Data: Using MapReduce to Parallelise Learning to Rank