Learning Neural Ranking Models Online from Implicit User Feedback
Yiling Jia,Hongning Wang +1 more
- 17 Jan 2022
TL;DR: This work proposes to directly learn a neural ranking model from users’ implicit feedback, focusing on RankNet and LambdaRank, and proves that under standard assumptions the OL2R solution achieves a gap-dependent upper regret bound of O(log 2(T), in which the regret is defined on the total number of mis-ordered pairs over T rounds.
read more
Abstract: Existing online learning to rank (OL2R) solutions are limited to linear models, which are incompetent to capture possible non-linear relations between queries and documents. In this work, to unleash the power of representation learning in OL2R, we propose to directly learn a neural ranking model from users’ implicit feedback (e.g., clicks) collected on the fly. We focus on RankNet and LambdaRank, due to their great empirical success and wide adoption in offline settings, and control the notorious explore-exploit trade-off based on the convergence analysis of neural networks using neural tangent kernel. Specifically, in each round of result serving, exploration is only performed on document pairs where the predicted rank order between the two documents is uncertain; otherwise, the ranker’s predicted order will be followed in result ranking. We prove that under standard assumptions our OL2R solution achieves a gap-dependent upper regret bound of O(log 2(T)), in which the regret is defined on the total number of mis-ordered pairs over T rounds. Comparisons against an extensive set of state-of-the-art OL2R baselines on two public learning to rank benchmark datasets demonstrate the effectiveness of the proposed solution.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Information Retrieval: Recent Advances and Beyond
Kailash A. Hambarde,Hugo Proença +1 more
TL;DR: Information retrieval models and techniques overviewed. Covers state-of-the-art models and challenges faced in the field.
27
GPR-OPT: A Practical Gaussian optimization criterion for implicit recommender systems
Ting Bai,Xiaotong Wang,Zhenhao Zhang,Wei Song,Bin Wu,Jian-Yun Nie +5 more
- 01 Jan 2024
TL;DR: GPR-OPT, a novel Gaussian optimization criterion, learns user preference confidence from implicit feedback, assuming user interests follow Gaussian distributions, and achieves 11.64% average improvement over collaborative filtering algorithms on three real-world datasets.
4
TRAVERS: A Diversity-Based Dynamic Approach to Iterative Relevance Search over Knowledge Graphs
Ziyang Li,Yu Gu,Yulin Shen,Wei Hu,Gong Cheng +4 more
- 30 Apr 2023
TL;DR: TRAVERS as mentioned in this paper is a labeling-based iterative relevance search approach, where the user effortlessly labels current answer entities, and is rewarded with improved answer entities in the next iteration.
1
A holistic view on positive and negative implicit feedback for micro-video recommendation
Pan Gu,Haiyang Hu +1 more
TL;DR: This study proposes HMNet, a holistic framework for micro-video recommendation, leveraging multiplex networks to capture sequential and non-sequential dependencies between positive and negative implicit feedback, outperforming state-of-the-art methods on two real-world datasets.
How to Forget Clients in Federated Online Learning to Rank?
Shuyi Wang,Bing Liu,Guido Zuccon +2 more
TL;DR: How to forget clients in Federated Online Learning to Rank (FORL) systems while maintaining model effectiveness and privacy. Unlearning contributions from a client without re-training the global model.
References
•Proceedings Article
Neural Tangent Kernel: Convergence and Generalization in Neural Networks
Arthur Jacot,Franck Gabriel,Clément Hongler +2 more
- 20 Jun 2018
TL;DR: This talk will introduce this formalism and give a number of results on the Neural Tangent Kernel and explain how they give us insight into the dynamics of neural networks during training and into their generalization features.
•Posted Content
Introduction to the non-asymptotic analysis of random matrices
TL;DR: A tutorial on some basic nonasymptotic methods and concepts in random matrix theory can be found in this paper, where the reader can learn several tools for the analysis of the extreme singular values of random matrices with independent rows or columns.
2K
•Proceedings Article
Improved Algorithms for Linear Stochastic Bandits
Yasin Abbasi-Yadkori,Dávid Pál,Csaba Szepesvári +2 more
- 12 Dec 2011
TL;DR: A simple modification of Auer's UCB algorithm achieves with high probability constant regret and improves the regret bound by a logarithmic factor, though experiments show a vast improvement.
Accurately interpreting clickthrough data as implicit feedback
Thorsten Joachims,Laura Granka,Bing Pan,Helene Hembrooke +3 more
- 15 Aug 2005
TL;DR: It is concluded that clicks are informative but biased, and while this makes the interpretation of clicks as absolute relevance judgments difficult, it is shown that relative preferences derived from clicks are reasonably accurate on average.
From RankNet to LambdaRank to LambdaMART: An Overview
Christopher J. C. Burges
- 23 Jun 2010
TL;DR: RankNet, LambdaRank, and LambdaMART have proven to be very successful algorithms for solving real world ranking problems and the details are spread across several papers and reports, so here is a self-contained, detailed and complete description of them.