Proceedings Article10.1145/1458082.1458233
Trada: tree based ranking function adaptation
Keke Chen,Rongqing Lu,C. K. Wong,Gordon Sun,Larry Heck,Belle L. Tseng +5 more
- 26 Oct 2008
- pp 1143-1152
TL;DR: Tree adaptation assumes that ranking functions are trained with regression-tree based modeling methods, such as Gradient Boosting Trees, and takes such a ranking function from one domain and tunes its tree-based structure with a small amount of training data from the target domain.
read more
Abstract: Machine Learned Ranking approaches have shown successes in web search engines. With the increasing demands on developing effective ranking functions for different search domains, we have seen a big bottleneck, i.e., the problem of insufficient training data, which has significantly limited the fast development and deployment of machine learned ranking functions for different web search domains. In this paper, we propose a new approach called tree based ranking function adaptation ("tree adaptation") to address this problem. Tree adaptation assumes that ranking functions are trained with regression-tree based modeling methods, such as Gradient Boosting Trees. It takes such a ranking function from one domain and tunes its tree-based structure with a small amount of training data from the target domain. The unique features include (1) it can automatically identify the part of model that needs adjustment for the new domain, (2) it can appropriately weight training examples considering both local and global distributions. Experiments are performed to show that tree adaptation can provide better-quality ranking functions for a new domain, compared to other modeling methods.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Adapting boosting for information retrieval measures
TL;DR: This work presents a new ranking algorithm that combines the strengths of two previous methods: boosted tree classification, and LambdaRank, and shows how to find the optimal linear combination for any two rankers, and uses this method to solve the line search problem exactly during boosting.
Adapting boosting for information retrieval measures
Qiang Wu,C. Burges,Krysta M. Svore,Jianfeng Gao +3 more
- 01 Jun 2010
TL;DR: This work presents a new ranking algorithm that combines the strengths of two previous methods: boosted tree classification, and LambdaRank, and shows how to find the optimal linear combination for any two rankers, and uses this method to solve the line search problem exactly during boosting.
415
Stochastic gradient boosted distributed decision trees
Jerry Ye,Jyh-Herng Chow,Jiang Chen,Zhaohui Zheng +3 more
- 02 Nov 2009
TL;DR: Two different distributed methods that generates exact stochastic GBDT models are presented, the first is a MapReduce implementation and the second utilizes MPI on the Hadoop grid environment.
384
•Proceedings Article
Domain Adaptation with Coupled Subspaces
John Blitzer,Sham M. Kakade,Dean P. Foster +2 more
- 14 Jun 2011
TL;DR: This work formalizes the intuition that if the authors can link target-specific features to source features, they can learn effectively using only source labeled data and gives finite sample target error bounds and an algorithm which performs at the state-of-the-art on two natural language processing adaptation tasks which are characterized by novel target features.
Learning to rank with extremely randomized trees.
Pierre Geurts,Gilles Louppe +1 more
- 26 Jan 2011
TL;DR: This article reported on their experiments on the Yahoo! Labs Learning to Rank challenge organized in the context of the 23rd International Conference of Machine Learning (ICML 2010) and showed that ensembles of randomized trees are quite competitive for the learning to rank problem.
References
•Book
The Elements of Statistical Learning
Trevor Hastie,Robert Tibshirani,Jerome H. Friedman +2 more
- 01 Jan 2001
29.4K
Greedy function approximation: A gradient boosting machine.
TL;DR: A general gradient descent boosting paradigm is developed for additive expansions based on any fitting criterion, and specific algorithms are presented for least-squares, least absolute deviation, and Huber-M loss functions for regression, and multiclass logistic likelihood for classification.
The Elements of Statistical Learning
TL;DR: Chapter 11 includes more case studies in other areas, ranging from manufacturing to marketing research, and a detailed comparison with other diagnostic tools, such as logistic regression and tree-based methods.
15.5K
•Book
Modern Information Retrieval
Ricardo Baeza-Yates,Berthier Ribeiro-Neto +1 more
- 15 May 1999
TL;DR: In this article, the authors present a rigorous and complete textbook for a first course on information retrieval from the computer science (as opposed to a user-centred) perspective, which provides an up-to-date student oriented treatment of the subject.
Optimizing search engines using clickthrough data
Thorsten Joachims
- 23 Jul 2002
TL;DR: The goal of this paper is to develop a method that utilizes clickthrough data for training, namely the query-log of the search engine in connection with the log of links the users clicked on in the presented ranking.
4.9K