Trada: tree based ranking function adaptation

doi:10.1145/1458082.1458233

Proceedings Article10.1145/1458082.1458233

Trada: tree based ranking function adaptation

Keke Chen, +5 more

- 26 Oct 2008

- pp 1143-1152

54

TL;DR: Tree adaptation assumes that ranking functions are trained with regression-tree based modeling methods, such as Gradient Boosting Trees, and takes such a ranking function from one domain and tunes its tree-based structure with a small amount of training data from the target domain.

Abstract: Machine Learned Ranking approaches have shown successes in web search engines. With the increasing demands on developing effective ranking functions for different search domains, we have seen a big bottleneck, i.e., the problem of insufficient training data, which has significantly limited the fast development and deployment of machine learned ranking functions for different web search domains. In this paper, we propose a new approach called tree based ranking function adaptation ("tree adaptation") to address this problem. Tree adaptation assumes that ranking functions are trained with regression-tree based modeling methods, such as Gradient Boosting Trees. It takes such a ranking function from one domain and tunes its tree-based structure with a small amount of training data from the target domain. The unique features include (1) it can automatically identify the part of model that needs adjustment for the new domain, (2) it can appropriately weight training examples considering both local and global distributions. Experiments are performed to show that tree adaptation can provide better-quality ranking functions for a new domain, compared to other modeling methods.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1007/S10791-009-9112-1

Adapting boosting for information retrieval measures

Qiang Wu, +3 more

- 01 Jun 2010

- Information Retrieval

TL;DR: This work presents a new ranking algorithm that combines the strengths of two previous methods: boosted tree classification, and LambdaRank, and shows how to find the optimal linear combination for any two rankers, and uses this method to solve the line search problem exactly during boosting.

...read moreread less

655

Journal Article•10.1007/s10791-009-9112-1

Adapting boosting for information retrieval measures

Qiang Wu, +3 more

- 01 Jun 2010

TL;DR: This work presents a new ranking algorithm that combines the strengths of two previous methods: boosted tree classification, and LambdaRank, and shows how to find the optimal linear combination for any two rankers, and uses this method to solve the line search problem exactly during boosting.

...read moreread less

415

Proceedings Article•10.1145/1645953.1646301

Stochastic gradient boosted distributed decision trees

Jerry Ye, +3 more

- 02 Nov 2009

TL;DR: Two different distributed methods that generates exact stochastic GBDT models are presented, the first is a MapReduce implementation and the second utilizes MPI on the Hadoop grid environment.

...read moreread less

384

•Proceedings Article

Domain Adaptation with Coupled Subspaces

John Blitzer, +2 more

- 14 Jun 2011

TL;DR: This work formalizes the intuition that if the authors can link target-specific features to source features, they can learn effectively using only source labeled data and gives finite sample target error bounds and an algorithm which performs at the state-of-the-art on two natural language processing adaptation tasks which are characterized by novel target features.

...read moreread less

140

Learning to rank with extremely randomized trees.

Pierre Geurts, +1 more

- 26 Jan 2011

TL;DR: This article reported on their experiments on the Yahoo! Labs Learning to Rank challenge organized in the context of the 23rd International Conference of Machine Learning (ICML 2010) and showed that ensembles of randomized trees are quite competitive for the learning to rank problem.

...read moreread less

49

...

Expand

References

•Book

The Elements of Statistical Learning

Trevor Hastie, +2 more

- 01 Jan 2001

29.4K

•Journal Article•10.1214/AOS/1013203451

Greedy function approximation: A gradient boosting machine.

Jerome H. Friedman

- 01 Oct 2001

- Annals of Statistics

TL;DR: A general gradient descent boosting paradigm is developed for additive expansions based on any fitting criterion, and specific algorithms are presented for least-squares, least absolute deviation, and Huber-M loss functions for regression, and multiclass logistic likelihood for classification.

...read moreread less

26.4K

Journal Article•10.1198/TECH.2003.S770

The Elements of Statistical Learning

Eric R. Ziegel

- 01 Aug 2003

- Technometrics

TL;DR: Chapter 11 includes more case studies in other areas, ranging from manufacturing to marketing research, and a detailed comparison with other diagnostic tools, such as logistic regression and tree-based methods.

...read moreread less

15.5K

•Book

Modern Information Retrieval

Ricardo Baeza-Yates, +1 more

- 15 May 1999

TL;DR: In this article, the authors present a rigorous and complete textbook for a first course on information retrieval from the computer science (as opposed to a user-centred) perspective, which provides an up-to-date student oriented treatment of the subject.

...read moreread less

11.6K

Proceedings Article•10.1145/775047.775067

Optimizing search engines using clickthrough data

Thorsten Joachims

- 23 Jul 2002

TL;DR: The goal of this paper is to develop a method that utilizes clickthrough data for training, namely the query-log of the search engine in connection with the log of links the users clicked on in the presented ranking.

...read moreread less

4.9K

...

Expand

Trada: tree based ranking function adaptation

Chat with Paper

AI Agents for this Paper

Citations

Adapting boosting for information retrieval measures

Adapting boosting for information retrieval measures

Stochastic gradient boosted distributed decision trees

Domain Adaptation with Coupled Subspaces

Learning to rank with extremely randomized trees.

References

The Elements of Statistical Learning

Greedy function approximation: A gradient boosting machine.

The Elements of Statistical Learning

Modern Information Retrieval

Optimizing search engines using clickthrough data

Related Papers (5)

Learning to rank using gradient descent

Domain Adaptation with Structural Correspondence Learning

Greedy function approximation: A gradient boosting machine.

AdaRank: a boosting algorithm for information retrieval

Learning to rank: from pairwise approach to listwise approach