Web query classification

Topic Tools

Papers published on a yearly basis

1 / 2

Papers

Journal Article•10.4018/JDWM.2007070101•

Multi-label classification: An overview

[...]

Grigorios Tsoumakas¹, Ioannis Katakis¹•Institutions (1)

Aristotle University of Thessaloniki¹

01 Jul 2007-International Journal of Data Warehousing and Mining

TL;DR: The task of multi-label classification is introduced, the sparse related literature is organizes into a structured presentation and comparative experimental results of certain multilabel classification methods are performed.

...read moreread less

Abstract: Nowadays, multi-label classification methods are increasingly required by modern applications, such as protein function classification, music categorization and semantic scene classification. This paper introduces the task of multi-label classification, organizes the sparse related literature into a structured presentation and performs comparative experimental results of certain multi-label classification methods. It also contributes the definition of concepts for the quantification of the multi-label nature of a data set.

...read moreread less

3,062 citations

Proceedings Article•10.1145/511446.511513•

Topic-sensitive PageRank

[...]

Taher H. Haveliwala¹•Institutions (1)

Stanford University¹

7 May 2002

TL;DR: A set of PageRank vectors are proposed, biased using a set of representative topics, to capture more accurately the notion of importance with respect to a particular topic, and are shown to generate more accurate rankings than with a single, generic PageRank vector.

...read moreread less

Abstract: In the original PageRank algorithm for improving the ranking of search-query results, a single PageRank vector is computed, using the link structure of the Web, to capture the relative "importance" of Web pages, independent of any particular search query. To yield more accurate search results, we propose computing a set of PageRank vectors, biased using a set of representative topics, to capture more accurately the notion of importance with respect to a particular topic. By using these (precomputed) biased PageRank vectors to generate query-specific importance scores for pages at query time, we show that we can generate more accurate rankings than with a single, generic PageRank vector. For ordinary keyword search queries, we compute the topic-sensitive PageRank scores for pages satisfying the query using the topic of the query keywords. For searches done in context (e.g., when the search query is performed by highlighting words in a Web page), we compute the topic-sensitive PageRank scores using the topic of the context in which the query appeared.

...read moreread less

2,074 citations

Journal Article•10.1007/S10817-007-9078-X•

Tractable Reasoning and Efficient Query Answering in Description Logics: The DL-Lite Family

[...]

Diego Calvanese¹, Giuseppe De Giacomo², Domenico Lembo², Maurizio Lenzerini², Riccardo Rosati² - Show less +1 more•Institutions (2)

Free University of Bozen-Bolzano¹, Sapienza University of Rome²

01 Oct 2007-Journal of Automated Reasoning

TL;DR: It is shown that, for the DLs of the DL-Lite family, the usual DL reasoning tasks are polynomial in the size of the TBox, and query answering is LogSpace in thesize of the ABox, which is the first result ofPolynomial-time data complexity for query answering over DL knowledge bases.

...read moreread less

Abstract: We propose a new family of description logics (DLs), called DL-Lite, specifically tailored to capture basic ontology languages, while keeping low complexity of reasoning. Reasoning here means not only computing subsumption between concepts and checking satisfiability of the whole knowledge base, but also answering complex queries (in particular, unions of conjunctive queries) over the instance level (ABox) of the DL knowledge base. We show that, for the DLs of the DL-Lite family, the usual DL reasoning tasks are polynomial in the size of the TBox, and query answering is LogSpace in the size of the ABox (i.e., in data complexity). To the best of our knowledge, this is the first result of polynomial-time data complexity for query answering over DL knowledge bases. Notably our logics allow for a separation between TBox and ABox reasoning during query evaluation: the part of the process requiring TBox reasoning is independent of the ABox, and the part of the process requiring access to the ABox can be carried out by an SQL engine, thus taking advantage of the query optimization strategies provided by current database management systems. Since even slight extensions to the logics of the DL-Lite family make query answering at least NLogSpace in data complexity, thus ruling out the possibility of using on-the-shelf relational technology for query processing, we can conclude that the logics of the DL-Lite family are the maximal DLs supporting efficient query answering over large amounts of instances.

...read moreread less

1,691 citations

Proceedings Article•10.1145/500141.500159•

Support vector machine active learning for image retrieval

[...]

Simon Tong¹, Edward Y. Chang²•Institutions (2)

Stanford University¹, University of California, Santa Barbara²

1 Oct 2001

TL;DR: This work proposes the use of a support vector machine active learning algorithm for conducting effective relevance feedback for image retrieval and achieves significantly higher search accuracy than traditional query refinement schemes after just three to four rounds of relevance feedback.

...read moreread less

Abstract: Relevance feedback is often a critical component when designing image databases. With these databases it is difficult to specify queries directly and explicitly. Relevance feedback interactively determinines a user's desired output or query concept by asking the user whether certain proposed images are relevant or not. For a relevance feedback algorithm to be effective, it must grasp a user's query concept accurately and quickly, while also only asking the user to label a small number of images. We propose the use of a support vector machine active learning algorithm for conducting effective relevance feedback for image retrieval. The algorithm selects the most informative images to query a user and quickly learns a boundary that separates the images that satisfy the user's query concept from the rest of the dataset. Experimental results show that our algorithm achieves significantly higher search accuracy than traditional query refinement schemes after just three to four rounds of relevance feedback.

...read moreread less

1,593 citations

Journal Article•10.1109/TKDE.2003.1208999•

Topic-sensitive PageRank: a context-sensitive ranking algorithm for Web search

[...]

Taher H. Haveliwala¹•Institutions (1)

Stanford University¹

01 Jul 2003-IEEE Transactions on Knowledge and Data Engineering

TL;DR: It is shown that using linear combinations of these (precomputed) biased PageRank vectors to generate context-specific importance scores for pages at query time, can generate more accurate rankings than with a single, generic PageRank vector.

...read moreread less

Abstract: The original PageRank algorithm for improving the ranking of search-query results computes a single vector, using the link structure of the Web, to capture the relative "importance" of Web pages, independent of any particular search query. To yield more accurate search results, we propose computing a set of PageRank vectors, biased using a set of representative topics, to capture more accurately the notion of importance with respect to a particular topic. For ordinary keyword search queries, we compute the topic-sensitive PageRank scores for pages satisfying the query using the topic of the query keywords. For searches done in context (e.g., when the search query is performed by highlighting words in a Web page), we compute the topic-sensitive PageRank scores using the topic of the context in which the query appeared. By using linear combinations of these (precomputed) biased PageRank vectors to generate context-specific importance scores for pages at query time, we show that we can generate more accurate rankings than with a single, generic PageRank vector. We describe techniques for efficiently implementing a large-scale search system based on the topic-sensitive PageRank scheme.

...read moreread less

1,309 citations

...

Expand

Year	Papers
2025	9
2024	30
2023	53
2022	88
2021	16
2020	15

Topic Tools

Papers published on a yearly basis

Papers

Multi-label classification: An overview

Topic-sensitive PageRank

Tractable Reasoning and Efficient Query Answering in Description Logics: The DL-Lite Family

Support vector machine active learning for image retrieval

Topic-sensitive PageRank: a context-sensitive ranking algorithm for Web search

Related Topics (5)

Performance Metrics