Graph-based entity-oriented search

Question

1. What was the common approach for scoring the entity type?

2. What are the future works in this paper?

3. What are the contributions in this paper?

4. What types of knowledge bases are commonly used to augment a corpus?

Accepted Answer

A filtering approach, based on the Kullback-Leibler divergence between the probability distributions of the entity and query types, was used for scoring the entity type.

Accepted Answer

In this section, the authors present several ideas for the future, both at a high-level of abstraction, and at a high-level of detail.. 3. 4 ]. 256 11. 3 future work 11.. More importantly, future work should focus on alternative approaches to reduce the complexity of the representation model, such as the document profiles based on keywords that the authors proposed here.. Apart from reducing the model by pruning redundancies, the authors would, on the other hand, like to extend it with synonyms for verbs, adjectives and adverbs, measuring the impact in effectiveness, and understanding whether the usage of synsets for nouns had been sufficient.

Accepted Answer

With the goal of harnessing all available information to optimize retrieval, the authors explore joint representation models of documents and entities, while taking a step towards the definition of a more general retrieval approach.. Specifically, the authors propose that graphs should be used to incorporate explicit and implicit information derived from the relations between text found in corpora and entities found in knowledge bases.. The authors also take advantage of this framework to elaborate a general model for entity-oriented search, proposing a universal ranking function for the tasks of ad hoc document retrieval ( leveraging entities ), ad hoc entity retrieval, and entity list completion.. The authors introduce the entity weight as the corresponding ranking function, relying on the idea of seed nodes for representing the query, either directly through term nodes, or based on the expansion to adjacent entity nodes.. The authors introduce the random walk score as the corresponding ranking function, relying on the same idea of seed nodes, similar to the entity weight in the graph-of-entity.. Scoring based on this function is highly reliant on the structure of the hypergraph, which the authors call representation-driven retrieval.. The authors also propose TF-bins as a discretization for representing term frequency in the hypergraph-of-entity.. For the random walk score, the authors propose and explore several parameters, including length and repeats, with or without seed node expansion, direction, or weights, and with or without a certain degree of node and/or hyperedge fatigue, a concept that they also propose.. For evaluation, the authors took advantage of TREC 2017 OpenSearch track, which relied on an online evaluation process based on the Living Labs API, and they also participated in TREC 2018 Common Core track, which was based on the newly introduced TREC Washington Post Corpus.

Accepted Answer

Knowledge bases like Wikipedia (semi-structured), DBpedia (structured), or Wikidata (structured) are frequently used to augment a corpus.

Accepted Answer

For assessment, 82 individual relevance judgments were provided, with entities graded from 0 to 4, from least to most useful in the context of the given document.

Accepted Answer

In this work the authors rely on the degree, clustering coefficient, average path length, diameter and density to characterize the hypergraph-of-entity.

Accepted Answer

The process requires issuing SQL queries to fetch the flattened relational data, which is then filtered and selected in order to be converted into semantic triples, that are stored in a quad store.

Accepted Answer

A fair evaluation of a general retrieval model requires a dataset, usually semistructured data, that can be interpreted as, or processed to become, combined data (e.g., entity-annotated text, where entities are linked to a knowledge base).

Accepted Answer

Their ubiquity across the relevant areas of entity-oriented search led us to propose a graph-based representation and retrieval model that combines terms, entities, and their relations.

Accepted Answer

Despite the disregard for efficiency, at this stage, the complexity of the model and its inefficient implementation supported on a graph database were critical challenges in setting up an evaluation workbench with acceptable run times.

Accepted Answer

The hypothesis is that this might improve effectiveness for search queries in the long tail [307], in particular by increasing recall without decreasing precision.

Accepted Answer

The clear advantage of reusing classical information retrieval models for entity-oriented search is the support on well-established approaches that have been researched for text-based applications over the years.

Accepted Answer

In order to improve on the low scalability of the graph-of-entity, the authors then redesigned this model in a way that reduced the number of edges in relation to the number of nodes, by relying on the hypergraph data structure.

Accepted Answer

The authors propose two classes of indicators:ranking indicators Structural features that can be used to rank different graphbased models in regard to their predicted retrieval performance.

Graph-based entity-oriented search

Chat with Paper

AI Agents for this Paper

Most frequently asked questions

1. What was the common approach for scoring the entity type?

2. What are the future works in this paper?

3. What are the contributions in this paper?

4. What types of knowledge bases are commonly used to augment a corpus?

5. How many individual relevance judgments were provided for the ad hoc document retrieval task?

6. What are the parameters used to characterize the hypergraph-of-entity?

7. What is the process of converting the flattened relational data into semantic triples?

8. What is the definition of a fair evaluation of a general retrieval model?

9. What was the main idea behind the graph-based representation and retrieval model?

10. What were the challenges in setting up an evaluation workbench?

11. What is the hypothesis that this might improve the effectiveness of search queries in the long tail?

12. What is the advantage of reusing classical information retrieval models for entity-oriented search?

13. How did the authors reduce the number of edges in relation to the number of nodes?

14. What are the two classes of indicators that can be used to rank different graphbased models?

Figures

Citations

Automation of a network of problems using programming tools

A Review of Graph-Based Models for Entity-Oriented Search

An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis

References

Hypergraph-of-entity: A unified representation model for the retrieval of text and knowledge

Army ANT: A Workbench for Innovation in Entity-Oriented Search

Characterizing the hypergraph-of-entity and the structural impact of its extensions

Related Papers (5)

Entity Came to Rescue - Leveraging Entities to Minimize Risks in Web Search

Representation learning for entity type ranking

Identifying and exploiting target entity type information for ad hoc entity retrieval

Autoregressive Entity Retrieval

Random walk-based entity representation learning and re-ranking for entity search