Active Ensemble Learning for Knowledge Graph Error Detection

doi:10.1145/3539597.3570368

Open AccessProceedings Article10.1145/3539597.3570368

Active Ensemble Learning for Knowledge Graph Error Detection

- 27 Feb 2023

23

TL;DR: In this article , an ensemble learning framework for KG error detection is proposed, which adaptively updates the ensemble learning policy in each iteration based on active queries, i.e., the answers from experts.

Abstract: Knowledge graphs (KGs) could effectively integrate a large number of real-world assertions, and improve the performance of various applications, such as recommendation and search. KG error detection has been intensively studied since real-world KGs inevitably contain erroneous triples. While existing studies focus on developing a novel algorithm dedicated to one or a few data characteristics, we explore advancing KG error detection by assembling a set of state-of-the-art (SOTA) KG error detectors. However, it is nontrivial to develop a practical ensemble learning framework for KG error detection. Existing ensemble learning models heavily rely on labels, while it is expensive to acquire labeled errors in KGs. Also, KG error detection itself is challenging since triples contain rich semantic information and might be false because of various reasons. To this end, we propose to leverage active learning to minimize human efforts. Our proposed framework - KAEL, could effectively assemble a set of off-the-shelf error detection algorithms, by actively using a limited number of manual annotations. It adaptively updates the ensemble learning policy in each iteration based on active queries, i.e., the answers from experts. After all annotation budget is used, KAEL utilizes the trained policy to identify remaining suspicious triples. Experiments on real-world KGs demonstrate that we can achieve significant improvement when applying KAEL to assemble SOTA error detectors. KAEL also outperforms SOTA ensemble learning baselines significantly.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arXiv.2303.10158

Data-centric Artificial Intelligence: A Survey

Daochen Zha, +6 more

- 17 Mar 2023

- arXiv.org

TL;DR: Data-centric AI as mentioned in this paper provides a comprehensive survey that provides a global view of a spectrum of tasks across various stages of the data lifecycle, and equip the readers with the techniques and further research ideas to systematically engineer data for building AI systems.

...read moreread less

110

Book Chapter•10.1137/1.9781611977653.ch106

Data-centric AI: Perspectives and Challenges

01 Jan 2023

54

Journal Article•10.48550/arXiv.2301.04819

Data-centric AI: Perspectives and Challenges

Daochen Zha, +4 more

- 12 Jan 2023

- arXiv.org

TL;DR: Data-centric AI (DCAI) as discussed by the authors advocates a fundamental shift from model advancements to ensuring data quality and reliability, and draws a big picture and brings together three general missions: training data development, inference data development and data maintenance.

...read moreread less

47

•Proceedings Article•10.1145/3539597.3570407

Bring Your Own View: Graph Neural Networks for Link Prediction with Personalized Subgraph Selection

Qiaoyu Tan, +7 more

- 23 Dec 2022

TL;DR: In this article , a Personalized Subgraph Selector (PS2) is proposed to automatically, personally, and inductively identify optimal subgraphs for different edges when performing GNNLP.

...read moreread less

33

Journal Article•10.48550/arxiv.2312.06185

KnowGPT: Black-Box Knowledge Injection for Large Language Models

Qinggang Zhang, +5 more

- 11 Dec 2023

- arXiv.org

TL;DR: This work introduces KnowGPT, a black-box knowledge injection framework for LLMs in question answering that leverages deep reinforcement learning to extract relevant knowledge from Knowledge Graphs and uses Multi-Armed Bandit to construct the most suitable prompt for each question.

...read moreread less

8

...

Expand

References

•Proceedings Article

Translating Embeddings for Modeling Multi-relational Data

Antoine Bordes, +4 more

- 05 Dec 2013

TL;DR: TransE is proposed, a method which models relationships by interpreting them as translations operating on the low-dimensional embeddings of the entities, which proves to be powerful since extensive experiments show that TransE significantly outperforms state-of-the-art methods in link prediction on two knowledge bases.

...read moreread less

7.6K

•Journal Article•10.1093/NAR/GKH061

The Unified Medical Language System (UMLS): integrating biomedical terminology

Olivier Bodenreider

- 01 Jan 2004

- Nucleic Acids Research

TL;DR: The Unified Medical Language System is a repository of biomedical vocabularies developed by the US National Library of Medicine and includes tools for customizing the Metathesaurus (MetamorphoSys), for generating lexical variants of concept names (lvg) and for extracting UMLS concepts from text (MetaMap).

...read moreread less

4.7K

•Proceedings Article

Knowledge graph embedding by translating on hyperplanes

Zhen Wang, +3 more

- 27 Jul 2014

TL;DR: This paper proposes TransH which models a relation as a hyperplane together with a translation operation on it and can well preserve the above mapping properties of relations with almost the same model complexity of TransE.

...read moreread less

4.1K

•Proceedings Article

Embedding Entities and Relations for Learning and Inference in Knowledge Bases

Bishan Yang, +4 more

- 01 May 2015

TL;DR: It is found that embeddings learned from the bilinear objective are particularly good at capturing relational semantics and that the composition of relations is characterized by matrix multiplication.

...read moreread less

3.2K

Journal Article•10.1109/TKDE.2017.2754499

Knowledge Graph Embedding: A Survey of Approaches and Applications

Quan Wang, +3 more

- 01 Dec 2017

- IEEE Transactions on Knowledge and Data ...

TL;DR: This article provides a systematic review of existing techniques of Knowledge graph embedding, including not only the state-of-the-arts but also those with latest trends, based on the type of information used in the embedding task.

...read moreread less

2.8K