Active Ensemble Learning for Knowledge Graph Error Detection
Junnan Dong,Qinggang Zhang,Xiao Huang,Qiaoyu Tan,Daochen Zha,Zhao Zihao +5 more
- 27 Feb 2023
TL;DR: In this article , an ensemble learning framework for KG error detection is proposed, which adaptively updates the ensemble learning policy in each iteration based on active queries, i.e., the answers from experts.
read more
Abstract: Knowledge graphs (KGs) could effectively integrate a large number of real-world assertions, and improve the performance of various applications, such as recommendation and search. KG error detection has been intensively studied since real-world KGs inevitably contain erroneous triples. While existing studies focus on developing a novel algorithm dedicated to one or a few data characteristics, we explore advancing KG error detection by assembling a set of state-of-the-art (SOTA) KG error detectors. However, it is nontrivial to develop a practical ensemble learning framework for KG error detection. Existing ensemble learning models heavily rely on labels, while it is expensive to acquire labeled errors in KGs. Also, KG error detection itself is challenging since triples contain rich semantic information and might be false because of various reasons. To this end, we propose to leverage active learning to minimize human efforts. Our proposed framework - KAEL, could effectively assemble a set of off-the-shelf error detection algorithms, by actively using a limited number of manual annotations. It adaptively updates the ensemble learning policy in each iteration based on active queries, i.e., the answers from experts. After all annotation budget is used, KAEL utilizes the trained policy to identify remaining suspicious triples. Experiments on real-world KGs demonstrate that we can achieve significant improvement when applying KAEL to assemble SOTA error detectors. KAEL also outperforms SOTA ensemble learning baselines significantly.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Data-centric Artificial Intelligence: A Survey
TL;DR: Data-centric AI as mentioned in this paper provides a comprehensive survey that provides a global view of a spectrum of tasks across various stages of the data lifecycle, and equip the readers with the techniques and further research ideas to systematically engineer data for building AI systems.
Data-centric AI: Perspectives and Challenges
TL;DR: Data-centric AI (DCAI) as discussed by the authors advocates a fundamental shift from model advancements to ensuring data quality and reliability, and draws a big picture and brings together three general missions: training data development, inference data development and data maintenance.
47
Bring Your Own View: Graph Neural Networks for Link Prediction with Personalized Subgraph Selection
Qiaoyu Tan,Xin Zhang,Ninghao Liu,Daochen Zha,Li Lei,Rui Chen,Soo-Hyun Choi,Xia Hu +7 more
- 23 Dec 2022
TL;DR: In this article , a Personalized Subgraph Selector (PS2) is proposed to automatically, personally, and inductively identify optimal subgraphs for different edges when performing GNNLP.
KnowGPT: Black-Box Knowledge Injection for Large Language Models
Qinggang Zhang,Junnan Dong,Hao Chen,Xiao Huang,Daochen Zha,Zailiang Yu +5 more
TL;DR: This work introduces KnowGPT, a black-box knowledge injection framework for LLMs in question answering that leverages deep reinforcement learning to extract relevant knowledge from Knowledge Graphs and uses Multi-Armed Bandit to construct the most suitable prompt for each question.
References
•Proceedings Article
Translating Embeddings for Modeling Multi-relational Data
Antoine Bordes,Nicolas Usunier,Alberto Garcia-Duran,Jason Weston,Oksana Yakhnenko +4 more
- 05 Dec 2013
TL;DR: TransE is proposed, a method which models relationships by interpreting them as translations operating on the low-dimensional embeddings of the entities, which proves to be powerful since extensive experiments show that TransE significantly outperforms state-of-the-art methods in link prediction on two knowledge bases.
The Unified Medical Language System (UMLS): integrating biomedical terminology
TL;DR: The Unified Medical Language System is a repository of biomedical vocabularies developed by the US National Library of Medicine and includes tools for customizing the Metathesaurus (MetamorphoSys), for generating lexical variants of concept names (lvg) and for extracting UMLS concepts from text (MetaMap).
•Proceedings Article
Knowledge graph embedding by translating on hyperplanes
Zhen Wang,Jianwen Zhang,Jianlin Feng,Zheng Chen +3 more
- 27 Jul 2014
TL;DR: This paper proposes TransH which models a relation as a hyperplane together with a translation operation on it and can well preserve the above mapping properties of relations with almost the same model complexity of TransE.
•Proceedings Article
Embedding Entities and Relations for Learning and Inference in Knowledge Bases
Bishan Yang,Wen-tau Yih,Xiaodong He,Jianfeng Gao,Li Deng +4 more
- 01 May 2015
TL;DR: It is found that embeddings learned from the bilinear objective are particularly good at capturing relational semantics and that the composition of relations is characterized by matrix multiplication.
Knowledge Graph Embedding: A Survey of Approaches and Applications
TL;DR: This article provides a systematic review of existing techniques of Knowledge graph embedding, including not only the state-of-the-arts but also those with latest trends, based on the type of information used in the embedding task.
2.8K
Related Papers (5)
Michael Edeh,Surjeet Dalal,Imed Ben Dhaou,Charles Chuka Agubosim,C. C. Umoke,Nneka Richard-Nnabu,Neeraj Dahiya +6 more