Retrieval-Augmented Generation-based Relation Extraction

doi:10.48550/arxiv.2404.13397

Journal Article10.48550/arxiv.2404.13397

Retrieval-Augmented Generation-based Relation Extraction

Sefika Efeoglu, +1 more

- 20 Apr 2024

- arXiv.org

- Vol. abs/2404.13397

3

TL;DR: Retrieval-Augmented Generation-based Relation Extraction (RAG4RE) significantly enhances relation extraction performance by leveraging retrieved information and augmented generation techniques, surpassing the performance of traditional RE approaches based solely on LLMs.

Abstract: Information Extraction (IE) is a transformative process that converts unstructured text data into a structured format by employing entity and relation extraction (RE) methodologies. The identification of the relation between a pair of entities plays a crucial role within this framework. Despite the existence of various techniques for relation extraction, their efficacy heavily relies on access to labeled data and substantial computational resources. In addressing these challenges, Large Language Models (LLMs) emerge as promising solutions; however, they might return hallucinating responses due to their own training data. To overcome these limitations, Retrieved-Augmented Generation-based Relation Extraction (RAG4RE) in this work is proposed, offering a pathway to enhance the performance of relation extraction tasks. This work evaluated the effectiveness of our RAG4RE approach utilizing different LLMs. Through the utilization of established benchmarks, such as TACRED, TACREV, Re-TACRED, and SemEval RE datasets, our aim is to comprehensively evaluate the efficacy of our RAG4RE approach. In particularly, we leverage prominent LLMs including Flan T5, Llama2, and Mistral in our investigation. The results of our study demonstrate that our RAG4RE approach surpasses performance of traditional RE approaches based solely on LLMs, particularly evident in the TACRED dataset and its variations. Furthermore, our approach exhibits remarkable performance compared to previous RE methodologies across both TACRED and TACREV datasets, underscoring its efficacy and potential for advancing RE tasks in natural language processing.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Table 5 Comparing RAG4RE to Relation Extraction approaches using Language Models (LLMs)

Table 4 Comparing our best-performing results with State-of-the-Art (SoTA) Systems’ results.

Fig. 2. RAG-based Relation Extraction pipeline.

Fig. 6. The number of False Negatives and Positives in results of experiments conducted on different benchmark datasets.

Fig. 4. A regenerated prompt is illustrated.

Citations

Journal Article•10.48550/arxiv.2312.17617

Large Language Models for Generative Information Extraction: A Survey

Derong Xu, +8 more

- 29 Dec 2023

- arXiv.org

TL;DR: This study surveys the most recent advancements in generative Large Language Models efforts for IE tasks and empirically analyzes the most advanced methods to discover the emerging trend of IE tasks with LLMs.

...read moreread less

64

Journal Article•10.1007/978-3-031-94575-5_6

Kastor: Fine-Tuned Small Language Models for Shape-Based Active Relation Extraction

Célian Ringwald, +4 more

TL;DR: Kastor is a framework that fine-tunes small language models for shape-based active relation extraction, enhancing model generalization and performance by evaluating all possible property combinations and refining noisy knowledge bases through iterative learning.

...read moreread less

Preprint•10.48550/arxiv.2406.14745

Relation Extraction with Fine-Tuned Large Language Models in Retrieval Augmented Generation Frameworks

Şefika Efeoğlu, +1 more

- 20 Jun 2024

TL;DR: Fine-tuned large language models enhance relation extraction performance by addressing domain adaptation challenges and identifying implicit relations in sentences, particularly when integrated into the Retrieval Augmented-based (RAG) framework.

...read moreread less

References

•Proceedings Article•10.18653/V1/D19-1410

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

Nils Reimers, +1 more

- 14 Aug 2019

TL;DR: Sentence-BERT (SBERT), a modification of the pretrained BERT network that use siamese and triplet network structures to derive semantically meaningful sentence embeddings that can be compared using cosine-similarity is presented.

...read moreread less

12K

Llama 2: Open Foundation and Fine-Tuned Chat Models

Hugo Touvron, +57 more

- 18 Jul 2023

TL;DR: This article developed and released Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters.

...read moreread less

5.7K

•Posted Content

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Patrick S. H. Lewis, +11 more

- 22 May 2020

- arXiv: Computation and Language

TL;DR: A general-purpose fine-tuning recipe for retrieval-augmented generation (RAG) -- models which combine pre-trained parametric and non-parametric memory for language generation, and finds that RAG models generate more specific, diverse and factual language than a state-of-the-art parametric-only seq2seq baseline.

...read moreread less

3.6K

•Proceedings Article•10.18653/V1/D17-1004

Position-aware Attention and Supervised Data Improve Slot Filling

Yuhao Zhang, +4 more

- 01 Sep 2017

TL;DR: An effective new model is proposed, which combines an LSTM sequence model with a form of entity position-aware attention that is better suited to relation extraction that builds TACRED, a large supervised relation extraction dataset obtained via crowdsourcing and targeted towards TAC KBP relations.

...read moreread less

1.1K

...

Expand

Retrieval-Augmented Generation-based Relation Extraction

Chat with Paper

AI Agents for this Paper

Figures

Citations

Large Language Models for Generative Information Extraction: A Survey

Kastor: Fine-Tuned Small Language Models for Shape-Based Active Relation Extraction

Relation Extraction with Fine-Tuned Large Language Models in Retrieval Augmented Generation Frameworks

References

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

Llama 2: Open Foundation and Fine-Tuned Chat Models

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Scaling Instruction-Finetuned Language Models

Position-aware Attention and Supervised Data Improve Slot Filling