Evaluating Prompt-based Question Answering for Object Prediction in the Open Research Knowledge Graph

doi:10.48550/arXiv.2305.12900

Journal Article10.48550/arXiv.2305.12900

Evaluating Prompt-based Question Answering for Object Prediction in the Open Research Knowledge Graph

Jennifer D'Souza, +1 more

- 22 May 2023

- arXiv.org

- Vol. abs/2305.12900

2

TL;DR: The authors adopted prompt-based training of transformer language models for new text genres in low-resource settings, and found that the models outperformed on a new domain of data, and achieved performance boosts of up to 40% in a relaxed evaluation setting, and testing the models on a starkly different domain even with a clever training objective in a low resource setting.

Abstract: There have been many recent investigations into prompt-based training of transformer language models for new text genres in low-resource settings. The prompt-based training approach has been found to be effective in generalizing pre-trained or fine-tuned models for transfer to resource-scarce settings. This work, for the first time, reports results on adopting prompt-based training of transformers for \textit{scholarly knowledge graph object prediction}. The work is unique in the following two main aspects. 1) It deviates from the other works proposing entity and relation extraction pipelines for predicting objects of a scholarly knowledge graph. 2) While other works have tested the method on text genera relatively close to the general knowledge domain, we test the method for a significantly different domain, i.e. scholarly knowledge, in turn testing the linguistic, probabilistic, and factual generalizability of these large-scale transformer models. We find that (i) per expectations, transformer models when tested out-of-the-box underperform on a new domain of data, (ii) prompt-based training of the models achieve performance boosts of up to 40\% in a relaxed evaluation setting, and (iii) testing the models on a starkly different domain even with a clever training objective in a low resource setting makes evident the domain knowledge capture gap offering an empirically-verified incentive for investing more attention and resources to the scholarly domain in the context of transformer models.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Peer Review

Large language models shape and are shaped by society: A survey of arXiv publication patterns

S. Balachandar, +3 more

- 20 Jul 2023

TL;DR: In this paper , the authors' research topics correlate with their backgrounds, the factors distinguishing highly cited LLM papers, and the patterns of international collaboration in large language model (LLM) research.

...read moreread less

Journal Article•10.48550/arxiv.2409.06433

Fine-tuning and Prompt Engineering with Cognitive Knowledge Graphs for Scholarly Knowledge Organization

Gollam Rabby, +3 more

- 10 Sep 2024

- arXiv.org

TL;DR: This research leverages large language models and cognitive knowledge graphs to organize scholarly knowledge, utilizing fine-tuning and prompt engineering to enhance accuracy in article categorization and predicate recommendation, facilitating domain-independent knowledge exchange and dissemination.

...read moreread less

References

•Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

- 12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

94.2K

•Posted Content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

- arXiv: Computation and Language

TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

81.7K

•Posted Content

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Yinhan Liu, +9 more

- 26 Jul 2019

- arXiv: Computation and Language

TL;DR: It is found that BERT was significantly undertrained, and can match or exceed the performance of every model published after it, and the best model achieves state-of-the-art results on GLUE, RACE and SQuAD.

...read moreread less

26.2K

•Posted Content

SQuAD: 100,000+ Questions for Machine Comprehension of Text

Pranav Rajpurkar, +3 more

- 16 Jun 2016

- arXiv: Computation and Language

TL;DR: The Stanford Question Answering Dataset (SQuAD) as mentioned in this paper is a reading comprehension dataset consisting of 100,000+ questions posed by crowdworkers on a set of Wikipedia articles, where the answer to each question is a segment of text from the corresponding reading passage.

...read moreread less

5.8K

Proceedings Article

Chain of Thought Prompting Elicits Reasoning in Large Language Models

Jason Loh Seong Wei, +7 more

- 28 Jan 2022

TL;DR: Experiments on three large language models show that chain-of-thought prompting improves performance on a range of arithmetic, commonsense, and symbolic reasoning tasks.

...read moreread less

4.8K

...

Expand