Journal Article10.48550/arXiv.2305.12900
Evaluating Prompt-based Question Answering for Object Prediction in the Open Research Knowledge Graph
Jennifer D'Souza,Sören Auer +1 more
2
TL;DR: The authors adopted prompt-based training of transformer language models for new text genres in low-resource settings, and found that the models outperformed on a new domain of data, and achieved performance boosts of up to 40% in a relaxed evaluation setting, and testing the models on a starkly different domain even with a clever training objective in a low resource setting.
read more
Abstract: There have been many recent investigations into prompt-based training of transformer language models for new text genres in low-resource settings. The prompt-based training approach has been found to be effective in generalizing pre-trained or fine-tuned models for transfer to resource-scarce settings. This work, for the first time, reports results on adopting prompt-based training of transformers for \textit{scholarly knowledge graph object prediction}. The work is unique in the following two main aspects. 1) It deviates from the other works proposing entity and relation extraction pipelines for predicting objects of a scholarly knowledge graph. 2) While other works have tested the method on text genera relatively close to the general knowledge domain, we test the method for a significantly different domain, i.e. scholarly knowledge, in turn testing the linguistic, probabilistic, and factual generalizability of these large-scale transformer models. We find that (i) per expectations, transformer models when tested out-of-the-box underperform on a new domain of data, (ii) prompt-based training of the models achieve performance boosts of up to 40\% in a relaxed evaluation setting, and (iii) testing the models on a starkly different domain even with a clever training objective in a low resource setting makes evident the domain knowledge capture gap offering an empirically-verified incentive for investing more attention and resources to the scholarly domain in the context of transformer models.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Peer Review
Large language models shape and are shaped by society: A survey of arXiv publication patterns
S. Balachandar,Kenny Peng,Nikhil Garg,Emma Pierson +3 more
- 20 Jul 2023
TL;DR: In this paper , the authors' research topics correlate with their backgrounds, the factors distinguishing highly cited LLM papers, and the patterns of international collaboration in large language model (LLM) research.
Fine-tuning and Prompt Engineering with Cognitive Knowledge Graphs for Scholarly Knowledge Organization
Gollam Rabby,Sören Auer,Jennifer D'Souza,Allard Oelen +3 more
TL;DR: This research leverages large language models and cognitive knowledge graphs to organize scholarly knowledge, utilizing fine-tuning and prompt engineering to enhance accuracy in article categorization and predicate recommendation, facilitating domain-independent knowledge exchange and dissemination.
References
•Proceedings Article
Attention is All you Need
Ashish Vaswani,Noam Shazeer,Niki Parmar,Jakob Uszkoreit,Llion Jones,Aidan N. Gomez,Lukasz Kaiser,Illia Polosukhin +7 more
- 12 Jun 2017
TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.
•Posted Content
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.
81.7K
•Posted Content
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu,Myle Ott,Naman Goyal,Jingfei Du,Mandar Joshi,Danqi Chen,Omer Levy,Michael Lewis,Luke Zettlemoyer,Veselin Stoyanov +9 more
TL;DR: It is found that BERT was significantly undertrained, and can match or exceed the performance of every model published after it, and the best model achieves state-of-the-art results on GLUE, RACE and SQuAD.
•Posted Content
SQuAD: 100,000+ Questions for Machine Comprehension of Text
TL;DR: The Stanford Question Answering Dataset (SQuAD) as mentioned in this paper is a reading comprehension dataset consisting of 100,000+ questions posed by crowdworkers on a set of Wikipedia articles, where the answer to each question is a segment of text from the corresponding reading passage.
5.8K
Proceedings Article
Chain of Thought Prompting Elicits Reasoning in Large Language Models
Jason Loh Seong Wei,Xuezhi Wang,D. Schuurmans,Maarten Bosma,Ed H. Chi,Fei Xia,Quoc Le,Denny Zhou +7 more
- 28 Jan 2022
TL;DR: Experiments on three large language models show that chain-of-thought prompting improves performance on a range of arithmetic, commonsense, and symbolic reasoning tasks.