Know What You Don't Know: Unanswerable Questions for SQuAD

doi:10.18653/V1/P18-2124

Open AccessProceedings Article10.18653/V1/P18-2124

Know What You Don't Know: Unanswerable Questions for SQuAD

Pranav Rajpurkar, +2 more

- 11 Jun 2018

- Vol. 2, pp 784-789

2.3K

TL;DR: SQuADRUn as discussed by the authors is a new dataset that combines the existing Stanford Question Answering Dataset with over 50,000 unanswerable questions written adversarially by crowdworkers to look similar to answerable ones.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Proceedings Article•10.1109/IALP51396.2020.9310487

HisBERT for Conversational Reading Comprehension

Chuang Liu, +4 more

- 04 Dec 2020

TL;DR: In this paper, the authors propose to integrate conversation history into the architecture of BERT for conversational reading comprehension (CRC) by using adversarial training to disturb the word embedding layer, which improves the robustness of the proposed model.

...read moreread less

7

•Journal Article•10.3390/app13042577

Evaluating Deep Learning Techniques for Natural Language Inference

Petros Eleftheriadis, +2 more

- 16 Feb 2023

- Applied Sciences

TL;DR: In this paper , the authors compare five deep learning models, such as BERT, RoBERTa, and ALBERT, on eight widely used NLI datasets, including the BreakingNLI dataset.

...read moreread less

7

Journal Article•10.18653/v1/2022.emnlp-main.465

IDK-MRC: Unanswerable Questions for Indonesian Machine Reading Comprehension

Rifki Afina Putri, +1 more

- 01 Jan 2022

TL;DR: The lack of unanswerable questions in Indonesian Machine Reading Comprehension (MRC) datasets leads to poor model performance. IDK-MRC is a new dataset that includes a large number of unanswerable questions, improving the performance of Indonesian MRC models.

...read moreread less

7

•Posted Content

Selective Question Answering under Domain Shift

Amita Kamath, +2 more

- 16 Jun 2020

- arXiv: Computation and Language

TL;DR: In this paper, a QA model is tested on a mixture of in-domain and out-of-domain data, and must answer (i.e., not abstain on) as many questions as possible while maintaining high accuracy.

...read moreread less

7

•Posted Content

Open-Domain Question-Answering for COVID-19 and Other Emergent Domains

Sharon Levy, +3 more

- 13 Oct 2021

- arXiv: Computation and Language

TL;DR: In this article, an open-domain question-answering system for the emergent domain of COVID-19 has been proposed to find answers to free-text questions from a large set of documents.

...read moreread less

7

...

Expand

References

•Proceedings Article•10.18653/V1/N18-1202

Deep contextualized word representations

Matthew E. Peters, +6 more

- 15 Feb 2018

TL;DR: This paper introduced a new type of deep contextualized word representation that models both complex characteristics of word use (e.g., syntax and semantics), and how these uses vary across linguistic contexts (i.e., to model polysemy).

...read moreread less

11.7K

•Proceedings Article•10.18653/V1/D16-1264

SQuAD: 100,000+ Questions for Machine Comprehension of Text

Pranav Rajpurkar, +3 more

- 16 Jun 2016

TL;DR: The Stanford Question Answering Dataset (SQuAD) as mentioned in this paper is a reading comprehension dataset consisting of 100,000+ questions posed by crowdworkers on a set of Wikipedia articles, where the answer to each question is a segment of text from the corresponding reading passage.

...read moreread less

6.3K

•Posted Content

SQuAD: 100,000+ Questions for Machine Comprehension of Text

Pranav Rajpurkar, +3 more

- 16 Jun 2016

- arXiv: Computation and Language

TL;DR: The Stanford Question Answering Dataset (SQuAD) as mentioned in this paper is a reading comprehension dataset consisting of 100,000+ questions posed by crowdworkers on a set of Wikipedia articles, where the answer to each question is a segment of text from the corresponding reading passage.

...read moreread less

5.8K

•Proceedings Article•10.18653/V1/D15-1075

A large annotated corpus for learning natural language inference

Samuel R. Bowman, +3 more

- 21 Aug 2015

TL;DR: The Stanford Natural Language Inference (SNLI) corpus as discussed by the authors is a large-scale collection of labeled sentence pairs, written by humans doing a novel grounded task based on image captioning.

...read moreread less

5.2K

•Proceedings Article

Teaching machines to read and comprehend

Karl Moritz Hermann, +6 more

- 07 Dec 2015

TL;DR: A new methodology is defined that resolves this bottleneck and provides large scale supervised reading comprehension data that allows a class of attention based deep neural networks that learn to read real documents and answer complex questions with minimal prior knowledge of language structure to be developed.

...read moreread less

3.4K