Know What You Don't Know: Unanswerable Questions for SQuAD

doi:10.18653/V1/P18-2124

Open AccessProceedings Article10.18653/V1/P18-2124

Know What You Don't Know: Unanswerable Questions for SQuAD

Pranav Rajpurkar, +2 more

- 11 Jun 2018

- Vol. 2, pp 784-789

2.3K

TL;DR: SQuADRUn as discussed by the authors is a new dataset that combines the existing Stanford Question Answering Dataset with over 50,000 unanswerable questions written adversarially by crowdworkers to look similar to answerable ones.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.25073/2588-1086/vnucsce.340

VLSP 2021 - ViMRC Challenge: Vietnamese Machine Reading Comprehension

Kiet Van Nguyen, +5 more

- 22 Mar 2022

- VNU Journal of Science: Computer Science...

TL;DR: The UIT-ViQuAD 2.0 dataset motivates researchers to further explore the Vietnamese machine reading comprehension task and related tasks such as question answering, question generation, and natural language inference.

...read moreread less

18

•Journal Article•10.1109/access.2022.3190408

On the Effectiveness of Pre-Trained Language Models for Legal Natural Language Processing: An Empirical Study

Dezhao Song, +3 more

- 01 Jan 2022

- IEEE Access

TL;DR: The first comprehensive empirical evaluation of pre-trained language models for legal natural language processing (NLP) in order to examine their effectiveness in this domain suggests that both general-domain and domain-specific PLM-based methods generally achieve better results than simpler methods on most tasks.

...read moreread less

18

•Proceedings Article•10.18653/V1/2020.FINDINGS-EMNLP.115

Language Generation via Combinatorial Constraint Satisfaction: A Tree Search Enhanced Monte-Carlo Approach

Maosen Zhang, +3 more

- 01 Nov 2020

TL;DR: This work proposes TSMC, an efficient method to generate high likelihood sentences with respect to a pre-trained language model while satisfying the constraints, which is highly flexible, requires no task-specific train- ing, and leverages efficient constraint satisfaction solving techniques.

...read moreread less

18

•Posted Content

Why Do Masked Neural Language Models Still Need Common Sense Knowledge

Sunjae Kwon, +3 more

- 08 Nov 2019

- arXiv: Computation and Language

TL;DR: A test that measures what types of common sense knowledge do pretrained MNLMs understand is proposed and it is experimentally demonstrated that existing MNLM-based models can be elevated by combining knowledge from an external common sense repository.

...read moreread less

18

•Posted Content

BERT Based Multilingual Machine Comprehension in English and Hindi.

Somil Gupta, +1 more

- 02 Jun 2020

- arXiv: Computation and Language

TL;DR: Experiments show that m-BERT, with fine-tuning, improves performance on all evaluation settings across both the datasets used by the prior model, therefore establishing m-BerT based MMC as the new state-of-the-art for English and Hindi.

...read moreread less

18

...

Expand

References

•Proceedings Article•10.18653/V1/N18-1202

Deep contextualized word representations

Matthew E. Peters, +6 more

- 15 Feb 2018

TL;DR: This paper introduced a new type of deep contextualized word representation that models both complex characteristics of word use (e.g., syntax and semantics), and how these uses vary across linguistic contexts (i.e., to model polysemy).

...read moreread less

11.7K

•Proceedings Article•10.18653/V1/D16-1264

SQuAD: 100,000+ Questions for Machine Comprehension of Text

Pranav Rajpurkar, +3 more

- 16 Jun 2016

TL;DR: The Stanford Question Answering Dataset (SQuAD) as mentioned in this paper is a reading comprehension dataset consisting of 100,000+ questions posed by crowdworkers on a set of Wikipedia articles, where the answer to each question is a segment of text from the corresponding reading passage.

...read moreread less

6.3K

•Posted Content

SQuAD: 100,000+ Questions for Machine Comprehension of Text

Pranav Rajpurkar, +3 more

- 16 Jun 2016

- arXiv: Computation and Language

TL;DR: The Stanford Question Answering Dataset (SQuAD) as mentioned in this paper is a reading comprehension dataset consisting of 100,000+ questions posed by crowdworkers on a set of Wikipedia articles, where the answer to each question is a segment of text from the corresponding reading passage.

...read moreread less

5.8K

•Proceedings Article•10.18653/V1/D15-1075

A large annotated corpus for learning natural language inference

Samuel R. Bowman, +3 more

- 21 Aug 2015

TL;DR: The Stanford Natural Language Inference (SNLI) corpus as discussed by the authors is a large-scale collection of labeled sentence pairs, written by humans doing a novel grounded task based on image captioning.

...read moreread less

5.2K

•Proceedings Article

Teaching machines to read and comprehend

Karl Moritz Hermann, +6 more

- 07 Dec 2015

TL;DR: A new methodology is defined that resolves this bottleneck and provides large scale supervised reading comprehension data that allows a class of attention based deep neural networks that learn to read real documents and answer complex questions with minimal prior knowledge of language structure to be developed.

...read moreread less

3.4K