Dense Passage Retrieval for Open-Domain Question Answering

doi:10.18653/V1/2020.EMNLP-MAIN.550

Open AccessProceedings Article10.18653/V1/2020.EMNLP-MAIN.550

Dense Passage Retrieval for Open-Domain Question Answering

Vladimir Karpukhin, +7 more

- 10 Apr 2020

- pp 6769-6781

1.9K

TL;DR: In this paper, a dual-encoder framework is proposed to learn dense representations from a small number of questions and passages by a simple dual encoder framework, which outperforms a strong Lucene-BM25 system greatly.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

SimCSE: Simple Contrastive Learning of Sentence Embeddings

Tianyu Gao, +2 more

- 18 Apr 2021

- arXiv: Computation and Language

TL;DR: SimCSE as discussed by the authors proposes a contrastive learning framework for sentence embeddings, which takes an input sentence and predicts itself in contrastive objective, with only standard dropout used as noise.

...read moreread less

1.7K

•Posted Content

On the Opportunities and Risks of Foundation Models.

Rishi Bommasani, +113 more

- 16 Aug 2021

- arXiv: Learning

TL;DR: The authors provides a thorough account of the opportunities and risks of foundation models, ranging from their capabilities (e.g., language, vision, robotics, reasoning, human interaction) and technical principles(e. g.g. model architectures, training procedures, data, systems, security, evaluation, theory) to their applications.

...read moreread less

1.3K

•Journal Article•10.1109/TKDE.2021.3090866

Self-supervised Learning: Generative or Contrastive.

Xiao Liu, +6 more

- 15 Jun 2020

- arXiv: Learning

TL;DR: This survey takes a look into new self-supervised learning methods for representation in computer vision, natural language processing, and graph learning, and comprehensively review the existing empirical methods into three main categories according to their objectives.

...read moreread less

1.1K

•Posted Content

Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval

Lee Xiong, +7 more

- 01 Jul 2020

- arXiv: Information Retrieval

TL;DR: Approximate nearest neighbor Negative Contrastive Estimation (ANCE) is presented, a training mechanism that constructs negatives from an Approximate Nearest Neighbor (ANN) index of the corpus, which is parallelly updated with the learning process to select more realistic negative training instances.

...read moreread less

917

•Proceedings Article•10.18653/V1/2020.FINDINGS-EMNLP.171

UNIFIEDQA: Crossing Format Boundaries with a Single QA System

Daniel Khashabi, +6 more

- 02 May 2020

TL;DR: This work uses the latest advances in language modeling to build a single pre-trained QA model, UNIFIEDQA, that performs well across 19 QA datasets spanning 4 diverse formats, and results in a new state of the art on 10 factoid and commonsense question answering datasets.

...read moreread less

883

...

Expand

References

•Proceedings Article

Learning Discriminative Projections for Text Similarity Measures

Wen-tau Yih, +3 more

- 23 Jun 2011

TL;DR: A novel discriminative training method that projects the raw term vectors into a common, low-dimensional vector space, which not only outperforms existing state-of-the-art approaches, but also achieves high accuracy at low dimensions and is thus more efficient.

...read moreread less

328

•Proceedings Article

Poly-encoders: Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring

Samuel Humeau, +3 more

- 30 Apr 2020

TL;DR: This work develops a new transformer architecture, the Poly-encoder, that learns global rather than token level self-attention features, and shows that the models achieve state-of-the-art results on four tasks.

...read moreread less

323

•Proceedings Article•10.18653/V1/N19-4013

End-to-End Open-Domain Question Answering with BERTserini

Wei Yang, +7 more

- 05 Feb 2019

- arXiv: Computation and Language

TL;DR: In this paper, an end-to-end question answering system that integrates BERT with the open-source Anserini information retrieval toolkit is presented, which integrates best practices from IR with a BERT-based reader to identify answers from a large corpus of Wikipedia articles.

...read moreread less

303

Journal Article•10.1023/A:1010516712215

Information science as "Little Science":The implications of a bibliometric analysis of theJournal of the American Society for Information Science

Wallace Koehler

- 01 Apr 2001

- Scientometrics

TL;DR: Based on analysis of articles published in AD and JASIS from 1950 to1999, it is found that there has been a slow but perhaps inevitable shift based first on the single nonfunded researcher and author to a much wider research and publishing participation among authors, regions, corporate authors, and countries.

...read moreread less

247

•Proceedings Article•10.18653/V1/D19-1599

Multi-passage BERT: A Globally Normalized BERT Model for Open-domain Question Answering

Zhiguo Wang, +4 more

- 22 Aug 2019

TL;DR: The authors proposed a multi-passage BERT model to globally normalize answer scores across all passages of the same question, and this change enables our QA model find better answers by utilizing more passages.

...read moreread less

243