HoVer: A Dataset for Many-Hop Fact Extraction And Claim Verification

Open AccessPosted Content

HoVer: A Dataset for Many-Hop Fact Extraction And Claim Verification

- 05 Nov 2020

95

TL;DR: It is shown that the performance of an existing state-of-the-art semantic-matching model degrades significantly on this dataset as the number of reasoning hops increases, hence demonstrating the necessity of many-hop reasoning to achieve strong results.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arxiv.2401.15391

MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop Queries

Yixuan Tang, +1 more

- 27 Jan 2024

- arXiv.org

TL;DR: A novel dataset, MultiHop-RAG, which consists of a knowledge base, a large collection of multi-hop queries, their ground-truth answers, and the associated supporting evidence, and it is hoped MultiHop-RAG will be a valuable resource for the community in developing effective RAG systems, thereby facilitating greater adoption of LLMs in practice.

...read moreread less

29

•Proceedings Article•10.18653/V1/2021.NAACL-MAIN.121

How Robust are Fact Checking Systems on Colloquial Claims

Byeongchang Kim, +3 more

- 01 Jun 2021

TL;DR: It is found that existing fact checking systems that perform well on claims in formal style significantly degenerate on colloquial claims with the same semantics, and it is shown that document retrieval is the weakest spot in the system even vulnerable to filler words, such as “yeah” and “you know”.

...read moreread less

27

•Journal Article•10.1162/tacl_a_00486

Fact Checking with Insufficient Evidence

P H Atanasova, +3 more

- 05 Apr 2022

- Transactions of the Association for Comp...

TL;DR: This work is the first to study what information FC models consider sufficient for FC by introducing a novel task and advancing it with three main contributions, finding that models are least successful in detecting missing evidence when adverbial modifiers are omitted.

...read moreread less

21

Journal Article•10.3390/systems11090458

Sustainable Development of Information Dissemination: A Review of Current Fake News Detection Research and Practice

Lu Yuan, +4 more

- 04 Sep 2023

- Systems

TL;DR: The survey includes fake news datasets, research methods for fake news detection, general technical models and multimodal related technical methods and proposes an explainable human-machine-theory triangle communication system, aiming at establishing a people-centered, sustainable human–machine interaction information dissemination system.

...read moreread less

21

Proceedings Article•10.48550/arXiv.2210.09306

Mitigating Covertly Unsafe Text within Natural Language Systems

Alex Mei, +9 more

- 17 Oct 2022

TL;DR: This work distinguishes types of text that can lead to physical harm and establishes one particularly underexplored category: covertly unsafe text, which is further broken down with respect to the system’s information and discusses solutions to mitigate the generation of text in each of these subcategories.

...read moreread less

11

...

Expand

References

•Posted Content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

- arXiv: Computation and Language

TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

81.7K

Proceedings Article•10.18653/V1/N19-1423

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

24.6K

•Proceedings Article•10.18653/V1/N18-1101

A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference

Adina Williams, +2 more

- 01 Jun 2018

TL;DR: The Multi-Genre Natural Language Inference corpus is introduced, a dataset designed for use in the development and evaluation of machine learning models for sentence understanding and shows that it represents a substantially more difficult task than does the Stanford NLI corpus.

...read moreread less

5.4K

•Proceedings Article•10.18653/V1/D15-1075

A large annotated corpus for learning natural language inference

Samuel R. Bowman, +3 more

- 21 Aug 2015

TL;DR: The Stanford Natural Language Inference (SNLI) corpus as discussed by the authors is a large-scale collection of labeled sentence pairs, written by humans doing a novel grounded task based on image captioning.

...read moreread less

5.2K

•Posted Content

Reading Wikipedia to Answer Open-Domain Questions

Danqi Chen, +3 more

- 31 Mar 2017

- arXiv: Computation and Language

TL;DR: In this paper, a multi-layer recurrent neural network model was proposed to detect answer spans in Wikipedia paragraphs, which combines a search component based on bigram hashing and TF-IDF matching.

...read moreread less

1.4K