HoVer: A Dataset for Many-Hop Fact Extraction And Claim Verification

Open AccessPosted Content

HoVer: A Dataset for Many-Hop Fact Extraction And Claim Verification

- 05 Nov 2020

95

TL;DR: It is shown that the performance of an existing state-of-the-art semantic-matching model degrades significantly on this dataset as the number of reasoning hops increases, hence demonstrating the necessity of many-hop reasoning to achieve strong results.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

The Web as a Knowledge-base for Answering Complex Questions

Alon Talmor, +1 more

- 18 Mar 2018

- arXiv: Computation and Language

TL;DR: This paper proposes to decompose complex questions into a sequence of simple questions, and compute the final answer from the sequence of answers, and empirically demonstrates that question decomposition improves performance from 20.8 precision@1 to 27.5 precision @1 on this new dataset.

...read moreread less

384

•Proceedings Article•10.18653/V1/2021.NAACL-MAIN.52

Get Your Vitamin C! Robust Fact Verification with Contrastive Evidence.

Tal Schuster, +2 more

- 01 Jun 2021

TL;DR: VitaminC is presented, a benchmark infused with challenging cases that require fact verification models to discern and adjust to slight factual changes, and it is shown that training using this design increases robustness—improving accuracy by 10% on adversarial fact verification and 6% on adversary natural language inference (NLI).

...read moreread less

237

Journal Article•10.48550/arxiv.2310.07521

Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity

Cunxiang Wang, +15 more

- 11 Oct 2023

- arXiv.org

TL;DR: This survey offers a structured guide for researchers aiming to fortify the factual reliability of LLMs, focusing two primary LLM configurations standalone LLMs and Retrieval-Augmented LLMs that utilizes external data.

...read moreread less

120

•Journal Article•10.1016/j.ipm.2022.103219

The State of Human-centered NLP Technology for Fact-checking

Anubrata Das, +3 more

- 08 Jan 2023

- Information Processing and Management

TL;DR: This paper reviewed the capabilities and limitations of the current NLP technologies for fact-checking and further charted the design space for how these technologies can be harnessed and refined in order to better meet the needs of human fact-checkers.

...read moreread less

55

Journal Article•10.48550/arXiv.2305.11859

Complex Claim Verification with Evidence Retrieved in the Wild

Jifan Chen, +3 more

- 19 May 2023

- arXiv.org

TL;DR: In this paper , the authors present the first fully automated pipeline to check real-world claims by retrieving raw evidence from the web, restricting their retriever to only search documents available prior to the claim's making, modeling the realistic scenario where an emerging claim needs to be checked.

...read moreread less

34

...

Expand

References

•Proceedings Article•10.18653/V1/2020.ACL-MAIN.774

Uncertain Natural Language Inference

Tongfei Chen, +4 more

- 01 Jul 2020

TL;DR: The feasibility of collecting annotations for UNLI is demonstrated by relabeling a portion of the SNLI dataset under a probabilistic scale, where items even with the same categorical label differ in how likely people judge them to be true given a premise.

...read moreread less

79

•Proceedings Article•10.3115/1631862.1631868

Local Textual Inference: Can it be Defined or Circumscribed?

Annie Zaenen, +2 more

- 30 Jun 2005

TL;DR: It is argued that local textual inferences come in three well-defined varieties (entailments, conventional implicatures/presuppositions, and conversationalimplicatures) and one less clearly defined one, generally available world knowledge.

...read moreread less

72

•Proceedings Article•10.18653/V1/2020.ACL-MAIN.761

DeSePtion: Dual Sequence Prediction and Adversarial Examples for Improved Fact-Checking

Christopher Hidey, +6 more

- 01 Jul 2020

TL;DR: This work shows that current systems for FEVER are vulnerable to three categories of realistic challenges for fact-checking – multiple propositions, temporal reasoning, and ambiguity and lexical variation – and introduces a resource with these types of claims, and presents a system designed to be resilient to these “attacks”.

...read moreread less

59

•Proceedings Article•10.18653/V1/W18-1703

Multi-hop inference for sentence-level textgraphs: How challenging is meaningfully combining information for science question answering?

Peter Jansen

- 29 May 2018

TL;DR: The authors empirically characterize the difficulty of building or traversing a graph of sentences connected by lexical overlap, by evaluating chance sentence aggregation quality through 9,784 manually-annotated judgements across knowledge graphs built from three free-text corpora (including study guides and Simple Wikipedia).

...read moreread less

30

•Proceedings Article

Understanding the Impact of Text Highlighting in Crowdsourcing Tasks

Jorge Ramírez, +3 more

- 28 Oct 2019

TL;DR: In this article, the authors investigate if and under what conditions highlighting selected parts of the text can (or cannot) improve classification cost and/or accuracy, and in general how it affects the process and outcome of the human intelligence tasks.

...read moreread less

12