COMET: Commonsense Transformers for Automatic Knowledge Graph Construction

Open AccessPosted Content

COMET: Commonsense Transformers for Automatic Knowledge Graph Construction

- 12 Jun 2019

576

TL;DR: The authors proposed COMmonsEnse Transformers (COMET) that learn to generate rich and diverse commonsense descriptions in natural language, and showed promising results when implicit knowledge from deep pre-trained language models is transferred to generate explicit knowledge in commonsense knowledge graphs.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.18653/v1/2020.emnlp-demos.6

Transformers: State-of-the-Art Natural Language Processing

Thomas Wolf, +21 more

- 01 Oct 2020

TL;DR: The \textit{Transformers} library is an open-source library that consists of carefully engineered state-of-the art Transformer architectures under a unified API and a curated collection of pretrained models made by and available for the community.

...read moreread less

1.9K

•Journal Article•10.1609/AAAI.V34I05.6239

PIQA: Reasoning about Physical Commonsense in Natural Language

Yonatan Bisk, +4 more

- 03 Apr 2020

TL;DR: The task of physical commonsense reasoning and a corresponding benchmark dataset Physical Interaction: Question Answering or PIQA are introduced and analysis about the dimensions of knowledge that existing models lack are provided, which offers significant opportunities for future research.

...read moreread less

1.2K

•Journal Article•10.1609/AAAI.V34I03.5681

K-BERT: Enabling Language Representation with Knowledge Graph

Weijie Liu, +6 more

- 03 Apr 2020

TL;DR: This work proposes a knowledge-enabled language representation model (K-BERT) with knowledge graphs (KGs), in which triples are injected into the sentences as domain knowledge, which significantly outperforms BERT and reveals promising results in twelve NLP tasks.

...read moreread less

937

•Posted Content

Calibrate Before Use: Improving Few-Shot Performance of Language Models

Tony Z. Zhao, +4 more

- 19 Feb 2021

- arXiv: Computation and Language

TL;DR: This work first estimates the model's bias towards each answer by asking for its prediction when given the training prompt and a content-free test input such as "N/A", and then fits calibration parameters that cause the prediction for this input to be uniform across answers.

...read moreread less

930

•Proceedings Article•10.18653/V1/2021.NAACL-MAIN.45

QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering.

Michihiro Yasunaga, +4 more

- 01 Jun 2021

TL;DR: This work proposes a new model, QA-GNN, which addresses the problem of answering questions using knowledge from pre-trained language models (LMs) and knowledge graphs (KGs) through two key innovations: relevance scoring and joint reasoning.

...read moreread less

602

...

Expand

References

Journal Article•10.1162/NECO.1997.9.8.1735

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997

- Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

99K

•Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

- 12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

94.2K

Proceedings Article•10.3115/V1/D14-1162

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

- 01 Oct 2014

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

41.6K

Proceedings Article•10.18653/V1/N19-1423

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

24.6K

•Journal Article•10.2307/4615733

A Simple Sequentially Rejective Multiple Test Procedure

Sture Holm

- 01 Jan 1979

- Scandinavian Journal of Statistics

TL;DR: In this paper, a simple and widely accepted multiple test procedure of the sequentially rejective type is presented, i.e. hypotheses are rejected one at a time until no further rejections can be done.

...read moreread less

23.4K