Conception: Multilingually-Enhanced, Human-Readable Concept Vector Representations

doi:10.18653/V1/2020.COLING-MAIN.291

Open AccessProceedings Article10.18653/V1/2020.COLING-MAIN.291

Conception: Multilingually-Enhanced, Human-Readable Concept Vector Representations

Simone Conia, +1 more

- 01 Dec 2020

- pp 3268-3284

15

TL;DR: This paper proposes Conception, a novel technique for building language-independent vector representations of concepts which places multilinguality at its core while retaining explicit relationships between concepts, and results in high-coverage representations that outperform the state of the art in multilingual and cross-lingual Semantic Word Similarity and Word Sense Disambiguation.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Do Multi-Sense Embeddings Improve Natural Language Understanding?

Jiwei Li, +1 more

- 02 Jun 2015

- arXiv: Computation and Language

TL;DR: This paper proposed a multi-sense embedding model based on Chinese Restaurant Processes that achieves state-of-the-art performance on matching human word similarity judgments, and proposed a pipelined architecture for incorporating multisense embeddings into language understanding.

...read moreread less

167

•Proceedings Article•10.24963/IJCAI.2021/593

Recent Trends in Word Sense Disambiguation: A Survey

Michele Bevilacqua, +3 more

- 01 Aug 2021

155

•Proceedings Article•10.18653/V1/2020.EMNLP-MAIN.285

With More Contexts Comes Better Performance: Contextualized Sense Embeddings for All-Round Word Sense Disambiguation

Bianca Scarlini, +2 more

- 01 Nov 2020

TL;DR: ARES representations enable a simple 1 Nearest-Neighbour algorithm to outperform state-of-the-art models, not only in the English Word Sense Disambiguation task, but also in the multilingual one, whilst training on sense-annotated data in English only.

...read moreread less

127

•Proceedings Article•10.24963/IJCAI.2021/620

Ten Years of BabelNet: A Survey

Roberto Navigli, +4 more

- 09 Aug 2021

42

•Proceedings Article•10.18653/V1/2021.EACL-MAIN.286

Framing Word Sense Disambiguation as a Multi-Label Problem for Model-Agnostic Knowledge Integration

Simone Conia, +1 more

- 01 Apr 2021

TL;DR: The authors proposed a multi-label classification approach in which multiple senses can be assigned to each target word and achieved state-of-the-art results in English all-words WSD.

...read moreread less

37

References

•Posted Content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

- arXiv: Computation and Language

TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

81.7K

Proceedings Article•10.3115/V1/D14-1162

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

- 01 Oct 2014

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

41.6K

•Proceedings Article

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

- 16 Jan 2013

TL;DR: Two novel model architectures for computing continuous vector representations of words from very large data sets are proposed and it is shown that these vectors provide state-of-the-art performance on the authors' test set for measuring syntactic and semantic word similarities.

...read moreread less

27.5K

•Proceedings Article

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, +2 more

- 01 Jan 2015

TL;DR: It is conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and it is proposed to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.

...read moreread less

25.7K

Proceedings Article•10.18653/V1/N19-1423

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

24.6K