End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

doi:10.18653/V1/P16-1101

Open AccessProceedings Article10.18653/V1/P16-1101

End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

Xuezhe Ma, +1 more

- 04 Mar 2016

- Vol. 1, pp 1064-1074

2.9K

TL;DR: This paper used a combination of bidirectional LSTM, CNN and CRF for sequence labeling tasks, and achieved state-of-the-art performance on both datasets for POS tagging and CoNLL 2003 corpus for NER.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.18653/V1/N18-1202

Deep contextualized word representations

Matthew E. Peters, +6 more

- 15 Feb 2018

TL;DR: This paper introduced a new type of deep contextualized word representation that models both complex characteristics of word use (e.g., syntax and semantics), and how these uses vary across linguistic contexts (i.e., to model polysemy).

...read moreread less

11.7K

•Proceedings Article•10.18653/V1/N16-1030

Neural Architectures for Named Entity Recognition

Guillaume Lample, +4 more

- 04 Mar 2016

TL;DR: Comunicacio presentada a la 2016 Conference of the North American Chapter of the Association for Computational Linguistics, celebrada a San Diego (CA, EUA) els dies 12 a 17 of juny 2016.

...read moreread less

5.3K

•Journal Article•10.1145/3560815

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

16 Jan 2023

- ACM Computing Surveys

TL;DR: The authors surveys and organizes research works in a new paradigm in natural language processing, which they dub "prompt-based learning" and describe a unified set of mathematical notations that can cover a wide variety of existing work.

...read moreread less

1.7K

•Proceedings Article

Contextual String Embeddings for Sequence Labeling

Alan Akbik, +2 more

- 01 Aug 2018

TL;DR: This paper proposes to leverage the internal states of a trained character language model to produce a novel type of word embedding which they refer to as contextual string embeddings, which are fundamentally model words as sequences of characters and are contextualized by their surrounding text.

...read moreread less

1.4K

•Journal Article•10.1109/TKDE.2020.2981314

A Survey on Deep Learning for Named Entity Recognition

Jing Li, +3 more

- 17 Mar 2020

- IEEE Transactions on Knowledge and Data ...

TL;DR: A comprehensive review on existing deep learning techniques for NER is provided in this paper, where the authors systematically categorize existing works based on a taxonomy along three axes: distributed representations for input, context encoder, and tag decoder.

...read moreread less

1.1K

...

Expand

References

•Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 05 Dec 2013

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

24.1K

•Proceedings Article•10.1109/ICCV.2015.123

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

Kaiming He, +3 more

- 07 Dec 2015

TL;DR: In this paper, a Parametric Rectified Linear Unit (PReLU) was proposed to improve model fitting with nearly zero extra computational cost and little overfitting risk, which achieved a 4.94% top-5 test error on ImageNet 2012 classification dataset.

...read moreread less

18.2K

•Proceedings Article

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

John Lafferty, +2 more

- 28 Jun 2001

TL;DR: This work presents iterative parameter estimation algorithms for conditional random fields and compares the performance of the resulting models to HMMs and MEMMs on synthetic and natural-language data.

...read moreread less

15.4K

•Posted Content

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

Kaiming He, +3 more

- 06 Feb 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work proposes a Parametric Rectified Linear Unit (PReLU) that generalizes the traditional rectified unit and derives a robust initialization method that particularly considers the rectifier nonlinearities.

...read moreread less

15.1K

Journal Article•10.1162/NECO.1989.1.4.541

Backpropagation applied to handwritten zip code recognition

Yann LeCun, +6 more

- 01 Dec 1989

- Neural Computation

TL;DR: This paper demonstrates how constraints from the task domain can be integrated into a backpropagation network through the architecture of the network, successfully applied to the recognition of handwritten zip code digits provided by the U.S. Postal Service.

...read moreread less

12.5K

...

Expand

End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

Chat with Paper

AI Agents for this Paper

Citations

Deep contextualized word representations

Neural Architectures for Named Entity Recognition

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Contextual String Embeddings for Sequence Labeling

A Survey on Deep Learning for Named Entity Recognition

References

Distributed Representations of Words and Phrases and their Compositionality

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

Backpropagation applied to handwritten zip code recognition

Related Papers (5)

Glove: Global Vectors for Word Representation

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

Long short-term memory

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Deep contextualized word representations