Multiword expression

Topic Tools

Papers published on a yearly basis

Papers

Book Chapter•10.1007/3-540-45715-1_1•

Multiword Expressions: A Pain in the Neck for NLP

[...]

Ivan A. Sag¹, Timothy Baldwin¹, Francis Bond, Ann Copestake², Dan Flickinger¹ - Show less +1 more•Institutions (2)

Stanford University¹, University of Cambridge²

17 Feb 2002

TL;DR: The various kinds of multiword expressions should be analyzed in distinct ways, including listing "words with spaces", hierarchically organized lexicons, restricted combinatoric rules, lexical selection, "idiomatic constructions" and simple statistical affinity.

...read moreread less

Abstract: Multiword expressions are a key problem for the development of large-scale, linguistically sound natural language processing technology. This paper surveys the problem and some currently available analytic techniques. The various kinds of multiword expressions should be analyzed in distinct ways, including listing "words with spaces", hierarchically organized lexicons, restricted combinatoric rules, lexical selection, "idiomatic constructions" and simple statistical affinity. An adequate comprehensive analysis of multiword expressions must employ both symbolic and statistical techniques.

...read moreread less

1,233 citations

Journal Article•10.1162/COLI_A_00302•

Multiword expression processing: A survey

[...]

Mathieu Constant¹, Gülşen Eryiğit², Johanna Monti³, Lonneke van der Plas⁴, Carlos Ramisch⁵, Mike Rosner⁴, Amalia Todirascu⁶ - Show less +3 more•Institutions (6)

University of Lorraine¹, Istanbul Technical University², University of Naples Federico II³, University of Malta⁴, Aix-Marseille University⁵, University of Strasbourg⁶

01 Dec 2017-Computational Linguistics

TL;DR: A shared understanding of what is meant by “MWE processing” is offered, distinguishing the subtasks of MWE discovery and identification, and the interactions between MWE processing and two use cases: Parsing and machine translation are elucidated.

...read moreread less

Abstract: Multiword expressions MWEs are a class of linguistic forms spanning conventional word boundaries that are both idiosyncratic and pervasive across different languages. The structure of linguistic processing that depends on the clear distinction between words and phrases has to be re-thought to accommodate MWEs. The issue of MWE handling is crucial for NLP applications, where it raises a number of challenges. The emergence of solutions in the absence of guiding principles motivates this survey, whose aim is not only to provide a focused review of MWE processing, but also to clarify the nature of interactions between MWE processing and downstream applications. We propose a conceptual framework within which challenges and research contributions can be positioned. It offers a shared understanding of what is meant by "MWE processing," distinguishing the subtasks of MWE discovery and identification. It also elucidates the interactions between MWE processing and two use cases: Parsing and machine translation. Many of the approaches in the literature can be differentiated according to how MWE processing is timed with respect to underlying use cases. We discuss how such orchestration choices affect the scope of MWE-aware systems. For each of the two MWE processing subtasks and for each of the two use cases, we conclude on open issues and research perspectives.

...read moreread less

240 citations

Patent•

Method and computer system for part-of-speech tagging of incomplete sentences

[...]

Nelly Tarbouriech¹, Herve Poirier¹•Institutions (1)

Xerox¹

19 Dec 2000

TL;DR: In this article, a method and a computer system for enhanced part-of-speech tagging as well as grammatically disambiguating a phrase is described. But this method is limited to a single phrase.

...read moreread less

Abstract: The invention relates to a method and a computer system for enhanced part-of-speech (POS-) tagging as well as grammatically disambiguating a phrase. A phrase is usually a short multiword expression that may be ambiguous. By introducing grammatical constraints the invention supports POS-tagging as well as grammatically disambiguating the phrase. According to an identifier for the phrase, the phrase is supplemented with artificial context information. The supplemented phrase is then POS-tagged or grammatically disambiguated. Important applications are POS-tagging, Automatic Term Encoding, Headword Detection and Information Retrieval.

...read moreread less

151 citations

Journal Article•10.1162/TACL_A_00176•

Discriminative Lexical Semantic Segmentation with Gaps: Running the MWE Gamut

[...]

Nathan Schneider¹, Emily Danchik¹, Chris Dyer¹, Noah A. Smith¹•Institutions (1)

Carnegie Mellon University¹

30 Apr 2014-Transactions of the Association for Computational Linguistics

TL;DR: A novel representation, evaluation measure, and supervised models are presented for the task of identifying the multiword expressions (MWEs) in a sentence, resulting in a lexical semantic segmentation, enabling efficient sequence tagging algorithms for feature-rich discriminative models.

...read moreread less

Abstract: We present a novel representation, evaluation measure, and supervised models for the task of identifying the multiword expressions (MWEs) in a sentence, resulting in a lexical semantic segmentation . Our approach generalizes a standard chunking representation to encode a subset of projective MWEs containing gaps, thereby enabling efficient sequence tagging algorithms for feature-rich discriminative models. Experiments on a new dataset of English web text offer the first linguistically-driven evaluation of MWE identification with truly heterogeneous expression types. Our statistical sequence model greatly outperforms a lookup-based segmentation procedure, achieving 60% F 1 for MWE identification.

...read moreread less

112 citations

Proceedings Article•

Task-based Evaluation of Multiword Expressions: a Pilot Study in Statistical Machine Translation

[...]

Marine Carpuat¹, Mona Diab¹•Institutions (1)

Columbia University¹

2 Jun 2010

TL;DR: Two different integration strategies for MWE inSMT are proposed, which take advantage of different degrees of MWE semantic compositionality and yield complementary improvements in SMT quality on a large-scale translation task.

...read moreread less

Abstract: We conduct a pilot study for task-oriented evaluation of Multiword Expression (MWE) in Statistical Machine Translation (SMT). We propose two different integration strategies for MWE in SMT, which take advantage of different degrees of MWE semantic compositionality and yield complementary improvements in SMT quality on a large-scale translation task.

...read moreread less

110 citations

...

Expand

Year	Papers
2021	9
2020	16
2019	5
2018	15
2017	25
2016	11

Topic Tools

Papers published on a yearly basis

Papers

Multiword Expressions: A Pain in the Neck for NLP

Multiword expression processing: A survey

Method and computer system for part-of-speech tagging of incomplete sentences

Discriminative Lexical Semantic Segmentation with Gaps: Running the MWE Gamut

Task-based Evaluation of Multiword Expressions: a Pilot Study in Statistical Machine Translation

Related Topics (5)

Performance Metrics