Book Chapter10.1007/978-3-540-30211-7_58
A nearest-neighbor method for resolving PP-Attachment ambiguity
Shaojun Zhao,Dekang Lin +1 more
- 22 Mar 2004
- pp 545-554
38
TL;DR: A nearest-neighbor algorithm for resolving prepositional phrase attachment ambiguities is presented and the cosine of pointwise mutual information vector is a significantly better similarity measure than several other commonly used similarity measures.
read more
Abstract: We present a nearest-neighbor algorithm for resolving prepositional phrase attachment ambiguities. Its performance is significantly higher than previous corpus-based methods for PP-attachment that do not rely on manually constructed knowledge bases. We will also show that the PP-attachment task provides a way to evaluate methods for computing distributional word similarities. Our experiments indicate that the cosine of pointwise mutual information vector is a significantly better similarity measure than several other commonly used similarity measures.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Named entity recognition in biomedical texts using an HMM model
Shaojun Zhao
- 28 Aug 2004
TL;DR: It is shown that word similarity is a potential method to automatically get word formation, prefix, suffix and abbreviation information automatically from biomedical texts, as well as useful word distribution information.
The Notion of Argument in Prepositional Phrase Attachment
Paola Merlo,Eva Esteve Ferrer +1 more
TL;DR: This article refine the formulation of the problem of prepositional phrase (PP) attachment as a four-way disambiguation problem, and introduces a method to learn arguments and adjuncts based on a definition of arguments as a vector of features.
Priberam's question answering system in a cross-language environment
Adán Cassan,Helena Figueira,André F. T. Martins,Afonso Mendes,Pedro Mendes,Cláudia Pinto,Daniel Vidal +6 more
- 20 Sep 2006
TL;DR: The improvements and changes implemented in Priberam's QA system since last CLEF participation are described, detailing the work involved in its cross-lingual extension and discussing the results of the runs submitted to evaluation.
References
Introduction to WordNet: An On-line Lexical Database
TL;DR: Standard alphabetical procedures for organizing lexical information put together words that are spelled alike and scatter words with similar or related meanings haphazardly through the list.
Divergence measures based on the Shannon entropy
TL;DR: A novel class of information-theoretic divergence measures based on the Shannon entropy is introduced, which do not require the condition of absolute continuity to be satisfied by the probability distributions involved and are established in terms of bounds.
Automatic Retrieval and Clustering of Similar Words
Dekang Lin
- 10 Aug 1998
TL;DR: A word similarity measure based on the distributional pattern of words allows the automatically constructed thesaurus to be significantly closer to WordNet than Roget Thesaurus is.
1.8K
•Journal Article
Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging
TL;DR: Injection molding wherein a pair of separable mold plates are initially urged together and fluid plastic is injected into a mold cavity formed between the mold plates to form an article.
1.7K
Mining the web for synonyms: PMI-IR versus LSA on TOEFL
Peter D. Turney
- 05 Sep 2001
TL;DR: This paper presented an unsupervised learning algorithm for recognizing synonyms based on statistical data acquired by querying a web search engine, called Pointwise Mutual Information (PMI) and Information Retrieval (IR) to measure the similarity of pairs of words.
Related Papers (5)
Adwait Ratnaparkhi,Jeff Reynar,Salim Roukos +2 more
- 08 Mar 1994
Donald Hindle,Mats Rooth +1 more
Michael Collins,James Brooks +1 more
- 01 Jan 1999