Journal Article10.1142/S0219427907001640
Pattern Dictionary Development based on Non-Compositional Language Model for Japanese Compound and Complex Sentences
Satoru Ikehara,Masato Tokuhisa,Jin'ichi Murakami,Masashi Saraki,Masahiro Miyazaki,Naoshi Ikeda +5 more
1
TL;DR: A large-scale sentence pattern dictionary (SP-dictionary) for Japanese compound and complex sentences has been developed and achieved a syntactic coverage of 92% and a semantic coverage of 70%.
read more
Abstract: A large-scale sentence pattern dictionary (SP-dictionary) for Japanese compound and complex sentences has been developed. The dictionary has been compiled based on the non-compositional language model. Sentences with 2 or 3 predicates are extracted from a Japanese-to-English parallel corpus of 1 million sentences, and the compositional constituents contained within them are generalized to produce a SP-dictionary containing a total of 215,000 pattern pairs. In evaluation tests, the SP-dictionary achieved a syntactic coverage of 92% and a semantic coverage of 70%.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Phrase-Level Pattern-Based Machine Translation
Jun Sakata,Masato Tokuhisa +1 more
- 01 Jan 2012
TL;DR: The results of trans- lation experiments show that the quality of phrase translation is still low, but these problems are to be resolved in the future work.
References
A statistical approach to machine translation
Peter Fitzhugh Brown,John Cocke,Stephen A. Della Pietra,Vincent J. Della Pietra,F. Jelinek,John Lafferty,Robert Leroy Mercer,Paul S. Roossin +7 more
TL;DR: The application of the statistical approach to translation from French to English and preliminary results are described and the results are given.
Natural language processing
TL;DR: $&% ' () * + " , .*/ 0 .
955
The CMU statistical machine translation system
Stephan Vogel,Ying Zhang,Fei Huang,Alicia Tribble,Ashish Venugopal,Bing Zhao,Alex Waibel +6 more
- 23 Sep 2003
TL;DR: This paper describes the components of a statistical machine translation system that combines phrase-to-phrase translations extracted from a bilingual corpus using different alignment approaches and shows how a manual lexicon can be incorporated into the statistical system in an optimized way.
Adding Linguistic Knowledge to a Lexical Example-Based Translation System
Ralf D. Brown
- 01 Jan 2006
TL;DR: A modest investment of time on the order of two person-weeks adding linguistic knowledge reduces the required example text by a factor of six or more, while retaining comparable translation quality, which makes EBMT more attractive for so-called "low-density" languages for which little data is available.
105
A Pattern-based Machine Translation System Extended by Example-based Processing
Hideo Watanabe,Koichi Takeda +1 more
- 10 Aug 1998
TL;DR: This paper describes a machine translation system called PalmTree which uses the "pattern based" approach as a fundamental framework, and describes several methods; pruning techniques for the former, and introduction of example-based processing for the latter.