Improving Statistical Machine Translation Accuracy Using Bilingual Lexicon Extractionwith Paraphrases

Open AccessProceedings Article

Improving Statistical Machine Translation Accuracy Using Bilingual Lexicon Extractionwith Paraphrases

- 01 Dec 2014

- pp 262-271

4

TL;DR: Paraphrases are used to smooth the vectors used in comparable feature estimation with BLE and improve the qual- ity of comparable features, which can improve the accuracy of the translation model thus improving SMT performance.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1007/S42979-021-00723-4

A Survey of Orthographic Information in Machine Translation

Bharathi Raja Chakravarthi, +3 more

- 04 Aug 2020

- arXiv: Computation and Language

TL;DR: A survey of research regarding orthography’s influence on machine translation of under-resourced languages is offered and how orthographic information can be utilised to improve machine translation is described.

...read moreread less

36

Journal Article•10.1016/J.CSL.2018.07.002

Extracting parallel fragments from comparable documents using a generative model

Somayeh Bakhshaei, +2 more

- 01 Jan 2019

- Computer Speech & Language

TL;DR: The experimental results show significant improvement if the extracted fragments generated by the proposed method are used for augmenting an existing parallel corpus in an statistical machine translation system.

...read moreread less

3

•Journal Article•10.1007/S42979-021-00723-4

A Survey of Orthographic Information in Machine Translation.

Bharathi Raja Chakravarthi, +3 more

- 01 Jan 2021

TL;DR: In this paper, a survey of orthographic influence on machine translation of under-resourced languages is presented, focusing on multilingual settings and bilingual lexicon induction, and a recent trend that links orthographic information with well-established machine translation methods is discussed.

...read moreread less

2

Journal Article•10.1145/3329713

Matching Graph, a Method for Extracting Parallel Information from Comparable Corpora

Somayeh Bakhshaei, +2 more

- 25 Jul 2019

TL;DR: A generative model is proposed for efficient extraction of parallel fragments from a pair of comparable documents that is a graph called the Matching Graph that can be trained on a small initial seed and shown to perform significantly better than other recently published models.

...read moreread less

1

References

•Proceedings Article•10.3115/1557769.1557821

Moses: Open Source Toolkit for Statistical Machine Translation

Philipp Koehn, +13 more

- 25 Jun 2007

TL;DR: An open-source toolkit for statistical machine translation whose novel contributions are support for linguistically motivated factors, confusion network decoding, and efficient data formats for translation models and language models.

...read moreread less

6.3K

•Journal Article

The mathematics of statistical machine translation: parameter estimation

Peter Fitzhugh Brown, +3 more

- 01 Jun 1993

- Computational Linguistics

TL;DR: The authors describe a series of five statistical models of the translation process and give algorithms for estimating the parameters of these models given a set of pairs of sentences that are translations of one another.

...read moreread less

4.9K

Journal Article•10.1080/00437956.1954.11659520

Distributional Structure

Zellig S. Harris

- 01 Jan 1954

- WORD

TL;DR: This discussion will discuss how each language can be described in terms of a distributional structure, i.e. in Terms of the occurrence of parts relative to other parts, and how this description is complete without intrusion of other features such as history or meaning.

...read moreread less

4.2K

•Proceedings Article•10.3115/1073445.1073462

Statistical phrase-based translation

Philipp Koehn, +2 more

- 27 May 2003

TL;DR: The empirical results suggest that the highest levels of performance can be obtained through relatively simple means: heuristic learning of phrase translations from word-based alignments and lexical weighting of phrase translation.

...read moreread less

4.1K

Book Chapter•10.1007/978-94-009-8467-7_1

Distributional Structure

Zellig S. Harris

- 01 Jan 1981

TL;DR: This discussion will discuss how each language can be described in terms of a distributional structure, i.e. in Terms of the occurrence of parts relative to other parts, and how this description is complete without intrusion of other features such as history or meaning.

...read moreread less

3.6K