Topic

Text simplification

About: Text simplification is a research topic. Over the lifetime, 660 publications have been published within this topic receiving 13806 citations.

...read moreread less

Topic Tools

Find unexplored research gaps

Generate a literature review

Explore related concepts

Papers published on a yearly basis

Papers

Journal Article•10.3758/BF03195564•

Coh-Metrix: Analysis of text on cohesion and language

[...]

Arthur C. Graesser¹, Danielle S. McNamara¹, Max M. Louwerse¹, Zhiqiang Cai¹•Institutions (1)

University of Memphis¹

01 May 2004-Behavior Research Methods Instruments & Computers

TL;DR: Standard text readability formulas scale texts on difficulty by relying on word length and sentence length, whereas Coh-Metrix is sensitive to cohesion relations, world knowledge, and language and discourse characteristics.

...read moreread less

Abstract: Advances in computational linguistics and discourse processing have made it possible to automate many language- and text-processing mechanisms. We have developed a computer tool called Coh-Metrix, which analyzes texts on over 200 measures of cohesion, language, and readability. Its modules use lexicons, part-of-speech classifiers, syntactic parsers, templates, corpora, latent semantic analysis, and other components that are widely used in computational linguistics. After the user enters an English text, Coh-Metrix returns measures requested by the user. In addition, a facility allows the user to store the results of these analyses in data files (such as Text, Excel, and SPSS). Standard text readability formulas scale texts on difficulty by relying on word length and sentence length, whereas Coh-Metrix is sensitive to cohesion relations, world knowledge, and language and discourse characteristics.

...read moreread less

1,559 citations

Journal Article•10.1162/TACL_A_00107•

Optimizing Statistical Machine Translation for Text Simplification

[...]

Wei Xu¹, Courtney Napoles², Ellie Pavlick¹, Quanze Chen¹, Chris Callison-Burch¹ - Show less +1 more•Institutions (2)

University of Pennsylvania¹, Johns Hopkins University²

27 Jul 2016-Transactions of the Association for Computational Linguistics

TL;DR: This work is the first to design automatic metrics that are effective for tuning and evaluating simplification systems, which will facilitate iterative development for this task.

...read moreread less

Abstract: Most recent sentence simplification systems use basic machine translation models to learn lexical and syntactic paraphrases from a manually simplified parallel corpus. These methods are limited by the quality and quantity of manually simplified corpora, which are expensive to build. In this paper, we conduct an in-depth adaptation of statistical machine translation to perform text simplification, taking advantage of large-scale paraphrases learned from bilingual texts and a small amount of manual simplifications with multiple references. Our work is the first to design automatic metrics that are effective for tuning and evaluating simplification systems, which will facilitate iterative development for this task.

...read moreread less

698 citations

Journal Article•10.1162/TACL_A_00139•

Problems in Current Text Simplification Research: New Data Can Help

[...]

Wei Xu¹, Chris Callison-Burch¹, Courtney Napoles²•Institutions (2)

University of Pennsylvania¹, Johns Hopkins University²

24 May 2015-Transactions of the Association for Computational Linguistics

TL;DR: This opinion paper argues that focusing on Wikipedia limits simplification research, and introduces a new simplification dataset that is a significant improvement over Simple Wikipedia, and presents a novel quantitative-comparative approach to study the quality of simplification data resources.

...read moreread less

Abstract: Simple Wikipedia has dominated simplification research in the past 5 years. In this opinion paper, we argue that focusing on Wikipedia limits simplification research. We back up our arguments with corpus analysis and by highlighting statements that other researchers have made in the simplification literature. We introduce a new simplification dataset that is a significant improvement over Simple Wikipedia, and present a novel quantitative-comparative approach to study the quality of simplification data resources.

...read moreread less

528 citations

Proceedings Article•

A Monolingual Tree-based Translation Model for Sentence Simplification

[...]

Zhemin Zhu¹, Delphine Bernhard², Iryna Gurevych¹•Institutions (2)

Technische Universität Darmstadt¹, Centre national de la recherche scientifique²

23 Aug 2010

TL;DR: A Tree-based Simplification Model (TSM) is proposed, which, to the knowledge, is the first statistical simplification model covering splitting, dropping, reordering and substitution integrally.

...read moreread less

Abstract: In this paper, we consider sentence simplification as a special form of translation with the complex sentence as the source and the simple sentence as the target. We propose a Tree-based Simplification Model (TSM), which, to our knowledge, is the first statistical simplification model covering splitting, dropping, reordering and substitution integrally. We also describe an efficient method to train our model with a large-scale parallel dataset obtained from the Wikipedia and Simple Wikipedia. The evaluation shows that our model achieves better readability scores than a set of baseline systems.

...read moreread less

482 citations

Proceedings Article•10.18653/V1/D17-1062•

Sentence Simplification with Deep Reinforcement Learning

[...]

Xingxing Zhang¹, Mirella Lapata•Institutions (1)

Dalarna University¹

11 Sep 2017

TL;DR: This paper proposed a deep reinforcement learning framework for sentence simplification, which explores the space of possible simplifications while learning to optimize a reward function that encourages outputs which are simple, fluent, and preserve the meaning of the input.

...read moreread less

Abstract: Sentence simplification aims to make sentences easier to read and understand. Most recent approaches draw on insights from machine translation to learn simplification rewrites from monolingual corpora of complex and simple sentences. We address the simplification problem with an encoder-decoder model coupled with a deep reinforcement learning framework. Our model, which we call DRESS (as shorthand for Deep REinforcement Sentence Simplification), explores the space of possible simplifications while learning to optimize a reward function that encourages outputs which are simple, fluent, and preserve the meaning of the input. Experiments on three datasets demonstrate that our model outperforms competitive simplification systems.

...read moreread less

384 citations

...

Expand

Performance Metrics

699

Papers

3,616

Citations

No. of papers in the topic in previous years
Year	Papers
2025	1
2024	8
2023	13
2022	17
2021	61
2020	69

Text simplification

Topic Tools

Papers published on a yearly basis

Papers

Coh-Metrix: Analysis of text on cohesion and language

Optimizing Statistical Machine Translation for Text Simplification

Problems in Current Text Simplification Research: New Data Can Help

A Monolingual Tree-based Translation Model for Sentence Simplification

Sentence Simplification with Deep Reinforcement Learning

Related Topics (5)

Performance Metrics