Multi-Document Summarization Using Cross-Language Texts

Open Access

Multi-Document Summarization Using Cross-Language Texts

- 01 Jan 2004

13

TL;DR: This work tries to generate a summary in source language, using translated documents by a machine translator and a summarization system in target language, and shows the possibility of multi-documents summarization, using crosslanguage texts.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.18653/V1/2020.FINDINGS-EMNLP.360

WikiLingua: A New Benchmark Dataset for Cross-Lingual Abstractive Summarization

Faisal Ladhak, +3 more

- 07 Oct 2020

TL;DR: The WikiLingua dataset as mentioned in this paper is a large-scale, multilingual dataset for the evaluation of cross-lingual abstractive summarization systems, which contains how-to guides on a diverse set of topics written by human authors.

...read moreread less

136

•Posted Content

WikiLingua: A New Benchmark Dataset for Cross-Lingual Abstractive Summarization

Faisal Ladhak, +3 more

- 07 Oct 2020

- arXiv: Computation and Language

TL;DR: A method for direct crosslingual summarization without requiring translation at inference time is proposed by leveraging synthetic data and Neural Machine Translation as a pre-training step, which significantly outperforms the baseline approaches, while being more cost efficient during inference.

...read moreread less

125

•Journal Article•10.1609/aaai.v36i10.21359

Improving Neural Cross-Lingual Abstractive Summarization via Employing Optimal Transport Distance for Knowledge Distillation

Thong Nguyen, +1 more

- 28 Jun 2022

- Proceedings of the ... AAAI Conference o...

TL;DR: This paper propose a knowledge distillation loss using Sinkhorn divergence, an Optimal-Transport distance, to estimate the discrepancy between those teacher and student representations, which can explicitly construct cross-lingual correlation by distilling the knowledge of the summarization teacher into the student.

...read moreread less

41

•Posted Content

NCLS: Neural Cross-Lingual Summarization

Junnan Zhu, +6 more

- 31 Aug 2019

- arXiv: Computation and Language

TL;DR: Wang et al. as discussed by the authors proposed an end-to-end cross-lingual summarization (NCLS) framework with multi-task learning to improve the quality of generated summaries.

...read moreread less

17

Proceedings Article•10.48550/arXiv.2204.13512

Neural Label Search for Zero-Shot Multi-Lingual Extractive Summarization

Ruipeng Jia, +5 more

- 28 Apr 2022

TL;DR: NLSSum (Neural Label Search for Summarization), which jointly learns hierarchical weights for these different sets of labels together with the summarization model, and achieves state-of-the-art results using both human and automatic evaluations across these two datasets.

...read moreread less

4

References

•Journal Article

Accurate methods for the statistics of surprise and coincidence

Ted Dunning

- 01 Mar 1993

- Computational Linguistics

TL;DR: The basis of a measure based on likelihood ratios that can be applied to the analysis of text is described, and in cases where traditional contingency table methods work well, the likelihood ratio tests described here are nearly identical.

...read moreread less

2.9K

Journal Article•10.1016/J.IPM.2003.10.006

Centroid-based summarization of multiple documents

Dragomir R. Radev, +3 more

- 01 Nov 2004

- Information Processing and Management

TL;DR: A multi-document summarizer, MEAD, is presented, which generates summaries using cluster centroids produced by a topic detection and tracking system and an evaluation scheme based on sentence utility and subsumption is applied.

...read moreread less

1.2K

•Proceedings Article•10.3115/1117575.1117578

Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies

Dragomir R. Radev, +2 more

- 30 Apr 2000

TL;DR: A multi-document summarizer, called MEAD, is presented, which generates summaries using cluster centroids produced by a topic detection and tracking system and two new techniques, based on sentence utility and subsumption, are described.

...read moreread less

511

•Proceedings Article•10.7916/D8DV1T7T

Experiments in multidocument summarization

Barry Schiffman, +2 more

- 24 Mar 2002

TL;DR: A multidocument summarizer built upon research into the detection of new information uses several new strategies to select interesting and informative sentences, including an innovative measure of importance derived from the analysis of a large corpus.

...read moreread less

79

A Summarization System with Categorization of Document Sets.

Chikashi Nobata, +3 more

- 01 Oct 2002

TL;DR: Two modules are incorporated into the earlier summarization system, which is based on a sentenceextraction technique, so that it could apply the system to the multi-document summarization task.

...read moreread less

21