Automatic Text Summarization Based on the Global Document Annotation
Katashi Nagao,Koiti Hasida +1 more
- 10 Aug 1998
- Vol. 2, pp 917-921
TL;DR: The main features are a domain/style-free algorithm and personalization on summarization which reflects readers' interests and preferences and the proposed method is flexible enough to dynamically generate summaries of various sizes.
read more
Abstract: The GDA (Global Document Annotation) project proposes a tag set which allows machines to automatically infer the underlying semantic/pragmatic structure of documents. Its objectives are to promote development and spread of NLP/AI applications to render GDA-tagged documents versatile and intelligent contents, which should motivate WWW (World Wide Web) users to tag their documents as part of content authoring. This paper discusses automatic text summarization based on GDA. Its main features are a domain/style-free algorithm and personalization on summarization which reflects readers' interests and preferences. In order to calculate the importance score of a text element, the algorithm uses spreading activation on an intradocument network which connects text elements via thematic, rhetorical, and coreferential relations. The proposed method is flexible enough to dynamically generate summaries of various sizes. A summary browser supporting personalization is reported as well.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Information retrieval on the web
Mei Kobayashi,Koichi Takeda +1 more
TL;DR: Overall trends cited by the sources are consistent and point to exponential growth in the past and in the coming decade, and the development of new techniques targeted to resolve some of the problems associated with Web-based information retrieval are discussed.
714
Patent
System and method of making unstructured data available to structured data analysis tools
Justin Langseth,Nithi Vivitrat,Gene Sohn +2 more
- 30 Jun 2006
TL;DR: In this paper, a system and method of making unstructured data available to structured data analysis tools is presented, which includes middleware software that can be used in combination with structured data tools to perform analysis on both structured and unstructural data.
201
Patent
Schema and ETL tools for structured and unstructured data
Justin Langseth,Nithi Vivitrat,Gene Sohn +2 more
- 30 Jun 2006
TL;DR: In this paper, a system and method of making unstructured data available to structured data analysis tools is presented, which includes middleware software that can be used in combination with structured data tools to perform analysis on both structured and unstructural data.
144
Semantic annotation and transcoding: making Web content more accessible
TL;DR: A method for constructing superstructure on the Web using XML and external annotations to Web documents to create annotated documents that computers can understand and process more easily, allowing content to reach a wider audience with minimal overhead.
144
Patent
Document-classification system, method and software
Bokyung Yang-Stephens,M. Charles Swope,Jeffrey Locke,Isabelle Moulinier +3 more
- 05 May 2000
TL;DR: This article presented a graphical user interface that concurrently displays an unclassified headnote, a ranked list of candidate classes, a candidate class in combination with adjacent classes of the classification system, and at least one classified headnote associated with one of the candidate classes.
143
References
WordNet: a lexical database for English
TL;DR: WordNet1 provides a more effective combination of traditional lexicographic information and modern computing, and is an online lexical database designed for use under program control.
16.9K
•Proceedings Article
Automated Text Summarization in SUMMARIST
Eduard Hovy,Chin-Yew Lin +1 more
- 01 Jul 1997
TL;DR: The system’s architecture is described and details of some of its modules, many of them trained on large corpora of text, are provided.
Automated text summarization and the summarist system
Eduard Hovy,Chin-Yew Lin +1 more
- 13 Oct 1998
TL;DR: A preliminary typology of summaries in general is presented; a description of the current and planned modules and performance of the SUMMARIST automated multilingual text summarization system is described; and three methods to evaluate summaries are discussed.
A method for abstracting newspaper articles by using surface clues
Hideo Watanabe
- 05 Aug 1996
TL;DR: A system which automatically creates an abstract of a newspaper article by selecting important sentences of a given text by means of multiple-regression analysis of a hand processed corpus is described.
29
A Connectionist Approach to the Generation of Abstracts
Kôiti Hasida,Shun Ishizaki,Hitoshi Isahara +2 more
- 01 Jan 1987
TL;DR: This chapter discusses a method for extracting significant portions out of what is called contextual representation structure (CRS), a connectionist paradigm in which information processing in the human brain is accounted for in terms of signal propagation in a network which reflects the topology of neural connections.
13