Journal Article10.1117/1.1344590
Document compression using rate-distortion optimized segmentation
Hui Cheng,Charles A. Bouman +1 more
TL;DR: A multilayer compression algorithm for document images that can achieve a much higher subjec- tive quality than state-of-the-art compression algorithms, such as DjVu and SPIHT.
read more
Abstract: Effective document compression algorithms require that scanned document images be first segmented into regions such as text, pictures, and background. In this paper, we present a multilayer compression algorithm for document images. This compression al- gorithm first segments a scanned document image into different classes, then compresses each class using an algorithm specifically designed for that class. Two algorithms are investigated for seg- menting document images: a direct image segmentation algorithm called the trainable sequential MAP (TSMAP) segmentation algo- rithm, and a rate-distortion optimized segmentation (RDOS) algo- rithm. The RDOS algorithm works in a closed loop fashion by apply- ing each coding method to each region of the document and then selecting the method that yields the best rate-distortion trade-off. Compared with the TSMAP algorithm, the RDOS algorithm can of- ten result in a better rate-distortion trade-off, and produce more ro- bust segmentations by eliminating those misclassifications which can cause severe artifacts. At similar bit rates, the multilayer com- pression algorithm using RDOS can achieve a much higher subjec- tive quality than state-of-the-art compression algorithms, such as DjVu and SPIHT. © 2001 SPIE and IS&T. (DOI: 10.1117/1.1344590)
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Compound image compression for real-time computer screen image transmission
Tong Lin,Pengwei Hao +1 more
TL;DR: Experimental results show that the SPEC has very low complexity and provides visually lossless quality while keeping competitive compression ratios.
Patent
Scalable layered coding in a multi-layer, compound-image data transmission system
Xin Li,Louis Joseph Kerofsky,Kristine E. Matthews +2 more
- 06 Mar 2002
TL;DR: In this article, a data coder prepares a frame of data for transmission over a data channel, and the frame is first broken into a series of non-overlapping blocks, which are analyzed to determine if they are a picture block or a non-picture block.
70
Fast search for best representations in multitree dictionaries
TL;DR: A new framework of multitree dictionaries is developed, which includes some previously proposed dictionaries as special cases and shows how to efficiently find the best representation in a multitree dictionary using a recursive tree-pruning algorithm.
Text Segmentation for MRC Document Compression
Eri Haneda,Charles A. Bouman +1 more
TL;DR: This paper proposes a novel multiscale segmentation scheme for MRC document encoding based upon the sequential application of two algorithms and shows that the new algorithm achieves greater accuracy of text detection but with a lower false detection rate of nontext features.
TSA-SCC: Text Semantic-Aware Screen Content Coding With Ultra Low Bitrate
TL;DR: In this paper , a general text semantic-aware screen content coding scheme (TSA-SCC) was proposed for ultra low bitrate setting, which detects the abrupt picture in a screen content video (or image), recognizes textual information (including word, position, font type, font size and font color) in abrupt picture based on neural networks, and encodes texts with text coding tools.
36
References
A new, fast, and efficient image codec based on set partitioning in hierarchical trees
Amir Said,William A. Pearlman +1 more
TL;DR: The image coding results, calculated from actual file sizes and images reconstructed by the decoding algorithm, are either comparable to or surpass previous results obtained through much more sophisticated and computationally complex methods.
6.1K
•Book
JPEG: Still Image Data Compression Standard
William B. Pennebaker,Joan L. Mitchell +1 more
- 31 Dec 1992
TL;DR: This chapter discusses JPEG Syntax and Data Organization, the history of JPEG, and some of the aspects of the Human Visual Systems that make up JPEG.
3.3K
•Book
Data Compression Book
Mark Nelson
- 01 Jan 1991
TL;DR: In this article, the authors present a guide to data compression techniques, including Shannon-Fano and Huffman coding techniques, lossy compression, JPEG compression algorithm, and fractal compression.
655
Color quantization of images
TL;DR: The authors develop algorithms for the design of hierarchical tree structured color palettes incorporating performance criteria which reflect subjective evaluations of image quality, which produce higher-quality displayed images and require fewer computations than previously proposed methods.
•Book
The Data Compression Book
Mark Nelson
- 01 Jul 1991
TL;DR: In this paper, the authors present a guide to data compression techniques, including Shannon-Fano and Huffman coding techniques, lossy compression, JPEG compression algorithm, and fractal compression.
550